; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC02G028030 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC02G028030
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionhomeobox protein knotted-1-like LET6
Genome locationCiama_Chr02:2350185..2354139
RNA-Seq ExpressionCaUC02G028030
SyntenyCaUC02G028030
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR005539 - ELK domain
IPR005540 - KNOX1
IPR005541 - KNOX2
IPR008422 - Homeobox KN domain
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589145.1 Homeobox protein SBH1, partial [Cucurbita argyrosperma subsp. sororia]9.7e-13083.39Show/hide
Query:  SSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKY
        ++NN++ +     F+P ++          T +AK+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQAC VA G+CR   RGDDP LDQFMEAYCEML+KY
Subjt:  SSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKY

Query:  EQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIE-VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ
        EQELTKPFKEAM+FFSRIESQLK LA    SSDG+ELVGQNECSKEIE VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ
Subjt:  EQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIE-VDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ

Query:  QLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF
        QLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN+ICNPFSMDCSSTLF
Subjt:  QLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF

XP_004144513.1 homeobox protein knotted-1-like LET6 [Cucumis sativus]4.5e-15986.36Show/hide
Query:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINST-NNPQTLDDNNHNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNNND---L
        MEGGGSSGSSCSFMATCNS TNNNN+N  + MMMMIN T NNPQT D        NKMFLPLSWSS++S    NNN QTQ  +SAF+PQ +TNN++   +
Subjt:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINST-NNPQTLDDNNHNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNNND---L

Query:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL
          TSTSKAKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVATGSCR AG G+DPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLK  
Subjt:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL

Query:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
        AVSSDGFELVGQNECSKEIEVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
Subjt:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL

Query:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSST
        DLKQINNWFINQRKRHWKP+EDMQFVVMDA HPHYYLDNVICNPFSMDCSS+
Subjt:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSST

XP_008455471.1 PREDICTED: homeobox protein knotted-1-like LET6 [Cucumis melo]1.4e-15785.03Show/hide
Query:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINST-NNPQTLDDNNHNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNN---NDL
        MEGGGSSGSSCSFMATCNS TNNNN+N  + MMMMIN T NNPQT D        NKMFLPLSWSS++S    NNN QTQ  +S F+PQ +TNN   +D 
Subjt:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINST-NNPQTLDDNNHNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNN---NDL

Query:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL
          TSTSKAKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVATGSCR AG G+DPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLK  
Subjt:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL

Query:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
        AVSSDGF+LV QN+CSKEIEVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
Subjt:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL

Query:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF
        DLKQINNWFINQRKRHWKP+EDMQFVVMD  HPHYYLDNVICNPFSMDCSS+ F
Subjt:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF

XP_022136261.1 homeobox protein SBH1-like [Momordica charantia]1.8e-14780.91Show/hide
Query:  MEGGGSSGSS--CSFMATCNSTNNNNTN---TPMMMMINSTNNPQTLDDNNHNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSK
        ME GGSSGSS   SFMA     NNN+++   + M+MM +S+NN   + DNN+NN NNK+F+PLSWS+TS+      QRVSAFVPQ   NN++    ++ +
Subjt:  MEGGGSSGSS--CSFMATCNSTNNNNTN---TPMMMMINSTNNPQTLDDNNHNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSK

Query:  AKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVS---S
        AKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVA GSCR   RGDDPALDQFMEAYCEML+KYEQEL+KPFKEAMLFFSRIESQLK LAVS   S
Subjt:  AKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVS---S

Query:  DGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQ
        DG ELVGQ+ECSKE+EVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ+LLDWWSRHYKWPYPSE+QKVALAESTGLDLKQ
Subjt:  DGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQ

Query:  INNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC-NPFSMDCSSTLF
        INNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC NPFSMDCSSTLF
Subjt:  INNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC-NPFSMDCSSTLF

XP_038887406.1 homeobox protein knotted-1-like LET6 [Benincasa hispida]7.6e-17592.82Show/hide
Query:  MEGGGSSGSSCSFMATCNS----TNNNNTNTPMMMMINSTNNPQTLDDNNHNNNNNKMFLPLSWSSTSS--NNNAQTQRVSAFVPQSNTNNNDLLTTSTS
        MEGGGSSGSSCSFMATCNS    TNNNN+NTPMMMM NS NNPQTLDD    NNNNKMFLPLSWSS++S  NNNAQTQRVSAFVP  +TNNND+LTTSTS
Subjt:  MEGGGSSGSSCSFMATCNS----TNNNNTNTPMMMMINSTNNPQTLDDNNHNNNNNKMFLPLSWSSTSS--NNNAQTQRVSAFVPQSNTNNNDLLTTSTS

Query:  KAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVSSDG
        KAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCR AGRGDDPALDQFMEAYCEMLTKYEQELTKPF+EAMLFFSRIESQLK LAV SDG
Subjt:  KAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVSSDG

Query:  FELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQIN
        FELV QNECSKEIEVDMN+NYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQIN
Subjt:  FELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQIN

Query:  NWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF
        NWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF
Subjt:  NWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF

TrEMBL top hitse value%identityAlignment
A0A0A0K5G5 Uncharacterized protein2.2e-15986.36Show/hide
Query:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINST-NNPQTLDDNNHNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNNND---L
        MEGGGSSGSSCSFMATCNS TNNNN+N  + MMMMIN T NNPQT D        NKMFLPLSWSS++S    NNN QTQ  +SAF+PQ +TNN++   +
Subjt:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINST-NNPQTLDDNNHNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNNND---L

Query:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL
          TSTSKAKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVATGSCR AG G+DPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLK  
Subjt:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL

Query:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
        AVSSDGFELVGQNECSKEIEVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
Subjt:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL

Query:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSST
        DLKQINNWFINQRKRHWKP+EDMQFVVMDA HPHYYLDNVICNPFSMDCSS+
Subjt:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSST

A0A1S3C0Z5 homeobox protein knotted-1-like LET67.0e-15885.03Show/hide
Query:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINST-NNPQTLDDNNHNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNN---NDL
        MEGGGSSGSSCSFMATCNS TNNNN+N  + MMMMIN T NNPQT D        NKMFLPLSWSS++S    NNN QTQ  +S F+PQ +TNN   +D 
Subjt:  MEGGGSSGSSCSFMATCNS-TNNNNTN--TPMMMMINST-NNPQTLDDNNHNNNNNKMFLPLSWSSTSS----NNNAQTQR-VSAFVPQSNTNN---NDL

Query:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL
          TSTSKAKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVATGSCR AG G+DPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLK  
Subjt:  LTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTL

Query:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
        AVSSDGF+LV QN+CSKEIEVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL
Subjt:  AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGL

Query:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF
        DLKQINNWFINQRKRHWKP+EDMQFVVMD  HPHYYLDNVICNPFSMDCSS+ F
Subjt:  DLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF

A0A6J1C313 homeobox protein SBH1-like8.5e-14880.91Show/hide
Query:  MEGGGSSGSS--CSFMATCNSTNNNNTN---TPMMMMINSTNNPQTLDDNNHNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSK
        ME GGSSGSS   SFMA     NNN+++   + M+MM +S+NN   + DNN+NN NNK+F+PLSWS+TS+      QRVSAFVPQ   NN++    ++ +
Subjt:  MEGGGSSGSS--CSFMATCNSTNNNNTN---TPMMMMINSTNNPQTLDDNNHNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSK

Query:  AKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVS---S
        AKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVA GSCR   RGDDPALDQFMEAYCEML+KYEQEL+KPFKEAMLFFSRIESQLK LAVS   S
Subjt:  AKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVS---S

Query:  DGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQ
        DG ELVGQ+ECSKE+EVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ+LLDWWSRHYKWPYPSE+QKVALAESTGLDLKQ
Subjt:  DGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQ

Query:  INNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC-NPFSMDCSSTLF
        INNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC NPFSMDCSSTLF
Subjt:  INNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC-NPFSMDCSSTLF

A0A6J1ERR4 homeobox protein knotted-1-like LET61.5e-12879.29Show/hide
Query:  DDNNHNN-----NNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGS
        D NN NN     NN KMFLP        N++ Q+ R                       K+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQAC VA G+
Subjt:  DDNNHNN-----NNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGS

Query:  CRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIE-VDMNENYIDPQAEEKELKGQLLRKY
        CR   RGDDP LDQFMEAYCEML+KYEQELTKPFKEAM+FFSRIESQLK LA    SSDG+ELVGQNECSKEIE VDMNENYIDPQAEEKELKGQLLRKY
Subjt:  CRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIE-VDMNENYIDPQAEEKELKGQLLRKY

Query:  SGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPF
        SGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN+ICNPF
Subjt:  SGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPF

Query:  SMDCSSTLF
        SMDCSSTLF
Subjt:  SMDCSSTLF

A0A6J1JKT8 homeobox protein knotted-1-like LET68.9e-12977.6Show/hide
Query:  MMMINSTNNPQTLDDNNHNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQ
        ++M +  NN      N+ ++NN KMFLP        N++ QT R                      AK+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQ
Subjt:  MMMINSTNNPQTLDDNNHNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQ

Query:  ACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIE-VDMNENYIDPQAEEKEL
        AC VA G+CR   RGDDP LDQFMEAYCEML+KYEQELTKPFKEAM+FFSRIESQLK LA    SSDG+ELVGQNECSKEIE VDMNENYIDPQAEEKEL
Subjt:  ACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIE-VDMNENYIDPQAEEKEL

Query:  KGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYL
        KGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYL
Subjt:  KGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYL

Query:  DNVICNPFSMDCSSTLF
        DN+ICNPFSMDCSSTLF
Subjt:  DNVICNPFSMDCSSTLF

SwissProt top hitse value%identityAlignment
O22299 Homeobox protein knotted-1-like LET66.2e-10358.38Show/hide
Query:  MEGG--GSSGSSCSFMATCNSTNNNNTNT----------------PMMMMINSTNNPQTLDDNNHNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSN
        MEGG  G++ +SC  M       NNN N                 PMMMM+       T ++N   +NNN +FLP  +   ++NNN          PQ +
Subjt:  MEGG--GSSGSSCSFMATCNSTNNNNTNT----------------PMMMMINSTNNPQTLDDNNHNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSN

Query:  TNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGR------GDDPALDQFMEAYCEMLTKYEQELTKPFKEAML
         N+    ++S+ K+KIMAHP + RLL AY+NCQK+GAPPEVVARLE+ CA +    R +        G+DPALDQFMEAYCEMLTKYEQEL+KPFKEAM+
Subjt:  TNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGR------GDDPALDQFMEAYCEMLTKYEQELTKPFKEAML

Query:  FFSRIESQLK--TLAVSSDGFELVGQ---NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKW
        F SRIE Q K  TLA +S     +G+      S + EVD+N ++IDPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQL+DWW RH KW
Subjt:  FFSRIESQLK--TLAVSSDGFELVGQ---NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKW

Query:  PYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTL
        PYPSESQK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYY+DNV+ N F MD + +L
Subjt:  PYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTL

O65034 Homeobox protein knotted-1-like 121.2e-7761Show/hide
Query:  KAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGD--DPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVSS
        KAKIMAHP +  LLAAY++CQKVGAPPEV+ RL      A    RP GR D  DP LDQFMEAYC ML KY +ELT+P  EAM F  R+ESQL T+A  +
Subjt:  KAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGD--DPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVSS

Query:  DG--------FELVGQNECSKEIEVDMN----EN---YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSES
         G            G++EC    E DM+    EN    IDP+AE+KELK QLL+KYSGYL SL+QEF KKKK GKLPKEARQ+LL WW  HYKWPYPSE+
Subjt:  DG--------FELVGQNECSKEIEVDMN----EN---YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSES

Query:  QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMD
        +K+ALAESTGLD KQINNWFINQRKRHWKPSEDM FV+M+  HP       +  PF  D
Subjt:  QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMD

P46608 Homeobox protein SBH16.6e-10563.05Show/hide
Query:  SSCSFMATCNSTNNNNTNTPMMMMINSTNNPQTLDDNNHNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTS---KAKIMAHPLFP
        ++C F+  C    +N+T TP +M+ N+ NN +T DD+N+NN        L +    S+++             N NNN   ++S+S   KAKIMAHP + 
Subjt:  SSCSFMATCNSTNNNNTNTPMMMMINSTNNPQTLDDNNHNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTS---KAKIMAHPLFP

Query:  RLLAAYVNCQKVGAPPEVVARLEQACAVA---TGSCRPAGR---GDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVSSDGF--EL
        RLLAAYVNCQKVGAPPEVVARLE+ACA A    G    AG    G+DPALDQFMEAYCEMLTKYEQEL+KP KEAMLF  RIE Q K L +SS  F    
Subjt:  RLLAAYVNCQKVGAPPEVVARLEQACAVA---TGSCRPAGR---GDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVSSDGF--EL

Query:  VGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWF
         G    S E +VD++ N IDPQAE+++LKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQLL+WW+RHYKWPYPSESQK+ALAESTGLD KQINNWF
Subjt:  VGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWF

Query:  INQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCS
        INQRKRHWKPSEDMQFVVMD +HPHYY+DNV+ NPF MD S
Subjt:  INQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCS

Q38874 Homeobox protein SHOOT MERISTEMLESS2.3e-9756.32Show/hide
Query:  GSSGSSCSFMATCNSTNNNNTNTPMMMMI------------NSTNNPQTLDD---NNHNNNNNKMFL-PLSWSSTSSNNNAQTQRVSAFVPQ---SNTNN
        GS+ +SC         N++    PMMMM+            +  +  Q  D     +H+  ++ +FL  L+    + N  A +   S+  P       ++
Subjt:  GSSGSSCSFMATCNSTNNNNTNTPMMMMI------------NSTNNPQTLDD---NNHNNNNNKMFL-PLSWSSTSSNNNAQTQRVSAFVPQ---SNTNN

Query:  NDLL-----------TTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQAC---AVATGSCRPAG-RGDDPALDQFMEAYCEMLTKYEQELTKP
        N+++           ++++ KAKIMAHP + RLLAAYVNCQKVGAPPEVVARLE+AC   A A  S  P G  G+DP LDQFMEAYCEML KYEQEL+KP
Subjt:  NDLL-----------TTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQAC---AVATGSCRPAG-RGDDPALDQFMEAYCEMLTKYEQELTKP

Query:  FKEAMLFFSRIESQLKTLAVSS-DGFELVGQ------NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLD
        FKEAM+F  R+E Q K+L++SS   F   G+      N  S E EVDMN  ++DPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQLLD
Subjt:  FKEAMLFFSRIESQLKTLAVSS-DGFELVGQ------NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLD

Query:  WWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD-CSSTL
        WWSRHYKWPYPSE QK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDA HP HY++DNV+ NPF MD  SST+
Subjt:  WWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD-CSSTL

Q9M6D9 Homeobox protein SHOOT MERISTEMLESS4.1e-9970.77Show/hide
Query:  LLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQAC---AVATGSCRPAGR-GDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIES
        +L++   KAKIMAHP + RLL AYVNCQKVGAPPEV ARLE+ C   A A  S  P G  G+DP LDQFMEAYCEML KYEQEL+KPFKEAM+F   +E 
Subjt:  LLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQAC---AVATGSCRPAGR-GDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIES

Query:  QLKTLAVSSDGFELVGQ------NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSES
        Q K+L++SS      G+      N  S E EVDMN  ++DPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQLLDWWSRHYKWPYPSE 
Subjt:  QLKTLAVSSDGFELVGQ------NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSES

Query:  QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD
        QK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDA HP HY++ NV+ NPF +D
Subjt:  QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD

Arabidopsis top hitse value%identityAlignment
AT1G23380.1 KNOTTED1-like homeobox gene 61.1e-6246.69Show/hide
Query:  NAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQ--------ACAVATGSCRPAGRGDDPALDQFMEAYCEM
        +A +   ++  P+    N+D ++ +  KAKI  HP +PRLL AY++CQKVGAPPE+   LE+           V   SC     G DP LD+FME YC++
Subjt:  NAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQ--------ACAVATGSCRPAGRGDDPALDQFMEAYCEM

Query:  LTKYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLP
        L KY+ +L +PF EA  F ++IE QL+ L     S+ G    G     +E+    +E   D   + E+++LK +LLRK+   + +LK EF KKKK GKLP
Subjt:  LTKYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLP

Query:  KEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD
        +EARQ LLDWW+ HYKWPYP+E  K+ALA++TGLD KQINNWFINQRKRHWKPSE+M F +MD +   ++ +
Subjt:  KEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD

AT1G23380.2 KNOTTED1-like homeobox gene 63.5e-6146.35Show/hide
Query:  NAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQK--VGAPPEVVARLEQ--------ACAVATGSCRPAGRGDDPALDQFMEAYC
        +A +   ++  P+    N+D ++ +  KAKI  HP +PRLL AY++CQK  VGAPPE+   LE+           V   SC     G DP LD+FME YC
Subjt:  NAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAAYVNCQK--VGAPPEVVARLEQ--------ACAVATGSCRPAGRGDDPALDQFMEAYC

Query:  EMLTKYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGK
        ++L KY+ +L +PF EA  F ++IE QL+ L     S+ G    G     +E+    +E   D   + E+++LK +LLRK+   + +LK EF KKKK GK
Subjt:  EMLTKYEQELTKPFKEAMLFFSRIESQLKTLAV---SSDGFELVGQNECSKEIEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGK

Query:  LPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD
        LP+EARQ LLDWW+ HYKWPYP+E  K+ALA++TGLD KQINNWFINQRKRHWKPSE+M F +MD +   ++ +
Subjt:  LPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD

AT1G62360.1 KNOX/ELK homeobox transcription factor1.6e-9856.32Show/hide
Query:  GSSGSSCSFMATCNSTNNNNTNTPMMMMI------------NSTNNPQTLDD---NNHNNNNNKMFL-PLSWSSTSSNNNAQTQRVSAFVPQ---SNTNN
        GS+ +SC         N++    PMMMM+            +  +  Q  D     +H+  ++ +FL  L+    + N  A +   S+  P       ++
Subjt:  GSSGSSCSFMATCNSTNNNNTNTPMMMMI------------NSTNNPQTLDD---NNHNNNNNKMFL-PLSWSSTSSNNNAQTQRVSAFVPQ---SNTNN

Query:  NDLL-----------TTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQAC---AVATGSCRPAG-RGDDPALDQFMEAYCEMLTKYEQELTKP
        N+++           ++++ KAKIMAHP + RLLAAYVNCQKVGAPPEVVARLE+AC   A A  S  P G  G+DP LDQFMEAYCEML KYEQEL+KP
Subjt:  NDLL-----------TTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQAC---AVATGSCRPAG-RGDDPALDQFMEAYCEMLTKYEQELTKP

Query:  FKEAMLFFSRIESQLKTLAVSS-DGFELVGQ------NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLD
        FKEAM+F  R+E Q K+L++SS   F   G+      N  S E EVDMN  ++DPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQLLD
Subjt:  FKEAMLFFSRIESQLKTLAVSS-DGFELVGQ------NECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLD

Query:  WWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD-CSSTL
        WWSRHYKWPYPSE QK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDA HP HY++DNV+ NPF MD  SST+
Subjt:  WWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD-CSSTL

AT1G70510.1 KNOTTED-like from Arabidopsis thaliana 21.6e-6149.22Show/hide
Query:  DLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACA--------VATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFF
        D  + S  K+KI +HPL+PRLL  Y++CQKVGAP E+   LE+           VA  SC     G DP LD+FME YC++L KY+ +L +PF EA  F 
Subjt:  DLLTTSTSKAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACA--------VATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFF

Query:  SRIESQLKTL--------AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYK
        ++IE QL+ L        A+S DG   V  +E  +E + D+  +    ++ +++LK QLLRK+  ++ SLK EF KKKK GKLP+EARQ LLDWW+ H K
Subjt:  SRIESQLKTL--------AVSSDGFELVGQNECSKEIEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYK

Query:  WPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD
        WPYP+E  K++LAE TGLD KQINNWFINQRKRHWKPSE+M F +MD ++  ++ +
Subjt:  WPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD

AT4G08150.1 KNOTTED-like from Arabidopsis thaliana9.5e-7550.43Show/hide
Query:  SSGSSCSFMATCNSTNNNNTNTPMMMMIN-STNNPQTLD-----DNNHNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLT-TSTSKAKI
        ++ +S ++    N+TNNNN +   M+  + S+  PQT +     D++  NNNN   +    SS+  N+ +   R      ++N NNND ++     KAKI
Subjt:  SSGSSCSFMATCNSTNNNNTNTPMMMMIN-STNNPQTLD-----DNNHNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLT-TSTSKAKI

Query:  MAHPLFPRLLAAYVNCQKVGAPPEVVARL-------EQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVS
        +AHP +  LL AY++CQK+GAPP+VV R+       E     +T S   + R  DP LDQFMEAYC+ML KY +ELT+P +EAM F  RIESQL  L  S
Subjt:  MAHPLFPRLLAAYVNCQKVGAPPEVVARL-------EQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVS

Query:  S----DGFELVGQNECSKEIEVDMNEN------YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVA
             +  +    N  S + E + N         IDP+AE++ELK  LL+KYSGYL SLKQE  KKKK GKLPKEARQ+LL WW  HYKWPYPSES+KVA
Subjt:  S----DGFELVGQNECSKEIEVDMNEN------YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVA

Query:  LAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDA-AHPHY---YLD
        LAESTGLD KQINNWFINQRKRHWKPSEDMQF+VMD   HPH+   Y+D
Subjt:  LAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDA-AHPHY---YLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGAGGAGGATCCAGTGGAAGCAGCTGTTCATTCATGGCTACTTGTAACAGCACCAACAACAATAATACTAACACACCCATGATGATGATGATCAATAGTACTAA
TAATCCTCAAACGTTGGACGACAACAACCACAACAACAACAACAACAAGATGTTCTTGCCTTTGTCTTGGTCTTCTACTTCTTCCAACAATAATGCTCAAACTCAAAGGG
TTTCTGCTTTTGTTCCTCAATCTAATACCAATAATAATGATCTTCTTACCACTTCCACTTCCAAAGCTAAAATTATGGCTCATCCTCTCTTCCCTCGCCTCCTCGCTGCC
TACGTCAACTGCCAAAAGGTGGGTGCGCCGCCGGAAGTGGTGGCGAGGCTAGAGCAGGCGTGTGCCGTGGCGACGGGAAGCTGTAGGCCGGCGGGACGTGGGGATGATCC
AGCGTTGGATCAGTTCATGGAGGCTTATTGTGAGATGTTGACCAAATATGAACAAGAGTTGACCAAACCTTTTAAAGAAGCAATGCTCTTCTTCTCAAGAATTGAGTCTC
AGCTGAAAACCCTAGCAGTTTCTTCTGATGGTTTCGAGTTGGTTGGGCAAAACGAGTGCTCGAAGGAGATTGAGGTGGATATGAACGAAAACTACATAGACCCTCAAGCC
GAAGAGAAGGAACTCAAAGGCCAACTTCTACGCAAATACAGCGGATATCTTGGGAGCCTAAAACAAGAGTTTTTGAAGAAGAAAAAGAATGGGAAGTTGCCAAAAGAAGC
TAGACAACAATTGCTCGACTGGTGGAGTCGACACTACAAATGGCCATATCCCTCGGAGTCGCAAAAGGTGGCGTTGGCGGAGTCGACGGGGCTAGACTTGAAGCAGATCA
ATAATTGGTTTATTAACCAAAGAAAGCGCCATTGGAAGCCATCAGAGGATATGCAGTTTGTGGTGATGGATGCTGCTCATCCACATTACTATTTGGACAATGTCATATGC
AATCCTTTCTCTATGGATTGTTCTTCTACTCTTTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGAGGAGGATCCAGTGGAAGCAGCTGTTCATTCATGGCTACTTGTAACAGCACCAACAACAATAATACTAACACACCCATGATGATGATGATCAATAGTACTAA
TAATCCTCAAACGTTGGACGACAACAACCACAACAACAACAACAACAAGATGTTCTTGCCTTTGTCTTGGTCTTCTACTTCTTCCAACAATAATGCTCAAACTCAAAGGG
TTTCTGCTTTTGTTCCTCAATCTAATACCAATAATAATGATCTTCTTACCACTTCCACTTCCAAAGCTAAAATTATGGCTCATCCTCTCTTCCCTCGCCTCCTCGCTGCC
TACGTCAACTGCCAAAAGGTGGGTGCGCCGCCGGAAGTGGTGGCGAGGCTAGAGCAGGCGTGTGCCGTGGCGACGGGAAGCTGTAGGCCGGCGGGACGTGGGGATGATCC
AGCGTTGGATCAGTTCATGGAGGCTTATTGTGAGATGTTGACCAAATATGAACAAGAGTTGACCAAACCTTTTAAAGAAGCAATGCTCTTCTTCTCAAGAATTGAGTCTC
AGCTGAAAACCCTAGCAGTTTCTTCTGATGGTTTCGAGTTGGTTGGGCAAAACGAGTGCTCGAAGGAGATTGAGGTGGATATGAACGAAAACTACATAGACCCTCAAGCC
GAAGAGAAGGAACTCAAAGGCCAACTTCTACGCAAATACAGCGGATATCTTGGGAGCCTAAAACAAGAGTTTTTGAAGAAGAAAAAGAATGGGAAGTTGCCAAAAGAAGC
TAGACAACAATTGCTCGACTGGTGGAGTCGACACTACAAATGGCCATATCCCTCGGAGTCGCAAAAGGTGGCGTTGGCGGAGTCGACGGGGCTAGACTTGAAGCAGATCA
ATAATTGGTTTATTAACCAAAGAAAGCGCCATTGGAAGCCATCAGAGGATATGCAGTTTGTGGTGATGGATGCTGCTCATCCACATTACTATTTGGACAATGTCATATGC
AATCCTTTCTCTATGGATTGTTCTTCTACTCTTTTCTGA
Protein sequenceShow/hide protein sequence
MEGGGSSGSSCSFMATCNSTNNNNTNTPMMMMINSTNNPQTLDDNNHNNNNNKMFLPLSWSSTSSNNNAQTQRVSAFVPQSNTNNNDLLTTSTSKAKIMAHPLFPRLLAA
YVNCQKVGAPPEVVARLEQACAVATGSCRPAGRGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMLFFSRIESQLKTLAVSSDGFELVGQNECSKEIEVDMNENYIDPQA
EEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC
NPFSMDCSSTLF