; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014916 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014916
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptionhomeobox protein knotted-1-like LET6
Genome locationchr12:5849358..5852543
RNA-Seq ExpressionLag0014916
SyntenyLag0014916
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR005539 - ELK domain
IPR005540 - KNOX1
IPR005541 - KNOX2
IPR008422 - Homeobox KN domain
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589145.1 Homeobox protein SBH1, partial [Cucurbita argyrosperma subsp. sororia]1.0e-13783.44Show/hide
Query:  ILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSC
        I+  HNNNN+SDS  NN KMFLP                        ND  +  TCRAK+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQAC VAAG+C
Subjt:  ILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSC

Query:  RVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVE-VDMNENYIDPQAEEKELKGQLLRKYS
        R  ARGDDP LDQFMEAYCEML+KYEQELTKPFKEAMVFFSRIESQLKALA SSS+SDG ELVGQNECSKE+E VDMNENYIDPQAEEKELKGQLLRKYS
Subjt:  RVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVE-VDMNENYIDPQAEEKELKGQLLRKYS

Query:  GYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFS
        GYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN+ICNPFS
Subjt:  GYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFS

Query:  MDCSSTLF
        MDCSSTLF
Subjt:  MDCSSTLF

XP_004144513.1 homeobox protein knotted-1-like LET6 [Cucumis sativus]1.1e-13976.32Show/hide
Query:  GSSAGGSTSSFIPGFPNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHN--------APRVSAFVPQ--LNNNN
        G  + GS+ SF+    +  NN+NSN   +S  +MMI+ T       HNN    D     NKMFLPLSWSS+T +N           +SAF+PQ   NN+N
Subjt:  GSSAGGSTSSFIPGFPNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHN--------APRVSAFVPQ--LNNNN

Query:  D--LHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQ
        D  +   ST +AKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVA GSCR A  G+DPALDQFMEAYCEMLTKYEQELTKPFKEAM+FFSRIESQ
Subjt:  D--LHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQ

Query:  LKALAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVA
        LKA AVS   SDG ELVGQNECSKE+EVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVA
Subjt:  LKALAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVA

Query:  LAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSST
        LAESTGLDLKQINNWFINQRKRHWKP+EDMQFVVMDA HPHYYLDNVICNPFSMDCSS+
Subjt:  LAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSST

XP_022136261.1 homeobox protein SBH1-like [Momordica charantia]2.0e-15482.07Show/hide
Query:  MERGSSGSSAGGSTSSFIPGFPNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHN
        MERG S  S+GG  SSF+ GF  NNNN +S+   S   +      N QIL+  NNNNN      NNK+F+PLSWS+ +   A RVSAFVPQ  +NN  HN
Subjt:  MERGSSGSSAGGSTSSFIPGFPNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHN

Query:  -NSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALA
          ++CRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVA GSCR  ARGDDPALDQFMEAYCEML+KYEQEL+KPFKEAM+FFSRIESQLKALA
Subjt:  -NSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALA

Query:  VSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAEST
        VSSSTSDGCELVGQ+ECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ+LLDWWSRHYKWPYPSE+QKVALAEST
Subjt:  VSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAEST

Query:  GLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC-NPFSMDCSSTLF
        GLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC NPFSMDCSSTLF
Subjt:  GLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC-NPFSMDCSSTLF

XP_022988875.1 homeobox protein knotted-1-like LET6 [Cucurbita maxima]1.0e-13783.44Show/hide
Query:  ILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSC
        I+  HNNNN+SDS  NN KMFLP                        ND  +  TCRAK+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQAC VAAG+C
Subjt:  ILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSC

Query:  RVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVE-VDMNENYIDPQAEEKELKGQLLRKYS
        R  ARGDDP LDQFMEAYCEML+KYEQELTKPFKEAMVFFSRIESQLKALA SSS+SDG ELVGQNECSKE+E VDMNENYIDPQAEEKELKGQLLRKYS
Subjt:  RVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVE-VDMNENYIDPQAEEKELKGQLLRKYS

Query:  GYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFS
        GYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN+ICNPFS
Subjt:  GYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFS

Query:  MDCSSTLF
        MDCSSTLF
Subjt:  MDCSSTLF

XP_038887406.1 homeobox protein knotted-1-like LET6 [Benincasa hispida]6.3e-14879.05Show/hide
Query:  GSSAGGSTSSFIP---GFPNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHN-----APRVSAFVPQLN-NNND
        G  + GS+ SF+          NN+NSNT +    +MM ++ NPQ L+           NNNNKMFLPLSWSS+T +N       RVSAFVP  + NNND
Subjt:  GSSAGGSTSSFIP---GFPNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHN-----APRVSAFVPQLN-NNND

Query:  LHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKA
        +   ST +AKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVA GSCR A RGDDPALDQFMEAYCEMLTKYEQELTKPF+EAM+FFSRIESQLKA
Subjt:  LHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKA

Query:  LAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAE
        LAV    SDG ELV QNECSKE+EVDMN+NYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAE
Subjt:  LAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAE

Query:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF
        STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF
Subjt:  STGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF

TrEMBL top hitse value%identityAlignment
A0A0A0K5G5 Uncharacterized protein5.2e-14076.32Show/hide
Query:  GSSAGGSTSSFIPGFPNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHN--------APRVSAFVPQ--LNNNN
        G  + GS+ SF+    +  NN+NSN   +S  +MMI+ T       HNN    D     NKMFLPLSWSS+T +N           +SAF+PQ   NN+N
Subjt:  GSSAGGSTSSFIPGFPNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHN--------APRVSAFVPQ--LNNNN

Query:  D--LHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQ
        D  +   ST +AKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVA GSCR A  G+DPALDQFMEAYCEMLTKYEQELTKPFKEAM+FFSRIESQ
Subjt:  D--LHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQ

Query:  LKALAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVA
        LKA AVS   SDG ELVGQNECSKE+EVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVA
Subjt:  LKALAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVA

Query:  LAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSST
        LAESTGLDLKQINNWFINQRKRHWKP+EDMQFVVMDA HPHYYLDNVICNPFSMDCSS+
Subjt:  LAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSST

A0A1S3C0Z5 homeobox protein knotted-1-like LET61.1e-13774.52Show/hide
Query:  GSSAGGSTSSFIPGFPNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHN--------APRVSAFVPQLNNNN--
        G  + GS+ SF+    +  NN+NSN   +S  +MMI+ TN       NN    D     NKMFLPLSWSS+T +N           +S F+PQ + NN  
Subjt:  GSSAGGSTSSFIPGFPNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHN--------APRVSAFVPQLNNNN--

Query:  --DLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQ
          D    ST +AKIMAHPLFPRLL AYVNCQKVGAPPEVVARLEQACAVA GSCR A  G+DPALDQFMEAYCEMLTKYEQELTKPFKEAM+FFSRIESQ
Subjt:  --DLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQ

Query:  LKALAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVA
        LKA AVS   SDG +LV QN+CSKE+EVDMNENYIDPQAE KELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVA
Subjt:  LKALAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVA

Query:  LAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF
        LAESTGLDLKQINNWFINQRKRHWKP+EDMQFVVMD  HPHYYLDNVICNPFSMDCSS+ F
Subjt:  LAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTLF

A0A6J1C313 homeobox protein SBH1-like9.8e-15582.07Show/hide
Query:  MERGSSGSSAGGSTSSFIPGFPNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHN
        MERG S  S+GG  SSF+ GF  NNNN +S+   S   +      N QIL+  NNNNN      NNK+F+PLSWS+ +   A RVSAFVPQ  +NN  HN
Subjt:  MERGSSGSSAGGSTSSFIPGFPNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHN

Query:  -NSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALA
          ++CRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVA GSCR  ARGDDPALDQFMEAYCEML+KYEQEL+KPFKEAM+FFSRIESQLKALA
Subjt:  -NSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALA

Query:  VSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAEST
        VSSSTSDGCELVGQ+ECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQ+LLDWWSRHYKWPYPSE+QKVALAEST
Subjt:  VSSSTSDGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAEST

Query:  GLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC-NPFSMDCSSTLF
        GLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC NPFSMDCSSTLF
Subjt:  GLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVIC-NPFSMDCSSTLF

A0A6J1ERR4 homeobox protein knotted-1-like LET68.3e-13882.47Show/hide
Query:  ILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSC
        I+  HNN+NN+DS +NN KMFLP                        ND  +  +CR K+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQAC VAAG+C
Subjt:  ILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSC

Query:  RVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVE-VDMNENYIDPQAEEKELKGQLLRKYS
        R  ARGDDP LDQFMEAYCEML+KYEQELTKPFKEAMVFFSRIESQLKALA SSS+SDG ELVGQNECSKE+E VDMNENYIDPQAEEKELKGQLLRKYS
Subjt:  RVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVE-VDMNENYIDPQAEEKELKGQLLRKYS

Query:  GYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFS
        GYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN+ICNPFS
Subjt:  GYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFS

Query:  MDCSSTLF
        MDCSSTLF
Subjt:  MDCSSTLF

A0A6J1JKT8 homeobox protein knotted-1-like LET64.9e-13883.44Show/hide
Query:  ILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSC
        I+  HNNNN+SDS  NN KMFLP                        ND  +  TCRAK+MAHPLFPRLLA+YVNCQKVGAPP+VVARLEQAC VAAG+C
Subjt:  ILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSC

Query:  RVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVE-VDMNENYIDPQAEEKELKGQLLRKYS
        R  ARGDDP LDQFMEAYCEML+KYEQELTKPFKEAMVFFSRIESQLKALA SSS+SDG ELVGQNECSKE+E VDMNENYIDPQAEEKELKGQLLRKYS
Subjt:  RVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVE-VDMNENYIDPQAEEKELKGQLLRKYS

Query:  GYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFS
        GYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDN+ICNPFS
Subjt:  GYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFS

Query:  MDCSSTLF
        MDCSSTLF
Subjt:  MDCSSTLF

SwissProt top hitse value%identityAlignment
O22299 Homeobox protein knotted-1-like LET61.6e-10658.78Show/hide
Query:  MERGSSGSSAGGSTSSFIPGFPNNNNNDNSN---------TALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQ
        ME GSSG+++         G   NNNN+N N         T  +   +MM+    P +     NNNN+++S NNN +FLP                F+  
Subjt:  MERGSSGSSAGGSTSSFIPGFPNNNNNDNSN---------TALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQ

Query:  LNNNNDLHNN----STCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAAR------GDDPALDQFMEAYCEMLTKYEQELTKPF
         NNNN   +N    S+ ++KIMAHP + RLL AY+NCQK+GAPPEVVARLE+ CA +A   R ++       G+DPALDQFMEAYCEMLTKYEQEL+KPF
Subjt:  LNNNNDLHNN----STCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAAR------GDDPALDQFMEAYCEMLTKYEQELTKPF

Query:  KEAMVFFSRIESQLKALAVSSSTSDGC---ELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWW
        KEAMVF SRIE Q KAL ++ ++S      E + +N  S E EVD+N ++IDPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQL+DWW
Subjt:  KEAMVFFSRIESQLKALAVSSSTSDGC---ELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWW

Query:  SRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTL
         RH KWPYPSESQK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYY+DNV+ N F MD + +L
Subjt:  SRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCSSTL

O65034 Homeobox protein knotted-1-like 122.8e-7459.46Show/hide
Query:  RAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAARGD--DPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALA--V
        +AKIMAHP +  LLAAY++CQKVGAPPEV+ RL      A    R   R D  DP LDQFMEAYC ML KY +ELT+P  EAM F  R+ESQL  +A   
Subjt:  RAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAARGD--DPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALA--V

Query:  SSSTSDGCELV---GQNECSKEVEVDMN----EN---YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSES
            +    L+   G++EC    E DM+    EN    IDP+AE+KELK QLL+KYSGYL SL+QEF KKKK GKLPKEARQ+LL WW  HYKWPYPSE+
Subjt:  SSSTSDGCELV---GQNECSKEVEVDMN----EN---YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSES

Query:  QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMD
        +K+ALAESTGLD KQINNWFINQRKRHWKPSEDM FV+M+  HP       +  PF  D
Subjt:  QKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMD

P46608 Homeobox protein SBH11.2e-10462.24Show/hide
Query:  PNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNN---DLHNNSTCRAKIMAHPLFPRLL
        P+NNNN N+N     +      T +  +   HNNN   D  NNNN   L   +  +  H+    +      NNNN      ++S  +AKIMAHP + RLL
Subjt:  PNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNN---DLHNNSTCRAKIMAHPLFPRLL

Query:  AAYVNCQKVGAPPEVVARLEQACAVAA------GSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVG
        AAYVNCQKVGAPPEVVARLE+ACA AA       +   +  G+DPALDQFMEAYCEMLTKYEQEL+KP KEAM+F  RIE Q K L +SSS     E  G
Subjt:  AAYVNCQKVGAPPEVVARLEQACAVAA------GSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVG

Query:  QNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFIN
            S E +VD++ N IDPQAE+++LKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQLL+WW+RHYKWPYPSESQK+ALAESTGLD KQINNWFIN
Subjt:  QNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFIN

Query:  QRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCS
        QRKRHWKPSEDMQFVVMD +HPHYY+DNV+ NPF MD S
Subjt:  QRKRHWKPSEDMQFVVMDAAHPHYYLDNVICNPFSMDCS

Q38874 Homeobox protein SHOOT MERISTEMLESS4.1e-10269.6Show/hide
Query:  LNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCR----VAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVF
        +N  +   ++++ +AKIMAHP + RLLAAYVNCQKVGAPPEVVARLE+AC+ AA +          G+DP LDQFMEAYCEML KYEQEL+KPFKEAMVF
Subjt:  LNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCR----VAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVF

Query:  FSRIESQLKALAVSSSTS----DGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYK
          R+E Q K+L++SS +S        +   N  S E EVDMN  ++DPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQLLDWWSRHYK
Subjt:  FSRIESQLKALAVSSSTS----DGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYK

Query:  WPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD-CSSTL
        WPYPSE QK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDA HP HY++DNV+ NPF MD  SST+
Subjt:  WPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD-CSSTL

Q9M6D9 Homeobox protein SHOOT MERISTEMLESS3.3e-9964.38Show/hide
Query:  SSATPHNAPRVSAFVPQLNNNNDL---------HNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCR----VAARGDDPALDQ
        SS +P +     +F+ ++N+ N+L          ++   +AKIMAHP + RLL AYVNCQKVGAPPEV ARLE+ C+ AA +        + G+DP LDQ
Subjt:  SSATPHNAPRVSAFVPQLNNNNDL---------HNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCR----VAARGDDPALDQ

Query:  FMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDG---CELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFL
        FMEAYCEML KYEQEL+KPFKEAMVF   +E Q K+L++SS +S G     +   N  S E EVDMN  ++DPQAE++ELKGQLLRKYSGYLGSLKQEF+
Subjt:  FMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDG---CELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFL

Query:  KKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD
        KK+K GKLPKEARQQLLDWWSRHYKWPYPSE QK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDA HP HY++ NV+ NPF +D
Subjt:  KKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD

Arabidopsis top hitse value%identityAlignment
AT1G23380.1 KNOTTED1-like homeobox gene 65.6e-6241.18Show/hide
Query:  NDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQK
        +D S   +S  ++M        +      N  SD   ++  + + +S  S+        ++  P++  N+D  + +  +AKI  HP +PRLL AY++CQK
Subjt:  NDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQK

Query:  VGAPPEVVARLEQ--------ACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVGQNECSK
        VGAPPE+   LE+           V   SC     G DP LD+FME YC++L KY+ +L +PF EA  F ++IE QL+ L     ++ G    G     +
Subjt:  VGAPPEVVARLEQ--------ACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVGQNECSK

Query:  EVEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKR
        E+    +E   D   + E+++LK +LLRK+   + +LK EF KKKK GKLP+EARQ LLDWW+ HYKWPYP+E  K+ALA++TGLD KQINNWFINQRKR
Subjt:  EVEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKR

Query:  HWKPSEDMQFVVMDAAHPHYYLD
        HWKPSE+M F +MD +   ++ +
Subjt:  HWKPSEDMQFVVMDAAHPHYYLD

AT1G23380.2 KNOTTED1-like homeobox gene 61.8e-6040.92Show/hide
Query:  NDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQK
        +D S   +S  ++M        +      N  SD   ++  + + +S  S+        ++  P++  N+D  + +  +AKI  HP +PRLL AY++CQK
Subjt:  NDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQK

Query:  --VGAPPEVVARLEQ--------ACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVGQNEC
          VGAPPE+   LE+           V   SC     G DP LD+FME YC++L KY+ +L +PF EA  F ++IE QL+ L     ++ G    G    
Subjt:  --VGAPPEVVARLEQ--------ACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVGQNEC

Query:  SKEVEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQR
         +E+    +E   D   + E+++LK +LLRK+   + +LK EF KKKK GKLP+EARQ LLDWW+ HYKWPYP+E  K+ALA++TGLD KQINNWFINQR
Subjt:  SKEVEVDMNENYID--PQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQR

Query:  KRHWKPSEDMQFVVMDAAHPHYYLD
        KRHWKPSE+M F +MD +   ++ +
Subjt:  KRHWKPSEDMQFVVMDAAHPHYYLD

AT1G62360.1 KNOX/ELK homeobox transcription factor2.9e-10369.6Show/hide
Query:  LNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCR----VAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVF
        +N  +   ++++ +AKIMAHP + RLLAAYVNCQKVGAPPEVVARLE+AC+ AA +          G+DP LDQFMEAYCEML KYEQEL+KPFKEAMVF
Subjt:  LNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCR----VAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVF

Query:  FSRIESQLKALAVSSSTS----DGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYK
          R+E Q K+L++SS +S        +   N  S E EVDMN  ++DPQAE++ELKGQLLRKYSGYLGSLKQEF+KK+K GKLPKEARQQLLDWWSRHYK
Subjt:  FSRIESQLKALAVSSSTS----DGCELVGQNECSKEVEVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYK

Query:  WPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD-CSSTL
        WPYPSE QK+ALAESTGLD KQINNWFINQRKRHWKPSEDMQFVVMDA HP HY++DNV+ NPF MD  SST+
Subjt:  WPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHP-HYYLDNVICNPFSMD-CSSTL

AT1G70510.1 KNOTTED-like from Arabidopsis thaliana 25.6e-6246.97Show/hide
Query:  SAFVPQLNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACA--------VAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQEL
        S  +P++    D  + S  ++KI +HPL+PRLL  Y++CQKVGAP E+   LE+           VA  SC     G DP LD+FME YC++L KY+ +L
Subjt:  SAFVPQLNNNNDLHNNSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQACA--------VAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQEL

Query:  TKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQ--AEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLL
         +PF EA  F ++IE QL+ L    +++      G     +E+  D +    D Q  + +++LK QLLRK+  ++ SLK EF KKKK GKLP+EARQ LL
Subjt:  TKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVGQNECSKEVEVDMNENYIDPQ--AEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLL

Query:  DWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD
        DWW+ H KWPYP+E  K++LAE TGLD KQINNWFINQRKRHWKPSE+M F +MD ++  ++ +
Subjt:  DWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDAAHPHYYLD

AT4G08150.1 KNOTTED-like from Arabidopsis thaliana3.4e-7547.77Show/hide
Query:  SSGSSAGGSTSSFIPGFPNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAF----VPQLNNNNDLHN
        ++ ++   ++S++ PG+ N NNN++ +  +       + +  PQ  E    +++   +NNNN      + SS   H +  + A         NNN+++ +
Subjt:  SSGSSAGGSTSSFIPGFPNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAF----VPQLNNNNDLHN

Query:  NSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQA-----CAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQL
            +AKI+AHP +  LL AY++CQK+GAPP+VV R+  A           +  V+A   DP LDQFMEAYC+ML KY +ELT+P +EAM F  RIESQL
Subjt:  NSTCRAKIMAHPLFPRLLAAYVNCQKVGAPPEVVARLEQA-----CAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQL

Query:  KALAVSS----STSDGCELVGQNECSKEVEVDMNEN------YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWP
          L  S     +  DG      N  S + E + N         IDP+AE++ELK  LL+KYSGYL SLKQE  KKKK GKLPKEARQ+LL WW  HYKWP
Subjt:  KALAVSS----STSDGCELVGQNECSKEVEVDMNEN------YIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWP

Query:  YPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDA-AHPHY---YLD
        YPSES+KVALAESTGLD KQINNWFINQRKRHWKPSEDMQF+VMD   HPH+   Y+D
Subjt:  YPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMDA-AHPHY---YLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCGAGGATCCAGTGGAAGCAGTGCCGGTGGCAGCACCAGCTCTTTCATTCCTGGATTTCCAAACAACAACAATAATGATAACTCTAACACTGCTCTTTCTTCTTC
AGCCATCATGATGATCGATACTACTAATCCCCAAATATTGGAACAACACAACAATAACAACAACAGCGACAGCAGCAACAACAATAACAAGATGTTTTTGCCTTTGTCTT
GGTCGTCTGCTACTCCCCACAACGCTCCAAGGGTTTCTGCTTTTGTCCCTCAGCTCAATAACAACAATGATCTTCATAACAACTCCACTTGCCGAGCTAAAATCATGGCT
CATCCCCTCTTTCCTCGCCTCCTCGCAGCCTACGTCAACTGCCAGAAGGTGGGTGCACCGCCGGAAGTGGTGGCGAGGCTAGAGCAGGCGTGCGCGGTGGCCGCCGGGAG
CTGTAGGGTGGCGGCACGTGGGGATGATCCAGCGCTGGATCAGTTCATGGAAGCCTACTGTGAAATGTTGACCAAGTACGAACAAGAGTTGACCAAACCTTTCAAAGAAG
CAATGGTTTTCTTCTCAAGAATCGAGTCTCAGCTCAAAGCCCTAGCTGTTTCTTCTTCTACTTCTGATGGTTGTGAACTGGTCGGGCAAAACGAGTGTTCGAAGGAGGTC
GAGGTCGATATGAATGAAAACTACATTGACCCTCAAGCTGAAGAGAAGGAACTCAAAGGCCAACTTTTGCGCAAATACAGCGGATATCTCGGCAGCCTAAAGCAAGAGTT
TCTTAAAAAGAAGAAGAATGGAAAGCTTCCGAAAGAAGCTCGACAGCAACTTCTCGACTGGTGGAGCCGACACTATAAATGGCCATATCCCTCGGAGTCGCAAAAGGTGG
CGCTGGCGGAGTCGACGGGGCTGGACTTGAAGCAGATCAATAACTGGTTCATCAACCAAAGAAAGCGCCATTGGAAGCCATCAGAGGATATGCAGTTTGTGGTGATGGAT
GCTGCTCATCCACACTACTATTTGGACAATGTCATCTGCAATCCCTTCTCTATGGATTGTTCTTCCACTCTTTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCGAGGATCCAGTGGAAGCAGTGCCGGTGGCAGCACCAGCTCTTTCATTCCTGGATTTCCAAACAACAACAATAATGATAACTCTAACACTGCTCTTTCTTCTTC
AGCCATCATGATGATCGATACTACTAATCCCCAAATATTGGAACAACACAACAATAACAACAACAGCGACAGCAGCAACAACAATAACAAGATGTTTTTGCCTTTGTCTT
GGTCGTCTGCTACTCCCCACAACGCTCCAAGGGTTTCTGCTTTTGTCCCTCAGCTCAATAACAACAATGATCTTCATAACAACTCCACTTGCCGAGCTAAAATCATGGCT
CATCCCCTCTTTCCTCGCCTCCTCGCAGCCTACGTCAACTGCCAGAAGGTGGGTGCACCGCCGGAAGTGGTGGCGAGGCTAGAGCAGGCGTGCGCGGTGGCCGCCGGGAG
CTGTAGGGTGGCGGCACGTGGGGATGATCCAGCGCTGGATCAGTTCATGGAAGCCTACTGTGAAATGTTGACCAAGTACGAACAAGAGTTGACCAAACCTTTCAAAGAAG
CAATGGTTTTCTTCTCAAGAATCGAGTCTCAGCTCAAAGCCCTAGCTGTTTCTTCTTCTACTTCTGATGGTTGTGAACTGGTCGGGCAAAACGAGTGTTCGAAGGAGGTC
GAGGTCGATATGAATGAAAACTACATTGACCCTCAAGCTGAAGAGAAGGAACTCAAAGGCCAACTTTTGCGCAAATACAGCGGATATCTCGGCAGCCTAAAGCAAGAGTT
TCTTAAAAAGAAGAAGAATGGAAAGCTTCCGAAAGAAGCTCGACAGCAACTTCTCGACTGGTGGAGCCGACACTATAAATGGCCATATCCCTCGGAGTCGCAAAAGGTGG
CGCTGGCGGAGTCGACGGGGCTGGACTTGAAGCAGATCAATAACTGGTTCATCAACCAAAGAAAGCGCCATTGGAAGCCATCAGAGGATATGCAGTTTGTGGTGATGGAT
GCTGCTCATCCACACTACTATTTGGACAATGTCATCTGCAATCCCTTCTCTATGGATTGTTCTTCCACTCTTTTCTGA
Protein sequenceShow/hide protein sequence
MERGSSGSSAGGSTSSFIPGFPNNNNNDNSNTALSSSAIMMIDTTNPQILEQHNNNNNSDSSNNNNKMFLPLSWSSATPHNAPRVSAFVPQLNNNNDLHNNSTCRAKIMA
HPLFPRLLAAYVNCQKVGAPPEVVARLEQACAVAAGSCRVAARGDDPALDQFMEAYCEMLTKYEQELTKPFKEAMVFFSRIESQLKALAVSSSTSDGCELVGQNECSKEV
EVDMNENYIDPQAEEKELKGQLLRKYSGYLGSLKQEFLKKKKNGKLPKEARQQLLDWWSRHYKWPYPSESQKVALAESTGLDLKQINNWFINQRKRHWKPSEDMQFVVMD
AAHPHYYLDNVICNPFSMDCSSTLF