; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0011208 (gene) of Chayote v1 genome

Gene IDSed0011208
OrganismSechium edule (Chayote v1)
DescriptionARM repeat superfamily protein
Genome locationLG01:56066075..56070533
RNA-Seq ExpressionSed0011208
SyntenySed0011208
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0004672 - protein kinase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR011989 - Armadillo-like helical
IPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570835.1 hypothetical protein SDJN03_29750, partial [Cucurbita argyrosperma subsp. sororia]1.7e-19973.43Show/hide
Query:  LEAAHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ--------------------------------------
        L+ AHHPS P  ELFDISTTVDPSY+ISLIRKLLP  ASNL NS G  D D    + TNMD+                                      
Subjt:  LEAAHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ--------------------------------------

Query:  -EAACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLD
         E ACP   +Q ISS +E VWE+YGCILWDLS++KSHA+LMV NLVLEVLSANLMVSQSVRVMEI LGIIGNLACHEVPMKHIV KSGLI+TIV+QL LD
Subjt:  -EAACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLD

Query:  DAQCLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERS
        DAQCLCEVCRLL AGLQSSEC IWAEALNSEHVLSRILWVSENTLN QLIEKSVGLL  IIESQQE  H+LLP LMKLGLSSVLFNLF+FEMKI  NERS
Subjt:  DAQCLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERS

Query:  GERYSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDA
         ERYSILD ILR++EALSGI+EHSQE CSNK  FQLVC L+KLPDAFEVSSSC+SAV+LIANILSDVPDLA DMSQDLSFLQGLLDIF FAGDDLEARDA
Subjt:  GERYSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDA

Query:  VWSIIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGSD-MRDEYRVEDVD
        VWSIIARILV V+E A+S+PR+FEYVSLLVSKTDLIEDDLL+ R+TE NK+EDGLTS C K NSRCISLRRIIAILN WTASKDEG+  +R EYR ED++
Subjt:  VWSIIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGSD-MRDEYRVEDVD

Query:  VNRLLNCCRKYS---------YLTILH
        VNRLL+CC K+S          LTILH
Subjt:  VNRLLNCCRKYS---------YLTILH

XP_004145826.1 uncharacterized protein LOC101215373 [Cucumis sativus]5.2e-20175Show/hide
Query:  AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ---------------------------------------EA
        AHHPS P  E+FDISTTVDPSY+ISLIRKLLP  ASN  NS GNG HD G  +   MD+                                       E 
Subjt:  AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ---------------------------------------EA

Query:  ACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQ
        ACP   +Q ISS +EKVWE+YGCILWDLS+++S A+LMV NLVLEVLSANLMVSQSVRVMEI LGIIGNLACHEVPMKHIV KSGLI+TIVSQL LDDAQ
Subjt:  ACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQ

Query:  CLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERSGER
        CLCEVCRLL  GLQSSECVIWAEALNSEHVLSRILWVSENTLN QLIEKSVGLL  IIESQQE  H+LL  LMKLGLSSVLFNLF+FEMKI  NERS ER
Subjt:  CLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERSGER

Query:  YSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDAVWS
        +SILDVILR++EALSG +EHS+E+CSNK+LFQLV +L+KLPDAFEVSSSC+SAVVLIANILSDVPDLAF+MSQDLSFLQGLLDIF F GDD EARDAVWS
Subjt:  YSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDAVWS

Query:  IIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGSDMRDEYRVEDVDVNRL
        IIARILVRVQEN +S+P+LFEYVSLLVSKTDLIEDDLL+  +TESNKEEDG+TS C K NSRCISLRRII+ILNHWTASKDEG+D+RDEY +EDVDVNRL
Subjt:  IIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGSDMRDEYRVEDVDVNRL

Query:  LNCCRKYS
        L CC K+S
Subjt:  LNCCRKYS

XP_008458655.1 PREDICTED: uncharacterized protein LOC103497988 isoform X4 [Cucumis melo]1.8e-20178.38Show/hide
Query:  AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQEAACPTLD------------QQHISSQQEKVWEDYGCILWD
        AHHPS P  ELFDISTTVDPSY+ISLIRKLLP  ASN  NS  NG HD G  +   MD++ +    D            +Q ISS +EKVWE+YGCILWD
Subjt:  AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQEAACPTLD------------QQHISSQQEKVWEDYGCILWD

Query:  LSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECVIWAEALNS
        LS+++S A+LMV NLVLEVLSANLMVSQSVRVMEI LGIIGNLACHEVPMKHIV KSGLI+TIVSQL LDDAQCLCEVCRLL  GLQSSECVIWAEALN 
Subjt:  LSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECVIWAEALNS

Query:  EHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERSGERYSILDVILRSIEALSGIDEHSQEICSN
        EHVLSRILWVSENTLN QLIEKSVGLL  IIES QE  H LLP LMKLGLSSVLFNLF+FEMKI  NERS ER+SILDVILR++E LSGI+EHS E+CSN
Subjt:  EHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERSGERYSILDVILRSIEALSGIDEHSQEICSN

Query:  KDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDAVWSIIARILVRVQENAISKPRLFEYVSLLV
        K+LFQLV +L+KLPDAFEVSSSC+SAVVLIANILSDVPDLAF+MSQDLSFLQGL D F FAGDDLEARDAVWSIIARILVRVQEN +S+P+L EYVSLLV
Subjt:  KDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDAVWSIIARILVRVQENAISKPRLFEYVSLLV

Query:  SKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGSDMRDEYRVEDVDVNRLLNCCRKYS
        SKTDLIEDDLL+  +TESNKEEDG+TS C K NSRCISLRRII+ILNHWTASKDEG+D+RDEY VEDVDVNRLL CC K+S
Subjt:  SKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGSDMRDEYRVEDVDVNRLLNCCRKYS

XP_022944140.1 uncharacterized protein LOC111448685 isoform X1 [Cucurbita moschata]7.5e-20073.47Show/hide
Query:  SESEPHLEA-------AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ-------------------------
        +E +P LE+       AHHPS P  ELFDISTTVDPSY+ISLIRKLLP  ASNL NS G  D D    + TNMD+                         
Subjt:  SESEPHLEA-------AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ-------------------------

Query:  --------------EAACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKS
                      E ACP   +Q ISS +E VWE+YGCILWDLS++KSHA+LMV NLVLEVLSANLMVSQSVRVMEI LGIIGNLACHEVPMKHIV KS
Subjt:  --------------EAACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKS

Query:  GLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNL
        GLI+ IV+QL LDDAQCLCEVCRLL AGL SSEC IWAEALNSEHVLSRILWVSENTLN QLIEKSVGLL  IIESQQE  H+LLP LMKLGLSS LFNL
Subjt:  GLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNL

Query:  FAFEMKIFENERSGERYSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDI
        F+FEMKI  NERS ERYSILD ILR++EALSGI+EHSQE CSNK LFQLV  L+KLPDAFEVSSSC+SAV+LIANILSDVPDLAFDMSQDLSFLQGLLDI
Subjt:  FAFEMKIFENERSGERYSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDI

Query:  FLFAGDDLEARDAVWSIIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGS
        F FAGDDLEARDAVWSIIARILV V+E A+S+PR+FEYVSLLVSKTDLIEDDLL+ R+TE NK+EDGLTS C K NSRCISLRRIIAILN WT SKDEG+
Subjt:  FLFAGDDLEARDAVWSIIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGS

Query:  DMRDEYRVEDVDVNRLLNCCRKYS
        D+RDEYR ED+DVNRLL+CC K+S
Subjt:  DMRDEYRVEDVDVNRLLNCCRKYS

XP_022986281.1 uncharacterized protein LOC111484077 [Cucurbita maxima]5.2e-20173.47Show/hide
Query:  SESEPHLEA-------AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ-------------------------
        +E +P LE+       AHHPS P  ELFDISTTVDPSY+ISLIRKLLP  ASNL NS G  D D G  + TNMD+                         
Subjt:  SESEPHLEA-------AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ-------------------------

Query:  --------------EAACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKS
                      E ACP   +Q ISS +E VWE+YGCILWDLS++KSHA+LMV NLVLEVLSANLMVSQSVRVMEI LGIIGNLACHEVPMKHIV KS
Subjt:  --------------EAACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKS

Query:  GLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNL
        GLI+TIV+QL LDDAQCLCEVCRLL AGLQSSEC IWA ALNSEHVLSRILWVSENTLN QLIEKSVGLL  IIESQQE  H+LLP LMKLGLSS LFNL
Subjt:  GLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNL

Query:  FAFEMKIFENERSGERYSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDI
        F+FEMKI  NERS ERYSILD ILR++EALSGI+EHSQE CSNK LFQLVC L+KLPDAFEVSSSC+SAV+LIANILSD+PDLAFDMSQDLSFLQGLLDI
Subjt:  FAFEMKIFENERSGERYSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDI

Query:  FLFAGDDLEARDAVWSIIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGS
        F FAGDDLEARDAVWSIIARILV V+E A+S+PR+FE VSLLVSKTDLIEDDLL+ R+TE NK+EDGLTS C K NSRCISL RIIAILN W ASKDEG+
Subjt:  FLFAGDDLEARDAVWSIIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGS

Query:  DMRDEYRVEDVDVNRLLNCCRKYS
        D+RDEYR ED+DVNRLL+CC K+S
Subjt:  DMRDEYRVEDVDVNRLLNCCRKYS

TrEMBL top hitse value%identityAlignment
A0A0A0KDI1 Uncharacterized protein2.5e-20175Show/hide
Query:  AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ---------------------------------------EA
        AHHPS P  E+FDISTTVDPSY+ISLIRKLLP  ASN  NS GNG HD G  +   MD+                                       E 
Subjt:  AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ---------------------------------------EA

Query:  ACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQ
        ACP   +Q ISS +EKVWE+YGCILWDLS+++S A+LMV NLVLEVLSANLMVSQSVRVMEI LGIIGNLACHEVPMKHIV KSGLI+TIVSQL LDDAQ
Subjt:  ACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQ

Query:  CLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERSGER
        CLCEVCRLL  GLQSSECVIWAEALNSEHVLSRILWVSENTLN QLIEKSVGLL  IIESQQE  H+LL  LMKLGLSSVLFNLF+FEMKI  NERS ER
Subjt:  CLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERSGER

Query:  YSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDAVWS
        +SILDVILR++EALSG +EHS+E+CSNK+LFQLV +L+KLPDAFEVSSSC+SAVVLIANILSDVPDLAF+MSQDLSFLQGLLDIF F GDD EARDAVWS
Subjt:  YSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDAVWS

Query:  IIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGSDMRDEYRVEDVDVNRL
        IIARILVRVQEN +S+P+LFEYVSLLVSKTDLIEDDLL+  +TESNKEEDG+TS C K NSRCISLRRII+ILNHWTASKDEG+D+RDEY +EDVDVNRL
Subjt:  IIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGSDMRDEYRVEDVDVNRL

Query:  LNCCRKYS
        L CC K+S
Subjt:  LNCCRKYS

A0A1S3C8G6 uncharacterized protein LOC103497988 isoform X11.2e-19874.61Show/hide
Query:  AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ---------------------------------------EA
        AHHPS P  ELFDISTTVDPSY+ISLIRKLLP  ASN  NS  NG HD G  +   MD+                                       E 
Subjt:  AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ---------------------------------------EA

Query:  ACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQ
        AC    +Q ISS +EKVWE+YGCILWDLS+++S A+LMV NLVLEVLSANLMVSQSVRVMEI LGIIGNLACHEVPMKHIV KSGLI+TIVSQL LDDAQ
Subjt:  ACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQ

Query:  CLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERSGER
        CLCEVCRLL  GLQSSECVIWAEALN EHVLSRILWVSENTLN QLIEKSVGLL  IIES QE  H LLP LMKLGLSSVLFNLF+FEMKI  NERS ER
Subjt:  CLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERSGER

Query:  YSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDAVWS
        +SILDVILR++E LSGI+EHS E+CSNK+LFQLV +L+KLPDAFEVSSSC+SAVVLIANILSDVPDLAF+MSQDLSFLQGL D F FAGDDLEARDAVWS
Subjt:  YSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDAVWS

Query:  IIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGSDMRDEYRVEDVDVNRL
        IIARILVRVQEN +S+P+L EYVSLLVSKTDLIEDDLL+  +TESNKEEDG+TS C K NSRCISLRRII+ILNHWTASKDEG+D+RDEY VEDVDVNRL
Subjt:  IIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGSDMRDEYRVEDVDVNRL

Query:  LNCCRKYS
        L CC K+S
Subjt:  LNCCRKYS

A0A1S3C9L3 uncharacterized protein LOC103497988 isoform X48.7e-20278.38Show/hide
Query:  AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQEAACPTLD------------QQHISSQQEKVWEDYGCILWD
        AHHPS P  ELFDISTTVDPSY+ISLIRKLLP  ASN  NS  NG HD G  +   MD++ +    D            +Q ISS +EKVWE+YGCILWD
Subjt:  AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQEAACPTLD------------QQHISSQQEKVWEDYGCILWD

Query:  LSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECVIWAEALNS
        LS+++S A+LMV NLVLEVLSANLMVSQSVRVMEI LGIIGNLACHEVPMKHIV KSGLI+TIVSQL LDDAQCLCEVCRLL  GLQSSECVIWAEALN 
Subjt:  LSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECVIWAEALNS

Query:  EHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERSGERYSILDVILRSIEALSGIDEHSQEICSN
        EHVLSRILWVSENTLN QLIEKSVGLL  IIES QE  H LLP LMKLGLSSVLFNLF+FEMKI  NERS ER+SILDVILR++E LSGI+EHS E+CSN
Subjt:  EHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERSGERYSILDVILRSIEALSGIDEHSQEICSN

Query:  KDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDAVWSIIARILVRVQENAISKPRLFEYVSLLV
        K+LFQLV +L+KLPDAFEVSSSC+SAVVLIANILSDVPDLAF+MSQDLSFLQGL D F FAGDDLEARDAVWSIIARILVRVQEN +S+P+L EYVSLLV
Subjt:  KDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDAVWSIIARILVRVQENAISKPRLFEYVSLLV

Query:  SKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGSDMRDEYRVEDVDVNRLLNCCRKYS
        SKTDLIEDDLL+  +TESNKEEDG+TS C K NSRCISLRRII+ILNHWTASKDEG+D+RDEY VEDVDVNRLL CC K+S
Subjt:  SKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGSDMRDEYRVEDVDVNRLLNCCRKYS

A0A6J1FYH6 uncharacterized protein LOC111448685 isoform X13.6e-20073.47Show/hide
Query:  SESEPHLEA-------AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ-------------------------
        +E +P LE+       AHHPS P  ELFDISTTVDPSY+ISLIRKLLP  ASNL NS G  D D    + TNMD+                         
Subjt:  SESEPHLEA-------AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ-------------------------

Query:  --------------EAACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKS
                      E ACP   +Q ISS +E VWE+YGCILWDLS++KSHA+LMV NLVLEVLSANLMVSQSVRVMEI LGIIGNLACHEVPMKHIV KS
Subjt:  --------------EAACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKS

Query:  GLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNL
        GLI+ IV+QL LDDAQCLCEVCRLL AGL SSEC IWAEALNSEHVLSRILWVSENTLN QLIEKSVGLL  IIESQQE  H+LLP LMKLGLSS LFNL
Subjt:  GLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNL

Query:  FAFEMKIFENERSGERYSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDI
        F+FEMKI  NERS ERYSILD ILR++EALSGI+EHSQE CSNK LFQLV  L+KLPDAFEVSSSC+SAV+LIANILSDVPDLAFDMSQDLSFLQGLLDI
Subjt:  FAFEMKIFENERSGERYSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDI

Query:  FLFAGDDLEARDAVWSIIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGS
        F FAGDDLEARDAVWSIIARILV V+E A+S+PR+FEYVSLLVSKTDLIEDDLL+ R+TE NK+EDGLTS C K NSRCISLRRIIAILN WT SKDEG+
Subjt:  FLFAGDDLEARDAVWSIIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGS

Query:  DMRDEYRVEDVDVNRLLNCCRKYS
        D+RDEYR ED+DVNRLL+CC K+S
Subjt:  DMRDEYRVEDVDVNRLLNCCRKYS

A0A6J1J751 uncharacterized protein LOC1114840772.5e-20173.47Show/hide
Query:  SESEPHLEA-------AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ-------------------------
        +E +P LE+       AHHPS P  ELFDISTTVDPSY+ISLIRKLLP  ASNL NS G  D D G  + TNMD+                         
Subjt:  SESEPHLEA-------AHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQ-------------------------

Query:  --------------EAACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKS
                      E ACP   +Q ISS +E VWE+YGCILWDLS++KSHA+LMV NLVLEVLSANLMVSQSVRVMEI LGIIGNLACHEVPMKHIV KS
Subjt:  --------------EAACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKS

Query:  GLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNL
        GLI+TIV+QL LDDAQCLCEVCRLL AGLQSSEC IWA ALNSEHVLSRILWVSENTLN QLIEKSVGLL  IIESQQE  H+LLP LMKLGLSS LFNL
Subjt:  GLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNL

Query:  FAFEMKIFENERSGERYSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDI
        F+FEMKI  NERS ERYSILD ILR++EALSGI+EHSQE CSNK LFQLVC L+KLPDAFEVSSSC+SAV+LIANILSD+PDLAFDMSQDLSFLQGLLDI
Subjt:  FAFEMKIFENERSGERYSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDI

Query:  FLFAGDDLEARDAVWSIIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGS
        F FAGDDLEARDAVWSIIARILV V+E A+S+PR+FE VSLLVSKTDLIEDDLL+ R+TE NK+EDGLTS C K NSRCISL RIIAILN W ASKDEG+
Subjt:  FLFAGDDLEARDAVWSIIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKDEGS

Query:  DMRDEYRVEDVDVNRLLNCCRKYS
        D+RDEYR ED+DVNRLL+CC K+S
Subjt:  DMRDEYRVEDVDVNRLLNCCRKYS

SwissProt top hitse value%identityAlignment
Q6DCP5 Protein saal15.0e-0530.37Show/hide
Query:  CILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECV-IW
        C +WD+S  +  A  +      E+L   ++ S+  R+ EIC+GI+GN++C + P   I     L    +  LS  D   L E  RLL   L  +E    W
Subjt:  CILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECV-IW

Query:  AEALNSE-HVLSRILWVSENTLNSQLIEKSVGLLL
        AE       V   + ++  ++ N  L+ K VG LL
Subjt:  AEALNSE-HVLSRILWVSENTLNSQLIEKSVGLLL

Q6NVK9 Protein saal11.5e-0430.22Show/hide
Query:  EDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQCLCEVCRL-LTAGLQSSE
        ED  C +WD+S  +  A  +      E+L   +  S+  R+ E+C+GI+GN++C   P   I T   L   ++  LS  D   L E  RL LT   Q+  
Subjt:  EDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQCLCEVCRL-LTAGLQSSE

Query:  CVIWAEALNSE-HVLSRILWVSENTLNSQLIEKSVGLLL
           W E    +  V   + ++  ++ N  L+ K VG LL
Subjt:  CVIWAEALNSE-HVLSRILWVSENTLNSQLIEKSVGLLL

Q803M5 Protein saal19.1e-0724.15Show/hide
Query:  CILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQCLCEVCRLLTAGL-QSSECVIW
        C +WD++  K  A  +      ++L   +  S + R+ EIC+GI+GN+AC       +   S L + ++  L  +D   L E CRLL   L Q+    +W
Subjt:  CILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQCLCEVCRLLTAGL-QSSECVIW

Query:  AEALNSEH-VLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSS--------------VLFNLFAFEMKIFENERSGERYSILDV
         E +  +  V S + ++  ++ N  L+ K   LL  + +  +E        LMK  +S+              +L +L     ++       E    L+V
Subjt:  AEALNSEH-VLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSS--------------VLFNLFAFEMKIFENERSGERYSILDV

Query:  ILRSIEALSGIDEHSQEICSNKD----LFQLVCNLI
         L S++ L+ ++E  Q + S++     ++  VC L+
Subjt:  ILRSIEALSGIDEHSQEICSNKD----LFQLVCNLI

Q96ER3 Protein SAAL12.5e-0424.1Show/hide
Query:  DQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQCLCEV
        D++ ++   E++ E+  C +WD+S  +  A  +      ++    L  S+  R+ EIC+GI+GN+AC +     I +   L   ++  L   D   L E 
Subjt:  DQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQCLCEV

Query:  CRLLTAGLQSSECV-IWAEALNSEH--VLSRILWVSENTLNSQLIEKSVGLLL---------AIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFE
         RLL   L  +E   +W E +  EH  +   I ++  ++ N  L+ K VG ++          ++E  +  A   L    +      +F L    ++  +
Subjt:  CRLLTAGLQSSECV-IWAEALNSEH--VLSRILWVSENTLNSQLIEKSVGLLL---------AIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFE

Query:  NERSGERYSILDVILRSIEALSGIDEHSQEIC----SNKDLFQLVCNLI
          RS E    LDV +  ++ L+ +D+  Q I     + KD++ L+ +L+
Subjt:  NERSGERYSILDVILRSIEALSGIDEHSQEIC----SNKDLFQLVCNLI

Q9D2C2 Protein SAAL13.8e-0523.55Show/hide
Query:  CILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECV-IW
        C +WD+S  +  A  +      ++    L  S   R+ EIC+GI+GN+AC     + I         ++  L   D   L E CRLL   L  +E   +W
Subjt:  CILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECV-IW

Query:  AEALNSEH--VLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGL-----------SSVLFNLFAFEMKIFENERSGERYSILDVIL
           +  EH  V + + ++  ++ N  L+ K   ++  + +  ++   ++L ++ K                 +F++    ++  +  RS E    LDV +
Subjt:  AEALNSEH--VLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGL-----------SSVLFNLFAFEMKIFENERSGERYSILDVIL

Query:  RSIEALSGIDEHSQEICSNKD--------LFQLVCNLIKLPD
        R ++ L+ +D+  Q I    D        LF LVC+    PD
Subjt:  RSIEALSGIDEHSQEICSNKD--------LFQLVCNLIKLPD

Arabidopsis top hitse value%identityAlignment
AT5G22820.1 ARM repeat superfamily protein1.4e-11449.36Show/hide
Query:  ESSESEPHLEAAHHPSPPSHELFDISTTVDPSYVISLIRKLLP----------------PTASNLTNSSGNG--DHDCGTPAATNM------------DQ
        E SE+E     +HHP PP  ELFDISTTVDPSY+ISLIRKLLP                     +   SGNG  +   G P + ++            + 
Subjt:  ESSESEPHLEAAHHPSPPSHELFDISTTVDPSYVISLIRKLLP----------------PTASNLTNSSGNG--DHDCGTPAATNM------------DQ

Query:  EAACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDD
         ++CP    Q  SS     WED+GC+LWDL+++++HA+LMV NL+LEVL ANLMVS+S R+ EICLGII NLACHE  +KHI + +G+++T+V QL LDD
Subjt:  EAACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDD

Query:  AQCLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERSG
         QCL EVCR+LT GL  + C  WA  L S+ +L  ILW++ENTLN  LIEKSVGLLL IIE Q E   +L+P LM LGL+S+L NL +FEM     ER  
Subjt:  AQCLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERSG

Query:  ERYSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDAV
        ERY +L++ILR+IEALS  D +S+EICS+K+LFQLVC+L+KL D  EV++SCV+  VLIAN+LS+  D   ++ +D SFL+GL     FA DD+EAR A+
Subjt:  ERYSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDAV

Query:  WSIIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRII
        W++IAR+L RV E+ I+   L +Y+ +L+S  D+IEDD L+ +L +SN+  +   S  +K ++R I++   I
Subjt:  WSIIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRII

AT5G22820.2 ARM repeat superfamily protein7.9e-12348.14Show/hide
Query:  ESSESEPHLEAAHHPSPPSHELFDISTTVDPSYVISLIRKLLP----------------PTASNLTNSSGNG--DHDCGTPAATNM------------DQ
        E SE+E     +HHP PP  ELFDISTTVDPSY+ISLIRKLLP                     +   SGNG  +   G P + ++            + 
Subjt:  ESSESEPHLEAAHHPSPPSHELFDISTTVDPSYVISLIRKLLP----------------PTASNLTNSSGNG--DHDCGTPAATNM------------DQ

Query:  EAACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDD
         ++CP    Q  SS     WED+GC+LWDL+++++HA+LMV NL+LEVL ANLMVS+S R+ EICLGII NLACHE  +KHI + +G+++T+V QL LDD
Subjt:  EAACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHAQLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDD

Query:  AQCLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERSG
         QCL EVCR+LT GL  + C  WA  L S+ +L  ILW++ENTLN  LIEKSVGLLL IIE Q E   +L+P LM LGL+S+L NL +FEM     ER  
Subjt:  AQCLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQLIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERSG

Query:  ERYSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDAV
        ERY +L++ILR+IEALS  D +S+EICS+K+LFQLVC+L+KL D  EV++SCV+  VLIAN+LS+  D   ++ +D SFL+GL     FA DD+EAR A+
Subjt:  ERYSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVVLIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDAV

Query:  WSIIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKD--EGSDMRDEYRVEDVD
        W++IAR+L RV E+ I+   L +Y+ +L+S  D+IEDD L+ +L +SN+  +   S  +K ++R I++++I +ILN+W A K+  +   +     +   D
Subjt:  WSIIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCISLRRIIAILNHWTASKD--EGSDMRDEYRVEDVD

Query:  VNRLLNCCRKY
        V RL +CC +Y
Subjt:  VNRLLNCCRKY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCTGGAGTCGTCAGAATCGGAGCCACACCTTGAAGCCGCTCATCACCCCTCCCCTCCATCTCACGAGTTGTTTGATATCTCTACGACGGTCGATCCTAGCTATGT
TATATCTCTCATACGGAAACTTCTACCACCTACTGCATCTAACCTAACCAATTCTTCTGGAAATGGAGATCACGACTGTGGCACGCCAGCTGCAACCAATATGGATCAAG
AAGCTGCTTGCCCTACATTGGACCAACAACATATTTCATCTCAACAAGAAAAGGTCTGGGAAGACTATGGCTGCATTCTGTGGGATCTTTCTTCGACGAAATCCCACGCA
CAACTTATGGTTAACAACCTTGTCCTTGAAGTTCTGTCTGCAAACCTTATGGTTTCTCAATCTGTGCGTGTTATGGAGATTTGCCTTGGAATTATTGGAAACCTGGCCTG
CCATGAAGTTCCCATGAAACATATAGTCACCAAGAGTGGGTTGATATCAACCATTGTGAGCCAGCTGTCTCTAGATGATGCTCAATGTTTATGTGAAGTTTGCAGGTTAT
TAACTGCGGGTCTTCAAAGTAGTGAATGTGTCATATGGGCTGAGGCTTTGAATTCTGAGCATGTTCTTTCTCGTATTCTATGGGTTTCTGAGAACACCTTAAATTCACAA
CTTATAGAAAAGAGTGTCGGGTTACTATTAGCCATCATTGAAAGTCAGCAAGAAGCCGCTCACATTCTTCTACCGTTTTTGATGAAGTTGGGTTTGTCAAGTGTTTTGTT
CAACCTTTTTGCTTTTGAGATGAAAATATTTGAAAATGAAAGATCAGGTGAAAGGTATTCAATTTTGGACGTGATTCTTCGTTCAATTGAAGCTCTCTCCGGAATTGATG
AACATTCTCAAGAAATATGTTCTAACAAAGATCTTTTTCAGCTTGTTTGTAATCTAATCAAATTGCCAGATGCATTTGAGGTTTCCAGTTCTTGTGTCAGTGCCGTGGTT
TTGATTGCAAATATTTTGTCAGATGTACCTGATCTAGCCTTTGACATGTCTCAGGATTTGTCTTTCCTACAAGGCTTACTAGATATATTCTTGTTTGCTGGGGATGATTT
AGAGGCACGTGATGCTGTTTGGAGCATCATTGCCAGGATACTGGTTCGTGTTCAAGAAAATGCGATTAGCAAACCAAGGCTATTTGAGTACGTGTCATTACTAGTGAGTA
AGACTGATCTCATCGAGGATGATCTTCTAGAACAACGTCTTACAGAATCAAATAAAGAAGAGGATGGATTGACCTCTACCTGCATGAAACCAAACTCTAGATGTATATCT
TTAAGGCGGATAATTGCTATTTTAAATCATTGGACTGCTTCAAAGGATGAGGGGTCGGACATGAGAGACGAATATCGTGTTGAAGATGTAGATGTCAATAGATTGCTGAA
TTGTTGCCGCAAATATTCTTACCTCACCATTCTACACAATCCAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGCTGGAGTCGTCAGAATCGGAGCCACACCTTGAAGCCGCTCATCACCCCTCCCCTCCATCTCACGAGTTGTTTGATATCTCTACGACGGTCGATCCTAGCTATGT
TATATCTCTCATACGGAAACTTCTACCACCTACTGCATCTAACCTAACCAATTCTTCTGGAAATGGAGATCACGACTGTGGCACGCCAGCTGCAACCAATATGGATCAAG
AAGCTGCTTGCCCTACATTGGACCAACAACATATTTCATCTCAACAAGAAAAGGTCTGGGAAGACTATGGCTGCATTCTGTGGGATCTTTCTTCGACGAAATCCCACGCA
CAACTTATGGTTAACAACCTTGTCCTTGAAGTTCTGTCTGCAAACCTTATGGTTTCTCAATCTGTGCGTGTTATGGAGATTTGCCTTGGAATTATTGGAAACCTGGCCTG
CCATGAAGTTCCCATGAAACATATAGTCACCAAGAGTGGGTTGATATCAACCATTGTGAGCCAGCTGTCTCTAGATGATGCTCAATGTTTATGTGAAGTTTGCAGGTTAT
TAACTGCGGGTCTTCAAAGTAGTGAATGTGTCATATGGGCTGAGGCTTTGAATTCTGAGCATGTTCTTTCTCGTATTCTATGGGTTTCTGAGAACACCTTAAATTCACAA
CTTATAGAAAAGAGTGTCGGGTTACTATTAGCCATCATTGAAAGTCAGCAAGAAGCCGCTCACATTCTTCTACCGTTTTTGATGAAGTTGGGTTTGTCAAGTGTTTTGTT
CAACCTTTTTGCTTTTGAGATGAAAATATTTGAAAATGAAAGATCAGGTGAAAGGTATTCAATTTTGGACGTGATTCTTCGTTCAATTGAAGCTCTCTCCGGAATTGATG
AACATTCTCAAGAAATATGTTCTAACAAAGATCTTTTTCAGCTTGTTTGTAATCTAATCAAATTGCCAGATGCATTTGAGGTTTCCAGTTCTTGTGTCAGTGCCGTGGTT
TTGATTGCAAATATTTTGTCAGATGTACCTGATCTAGCCTTTGACATGTCTCAGGATTTGTCTTTCCTACAAGGCTTACTAGATATATTCTTGTTTGCTGGGGATGATTT
AGAGGCACGTGATGCTGTTTGGAGCATCATTGCCAGGATACTGGTTCGTGTTCAAGAAAATGCGATTAGCAAACCAAGGCTATTTGAGTACGTGTCATTACTAGTGAGTA
AGACTGATCTCATCGAGGATGATCTTCTAGAACAACGTCTTACAGAATCAAATAAAGAAGAGGATGGATTGACCTCTACCTGCATGAAACCAAACTCTAGATGTATATCT
TTAAGGCGGATAATTGCTATTTTAAATCATTGGACTGCTTCAAAGGATGAGGGGTCGGACATGAGAGACGAATATCGTGTTGAAGATGTAGATGTCAATAGATTGCTGAA
TTGTTGCCGCAAATATTCTTACCTCACCATTCTACACAATCCAGAATAG
Protein sequenceShow/hide protein sequence
MELESSESEPHLEAAHHPSPPSHELFDISTTVDPSYVISLIRKLLPPTASNLTNSSGNGDHDCGTPAATNMDQEAACPTLDQQHISSQQEKVWEDYGCILWDLSSTKSHA
QLMVNNLVLEVLSANLMVSQSVRVMEICLGIIGNLACHEVPMKHIVTKSGLISTIVSQLSLDDAQCLCEVCRLLTAGLQSSECVIWAEALNSEHVLSRILWVSENTLNSQ
LIEKSVGLLLAIIESQQEAAHILLPFLMKLGLSSVLFNLFAFEMKIFENERSGERYSILDVILRSIEALSGIDEHSQEICSNKDLFQLVCNLIKLPDAFEVSSSCVSAVV
LIANILSDVPDLAFDMSQDLSFLQGLLDIFLFAGDDLEARDAVWSIIARILVRVQENAISKPRLFEYVSLLVSKTDLIEDDLLEQRLTESNKEEDGLTSTCMKPNSRCIS
LRRIIAILNHWTASKDEGSDMRDEYRVEDVDVNRLLNCCRKYSYLTILHNPE