; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0024889 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0024889
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionWAPL domain-containing protein
Genome locationchr10:6764054..6770285
RNA-Seq ExpressionLag0024889
SyntenyLag0024889
Gene Ontology termsGO:0007063 - regulation of sister chromatid cohesion (biological process)
InterPro domainsIPR011989 - Armadillo-like helical
IPR022771 - Wings apart-like protein, C-terminal
IPR039874 - Wings apart-like protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022954611.1 uncharacterized protein LOC111456825 isoform X1 [Cucurbita moschata]0.0e+0088.17Show/hide
Query:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV
        MA+TIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMA EVKAPRIGHKLL LR DSD LQSTTK LDSSSSAIFSKV
Subjt:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV

Query:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK
        EEILVSCKEIKSR IDTSTT RPELCPKWIALL IEKACLTTISLEE SGAIRKTGGDFKEKLRELGGLD+VFEVA DCHSNMEDA + N L+SL+LLLK
Subjt:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK

Query:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGE----SNRKITLSSSNS
        CLKIMENATFLSKDNQSHLLGIKRN+E +GTPQSFTEIMLN+IKILSGLYLRKSS AGL  EKS H LDGSCNTSKV AEAD      +NRKITLSSSNS
Subjt:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGE----SNRKITLSSSNS

Query:  KTWCNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQD
        KTWCNTKST+SD SSII+QNMRSAT RL+NSLTTSGTT TSL NTSFFKM QRSSTSGSSSV SRS DTGAT LN  PVGKINHLDF EGCE  LSEDQD
Subjt:  KTWCNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQD

Query:  PFAFDEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQ
        PFAFDEGD +PSKWELLS+KE KSRAKK VVKFRDLENG  SQ MT+EKESIGGESHH NE SCLTPF+EE FSLVADCLLTSIKVLMNLTNDN+VGCQQ
Subjt:  PFAFDEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQ

Query:  IASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPL
        IASCGG+ETMCSLIANHFPSF STSSTLNDLK+HTS L+FEP ND HLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPS HG E+GH+NVIPL
Subjt:  IASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPL

Query:  ICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAV
        ICSIFLANQ ASDGVGDGQ+LPWNEEVALLEGEKEAEKMIVEAY+ALLLAFLSTES GIRDAIVDCLP+H L+ILVPVLERF+AFHLTLNMISPETHKAV
Subjt:  ICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAV

Query:  TEVIESCRNS
        TEVIESCRNS
Subjt:  TEVIESCRNS

XP_022954612.1 uncharacterized protein LOC111456825 isoform X2 [Cucurbita moschata]0.0e+0088.67Show/hide
Query:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV
        MA+TIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMA EVKAPRIGHKLL LR DSD LQSTTK LDSSSSAIFSKV
Subjt:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV

Query:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK
        EEILVSCKEIKSR IDTSTT RPELCPKWIALL IEKACLTTISLEE SGAIRKTGGDFKEKLRELGGLD+VFEVA DCHSNMEDA + N L+SL+LLLK
Subjt:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK

Query:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKITLSSSNSKTWC
        CLKIMENATFLSKDNQSHLLGIKRN+E +GTPQSFTEIMLN+IKILSGLYLRKSS AGL  EKS H LDGSCNTSKV AEAD  +NRKITLSSSNSKTWC
Subjt:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKITLSSSNSKTWC

Query:  NTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQDPFAF
        NTKST+SD SSII+QNMRSAT RL+NSLTTSGTT TSL NTSFFKM QRSSTSGSSSV SRS DTGAT LN  PVGKINHLDF EGCE  LSEDQDPFAF
Subjt:  NTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQDPFAF

Query:  DEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQIASC
        DEGD +PSKWELLS+KE KSRAKK VVKFRDLENG  SQ MT+EKESIGGESHH NE SCLTPF+EE FSLVADCLLTSIKVLMNLTNDN+VGCQQIASC
Subjt:  DEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQIASC

Query:  GGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSI
        GG+ETMCSLIANHFPSF STSSTLNDLK+HTS L+FEP ND HLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPS HG E+GH+NVIPLICSI
Subjt:  GGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSI

Query:  FLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVI
        FLANQ ASDGVGDGQ+LPWNEEVALLEGEKEAEKMIVEAY+ALLLAFLSTES GIRDAIVDCLP+H L+ILVPVLERF+AFHLTLNMISPETHKAVTEVI
Subjt:  FLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVI

Query:  ESCRNS
        ESCRNS
Subjt:  ESCRNS

XP_022994335.1 uncharacterized protein LOC111490088 isoform X1 [Cucurbita maxima]0.0e+0088.45Show/hide
Query:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV
        MA+TIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMA EVKAPRIGHKLL LR DSD LQSTTK LDSSSSAIFSKV
Subjt:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV

Query:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK
        EEILVSCKEIKSR IDTSTT RPELCPKWIALL IEKACLTTISLEE SGAIRKTGGDFKEKLRELGGLD+VFEVA DCHSNMEDA + N LQSL+LLLK
Subjt:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK

Query:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGE----SNRKITLSSSNS
        CLKIMENATFLSKDNQSHLLGIKRN+E +GTPQSFTEIMLN+IKILSGLYLRKSS AGL  EKSAH LDGSCNTSKV AEAD      +NRKITLSSSNS
Subjt:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGE----SNRKITLSSSNS

Query:  KTWCNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQD
        KTWCNTKST+ D SSII+QNMRSAT RL+NSLTTSGTT TSL NTSFFKM QRSSTSGSSSVTSRS DTGAT LN  PVGKINHLDF EGCE  LSEDQD
Subjt:  KTWCNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQD

Query:  PFAFDEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQ
        PFAFDEGD +PSKWELLS+KE KSRAKK VVKFRDLENG  SQ MT EKESIGGESHH NE SCLTPF+EE FSLVADCLLTSIKVLMNLTNDN+VGCQQ
Subjt:  PFAFDEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQ

Query:  IASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPL
        IASCGG+ETMCSLIANHFPSF STSSTLNDLK+HTS L+FEP ND HLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPS HG E+GH+NVIPL
Subjt:  IASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPL

Query:  ICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAV
        ICSIFLANQ ASDGVGDGQ+LPWNEEVALLEGEKEAEKMIVEAY+ALLLAFLSTES GIRDAIVDCLP+H L+ILVPVLERFVAFHLTLNMISPETHKAV
Subjt:  ICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAV

Query:  TEVIESCRNS
        TEVIESCR S
Subjt:  TEVIESCRNS

XP_022994336.1 uncharacterized protein LOC111490088 isoform X2 [Cucurbita maxima]0.0e+0088.95Show/hide
Query:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV
        MA+TIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMA EVKAPRIGHKLL LR DSD LQSTTK LDSSSSAIFSKV
Subjt:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV

Query:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK
        EEILVSCKEIKSR IDTSTT RPELCPKWIALL IEKACLTTISLEE SGAIRKTGGDFKEKLRELGGLD+VFEVA DCHSNMEDA + N LQSL+LLLK
Subjt:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK

Query:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKITLSSSNSKTWC
        CLKIMENATFLSKDNQSHLLGIKRN+E +GTPQSFTEIMLN+IKILSGLYLRKSS AGL  EKSAH LDGSCNTSKV AEAD  +NRKITLSSSNSKTWC
Subjt:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKITLSSSNSKTWC

Query:  NTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQDPFAF
        NTKST+ D SSII+QNMRSAT RL+NSLTTSGTT TSL NTSFFKM QRSSTSGSSSVTSRS DTGAT LN  PVGKINHLDF EGCE  LSEDQDPFAF
Subjt:  NTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQDPFAF

Query:  DEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQIASC
        DEGD +PSKWELLS+KE KSRAKK VVKFRDLENG  SQ MT EKESIGGESHH NE SCLTPF+EE FSLVADCLLTSIKVLMNLTNDN+VGCQQIASC
Subjt:  DEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQIASC

Query:  GGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSI
        GG+ETMCSLIANHFPSF STSSTLNDLK+HTS L+FEP ND HLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPS HG E+GH+NVIPLICSI
Subjt:  GGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSI

Query:  FLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVI
        FLANQ ASDGVGDGQ+LPWNEEVALLEGEKEAEKMIVEAY+ALLLAFLSTES GIRDAIVDCLP+H L+ILVPVLERFVAFHLTLNMISPETHKAVTEVI
Subjt:  FLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVI

Query:  ESCRNS
        ESCR S
Subjt:  ESCRNS

XP_023542580.1 uncharacterized protein LOC111802444 isoform X2 [Cucurbita pepo subsp. pepo]0.0e+0088.67Show/hide
Query:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV
        MA+TIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMA EVKAPRIGHKLL LR DSD LQSTTK LDSSSSAIFSKV
Subjt:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV

Query:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK
        EEILVSCKEIKSR IDTSTT RPELCPKWIALL IEKACLTTISLEE SGAIRKTGGDFKEKLRELGGLD+VFEVA DCHSNMEDA + N LQSL+LLLK
Subjt:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK

Query:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKITLSSSNSKTWC
        CLKIMENATFLSKDNQSHLLGIKRN+E +GTPQSFTEIMLN+IKILSGLYLRKSS AGL  EKSAH LDGSCNTSKV AEAD  +NRKITLSSSNSKTWC
Subjt:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKITLSSSNSKTWC

Query:  NTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQDPFAF
        N KST+SD SSII+QNMRSAT RL+NSLTTSGTT TSL NTSFFKM QRSSTSGSSSVTSRS DTGAT LN  P GKINH DF+EGCE  LSEDQDPFAF
Subjt:  NTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQDPFAF

Query:  DEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQIASC
        DEGD +PSKWELLS+KE+KSRAKK VVKFRDLENG  SQ MT EKESIGGESHH NE SCLTPF+EE FSLVADCLLTSIKVLMNLTNDN+ GCQQIASC
Subjt:  DEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQIASC

Query:  GGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSI
        GG+ETMCSLIANHFPSF STSSTLNDLKVHTS L+FEP ND HLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPS HG E+G++NVIPLICSI
Subjt:  GGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSI

Query:  FLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVI
        FLANQ ASDGVGDGQ+LPWNEEVALLEGEKEAEKMIVEAY+ALLLAFLSTES GIRDAIVDCLP+H L+ILVPVLERFVAFHLTLNMISPETHKAVTEVI
Subjt:  FLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVI

Query:  ESCRNS
        ESCRNS
Subjt:  ESCRNS

TrEMBL top hitse value%identityAlignment
A0A6J1GRC5 uncharacterized protein LOC111456825 isoform X10.0e+0088.17Show/hide
Query:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV
        MA+TIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMA EVKAPRIGHKLL LR DSD LQSTTK LDSSSSAIFSKV
Subjt:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV

Query:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK
        EEILVSCKEIKSR IDTSTT RPELCPKWIALL IEKACLTTISLEE SGAIRKTGGDFKEKLRELGGLD+VFEVA DCHSNMEDA + N L+SL+LLLK
Subjt:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK

Query:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGE----SNRKITLSSSNS
        CLKIMENATFLSKDNQSHLLGIKRN+E +GTPQSFTEIMLN+IKILSGLYLRKSS AGL  EKS H LDGSCNTSKV AEAD      +NRKITLSSSNS
Subjt:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGE----SNRKITLSSSNS

Query:  KTWCNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQD
        KTWCNTKST+SD SSII+QNMRSAT RL+NSLTTSGTT TSL NTSFFKM QRSSTSGSSSV SRS DTGAT LN  PVGKINHLDF EGCE  LSEDQD
Subjt:  KTWCNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQD

Query:  PFAFDEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQ
        PFAFDEGD +PSKWELLS+KE KSRAKK VVKFRDLENG  SQ MT+EKESIGGESHH NE SCLTPF+EE FSLVADCLLTSIKVLMNLTNDN+VGCQQ
Subjt:  PFAFDEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQ

Query:  IASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPL
        IASCGG+ETMCSLIANHFPSF STSSTLNDLK+HTS L+FEP ND HLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPS HG E+GH+NVIPL
Subjt:  IASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPL

Query:  ICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAV
        ICSIFLANQ ASDGVGDGQ+LPWNEEVALLEGEKEAEKMIVEAY+ALLLAFLSTES GIRDAIVDCLP+H L+ILVPVLERF+AFHLTLNMISPETHKAV
Subjt:  ICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAV

Query:  TEVIESCRNS
        TEVIESCRNS
Subjt:  TEVIESCRNS

A0A6J1GRE8 uncharacterized protein LOC111456825 isoform X20.0e+0088.67Show/hide
Query:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV
        MA+TIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMA EVKAPRIGHKLL LR DSD LQSTTK LDSSSSAIFSKV
Subjt:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV

Query:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK
        EEILVSCKEIKSR IDTSTT RPELCPKWIALL IEKACLTTISLEE SGAIRKTGGDFKEKLRELGGLD+VFEVA DCHSNMEDA + N L+SL+LLLK
Subjt:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK

Query:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKITLSSSNSKTWC
        CLKIMENATFLSKDNQSHLLGIKRN+E +GTPQSFTEIMLN+IKILSGLYLRKSS AGL  EKS H LDGSCNTSKV AEAD  +NRKITLSSSNSKTWC
Subjt:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKITLSSSNSKTWC

Query:  NTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQDPFAF
        NTKST+SD SSII+QNMRSAT RL+NSLTTSGTT TSL NTSFFKM QRSSTSGSSSV SRS DTGAT LN  PVGKINHLDF EGCE  LSEDQDPFAF
Subjt:  NTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQDPFAF

Query:  DEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQIASC
        DEGD +PSKWELLS+KE KSRAKK VVKFRDLENG  SQ MT+EKESIGGESHH NE SCLTPF+EE FSLVADCLLTSIKVLMNLTNDN+VGCQQIASC
Subjt:  DEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQIASC

Query:  GGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSI
        GG+ETMCSLIANHFPSF STSSTLNDLK+HTS L+FEP ND HLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPS HG E+GH+NVIPLICSI
Subjt:  GGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSI

Query:  FLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVI
        FLANQ ASDGVGDGQ+LPWNEEVALLEGEKEAEKMIVEAY+ALLLAFLSTES GIRDAIVDCLP+H L+ILVPVLERF+AFHLTLNMISPETHKAVTEVI
Subjt:  FLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVI

Query:  ESCRNS
        ESCRNS
Subjt:  ESCRNS

A0A6J1JYU8 uncharacterized protein LOC111490088 isoform X20.0e+0088.95Show/hide
Query:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV
        MA+TIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMA EVKAPRIGHKLL LR DSD LQSTTK LDSSSSAIFSKV
Subjt:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV

Query:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK
        EEILVSCKEIKSR IDTSTT RPELCPKWIALL IEKACLTTISLEE SGAIRKTGGDFKEKLRELGGLD+VFEVA DCHSNMEDA + N LQSL+LLLK
Subjt:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK

Query:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKITLSSSNSKTWC
        CLKIMENATFLSKDNQSHLLGIKRN+E +GTPQSFTEIMLN+IKILSGLYLRKSS AGL  EKSAH LDGSCNTSKV AEAD  +NRKITLSSSNSKTWC
Subjt:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKITLSSSNSKTWC

Query:  NTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQDPFAF
        NTKST+ D SSII+QNMRSAT RL+NSLTTSGTT TSL NTSFFKM QRSSTSGSSSVTSRS DTGAT LN  PVGKINHLDF EGCE  LSEDQDPFAF
Subjt:  NTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQDPFAF

Query:  DEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQIASC
        DEGD +PSKWELLS+KE KSRAKK VVKFRDLENG  SQ MT EKESIGGESHH NE SCLTPF+EE FSLVADCLLTSIKVLMNLTNDN+VGCQQIASC
Subjt:  DEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQIASC

Query:  GGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSI
        GG+ETMCSLIANHFPSF STSSTLNDLK+HTS L+FEP ND HLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPS HG E+GH+NVIPLICSI
Subjt:  GGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSI

Query:  FLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVI
        FLANQ ASDGVGDGQ+LPWNEEVALLEGEKEAEKMIVEAY+ALLLAFLSTES GIRDAIVDCLP+H L+ILVPVLERFVAFHLTLNMISPETHKAVTEVI
Subjt:  FLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVI

Query:  ESCRNS
        ESCR S
Subjt:  ESCRNS

A0A6J1K0X5 uncharacterized protein LOC111490088 isoform X10.0e+0088.45Show/hide
Query:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV
        MA+TIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMA EVKAPRIGHKLL LR DSD LQSTTK LDSSSSAIFSKV
Subjt:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV

Query:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK
        EEILVSCKEIKSR IDTSTT RPELCPKWIALL IEKACLTTISLEE SGAIRKTGGDFKEKLRELGGLD+VFEVA DCHSNMEDA + N LQSL+LLLK
Subjt:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK

Query:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGE----SNRKITLSSSNS
        CLKIMENATFLSKDNQSHLLGIKRN+E +GTPQSFTEIMLN+IKILSGLYLRKSS AGL  EKSAH LDGSCNTSKV AEAD      +NRKITLSSSNS
Subjt:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGE----SNRKITLSSSNS

Query:  KTWCNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQD
        KTWCNTKST+ D SSII+QNMRSAT RL+NSLTTSGTT TSL NTSFFKM QRSSTSGSSSVTSRS DTGAT LN  PVGKINHLDF EGCE  LSEDQD
Subjt:  KTWCNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQD

Query:  PFAFDEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQ
        PFAFDEGD +PSKWELLS+KE KSRAKK VVKFRDLENG  SQ MT EKESIGGESHH NE SCLTPF+EE FSLVADCLLTSIKVLMNLTNDN+VGCQQ
Subjt:  PFAFDEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQ

Query:  IASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPL
        IASCGG+ETMCSLIANHFPSF STSSTLNDLK+HTS L+FEP ND HLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPS HG E+GH+NVIPL
Subjt:  IASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPL

Query:  ICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAV
        ICSIFLANQ ASDGVGDGQ+LPWNEEVALLEGEKEAEKMIVEAY+ALLLAFLSTES GIRDAIVDCLP+H L+ILVPVLERFVAFHLTLNMISPETHKAV
Subjt:  ICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAV

Query:  TEVIESCRNS
        TEVIESCR S
Subjt:  TEVIESCRNS

A0A6J1KJD1 uncharacterized protein LOC111495758 isoform X20.0e+0087.82Show/hide
Query:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV
        MA+TIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPN VSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSD LQST  RLDSSSSAI SKV
Subjt:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV

Query:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK
        EEILVSCKEIKSR+ D  T  RPELCPKWIALLTIEKACLTTISLEE SGA+RK GGDFKEKLRELGGLD+VFEVA DCHSN+EDA Y NFLQSLMLLLK
Subjt:  EEILVSCKEIKSRTIDTSTT-RPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLK

Query:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKITLSSSNSKTWC
        CLKIMENATFLSK+NQSHLLGIKRN+EG+GTPQSFTEIMLNVIKILSGLYLRKSSAAGL  EK A  +DGS  TSK+ AEAD E+NRKITL SSN KTWC
Subjt:  CLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKITLSSSNSKTWC

Query:  NTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQDPFAF
        NTKST SDKSSII+QNMRSATARL+N+LT SGTTSTSL N+SFFKMRQR  TSGSSSVTSRSTD GAT LN  PV K NH D    CE  LSEDQDPFAF
Subjt:  NTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQDPFAF

Query:  DEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQIASC
        DEGD EPSKWELLSQKEKKSRAKKGVVKFRDLENG KSQ MT EKESI GESH  NEIS L  FNEEGF+LVADCLLTSIKVLMNLTNDNHVGCQQIASC
Subjt:  DEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQIASC

Query:  GGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSI
        GGLETMCSLIANHFPSF STSSTLN LK HT SLEFE QN+KHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPS+HGPEKGH+NVIPLICSI
Subjt:  GGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSI

Query:  FLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVI
        FLANQGAS+GVG+G++LPWNEEVALLEGEKEAEKMIVEAY+ALLLAFLSTESQGIRDAIVDCLPDH+LAILVPVLERFVAFHLTLNMISPETHK VTEVI
Subjt:  FLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVI

Query:  ESCRNS
        ESCR S
Subjt:  ESCRNS

SwissProt top hitse value%identityAlignment
F4I7C7 Wings apart-like protein 18.9e-16548.91Show/hide
Query:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV
        ++++IIDA+L LS DD  SNLAAATLF+ LT DGQD+H +ESP C+ FLIKLLKP++  + E K   IG KLL L  D D  +   K  D SSS I S+V
Subjt:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV

Query:  EEILVSCKEIK-SRTIDTSTTRPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNME-----------DAGYG
        +E+LV+CKE++ + +  T TTRPEL  KW+ALL +E+AC++ IS ++TSG+++KTGG+FKEKLRELGGLD+V EV  DCH+ ME           +    
Subjt:  EEILVSCKEIK-SRTIDTSTTRPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNME-----------DAGYG

Query:  NFLQSLMLLLKCLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKI
           QSLMLLLKCLKIMENATFLS DNQ+HLLG K+ +    +  SFTE+ ++VIK+LSGL+LR   ++  T   ++H+ +G  + S +      E+NRK+
Subjt:  NFLQSLMLLLKCLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKI

Query:  T--LSSSNSKTWCNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGS--SSVTSRSTDTGATTLNIPPVGKINHLDFSE
        T  + + +S T+    S S+   S+  ++          +  +   +S S    +  K R  S+ SGS    + S  +D   TTL     G+     F E
Subjt:  T--LSSSNSKTWCNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGS--SSVTSRSTDTGATTLNIPPVGKINHLDFSE

Query:  GCEPTLSEDQDPFAFDEGDFEPSKWELLSQKEKKSRA--KKGVVK----------FRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVA
           P   E +DPFAFD  D++PSKW ++S  +KKSRA  KKG  K          F   E     +  + E+ S    S  L    C    +EE   L+ 
Subjt:  GCEPTLSEDQDPFAFDEGDFEPSKWELLSQKEKKSRA--KKGVVK----------FRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVA

Query:  DCLLTSIKVLMNLTNDNHVGCQQIASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRL
        DCLLT++KVLMNLTNDN VGC+Q+  C GLE+M  LIA HFPSF + S   ++++   SS     + DK+LTDQELDFLVAILGLLVNLVE+DG NRSRL
Subjt:  DCLLTSIKVLMNLTNDNHVGCQQIASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRL

Query:  ASASVLIPSLHGPEKGHTNVIPLICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVP
        ASASV I      ++    +IPL+CSIFL NQG+++   +      ++E A+LEGEKEAEKMIVEAY+ALLLAFLSTES+ IR++I D LP  NLAILVP
Subjt:  ASASVLIPSLHGPEKGHTNVIPLICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVP

Query:  VLERFVAFHLTLNMISPETHKAVTEVIESCRN
        VLERFVAFH+TLNMI PETHKAV  VIESC++
Subjt:  VLERFVAFHLTLNMISPETHKAVTEVIESCRN

Q65Z40 Wings apart-like protein homolog3.2e-0531.91Show/hide
Query:  LTIEKACLTTISLEETSGAIRKTGGD-FKEKLRELGGLDSVFEVANDC--HSNMEDAGYGNFLQSLMLLLKCLKIMENATFLSKDNQSHLLGIK
        L +E      +++E       K  GD FKE+LR LGGLD + +   +C  H + +D      + SL    +CL+++E+ T  + +NQS+L+  K
Subjt:  LTIEKACLTTISLEETSGAIRKTGGD-FKEKLRELGGLDSVFEVANDC--HSNMEDAGYGNFLQSLMLLLKCLKIMENATFLSKDNQSHLLGIK

Q7Z5K2 Wings apart-like protein homolog4.2e-0525.52Show/hide
Query:  VADCLLTSIKVLMNLTNDNHVGCQQIASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHL-TDQELDFLVAILGLLVNLVEKDGHNR
        V DC+   I VL+NLTNDN                          +GST +   D  + T +L    Q  K+L  +Q  D  V  LGLL+NLVE    NR
Subjt:  VADCLLTSIKVLMNLTNDNHVGCQQIASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHL-TDQELDFLVAILGLLVNLVEKDGHNR

Query:  SRL------ASASVLIPSLHGPEK----GHTNVIPLICSIFLANQGAS-------------------DGVGDGQ----ALPW------------------
          L       S    I S  G +     G  + +  +  +FL  + A+                   D  G+ Q     + W                  
Subjt:  SRL------ASASVLIPSLHGPEK----GHTNVIPLICSIFLANQGAS-------------------DGVGDGQ----ALPW------------------

Query:  NEEV----ALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVIE
        +EE+    AL    K  E  IV +Y ALLL  L  ES      + + LP+ + +I+  +L++F++F      +     K+++ VIE
Subjt:  NEEV----ALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVIE

Q9C951 Wings apart-like protein 29.3e-15446.08Show/hide
Query:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV
        ++++IIDA+LGL  DD  SNLAAATLF++LT DGQDDH +ESPN + FL+KLL+P++S + +VK   IG +LL +  D D  +      D SS  I  + 
Subjt:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV

Query:  EEILVSCKEIKSRTIDT---STTRPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFL------
        +EILV+CKE+  R ID+      RPEL  KW+ALL +EKACL+ IS ++TSG ++K+GG FKEKLRELGGLD+VF+V  DCH+ ME     + L      
Subjt:  EEILVSCKEIKSRTIDT---STTRPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFL------

Query:  -----QSLMLLLKCLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAH---HLDGSCN-------TSKV
             QSLMLLLKCLKIMENATFLS +NQ HLL + +++    +  SFTE+M++VIKILSGL LR         EK  H   HL  +         +S  
Subjt:  -----QSLMLLLKCLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAH---HLDGSCN-------TSKV

Query:  CAEADGESNRKITLSSSNSKTW---CNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPP
        C+     S + +++S  N   +   C+T      +SS++            +++     T+T+ +NT  F  R  S  SG S   +R++ T  ++     
Subjt:  CAEADGESNRKITLSSSNSKTW---CNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPP

Query:  VGKINHLDFSEGCEPTLSEDQDPFAFDEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGG---------ESHHLNEISCLTPFN
          K+ +         +  + QDPF+FD  D  PS+W +  QK+ K + +KG   +RD ++    Q  ++++ES  G           HH+ E   LT   
Subjt:  VGKINHLDFSEGCEPTLSEDQDPFAFDEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGG---------ESHHLNEISCLTPFN

Query:  EEG-FSLVADCLLTSIKVLMNLTNDNHVGCQQIASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVE
        ++G   L++DCLLT++KVLMNLTN N VGC+++A+CGGLE+M  L+  HFPSF + S   + ++  T       Q DKHLTDQELDFLVAILGLLVNLVE
Subjt:  EEG-FSLVADCLLTSIKVLMNLTNDNHVGCQQIASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVE

Query:  KDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLP
        K+G NRSRLA+ASV I +  G +    ++IPL+CSIFL N+G++D   +      ++E A+LE EKEAEKMIVEAY+ALLLAFLSTES+ IR+AI D LP
Subjt:  KDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLP

Query:  DHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVIESCR
          ++AILVPVL+RFVAFH TL+MI PETHK V EVIESC+
Subjt:  DHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVIESCR

Arabidopsis top hitse value%identityAlignment
AT1G11060.1 WAPL (Wings apart-like protein regulation of heterochromatin) protein6.4e-16648.91Show/hide
Query:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV
        ++++IIDA+L LS DD  SNLAAATLF+ LT DGQD+H +ESP C+ FLIKLLKP++  + E K   IG KLL L  D D  +   K  D SSS I S+V
Subjt:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV

Query:  EEILVSCKEIK-SRTIDTSTTRPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNME-----------DAGYG
        +E+LV+CKE++ + +  T TTRPEL  KW+ALL +E+AC++ IS ++TSG+++KTGG+FKEKLRELGGLD+V EV  DCH+ ME           +    
Subjt:  EEILVSCKEIK-SRTIDTSTTRPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNME-----------DAGYG

Query:  NFLQSLMLLLKCLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKI
           QSLMLLLKCLKIMENATFLS DNQ+HLLG K+ +    +  SFTE+ ++VIK+LSGL+LR   ++  T   ++H+ +G  + S +      E+NRK+
Subjt:  NFLQSLMLLLKCLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKI

Query:  T--LSSSNSKTWCNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGS--SSVTSRSTDTGATTLNIPPVGKINHLDFSE
        T  + + +S T+    S S+   S+  ++          +  +   +S S    +  K R  S+ SGS    + S  +D   TTL     G+     F E
Subjt:  T--LSSSNSKTWCNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGS--SSVTSRSTDTGATTLNIPPVGKINHLDFSE

Query:  GCEPTLSEDQDPFAFDEGDFEPSKWELLSQKEKKSRA--KKGVVK----------FRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVA
           P   E +DPFAFD  D++PSKW ++S  +KKSRA  KKG  K          F   E     +  + E+ S    S  L    C    +EE   L+ 
Subjt:  GCEPTLSEDQDPFAFDEGDFEPSKWELLSQKEKKSRA--KKGVVK----------FRDLENGCKSQAMTNEKESIGGESHHLNEISCLTPFNEEGFSLVA

Query:  DCLLTSIKVLMNLTNDNHVGCQQIASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRL
        DCLLT++KVLMNLTNDN VGC+Q+  C GLE+M  LIA HFPSF + S   ++++   SS     + DK+LTDQELDFLVAILGLLVNLVE+DG NRSRL
Subjt:  DCLLTSIKVLMNLTNDNHVGCQQIASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVEKDGHNRSRL

Query:  ASASVLIPSLHGPEKGHTNVIPLICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVP
        ASASV I      ++    +IPL+CSIFL NQG+++   +      ++E A+LEGEKEAEKMIVEAY+ALLLAFLSTES+ IR++I D LP  NLAILVP
Subjt:  ASASVLIPSLHGPEKGHTNVIPLICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLPDHNLAILVP

Query:  VLERFVAFHLTLNMISPETHKAVTEVIESCRN
        VLERFVAFH+TLNMI PETHKAV  VIESC++
Subjt:  VLERFVAFHLTLNMISPETHKAVTEVIESCRN

AT1G61030.1 WAPL (Wings apart-like protein regulation of heterochromatin) protein6.6e-15546.08Show/hide
Query:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV
        ++++IIDA+LGL  DD  SNLAAATLF++LT DGQDDH +ESPN + FL+KLL+P++S + +VK   IG +LL +  D D  +      D SS  I  + 
Subjt:  MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKV

Query:  EEILVSCKEIKSRTIDT---STTRPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFL------
        +EILV+CKE+  R ID+      RPEL  KW+ALL +EKACL+ IS ++TSG ++K+GG FKEKLRELGGLD+VF+V  DCH+ ME     + L      
Subjt:  EEILVSCKEIKSRTIDT---STTRPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFL------

Query:  -----QSLMLLLKCLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAH---HLDGSCN-------TSKV
             QSLMLLLKCLKIMENATFLS +NQ HLL + +++    +  SFTE+M++VIKILSGL LR         EK  H   HL  +         +S  
Subjt:  -----QSLMLLLKCLKIMENATFLSKDNQSHLLGIKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAH---HLDGSCN-------TSKV

Query:  CAEADGESNRKITLSSSNSKTW---CNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPP
        C+     S + +++S  N   +   C+T      +SS++            +++     T+T+ +NT  F  R  S  SG S   +R++ T  ++     
Subjt:  CAEADGESNRKITLSSSNSKTW---CNTKSTSSDKSSIIAQNMRSATARLENSLTTSGTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPP

Query:  VGKINHLDFSEGCEPTLSEDQDPFAFDEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGG---------ESHHLNEISCLTPFN
          K+ +         +  + QDPF+FD  D  PS+W +  QK+ K + +KG   +RD ++    Q  ++++ES  G           HH+ E   LT   
Subjt:  VGKINHLDFSEGCEPTLSEDQDPFAFDEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAMTNEKESIGG---------ESHHLNEISCLTPFN

Query:  EEG-FSLVADCLLTSIKVLMNLTNDNHVGCQQIASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVE
        ++G   L++DCLLT++KVLMNLTN N VGC+++A+CGGLE+M  L+  HFPSF + S   + ++  T       Q DKHLTDQELDFLVAILGLLVNLVE
Subjt:  EEG-FSLVADCLLTSIKVLMNLTNDNHVGCQQIASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDFLVAILGLLVNLVE

Query:  KDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLP
        K+G NRSRLA+ASV I +  G +    ++IPL+CSIFL N+G++D   +      ++E A+LE EKEAEKMIVEAY+ALLLAFLSTES+ IR+AI D LP
Subjt:  KDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVDCLP

Query:  DHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVIESCR
          ++AILVPVL+RFVAFH TL+MI PETHK V EVIESC+
Subjt:  DHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVIESCR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAGACAATAATTGATGCTGTTTTAGGTCTTAGCTTTGACGACTCAGCCAGCAATCTAGCTGCCGCAACTCTTTTTTACATTTTGACGGGTGATGGACAAGATGA
TCACCTTCTGGAATCACCAAATTGTGTTAGTTTTTTAATTAAATTGTTGAAACCAATTCTCTCTATGGCTGCTGAAGTGAAAGCACCGAGAATTGGCCATAAGCTTTTAG
TACTTCGAACAGATTCTGACACTCTACAAAGTACAACAAAAAGATTGGACTCCAGTTCTTCTGCAATTTTTTCAAAAGTTGAAGAAATTCTTGTAAGTTGCAAGGAAATA
AAATCAAGAACCATAGACACCAGCACAACTAGGCCAGAATTGTGTCCAAAATGGATTGCATTACTGACTATCGAGAAAGCTTGCTTGACTACCATTTCCCTTGAAGAAAC
ATCTGGTGCTATAAGAAAAACTGGAGGCGACTTCAAGGAAAAATTGCGAGAGCTAGGAGGACTTGACTCAGTCTTTGAGGTTGCCAATGATTGCCATTCCAATATGGAGG
ATGCAGGATATGGAAACTTTCTGCAGAGCCTGATGCTGCTTTTGAAATGCTTAAAGATAATGGAAAATGCCACATTCCTTAGTAAAGATAACCAGAGTCATTTGCTTGGA
ATTAAAAGAAATGTGGAGGGTCGAGGAACACCACAATCTTTCACTGAAATCATGTTAAATGTCATCAAGATTCTTTCGGGTCTCTATTTACGCAAAAGTTCTGCTGCTGG
GTTAACTACTGAGAAGTCAGCTCATCACCTTGATGGGTCTTGTAATACTTCCAAAGTGTGTGCTGAGGCAGATGGCGAATCAAACAGAAAGATAACTCTATCAAGCAGTA
ATTCAAAGACATGGTGCAACACCAAGAGTACCTCATCTGACAAGAGCTCTATTATAGCCCAGAACATGAGGAGTGCCACTGCTCGGTTAGAAAATTCTCTAACAACTTCT
GGAACTACTAGCACTTCATTGGCAAATACCAGTTTCTTCAAGATGAGACAAAGATCTTCCACATCTGGTTCATCCAGTGTAACATCAAGAAGTACTGATACTGGAGCAAC
TACATTGAATATTCCACCTGTGGGAAAAATTAATCATCTTGATTTCTCAGAAGGTTGTGAGCCTACCCTTTCAGAGGACCAGGATCCTTTTGCTTTTGACGAGGGTGATT
TCGAACCCTCTAAATGGGAGTTACTTTCACAGAAAGAGAAGAAATCTCGGGCTAAAAAAGGGGTGGTCAAATTTAGAGATCTCGAGAATGGATGTAAATCTCAGGCGATG
ACGAATGAGAAAGAATCAATTGGTGGAGAAAGTCATCACTTGAATGAAATTTCATGCTTAACACCCTTTAATGAGGAGGGATTCAGTCTAGTAGCTGACTGCCTTCTTAC
TTCCATCAAGGTTTTGATGAACTTGACCAATGACAATCATGTTGGCTGTCAACAAATTGCTTCCTGTGGAGGATTGGAAACTATGTGTTCACTGATTGCCAACCATTTTC
CTTCATTCGGCTCCACTTCATCCACCTTAAATGACTTAAAAGTGCATACATCAAGTCTTGAATTTGAGCCTCAGAACGACAAGCATCTAACTGATCAAGAGCTTGATTTT
CTTGTTGCGATTTTGGGCCTGCTTGTGAACTTGGTGGAGAAGGATGGTCATAACAGATCACGGCTTGCTTCAGCCAGTGTTTTGATACCTAGCTTACATGGACCAGAAAA
GGGTCATACCAACGTAATTCCACTAATATGTTCCATCTTTCTGGCCAACCAAGGAGCAAGCGACGGAGTTGGAGACGGGCAGGCTTTGCCTTGGAATGAGGAGGTAGCTC
TTCTGGAAGGTGAAAAGGAAGCAGAAAAAATGATTGTTGAAGCTTATGCAGCACTACTTCTAGCATTTCTTTCAACCGAAAGCCAGGGCATACGCGATGCAATCGTCGAC
TGTCTTCCAGATCACAACCTAGCAATTCTCGTGCCAGTTTTGGAGCGATTTGTGGCATTTCATTTGACATTGAACATGATTTCTCCGGAGACACATAAAGCCGTAACCGA
AGTGATTGAATCATGTAGAAATTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAGACAATAATTGATGCTGTTTTAGGTCTTAGCTTTGACGACTCAGCCAGCAATCTAGCTGCCGCAACTCTTTTTTACATTTTGACGGGTGATGGACAAGATGA
TCACCTTCTGGAATCACCAAATTGTGTTAGTTTTTTAATTAAATTGTTGAAACCAATTCTCTCTATGGCTGCTGAAGTGAAAGCACCGAGAATTGGCCATAAGCTTTTAG
TACTTCGAACAGATTCTGACACTCTACAAAGTACAACAAAAAGATTGGACTCCAGTTCTTCTGCAATTTTTTCAAAAGTTGAAGAAATTCTTGTAAGTTGCAAGGAAATA
AAATCAAGAACCATAGACACCAGCACAACTAGGCCAGAATTGTGTCCAAAATGGATTGCATTACTGACTATCGAGAAAGCTTGCTTGACTACCATTTCCCTTGAAGAAAC
ATCTGGTGCTATAAGAAAAACTGGAGGCGACTTCAAGGAAAAATTGCGAGAGCTAGGAGGACTTGACTCAGTCTTTGAGGTTGCCAATGATTGCCATTCCAATATGGAGG
ATGCAGGATATGGAAACTTTCTGCAGAGCCTGATGCTGCTTTTGAAATGCTTAAAGATAATGGAAAATGCCACATTCCTTAGTAAAGATAACCAGAGTCATTTGCTTGGA
ATTAAAAGAAATGTGGAGGGTCGAGGAACACCACAATCTTTCACTGAAATCATGTTAAATGTCATCAAGATTCTTTCGGGTCTCTATTTACGCAAAAGTTCTGCTGCTGG
GTTAACTACTGAGAAGTCAGCTCATCACCTTGATGGGTCTTGTAATACTTCCAAAGTGTGTGCTGAGGCAGATGGCGAATCAAACAGAAAGATAACTCTATCAAGCAGTA
ATTCAAAGACATGGTGCAACACCAAGAGTACCTCATCTGACAAGAGCTCTATTATAGCCCAGAACATGAGGAGTGCCACTGCTCGGTTAGAAAATTCTCTAACAACTTCT
GGAACTACTAGCACTTCATTGGCAAATACCAGTTTCTTCAAGATGAGACAAAGATCTTCCACATCTGGTTCATCCAGTGTAACATCAAGAAGTACTGATACTGGAGCAAC
TACATTGAATATTCCACCTGTGGGAAAAATTAATCATCTTGATTTCTCAGAAGGTTGTGAGCCTACCCTTTCAGAGGACCAGGATCCTTTTGCTTTTGACGAGGGTGATT
TCGAACCCTCTAAATGGGAGTTACTTTCACAGAAAGAGAAGAAATCTCGGGCTAAAAAAGGGGTGGTCAAATTTAGAGATCTCGAGAATGGATGTAAATCTCAGGCGATG
ACGAATGAGAAAGAATCAATTGGTGGAGAAAGTCATCACTTGAATGAAATTTCATGCTTAACACCCTTTAATGAGGAGGGATTCAGTCTAGTAGCTGACTGCCTTCTTAC
TTCCATCAAGGTTTTGATGAACTTGACCAATGACAATCATGTTGGCTGTCAACAAATTGCTTCCTGTGGAGGATTGGAAACTATGTGTTCACTGATTGCCAACCATTTTC
CTTCATTCGGCTCCACTTCATCCACCTTAAATGACTTAAAAGTGCATACATCAAGTCTTGAATTTGAGCCTCAGAACGACAAGCATCTAACTGATCAAGAGCTTGATTTT
CTTGTTGCGATTTTGGGCCTGCTTGTGAACTTGGTGGAGAAGGATGGTCATAACAGATCACGGCTTGCTTCAGCCAGTGTTTTGATACCTAGCTTACATGGACCAGAAAA
GGGTCATACCAACGTAATTCCACTAATATGTTCCATCTTTCTGGCCAACCAAGGAGCAAGCGACGGAGTTGGAGACGGGCAGGCTTTGCCTTGGAATGAGGAGGTAGCTC
TTCTGGAAGGTGAAAAGGAAGCAGAAAAAATGATTGTTGAAGCTTATGCAGCACTACTTCTAGCATTTCTTTCAACCGAAAGCCAGGGCATACGCGATGCAATCGTCGAC
TGTCTTCCAGATCACAACCTAGCAATTCTCGTGCCAGTTTTGGAGCGATTTGTGGCATTTCATTTGACATTGAACATGATTTCTCCGGAGACACATAAAGCCGTAACCGA
AGTGATTGAATCATGTAGAAATTCCTGA
Protein sequenceShow/hide protein sequence
MAKTIIDAVLGLSFDDSASNLAAATLFYILTGDGQDDHLLESPNCVSFLIKLLKPILSMAAEVKAPRIGHKLLVLRTDSDTLQSTTKRLDSSSSAIFSKVEEILVSCKEI
KSRTIDTSTTRPELCPKWIALLTIEKACLTTISLEETSGAIRKTGGDFKEKLRELGGLDSVFEVANDCHSNMEDAGYGNFLQSLMLLLKCLKIMENATFLSKDNQSHLLG
IKRNVEGRGTPQSFTEIMLNVIKILSGLYLRKSSAAGLTTEKSAHHLDGSCNTSKVCAEADGESNRKITLSSSNSKTWCNTKSTSSDKSSIIAQNMRSATARLENSLTTS
GTTSTSLANTSFFKMRQRSSTSGSSSVTSRSTDTGATTLNIPPVGKINHLDFSEGCEPTLSEDQDPFAFDEGDFEPSKWELLSQKEKKSRAKKGVVKFRDLENGCKSQAM
TNEKESIGGESHHLNEISCLTPFNEEGFSLVADCLLTSIKVLMNLTNDNHVGCQQIASCGGLETMCSLIANHFPSFGSTSSTLNDLKVHTSSLEFEPQNDKHLTDQELDF
LVAILGLLVNLVEKDGHNRSRLASASVLIPSLHGPEKGHTNVIPLICSIFLANQGASDGVGDGQALPWNEEVALLEGEKEAEKMIVEAYAALLLAFLSTESQGIRDAIVD
CLPDHNLAILVPVLERFVAFHLTLNMISPETHKAVTEVIESCRNS