; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy2G017505 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy2G017505
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionCAAX amino terminal protease
Genome locationGy14Chr2:27523011..27525834
RNA-Seq ExpressionCsGy2G017505
SyntenyCsGy2G017505
Gene Ontology termsGO:0071586 - CAAX-box protein processing (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004222 - metalloendopeptidase activity (molecular function)
InterPro domainsIPR003675 - Type II CAAX prenyl endopeptidase Rce1-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004143052.1 uncharacterized protein LOC101207590 isoform X1 [Cucumis sativus]4.21e-171100Show/hide
Query:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN
        MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN
Subjt:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN

Query:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP
        YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP
Subjt:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP

Query:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIW
        LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIW
Subjt:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIW

XP_008444402.1 PREDICTED: uncharacterized protein LOC103487740 isoform X1 [Cucumis melo]3.23e-15292.19Show/hide
Query:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN
        MNLL  NYRC ST+A STFN FTWRNS FMGRKGIGL NL+RVLPAGLS RSNVK KV AKRKSARRLERNREE SITSSSADDNAQEVKMNSSDSSPKN
Subjt:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN

Query:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP
        +LINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLV LISSSR  LLK WPDFAESSEAANRQVLTSL+P
Subjt:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP

Query:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIW
        LDYAVVA LPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIW
Subjt:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIW

XP_011649506.1 uncharacterized protein LOC101207590 isoform X2 [Cucumis sativus]6.53e-145100Show/hide
Query:  KGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKNYLINISSRSSVLQACIITSGLIAALGVIIRQV
        KGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKNYLINISSRSSVLQACIITSGLIAALGVIIRQV
Subjt:  KGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKNYLINISSRSSVLQACIITSGLIAALGVIIRQV

Query:  SHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQPLDYAVVAFLPGISEELLFRGALIPLLGFNWAS
        SHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQPLDYAVVAFLPGISEELLFRGALIPLLGFNWAS
Subjt:  SHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQPLDYAVVAFLPGISEELLFRGALIPLLGFNWAS

Query:  VVVTAAIFGILHLGGGRKYSFAIW
        VVVTAAIFGILHLGGGRKYSFAIW
Subjt:  VVVTAAIFGILHLGGGRKYSFAIW

XP_022996702.1 uncharacterized protein LOC111491872 isoform X1 [Cucurbita maxima]4.71e-13683.72Show/hide
Query:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN
        MNLL IN RC ST+A STFN  TWRNS F+GRK  GL +++RVLP GL  RSN K KV AKRK AR+LER  EEVSI SSS DDNAQ++KMNSSDSS KN
Subjt:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN

Query:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP
         LINISSRSSV+QACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFE+ QLQLI GLVVLISSSRF LLK WPDFAESSEAANRQVLTSLQP
Subjt:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP

Query:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIWYS
        +DYA+VAFLPGISEELLFRGALIPLLGFNWASV++TAAIFGILHLGGGRKYSFAIW S
Subjt:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIWYS

XP_038886747.1 uncharacterized protein LOC120076873 isoform X1 [Benincasa hispida]1.46e-14186.33Show/hide
Query:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN
        MNLL IN RC ST+  STFN FTWRNS FMGRK IGLCN++RVLP GL  RSNVK KV AKRKSAR+LER  EE  ITSSSADDNAQ+V+MN SDSSPKN
Subjt:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN

Query:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP
         +INISSRSSVL+ACIITSGLIAALGVIIRQVSH ASIEGLPVIDCTSEVSFSFE+ QLQLIIGLVVLISSSRF LLK WPDFAESSEAANRQVLTSLQP
Subjt:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP

Query:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIW
        LDY VVAFLPGISEELLFRGAL+PLLGFNWASVVVTAAIFG+LHLGGGRKYSFAIW
Subjt:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIW

TrEMBL top hitse value%identityAlignment
A0A0A0LKL7 Uncharacterized protein2.04e-171100Show/hide
Query:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN
        MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN
Subjt:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN

Query:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP
        YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP
Subjt:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP

Query:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIW
        LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIW
Subjt:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIW

A0A1S3BA79 uncharacterized protein LOC103487740 isoform X11.57e-15292.19Show/hide
Query:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN
        MNLL  NYRC ST+A STFN FTWRNS FMGRKGIGL NL+RVLPAGLS RSNVK KV AKRKSARRLERNREE SITSSSADDNAQEVKMNSSDSSPKN
Subjt:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN

Query:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP
        +LINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLV LISSSR  LLK WPDFAESSEAANRQVLTSL+P
Subjt:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP

Query:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIW
        LDYAVVA LPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIW
Subjt:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIW

A0A6J1HC53 uncharacterized protein LOC111462659 isoform X12.97e-13483.33Show/hide
Query:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN
        MNLL IN RC ST+A STFN  TWRNS F+GRK  GL +++RVLP GL  RSNVK +V AKRK AR+LER  EEVSI SSS DDNAQ++KMNSSDSS KN
Subjt:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN

Query:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP
         LINISSRSSV+QACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFE+ QLQLI GLVVLISSSRF LLK WPDFAESSEAANRQVLTSLQP
Subjt:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP

Query:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIWYS
        +DYA+VAFLPGISEELLFRGALIPLLGFNWASV++TAAIFGILHLGGGRKYSF IW S
Subjt:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIWYS

A0A6J1K2R3 uncharacterized protein LOC111491872 isoform X12.28e-13683.72Show/hide
Query:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN
        MNLL IN RC ST+A STFN  TWRNS F+GRK  GL +++RVLP GL  RSN K KV AKRK AR+LER  EEVSI SSS DDNAQ++KMNSSDSS KN
Subjt:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN

Query:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP
         LINISSRSSV+QACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFE+ QLQLI GLVVLISSSRF LLK WPDFAESSEAANRQVLTSLQP
Subjt:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP

Query:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIWYS
        +DYA+VAFLPGISEELLFRGALIPLLGFNWASV++TAAIFGILHLGGGRKYSFAIW S
Subjt:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIWYS

A0A6J1KBS8 uncharacterized protein LOC111491872 isoform X25.11e-13282.56Show/hide
Query:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN
        MNLL IN RC ST+A STFN  TWRNS F+GRK  GL +++RVLP GL      K KV AKRK AR+LER  EEVSI SSS DDNAQ++KMNSSDSS KN
Subjt:  MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKN

Query:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP
         LINISSRSSV+QACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFE+ QLQLI GLVVLISSSRF LLK WPDFAESSEAANRQVLTSLQP
Subjt:  YLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQP

Query:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIWYS
        +DYA+VAFLPGISEELLFRGALIPLLGFNWASV++TAAIFGILHLGGGRKYSFAIW S
Subjt:  LDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIWYS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G26085.1 CAAX amino terminal protease family protein2.0e-5655.56Show/hide
Query:  RSNVKTKVCAKRKSARRLERNREE------VSITSSS-ADDNAQEVKMNSSDSSPKNYLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPV
        R     +  + RKS ++L+R  ++       ++T    +    +E +++SS S     +   + R  VLQAC +TSGL+AALG+IIR+ SHVAS EGL V
Subjt:  RSNVKTKVCAKRKSARRLERNREE------VSITSSS-ADDNAQEVKMNSSDSSPKNYLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPV

Query:  IDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQPLDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGIL
         DC+ +V F FE W L LI G+VV ISSSRF LLK+WPDFA+SSEAANRQ+LTSL+PLDY VVA LPGISEELLFRGAL+PL G NW  +V    IFG+L
Subjt:  IDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQPLDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGIL

Query:  HLGGGRKYSFAIWYSI
        HLG GRKYSFA+W SI
Subjt:  HLGGGRKYSFAIWYSI

AT3G26085.2 CAAX amino terminal protease family protein2.0e-5656.67Show/hide
Query:  KVCAKRKSARRLERNREE------VSITSSS-ADDNAQEVKMNSSDSSPKNYLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSE
        +  + RKS ++L+R  ++       ++T    +    +E +++SS S     +   + R  VLQAC +TSGL+AALG+IIR+ SHVAS EGL V DC+ +
Subjt:  KVCAKRKSARRLERNREE------VSITSSS-ADDNAQEVKMNSSDSSPKNYLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSE

Query:  VSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQPLDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGR
        V F FE W L LI G+VV ISSSRF LLK+WPDFA+SSEAANRQ+LTSL+PLDY VVA LPGISEELLFRGAL+PL G NW  +V    IFG+LHLG GR
Subjt:  VSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQPLDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGILHLGGGR

Query:  KYSFAIWYSI
        KYSFA+W SI
Subjt:  KYSFAIWYSI

AT3G26085.3 CAAX amino terminal protease family protein2.0e-5655.56Show/hide
Query:  RSNVKTKVCAKRKSARRLERNREE------VSITSSS-ADDNAQEVKMNSSDSSPKNYLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPV
        R     +  + RKS ++L+R  ++       ++T    +    +E +++SS S     +   + R  VLQAC +TSGL+AALG+IIR+ SHVAS EGL V
Subjt:  RSNVKTKVCAKRKSARRLERNREE------VSITSSS-ADDNAQEVKMNSSDSSPKNYLINISSRSSVLQACIITSGLIAALGVIIRQVSHVASIEGLPV

Query:  IDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQPLDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGIL
         DC+ +V F FE W L LI G+VV ISSSRF LLK+WPDFA+SSEAANRQ+LTSL+PLDY VVA LPGISEELLFRGAL+PL G NW  +V    IFG+L
Subjt:  IDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQPLDYAVVAFLPGISEELLFRGALIPLLGFNWASVVVTAAIFGIL

Query:  HLGGGRKYSFAIWYSI
        HLG GRKYSFA+W SI
Subjt:  HLGGGRKYSFAIWYSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTTGCTCGCTATAAACTATCGATGCACATCAACGGACGCCGGTTCCACCTTCAATCTATTCACATGGCGGAATTCTATTTTCATGGGTAGGAAGGGTATTGGCCT
CTGCAATTTACGAAGGGTTCTCCCAGCTGGATTATCTTGTAGATCAAATGTAAAGACAAAAGTTTGTGCAAAGCGGAAATCCGCGAGGAGATTGGAAAGAAATCGTGAGG
AAGTTTCTATAACGTCCTCTTCTGCTGATGACAATGCTCAAGAAGTGAAGATGAACTCTTCTGATAGTTCACCAAAGAACTACTTAATTAATATCTCCTCAAGAAGTTCT
GTGCTTCAGGCTTGCATTATTACTTCTGGTTTGATTGCTGCTTTGGGTGTAATAATTCGACAGGTATCTCATGTTGCATCGATAGAGGGATTGCCAGTGATTGACTGCAC
TTCGGAAGTATCATTTAGTTTTGAGGTGTGGCAACTTCAGTTGATCATAGGACTGGTTGTTCTAATATCTTCATCCCGCTTTTTCCTGTTGAAAACGTGGCCAGACTTTG
CTGAATCCAGTGAAGCAGCTAATCGGCAGGTGCTCACTTCTCTTCAACCTTTAGATTATGCGGTAGTTGCCTTTTTGCCTGGGATTAGCGAGGAATTGCTTTTCCGTGGT
GCATTGATACCGCTCTTGGGATTCAACTGGGCAAGTGTCGTCGTGACAGCTGCCATTTTTGGCATTCTACACTTGGGTGGTGGCCGGAAGTATTCGTTTGCTATATGGTA
TTCCATATCTCTTATGTGA
mRNA sequenceShow/hide mRNA sequence
TCCTCTTTGGGATCATTTTATCCTAAACCGAGTCCAGTTGAAGCTCAACCGGATCTGGGACAACCGTAAATCCGTTATGAATTTGCTCGCTATAAACTATCGATGCACAT
CAACGGACGCCGGTTCCACCTTCAATCTATTCACATGGCGGAATTCTATTTTCATGGGTAGGAAGGGTATTGGCCTCTGCAATTTACGAAGGGTTCTCCCAGCTGGATTA
TCTTGTAGATCAAATGTAAAGACAAAAGTTTGTGCAAAGCGGAAATCCGCGAGGAGATTGGAAAGAAATCGTGAGGAAGTTTCTATAACGTCCTCTTCTGCTGATGACAA
TGCTCAAGAAGTGAAGATGAACTCTTCTGATAGTTCACCAAAGAACTACTTAATTAATATCTCCTCAAGAAGTTCTGTGCTTCAGGCTTGCATTATTACTTCTGGTTTGA
TTGCTGCTTTGGGTGTAATAATTCGACAGGTATCTCATGTTGCATCGATAGAGGGATTGCCAGTGATTGACTGCACTTCGGAAGTATCATTTAGTTTTGAGGTGTGGCAA
CTTCAGTTGATCATAGGACTGGTTGTTCTAATATCTTCATCCCGCTTTTTCCTGTTGAAAACGTGGCCAGACTTTGCTGAATCCAGTGAAGCAGCTAATCGGCAGGTGCT
CACTTCTCTTCAACCTTTAGATTATGCGGTAGTTGCCTTTTTGCCTGGGATTAGCGAGGAATTGCTTTTCCGTGGTGCATTGATACCGCTCTTGGGATTCAACTGGGCAA
GTGTCGTCGTGACAGCTGCCATTTTTGGCATTCTACACTTGGGTGGTGGCCGGAAGTATTCGTTTGCTATATGGTATTCCATATCTCTTATGTGAATTTCTTATGATATG
CCCGATGTAGAAACTCTATCCTCAGCAATAGATCATTTATCAGATCCACTTAACTGCACAGTTCATATGCCTTCGCTCTTACAATATGCTGATGGGTAAAATTATAGAAA
CTACCTGTGAAAAATCAGTGTGTCTGTAAATGCTGTAGAAAAAACTGTTTTGGATTGTTCAAAACAAAATTATGATTAATCCTACTCTGAAAGTTCCATCCATTTTTGTT
GAGCAGGGCAACTTTTGTTGGGCTTGCATATGGCTATGCGACCATAGAGTCCTCCAGTATCGTTGTGCCGATGGCTTCTCATGCATTGAATAATCTGGTTGGAGGAATTC
TGTGGACCTACGAATCAAGGTCTTTAGAGAATCTTGAGGATTGAAAACGATGTAAACGAACTGAAAGCTGCTGATTGGTCTCCTCGAATGTGTGTACGCACTCATAGGAT
CTAAAATAAACATAGTTTAACTAAAACATATACAATGTATTAGCAGTCCCGAGATGTGCGATTCAAATTTAAGTGAGTAAATACATTGTGTATATAATAACTATTCATCA
ATAACAGATATTTTATTTCTAAACTTTTTATCTCTGTCG
Protein sequenceShow/hide protein sequence
MNLLAINYRCTSTDAGSTFNLFTWRNSIFMGRKGIGLCNLRRVLPAGLSCRSNVKTKVCAKRKSARRLERNREEVSITSSSADDNAQEVKMNSSDSSPKNYLINISSRSS
VLQACIITSGLIAALGVIIRQVSHVASIEGLPVIDCTSEVSFSFEVWQLQLIIGLVVLISSSRFFLLKTWPDFAESSEAANRQVLTSLQPLDYAVVAFLPGISEELLFRG
ALIPLLGFNWASVVVTAAIFGILHLGGGRKYSFAIWYSISLM