; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi08G016230 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi08G016230
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionCAAX amino terminal protease
Genome locationchr08:24166764..24169833
RNA-Seq ExpressionLsi08G016230
SyntenyLsi08G016230
Gene Ontology termsGO:0016485 - protein processing (biological process)
GO:0080120 - CAAX-box protein maturation (biological process)
GO:0016020 - membrane (cellular component)
GO:0004175 - endopeptidase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598256.1 hypothetical protein SDJN03_08034, partial [Cucurbita argyrosperma subsp. sororia]6.4e-8170.5Show/hide
Query:  TGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSH
        TGL +VQRVLPTGL +RSNVKPKVYAKRK ARKLERT EEVSI SS  DDNAQD+KMNSSD+S KNRLINISSRSSV+QACIITSGLIAALGVIIRQ S 
Subjt:  TGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSH

Query:  SNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLL
          HV  +E    +D       ++  SFEM QLQLI GLVVLISSSRFLLLK WPDFAESSEAA +QVLTSLQP+DYA+VAFLPGISE             
Subjt:  SNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLL

Query:  HLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF
                   ELLFRGAL+PLLGFNWASVV+TAAIFG+LHLGGGRKYSFAIW  ++ L +
Subjt:  HLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF

XP_004143052.1 uncharacterized protein LOC101207590 isoform X1 [Cucumis sativus]8.4e-8170.61Show/hide
Query:  GTGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDS
        G GLCN++RVLP GL  RSNVK KV AKRKSAR+LER  EEVSITSS ADDNAQ+VKMNSSD+SPKN LINISSRSSVLQACIITSGLIAALGVIIRQ S
Subjt:  GTGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDS

Query:  HSNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFL
           HV  +E    +D       ++  SFE+ QLQLIIGLVVLISSSRF LLK WPDFAESSEAA +QVLTSLQPLDYAVVAFLPGISE            
Subjt:  HSNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFL

Query:  LHLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF
                    ELLFRGAL+PLLGFNWASVVVTAAIFG+LHLGGGRKYSFAIW  ++ L +
Subjt:  LHLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF

XP_011649506.1 uncharacterized protein LOC101207590 isoform X2 [Cucumis sativus]8.4e-8170.61Show/hide
Query:  GTGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDS
        G GLCN++RVLP GL  RSNVK KV AKRKSAR+LER  EEVSITSS ADDNAQ+VKMNSSD+SPKN LINISSRSSVLQACIITSGLIAALGVIIRQ S
Subjt:  GTGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDS

Query:  HSNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFL
           HV  +E    +D       ++  SFE+ QLQLIIGLVVLISSSRF LLK WPDFAESSEAA +QVLTSLQPLDYAVVAFLPGISE            
Subjt:  HSNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFL

Query:  LHLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF
                    ELLFRGAL+PLLGFNWASVVVTAAIFG+LHLGGGRKYSFAIW  ++ L +
Subjt:  LHLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF

XP_038886747.1 uncharacterized protein LOC120076873 isoform X1 [Benincasa hispida]3.9e-8673.36Show/hide
Query:  GLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSHS
        GLCNVQRVLP GLC+RSNVKPKVYAKRKSARKLERTNEE  ITSS ADDNAQDV+MN SD+SPKNR+INISSRSSVL+ACIITSGLIAALGVIIRQ SH 
Subjt:  GLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSHS

Query:  NHVGKVEHEFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLLHL
          +  +      S      ++  SFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAA +QVLTSLQPLDY VVAFLPGISE               
Subjt:  NHVGKVEHEFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLLHL

Query:  STSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF
                 ELLFRGAL+PLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIW  ++ L +
Subjt:  STSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF

XP_038886748.1 uncharacterized protein LOC120076873 isoform X2 [Benincasa hispida]3.9e-8673.36Show/hide
Query:  GLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSHS
        GLCNVQRVLP GLC+RSNVKPKVYAKRKSARKLERTNEE  ITSS ADDNAQDV+MN SD+SPKNR+INISSRSSVL+ACIITSGLIAALGVIIRQ SH 
Subjt:  GLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSHS

Query:  NHVGKVEHEFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLLHL
          +  +      S      ++  SFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAA +QVLTSLQPLDY VVAFLPGISE               
Subjt:  NHVGKVEHEFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLLHL

Query:  STSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF
                 ELLFRGAL+PLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIW  ++ L +
Subjt:  STSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF

TrEMBL top hitse value%identityAlignment
A0A0A0LKL7 Uncharacterized protein4.1e-8170.61Show/hide
Query:  GTGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDS
        G GLCN++RVLP GL  RSNVK KV AKRKSAR+LER  EEVSITSS ADDNAQ+VKMNSSD+SPKN LINISSRSSVLQACIITSGLIAALGVIIRQ S
Subjt:  GTGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDS

Query:  HSNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFL
           HV  +E    +D       ++  SFE+ QLQLIIGLVVLISSSRF LLK WPDFAESSEAA +QVLTSLQPLDYAVVAFLPGISE            
Subjt:  HSNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFL

Query:  LHLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF
                    ELLFRGAL+PLLGFNWASVVVTAAIFG+LHLGGGRKYSFAIW  ++ L +
Subjt:  LHLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF

A0A5A7V132 Uncharacterized protein2.2e-7969.47Show/hide
Query:  GTGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDS
        G GL N+QRVLP GL +RSNVKPKV+AKRKSAR+LER  EE SITSS ADDNAQ+VKMNSSD+SPKN LINISSRSSVLQACIITSGLIAALGVIIRQ S
Subjt:  GTGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDS

Query:  HSNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFL
           HV  +E    +D       ++  SFE+ QLQLIIGLV LISSSR LLLK WPDFAESSEAA +QVLTSL+PLDYAVVA LPGISE            
Subjt:  HSNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFL

Query:  LHLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF
                    ELLFRGAL+PLLGFNWASVVVTAAIFG+LHLGGGRKYSFAIW  ++ L +
Subjt:  LHLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF

A0A6J1BQ63 uncharacterized protein LOC111004632 isoform X14.5e-8070.47Show/hide
Query:  GTGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDS
        G GLC +QR  PTGLC+ SNVKP VYA+RKSARKLER  EEVS TS  AD+NA DVKMNSSD+SPKN L NISSRSSVLQAC ITSGLIAALGVIIRQ S
Subjt:  GTGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDS

Query:  HSNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFL
           HV  +E    +D       ++  SFEMRQLQLI GLVVLISSSR++LLK WPDFAESSEAA +QVLTSLQP+DY VVAFLPGISEV+K         
Subjt:  HSNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFL

Query:  LHLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIW
                    ELLFRGAL+PLLGFNWASV+VTAAIFGVLHLGGGRKYSFAIW
Subjt:  LHLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIW

A0A6J1K2R3 uncharacterized protein LOC111491872 isoform X11.5e-8069.73Show/hide
Query:  TGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSH
        TGL +VQRVLPTGL +RSN KPKV+AKRK ARKLERT EEVSI SS  DDNAQD+KMNSSD+S KNRLINISSRSSV+QACIITSGLIAALGVIIRQ S 
Subjt:  TGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSH

Query:  SNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLL
          HV  +E    +D       ++  SFEMRQLQLI GLVVLISSSRFLLLK WPDFAESSEAA +QVLTSLQP+DYA+VAFLPGISE             
Subjt:  SNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLL

Query:  HLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF
                   ELLFRGAL+PLLGFNWASV++TAAIFG+LHLGGGRKYSFAIW  ++ L +
Subjt:  HLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF

A0A6J1K7J1 uncharacterized protein LOC111491872 isoform X41.5e-8069.73Show/hide
Query:  TGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSH
        TGL +VQRVLPTGL +RSN KPKV+AKRK ARKLERT EEVSI SS  DDNAQD+KMNSSD+S KNRLINISSRSSV+QACIITSGLIAALGVIIRQ S 
Subjt:  TGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSH

Query:  SNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLL
          HV  +E    +D       ++  SFEMRQLQLI GLVVLISSSRFLLLK WPDFAESSEAA +QVLTSLQP+DYA+VAFLPGISE             
Subjt:  SNHVGKVEH-EFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLL

Query:  HLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF
                   ELLFRGAL+PLLGFNWASV++TAAIFG+LHLGGGRKYSFAIW  ++ L +
Subjt:  HLSTSCSHSEYELLFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G26085.1 CAAX amino terminal protease family protein9.1e-4145Show/hide
Query:  AKRKSARKLER-------------TNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSHSNHVGKVEHEFM
        + RKS +KL+R             T+EEVS          ++ +++SS +     +   + R  VLQAC +TSGL+AALG+IIR+   ++HV   E   +
Subjt:  AKRKSARKLER-------------TNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSHSNHVGKVEHEFM

Query:  DSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLLHLSTSCSHSEYEL
               + +   FE   L LI G+VV ISSSRFLLLK+WPDFA+SSEAA +Q+LTSL+PLDY VVA LPGISE                        EL
Subjt:  DSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLLHLSTSCSHSEYEL

Query:  LFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIW
        LFRGAL+PL G NW  +V    IFG+LHLG GRKYSFA+W
Subjt:  LFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIW

AT3G26085.2 CAAX amino terminal protease family protein9.1e-4145Show/hide
Query:  AKRKSARKLER-------------TNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSHSNHVGKVEHEFM
        + RKS +KL+R             T+EEVS          ++ +++SS +     +   + R  VLQAC +TSGL+AALG+IIR+   ++HV   E   +
Subjt:  AKRKSARKLER-------------TNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSHSNHVGKVEHEFM

Query:  DSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLLHLSTSCSHSEYEL
               + +   FE   L LI G+VV ISSSRFLLLK+WPDFA+SSEAA +Q+LTSL+PLDY VVA LPGISE                        EL
Subjt:  DSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLLHLSTSCSHSEYEL

Query:  LFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIW
        LFRGAL+PL G NW  +V    IFG+LHLG GRKYSFA+W
Subjt:  LFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIW

AT3G26085.3 CAAX amino terminal protease family protein9.1e-4145Show/hide
Query:  AKRKSARKLER-------------TNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSHSNHVGKVEHEFM
        + RKS +KL+R             T+EEVS          ++ +++SS +     +   + R  VLQAC +TSGL+AALG+IIR+   ++HV   E   +
Subjt:  AKRKSARKLER-------------TNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSHSNHVGKVEHEFM

Query:  DSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLLHLSTSCSHSEYEL
               + +   FE   L LI G+VV ISSSRFLLLK+WPDFA+SSEAA +Q+LTSL+PLDY VVA LPGISE                        EL
Subjt:  DSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLLHLSTSCSHSEYEL

Query:  LFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIW
        LFRGAL+PL G NW  +V    IFG+LHLG GRKYSFA+W
Subjt:  LFRGALLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGTACTGGCCTCTGCAATGTACAAAGGGTTCTACCCACTGGATTATGCACTAGGTCAAATGTAAAGCCAAAAGTTTATGCAAAGCGGAAATCCGCGAGGAAATT
GGAAAGAACGAATGAGGAAGTTTCTATAACGTCCTCTTGTGCTGATGATAATGCTCAAGATGTGAAGATGAACTCTTCTGATAATTCACCAAAGAACCGCCTGATTAATA
TCTCCTCAAGAAGTTCTGTGCTTCAGGCTTGCATAATTACTTCTGGTTTGATTGCTGCTTTGGGTGTAATAATTCGACAGGATAGTCATTCTAATCACGTAGGAAAAGTG
GAACATGAGTTTATGGACAGCCCTTACAAGTTACACTTGCAAATTGAAATTAGTTTCGAGATGAGGCAACTTCAGTTGATTATAGGACTGGTTGTTCTAATATCTTCATC
CCGATTTTTACTGTTGAAAGCATGGCCAGACTTTGCAGAGTCTAGTGAAGCAGCCTATCAACAGGTGCTCACTTCTCTTCAACCTTTAGATTATGCGGTAGTAGCCTTTT
TGCCCGGGATTAGCGAGGTGAGCAAAGTATACCAACACATATCTCATTTTCTTCTACATCTATCTACTTCTTGTTCTCACTCAGAATATGAATTGCTTTTCCGTGGCGCA
TTGTTACCACTCTTGGGATTCAACTGGGCAAGTGTGGTGGTGACAGCCGCCATTTTTGGTGTTCTACACTTGGGTGGTGGCCGGAAGTATTCATTTGCAATATGGTATCC
ATATCTATATCTGAATTTCACATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGGTACTGGCCTCTGCAATGTACAAAGGGTTCTACCCACTGGATTATGCACTAGGTCAAATGTAAAGCCAAAAGTTTATGCAAAGCGGAAATCCGCGAGGAAATT
GGAAAGAACGAATGAGGAAGTTTCTATAACGTCCTCTTGTGCTGATGATAATGCTCAAGATGTGAAGATGAACTCTTCTGATAATTCACCAAAGAACCGCCTGATTAATA
TCTCCTCAAGAAGTTCTGTGCTTCAGGCTTGCATAATTACTTCTGGTTTGATTGCTGCTTTGGGTGTAATAATTCGACAGGATAGTCATTCTAATCACGTAGGAAAAGTG
GAACATGAGTTTATGGACAGCCCTTACAAGTTACACTTGCAAATTGAAATTAGTTTCGAGATGAGGCAACTTCAGTTGATTATAGGACTGGTTGTTCTAATATCTTCATC
CCGATTTTTACTGTTGAAAGCATGGCCAGACTTTGCAGAGTCTAGTGAAGCAGCCTATCAACAGGTGCTCACTTCTCTTCAACCTTTAGATTATGCGGTAGTAGCCTTTT
TGCCCGGGATTAGCGAGGTGAGCAAAGTATACCAACACATATCTCATTTTCTTCTACATCTATCTACTTCTTGTTCTCACTCAGAATATGAATTGCTTTTCCGTGGCGCA
TTGTTACCACTCTTGGGATTCAACTGGGCAAGTGTGGTGGTGACAGCCGCCATTTTTGGTGTTCTACACTTGGGTGGTGGCCGGAAGTATTCATTTGCAATATGGTATCC
ATATCTATATCTGAATTTCACATGATATGCCTGATGTAGAAACTCTATCCTCAGCAATAGATCACGAATCAGATCCTTTTAACTGCACAGTCCATATGCCTTTGCTCTTA
CAATATGCTCTGATCAGCTGATGGTTAAAAATTTGGCTAAATTATAAAAACTACCTTTAACTTTGCCCATTCTTTCAAAACTATCTCTATACCTTCAAAAGTTGCAACAT
TGCCTTTAACCTTTCATAATTGTTTCAAAATTAAATAGAAAAATCATTATGACATTGGACAGTCAGAGTGTTTGTCTTTGCTCTTGGAATGCTGAATGTTTTGTTGTGAT
AGCCCACTGTTGTTTGTAAATGGTGTAAGAAAAAACGGTGTTGGACTCTTTAAAACAAAATTATGATTAATCCTGTTCTCAAAGTTCCATCCATTATGTTGGGCAGGGCA
ACTTTTGTTGGACTCACCTATGGTTATGCGACTCTAGAGTCCTCCAGTATCGTTGTGCCGATAGCTTCTCATGCATTGAATAATCTGGTTGGAGGAATTCTGTGGCGCTA
CGAGTCAAGATCTTTGGAGAATCCTGAAGATTAAAAATTATCTACAAAGCTGATTTTTGTATATGTAGCTCAATTGTGTATATACATTGTATAAATAATTTCTAATAAGT
AGTTGTGACCATAGCATTAAATTTTGATCGGCATGTCGATATTTCTATGTTTTCATGGGTTCGACATCAACATGAGGGATGAATTGACATTTCTATTATACCGTAG
Protein sequenceShow/hide protein sequence
MAGTGLCNVQRVLPTGLCTRSNVKPKVYAKRKSARKLERTNEEVSITSSCADDNAQDVKMNSSDNSPKNRLINISSRSSVLQACIITSGLIAALGVIIRQDSHSNHVGKV
EHEFMDSPYKLHLQIEISFEMRQLQLIIGLVVLISSSRFLLLKAWPDFAESSEAAYQQVLTSLQPLDYAVVAFLPGISEVSKVYQHISHFLLHLSTSCSHSEYELLFRGA
LLPLLGFNWASVVVTAAIFGVLHLGGGRKYSFAIWYPYLYLNFT