; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG06G008280 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG06G008280
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionlysine-rich arabinogalactan protein 19
Genome locationCG_Chr06:11519612..11520613
RNA-Seq ExpressionClCG06G008280
SyntenyClCG06G008280
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR038793 - Lysine-rich arabinogalactan protein 19


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN61400.1 hypothetical protein Csa_005957 [Cucumis sativus]2.8e-8083.84Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPP
        MASKSW F VICL FLF S+  QAPAQPPSLPTVSP P+TPPPT  T PPLSTPPP  VAPSTSPTIP   PPQSSP+STPSLPPALPPPPATPPPLPPP
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPP

Query:  LPPPQVSPTPSQAPPVPAPTLA--PLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL
        LPPPQVSPTP   PPV APTLA  PLPTPPTP  SPPT APVAAPE SPAPAPGKHK RR HKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSP+L
Subjt:  LPPPQVSPTPSQAPPVPAPTLA--PLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL

Query:  NGGDPMKGKVGTWTRVVLGFIFLVATTGF
        NGGDP+K KVG W++V LGFIFLVATTGF
Subjt:  NGGDPMKGKVGTWTRVVLGFIFLVATTGF

XP_008457186.2 PREDICTED: lysine-rich arabinogalactan protein 19 [Cucumis melo]2.1e-8084.28Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPP
        MASK W F VICL FLF SAI QAPAQPPSLPTVSP PSTPPPT  TPPPLSTPPP  VAPSTSP IP   PPQSSP+STPSLPPA PPPPATPPPLPPP
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPP

Query:  LPPPQVSPTPSQAPPVPAPTLAPLPTP--PTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL
        LPPPQVSP P   PPV APTLAPLP P  PTP  SPPT APVA+PELSPAPAPGKHK RR HKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSP+L
Subjt:  LPPPQVSPTPSQAPPVPAPTLAPLPTP--PTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL

Query:  NGGDPMKGKVGTWTRVVLGFIFLVATTGF
        + GDPMKGKVGTWT+V LGFIFLVATTGF
Subjt:  NGGDPMKGKVGTWTRVVLGFIFLVATTGF

XP_011649058.1 lysine-rich arabinogalactan protein 19 [Cucumis sativus]2.8e-8083.84Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPP
        MASKSW F VICL FLF S+  QAPAQPPSLPTVSP P+TPPPT  T PPLSTPPP  VAPSTSPTIP   PPQSSP+STPSLPPALPPPPATPPPLPPP
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPP

Query:  LPPPQVSPTPSQAPPVPAPTLA--PLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL
        LPPPQVSPTP   PPV APTLA  PLPTPPTP  SPPT APVAAPE SPAPAPGKHK RR HKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSP+L
Subjt:  LPPPQVSPTPSQAPPVPAPTLA--PLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL

Query:  NGGDPMKGKVGTWTRVVLGFIFLVATTGF
        NGGDP+K KVG W++V LGFIFLVATTGF
Subjt:  NGGDPMKGKVGTWTRVVLGFIFLVATTGF

XP_022158972.1 lysine-rich arabinogalactan protein 19 [Momordica charantia]4.9e-7781.03Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPL-STPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPP
        MASKSWP LVI L FLF  AIGQAPAQPPSLPT S  PST  PT  TPPPL STPPPN VAPSTSPT+PPPQPPQ +P+STPSLPP LPPPPATPPPLP 
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPL-STPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPP

Query:  PLPPPQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPEL--SPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPN
          PPPQVSPTP+QAP VPAPTLAPLP PP P+  PPT APVAAP L  SPAPAPGKH+ +RHHKHKKH APAPAPT+PSPPAPPTVVDSQDSTAPAPSPN
Subjt:  PLPPPQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPEL--SPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPN

Query:  LNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF
        LNGGDPM+GKV TWTRVVLGFI LVATT   F
Subjt:  LNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF

XP_038875574.1 lysine-rich arabinogalactan protein 19-like [Benincasa hispida]5.9e-9189.96Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPP
        MASKSWPF VICLGFLFP AIGQAP QPP+LPTVSP PSTPPPT TTPPPLSTPPP  VAPSTSPTIPPPQP    PISTPSLPPALPPPPATPPPLPPP
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPP

Query:  LPPPQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNLNG
        LPPPQVSPTPSQAPPVPAPTLAPL TPPTPM SPPT AP+A PEL+PAPAPGKHK RRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNLNG
Subjt:  LPPPQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNLNG

Query:  GDPMKGKVGTWTRVVLGFIFLVATTGFKF
        GDPMKGKV T +RVVLGFIFLVATTGF F
Subjt:  GDPMKGKVGTWTRVVLGFIFLVATTGFKF

TrEMBL top hitse value%identityAlignment
A0A0A0LHJ4 Uncharacterized protein1.3e-8083.84Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPP
        MASKSW F VICL FLF S+  QAPAQPPSLPTVSP P+TPPPT  T PPLSTPPP  VAPSTSPTIP   PPQSSP+STPSLPPALPPPPATPPPLPPP
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPP

Query:  LPPPQVSPTPSQAPPVPAPTLA--PLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL
        LPPPQVSPTP   PPV APTLA  PLPTPPTP  SPPT APVAAPE SPAPAPGKHK RR HKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSP+L
Subjt:  LPPPQVSPTPSQAPPVPAPTLA--PLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL

Query:  NGGDPMKGKVGTWTRVVLGFIFLVATTGF
        NGGDP+K KVG W++V LGFIFLVATTGF
Subjt:  NGGDPMKGKVGTWTRVVLGFIFLVATTGF

A0A1S3C506 lysine-rich arabinogalactan protein 191.0e-8084.28Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPP
        MASK W F VICL FLF SAI QAPAQPPSLPTVSP PSTPPPT  TPPPLSTPPP  VAPSTSP IP   PPQSSP+STPSLPPA PPPPATPPPLPPP
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPP

Query:  LPPPQVSPTPSQAPPVPAPTLAPLPTP--PTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL
        LPPPQVSP P   PPV APTLAPLP P  PTP  SPPT APVA+PELSPAPAPGKHK RR HKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSP+L
Subjt:  LPPPQVSPTPSQAPPVPAPTLAPLPTP--PTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL

Query:  NGGDPMKGKVGTWTRVVLGFIFLVATTGF
        + GDPMKGKVGTWT+V LGFIFLVATTGF
Subjt:  NGGDPMKGKVGTWTRVVLGFIFLVATTGF

A0A6J1DYJ7 lysine-rich arabinogalactan protein 192.4e-7781.03Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPL-STPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPP
        MASKSWP LVI L FLF  AIGQAPAQPPSLPT S  PST  PT  TPPPL STPPPN VAPSTSPT+PPPQPPQ +P+STPSLPP LPPPPATPPPLP 
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPL-STPPPN-VAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPP

Query:  PLPPPQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPEL--SPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPN
          PPPQVSPTP+QAP VPAPTLAPLP PP P+  PPT APVAAP L  SPAPAPGKH+ +RHHKHKKH APAPAPT+PSPPAPPTVVDSQDSTAPAPSPN
Subjt:  PLPPPQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPEL--SPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPN

Query:  LNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF
        LNGGDPM+GKV TWTRVVLGFI LVATT   F
Subjt:  LNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF

A0A6J1EVZ4 lysine-rich arabinogalactan protein 19-like1.3e-7074.17Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPNV-APSTSPTIPPPQPPQSSPISTPSLPPA----------LPPP
        MASKS   LVICL FLFP AI QAPAQPPSLPT +P PSTPPPT T      TPPPNV APS +PTIPPPQPPQ+ P+STPSLPP           LPPP
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPNV-APSTSPTIPPPQPPQSSPISTPSLPPA----------LPPP

Query:  PATPPPLPPPLPPPQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ-DS
        PATPPPLPPPLP PQ+SPTP Q        +APLPTPPTP+ SPP LAPVA PEL+PAPAPGKHK RR HKHKKHQAPAPAP VPSPPAPPTVVDSQ DS
Subjt:  PATPPPLPPPLPPPQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ-DS

Query:  TAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF
        TAPAPSPNLNGGDPMK KVG W RVVLGFI +VA+T F F
Subjt:  TAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF

A0A6J1IGY8 lysine-rich arabinogalactan protein 19-like8.7e-7274.38Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPNV-APSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPL---
        MASKS P LVICL FLFP AI QAPAQPPSLPT +P PSTPPPT T      TPPPNV APS +PTIPPPQPPQ+ P+STPSLPP LPPPP TPPPL   
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPNV-APSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPL---

Query:  ---PPPLPPPQVSPTPSQAPPVPAP-------TLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ
           PPPLPPP  +P P   PP+PAP        +APLPTPPTP+ S P LAPVA PEL+PAPAPGKHK RR HKHKKHQAPAPAP VPSPPAPPTVVDSQ
Subjt:  ---PPPLPPPQVSPTPSQAPPVPAP-------TLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ

Query:  DSTAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF
        DSTAPAPSPNLNGGDPMK KVGTW R+VLGFI +VATT F F
Subjt:  DSTAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF

SwissProt top hitse value%identityAlignment
Q9S740 Lysine-rich arabinogalactan protein 195.1e-2145.89Show/hide
Query:  SAIGQAPAQPPSLPTVSPLPSTPPPTATTPP------------------PLSTPPPNVAPSTSPTIPPPQPPQSSPISTPSL--PPALPPP-PATPPPLP
        S +      PP      P  + PPPT TTPP                  P S P P VAP  SP  PPPQPPQS P S P++  PP  PPP P +PPP P
Subjt:  SAIGQAPAQPPSLPTVSPLPSTPPPTATTPP------------------PLSTPPPNVAPSTSPTIPPPQPPQSSPISTPSL--PPALPPP-PATPPPLP

Query:  PPLPPPQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKK-HQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPN
           PP   SP P+ A P PAP   P    P+P+S PP          +PAPAP KHKR+  HKHK+ H APAPAP  PSPP+PP + D QD TAPAPSPN
Subjt:  PPLPPPQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKK-HQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPN

Query:  LNGG---DPMKGKVGTWTRVVLGFIFLVATT
         NGG   + +KG+   W    L  +FL+A T
Subjt:  LNGG---DPMKGKVGTWTRVVLGFIFLVATT

Arabidopsis top hitse value%identityAlignment
AT1G68725.1 arabinogalactan protein 193.7e-2245.89Show/hide
Query:  SAIGQAPAQPPSLPTVSPLPSTPPPTATTPP------------------PLSTPPPNVAPSTSPTIPPPQPPQSSPISTPSL--PPALPPP-PATPPPLP
        S +      PP      P  + PPPT TTPP                  P S P P VAP  SP  PPPQPPQS P S P++  PP  PPP P +PPP P
Subjt:  SAIGQAPAQPPSLPTVSPLPSTPPPTATTPP------------------PLSTPPPNVAPSTSPTIPPPQPPQSSPISTPSL--PPALPPP-PATPPPLP

Query:  PPLPPPQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKK-HQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPN
           PP   SP P+ A P PAP   P    P+P+S PP          +PAPAP KHKR+  HKHK+ H APAPAP  PSPP+PP + D QD TAPAPSPN
Subjt:  PPLPPPQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKK-HQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPN

Query:  LNGG---DPMKGKVGTWTRVVLGFIFLVATT
         NGG   + +KG+   W    L  +FL+A T
Subjt:  LNGG---DPMKGKVGTWTRVVLGFIFLVATT

AT2G14890.1 arabinogalactan protein 99.0e-0540.43Show/hide
Query:  SKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPNVAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPPLPP
        ++S+   VIC+  L     GQAP  PP+     P P+TPPP A TPPP+S PPP    +TSP      PP ++P    S PP   PPPATPPP+  P PP
Subjt:  SKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPNVAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPPLPP

Query:  PQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTA
        P  SP P+  PPV  P  APL +PP  + +P   AP   P+ SP+P+P               AP P+    SP   PT V+ Q+  +
Subjt:  PQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTA

AT2G14890.2 arabinogalactan protein 99.0e-0541.4Show/hide
Query:  SKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPNVAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPPLPP
        ++S+   VIC+  L     GQAP  PP+     P P+TPPP A TPPP+S PPP    +TSP      PP ++P    S PP   PPPATPPP+  P PP
Subjt:  SKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPNVAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPPLPP

Query:  PQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDS
        P  SP P+  PPV  P  APL +PP  + +P   AP   P+ SP+P+P               AP P+    SP   PT V+ Q S
Subjt:  PQVSPTPSQAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCAAATCATGGCCTTTTTTGGTAATTTGCCTTGGCTTTCTATTCCCTTCCGCGATCGGCCAGGCCCCGGCACAACCACCAAGCTTGCCAACTGTCTCACCACT
GCCTTCCACCCCGCCTCCGACCGCAACAACTCCTCCACCCTTGTCAACACCACCGCCTAATGTTGCACCATCAACAAGCCCAACCATTCCACCACCCCAACCACCACAAA
GTTCACCTATCTCAACCCCGTCGCTACCACCAGCATTGCCACCTCCCCCTGCTACGCCTCCGCCATTGCCACCTCCTTTACCACCACCACAAGTCTCTCCAACTCCTTCA
CAGGCACCACCGGTGCCAGCTCCCACCCTAGCACCATTACCAACTCCGCCAACACCAATGTCTTCACCACCAACGCTAGCACCAGTCGCGGCACCAGAGTTATCCCCTGC
ACCTGCGCCAGGAAAGCATAAGCGTAGGAGGCATCACAAACATAAGAAGCATCAAGCACCGGCCCCGGCCCCAACCGTCCCGAGCCCACCAGCACCACCTACAGTGGTGG
ACTCACAAGATAGCACAGCGCCAGCGCCATCACCAAATCTGAATGGAGGAGACCCAATGAAAGGAAAGGTAGGAACATGGACTAGAGTTGTACTGGGATTCATATTTCTA
GTTGCTACCACAGGCTTCAAATTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCAAATCATGGCCTTTTTTGGTAATTTGCCTTGGCTTTCTATTCCCTTCCGCGATCGGCCAGGCCCCGGCACAACCACCAAGCTTGCCAACTGTCTCACCACT
GCCTTCCACCCCGCCTCCGACCGCAACAACTCCTCCACCCTTGTCAACACCACCGCCTAATGTTGCACCATCAACAAGCCCAACCATTCCACCACCCCAACCACCACAAA
GTTCACCTATCTCAACCCCGTCGCTACCACCAGCATTGCCACCTCCCCCTGCTACGCCTCCGCCATTGCCACCTCCTTTACCACCACCACAAGTCTCTCCAACTCCTTCA
CAGGCACCACCGGTGCCAGCTCCCACCCTAGCACCATTACCAACTCCGCCAACACCAATGTCTTCACCACCAACGCTAGCACCAGTCGCGGCACCAGAGTTATCCCCTGC
ACCTGCGCCAGGAAAGCATAAGCGTAGGAGGCATCACAAACATAAGAAGCATCAAGCACCGGCCCCGGCCCCAACCGTCCCGAGCCCACCAGCACCACCTACAGTGGTGG
ACTCACAAGATAGCACAGCGCCAGCGCCATCACCAAATCTGAATGGAGGAGACCCAATGAAAGGAAAGGTAGGAACATGGACTAGAGTTGTACTGGGATTCATATTTCTA
GTTGCTACCACAGGCTTCAAATTCTAATTTTGAAGAGAGAATTATTCACAAGGAACCCATTATGGTTCATAAAATTATAGAATTGGCCCAGAATCTGTGAG
Protein sequenceShow/hide protein sequence
MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTVSPLPSTPPPTATTPPPLSTPPPNVAPSTSPTIPPPQPPQSSPISTPSLPPALPPPPATPPPLPPPLPPPQVSPTPS
QAPPVPAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFL
VATTGFKF