; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G08905 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G08905
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
Descriptionlysine-rich arabinogalactan protein 19-like
Genome locationClcChr06:10959988..10960921
RNA-Seq ExpressionClc06G08905
SyntenyClc06G08905
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR038793 - Lysine-rich arabinogalactan protein 19


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN61400.1 hypothetical protein Csa_005957 [Cucumis sativus]1.4e-4757.08Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPT--------GLTTAFHP--ASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLP
        MASKSW F VICL FLF S+  QAPAQPPSLPT          T A  P  ++ P  ++ P       +   Q+ P   P       S P         P
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPT--------GLTTAFHP--ASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLP

Query:  PATASAIATSFTTTTSLSNSFTGHHRCHAPTLA--PLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ
        P     +     + T L           APTLA  PLPTPPTP  SPPT APVAAPE SPAPAPGKHK RR HKHKKHQAPAPAPTVPSPPAPPTVVDSQ
Subjt:  PATASAIATSFTTTTSLSNSFTGHHRCHAPTLA--PLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ

Query:  DSTAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGF
        DSTAPAPSP+LNGGDP+K KVG W++V LGFIFLVATTGF
Subjt:  DSTAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGF

XP_008457186.2 PREDICTED: lysine-rich arabinogalactan protein 19 [Cucumis melo]8.2e-4856.67Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTGLTTAFHP----------ASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLP
        MASK W F VICL FLF SAI QAPAQPPSLPT       P          ++ P  ++ P       +   Q+ P   P       S P  +      P
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTGLTTAFHP----------ASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLP

Query:  PATASAIATSFTTTTSLSNSFTGHHRCHAPTLAPLPTP--PTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ
        P     +     +   L           APTLAPLP P  PTP  SPPT APVA+PELSPAPAPGKHK RR HKHKKHQAPAPAPTVPSPPAPPTVVDSQ
Subjt:  PATASAIATSFTTTTSLSNSFTGHHRCHAPTLAPLPTP--PTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ

Query:  DSTAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGF
        DSTAPAPSP+L+ GDPMKGKVGTWT+V LGFIFLVATTGF
Subjt:  DSTAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGF

XP_011649058.1 lysine-rich arabinogalactan protein 19 [Cucumis sativus]1.4e-4757.08Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPT--------GLTTAFHP--ASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLP
        MASKSW F VICL FLF S+  QAPAQPPSLPT          T A  P  ++ P  ++ P       +   Q+ P   P       S P         P
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPT--------GLTTAFHP--ASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLP

Query:  PATASAIATSFTTTTSLSNSFTGHHRCHAPTLA--PLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ
        P     +     + T L           APTLA  PLPTPPTP  SPPT APVAAPE SPAPAPGKHK RR HKHKKHQAPAPAPTVPSPPAPPTVVDSQ
Subjt:  PATASAIATSFTTTTSLSNSFTGHHRCHAPTLA--PLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ

Query:  DSTAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGF
        DSTAPAPSP+LNGGDP+K KVG W++V LGFIFLVATTGF
Subjt:  DSTAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGF

XP_022974214.1 lysine-rich arabinogalactan protein 19-like [Cucurbita maxima]2.2e-4857.38Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTGLTTAFHP-----ASRPQQLLHPCQHHRL-MLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLPPATA
        MASKS P LVICL FLFP AI QAPAQPPSLPT       P      + P  +  P     +      Q  P   P+   V L  P         PPAT 
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTGLTTAFHP-----ASRPQQLLHPCQHHRL-MLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLPPATA

Query:  SAIATSFTTTTSLSNSFTGHHRCHAP-TLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAP
          +     T   L            P  +APLPTPPTP+ S P LAPVA PEL+PAPAPGKHK RR HKHKKHQAPAPAP VPSPPAPPTVVDSQDSTAP
Subjt:  SAIATSFTTTTSLSNSFTGHHRCHAP-TLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAP

Query:  APSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF
        APSPNLNGGDPMK KVGTW R+VLGFI +VATT F F
Subjt:  APSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF

XP_038875574.1 lysine-rich arabinogalactan protein 19-like [Benincasa hispida]5.8e-5460.17Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTGLTTAFHPASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKV------HLSQPRRYHQHCHLPPATA
        MASKSWPF VICLGFLFP AIGQAP QPP+LP   T +  P++ P  +  P              P   P    V       +  P+       LPPA  
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTGLTTAFHPASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKV------HLSQPRRYHQHCHLPPATA

Query:  SAIATSFTTTTSL-----SNSFTGHHRCHAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQD
           AT       L     S + +      APTLAPL TPPTPM SPPT AP+A PEL+PAPAPGKHK RRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQD
Subjt:  SAIATSFTTTTSL-----SNSFTGHHRCHAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQD

Query:  STAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF
        STAPAPSPNLNGGDPMKGKV T +RVVLGFIFLVATTGF F
Subjt:  STAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF

TrEMBL top hitse value%identityAlignment
A0A0A0LHJ4 Uncharacterized protein6.7e-4857.08Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPT--------GLTTAFHP--ASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLP
        MASKSW F VICL FLF S+  QAPAQPPSLPT          T A  P  ++ P  ++ P       +   Q+ P   P       S P         P
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPT--------GLTTAFHP--ASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLP

Query:  PATASAIATSFTTTTSLSNSFTGHHRCHAPTLA--PLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ
        P     +     + T L           APTLA  PLPTPPTP  SPPT APVAAPE SPAPAPGKHK RR HKHKKHQAPAPAPTVPSPPAPPTVVDSQ
Subjt:  PATASAIATSFTTTTSLSNSFTGHHRCHAPTLA--PLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ

Query:  DSTAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGF
        DSTAPAPSP+LNGGDP+K KVG W++V LGFIFLVATTGF
Subjt:  DSTAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGF

A0A1S3C506 lysine-rich arabinogalactan protein 194.0e-4856.67Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTGLTTAFHP----------ASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLP
        MASK W F VICL FLF SAI QAPAQPPSLPT       P          ++ P  ++ P       +   Q+ P   P       S P  +      P
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTGLTTAFHP----------ASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLP

Query:  PATASAIATSFTTTTSLSNSFTGHHRCHAPTLAPLPTP--PTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ
        P     +     +   L           APTLAPLP P  PTP  SPPT APVA+PELSPAPAPGKHK RR HKHKKHQAPAPAPTVPSPPAPPTVVDSQ
Subjt:  PATASAIATSFTTTTSLSNSFTGHHRCHAPTLAPLPTP--PTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ

Query:  DSTAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGF
        DSTAPAPSP+L+ GDPMKGKVGTWT+V LGFIFLVATTGF
Subjt:  DSTAPAPSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGF

A0A6J1DYJ7 lysine-rich arabinogalactan protein 192.8e-4657.81Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTGLTTAFHPASRPQQLLHPCQHHRLMLHHQQAQPFHHPN-HHKVHLSQPRRYH--QHCHLPPATASAI
        MASKSWP LVI L FLF  AIGQAPAQPPSLPT         S P   L P       L          P+    V   QP +        LPP      
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTGLTTAFHPASRPQQLLHPCQHHRLMLHHQQAQPFHHPN-HHKVHLSQPRRYH--QHCHLPPATASAI

Query:  AT--SFTTTTSLSNSFTGHHRCHAPTLAPLPTPPTPMSSPPTLAPVAAPEL--SPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAP
        AT         +S +        APTLAPLP PP P+  PPT APVAAP L  SPAPAPGKH+ +RHHKHKKH APAPAPT+PSPPAPPTVVDSQDSTAP
Subjt:  AT--SFTTTTSLSNSFTGHHRCHAPTLAPLPTPPTPMSSPPTLAPVAAPEL--SPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAP

Query:  APSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF
        APSPNLNGGDPM+GKV TWTRVVLGFI LVATT   F
Subjt:  APSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF

A0A6J1EVZ4 lysine-rich arabinogalactan protein 19-like1.4e-4557.08Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTGLTTAFHPASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHL-SQPRRYHQHCHLPPATASAIAT
        MASKS   LVICL FLFP AI QAPAQPPSLPT       P   P   + P  +           P   P    V   S P  +      PP T   +  
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTGLTTAFHPASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHL-SQPRRYHQHCHLPPATASAIAT

Query:  SFTTTTSLSNSFTGHHRCHAP-TLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ-DSTAPAPSP
           T   L            P  +APLPTPPTP+ SPP LAPVA PEL+PAPAPGKHK RR HKHKKHQAPAPAP VPSPPAPPTVVDSQ DSTAPAPSP
Subjt:  SFTTTTSLSNSFTGHHRCHAP-TLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ-DSTAPAPSP

Query:  NLNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF
        NLNGGDPMK KVG W RVVLGFI +VA+T F F
Subjt:  NLNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF

A0A6J1IGY8 lysine-rich arabinogalactan protein 19-like1.0e-4857.38Show/hide
Query:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTGLTTAFHP-----ASRPQQLLHPCQHHRL-MLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLPPATA
        MASKS P LVICL FLFP AI QAPAQPPSLPT       P      + P  +  P     +      Q  P   P+   V L  P         PPAT 
Subjt:  MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTGLTTAFHP-----ASRPQQLLHPCQHHRL-MLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLPPATA

Query:  SAIATSFTTTTSLSNSFTGHHRCHAP-TLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAP
          +     T   L            P  +APLPTPPTP+ S P LAPVA PEL+PAPAPGKHK RR HKHKKHQAPAPAP VPSPPAPPTVVDSQDSTAP
Subjt:  SAIATSFTTTTSLSNSFTGHHRCHAP-TLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAP

Query:  APSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF
        APSPNLNGGDPMK KVGTW R+VLGFI +VATT F F
Subjt:  APSPNLNGGDPMKGKVGTWTRVVLGFIFLVATTGFKF

SwissProt top hitse value%identityAlignment
Q9FPR2 Lysine-rich arabinogalactan protein 187.5e-0439.57Show/hide
Query:  PPATASAIATSFTTTTSLSNSFTGHHRCHAPTLAPLPTPP-------TPMSSPPTLAPVA----APELS-----PAPAPGKHKRRRHHKHKKHQ-APAPA
        P A+AS+   S  +   +S S         PT  P  +PP       +P+SSPP  APVA    AP  +     PAPAP KHK+    K KKHQ APAPA
Subjt:  PPATASAIATSFTTTTSLSNSFTGHHRCHAPTLAPLPTPP-------TPMSSPPTLAPVA----APELS-----PAPAPGKHKRRRHHKHKKHQ-APAPA

Query:  PTVPSPPAPPTVVDSQDSTAPAPSPNLNGGDPMKGKVGT
        P +  PPAPPT     +S A +P P+    D   G   T
Subjt:  PTVPSPPAPPTVVDSQDSTAPAPSPNLNGGDPMKGKVGT

Q9S740 Lysine-rich arabinogalactan protein 192.1e-1436.62Show/hide
Query:  PSAIGQAPAQPPSLPTGLTTAFHPASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLPPATASAIATSFTTTTSLSNSFTGHHRC
        P +  Q PA P + P  +T    PA +   ++ P             QP   P      +S P         PPA  S   T  +   + +         
Subjt:  PSAIGQAPAQPPSLPTGLTTAFHPASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLPPATASAIATSFTTTTSLSNSFTGHHRC

Query:  HAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKK-HQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNLNGG---DPMKGKVGTWT
         +P  AP   PP P+S PP  AP +   L PAPAP   K +R HKHK+ H APAPAP  PSPP+PP + D QD TAPAPSPN NGG   + +KG+   W 
Subjt:  HAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKK-HQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNLNGG---DPMKGKVGTWT

Query:  RVVLGFIFLVATT
           L  +FL+A T
Subjt:  RVVLGFIFLVATT

Arabidopsis top hitse value%identityAlignment
AT1G68725.1 arabinogalactan protein 191.5e-1536.62Show/hide
Query:  PSAIGQAPAQPPSLPTGLTTAFHPASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLPPATASAIATSFTTTTSLSNSFTGHHRC
        P +  Q PA P + P  +T    PA +   ++ P             QP   P      +S P         PPA  S   T  +   + +         
Subjt:  PSAIGQAPAQPPSLPTGLTTAFHPASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLPPATASAIATSFTTTTSLSNSFTGHHRC

Query:  HAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKK-HQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNLNGG---DPMKGKVGTWT
         +P  AP   PP P+S PP  AP +   L PAPAP   K +R HKHK+ H APAPAP  PSPP+PP + D QD TAPAPSPN NGG   + +KG+   W 
Subjt:  HAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKK-HQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNLNGG---DPMKGKVGTWT

Query:  RVVLGFIFLVATT
           L  +FL+A T
Subjt:  RVVLGFIFLVATT

AT4G37450.1 arabinogalactan protein 185.3e-0539.57Show/hide
Query:  PPATASAIATSFTTTTSLSNSFTGHHRCHAPTLAPLPTPP-------TPMSSPPTLAPVA----APELS-----PAPAPGKHKRRRHHKHKKHQ-APAPA
        P A+AS+   S  +   +S S         PT  P  +PP       +P+SSPP  APVA    AP  +     PAPAP KHK+    K KKHQ APAPA
Subjt:  PPATASAIATSFTTTTSLSNSFTGHHRCHAPTLAPLPTPP-------TPMSSPPTLAPVA----APELS-----PAPAPGKHKRRRHHKHKKHQ-APAPA

Query:  PTVPSPPAPPTVVDSQDSTAPAPSPNLNGGDPMKGKVGT
        P +  PPAPPT     +S A +P P+    D   G   T
Subjt:  PTVPSPPAPPTVVDSQDSTAPAPSPNLNGGDPMKGKVGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCAAATCATGGCCTTTTTTGGTAATTTGCCTTGGCTTTCTATTCCCTTCCGCGATCGGCCAGGCCCCGGCACAACCACCAAGCTTGCCAACTGGTCTC
ACCACTGCCTTCCACCCCGCCTCCCGACCGCAACAACTCCTCCACCCTTGTCAACACCACCGCCTAATGTTGCACCATCAACAAGCCCAACCATTCCACCACCCC
AACCACCACAAAGTTCACCTATCTCAACCCCGTCGCTACCACCAGCATTGCCACCTCCCCCCTGCTACCGCCTCCGCCATTGCCACCTCCTTTACCACCACCACA
AGTCTCTCCAACTCCTTCACAGGCCACCACCGGTGCCATGCTCCCACCCTAGCACCATTACCAACTCCGCCAACACCAATGTCTTCACCACCAACGCTAGCACCA
GTCGCGGCACCAGAGTTATCCCCTGCACCTGCGCCAGGAAAGCATAAGCGTAGGAGGCATCACAAACATAAGAAGCATCAAGCACCGGCCCCGGCCCCAACCGTC
CCGAGCCCACCAGCACCACCTACAGTGGTGGACTCACAAGATAGCACAGCGCCAGCGCCATCACCAAATCTGAATGGAGGAGACCCAATGAAAGGAAAGGTGGGA
ACATGGACTAGAGTTGTACTGGGATTCATATTTCTAGTTGCTACCACAGGCTTCAAATTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCAAATCATGGCCTTTTTTGGTAATTTGCCTTGGCTTTCTATTCCCTTCCGCGATCGGCCAGGCCCCGGCACAACCACCAAGCTTGCCAACTGGTCTC
ACCACTGCCTTCCACCCCGCCTCCCGACCGCAACAACTCCTCCACCCTTGTCAACACCACCGCCTAATGTTGCACCATCAACAAGCCCAACCATTCCACCACCCC
AACCACCACAAAGTTCACCTATCTCAACCCCGTCGCTACCACCAGCATTGCCACCTCCCCCCTGCTACCGCCTCCGCCATTGCCACCTCCTTTACCACCACCACA
AGTCTCTCCAACTCCTTCACAGGCCACCACCGGTGCCATGCTCCCACCCTAGCACCATTACCAACTCCGCCAACACCAATGTCTTCACCACCAACGCTAGCACCA
GTCGCGGCACCAGAGTTATCCCCTGCACCTGCGCCAGGAAAGCATAAGCGTAGGAGGCATCACAAACATAAGAAGCATCAAGCACCGGCCCCGGCCCCAACCGTC
CCGAGCCCACCAGCACCACCTACAGTGGTGGACTCACAAGATAGCACAGCGCCAGCGCCATCACCAAATCTGAATGGAGGAGACCCAATGAAAGGAAAGGTGGGA
ACATGGACTAGAGTTGTACTGGGATTCATATTTCTAGTTGCTACCACAGGCTTCAAATTCTAA
Protein sequenceShow/hide protein sequence
MASKSWPFLVICLGFLFPSAIGQAPAQPPSLPTGLTTAFHPASRPQQLLHPCQHHRLMLHHQQAQPFHHPNHHKVHLSQPRRYHQHCHLPPATASAIATSFTTTT
SLSNSFTGHHRCHAPTLAPLPTPPTPMSSPPTLAPVAAPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNLNGGDPMKGKVG
TWTRVVLGFIFLVATTGFKF