; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C09G162010 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C09G162010
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
Descriptionactin-related protein 8-like
Genome locationCla97Chr09:101085..101741
RNA-Seq ExpressionCla97C09G162010
SyntenyCla97C09G162010
Gene Ontology termsNA
InterPro domainsIPR008480 - Protein of unknown function DUF761, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022953927.1 uncharacterized protein LOC111456334 [Cucurbita moschata]1.0e-7974.43Show/hide
Query:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTLHFP
        M+NNLPVI+K++W LVRVAYFLLRKGISKSKL+LDLNLMMKRGKIAGKAI+NLMFHHHYHG A+P+S   SA QLP  VG D+YEF+CS+SPAFP+LHFP
Subjt:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTLHFP

Query:  GFHLGNGKRRRNQNH-NSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAAD
         FH    KRRRNQNH +SFFACAHAP TLDDDAA VNAVKA VEI N H  ASSPS V  S            VRQLRITDSPFPL DANAD  VDKAAD
Subjt:  GFHLGNGKRRRNQNH-NSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAAD

Query:  EFISRFYKELRLQKTAEEN
        E+ISRFYKELRLQ TA+EN
Subjt:  EFISRFYKELRLQKTAEEN

XP_022964083.1 uncharacterized protein LOC111464220 [Cucurbita moschata]1.9e-9184.51Show/hide
Query:  IAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYH-GAASPTST-ALSATQLPFPVGADEYEFSCSNSPAFPTLHFPGFHLG
        +AKKVWNLVRV YFLLRKGISKSKL+LDLNLM KRGK+AGKAISNLMFHHHYH  AASP+S+   SA QLPFPVGADEYEFSCSNSPAFP  H       
Subjt:  IAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYH-GAASPTST-ALSATQLPFPVGADEYEFSCSNSPAFPTLHFPGFHLG

Query:  NGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAADEFISRF
         GKRRRNQNHNSFFACAHAP+TLDDDAA VNAV AVVEILNNH  ASS  P+PASPALPGFG TPRRVRQLRITDSPFPLQDANADPLVDKAADEFISRF
Subjt:  NGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAADEFISRF

Query:  YKELRLQKTAEEN
        YKELRLQKTA+EN
Subjt:  YKELRLQKTAEEN

XP_023000392.1 uncharacterized protein LOC111494650 [Cucurbita maxima]5.3e-9785Show/hide
Query:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTST-ALSATQLPFPVGADEYEFSCSNSPAFPTLHF
        M+NNLP++AKKVWNLVRVAYFLLRKGISKSKL+LDLNLM KRGK+AGKAISNLMFHHHYH AASP+S+   SA QLPFP+GADEYEFSCSNSPAFP  H 
Subjt:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTST-ALSATQLPFPVGADEYEFSCSNSPAFPTLHF

Query:  PGFHLGNGKRRRNQNHNSFFACAHAPETLDDD-AATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAA
               GKRRRNQNHNSFFACAHAP+TLDDD AA VNAV AVVEILNNH  ASS  PVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAA
Subjt:  PGFHLGNGKRRRNQNHNSFFACAHAPETLDDD-AATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAA

Query:  DEFISRFYKELRLQKTAEEN
        DEFISRFYKELRLQKTA+EN
Subjt:  DEFISRFYKELRLQKTAEEN

XP_023514516.1 uncharacterized protein LOC111778773 [Cucurbita pepo subsp. pepo]1.5e-9183.26Show/hide
Query:  IAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYH--GAASPTSTA--LSATQLPFPVGADEYEFSCSNSPAFPTLHFPGFH
        +AKKVWNLVRVAYFLLRKGISKSKL+LDLNLM KRGK+AGKAISNLMFHHHYH   AASP+S++   S  QLPFP+GADEYEFSCSNSPAFP  H     
Subjt:  IAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYH--GAASPTSTA--LSATQLPFPVGADEYEFSCSNSPAFPTLHFPGFH

Query:  LGNGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAADEFIS
           GKRRRNQNHN FFACAHAP+TLDDDAA VNAV AVVEILNNH  ASS  P+PASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAADEFIS
Subjt:  LGNGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAADEFIS

Query:  RFYKELRLQKTAEEN
        RFYKELRLQKTA+EN
Subjt:  RFYKELRLQKTAEEN

XP_023548437.1 uncharacterized protein LOC111807092 [Cucurbita pepo subsp. pepo]1.7e-7974.43Show/hide
Query:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTLHFP
        M+NNLPVI+K++W LVRVAYFLLRKGISKSKL+LDLNLMMKRGKIAGKAI+NLMFHHHYHG A+P+S   SA QLP  VG D+YEF+CS+SPAFP+LHFP
Subjt:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTLHFP

Query:  GFHLGNGKRRRNQNH-NSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAAD
         FH    KRRRNQNH +SFFACAHAP TLDDDAA VNAVKA VEI N H  ASSPS V              RVRQLRITDSPFPL DANAD  VDKAAD
Subjt:  GFHLGNGKRRRNQNH-NSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAAD

Query:  EFISRFYKELRLQKTAEEN
        E+ISRFYKELRLQ TA+EN
Subjt:  EFISRFYKELRLQKTAEEN

TrEMBL top hitse value%identityAlignment
A0A0A0K800 Uncharacterized protein6.2e-6767.42Show/hide
Query:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKL-VLDLNLMMKRGKIAGKAISNLMFHHHY--HGAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTL
        M++N+PVIAKKVWNLVRVAYFLLRKGISKSK+ +LDLNLMMKRGKIAGKAISNLMF HHY  H    P        QLPF V AD+YEFSCSN+P+    
Subjt:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKL-VLDLNLMMKRGKIAGKAISNLMFHHHY--HGAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTL

Query:  HFPGFHLGNGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKA
            +H    +RR N NHNSFFACAHAP+TLDDD  T+NA+KAVV+ILNN    + P   P+SPA       P  VRQLRITDSPFPLQD NADPLVDKA
Subjt:  HFPGFHLGNGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKA

Query:  ADEFISRFYKELRLQKTAEEN
        ADEFISRFYKEL LQKT + N
Subjt:  ADEFISRFYKELRLQKTAEEN

A0A6J1GPL9 uncharacterized protein LOC1114563344.9e-8074.43Show/hide
Query:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTLHFP
        M+NNLPVI+K++W LVRVAYFLLRKGISKSKL+LDLNLMMKRGKIAGKAI+NLMFHHHYHG A+P+S   SA QLP  VG D+YEF+CS+SPAFP+LHFP
Subjt:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTLHFP

Query:  GFHLGNGKRRRNQNH-NSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAAD
         FH    KRRRNQNH +SFFACAHAP TLDDDAA VNAVKA VEI N H  ASSPS V  S            VRQLRITDSPFPL DANAD  VDKAAD
Subjt:  GFHLGNGKRRRNQNH-NSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAAD

Query:  EFISRFYKELRLQKTAEEN
        E+ISRFYKELRLQ TA+EN
Subjt:  EFISRFYKELRLQKTAEEN

A0A6J1HJS6 uncharacterized protein LOC1114642209.4e-9284.51Show/hide
Query:  IAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYH-GAASPTST-ALSATQLPFPVGADEYEFSCSNSPAFPTLHFPGFHLG
        +AKKVWNLVRV YFLLRKGISKSKL+LDLNLM KRGK+AGKAISNLMFHHHYH  AASP+S+   SA QLPFPVGADEYEFSCSNSPAFP  H       
Subjt:  IAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYH-GAASPTST-ALSATQLPFPVGADEYEFSCSNSPAFPTLHFPGFHLG

Query:  NGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAADEFISRF
         GKRRRNQNHNSFFACAHAP+TLDDDAA VNAV AVVEILNNH  ASS  P+PASPALPGFG TPRRVRQLRITDSPFPLQDANADPLVDKAADEFISRF
Subjt:  NGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAADEFISRF

Query:  YKELRLQKTAEEN
        YKELRLQKTA+EN
Subjt:  YKELRLQKTAEEN

A0A6J1JMV1 uncharacterized protein LOC1114883289.2e-7973.85Show/hide
Query:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTLHFP
        M+NNLPVI+K++W LVRVAYFLLRKGISKSKL+LDLNLMMKRGKIAGKAI+NLMFHHHYHG A+P+S   SA QLP  VG D+YEF+CS+SPAFP+LHFP
Subjt:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTLHFP

Query:  GFHLGNGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAADE
         FH    KRRRNQNH SFFACAHAP+TLDDDAA   AVKA VEI N H  ASSPS V  S            VRQLRITDSPFPL DANAD  VDKAADE
Subjt:  GFHLGNGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAADE

Query:  FISRFYKELRLQKTAEEN
        +ISRFYKELRLQ TA+EN
Subjt:  FISRFYKELRLQKTAEEN

A0A6J1KDI8 uncharacterized protein LOC1114946502.6e-9785Show/hide
Query:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTST-ALSATQLPFPVGADEYEFSCSNSPAFPTLHF
        M+NNLP++AKKVWNLVRVAYFLLRKGISKSKL+LDLNLM KRGK+AGKAISNLMFHHHYH AASP+S+   SA QLPFP+GADEYEFSCSNSPAFP  H 
Subjt:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTST-ALSATQLPFPVGADEYEFSCSNSPAFPTLHF

Query:  PGFHLGNGKRRRNQNHNSFFACAHAPETLDDD-AATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAA
               GKRRRNQNHNSFFACAHAP+TLDDD AA VNAV AVVEILNNH  ASS  PVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAA
Subjt:  PGFHLGNGKRRRNQNHNSFFACAHAPETLDDD-AATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAA

Query:  DEFISRFYKELRLQKTAEEN
        DEFISRFYKELRLQKTA+EN
Subjt:  DEFISRFYKELRLQKTAEEN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G52140.1 unknown protein1.7e-4046.61Show/hide
Query:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYH-GAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTLHF
        MD N+P I+KK+WN+VR   +++RKG+SK+KL+ D N  +KRGK       NLMFH      A S  S AL+AT         EYEFSCSN+P +     
Subjt:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYH-GAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTLHF

Query:  PGFHLGNGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSP----VPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPL--
          F   N    R ++HN+ F C   P+TLDDD A   A +AV+E+LN  G   + +P    V  SP  PGFG+TP  VR LR+TDSPFPL   N D    
Subjt:  PGFHLGNGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSP----VPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPL--

Query:  -VDKAADEFISRFYKELRLQK
         VDKAAD+FI +FYK L  QK
Subjt:  -VDKAADEFISRFYKELRLQK

AT3G16330.1 unknown protein1.8e-3443.67Show/hide
Query:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTLHFP
        M+ N+  I+KK+ N+VR   ++L KGISK KL+ D N  +KRGK       NLMFH+       P S   S  Q       +EYEFSCS++P +    FP
Subjt:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTLHFP

Query:  GFHLGNGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPA---------SPALPGFGRTPRRVRQLRITDSPFPLQDAN--
         F++   K++   +HNS F+C  AP TLDDD    +  +AV+E+LN+ G     S  PA         SP LPGFGR+   VR LR+TDSPFPL++    
Subjt:  GFHLGNGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPA---------SPALPGFGRTPRRVRQLRITDSPFPLQDAN--

Query:  ADPLVDKAADEFISRFYKELRLQKTAEEN
        A+  VDKAADEFI +FYK L  QK   E+
Subjt:  ADPLVDKAADEFISRFYKELRLQKTAEEN

AT4G29110.1 unknown protein1.2e-1935Show/hide
Query:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTLHFP
        M+ N  V AK++W +VR+ + +L+ G  K+KL+LDLNLM+KRG    KAI+NL       G+   +S++      PF               AF      
Subjt:  MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTLHFP

Query:  GFHLGNGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHG------AASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQD-ANADPL
           +   KRR +  ++            +++ A   AVK V E+L  +        ++  SP+  SPA          VRQLR+TDSPFPL D  + D +
Subjt:  GFHLGNGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHG------AASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQD-ANADPL

Query:  VDKAADEFISRFYKELRLQK
        VDKAA+EFI +FYK L+LQK
Subjt:  VDKAADEFISRFYKELRLQK

AT4G32860.1 unknown protein1.1e-0426.98Show/hide
Query:  VIAKKVWNLVRVAYFLLRK--GISKSKLV--LDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTSTALSATQLPFPVGAD-EYEFSCSNSP---AFPTLH
        V  KK+ +L ++  F ++K    S+ KL+  LD +L+ KRGKI  K+++  +   H      P+      +    PV    EYEFSCS++P   ++ T  
Subjt:  VIAKKVWNLVRVAYFLLRK--GISKSKLV--LDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTSTALSATQLPFPVGAD-EYEFSCSNSP---AFPTLH

Query:  FPGFHLGNGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAA
          G       RR N +HN         +         N +  V + + +   A++  P  AS                    S   ++  +    VD+AA
Subjt:  FPGFHLGNGKRRRNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAA

Query:  DEFISRFYKELRLQK
        +EFI  FY++LRLQK
Subjt:  DEFISRFYKELRLQK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAATAATCTTCCAGTGATAGCAAAAAAGGTTTGGAATTTGGTTCGTGTTGCTTATTTCCTTCTCCGTAAAGGCATCTCCAAGAGCAAACTCGTGCTCGACCTCAA
TCTCATGATGAAACGTGGCAAAATCGCCGGCAAAGCCATTAGCAATCTCATGTTTCACCACCACTACCACGGCGCCGCTTCCCCTACCTCCACCGCCCTTTCCGCCACTC
AACTCCCCTTCCCCGTCGGTGCTGACGAGTACGAATTCAGCTGCAGCAATAGCCCCGCTTTCCCCACCCTCCACTTCCCTGGCTTCCACCTTGGCAATGGCAAACGCCGC
CGTAACCAAAACCACAACTCCTTCTTCGCCTGTGCTCACGCGCCTGAAACACTTGACGACGATGCCGCTACCGTTAATGCCGTTAAGGCCGTTGTCGAGATTCTTAACAA
CCACGGAGCCGCGTCCTCACCATCCCCTGTCCCGGCTTCACCAGCTCTCCCAGGATTCGGCCGGACTCCGAGGAGAGTCCGCCAGCTCAGGATAACCGACTCGCCGTTCC
CTCTACAAGACGCCAACGCGGATCCATTAGTGGACAAAGCGGCCGATGAATTTATCAGTAGGTTTTACAAGGAGCTCAGGCTCCAAAAAACGGCCGAGGAGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATAATAATCTTCCAGTGATAGCAAAAAAGGTTTGGAATTTGGTTCGTGTTGCTTATTTCCTTCTCCGTAAAGGCATCTCCAAGAGCAAACTCGTGCTCGACCTCAA
TCTCATGATGAAACGTGGCAAAATCGCCGGCAAAGCCATTAGCAATCTCATGTTTCACCACCACTACCACGGCGCCGCTTCCCCTACCTCCACCGCCCTTTCCGCCACTC
AACTCCCCTTCCCCGTCGGTGCTGACGAGTACGAATTCAGCTGCAGCAATAGCCCCGCTTTCCCCACCCTCCACTTCCCTGGCTTCCACCTTGGCAATGGCAAACGCCGC
CGTAACCAAAACCACAACTCCTTCTTCGCCTGTGCTCACGCGCCTGAAACACTTGACGACGATGCCGCTACCGTTAATGCCGTTAAGGCCGTTGTCGAGATTCTTAACAA
CCACGGAGCCGCGTCCTCACCATCCCCTGTCCCGGCTTCACCAGCTCTCCCAGGATTCGGCCGGACTCCGAGGAGAGTCCGCCAGCTCAGGATAACCGACTCGCCGTTCC
CTCTACAAGACGCCAACGCGGATCCATTAGTGGACAAAGCGGCCGATGAATTTATCAGTAGGTTTTACAAGGAGCTCAGGCTCCAAAAAACGGCCGAGGAGAACTGA
Protein sequenceShow/hide protein sequence
MDNNLPVIAKKVWNLVRVAYFLLRKGISKSKLVLDLNLMMKRGKIAGKAISNLMFHHHYHGAASPTSTALSATQLPFPVGADEYEFSCSNSPAFPTLHFPGFHLGNGKRR
RNQNHNSFFACAHAPETLDDDAATVNAVKAVVEILNNHGAASSPSPVPASPALPGFGRTPRRVRQLRITDSPFPLQDANADPLVDKAADEFISRFYKELRLQKTAEEN