; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G18026 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G18026
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionIntegral membrane protein hemolysin-III homolog
Genome locationctg3345:3705797..3709824
RNA-Seq ExpressionCucsat.G18026
SyntenyCucsat.G18026
Gene Ontology termsGO:0006351 - transcription, DNA-templated (biological process)
GO:0070176 - DRM complex (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139970.1 uncharacterized protein LOC101211824 isoform X3 [Cucumis sativus]1.55e-186100Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE
        VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE

XP_022986425.1 uncharacterized protein LOC111484175 [Cucurbita maxima]7.10e-14381.06Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGN GKDVPSQEKQLQISAKKTA RDLQNDN   ASNCTGSSPLLKE G  SD IKVSGN       PA+PSHLHSSTSN++NGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNS CD+T+IK +YPNL+KLG LA T HLKSQ KELQNHC  AFAPFPMVS +NA  KPSVPHH+GK GIN   AESNFH APST PS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE
             GWKNLQWEDRYHQLQLLLNKLDQSDQ+DYLQVL SLSSVELSRHAVELE+RSIQLSLEE
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE

XP_031743509.1 uncharacterized protein LOC101211824 isoform X1 [Cucumis sativus]1.45e-240100Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN
        VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN

Query:  LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR
        LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR
Subjt:  LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR

XP_031743510.1 uncharacterized protein LOC101211824 isoform X2 [Cucumis sativus]5.46e-23899.7Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN
        VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE VLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN

Query:  LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR
        LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR
Subjt:  LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR

XP_038878250.1 uncharacterized protein LOC120070536 [Benincasa hispida]1.85e-15787.12Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGN GKDV SQEKQLQISAKKTA RDLQNDN   ASNC GSSPLLKE GT SDIIKVSGNKRA PV PASPSHLHSS SN+ANGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNS C NTS KA+YPNL+KLG LA T HLKSQ KELQNHC QAFAPFPMVS +NAP KPSVPHH+GKCG NLA AESNF SAPST PS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE
        VGIP GWKNLQWEDRYHQLQLLLNKLDQSDQ+DYLQVL SLSSVELSRHAVELEKRSIQLSLEE
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE

TrEMBL top hitse value%identityAlignment
A0A0A0KAB4 Uncharacterized protein7.03e-241100Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN
        VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN

Query:  LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR
        LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR
Subjt:  LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR

A0A6J1CFY6 uncharacterized protein LOC1110111674.00e-14281.44Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQK IDSKFSEYGHGN GKDVP  EKQLQISAKKTA RDLQN+N   ASNCTGS PLLKE G GSD IKVS NKR   V P SP HLHSSTSN+ANGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNS  D+TSIKA+YPNL+KLG L  TVHLKSQ KEL+NHC  AFAPFP+V  +NA   PSVPHH+GK GINLA AESNFHSA ST PS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE
        VGIP GWKNLQWEDRYHQLQLLLNKLDQSDQ+DYLQVL SLSSVELSRHAV LEKRSIQLSLEE
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE

A0A6J1FY79 uncharacterized protein LOC1114484074.62e-14180.3Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGN GKDVPSQEKQLQISAKKTA RDLQNDN   ASNCTGSSPLLKE G  SD IKVSGN       PA+PSHLHSSTSN++NGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKS+ADIGKNS CD+T+IK +YPNL+KLG LA T HLKSQ KELQ  C  AFAPFPMVS +NA  KPSVPHH+GK GIN A AESNFH APST PS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE
             GWKNLQWEDRYHQLQLLLNKLDQSDQ+DYLQVL SLSSVELSRHAVELE+RSIQLSLEE
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE

A0A6J1G8C0 uncharacterized protein LOC1114517571.56e-11069.7Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSK S     N GK+ P+ EKQLQISAKKTA RDLQNDN  +ASNCTGSSPLLKE G  SD IKVSGN +  PV+  SP  L SSTSN+  GHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VY+RRKSDADI K+S CD++SIKA+Y +  KLG LA TVHLKSQ KELQ+HC  AFAPF MVS +NA  KPSVPH   K GINLA AES+F SA      
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE
              WKNLQWE RYHQL+LLLNKL+QSDQ+DYLQVL SLSSVELSRHAVELEKRSI LS EE
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE

A0A6J1JE12 uncharacterized protein LOC1114841753.44e-14381.06Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGN GKDVPSQEKQLQISAKKTA RDLQNDN   ASNCTGSSPLLKE G  SD IKVSGN       PA+PSHLHSSTSN++NGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNS CD+T+IK +YPNL+KLG LA T HLKSQ KELQNHC  AFAPFPMVS +NA  KPSVPHH+GK GIN   AESNFH APST PS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE
             GWKNLQWEDRYHQLQLLLNKLDQSDQ+DYLQVL SLSSVELSRHAVELE+RSIQLSLEE
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G45250.1 Integral membrane protein hemolysin-III homolog5.4e-1455.56Show/hide
Query:  LQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNL--VINSLKIT
        L WE+RY  LQ+LLNKL+QSD+ D++Q+L SLSS ELS+HAV+LEKRSIQ SLEE     ++  +  L+ L   +NS+K T
Subjt:  LQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNL--VINSLKIT

AT2G45250.2 Integral membrane protein hemolysin-III homolog6.4e-1571.43Show/hide
Query:  LQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEE
        L WE+RY  LQ+LLNKL+QSD+ D++Q+L SLSS ELS+HAV+LEKRSIQ SLEEE
Subjt:  LQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEE

AT4G38280.1 BEST Arabidopsis thaliana protein match is: Integral membrane protein hemolysin-III homolog (TAIR:AT2G45250.1)1.1e-1452.04Show/hide
Query:  SAPSTFPSVGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNL--VINSLKIT
        S+P+  P+   P   K L WE+RY  LQ+LLNKL+QSD+ D++Q+L SLSS ELS+HAV+LEKRSIQ SLEE     ++  +  L+ L   +NSLK T
Subjt:  SAPSTFPSVGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNL--VINSLKIT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAGAAATCCATTGACTCCAAATTCAGTGAATATGGACATGGAAATTTTGGGAAGGACGTGCCTTCTCAGGAAAAGCAACTGCAAATTTCTGCGAAGAAGACAGC
ATCAAGGGATTTGCAAAATGATAATATGGCCATAGCTTCAAATTGTACTGGAAGTTCTCCTCTTTTGAAGGAAATAGGTACCGGTAGTGACATCATTAAAGTTTCTGGTA
ACAAGAGAGCCTTACCAGTCTACCCTGCAAGTCCATCTCATCTCCATTCTTCAACTTCTAATTCTGCAAATGGGCATCTTGTTTATGTCCGTAGAAAATCTGATGCGGAT
ATAGGGAAGAATAGTTCTTGTGATAATACAAGCATAAAAGCTAATTATCCAAATCTGAACAAACTTGGTTCACTAGCTGTAACTGTGCATCTCAAATCCCAGGCTAAGGA
GCTGCAGAATCATTGCGTGCAAGCATTTGCTCCTTTTCCAATGGTGTCTTCCGTGAATGCACCTAGAAAACCTTCAGTTCCTCATCACATGGGAAAGTGTGGCATCAATT
TAGCCGTAGCAGAATCGAACTTCCATTCTGCACCTTCTACTTTCCCTTCAGTAGGCATCCCAGTAGGATGGAAAAATTTGCAGTGGGAAGACAGATATCATCAGTTGCAG
TTGTTATTGAATAAATTGGACCAATCAGATCAACGTGATTATCTTCAGGTGCTCGGATCGTTGTCATCGGTTGAACTTAGTAGACATGCAGTTGAATTGGAAAAGAGATC
CATTCAGCTCTCGCTTGAGGAAGAAGTCTTGGATTTGGACATATCAAATATTCAGAGGTTGGATAATTTGGTAATCAATAGCTTAAAGATCACTAGGGAGAGGTTCACGT
TGGCACCTGAAGTTTCAAACCTTCTACATTCTGGATTGCTGCTAGGACATGTGTCAGAGAAGAGCTCAAACAGGCAAATGCAGAATTGGAGTCAACAACACAGAAACTGG
AGAAGGAGAAAATTGAGTTGCAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAGAAATCCATTGACTCCAAATTCAGTGAATATGGACATGGAAATTTTGGGAAGGACGTGCCTTCTCAGGAAAAGCAACTGCAAATTTCTGCGAAGAAGACAGC
ATCAAGGGATTTGCAAAATGATAATATGGCCATAGCTTCAAATTGTACTGGAAGTTCTCCTCTTTTGAAGGAAATAGGTACCGGTAGTGACATCATTAAAGTTTCTGGTA
ACAAGAGAGCCTTACCAGTCTACCCTGCAAGTCCATCTCATCTCCATTCTTCAACTTCTAATTCTGCAAATGGGCATCTTGTTTATGTCCGTAGAAAATCTGATGCGGAT
ATAGGGAAGAATAGTTCTTGTGATAATACAAGCATAAAAGCTAATTATCCAAATCTGAACAAACTTGGTTCACTAGCTGTAACTGTGCATCTCAAATCCCAGGCTAAGGA
GCTGCAGAATCATTGCGTGCAAGCATTTGCTCCTTTTCCAATGGTGTCTTCCGTGAATGCACCTAGAAAACCTTCAGTTCCTCATCACATGGGAAAGTGTGGCATCAATT
TAGCCGTAGCAGAATCGAACTTCCATTCTGCACCTTCTACTTTCCCTTCAGTAGGCATCCCAGTAGGATGGAAAAATTTGCAGTGGGAAGACAGATATCATCAGTTGCAG
TTGTTATTGAATAAATTGGACCAATCAGATCAACGTGATTATCTTCAGGTGCTCGGATCGTTGTCATCGGTTGAACTTAGTAGACATGCAGTTGAATTGGAAAAGAGATC
CATTCAGCTCTCGCTTGAGGAAGAAGTCTTGGATTTGGACATATCAAATATTCAGAGGTTGGATAATTTGGTAATCAATAGCTTAAAGATCACTAGGGAGAGGTTCACGT
TGGCACCTGAAGTTTCAAACCTTCTACATTCTGGATTGCTGCTAGGACATGTGTCAGAGAAGAGCTCAAACAGGCAAATGCAGAATTGGAGTCAACAACACAGAAACTGG
AGAAGGAGAAAATTGAGTTGCAGGTAG
Protein sequenceShow/hide protein sequence
MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHLVYVRRKSDAD
IGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCVQAFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPSVGIPVGWKNLQWEDRYHQLQ
LLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSNLLHSGLLLGHVSEKSSNRQMQNWSQQHRNW
RRRKLSCR