; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G09710 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G09710
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionIntegral membrane protein hemolysin-III homolog
Genome locationChr6:8330261..8333663
RNA-Seq ExpressionCSPI06G09710
SyntenyCSPI06G09710
Gene Ontology termsGO:0006351 - transcription, DNA-templated (biological process)
GO:0070176 - DRM complex (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139970.1 uncharacterized protein LOC101211824 isoform X3 [Cucumis sativus]9.0e-14499.24Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHC+Q FAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE
        VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE

XP_022986425.1 uncharacterized protein LOC111484175 [Cucurbita maxima]1.3e-11080.68Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGN GKDVPSQEKQLQISAKKTA RDLQNDN   ASNCTGSSPLLKE G  SD IKVSGN       PA+PSHLHSSTSN++NGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNS CD+T+IK +YPNL+KLG LA T HLKSQ KELQNHC   FAPFPMVS +NA  KPSVPHH+GK GIN   AESNFH APST PS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE
             GWKNLQWEDRYHQLQLLLNKLDQSDQ+DYLQVL SLSSVELSRHAVELE+RSIQLSLEE
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE

XP_031743509.1 uncharacterized protein LOC101211824 isoform X1 [Cucumis sativus]5.6e-18699.41Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHC+Q FAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN
        VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN

Query:  LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR
        LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR
Subjt:  LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR

XP_031743510.1 uncharacterized protein LOC101211824 isoform X2 [Cucumis sativus]4.0e-18499.11Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHC+Q FAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN
        VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSL EEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN

Query:  LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR
        LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR
Subjt:  LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR

XP_038878250.1 uncharacterized protein LOC120070536 [Benincasa hispida]4.3e-12286.74Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGN GKDV SQEKQLQISAKKTA RDLQNDN   ASNC GSSPLLKE GT SDIIKVSGNKRA PV PASPSHLHSS SN+ANGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNS C NTS KA+YPNL+KLG LA T HLKSQ KELQNHC Q FAPFPMVS +NAP KPSVPHH+GKCG NLA AESNF SAPST PS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE
        VGIP GWKNLQWEDRYHQLQLLLNKLDQSDQ+DYLQVL SLSSVELSRHAVELEKRSIQLSLEE
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE

TrEMBL top hitse value%identityAlignment
A0A0A0KAB4 Uncharacterized protein2.7e-18699.41Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHC+Q FAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN
        VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSN

Query:  LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR
        LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR
Subjt:  LLHSGLLLGHVSEKSSNRQMQNWSQQHRNWRRRKLSCR

A0A6J1CFY6 uncharacterized protein LOC1110111671.8e-11081.06Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQK IDSKFSEYGHGN GKDVP  EKQLQISAKKTA RDLQN+N   ASNCTGS PLLKE G GSD IKVS NKR   V P SP HLHSSTSN+ANGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNS  D+TSIKA+YPNL+KLG L  TVHLKSQ KEL+NHC   FAPFP+V  +NA   PSVPHH+GK GINLA AESNFHSA ST PS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE
        VGIP GWKNLQWEDRYHQLQLLLNKLDQSDQ+DYLQVL SLSSVELSRHAV LEKRSIQLSLEE
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE

A0A6J1FY79 uncharacterized protein LOC1114484072.7e-10979.92Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGN GKDVPSQEKQLQISAKKTA RDLQNDN   ASNCTGSSPLLKE G  SD IKVSGN       PA+PSHLHSSTSN++NGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKS+ADIGKNS CD+T+IK +YPNL+KLG LA T HLKSQ KELQ  C   FAPFPMVS +NA  KPSVPHH+GK GIN A AESNFH APST PS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE
             GWKNLQWEDRYHQLQLLLNKLDQSDQ+DYLQVL SLSSVELSRHAVELE+RSIQLSLEE
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE

A0A6J1G8C0 uncharacterized protein LOC1114517572.1e-8565.4Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSK S     N GK+ P+ EKQLQISAKKTA RDLQNDN  +ASNCTGSSPLLKE G  SD IKVSGN +  PV+  SP  L SSTSN+  GHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VY+RRKSDADI K+S CD++SIKA+Y   +KLG LA TVHLKSQ KELQ+HC   FAPF MVS +NA  KPSVPH   K GINLA AES+F SA      
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE--EVLDLDISNIQRLDNLVINSLKI
              WKNLQWE RYHQL+LLLNKL+QSDQ+DYLQVL SLSSVELSRHAVELEKRSI LS EE  E+  + + N+  L N  +N++K+
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE--EVLDLDISNIQRLDNLVINSLKI

A0A6J1JE12 uncharacterized protein LOC1114841756.3e-11180.68Show/hide
Query:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL
        MVQKSIDSKFSEYGHGN GKDVPSQEKQLQISAKKTA RDLQNDN   ASNCTGSSPLLKE G  SD IKVSGN       PA+PSHLHSSTSN++NGHL
Subjt:  MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHL

Query:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS
        VYVRRKSDADIGKNS CD+T+IK +YPNL+KLG LA T HLKSQ KELQNHC   FAPFPMVS +NA  KPSVPHH+GK GIN   AESNFH APST PS
Subjt:  VYVRRKSDADIGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPS

Query:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE
             GWKNLQWEDRYHQLQLLLNKLDQSDQ+DYLQVL SLSSVELSRHAVELE+RSIQLSLEE
Subjt:  VGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G45250.1 Integral membrane protein hemolysin-III homolog5.4e-1455.56Show/hide
Query:  LQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNL--VINSLKIT
        L WE+RY  LQ+LLNKL+QSD+ D++Q+L SLSS ELS+HAV+LEKRSIQ SLEE     ++  +  L+ L   +NS+K T
Subjt:  LQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNL--VINSLKIT

AT2G45250.2 Integral membrane protein hemolysin-III homolog6.4e-1571.43Show/hide
Query:  LQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEE
        L WE+RY  LQ+LLNKL+QSD+ D++Q+L SLSS ELS+HAV+LEKRSIQ SLEEE
Subjt:  LQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEE

AT4G38280.1 BEST Arabidopsis thaliana protein match is: Integral membrane protein hemolysin-III homolog (TAIR:AT2G45250.1)1.1e-1452.04Show/hide
Query:  SAPSTFPSVGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNL--VINSLKIT
        S+P+  P+   P   K L WE+RY  LQ+LLNKL+QSD+ D++Q+L SLSS ELS+HAV+LEKRSIQ SLEE     ++  +  L+ L   +NSLK T
Subjt:  SAPSTFPSVGIPVGWKNLQWEDRYHQLQLLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNL--VINSLKIT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAGAAATCCATTGACTCCAAATTCAGTGAATATGGACATGGAAATTTTGGGAAGGACGTGCCTTCTCAGGAAAAGCAACTGCAAATTTCTGCGAAGAAGACAGC
ATCAAGGGATTTGCAAAATGACAATATGGCCATAGCTTCAAATTGTACTGGAAGTTCTCCTCTTTTGAAGGAAATAGGTACCGGTAGTGACATCATTAAAGTTTCTGGTA
ACAAGAGAGCCTTACCAGTCTACCCTGCAAGTCCATCTCATCTCCATTCTTCAACTTCTAATTCTGCAAATGGGCATCTTGTTTATGTCCGTAGAAAATCTGATGCGGAT
ATAGGGAAGAATAGTTCTTGTGATAATACAAGCATAAAAGCTAATTATCCAAATCTAAACAAACTTGGTTCACTAGCTGTAACTGTGCATCTCAAATCCCAGGCTAAGGA
GCTGCAGAATCATTGCTTGCAAACATTTGCTCCTTTTCCAATGGTGTCTTCCGTGAATGCACCTAGAAAACCTTCAGTTCCTCATCACATGGGAAAGTGTGGCATCAATT
TAGCCGTAGCAGAATCGAACTTCCATTCTGCACCTTCTACTTTCCCTTCAGTAGGCATCCCAGTAGGATGGAAAAATTTGCAGTGGGAAGACAGATATCATCAGTTGCAG
TTGTTATTGAATAAATTGGACCAATCAGATCAACGTGATTATCTTCAGGTGCTCGGATCGTTGTCATCAGTTGAACTTAGTAGACATGCAGTTGAATTGGAAAAGAGATC
CATTCAGCTCTCGCTTGAGGAAGAAGTCTTGGATTTGGACATATCAAATATTCAGAGGTTGGATAATTTGGTAATCAATAGCTTAAAGATCACTAGGGAGAGGTTCACGT
TGGCACCTGAAGTTTCAAACCTTCTACATTCTGGATTGCTGCTAGGACATGTGTCAGAGAAGAGCTCAAACAGGCAAATGCAGAATTGGAGTCAACAACACAGAAACTGG
AGAAGGAGAAAATTGAGTTGCAGGTAG
mRNA sequenceShow/hide mRNA sequence
ACAATTCTATGATTAATCAATAATATCGTAAATTCAATTCTCTAACAAAAATTCCTATCGCAATGTAATTCTAAAAATCCTTATTAGTCCCAAACGGTGATTTGTAGTTT
TGGTTTGGACCCTTGAGAAACCATAACTGTGATCCATGTGTCTATCTGCAAGGCATTACGCACTATATCCCTCATCTTCTTAAACATTTTGCGTCCACTTTTTATTCTCC
TGAGCAAATTCGAATACATTTTCATGGTAATTTGATGAACGAGCATTGAAGAAATTGTATGACCTTCATAACTGAACGTTTGAATTCCACATGGCTCTCCCTTCTCCATC
ATAGCGTTTCCAATGAGCCAAGATGGTTCAGAAATCCATTGACTCCAAATTCAGTGAATATGGACATGGAAATTTTGGGAAGGACGTGCCTTCTCAGGAAAAGCAACTGC
AAATTTCTGCGAAGAAGACAGCATCAAGGGATTTGCAAAATGACAATATGGCCATAGCTTCAAATTGTACTGGAAGTTCTCCTCTTTTGAAGGAAATAGGTACCGGTAGT
GACATCATTAAAGTTTCTGGTAACAAGAGAGCCTTACCAGTCTACCCTGCAAGTCCATCTCATCTCCATTCTTCAACTTCTAATTCTGCAAATGGGCATCTTGTTTATGT
CCGTAGAAAATCTGATGCGGATATAGGGAAGAATAGTTCTTGTGATAATACAAGCATAAAAGCTAATTATCCAAATCTAAACAAACTTGGTTCACTAGCTGTAACTGTGC
ATCTCAAATCCCAGGCTAAGGAGCTGCAGAATCATTGCTTGCAAACATTTGCTCCTTTTCCAATGGTGTCTTCCGTGAATGCACCTAGAAAACCTTCAGTTCCTCATCAC
ATGGGAAAGTGTGGCATCAATTTAGCCGTAGCAGAATCGAACTTCCATTCTGCACCTTCTACTTTCCCTTCAGTAGGCATCCCAGTAGGATGGAAAAATTTGCAGTGGGA
AGACAGATATCATCAGTTGCAGTTGTTATTGAATAAATTGGACCAATCAGATCAACGTGATTATCTTCAGGTGCTCGGATCGTTGTCATCAGTTGAACTTAGTAGACATG
CAGTTGAATTGGAAAAGAGATCCATTCAGCTCTCGCTTGAGGAAGAAGTCTTGGATTTGGACATATCAAATATTCAGAGGTTGGATAATTTGGTAATCAATAGCTTAAAG
ATCACTAGGGAGAGGTTCACGTTGGCACCTGAAGTTTCAAACCTTCTACATTCTGGATTGCTGCTAGGACATGTGTCAGAGAAGAGCTCAAACAGGCAAATGCAGAATTG
GAGTCAACAACACAGAAACTGGAGAAGGAGAAAATTGAGTTGCAGGTAGGGACAGAGAAGGAAGTAAACAGAAGGTCAAGTGACTGGTCATTCAATTTGAGGAGGAGCGA
TTAAGAGGGACGAGTTAGAGAGCTACCTGAACAGAATGTCTCACTACAAAGAGAGGTTTCATTTTTAAACAAGATGGAAACAGAGAACAGAACTATAACAACTAATCTCG
AGCAAGATATGCAAAATATTGTGGACCTAACAGCTAGAATTGAAGAAAATAAATATTTACAACCAAATCTCTTCTAAATAAGAAGAAGATTACAAGGGGAGCAAATCGAA
GGTATGGATTGCATCAGAAAGAATTATGAGGAGAACGAGAAAGAGTGCAAAGAATTAAATAAAACAATTTCAAGGTTGTCAAGG
Protein sequenceShow/hide protein sequence
MVQKSIDSKFSEYGHGNFGKDVPSQEKQLQISAKKTASRDLQNDNMAIASNCTGSSPLLKEIGTGSDIIKVSGNKRALPVYPASPSHLHSSTSNSANGHLVYVRRKSDAD
IGKNSSCDNTSIKANYPNLNKLGSLAVTVHLKSQAKELQNHCLQTFAPFPMVSSVNAPRKPSVPHHMGKCGINLAVAESNFHSAPSTFPSVGIPVGWKNLQWEDRYHQLQ
LLLNKLDQSDQRDYLQVLGSLSSVELSRHAVELEKRSIQLSLEEEVLDLDISNIQRLDNLVINSLKITRERFTLAPEVSNLLHSGLLLGHVSEKSSNRQMQNWSQQHRNW
RRRKLSCR