; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G2633 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G2633
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionUnknown protein
Genome locationctg1006:481750..483560
RNA-Seq ExpressionCucsat.G2633
SyntenyCucsat.G2633
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139833.1 uncharacterized protein LOC101214550 [Cucumis sativus]4.05e-163100Show/hide
Query:  MAAEVSSLIRVLAGYKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK
        MAAEVSSLIRVLAGYKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK
Subjt:  MAAEVSSLIRVLAGYKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAPM
        FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAPM
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAPM

Query:  AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
Subjt:  AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

XP_008447115.1 PREDICTED: uncharacterized protein LOC103489640 [Cucumis melo]6.17e-15898.77Show/hide
Query:  MAAEVSSLIRVLAGYKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK
        MAAEVSSLIRVLAGYK+DDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK
Subjt:  MAAEVSSLIRVLAGYKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS-STNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAP
        FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS STNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAA KEIQEEEAAEIRNSAAP
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS-STNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAP

Query:  MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
Subjt:  MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

XP_022952111.1 uncharacterized protein LOC111454876 [Cucurbita moschata]2.73e-13284.11Show/hide
Query:  MAAEVSSLIRVLAG-----YKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMIN
        MAA+VS+LIRVLAG     Y D+DNRT LGNGQ+ T LVTRDLLGQSSNL  SQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSK  QM N
Subjt:  MAAEVSSLIRVLAG-----YKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMIN

Query:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS-----STNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEE-
        QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS     STNYASVCTLDKVKSALERADKELVKKRSSLWKS SSPSYSSSSS+AA  KEIQEEE 
Subjt:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS-----STNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEE-

Query:  AAEIRNSAA----PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        A E RN AA    PMAVGC GCLSYVLV KNNPRCPRCNSVVPL ++KKPRIDLNMSI
Subjt:  AAEIRNSAA----PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

XP_022969452.1 uncharacterized protein LOC111468450 [Cucurbita maxima]5.42e-13386.9Show/hide
Query:  MAAEVSSLIRVLAG--YKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTE
        MAA+VSSLIRVLAG  Y D+DNRT LGNGQ+ T LVTRDLLGQSSNL  SQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ QM NQTE
Subjt:  MAAEVSSLIRVLAG--YKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTE

Query:  SKFQDLNFPPSPSKRTLNLFNETSLDLKLTSS-PSS----TNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEE-AAE
        SKFQDLNFPPSPSKRTLNLFNETSLDLKLTSS PSS    TNYASVCTLDKVKSALERADKELVKKRSSLWKS SSPSYSSSSS+AA  KEIQEEE A E
Subjt:  SKFQDLNFPPSPSKRTLNLFNETSLDLKLTSS-PSS----TNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEE-AAE

Query:  IRN-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
         RN +AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL ++KKPRIDLNMSI
Subjt:  IRN-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

XP_038887388.1 LOW QUALITY PROTEIN: uncharacterized protein LOC120077543 [Benincasa hispida]2.33e-13888.48Show/hide
Query:  MAAEVSSLIRVLAGYKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK
        MAAEVSSLIRVLAGYKD+DNRTALGNGQ+STALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNS+SKQ +MINQTESK
Subjt:  MAAEVSSLIRVLAGYKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAPM
        FQDLNFPPSPSKRTLNL NETSLDLKLTSSPS+    SVCTLDKVKSALERADKELVKKRSS       PS        +A KEIQEEEAAEIRNSAAPM
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAPM

Query:  AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
Subjt:  AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

TrEMBL top hitse value%identityAlignment
A0A0A0K3J9 Uncharacterized protein1.96e-163100Show/hide
Query:  MAAEVSSLIRVLAGYKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK
        MAAEVSSLIRVLAGYKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK
Subjt:  MAAEVSSLIRVLAGYKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAPM
        FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAPM
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAPM

Query:  AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
Subjt:  AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

A0A1S3BGM6 uncharacterized protein LOC1034896402.99e-15898.77Show/hide
Query:  MAAEVSSLIRVLAGYKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK
        MAAEVSSLIRVLAGYK+DDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK
Subjt:  MAAEVSSLIRVLAGYKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS-STNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAP
        FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS STNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAA KEIQEEEAAEIRNSAAP
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS-STNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAP

Query:  MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
Subjt:  MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

A0A6J1EXD9 uncharacterized protein LOC1114370423.29e-13185.19Show/hide
Query:  MAAEVSSLIRVLAGYKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK
        MAA+VSSLIRVLAGYKD DNR AL   ++STAL TRDLLGQSSNL DSQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ QM NQTESK
Subjt:  MAAEVSSLIRVLAGYKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESK

Query:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAPM
         QDLNFPPSPSKRTLNLFNETSLDL LTSS   TNYASVCTLDKVKSALERADKEL+KKRS+LWKS SSPS         A KEIQEEEAAE R SAAPM
Subjt:  FQDLNFPPSPSKRTLNLFNETSLDLKLTSSPSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAPM

Query:  AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        AVGCPGCLSYVLV KNNPRCPRCNSVVPLP+IKKPRIDLN+SI
Subjt:  AVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

A0A6J1GJC7 uncharacterized protein LOC1114548761.32e-13284.11Show/hide
Query:  MAAEVSSLIRVLAG-----YKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMIN
        MAA+VS+LIRVLAG     Y D+DNRT LGNGQ+ T LVTRDLLGQSSNL  SQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSK  QM N
Subjt:  MAAEVSSLIRVLAG-----YKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMIN

Query:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS-----STNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEE-
        QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS     STNYASVCTLDKVKSALERADKELVKKRSSLWKS SSPSYSSSSS+AA  KEIQEEE 
Subjt:  QTESKFQDLNFPPSPSKRTLNLFNETSLDLKLTSSPS-----STNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEE-

Query:  AAEIRNSAA----PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
        A E RN AA    PMAVGC GCLSYVLV KNNPRCPRCNSVVPL ++KKPRIDLNMSI
Subjt:  AAEIRNSAA----PMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

A0A6J1HWE1 uncharacterized protein LOC1114684502.62e-13386.9Show/hide
Query:  MAAEVSSLIRVLAG--YKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTE
        MAA+VSSLIRVLAG  Y D+DNRT LGNGQ+ T LVTRDLLGQSSNL  SQELDLDLQVP+GWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQ QM NQTE
Subjt:  MAAEVSSLIRVLAG--YKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTE

Query:  SKFQDLNFPPSPSKRTLNLFNETSLDLKLTSS-PSS----TNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEE-AAE
        SKFQDLNFPPSPSKRTLNLFNETSLDLKLTSS PSS    TNYASVCTLDKVKSALERADKELVKKRSSLWKS SSPSYSSSSS+AA  KEIQEEE A E
Subjt:  SKFQDLNFPPSPSKRTLNLFNETSLDLKLTSS-PSS----TNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEE-AAE

Query:  IRN-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
         RN +AAPMAVGC GCLSYVLV KNNPRCPRCNSVVPL ++KKPRIDLNMSI
Subjt:  IRN-SAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16500.1 unknown protein1.2e-4444.96Show/hide
Query:  MAAEVSSLIRVLAGYKDDDNRTALGNGQDST-ALVTRDLLGQSSNL-------TDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSD----S
        MAA+VSSL+R+L+ +KDD        G  ST AL+TRDLLG    +         S ELDLD+QVP GWEKRLDLKSGKVY+Q+     S  +S      
Subjt:  MAAEVSSLIRVLAGYKDDDNRTALGNGQDST-ALVTRDLLGQSSNL-------TDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSD----S

Query:  KQIQMINQTESKFQDLNFPP----SPSKRTLNLF---NETSLDLKLTSS----------------PSSTNYASVCTLDKVKSALERADKELVKKRSSLWK
              NQT  +FQDLN PP     P+K  L+LF   ++TSL+LKL  S                 S +  +SVCTLDKVK ALERA+K+  K++S    
Subjt:  KQIQMINQTESKFQDLNFPP----SPSKRTLNLF---NETSLDLKLTSS----------------PSSTNYASVCTLDKVKSALERADKELVKKRSSLWK

Query:  SASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI
              Y  ++S+  A               A+ +A GCPGCLSYV V KNNP+CPRC+S VPLP +KKP+IDLN+S+
Subjt:  SASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLNMSI

AT1G79160.1 unknown protein5.8e-5050.39Show/hide
Query:  MAAEVSSLIRVLAGYKDDDNRTALGNG-QDSTALVTRDLL--GQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQT
        MAA+VSSL+R+L+GYKDD        G + S AL+TRDLL  G+      S ELDLDLQVPTG+EKRLDLKSGKVY+QR  +  S   +++ Q    NQT
Subjt:  MAAEVSSLIRVLAGYKDDDNRTALGNG-QDSTALVTRDLL--GQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQT

Query:  ESKFQDLNFPPSPSKRT--LNLFNETSLDLKL-----TSSPSSTNYASVCTLDKVKSALERADKE--LVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEE
           FQDLNFPP     +  LNLF++T+ +LKL     +S P+++N  SVCTLDKVKSALERA+++  + KKR    +S     Y    + A         
Subjt:  ESKFQDLNFPPSPSKRT--LNLFNETSLDLKL-----TSSPSSTNYASVCTLDKVKSALERADKE--LVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEE

Query:  EAAEIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPT---IKKPRIDLNMSI
                A+P+  GCPGCLSYVLVM NNP+CPRC+++VPLPT    KKP+IDLN+SI
Subjt:  EAAEIRNSAAPMAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPT---IKKPRIDLNMSI

AT5G06270.1 unknown protein8.2e-0432.22Show/hide
Query:  SSLWKSASSPSYSSSSS-----SAAAGKEIQEEEAAEIRNSAAP-----MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLN
        SS  +   SPS S+++S     S+    E+ ++E + +R S +P     + VGCP CL YV++ +++P+CP+C S V L  + +   + N
Subjt:  SSLWKSASSPSYSSSSS-----SAAAGKEIQEEEAAEIRNSAAP-----MAVGCPGCLSYVLVMKNNPRCPRCNSVVPLPTIKKPRIDLN

AT5G22270.1 unknown protein1.6e-0442.86Show/hide
Query:  SPSYSSSSSSAAAGKEIQEEEAAEIRNSAAPM-AVGCPGCLSYVLV-MKNNPRCPRCNSVVPL
        SPS  SS  + ++   +  EEA      A  M  VGCP C+ Y++  ++N+PRCPRCNS V L
Subjt:  SPSYSSSSSSAAAGKEIQEEEAAEIRNSAAPM-AVGCPGCLSYVLV-MKNNPRCPRCNSVVPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGCCGAAGTCAGCAGTCTGATTCGGGTACTCGCCGGGTACAAGGACGATGATAATCGCACAGCCCTCGGTAATGGCCAAGATTCAACGGCTCTCGTTACTCGCGA
TTTGCTCGGTCAATCTTCCAACCTTACCGACTCTCAAGAATTAGACCTCGACTTGCAAGTTCCCACCGGCTGGGAGAAAAGACTCGACTTGAAGTCAGGAAAAGTTTACA
TACAGAGGAGTCAAACGCCGGATTCTCCTCTGAATTCAGATTCAAAACAAATCCAAATGATCAATCAAACAGAATCCAAATTCCAGGATTTGAATTTCCCTCCATCTCCT
TCAAAACGAACATTAAATCTCTTCAACGAAACCAGTTTGGATTTGAAATTGACATCGTCCCCGTCCTCCACCAATTACGCCAGCGTTTGTACTCTGGATAAGGTGAAATC
TGCTCTGGAAAGGGCCGACAAGGAGTTGGTAAAGAAACGCTCTTCCCTATGGAAATCAGCTTCATCCCCGTCGTACTCCTCATCCTCATCTTCCGCGGCGGCGGGAAAGG
AAATTCAAGAAGAAGAAGCGGCGGAAATTAGAAACTCGGCGGCGCCGATGGCGGTGGGTTGTCCCGGATGTTTATCGTATGTATTAGTAATGAAAAATAACCCTCGATGT
CCTCGTTGCAACTCTGTTGTTCCATTGCCCACCATCAAGAAACCTCGGATTGATCTGAACATGTCCATATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGCCGAAGTCAGCAGTCTGATTCGGGTACTCGCCGGGTACAAGGACGATGATAATCGCACAGCCCTCGGTAATGGCCAAGATTCAACGGCTCTCGTTACTCGCGA
TTTGCTCGGTCAATCTTCCAACCTTACCGACTCTCAAGAATTAGACCTCGACTTGCAAGTTCCCACCGGCTGGGAGAAAAGACTCGACTTGAAGTCAGGAAAAGTTTACA
TACAGAGGAGTCAAACGCCGGATTCTCCTCTGAATTCAGATTCAAAACAAATCCAAATGATCAATCAAACAGAATCCAAATTCCAGGATTTGAATTTCCCTCCATCTCCT
TCAAAACGAACATTAAATCTCTTCAACGAAACCAGTTTGGATTTGAAATTGACATCGTCCCCGTCCTCCACCAATTACGCCAGCGTTTGTACTCTGGATAAGGTGAAATC
TGCTCTGGAAAGGGCCGACAAGGAGTTGGTAAAGAAACGCTCTTCCCTATGGAAATCAGCTTCATCCCCGTCGTACTCCTCATCCTCATCTTCCGCGGCGGCGGGAAAGG
AAATTCAAGAAGAAGAAGCGGCGGAAATTAGAAACTCGGCGGCGCCGATGGCGGTGGGTTGTCCCGGATGTTTATCGTATGTATTAGTAATGAAAAATAACCCTCGATGT
CCTCGTTGCAACTCTGTTGTTCCATTGCCCACCATCAAGAAACCTCGGATTGATCTGAACATGTCCATATAA
Protein sequenceShow/hide protein sequence
MAAEVSSLIRVLAGYKDDDNRTALGNGQDSTALVTRDLLGQSSNLTDSQELDLDLQVPTGWEKRLDLKSGKVYIQRSQTPDSPLNSDSKQIQMINQTESKFQDLNFPPSP
SKRTLNLFNETSLDLKLTSSPSSTNYASVCTLDKVKSALERADKELVKKRSSLWKSASSPSYSSSSSSAAAGKEIQEEEAAEIRNSAAPMAVGCPGCLSYVLVMKNNPRC
PRCNSVVPLPTIKKPRIDLNMSI