; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G4811 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G4811
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionUnknown protein
Genome locationctg1227:4028120..4030007
RNA-Seq ExpressionCucsat.G4811
SyntenyCucsat.G4811
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK06673.1 uncharacterized protein E5676_scaffold453G001600 [Cucumis melo var. makuwa]1.59e-12593.33Show/hide
Query:  MSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHSPP-LTRTKSLDLNPE
        MSPRISFSHDFS TEPIPVEQRPNSRSKSSGF SSFDFDFCIPECSDHESSSADEIFS GKILPLEIKKKPEDQRLEHSSLNHHSPP LTRTKSLDLNPE
Subjt:  MSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHSPP-LTRTKSLDLNPE

Query:  KCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHRNGLKNSQQCSSSSS
        KCLKK+PS KEIK TGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGS+SNN+KR+PLSKDGVNQKQSS RNGLKN QQCSSSS 
Subjt:  KCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHRNGLKNSQQCSSSSS

Query:  TGFQKPPLNK
        TGFQKPPL K
Subjt:  TGFQKPPLNK

XP_004138637.1 uncharacterized protein LOC101215741 [Cucumis sativus]6.35e-152100Show/hide
Query:  MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHS
        MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHS
Subjt:  MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHS

Query:  PPLTRTKSLDLNPEKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHR
        PPLTRTKSLDLNPEKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHR
Subjt:  PPLTRTKSLDLNPEKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHR

Query:  NGLKNSQQCSSSSSTGFQKPPLNK
        NGLKNSQQCSSSSSTGFQKPPLNK
Subjt:  NGLKNSQQCSSSSSTGFQKPPLNK

XP_008441254.1 PREDICTED: uncharacterized protein LOC103485440 [Cucumis melo]2.83e-13492.89Show/hide
Query:  MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHS
        MAIE AVS DIPVPAMSPRISFSHDFS TEPIPVEQRPNSRSKSSGF SSFDFDFCIPECSDHESSSADEIFS GKILPLEIKKKPEDQRLEHSSLNHHS
Subjt:  MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHS

Query:  PP-LTRTKSLDLNPEKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSH
        PP LTRTKSLDLNPEKCLKK+PS KEIK TGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGS+SNN+KR+PLSKDGVNQKQSS 
Subjt:  PP-LTRTKSLDLNPEKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSH

Query:  RNGLKNSQQCSSSSSTGFQKPPLNK
        RNGLKN QQCSSSS TGFQKPPL K
Subjt:  RNGLKNSQQCSSSSSTGFQKPPLNK

XP_022953388.1 uncharacterized protein LOC111455949 isoform X2 [Cucurbita moschata]1.01e-9776.75Show/hide
Query:  MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQ--RLEHSSLNH
        MAIE AVS DIPV A+SPRISFSHDF H E IPVEQRPNSRS SS F SSFDFDFCI ECS  ESSSADEIFS GKILPLEIKKK E+   RL+ SS ++
Subjt:  MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQ--RLEHSSLNH

Query:  HSPPLTRTKSLDLNPEKCLKKNPSLKEIK-GTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQS
        HSPPLTR KSLD N EKCLKK+ S KEIK    SDSEEKQ+  SN KSFW FKRSSSCGSGYTRSLCPLPLLSRSNSTGSA N +KR+ LSKDGV QKQS
Subjt:  HSPPLTRTKSLDLNPEKCLKKNPSLKEIK-GTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQS

Query:  SHRNGLKNSQQCSSSSSTGFQKPPLNKK
        SHRN  KNSQ CSSS   G+QKPPL KK
Subjt:  SHRNGLKNSQQCSSSSSTGFQKPPLNKK

XP_038884074.1 uncharacterized protein LOC120075010 [Benincasa hispida]3.94e-10980.87Show/hide
Query:  MAIETAVSSDIPVPA-MSPRISFSHDFSHTEPIPVEQRPNSRSKSS-GFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNH
        MAI+ AVS DI V A MSPRISFSHDFS TEPIPVEQRP+SRSKSS GF SSFDFDFCIPECSDHESSSADEIFS GKILPL+IKKKP D+ L+HSS NH
Subjt:  MAIETAVSSDIPVPA-MSPRISFSHDFSHTEPIPVEQRPNSRSKSS-GFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNH

Query:  HSPPLTRTKSLDLNPEKCLKKNPSLKEIKGTGSDSEEKQ----NTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQ
            LTRTKSLDLN +KCLKK+PSLKEIK T +DSEEKQ    N+NSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASN +KR+PLSKDGV+Q
Subjt:  HSPPLTRTKSLDLNPEKCLKKNPSLKEIKGTGSDSEEKQ----NTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQ

Query:  KQSSHRNGLKNSQQCSSSSSTGFQKPPLNK
        KQSSHRN  KNS QCSSS   G+QKPPL K
Subjt:  KQSSHRNGLKNSQQCSSSSSTGFQKPPLNK

TrEMBL top hitse value%identityAlignment
A0A0A0LMU8 Uncharacterized protein3.07e-152100Show/hide
Query:  MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHS
        MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHS
Subjt:  MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHS

Query:  PPLTRTKSLDLNPEKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHR
        PPLTRTKSLDLNPEKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHR
Subjt:  PPLTRTKSLDLNPEKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHR

Query:  NGLKNSQQCSSSSSTGFQKPPLNK
        NGLKNSQQCSSSSSTGFQKPPLNK
Subjt:  NGLKNSQQCSSSSSTGFQKPPLNK

A0A1S3B315 uncharacterized protein LOC1034854401.37e-13492.89Show/hide
Query:  MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHS
        MAIE AVS DIPVPAMSPRISFSHDFS TEPIPVEQRPNSRSKSSGF SSFDFDFCIPECSDHESSSADEIFS GKILPLEIKKKPEDQRLEHSSLNHHS
Subjt:  MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHS

Query:  PP-LTRTKSLDLNPEKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSH
        PP LTRTKSLDLNPEKCLKK+PS KEIK TGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGS+SNN+KR+PLSKDGVNQKQSS 
Subjt:  PP-LTRTKSLDLNPEKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSH

Query:  RNGLKNSQQCSSSSSTGFQKPPLNK
        RNGLKN QQCSSSS TGFQKPPL K
Subjt:  RNGLKNSQQCSSSSSTGFQKPPLNK

A0A5D3C420 Uncharacterized protein7.70e-12693.33Show/hide
Query:  MSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHSPP-LTRTKSLDLNPE
        MSPRISFSHDFS TEPIPVEQRPNSRSKSSGF SSFDFDFCIPECSDHESSSADEIFS GKILPLEIKKKPEDQRLEHSSLNHHSPP LTRTKSLDLNPE
Subjt:  MSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHSPP-LTRTKSLDLNPE

Query:  KCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHRNGLKNSQQCSSSSS
        KCLKK+PS KEIK TGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGS+SNN+KR+PLSKDGVNQKQSS RNGLKN QQCSSSS 
Subjt:  KCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHRNGLKNSQQCSSSSS

Query:  TGFQKPPLNK
        TGFQKPPL K
Subjt:  TGFQKPPLNK

A0A6J1GN78 uncharacterized protein LOC111455949 isoform X24.90e-9876.75Show/hide
Query:  MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQ--RLEHSSLNH
        MAIE AVS DIPV A+SPRISFSHDF H E IPVEQRPNSRS SS F SSFDFDFCI ECS  ESSSADEIFS GKILPLEIKKK E+   RL+ SS ++
Subjt:  MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQ--RLEHSSLNH

Query:  HSPPLTRTKSLDLNPEKCLKKNPSLKEIK-GTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQS
        HSPPLTR KSLD N EKCLKK+ S KEIK    SDSEEKQ+  SN KSFW FKRSSSCGSGYTRSLCPLPLLSRSNSTGSA N +KR+ LSKDGV QKQS
Subjt:  HSPPLTRTKSLDLNPEKCLKKNPSLKEIK-GTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQS

Query:  SHRNGLKNSQQCSSSSSTGFQKPPLNKK
        SHRN  KNSQ CSSS   G+QKPPL KK
Subjt:  SHRNGLKNSQQCSSSSSTGFQKPPLNKK

E5GB42 Uncharacterized protein1.37e-13492.89Show/hide
Query:  MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHS
        MAIE AVS DIPVPAMSPRISFSHDFS TEPIPVEQRPNSRSKSSGF SSFDFDFCIPECSDHESSSADEIFS GKILPLEIKKKPEDQRLEHSSLNHHS
Subjt:  MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHS

Query:  PP-LTRTKSLDLNPEKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSH
        PP LTRTKSLDLNPEKCLKK+PS KEIK TGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGS+SNN+KR+PLSKDGVNQKQSS 
Subjt:  PP-LTRTKSLDLNPEKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSH

Query:  RNGLKNSQQCSSSSSTGFQKPPLNK
        RNGLKN QQCSSSS TGFQKPPL K
Subjt:  RNGLKNSQQCSSSSSTGFQKPPLNK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G48780.1 unknown protein6.8e-1234.68Show/hide
Query:  RISFSHDFSHTE--PIPVEQRPN--SRSKSSGFGSSFDFDFCIPECSD-HESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHSPPLTRTKSLDLNP
        RISFS D   ++  P PV +      R ++    S+ DF+F I    D  +SS ADEIF+ G ILP  +        +      +  PP+T + S     
Subjt:  RISFSHDFSHTE--PIPVEQRPN--SRSKSSGFGSSFDFDFCIPECSD-HESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHSPPLTRTKSLDLNP

Query:  EKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSL-CPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHRNGLKNSQQCSSS
         + L    S KE  G  S +        +SKSFW FKRSSS      +SL C  P L+RSNSTGS +N+ KR+ L    VN  + S R+   N+ Q    
Subjt:  EKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSL-CPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHRNGLKNSQQCSSS

Query:  SSTGFQKPPLNKKSPTQVLPGP
          TG +       S   VL GP
Subjt:  SSTGFQKPPLNKKSPTQVLPGP

AT1G67050.1 unknown protein2.0e-3548.4Show/hide
Query:  MSPRISFSHDFSHTEPIPVEQRP--NSRSKSSGFGSSFDFDFCIP------ECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHSPPLTRTK
        MSPRISFS DF  ++ IP+E+RP  +S SK S   SS DFDFCIP      E  D  S SADE+FS GKILP EIKKKPE  + E       S P +R +
Subjt:  MSPRISFSHDFSHTEPIPVEQRP--NSRSKSSGFGSSFDFDFCIP------ECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHSPPLTRTK

Query:  SLDLNPEKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSS--CGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHRNGLKN
            N E         ++       +EEK NT    KSFW FKRSSS  CGS Y RSLCPLPLL+RSNSTGS S+  K+S   K   + K     +   +
Subjt:  SLDLNPEKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSS--CGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHRNGLKN

Query:  SQQCSSSSSTGFQKPPLNK
        S   SS S+ GF KPPL K
Subjt:  SQQCSSSSSTGFQKPPLNK

AT3G18300.1 unknown protein8.3e-1033.5Show/hide
Query:  RISFSHDFSHTEP-IPVEQRPNS---RSKSSGFGSSFDFDFCIPECSD-HESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHSPPLTRTKSLDLN-
        R SF+ D   ++   P+EQ+P+    R  +    S+ DF+F I    D  +SS ADEIF+ G ILP+   +      +      +  PP+    +L    
Subjt:  RISFSHDFSHTEP-IPVEQRPNS---RSKSSGFGSSFDFDFCIPECSD-HESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHSPPLTRTKSLDLN-

Query:  -------PEKCLKKNPSLKEIKGT----GSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSL-CPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHR
               PE   K   S+KE +G+    GS +        +SKSFW FKRSSS      +SL C  P L+RSNSTGS +  + +  + +D +N K SS R
Subjt:  -------PEKCLKKNPSLKEIKGT----GSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSL-CPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHR

Query:  NGL
        +G+
Subjt:  NGL

AT5G38320.1 unknown protein8.3e-1037.5Show/hide
Query:  SPRISFSHDFSHTEPIPVEQRPN-SRSKSSGFGSSFDFDFCIPE--CSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHSPPLTRTKSL---D
        SPRISFS+DF H E IP+EQR + S    S F   F  +F IP    S   S SA+E F+ GKILP+E+KK PE   +  S  + +   L R + +   D
Subjt:  SPRISFSHDFSHTEPIPVEQRPN-SRSKSSGFGSSFDFDFCIPE--CSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHSPPLTRTKSL---D

Query:  LNP--------EKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSF
          P        ++  +    L     TGS+S + Q ++S+S SF
Subjt:  LNP--------EKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSF

AT5G38320.2 unknown protein1.3e-1052.7Show/hide
Query:  SPRISFSHDFSHTEPIPVEQRPN-SRSKSSGFGSSFDFDFCIPE--CSDHESSSADEIFSQGKILPLEIKKKPE
        SPRISFS+DF H E IP+EQR + S    S F   F  +F IP    S   S SA+E F+ GKILP+E+KK PE
Subjt:  SPRISFSHDFSHTEPIPVEQRPN-SRSKSSGFGSSFDFDFCIPE--CSDHESSSADEIFSQGKILPLEIKKKPE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATCGAAACCGCCGTTTCTTCTGACATTCCCGTTCCCGCAATGAGCCCTAGAATTTCATTCTCTCACGATTTCTCCCACACCGAACCTATCCCCGTCGAACAACG
CCCTAATTCCCGATCCAAATCCTCCGGTTTCGGTTCCAGCTTCGATTTCGATTTCTGCATTCCTGAATGTTCCGATCACGAATCCTCCTCCGCCGATGAAATTTTCTCTC
AAGGAAAAATCCTTCCACTCGAAATCAAGAAGAAACCCGAAGACCAGCGACTCGAACACTCTTCTTTAAATCATCATTCTCCTCCATTGACGCGGACGAAATCGCTCGAT
CTCAATCCGGAGAAATGCTTGAAGAAAAATCCATCGTTGAAGGAAATCAAGGGTACGGGGAGTGATTCTGAAGAGAAACAAAATACTAATTCTAATTCCAAATCCTTTTG
GCGTTTCAAAAGAAGCAGTAGTTGTGGGTCTGGTTATACTCGTAGCTTATGTCCTTTACCGCTTCTCTCACGAAGCAATTCCACTGGCTCTGCTTCCAATAATATGAAGC
GATCGCCATTGTCCAAAGACGGTGTAAATCAAAAGCAAAGCTCTCATAGAAATGGGTTGAAAAATTCGCAGCAGTGTTCGTCTTCTTCATCAACGGGATTTCAGAAACCT
CCATTGAACAAGAAAAGCCCAACTCAAGTCTTACCAGGCCCAAGACTAAGCTGTAAATCAAGGCCCAAACTCCTACCGGAAACATCTTCATCTCTTACTCCTGCCTGGGA
AAACGCGGGCCGTGACGTAGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCATCGAAACCGCCGTTTCTTCTGACATTCCCGTTCCCGCAATGAGCCCTAGAATTTCATTCTCTCACGATTTCTCCCACACCGAACCTATCCCCGTCGAACAACG
CCCTAATTCCCGATCCAAATCCTCCGGTTTCGGTTCCAGCTTCGATTTCGATTTCTGCATTCCTGAATGTTCCGATCACGAATCCTCCTCCGCCGATGAAATTTTCTCTC
AAGGAAAAATCCTTCCACTCGAAATCAAGAAGAAACCCGAAGACCAGCGACTCGAACACTCTTCTTTAAATCATCATTCTCCTCCATTGACGCGGACGAAATCGCTCGAT
CTCAATCCGGAGAAATGCTTGAAGAAAAATCCATCGTTGAAGGAAATCAAGGGTACGGGGAGTGATTCTGAAGAGAAACAAAATACTAATTCTAATTCCAAATCCTTTTG
GCGTTTCAAAAGAAGCAGTAGTTGTGGGTCTGGTTATACTCGTAGCTTATGTCCTTTACCGCTTCTCTCACGAAGCAATTCCACTGGCTCTGCTTCCAATAATATGAAGC
GATCGCCATTGTCCAAAGACGGTGTAAATCAAAAGCAAAGCTCTCATAGAAATGGGTTGAAAAATTCGCAGCAGTGTTCGTCTTCTTCATCAACGGGATTTCAGAAACCT
CCATTGAACAAGAAAAGCCCAACTCAAGTCTTACCAGGCCCAAGACTAAGCTGTAAATCAAGGCCCAAACTCCTACCGGAAACATCTTCATCTCTTACTCCTGCCTGGGA
AAACGCGGGCCGTGACGTAGCGTAA
Protein sequenceShow/hide protein sequence
MAIETAVSSDIPVPAMSPRISFSHDFSHTEPIPVEQRPNSRSKSSGFGSSFDFDFCIPECSDHESSSADEIFSQGKILPLEIKKKPEDQRLEHSSLNHHSPPLTRTKSLD
LNPEKCLKKNPSLKEIKGTGSDSEEKQNTNSNSKSFWRFKRSSSCGSGYTRSLCPLPLLSRSNSTGSASNNMKRSPLSKDGVNQKQSSHRNGLKNSQQCSSSSSTGFQKP
PLNKKSPTQVLPGPRLSCKSRPKLLPETSSSLTPAWENAGRDVA