; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G014910 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G014910
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionUnknown protein
Genome locationCmo_Chr14:11993404..11994120
RNA-Seq ExpressionCmoCh14G014910
SyntenyCmoCh14G014910
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018455.1 hypothetical protein SDJN02_20323, partial [Cucurbita argyrosperma subsp. argyrosperma]3.5e-12699.58Show/hide
Query:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYC
        MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASI GGNSNVSTSYHKTPVFSCDCFYC
Subjt:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGES
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGES
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGES

Query:  GPGKEVDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        GPGKEVDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GPGKEVDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

XP_022955886.1 uncharacterized protein LOC111457737 [Cucurbita moschata]4.1e-127100Show/hide
Query:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYC
        MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYC
Subjt:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGES
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGES
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGES

Query:  GPGKEVDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        GPGKEVDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GPGKEVDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

XP_022979917.1 uncharacterized protein LOC111479467 [Cucurbita maxima]6.1e-12396.64Show/hide
Query:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYC
        MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVST YHKTP+FSCDCFYC
Subjt:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGES
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRI RQATDKSLPVVQRP PV  ECVVVP+SPERQSA ASVLEETKGSPVKEVGES
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGES

Query:  GPGKEVDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        GPGKEVDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GPGKEVDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

XP_023528171.1 uncharacterized protein LOC111791162 [Cucurbita pepo subsp. pepo]8.8e-12295.44Show/hide
Query:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYC
        MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALV+VLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASI+GGNSNVST YHKTP+FSCDCFYC
Subjt:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGES
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRI RQATDKSLPVVQRP P+ DECVVVPLSPERQSAAASVLEETKGSPVKEVGES
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGES

Query:  GPGKEVDGDHQKGLATKVLPDVLGFFNSR---LWSLWSPNL
        GPGKEVDGDHQKGLATKVLPDVLGFFNSR   LWSLWSPNL
Subjt:  GPGKEVDGDHQKGLATKVLPDVLGFFNSR---LWSLWSPNL

XP_038904588.1 uncharacterized protein LOC120090946 [Benincasa hispida]3.4e-9778.78Show/hide
Query:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNV-----STSYHKTPVFSC
        MEIKHKGK+HPSPSSSPPSSSSSVFKLLP AILALVS+LSLD+REVLAYMIARSIQSSA TST  SRKKS +KASINGGN NV     +T+YHKTP+FSC
Subjt:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNV-----STSYHKTPVFSC

Query:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVK
        DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKPKKN G+GKRRDRI RQ T K+LPV+Q P PV DECV VPL  ER         E +GSPVK
Subjt:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVK

Query:  EVGESGPGKEV--DGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        EV ESGP +EV   GDH+KGL TKVLPDVLGF NSRLWSLWSPNL
Subjt:  EVGESGPGKEV--DGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

TrEMBL top hitse value%identityAlignment
A0A0A0KTW2 Uncharacterized protein6.9e-8874.9Show/hide
Query:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTS-THDSRKKSAKKASINGGNSNV--------STSYHKTP
        MEIKHK K+HPSP   PPSSSSSVFKLLPAAILAL S+LSLD+REVLAYMIARSIQSSA TS T  SRKKS KK  IN GNSNV        +T+YHKTP
Subjt:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTS-THDSRKKSAKKASINGGNSNV--------STSYHKTP

Query:  VFSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQAT--DKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEET
        +FSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKPKKNTG+GKRRDRI RQ +  +K+LPVV  PT V DECV VPLSP         + E 
Subjt:  VFSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQAT--DKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEET

Query:  KGSPVKEVGESGPGKE--VDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        +GS VKEV ESGP  E    G+HQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  KGSPVKEVGESGPGKE--VDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A1S3BXS0 uncharacterized protein LOC1034945675.8e-8773.12Show/hide
Query:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSA-VTSTHDSRKKSAKKASINGGNSNV-------STSYHKTPV
        MEIKHK K+HPSP   PPSSSSSVFKLLPAAILAL S+LSLD+REVLAYMIARSIQSSA +TST  SRKKS KK SIN GNSNV       +T+YHKTP+
Subjt:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSA-VTSTHDSRKKSAKKASINGGNSNV-------STSYHKTPV

Query:  FSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQAT-----DKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLE
        FSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKPKKN G+GKRRDRI RQ +     +K+LPV+  PT V DECV V LSP         + 
Subjt:  FSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQAT-----DKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLE

Query:  ETKGSPVKEVGESGPGKE--VDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        E +GS VKEV E+GP  E    G+HQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  ETKGSPVKEVGESGPGKE--VDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A5A7TRT9 Uncharacterized protein2.6e-8773.12Show/hide
Query:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSA-VTSTHDSRKKSAKKASINGGNSNV-------STSYHKTPV
        MEIKHK K+HPSP   PPSSSSSVFKLLPAAILAL S+LSLD+REVLAYMIARSIQSSA +TST  SRKKS KK SIN GNSNV       +T+YHKTP+
Subjt:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSA-VTSTHDSRKKSAKKASINGGNSNV-------STSYHKTPV

Query:  FSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQAT-----DKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLE
        FSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLT+GEKPKKN G+GKRRDRI RQ +     +K+LPV+  PT V DECV V LSP         + 
Subjt:  FSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQAT-----DKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLE

Query:  ETKGSPVKEVGESGPGKE--VDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        E +GS VKEV E+GP  E    G+HQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  ETKGSPVKEVGESGPGKE--VDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A6J1GV32 uncharacterized protein LOC1114577372.0e-127100Show/hide
Query:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYC
        MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYC
Subjt:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGES
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGES
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGES

Query:  GPGKEVDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        GPGKEVDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GPGKEVDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

A0A6J1IQ09 uncharacterized protein LOC1114794673.0e-12396.64Show/hide
Query:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYC
        MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVST YHKTP+FSCDCFYC
Subjt:  MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYC

Query:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGES
        YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRI RQATDKSLPVVQRP PV  ECVVVP+SPERQSA ASVLEETKGSPVKEVGES
Subjt:  YTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGES

Query:  GPGKEVDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
        GPGKEVDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL
Subjt:  GPGKEVDGDHQKGLATKVLPDVLGFFNSRLWSLWSPNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein5.3e-2443.42Show/hide
Query:  EIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYCY
        ++  KG VHPSP      S+  +  LLP AI +L +VLS ++REVLAY+I+ +  S     T    K  A K ++   +S         P+F CDCF CY
Subjt:  EIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYCY

Query:  TAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKN-TGKGKRRDRISRQAT
        T+YW RWDSSP+R+LIH+ I+AFED L   +  KKN TGK  RR R  + ++
Subjt:  TAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKN-TGKGKRRDRISRQAT

AT1G24270.1 unknown protein6.1e-2846.39Show/hide
Query:  MEIKHKGKVHPSP-----SSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSC
        M++  KGKVHPSP     SSS    S SVFKLL +AIL LVSVLS ++ EVLAY+I RS+ ++ V S    R                    HK P+  C
Subjt:  MEIKHKGKVHPSP-----SSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSC

Query:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGE-----KPKKNTGKGKRRDRISRQATDKSL
         CF CYT+YW +WDSS NRELI+Q IEAFEDHLT  E       KKN  + K+ +    Q  +KS+
Subjt:  DCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGE-----KPKKNTGKGKRRDRISRQATDKSL

AT1G62422.1 unknown protein6.5e-2242.11Show/hide
Query:  KGKVHPSPSSSPPS--SSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYCYTA
        KG VHPSP   PP+  +      LLP AIL+LV+ LS+++REVLAY+I+ S  S+ +     SR K  K+ +            H +P+F CDCF CYT+
Subjt:  KGKVHPSPSSSPPS--SSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYCYTA

Query:  YWCRWDSSPNRELIHQAIEAFEDHLTSGEKPK-KNTGKGKRRDRISRQATDK
        YW RWD+SP R+LIH+ I+A+ED L   +K K +    GK   R++   T +
Subjt:  YWCRWDSSPNRELIHQAIEAFEDHLTSGEKPK-KNTGKGKRRDRISRQATDK

AT5G13090.1 unknown protein1.8e-4042.39Show/hide
Query:  MEIKHKGKVHPSPSSSPPSSSS---------------SVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVST
        M++K KGKV+PSP   P SSSS               SV KLLPA IL LVSVLS +EREVLAY+I R    S       S+ K+ KK       SN S+
Subjt:  MEIKHKGKVHPSPSSSPPSSSS---------------SVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVST

Query:  SYHKTPVFSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGK-GKRRDRISRQATDKSLPVVQRPT--------PVVD---ECVVVP
          HK PVF C+CF CYT YW RWDSSPNRELIH+ IEAFE+H        ++  K GK++++  R+ TD       R T        PVV+   E  V  
Subjt:  SYHKTPVFSCDCFYCYTAYWCRWDSSPNRELIHQAIEAFEDHLTSGEKPKKNTGK-GKRRDRISRQATDKSLPVVQRPT--------PVVD---ECVVVP

Query:  LSPERQSAAASVLEETKGSPVKEVGESGPGKEVDG------------DHQKGLATKVLPDVLGFFNSRLWSLWSPN
         S        S  E  +G P  E+     G+E +                KGLA KVLPDVLG F+S  W LW+PN
Subjt:  LSPERQSAAASVLEETKGSPVKEVGESGPGKEVDG------------DHQKGLATKVLPDVLGFFNSRLWSLWSPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATTAAGCACAAAGGTAAAGTGCACCCATCGCCGTCGTCCTCGCCGCCCTCATCTTCCTCCTCTGTCTTCAAGCTTCTTCCGGCCGCCATTTTAGCGCTAGTTTC
AGTTCTTTCTCTCGATGAGCGTGAAGTCTTGGCCTACATGATCGCTAGGTCTATTCAATCCTCTGCAGTCACCTCCACTCACGATTCCAGGAAGAAATCGGCTAAAAAAG
CTTCGATCAATGGCGGAAACAGTAATGTTAGTACTAGTTATCATAAAACTCCTGTGTTTAGTTGCGATTGCTTCTACTGCTACACTGCCTACTGGTGCCGCTGGGACTCC
TCTCCTAATCGCGAACTAATCCACCAGGCGATCGAGGCGTTTGAAGATCACTTAACCAGCGGCGAGAAGCCGAAGAAGAACACTGGAAAAGGAAAGAGGAGAGACAGAAT
CAGCCGTCAAGCTACTGACAAATCTCTGCCTGTTGTTCAACGTCCAACGCCGGTGGTCGATGAGTGCGTCGTCGTTCCGCTATCGCCGGAACGTCAATCAGCGGCGGCGT
CTGTGTTGGAGGAGACAAAAGGAAGTCCGGTGAAGGAAGTTGGAGAGAGCGGTCCGGGAAAGGAAGTGGACGGCGACCACCAGAAGGGTTTGGCGACGAAGGTACTTCCG
GACGTGTTAGGGTTTTTCAATTCTCGTTTGTGGAGTCTGTGGAGTCCGAATCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATTAAGCACAAAGGTAAAGTGCACCCATCGCCGTCGTCCTCGCCGCCCTCATCTTCCTCCTCTGTCTTCAAGCTTCTTCCGGCCGCCATTTTAGCGCTAGTTTC
AGTTCTTTCTCTCGATGAGCGTGAAGTCTTGGCCTACATGATCGCTAGGTCTATTCAATCCTCTGCAGTCACCTCCACTCACGATTCCAGGAAGAAATCGGCTAAAAAAG
CTTCGATCAATGGCGGAAACAGTAATGTTAGTACTAGTTATCATAAAACTCCTGTGTTTAGTTGCGATTGCTTCTACTGCTACACTGCCTACTGGTGCCGCTGGGACTCC
TCTCCTAATCGCGAACTAATCCACCAGGCGATCGAGGCGTTTGAAGATCACTTAACCAGCGGCGAGAAGCCGAAGAAGAACACTGGAAAAGGAAAGAGGAGAGACAGAAT
CAGCCGTCAAGCTACTGACAAATCTCTGCCTGTTGTTCAACGTCCAACGCCGGTGGTCGATGAGTGCGTCGTCGTTCCGCTATCGCCGGAACGTCAATCAGCGGCGGCGT
CTGTGTTGGAGGAGACAAAAGGAAGTCCGGTGAAGGAAGTTGGAGAGAGCGGTCCGGGAAAGGAAGTGGACGGCGACCACCAGAAGGGTTTGGCGACGAAGGTACTTCCG
GACGTGTTAGGGTTTTTCAATTCTCGTTTGTGGAGTCTGTGGAGTCCGAATCTTTAA
Protein sequenceShow/hide protein sequence
MEIKHKGKVHPSPSSSPPSSSSSVFKLLPAAILALVSVLSLDEREVLAYMIARSIQSSAVTSTHDSRKKSAKKASINGGNSNVSTSYHKTPVFSCDCFYCYTAYWCRWDS
SPNRELIHQAIEAFEDHLTSGEKPKKNTGKGKRRDRISRQATDKSLPVVQRPTPVVDECVVVPLSPERQSAAASVLEETKGSPVKEVGESGPGKEVDGDHQKGLATKVLP
DVLGFFNSRLWSLWSPNL