; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh02G010920 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh02G010920
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionUnknown protein
Genome locationCmo_Chr02:6625079..6626354
RNA-Seq ExpressionCmoCh02G010920
SyntenyCmoCh02G010920
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605736.1 hypothetical protein SDJN03_03053, partial [Cucurbita argyrosperma subsp. sororia]3.9e-12597.91Show/hide
Query:  MSGGVDMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGEEPTVFSQQN
        MSGGVDMSLPKEQE LHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMY YFLSRVAFPRL+GEEPTVFSQQN
Subjt:  MSGGVDMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGEEPTVFSQQN

Query:  KLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAALFVG
        KLLQ YVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTL EWLREEFAKEDKKPAALFVG
Subjt:  KLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAALFVG

Query:  RALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        RALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
Subjt:  RALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

XP_022958042.1 uncharacterized protein LOC111459394 [Cucurbita moschata]1.2e-129100Show/hide
Query:  MSGGVDMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGEEPTVFSQQN
        MSGGVDMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGEEPTVFSQQN
Subjt:  MSGGVDMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGEEPTVFSQQN

Query:  KLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAALFVG
        KLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAALFVG
Subjt:  KLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAALFVG

Query:  RALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        RALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
Subjt:  RALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

XP_022995262.1 uncharacterized protein LOC111490863 [Cucurbita maxima]3.6e-12395.04Show/hide
Query:  MSGGVDMSLPKEQELLHEETCDPK---QQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGEEPTVFS
        M GGVD+SL KEQELLHEETCDPK   QQQHRGMKLGRKKPAVFLSFRHLNMLA+MVIFSASGLVCAEDLAFVVLSI+Y YFLSRVAFPRLEGEEPTVFS
Subjt:  MSGGVDMSLPKEQELLHEETCDPK---QQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGEEPTVFS

Query:  QQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAAL
        QQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEG+AWNDRFSTPIRVFVPVFYNSRRIFTL EWLREEFAKEDKKPAAL
Subjt:  QQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAAL

Query:  FVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        FVGRALAVINMALWSFNLFGFLLPVYLPKAF+RYYSLSKSKD
Subjt:  FVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

XP_023534699.1 uncharacterized protein LOC111796187 [Cucurbita pepo subsp. pepo]3.0e-12597.5Show/hide
Query:  MSGGVDMSLPKEQELLHEETCDPK-QQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGEEPTVFSQQ
        MSGGVDMSLPKE+ELLHEETCDPK QQQHRGMKLGRKKPAVFLSFRHL MLALMVIFSASGLVCAEDLAFVVLSIMY YFLSRVAFPRLEGEEPTVFSQQ
Subjt:  MSGGVDMSLPKEQELLHEETCDPK-QQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGEEPTVFSQQ

Query:  NKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAALFV
        NKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTL EWLREEFAKEDKKPAALF+
Subjt:  NKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAALFV

Query:  GRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        GRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
Subjt:  GRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

XP_038906981.1 uncharacterized protein LOC120092829 [Benincasa hispida]4.3e-9274.41Show/hide
Query:  MSGGV-----DMSLPKEQELLHEETCDPKQQQHRGMKL-----GRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRL-E
        MSGGV     D+SLPKEQE LH+E  D K QQ +G+KL     GR+K A FLSFR LN LA+++IFSASG+VCAEDLAFVV S+MY YF+SRVAFP L +
Subjt:  MSGGV-----DMSLPKEQELLHEETCDPKQQQHRGMKL-----GRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRL-E

Query:  GEEPTVFSQQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAK
          EPTVFS +NK+L+ YV FAAVVGLFLPIAYIL GFF++D+EGIKAASPHVFLLASQVFMEGVA  DRFSTPIRVFVPVFYNSRRIFTL EWLR+EF K
Subjt:  GEEPTVFSQQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAK

Query:  EDKKPAA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        EDK+ +     L VGRALAV NMALWSFNLFGFLLPVYLP+AFKRYYSL KSKD
Subjt:  EDKKPAA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

TrEMBL top hitse value%identityAlignment
A0A0A0KK26 Uncharacterized protein7.4e-9072.69Show/hide
Query:  MSGGV-----DMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGE-EPT
        MSGGV     D+SLPKEQE +H+E  DPKQ    G  +GR+K A FLS R LN LA+++IFSASG+VCAEDL FV+ SIMY YFLSRVAFPR+ G  +  
Subjt:  MSGGV-----DMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGE-EPT

Query:  VFSQQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKP
        VF  +N++L+ YVLFAA+VGLFLP+AYIL GFF++D+EGIKAASPHVFLLASQVFMEGVA ND FSTPIRVFVPVFYNSRRIFTL EWLR EFAKEDK+ 
Subjt:  VFSQQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKP

Query:  AA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        +     L VGRALAV NMALWSFNLFGFLLPVYLP+AFKRYYSL KSKD
Subjt:  AA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

A0A1S3AUS6 uncharacterized protein LOC1034829071.9e-9073.09Show/hide
Query:  MSGGV-----DMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGE-EPT
        MSGGV     D+SLPKEQE +H+E  DPKQ       +GR+K A FLS R LN LA+++IFSASG+VCAEDLAFVV SIMY YF+SRVAFPR+ G  +  
Subjt:  MSGGV-----DMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGE-EPT

Query:  VFSQQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKP
        VF  +N++L+ YVLFAA++GLFLPIAYIL GFF++D+EGIKAASPHVFLLASQVFMEGVA NDRFSTPIRVFVPVFYNSRRIFTL EWLR+EFAKEDK+ 
Subjt:  VFSQQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKP

Query:  AA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        +     L VGRALAV NMALWSFNLFGFLLPVYLP+AFKRYYSL KSKD
Subjt:  AA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

A0A5D3C472 Uncharacterized protein1.9e-9073.09Show/hide
Query:  MSGGV-----DMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGE-EPT
        MSGGV     D+SLPKEQE +H+E  DPKQ       +GR+K A FLS R LN LA+++IFSASG+VCAEDLAFVV SIMY YF+SRVAFPR+ G  +  
Subjt:  MSGGV-----DMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGE-EPT

Query:  VFSQQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKP
        VF  +N++L+ YVLFAA++GLFLPIAYIL GFF++D+EGIKAASPHVFLLASQVFMEGVA NDRFSTPIRVFVPVFYNSRRIFTL EWLR+EFAKEDK+ 
Subjt:  VFSQQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKP

Query:  AA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        +     L VGRALAV NMALWSFNLFGFLLPVYLP+AFKRYYSL KSKD
Subjt:  AA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

A0A6J1H0U3 uncharacterized protein LOC1114593945.6e-130100Show/hide
Query:  MSGGVDMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGEEPTVFSQQN
        MSGGVDMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGEEPTVFSQQN
Subjt:  MSGGVDMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGEEPTVFSQQN

Query:  KLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAALFVG
        KLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAALFVG
Subjt:  KLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAALFVG

Query:  RALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        RALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
Subjt:  RALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

A0A6J1JYC0 uncharacterized protein LOC1114908631.7e-12395.04Show/hide
Query:  MSGGVDMSLPKEQELLHEETCDPK---QQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGEEPTVFS
        M GGVD+SL KEQELLHEETCDPK   QQQHRGMKLGRKKPAVFLSFRHLNMLA+MVIFSASGLVCAEDLAFVVLSI+Y YFLSRVAFPRLEGEEPTVFS
Subjt:  MSGGVDMSLPKEQELLHEETCDPK---QQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGEEPTVFS

Query:  QQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAAL
        QQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEG+AWNDRFSTPIRVFVPVFYNSRRIFTL EWLREEFAKEDKKPAAL
Subjt:  QQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAAL

Query:  FVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        FVGRALAVINMALWSFNLFGFLLPVYLPKAF+RYYSLSKSKD
Subjt:  FVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27990.1 unknown protein7.2e-2936.73Show/hide
Query:  LNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPR--LEGEEPTVFSQQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLL
        L ++A +++FSASGLV   D+ F   + +Y   LSR+AFP   +    P VF + +KL + YV+    +GLFLP+AY+L GF + D   +++A+PH+FLL
Subjt:  LNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPR--LEGEEPTVFSQQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLL

Query:  ASQVFMEGV-AWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAA-------LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYY
        + Q+  E V +    FS P+R  VP+ Y   RIF +  W ++ +  +     A        + GR LA+ N+  +  NL  FL+P +LP+AF++Y+
Subjt:  ASQVFMEGV-AWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAA-------LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYY

AT5G23920.1 unknown protein6.1e-4440.77Show/hide
Query:  KEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTY-FLSRVAFPRLEGEEPTVFSQ-QNKLLQQYVL
        +E++L   +   P+    RG+ + RK+  VFLSF        M++ +A GLV   ++AFV+L  +Y Y FLSR AFPR + E+    S  +NKL Q Y L
Subjt:  KEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTY-FLSRVAFPRLEGEEPTVFSQ-QNKLLQQYVL

Query:  FAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKK--PAALFVGRALAVI
          A++GL  P+ YI  G ++ D  G  AA+PH+FLL+ Q F E + ++D++S PI +  PVFYN+RRIF L +W++ EF+   +   P  L+ GR +A +
Subjt:  FAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKK--PAALFVGRALAVI

Query:  NMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        N  +W +NLFG LLPV+LP++ + Y+S     D
Subjt:  NMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

AT5G52420.1 unknown protein3.7e-5746.56Show/hide
Query:  MSGGV-----DMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFP--RLEGEEP
        MSGGV     D++LPKE+E  H       Q Q         KPA F SFR LN+LA++++ SASGLV  +D  F +L+++Y +FLS++ FP       + 
Subjt:  MSGGV-----DMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFP--RLEGEEP

Query:  TVFSQQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKED--
         + S  NK+ + YV  A +VGL +PI YI  G  +DD+ G+ AA+PHVFLLASQ+FMEG+A    FS P R+ VP+ YN+RR+ TL EW+  EF++ED  
Subjt:  TVFSQQNKLLQQYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKED--

Query:  --KKPAALFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSK
               ++ G+ LA  N+ +WSFNLFG L+PVYLP+AFKRYY   K
Subjt:  --KKPAALFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGGCGGAGTGGACATGAGCCTCCCAAAGGAGCAAGAACTCCTCCACGAAGAAACATGCGACCCAAAGCAGCAGCAGCATCGGGGGATGAAATTGGGGCGGAAGAA
GCCGGCGGTGTTTCTGAGTTTCCGCCACCTGAATATGCTGGCGTTGATGGTGATCTTCTCTGCCAGCGGATTGGTGTGTGCAGAGGACTTGGCGTTTGTGGTGCTTTCGA
TTATGTACACGTACTTCCTTTCCAGAGTGGCGTTTCCAAGGCTGGAGGGCGAGGAGCCGACGGTGTTCAGCCAGCAGAACAAGCTGCTCCAGCAGTACGTGTTGTTCGCC
GCCGTGGTGGGGCTGTTCCTCCCCATAGCCTACATTTTAGTAGGATTCTTTCAAGATGATCAAGAGGGCATCAAAGCCGCTTCTCCCCATGTCTTTCTTCTCGCCAGCCA
GGTTTTCATGGAAGGCGTGGCATGGAACGACAGATTCTCGACGCCAATCCGCGTGTTTGTGCCAGTTTTCTACAACTCAAGGAGGATCTTCACCCTCTTTGAGTGGCTGC
GGGAAGAGTTCGCCAAAGAAGATAAGAAACCAGCGGCTCTGTTCGTCGGAAGGGCACTTGCTGTAATCAACATGGCGCTTTGGAGCTTCAATCTGTTTGGGTTCTTGTTG
CCTGTGTATCTGCCAAAAGCCTTCAAGAGATATTACTCTCTTTCGAAATCCAAAGATTGA
mRNA sequenceShow/hide mRNA sequence
CCACCACCACGTGTTCATGGAACCAACCAGCAGCCGCCACTTGCCCACTTTCCTTGCCACACTCCTTCTCAAACTGAAACTCCAATCTCCATTACCTTTCTCCTCCTTCA
CCCATTTCACTTCTCCTCTGTAAAAGGAAGGAAAAAAAAAAAATCAGACGAAGAAAAATGTCCGGCGGAGTGGACATGAGCCTCCCAAAGGAGCAAGAACTCCTCCACGA
AGAAACATGCGACCCAAAGCAGCAGCAGCATCGGGGGATGAAATTGGGGCGGAAGAAGCCGGCGGTGTTTCTGAGTTTCCGCCACCTGAATATGCTGGCGTTGATGGTGA
TCTTCTCTGCCAGCGGATTGGTGTGTGCAGAGGACTTGGCGTTTGTGGTGCTTTCGATTATGTACACGTACTTCCTTTCCAGAGTGGCGTTTCCAAGGCTGGAGGGCGAG
GAGCCGACGGTGTTCAGCCAGCAGAACAAGCTGCTCCAGCAGTACGTGTTGTTCGCCGCCGTGGTGGGGCTGTTCCTCCCCATAGCCTACATTTTAGTAGGATTCTTTCA
AGATGATCAAGAGGGCATCAAAGCCGCTTCTCCCCATGTCTTTCTTCTCGCCAGCCAGGTTTTCATGGAAGGCGTGGCATGGAACGACAGATTCTCGACGCCAATCCGCG
TGTTTGTGCCAGTTTTCTACAACTCAAGGAGGATCTTCACCCTCTTTGAGTGGCTGCGGGAAGAGTTCGCCAAAGAAGATAAGAAACCAGCGGCTCTGTTCGTCGGAAGG
GCACTTGCTGTAATCAACATGGCGCTTTGGAGCTTCAATCTGTTTGGGTTCTTGTTGCCTGTGTATCTGCCAAAAGCCTTCAAGAGATATTACTCTCTTTCGAAATCCAA
AGATTGAGCGCTCTTCCTTTGGGATAAAACATAAAGTAATCTTACTTGGTTGCAACTACAAGGATTTGTACCTTCCAAGAAGCACTAAATTACCCACTAATTGCATGATT
CTAAGCTTATATCTAATGGTTCTATGCTTTTAC
Protein sequenceShow/hide protein sequence
MSGGVDMSLPKEQELLHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYTYFLSRVAFPRLEGEEPTVFSQQNKLLQQYVLFA
AVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLFEWLREEFAKEDKKPAALFVGRALAVINMALWSFNLFGFLL
PVYLPKAFKRYYSLSKSKD