; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg08403 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg08403
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionTransmembrane protein
Genome locationCarg_Chr02:5912597..5913566
RNA-Seq ExpressionCarg08403
SyntenyCarg08403
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605736.1 hypothetical protein SDJN03_03053, partial [Cucurbita argyrosperma subsp. sororia]1.7e-128100Show/hide
Query:  MSGGVDMSLPKEQELHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGEEPTVFSQQNK
        MSGGVDMSLPKEQELHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGEEPTVFSQQNK
Subjt:  MSGGVDMSLPKEQELHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGEEPTVFSQQNK

Query:  LLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAALFVGR
        LLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAALFVGR
Subjt:  LLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAALFVGR

Query:  ALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        ALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
Subjt:  ALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

XP_022958042.1 uncharacterized protein LOC111459394 [Cucurbita moschata]8.5e-12597.91Show/hide
Query:  MSGGVDMSLPKEQE-LHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGEEPTVFSQQN
        MSGGVDMSLPKEQE LHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMY YFLSRVAFPRL+GEEPTVFSQQN
Subjt:  MSGGVDMSLPKEQE-LHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGEEPTVFSQQN

Query:  KLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAALFVG
        KLLQ YVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTL EWLREEFAKEDKKPAALFVG
Subjt:  KLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAALFVG

Query:  RALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        RALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
Subjt:  RALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

XP_022995262.1 uncharacterized protein LOC111490863 [Cucurbita maxima]4.4e-12194.63Show/hide
Query:  MSGGVDMSLPKEQE-LHEETCDPK---QQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGEEPTVFS
        M GGVD+SL KEQE LHEETCDPK   QQQHRGMKLGRKKPAVFLSFRHLNMLA+MVIFSASGLVCAEDLAFVVLSI+YMYFLSRVAFPRL+GEEPTVFS
Subjt:  MSGGVDMSLPKEQE-LHEETCDPK---QQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGEEPTVFS

Query:  QQNKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAAL
        QQNKLLQ YVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEG+AWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAAL
Subjt:  QQNKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAAL

Query:  FVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        FVGRALAVINMALWSFNLFGFLLPVYLPKAF+RYYSLSKSKD
Subjt:  FVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

XP_023534699.1 uncharacterized protein LOC111796187 [Cucurbita pepo subsp. pepo]3.6e-12397.08Show/hide
Query:  MSGGVDMSLPKEQE-LHEETCDPK-QQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGEEPTVFSQQ
        MSGGVDMSLPKE+E LHEETCDPK QQQHRGMKLGRKKPAVFLSFRHL MLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRL+GEEPTVFSQQ
Subjt:  MSGGVDMSLPKEQE-LHEETCDPK-QQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGEEPTVFSQQ

Query:  NKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAALFV
        NKLLQ YVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAALF+
Subjt:  NKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAALFV

Query:  GRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        GRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
Subjt:  GRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

XP_038906981.1 uncharacterized protein LOC120092829 [Benincasa hispida]7.3e-9275.2Show/hide
Query:  MSGGV-----DMSLPKEQE-LHEETCDPKQQQHRGMKL-----GRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRL-Q
        MSGGV     D+SLPKEQE LH+E  D K QQ +G+KL     GR+K A FLSFR LN LA+++IFSASG+VCAEDLAFVV S+MYMYF+SRVAFP L  
Subjt:  MSGGV-----DMSLPKEQE-LHEETCDPKQQQHRGMKL-----GRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRL-Q

Query:  GEEPTVFSQQNKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAK
          EPTVFS +NK+L+LYV FAAVVGLFLPIAYIL GFF++D+EGIKAASPHVFLLASQVFMEGVA  DRFSTPIRVFVPVFYNSRRIFTL EWLR+EF K
Subjt:  GEEPTVFSQQNKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAK

Query:  EDKKPAA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        EDK+ +     L VGRALAV NMALWSFNLFGFLLPVYLP+AFKRYYSL KSKD
Subjt:  EDKKPAA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

TrEMBL top hitse value%identityAlignment
A0A0A0KK26 Uncharacterized protein5.6e-9073.49Show/hide
Query:  MSGGV-----DMSLPKEQE-LHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGE-EPT
        MSGGV     D+SLPKEQE +H+E  DPKQ    G  +GR+K A FLS R LN LA+++IFSASG+VCAEDL FV+ SIMYMYFLSRVAFPR+ G  +  
Subjt:  MSGGV-----DMSLPKEQE-LHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGE-EPT

Query:  VFSQQNKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKP
        VF  +N++L+LYVLFAA+VGLFLP+AYIL GFF++D+EGIKAASPHVFLLASQVFMEGVA ND FSTPIRVFVPVFYNSRRIFTL EWLR EFAKEDK+ 
Subjt:  VFSQQNKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKP

Query:  AA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        +     L VGRALAV NMALWSFNLFGFLLPVYLP+AFKRYYSL KSKD
Subjt:  AA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

A0A1S3AUS6 uncharacterized protein LOC1034829071.5e-9073.9Show/hide
Query:  MSGGV-----DMSLPKEQE-LHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGE-EPT
        MSGGV     D+SLPKEQE +H+E  DPKQ       +GR+K A FLS R LN LA+++IFSASG+VCAEDLAFVV SIMYMYF+SRVAFPR+ G  +  
Subjt:  MSGGV-----DMSLPKEQE-LHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGE-EPT

Query:  VFSQQNKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKP
        VF  +N++L+LYVLFAA++GLFLPIAYIL GFF++D+EGIKAASPHVFLLASQVFMEGVA NDRFSTPIRVFVPVFYNSRRIFTL EWLR+EFAKEDK+ 
Subjt:  VFSQQNKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKP

Query:  AA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        +     L VGRALAV NMALWSFNLFGFLLPVYLP+AFKRYYSL KSKD
Subjt:  AA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

A0A5D3C472 Uncharacterized protein1.5e-9073.9Show/hide
Query:  MSGGV-----DMSLPKEQE-LHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGE-EPT
        MSGGV     D+SLPKEQE +H+E  DPKQ       +GR+K A FLS R LN LA+++IFSASG+VCAEDLAFVV SIMYMYF+SRVAFPR+ G  +  
Subjt:  MSGGV-----DMSLPKEQE-LHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGE-EPT

Query:  VFSQQNKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKP
        VF  +N++L+LYVLFAA++GLFLPIAYIL GFF++D+EGIKAASPHVFLLASQVFMEGVA NDRFSTPIRVFVPVFYNSRRIFTL EWLR+EFAKEDK+ 
Subjt:  VFSQQNKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKP

Query:  AA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        +     L VGRALAV NMALWSFNLFGFLLPVYLP+AFKRYYSL KSKD
Subjt:  AA----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

A0A6J1H0U3 uncharacterized protein LOC1114593944.1e-12597.91Show/hide
Query:  MSGGVDMSLPKEQE-LHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGEEPTVFSQQN
        MSGGVDMSLPKEQE LHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMY YFLSRVAFPRL+GEEPTVFSQQN
Subjt:  MSGGVDMSLPKEQE-LHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGEEPTVFSQQN

Query:  KLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAALFVG
        KLLQ YVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTL EWLREEFAKEDKKPAALFVG
Subjt:  KLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAALFVG

Query:  RALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        RALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
Subjt:  RALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

A0A6J1JYC0 uncharacterized protein LOC1114908632.1e-12194.63Show/hide
Query:  MSGGVDMSLPKEQE-LHEETCDPK---QQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGEEPTVFS
        M GGVD+SL KEQE LHEETCDPK   QQQHRGMKLGRKKPAVFLSFRHLNMLA+MVIFSASGLVCAEDLAFVVLSI+YMYFLSRVAFPRL+GEEPTVFS
Subjt:  MSGGVDMSLPKEQE-LHEETCDPK---QQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGEEPTVFS

Query:  QQNKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAAL
        QQNKLLQ YVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEG+AWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAAL
Subjt:  QQNKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAAL

Query:  FVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
        FVGRALAVINMALWSFNLFGFLLPVYLPKAF+RYYSLSKSKD
Subjt:  FVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27990.1 unknown protein1.9e-2933.2Show/hide
Query:  GVDMSLPKEQELHEETCDPKQQQHRGM-KLGRKKPAVFLSFR--HLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPR--LQGEEPTVFSQQ
        GV +++    ++ +     ++ +  GM  +G     +  S R   L ++A +++FSASGLV   D+ F   + +Y+  LSR+AFP   +    P VF + 
Subjt:  GVDMSLPKEQELHEETCDPKQQQHRGM-KLGRKKPAVFLSFR--HLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPR--LQGEEPTVFSQQ

Query:  NKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGV-AWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAA--
        +KL +LYV+    +GLFLP+AY+L GF + D   +++A+PH+FLL+ Q+  E V +    FS P+R  VP+ Y   RIF ++ W ++ +  +     A  
Subjt:  NKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGV-AWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAA--

Query:  -----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYY
              + GR LA+ N+  +  NL  FL+P +LP+AF++Y+
Subjt:  -----LFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYY

AT5G23920.1 unknown protein6.0e-4440.95Show/hide
Query:  KEQELHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMY-FLSRVAFPRLQGEEPTVFSQ-QNKLLQLYVLF
        +E++L +    P Q   +   + RK+  VFLSF        M++ +A GLV   ++AFV+L  +Y+Y FLSR AFPR Q E+    S  +NKL Q Y L 
Subjt:  KEQELHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMY-FLSRVAFPRLQGEEPTVFSQ-QNKLLQLYVLF

Query:  AAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKK--PAALFVGRALAVIN
         A++GL  P+ YI  G ++ D  G  AA+PH+FLL+ Q F E + ++D++S PI +  PVFYN+RRIF L++W++ EF+   +   P  L+ GR +A +N
Subjt:  AAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKK--PAALFVGRALAVIN

Query:  MALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD
          +W +NLFG LLPV+LP++ + Y+S     D
Subjt:  MALWSFNLFGFLLPVYLPKAFKRYYSLSKSKD

AT5G52420.1 unknown protein2.5e-5846.75Show/hide
Query:  MSGGV-----DMSLPKEQELHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFP--RLQGEEPT
        MSGGV     D++LPKE+E   +T         G      KPA F SFR LN+LA++++ SASGLV  +D  F +L+++Y +FLS++ FP       +  
Subjt:  MSGGV-----DMSLPKEQELHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFP--RLQGEEPT

Query:  VFSQQNKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKED---
        + S  NK+ ++YV  A +VGL +PI YI  G  +DD+ G+ AA+PHVFLLASQ+FMEG+A    FS P R+ VP+ YN+RR+ TLVEW+  EF++ED   
Subjt:  VFSQQNKLLQLYVLFAAVVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKED---

Query:  -KKPAALFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSK
              ++ G+ LA  N+ +WSFNLFG L+PVYLP+AFKRYY   K
Subjt:  -KKPAALFVGRALAVINMALWSFNLFGFLLPVYLPKAFKRYYSLSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCGGCGGAGTGGACATGAGCCTCCCAAAGGAGCAAGAACTCCACGAAGAAACATGCGACCCAAAGCAGCAGCAGCATCGGGGGATGAAATTGGGGCGGAAGAAGCC
GGCGGTGTTTCTGAGTTTCCGCCACCTGAATATGCTGGCGTTGATGGTGATCTTCTCTGCCAGCGGATTGGTGTGTGCAGAGGACTTGGCGTTTGTGGTGCTTTCGATTA
TGTACATGTACTTCCTTTCCAGAGTGGCGTTTCCAAGGCTGCAGGGCGAGGAGCCGACGGTGTTCAGCCAGCAGAACAAGCTGCTCCAGCTGTACGTGTTGTTCGCCGCC
GTGGTGGGGCTGTTCCTCCCCATAGCCTACATTTTAGTAGGATTCTTTCAAGATGATCAAGAGGGCATCAAAGCCGCTTCTCCCCATGTCTTTCTTCTCGCTAGCCAGGT
TTTCATGGAAGGCGTGGCATGGAACGACAGATTCTCGACGCCAATCCGCGTGTTTGTGCCAGTTTTCTACAACTCAAGGAGGATCTTCACCCTCGTTGAGTGGCTGCGGG
AAGAGTTCGCCAAAGAAGATAAGAAACCAGCGGCTCTGTTCGTCGGAAGAGCACTTGCTGTAATCAACATGGCGCTTTGGAGCTTCAATCTGTTTGGGTTCTTGTTGCCT
GTGTATCTGCCAAAAGCCTTCAAGAGATATTACTCTCTTTCGAAATCCAAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCGGCGGAGTGGACATGAGCCTCCCAAAGGAGCAAGAACTCCACGAAGAAACATGCGACCCAAAGCAGCAGCAGCATCGGGGGATGAAATTGGGGCGGAAGAAGCC
GGCGGTGTTTCTGAGTTTCCGCCACCTGAATATGCTGGCGTTGATGGTGATCTTCTCTGCCAGCGGATTGGTGTGTGCAGAGGACTTGGCGTTTGTGGTGCTTTCGATTA
TGTACATGTACTTCCTTTCCAGAGTGGCGTTTCCAAGGCTGCAGGGCGAGGAGCCGACGGTGTTCAGCCAGCAGAACAAGCTGCTCCAGCTGTACGTGTTGTTCGCCGCC
GTGGTGGGGCTGTTCCTCCCCATAGCCTACATTTTAGTAGGATTCTTTCAAGATGATCAAGAGGGCATCAAAGCCGCTTCTCCCCATGTCTTTCTTCTCGCTAGCCAGGT
TTTCATGGAAGGCGTGGCATGGAACGACAGATTCTCGACGCCAATCCGCGTGTTTGTGCCAGTTTTCTACAACTCAAGGAGGATCTTCACCCTCGTTGAGTGGCTGCGGG
AAGAGTTCGCCAAAGAAGATAAGAAACCAGCGGCTCTGTTCGTCGGAAGAGCACTTGCTGTAATCAACATGGCGCTTTGGAGCTTCAATCTGTTTGGGTTCTTGTTGCCT
GTGTATCTGCCAAAAGCCTTCAAGAGATATTACTCTCTTTCGAAATCCAAAGATTGA
Protein sequenceShow/hide protein sequence
MSGGVDMSLPKEQELHEETCDPKQQQHRGMKLGRKKPAVFLSFRHLNMLALMVIFSASGLVCAEDLAFVVLSIMYMYFLSRVAFPRLQGEEPTVFSQQNKLLQLYVLFAA
VVGLFLPIAYILVGFFQDDQEGIKAASPHVFLLASQVFMEGVAWNDRFSTPIRVFVPVFYNSRRIFTLVEWLREEFAKEDKKPAALFVGRALAVINMALWSFNLFGFLLP
VYLPKAFKRYYSLSKSKD