; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh11G015460 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh11G015460
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPhloem filament protein
Genome locationCmo_Chr11:11068916..11070342
RNA-Seq ExpressionCmoCh11G015460
SyntenyCmoCh11G015460
Gene Ontology termsGO:0004869 - cysteine-type endopeptidase inhibitor activity (molecular function)
InterPro domainsIPR009994 - Phloem filament PP1
IPR027214 - Cystatin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588836.1 hypothetical protein SDJN03_17401, partial [Cucurbita argyrosperma subsp. sororia]1.6e-11760.55Show/hide
Query:  PPKVEKWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIVAPE
        PP V KWI+I NL+   +Q V KF L+   +K GDSLK+D IY                                   +ERRDRLRILKLVSIIIIVAPE
Subjt:  PPKVEKWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIVAPE

Query:  PPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGPLVPG
        PPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGE LKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGP VPG
Subjt:  PPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGPLVPG

Query:  EWIPIPNLKEPGFQVF------------------------------------------------------------------------------------
        EWIPIPNLKEPGFQV                                                                                     
Subjt:  EWIPIPNLKEPGFQVF------------------------------------------------------------------------------------

Query:  -------------SFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC
                     SFVQEVSKFALDDFNVKSGDSLKYDG+YDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC
Subjt:  -------------SFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC

XP_022927812.1 uncharacterized protein LOC111434592 [Cucurbita moschata]4.9e-10347.09Show/hide
Query:  PVDPPKVEKWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIV
        P+ PP   +WI+I NL+   +Q V KF L+   +K GDSLK+D IY                                   +ERRDRLRILKLVSIIIIV
Subjt:  PVDPPKVEKWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIV

Query:  APEPPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLV-----------------------------------------------------------
        APEPPEKKWIKIPVLQAPLVQELAKFAVDEYSS+GEGLKLV                                                           
Subjt:  APEPPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLV-----------------------------------------------------------

Query:  --------------------------------------------------------EIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFS
                                                                EIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFS
Subjt:  --------------------------------------------------------EIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFS

Query:  KRIKILESFKLLGPLVPGEWIPIPNLKEPGFQV-------------------------------------------------------------------
        KRIKILESFKLLGPLVPGEWIPIPNLKEPGFQV                                                                   
Subjt:  KRIKILESFKLLGPLVPGEWIPIPNLKEPGFQV-------------------------------------------------------------------

Query:  ------------------------------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLG
                                      FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLG
Subjt:  ------------------------------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLG

Query:  QRIKIVESFKLIERKC
        QRIKIVESFKLIERKC
Subjt:  QRIKIVESFKLIERKC

XP_022927869.1 uncharacterized protein LOC111434636 isoform X1 [Cucurbita moschata]5.8e-12061.56Show/hide
Query:  PPKVEKWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIVAPE
        PP V KWI+I NL+   +Q V KF L+   +K GDSLK+D IY                                   +ERRDRLRILKLVSIIIIVAPE
Subjt:  PPKVEKWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIVAPE

Query:  PPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGPLVPG
        PPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGPLVPG
Subjt:  PPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGPLVPG

Query:  EWIPIPNLKEPGFQV-------------------------------------------------------------------------------------
        EWIPIPNLKEPGFQV                                                                                     
Subjt:  EWIPIPNLKEPGFQV-------------------------------------------------------------------------------------

Query:  ------------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC
                    FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC
Subjt:  ------------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC

XP_022927870.1 uncharacterized protein LOC111434636 isoform X2 [Cucurbita moschata]6.4e-10346.91Show/hide
Query:  VGPVDPPKVEKWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIII
        +GP  P    +WI+I NL+   +Q V KF L+   +K GDSLK+D IY                                   +ERRDRLRILKLVSIII
Subjt:  VGPVDPPKVEKWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIII

Query:  IVAPEPPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLV---------------------------------------------------------
        IVAPEPPEKKWIKIPVLQAPLVQELAKFAVDEYSS+GEGLKLV                                                         
Subjt:  IVAPEPPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLV---------------------------------------------------------

Query:  ----------------------------------------------------------EIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHF
                                                                  EIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHF
Subjt:  ----------------------------------------------------------EIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHF

Query:  FSKRIKILESFKLLGPLVPGEWIPIPNLKEPGFQV-----------------------------------------------------------------
        FSKRIKILESFKLLGPLVPGEWIPIPNLKEPGFQV                                                                 
Subjt:  FSKRIKILESFKLLGPLVPGEWIPIPNLKEPGFQV-----------------------------------------------------------------

Query:  --------------------------------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNF
                                        FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNF
Subjt:  --------------------------------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNF

Query:  LGQRIKIVESFKLIERKC
        LGQRIKIVESFKLIERKC
Subjt:  LGQRIKIVESFKLIERKC

XP_023531643.1 uncharacterized protein LOC111793824 [Cucurbita pepo subsp. pepo]1.6e-10957.75Show/hide
Query:  PPKVEK--WIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIVA
        PP   K  WI+I NL+   +Q V KF L+   +K GDSLK+D IY                                   +ERRDRLRILKLVSII+IV+
Subjt:  PPKVEK--WIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIVA

Query:  PEPPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGPLV
        PEPPEKKWIKIP+LQAPLVQELAKFAVDEYSS   GLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLL P V
Subjt:  PEPPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGPLV

Query:  PGEWIPIPNLKEPGFQV-----------------------------------------------------------------------------------
         G+WIPIPNLKE G Q                                                                                    
Subjt:  PGEWIPIPNLKEPGFQV-----------------------------------------------------------------------------------

Query:  --------------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC
                      FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC
Subjt:  --------------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC

TrEMBL top hitse value%identityAlignment
A0A6J1EI90 uncharacterized protein LOC1114345922.4e-10347.09Show/hide
Query:  PVDPPKVEKWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIV
        P+ PP   +WI+I NL+   +Q V KF L+   +K GDSLK+D IY                                   +ERRDRLRILKLVSIIIIV
Subjt:  PVDPPKVEKWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIV

Query:  APEPPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLV-----------------------------------------------------------
        APEPPEKKWIKIPVLQAPLVQELAKFAVDEYSS+GEGLKLV                                                           
Subjt:  APEPPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLV-----------------------------------------------------------

Query:  --------------------------------------------------------EIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFS
                                                                EIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFS
Subjt:  --------------------------------------------------------EIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFS

Query:  KRIKILESFKLLGPLVPGEWIPIPNLKEPGFQV-------------------------------------------------------------------
        KRIKILESFKLLGPLVPGEWIPIPNLKEPGFQV                                                                   
Subjt:  KRIKILESFKLLGPLVPGEWIPIPNLKEPGFQV-------------------------------------------------------------------

Query:  ------------------------------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLG
                                      FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLG
Subjt:  ------------------------------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLG

Query:  QRIKIVESFKLIERKC
        QRIKIVESFKLIERKC
Subjt:  QRIKIVESFKLIERKC

A0A6J1EJ75 uncharacterized protein LOC111434636 isoform X12.8e-12061.56Show/hide
Query:  PPKVEKWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIVAPE
        PP V KWI+I NL+   +Q V KF L+   +K GDSLK+D IY                                   +ERRDRLRILKLVSIIIIVAPE
Subjt:  PPKVEKWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIVAPE

Query:  PPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGPLVPG
        PPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGPLVPG
Subjt:  PPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGPLVPG

Query:  EWIPIPNLKEPGFQV-------------------------------------------------------------------------------------
        EWIPIPNLKEPGFQV                                                                                     
Subjt:  EWIPIPNLKEPGFQV-------------------------------------------------------------------------------------

Query:  ------------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC
                    FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC
Subjt:  ------------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC

A0A6J1EQ39 uncharacterized protein LOC111434636 isoform X23.1e-10346.91Show/hide
Query:  VGPVDPPKVEKWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIII
        +GP  P    +WI+I NL+   +Q V KF L+   +K GDSLK+D IY                                   +ERRDRLRILKLVSIII
Subjt:  VGPVDPPKVEKWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIII

Query:  IVAPEPPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLV---------------------------------------------------------
        IVAPEPPEKKWIKIPVLQAPLVQELAKFAVDEYSS+GEGLKLV                                                         
Subjt:  IVAPEPPEKKWIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLV---------------------------------------------------------

Query:  ----------------------------------------------------------EIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHF
                                                                  EIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHF
Subjt:  ----------------------------------------------------------EIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHF

Query:  FSKRIKILESFKLLGPLVPGEWIPIPNLKEPGFQV-----------------------------------------------------------------
        FSKRIKILESFKLLGPLVPGEWIPIPNLKEPGFQV                                                                 
Subjt:  FSKRIKILESFKLLGPLVPGEWIPIPNLKEPGFQV-----------------------------------------------------------------

Query:  --------------------------------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNF
                                        FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNF
Subjt:  --------------------------------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNF

Query:  LGQRIKIVESFKLIERKC
        LGQRIKIVESFKLIERKC
Subjt:  LGQRIKIVESFKLIERKC

A0A6J1JG70 uncharacterized protein LOC111486626 isoform X21.8e-9551.27Show/hide
Query:  KWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIVAPEPPEKK
        KWI+I NL+   +Q V KF ++   +K GDSLK++ +Y                                   +E RD LRI KL SII IV+PEP EKK
Subjt:  KWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIVAPEPPEKK

Query:  WIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGPLV-PGEWIP
        WIKIP +Q PLVQELAKFAVDE++  G+GLK +E+Y+GWFMDLG DNIKFRLHLKAKDWLGRIRNYEAVVLVEHF+SKRIKILESFK  GPLV   +WI 
Subjt:  WIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGPLV-PGEWIP

Query:  IPNLKEPGFQV-----------------------------------------------------------------------------------------
        IPNLKEPGFQV                                                                                         
Subjt:  IPNLKEPGFQV-----------------------------------------------------------------------------------------

Query:  --------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC
                F+F+QEVSKFALDDFNVKSGDSL++DGIYDGWYMEMGQDNIKFRIHL+AKDCLSRVHHYEA+V+VK FL +RIKIVESFKLIERKC
Subjt:  --------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC

A0A6J1JKF7 uncharacterized protein LOC111486626 isoform X11.8e-9551.27Show/hide
Query:  KWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIVAPEPPEKK
        KWI+I NL+   +Q V KF ++   +K GDSLK++ +Y                                   +E RD LRI KL SII IV+PEP EKK
Subjt:  KWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIY-----------------------------------DERRDRLRILKLVSIIIIVAPEPPEKK

Query:  WIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGPLV-PGEWIP
        WIKIP +Q PLVQELAKFAVDE++  G+GLK +E+Y+GWFMDLG DNIKFRLHLKAKDWLGRIRNYEAVVLVEHF+SKRIKILESFK  GPLV   +WI 
Subjt:  WIKIPVLQAPLVQELAKFAVDEYSSKGEGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGPLV-PGEWIP

Query:  IPNLKEPGFQV-----------------------------------------------------------------------------------------
        IPNLKEPGFQV                                                                                         
Subjt:  IPNLKEPGFQV-----------------------------------------------------------------------------------------

Query:  --------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC
                F+F+QEVSKFALDDFNVKSGDSL++DGIYDGWYMEMGQDNIKFRIHL+AKDCLSRVHHYEA+V+VK FL +RIKIVESFKLIERKC
Subjt:  --------FSFVQEVSKFALDDFNVKSGDSLKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G16500.1 Cystatin/monellin superfamily protein1.5e-0437.04Show/hide
Query:  PLVQELAKFAVDEYSSKG-EGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLL
        P V  +AK+A++E++ +  E L  V++ +G    +     K+ L + AKD  G+I+NYEAVV+ + +     K LESFK L
Subjt:  PLVQELAKFAVDEYSSKG-EGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATCTATCATCGTCATATCGTCTCCTGGACACTGCGTTGGGCCTGTGGATCCACCCAAGGTCGAGAAGTGGATTAAAATCTGTAATCTTCAGTTTTCGTTCGTGCA
AGAGGTATCAAAGTTTGCGTTGGATGACTTCAACGTTAAATCTGGAGATAGCCTCAAATACGATGGCATTTACGATGAGAGACGTGACCGTCTAAGAATCTTGAAGCTGG
TCTCTATCATTATCATAGTAGCTCCTGAACCTCCTGAGAAGAAGTGGATAAAAATTCCTGTTCTTCAGGCGCCATTAGTGCAAGAGTTAGCAAAGTTTGCCGTGGATGAA
TATAGTTCAAAAGGAGAAGGCCTAAAATTAGTTGAAATCTATGACGGCTGGTTTATGGACCTGGGTCAAGATAACATAAAGTTTCGTCTTCATCTTAAGGCGAAAGATTG
GTTGGGACGCATCCGCAACTATGAGGCTGTTGTGCTTGTTGAGCACTTTTTCTCCAAGAGAATCAAGATTCTCGAATCTTTCAAGCTTCTCGGTCCACTGGTTCCAGGCG
AGTGGATTCCAATACCTAATCTCAAGGAGCCTGGCTTTCAAGTGTTTTCGTTCGTGCAAGAGGTATCAAAGTTTGCGTTGGATGACTTCAACGTTAAATCTGGAGATAGC
CTCAAATACGATGGCATTTACGATGGTTGGTATATGGAGATGGGTCAAGACAACATAAAGTTTCGTATTCATTTAAAGGCAAAAGACTGTCTCAGTCGTGTGCACCACTA
TGAAGCTCATGTGTTTGTAAAGAACTTTCTCGGTCAAAGAATTAAGATCGTCGAATCTTTCAAGCTTATCGAAAGGAAGTGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAATCTATCATCGTCATATCGTCTCCTGGACACTGCGTTGGGCCTGTGGATCCACCCAAGGTCGAGAAGTGGATTAAAATCTGTAATCTTCAGTTTTCGTTCGTGCA
AGAGGTATCAAAGTTTGCGTTGGATGACTTCAACGTTAAATCTGGAGATAGCCTCAAATACGATGGCATTTACGATGAGAGACGTGACCGTCTAAGAATCTTGAAGCTGG
TCTCTATCATTATCATAGTAGCTCCTGAACCTCCTGAGAAGAAGTGGATAAAAATTCCTGTTCTTCAGGCGCCATTAGTGCAAGAGTTAGCAAAGTTTGCCGTGGATGAA
TATAGTTCAAAAGGAGAAGGCCTAAAATTAGTTGAAATCTATGACGGCTGGTTTATGGACCTGGGTCAAGATAACATAAAGTTTCGTCTTCATCTTAAGGCGAAAGATTG
GTTGGGACGCATCCGCAACTATGAGGCTGTTGTGCTTGTTGAGCACTTTTTCTCCAAGAGAATCAAGATTCTCGAATCTTTCAAGCTTCTCGGTCCACTGGTTCCAGGCG
AGTGGATTCCAATACCTAATCTCAAGGAGCCTGGCTTTCAAGTGTTTTCGTTCGTGCAAGAGGTATCAAAGTTTGCGTTGGATGACTTCAACGTTAAATCTGGAGATAGC
CTCAAATACGATGGCATTTACGATGGTTGGTATATGGAGATGGGTCAAGACAACATAAAGTTTCGTATTCATTTAAAGGCAAAAGACTGTCTCAGTCGTGTGCACCACTA
TGAAGCTCATGTGTTTGTAAAGAACTTTCTCGGTCAAAGAATTAAGATCGTCGAATCTTTCAAGCTTATCGAAAGGAAGTGTTAG
Protein sequenceShow/hide protein sequence
MESIIVISSPGHCVGPVDPPKVEKWIKICNLQFSFVQEVSKFALDDFNVKSGDSLKYDGIYDERRDRLRILKLVSIIIIVAPEPPEKKWIKIPVLQAPLVQELAKFAVDE
YSSKGEGLKLVEIYDGWFMDLGQDNIKFRLHLKAKDWLGRIRNYEAVVLVEHFFSKRIKILESFKLLGPLVPGEWIPIPNLKEPGFQVFSFVQEVSKFALDDFNVKSGDS
LKYDGIYDGWYMEMGQDNIKFRIHLKAKDCLSRVHHYEAHVFVKNFLGQRIKIVESFKLIERKC