; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh17G008800 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh17G008800
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionUnknown protein
Genome locationCmo_Chr17:8014204..8018005
RNA-Seq ExpressionCmoCh17G008800
SyntenyCmoCh17G008800
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575639.1 hypothetical protein SDJN03_26278, partial [Cucurbita argyrosperma subsp. sororia]1.0e-156100Show/hide
Query:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS
        MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS
Subjt:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS

Query:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
        RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
Subjt:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV

Query:  GARHPELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
        GARHPELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
Subjt:  GARHPELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI

KAG7014193.1 hypothetical protein SDJN02_24367 [Cucurbita argyrosperma subsp. argyrosperma]9.7e-15293.02Show/hide
Query:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS
        MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS
Subjt:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS

Query:  RWN-DLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFML
        RWN DLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFML
Subjt:  RWN-DLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFML

Query:  VGARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKV
        VGARHP                    ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKV
Subjt:  VGARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKV

Query:  I
        I
Subjt:  I

XP_022954230.1 uncharacterized protein LOC111456547 isoform X1 [Cucurbita moschata]3.9e-15393.33Show/hide
Query:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS
        MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS
Subjt:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS

Query:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
        RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
Subjt:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV

Query:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
        GARHP                    ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
Subjt:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI

XP_022991363.1 uncharacterized protein LOC111488021 isoform X1 [Cucurbita maxima]3.3e-15292.67Show/hide
Query:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS
        MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTP+S
Subjt:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS

Query:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
        RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
Subjt:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV

Query:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
        GARHP                    ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWL ELKVI
Subjt:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI

XP_023547746.1 uncharacterized protein LOC111806603 isoform X1 [Cucurbita pepo subsp. pepo]4.3e-15292.67Show/hide
Query:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS
        MAAAISFTISSSSLWSRLSKPASKFRD RLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS
Subjt:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS

Query:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
        RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
Subjt:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV

Query:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
        GARHP                    ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIA GKPDPCPLHTAWLAELKVI
Subjt:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI

TrEMBL top hitse value%identityAlignment
A0A1S3CDP5 uncharacterized protein LOC103499856 isoform X11.2e-14287.67Show/hide
Query:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS
        MAAAISFTIS+ SL S+LSKP S+F  RRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGED LVKRGESVPLDF GPILVCTRNDDLEAVLE+TPRS
Subjt:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS

Query:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
        RWNDLVFFQNGMLDPWYESKGL D NQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVL KEAFEKQMLEKLIWISAFMLV
Subjt:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV

Query:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
        GARHP                    ELA AAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWL ELKV+
Subjt:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI

A0A5D3CGL0 Uncharacterized protein1.2e-14287.67Show/hide
Query:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS
        MAAAISFTIS+ SL S+LSKP S+F  RRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGED LVKRGESVPLDF GPILVCTRNDDLEAVLE+TPRS
Subjt:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS

Query:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
        RWNDLVFFQNGMLDPWYESKGL D NQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVL KEAFEKQMLEKLIWISAFMLV
Subjt:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV

Query:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
        GARHP                    ELA AAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWL ELKV+
Subjt:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI

A0A6J1DKB2 uncharacterized protein LOC111021267 isoform X13.0e-14388Show/hide
Query:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS
        MAAAISFTISS SL  +LSKP SKFR RRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDF GPILVCTRNDDLEAVLE+TPRS
Subjt:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS

Query:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
        RWNDLVFFQNGMLDPWYESKGL D NQ+LAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVA RLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
Subjt:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV

Query:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
        GARHP                    ELA AAAAERQLVFE+GIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWL ELKV+
Subjt:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI

A0A6J1GRW1 uncharacterized protein LOC111456547 isoform X11.9e-15393.33Show/hide
Query:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS
        MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS
Subjt:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS

Query:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
        RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
Subjt:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV

Query:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
        GARHP                    ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
Subjt:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI

A0A6J1JW05 uncharacterized protein LOC111488021 isoform X11.6e-15292.67Show/hide
Query:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS
        MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTP+S
Subjt:  MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRS

Query:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
        RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV
Subjt:  RWNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLV

Query:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
        GARHP                    ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWL ELKVI
Subjt:  GARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G16080.1 unknown protein5.3e-11670.23Show/hide
Query:  AAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSR
        +A  SF   S SL   +S+ A       +   +AAT  K+ PAVIVG GRVGRAL +MGNGED LVKRGE+VP+DF GPILVCTRNDDL+AVLE+TP+SR
Subjt:  AAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSR

Query:  WNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLVG
        W DLVFFQNGM++PW+ESKGL D +QVLAYFA+SKLGE PVDG TDTNPEGLTAAYGKWAS +A RL + GLSCKVLDKEAF+KQMLEKLIWI AFMLVG
Subjt:  WNDLVFFQNGMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLVG

Query:  ARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI
        ARHP                    ELA+AAAAE+ L FEE + ERLCAYSRAV+HFPTAVKEFKWRNGWFYSLSEKAIA G+PDPCPLHT WL ELKVI
Subjt:  ARHP--------------------ELASAAAAERQLVFEEGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGCCGCAATTTCCTTCACAATCTCTAGCTCTTCTCTCTGGTCTCGACTTTCTAAACCCGCTTCTAAATTTCGCGATCGCCGTCTGAGAATCATGGCCGCGGCTAC
CGGAACTAAGGTGGTACCCGCTGTGATAGTCGGAAGTGGGCGCGTTGGGAGGGCGCTGCTGGACATGGGAAATGGTGAAGATTTTTTGGTGAAGAGAGGCGAGTCCGTGC
CCCTTGATTTTCCGGGTCCTATCCTTGTTTGTACCAGAAATGATGATCTTGAAGCTGTTCTTGAATCCACTCCTCGATCGAGATGGAACGATTTGGTTTTTTTCCAGAAT
GGAATGCTGGACCCTTGGTATGAAAGCAAAGGTCTAAATGATGTGAATCAAGTGTTAGCATATTTCGCTATCTCAAAGCTGGGAGAGGCTCCTGTGGATGGAATAACGGA
TACCAATCCTGAAGGACTGACAGCAGCGTATGGAAAATGGGCATCTGCCGTAGCTGGAAGGCTCAATGCTGCAGGCCTCTCCTGCAAGGTTCTCGATAAGGAAGCATTTG
AGAAACAAATGTTGGAGAAGCTGATTTGGATTTCTGCATTTATGCTCGTTGGAGCACGTCATCCAGAACTTGCATCTGCAGCCGCAGCTGAAAGGCAGTTGGTGTTTGAA
GAAGGTATAGAGGAAAGATTATGTGCATATTCTCGGGCTGTGGCTCACTTCCCCACGGCAGTAAAAGAGTTCAAATGGCGAAACGGGTGGTTCTATTCTCTCTCCGAGAA
AGCCATTGCTGCTGGAAAACCCGACCCCTGCCCTCTGCATACCGCTTGGCTTGCGGAGCTGAAAGTTATCTAG
mRNA sequenceShow/hide mRNA sequence
CACAGATCGAGTAGCTTATCTTCTTCTCCGCCACTCAAAGCCGCTCTCTGAAAACCCACACTGACCCTCTCTGCTTATACAAACCACACAACAATGTCACTTCAAGCTCA
CCTGGTTCATTTCCTACTCAGCCATGGCCGCCGCAATTTCCTTCACAATCTCTAGCTCTTCTCTCTGGTCTCGACTTTCTAAACCCGCTTCTAAATTTCGCGATCGCCGT
CTGAGAATCATGGCCGCGGCTACCGGAACTAAGGTGGTACCCGCTGTGATAGTCGGAAGTGGGCGCGTTGGGAGGGCGCTGCTGGACATGGGAAATGGTGAAGATTTTTT
GGTGAAGAGAGGCGAGTCCGTGCCCCTTGATTTTCCGGGTCCTATCCTTGTTTGTACCAGAAATGATGATCTTGAAGCTGTTCTTGAATCCACTCCTCGATCGAGATGGA
ACGATTTGGTTTTTTTCCAGAATGGAATGCTGGACCCTTGGTATGAAAGCAAAGGTCTAAATGATGTGAATCAAGTGTTAGCATATTTCGCTATCTCAAAGCTGGGAGAG
GCTCCTGTGGATGGAATAACGGATACCAATCCTGAAGGACTGACAGCAGCGTATGGAAAATGGGCATCTGCCGTAGCTGGAAGGCTCAATGCTGCAGGCCTCTCCTGCAA
GGTTCTCGATAAGGAAGCATTTGAGAAACAAATGTTGGAGAAGCTGATTTGGATTTCTGCATTTATGCTCGTTGGAGCACGTCATCCAGAACTTGCATCTGCAGCCGCAG
CTGAAAGGCAGTTGGTGTTTGAAGAAGGTATAGAGGAAAGATTATGTGCATATTCTCGGGCTGTGGCTCACTTCCCCACGGCAGTAAAAGAGTTCAAATGGCGAAACGGG
TGGTTCTATTCTCTCTCCGAGAAAGCCATTGCTGCTGGAAAACCCGACCCCTGCCCTCTGCATACCGCTTGGCTTGCGGAGCTGAAAGTTATCTAGGAATGACCCTGTAT
TGCTTGCTATCAAATGGATTTCCTGTTGCACAACGGTTGGCTTTAATCCAAATGAAGAACTCTCTTTTTGTTTTAAAAAGTAGATTTTGTACCTGTGTTCGTCATCTACA
AATCACCTCCAGTTACTTGTTGAAACTGGGCCCAGCCTCTTTCAGTTGGCGATAGAGAAGGATTCATCATCTGTACCATCCTGAACTTCTGAACAGAAAAATCCATCAAT
TATGTTAAGAGAGATTTAATCTAACTTCTACAGAAGAAGACTCCTTGCGAACAGAAAAAAAAAAACATGAAATTTAACAATGCCAACATTAGAGCAATGTGGAAGGAAAT
TGCAACACTGTGGTATGGAGATTGTTCAATAGAGATATCACAAATGAAGACATTATAACATGCTTCCTCCTGTTCAAAGACATCACGATTGAAGTTCCTTTATATTCGAG
ACCAAGATTGAGAGTCCTATAGTTTTGGTTTTTTTTGGTTTACTGACTGATTTATCTTCCCTAGAAGGCAC
Protein sequenceShow/hide protein sequence
MAAAISFTISSSSLWSRLSKPASKFRDRRLRIMAAATGTKVVPAVIVGSGRVGRALLDMGNGEDFLVKRGESVPLDFPGPILVCTRNDDLEAVLESTPRSRWNDLVFFQN
GMLDPWYESKGLNDVNQVLAYFAISKLGEAPVDGITDTNPEGLTAAYGKWASAVAGRLNAAGLSCKVLDKEAFEKQMLEKLIWISAFMLVGARHPELASAAAAERQLVFE
EGIEERLCAYSRAVAHFPTAVKEFKWRNGWFYSLSEKAIAAGKPDPCPLHTAWLAELKVI