; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019434 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019434
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAB hydrolase-1 domain-containing protein
Genome locationtig00153347:582273..586294
RNA-Seq ExpressionSgr019434
SyntenySgr019434
Gene Ontology termsNA
InterPro domainsIPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583891.1 hypothetical protein SDJN03_19823, partial [Cucurbita argyrosperma subsp. sororia]9.6e-10664.94Show/hide
Query:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW
        MA LASIHPI++LRPK  TP+ T R NA+L R  KMRVPFKLK++Q+RIFHELPSGLQMEVI+QKG AKSAE+ AANVERPPL FVHGSYHAAW+WAEHW
Subjt:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW

Query:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLY-----NITYQ----------TSNLTIFQIQIDCSQG
        LPFFSASGFDCYAISLLGQGESDAPSA VAGTLQTHASDIADFIH   SI     G    GL+      N  Y           T  + +  +    + G
Subjt:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLY-----NITYQ----------TSNLTIFQIQIDCSQG

Query:  LLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGLN
        L++ +L    +    V  S  A +F   L               L+ RYQELMKESSR PLFDLR+LNASLPVP  PKSC+EVLVLGASDDFIVDAEGLN
Subjt:  LLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGLN

Query:  ETGRFYSVTPICVPGVAHDMMLDCSWQK
        ETGRFYSVTPIC+ GVAHDMMLDCSWQK
Subjt:  ETGRFYSVTPICVPGVAHDMMLDCSWQK

XP_022139824.1 uncharacterized protein LOC111010645 isoform X2 [Momordica charantia]4.3e-10666.57Show/hide
Query:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW
        MA +AS HPIS+LRPKF TP+    ANAEL  A KMRVPFKLKEEQSRIFHELPSGLQMEVILQKG AKSAETRA NVERPPL FVHGSYHAAW+WAEHW
Subjt:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW

Query:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLL--YNITYQ--------------TSNLTIFQIQIDCSQ
        LPFFSASGFDCYAISLLGQGESDAPSA VAGTLQTHASDIADFIH+  S      G    GL+  Y I                 T  + +  +    + 
Subjt:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLL--YNITYQ--------------TSNLTIFQIQIDCSQ

Query:  GLLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGL
        GLL  +L    +    V  S  A +F   L               L+LRYQELMKESSR PLFDLR+LNASLPVP  PKSCIEVLVLGASDDFIVDAEGL
Subjt:  GLLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGL

Query:  NETGRFYSVTPICVPGVAHDMMLDCSWQK
        NETGRFYSVTPICV GVAHD+MLDCSWQ+
Subjt:  NETGRFYSVTPICVPGVAHDMMLDCSWQK

XP_022139825.1 uncharacterized protein LOC111010645 isoform X3 [Momordica charantia]1.5e-11474.07Show/hide
Query:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW
        MA +AS HPIS+LRPKF TP+    ANAEL  A KMRVPFKLKEEQSRIFHELPSGLQMEVILQKG AKSAETRA NVERPPL FVHGSYHAAW+WAEHW
Subjt:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW

Query:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLYNITYQTSNLTIFQIQIDCSQGLLELFLCVLYLPPAT
        LPFFSASGFDCYAISLLGQGESDAPSA VAGTLQTHASDIADFIH+  S      G    GL+  + Y  +N                           +
Subjt:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLYNITYQTSNLTIFQIQIDCSQGLLELFLCVLYLPPAT

Query:  VDSFGAISFPNPLLLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGLNETGRFYSVTPICVPGVAHDMMLDCSWQK
         DS+G ISFPNPLLLLRYQELMKESSR PLFDLR+LNASLPVP  PKSCIEVLVLGASDDFIVDAEGLNETGRFYSVTPICV GVAHD+MLDCSWQ+
Subjt:  VDSFGAISFPNPLLLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGLNETGRFYSVTPICVPGVAHDMMLDCSWQK

XP_023001441.1 uncharacterized protein LOC111495573 isoform X1 [Cucurbita maxima]1.2e-10564.85Show/hide
Query:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW
        MA LASIHPIS+LRPK  TP+ T R NA+L R  KMR PFKLK+EQ+RIFHELPSGLQMEVI+QKG AKSAE+RAANVERPPL FVHGSYHAAW+WAEHW
Subjt:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW

Query:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLY-----NITYQ------------TSNLTIFQIQIDCS
        LPFFSASGFDCYAISLLGQGESDAPSA VAGTLQTHASDIADFIH   SI     G    GL+      N  Y             T  + +  +    +
Subjt:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLY-----NITYQ------------TSNLTIFQIQIDCS

Query:  QGLLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEG
         GL++ +L    +    V  S    +F   L               L+ RYQELMKESSR PLFDLR+LNASLPVP  PKSC+EVLVLGASDDFIVDAEG
Subjt:  QGLLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEG

Query:  LNETGRFYSVTPICVPGVAHDMMLDCSWQK
        LNETGRFYSVTPIC+ GVAHDMMLDCSWQK
Subjt:  LNETGRFYSVTPICVPGVAHDMMLDCSWQK

XP_023001442.1 uncharacterized protein LOC111495573 isoform X2 [Cucurbita maxima]7.3e-10665.24Show/hide
Query:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW
        MA LASIHPIS+LRPK  TP+ T R NA+L R  KMR PFKLK+EQ+RIFHELPSGLQMEVI+QKG AKSAE+RAANVERPPL FVHGSYHAAW+WAEHW
Subjt:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW

Query:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLY-----NITYQ----------TSNLTIFQIQIDCSQG
        LPFFSASGFDCYAISLLGQGESDAPSA VAGTLQTHASDIADFIH   SI     G    GL+      N  Y           T  + +  +    + G
Subjt:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLY-----NITYQ----------TSNLTIFQIQIDCSQG

Query:  LLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGLN
        L++ +L    +    V  S    +F   L               L+ RYQELMKESSR PLFDLR+LNASLPVP  PKSC+EVLVLGASDDFIVDAEGLN
Subjt:  LLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGLN

Query:  ETGRFYSVTPICVPGVAHDMMLDCSWQK
        ETGRFYSVTPIC+ GVAHDMMLDCSWQK
Subjt:  ETGRFYSVTPICVPGVAHDMMLDCSWQK

TrEMBL top hitse value%identityAlignment
A0A6J1CDD4 uncharacterized protein LOC111010645 isoform X37.1e-11574.07Show/hide
Query:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW
        MA +AS HPIS+LRPKF TP+    ANAEL  A KMRVPFKLKEEQSRIFHELPSGLQMEVILQKG AKSAETRA NVERPPL FVHGSYHAAW+WAEHW
Subjt:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW

Query:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLYNITYQTSNLTIFQIQIDCSQGLLELFLCVLYLPPAT
        LPFFSASGFDCYAISLLGQGESDAPSA VAGTLQTHASDIADFIH+  S      G    GL+  + Y  +N                           +
Subjt:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLYNITYQTSNLTIFQIQIDCSQGLLELFLCVLYLPPAT

Query:  VDSFGAISFPNPLLLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGLNETGRFYSVTPICVPGVAHDMMLDCSWQK
         DS+G ISFPNPLLLLRYQELMKESSR PLFDLR+LNASLPVP  PKSCIEVLVLGASDDFIVDAEGLNETGRFYSVTPICV GVAHD+MLDCSWQ+
Subjt:  VDSFGAISFPNPLLLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGLNETGRFYSVTPICVPGVAHDMMLDCSWQK

A0A6J1CE22 uncharacterized protein LOC111010645 isoform X22.1e-10666.57Show/hide
Query:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW
        MA +AS HPIS+LRPKF TP+    ANAEL  A KMRVPFKLKEEQSRIFHELPSGLQMEVILQKG AKSAETRA NVERPPL FVHGSYHAAW+WAEHW
Subjt:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW

Query:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLL--YNITYQ--------------TSNLTIFQIQIDCSQ
        LPFFSASGFDCYAISLLGQGESDAPSA VAGTLQTHASDIADFIH+  S      G    GL+  Y I                 T  + +  +    + 
Subjt:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLL--YNITYQ--------------TSNLTIFQIQIDCSQ

Query:  GLLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGL
        GLL  +L    +    V  S  A +F   L               L+LRYQELMKESSR PLFDLR+LNASLPVP  PKSCIEVLVLGASDDFIVDAEGL
Subjt:  GLLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGL

Query:  NETGRFYSVTPICVPGVAHDMMLDCSWQK
        NETGRFYSVTPICV GVAHD+MLDCSWQ+
Subjt:  NETGRFYSVTPICVPGVAHDMMLDCSWQK

A0A6J1EGL6 uncharacterized protein LOC111434144 isoform X22.3e-10564.63Show/hide
Query:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW
        MA LASIHPI++LRPK  TP+ T R NA+L R  KMRVPFKLK++Q+RIFHELPSGLQMEVI+QKG AKSAE+ AANVERPPL FVHGSYHAAW+WAEHW
Subjt:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW

Query:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLY-----NITYQ----------TSNLTIFQIQIDCSQG
        LPFFSASGFDCYAISLLGQGESDAPSA VAGTLQTHASDIADFIH   SI     G    GL+      N  Y           T  + +  +    + G
Subjt:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLY-----NITYQ----------TSNLTIFQIQIDCSQG

Query:  LLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGLN
        L++ +L    +    V  S  A +F   L               L+ RYQELMKESSR PLFDLR+LNASLPVP  PKSC+EVLVLGASDDFIVD EGLN
Subjt:  LLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGLN

Query:  ETGRFYSVTPICVPGVAHDMMLDCSWQK
        ETGRFYSVTPIC+ GVAHDMMLDCSWQK
Subjt:  ETGRFYSVTPICVPGVAHDMMLDCSWQK

A0A6J1KGJ4 uncharacterized protein LOC111495573 isoform X23.5e-10665.24Show/hide
Query:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW
        MA LASIHPIS+LRPK  TP+ T R NA+L R  KMR PFKLK+EQ+RIFHELPSGLQMEVI+QKG AKSAE+RAANVERPPL FVHGSYHAAW+WAEHW
Subjt:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW

Query:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLY-----NITYQ----------TSNLTIFQIQIDCSQG
        LPFFSASGFDCYAISLLGQGESDAPSA VAGTLQTHASDIADFIH   SI     G    GL+      N  Y           T  + +  +    + G
Subjt:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLY-----NITYQ----------TSNLTIFQIQIDCSQG

Query:  LLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGLN
        L++ +L    +    V  S    +F   L               L+ RYQELMKESSR PLFDLR+LNASLPVP  PKSC+EVLVLGASDDFIVDAEGLN
Subjt:  LLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGLN

Query:  ETGRFYSVTPICVPGVAHDMMLDCSWQK
        ETGRFYSVTPIC+ GVAHDMMLDCSWQK
Subjt:  ETGRFYSVTPICVPGVAHDMMLDCSWQK

A0A6J1KIM1 uncharacterized protein LOC111495573 isoform X16.0e-10664.85Show/hide
Query:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW
        MA LASIHPIS+LRPK  TP+ T R NA+L R  KMR PFKLK+EQ+RIFHELPSGLQMEVI+QKG AKSAE+RAANVERPPL FVHGSYHAAW+WAEHW
Subjt:  MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHW

Query:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLY-----NITYQ------------TSNLTIFQIQIDCS
        LPFFSASGFDCYAISLLGQGESDAPSA VAGTLQTHASDIADFIH   SI     G    GL+      N  Y             T  + +  +    +
Subjt:  LPFFSASGFDCYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLY-----NITYQ------------TSNLTIFQIQIDCS

Query:  QGLLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEG
         GL++ +L    +    V  S    +F   L               L+ RYQELMKESSR PLFDLR+LNASLPVP  PKSC+EVLVLGASDDFIVDAEG
Subjt:  QGLLELFLCVLYLPPATVD-SFGAISFPNPL---------------LLLRYQELMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEG

Query:  LNETGRFYSVTPICVPGVAHDMMLDCSWQK
        LNETGRFYSVTPIC+ GVAHDMMLDCSWQK
Subjt:  LNETGRFYSVTPICVPGVAHDMMLDCSWQK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G38360.1 alpha/beta-Hydrolases superfamily protein2.6e-3754.67Show/hide
Query:  AELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHWLPFFSASGFDCYAISLLGQGESDAPSA
        A L+ + +  +P+ LK+ Q+R+ H+LPSGL+MEVI Q+   KS   R    E PPL FVHGSYHAAW WAE+WLPFFS+SGFD YA+SLLGQGESD P  
Subjt:  AELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHWLPFFSASGFDCYAISLLGQGESDAPSA

Query:  PVAGTLQTHASDIADFIHKRLS------INQCCSGIHLEGLLYNITYQTS
         VAGTLQTHASDIADFI   L       +     G+ ++  L NI  + S
Subjt:  PVAGTLQTHASDIADFIHKRLS------INQCCSGIHLEGLLYNITYQTS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACTCTTGCCTCTATACATCCCATCTCCATCCTTCGACCCAAGTTTCGCACTCCAGAAGGCACCTTGAGGGCCAATGCCGAACTCAGTAGAGCCCAGAAAATGCG
AGTGCCATTCAAGCTGAAGGAGGAACAGAGCCGTATTTTCCACGAACTCCCCTCTGGTCTTCAAATGGAGGTGATTCTGCAGAAGGGTAGGGCGAAATCAGCCGAAACAA
GGGCTGCAAATGTGGAGCGGCCTCCTCTTTTCTTTGTTCATGGAAGCTACCACGCTGCTTGGACTTGGGCAGAGCACTGGCTGCCATTCTTTTCGGCTTCTGGGTTTGAT
TGCTATGCTATCAGCTTGTTGGGTCAGGGTGAAAGTGATGCACCATCTGCACCTGTTGCTGGTACCCTCCAGACACATGCAAGTGATATTGCTGACTTCATTCATAAAAG
GCTTAGTATAAACCAGTGTTGCTCGGGCATTCATTTGGAGGGCTTATTGTACAATATTACATATCAAACATCAAACCTGACCATTTTTCAGATACAGATAGATTGTTCCC
AAGGCTTACTGGAGCTGTTCTTGTGTGTTCTGTACCTCCCTCCGGCAACAGTGGACTCGTTTGGCGCTATTTCTTTTCCAAACCCATTGTTGCTGTTAAGATATCAAGAG
CTGATGAAAGAAAGCTCAAGGACGCCATTATTTGATCTGAGGGAGTTGAATGCATCTCTTCCAGTACCGCCCCCGCCCAAGTCTTGCATTGAAGTACTAGTGCTGGGTGC
AAGTGATGATTTCATTGTGGATGCAGAAGGGTTGAATGAAACAGGCAGGTTTTACAGTGTGACACCAATCTGTGTTCCAGGGGTTGCTCATGACATGATGTTGGATTGTT
CTTGGCAGAAAGATCTTTTCATGGAGAGCCCAGAACCAGATCCAGCTCCGGGATTACACTGTTCCGGCGACGAGGTTCTAATTTTTGTCGACGACTTGTCGTTGTCGACC
TCTCCTCCAC
mRNA sequenceShow/hide mRNA sequence
ATGGCTACTCTTGCCTCTATACATCCCATCTCCATCCTTCGACCCAAGTTTCGCACTCCAGAAGGCACCTTGAGGGCCAATGCCGAACTCAGTAGAGCCCAGAAAATGCG
AGTGCCATTCAAGCTGAAGGAGGAACAGAGCCGTATTTTCCACGAACTCCCCTCTGGTCTTCAAATGGAGGTGATTCTGCAGAAGGGTAGGGCGAAATCAGCCGAAACAA
GGGCTGCAAATGTGGAGCGGCCTCCTCTTTTCTTTGTTCATGGAAGCTACCACGCTGCTTGGACTTGGGCAGAGCACTGGCTGCCATTCTTTTCGGCTTCTGGGTTTGAT
TGCTATGCTATCAGCTTGTTGGGTCAGGGTGAAAGTGATGCACCATCTGCACCTGTTGCTGGTACCCTCCAGACACATGCAAGTGATATTGCTGACTTCATTCATAAAAG
GCTTAGTATAAACCAGTGTTGCTCGGGCATTCATTTGGAGGGCTTATTGTACAATATTACATATCAAACATCAAACCTGACCATTTTTCAGATACAGATAGATTGTTCCC
AAGGCTTACTGGAGCTGTTCTTGTGTGTTCTGTACCTCCCTCCGGCAACAGTGGACTCGTTTGGCGCTATTTCTTTTCCAAACCCATTGTTGCTGTTAAGATATCAAGAG
CTGATGAAAGAAAGCTCAAGGACGCCATTATTTGATCTGAGGGAGTTGAATGCATCTCTTCCAGTACCGCCCCCGCCCAAGTCTTGCATTGAAGTACTAGTGCTGGGTGC
AAGTGATGATTTCATTGTGGATGCAGAAGGGTTGAATGAAACAGGCAGGTTTTACAGTGTGACACCAATCTGTGTTCCAGGGGTTGCTCATGACATGATGTTGGATTGTT
CTTGGCAGAAAGATCTTTTCATGGAGAGCCCAGAACCAGATCCAGCTCCGGGATTACACTGTTCCGGCGACGAGGTTCTAATTTTTGTCGACGACTTGTCGTTGTCGACC
TCTCCTCCAC
Protein sequenceShow/hide protein sequence
MATLASIHPISILRPKFRTPEGTLRANAELSRAQKMRVPFKLKEEQSRIFHELPSGLQMEVILQKGRAKSAETRAANVERPPLFFVHGSYHAAWTWAEHWLPFFSASGFD
CYAISLLGQGESDAPSAPVAGTLQTHASDIADFIHKRLSINQCCSGIHLEGLLYNITYQTSNLTIFQIQIDCSQGLLELFLCVLYLPPATVDSFGAISFPNPLLLLRYQE
LMKESSRTPLFDLRELNASLPVPPPPKSCIEVLVLGASDDFIVDAEGLNETGRFYSVTPICVPGVAHDMMLDCSWQKDLFMESPEPDPAPGLHCSGDEVLIFVDDLSLST
SPPX