; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr029566 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr029566
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPolyketide_cyc domain-containing protein
Genome locationtig00153403:2020367..2027535
RNA-Seq ExpressionSgr029566
SyntenySgr029566
Gene Ontology termsNA
InterPro domainsIPR005031 - Coenzyme Q-binding protein COQ10, START domain
IPR023393 - START-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008461396.1 PREDICTED: uncharacterized protein LOC103499992 isoform X1 [Cucumis melo]6.0e-9885.65Show/hide
Query:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD
        MSIA +T NS PNS+ L HRSI RRNG+LFMA+PT RSINSR  SLP+LVF+IPR S KRSRNPI PR+KFVSPVMEWQNCTAKMEVDIPASVAY CYSD
Subjt:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD

Query:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPSLSRWSLKY AFGQDIEFSWLARNLQP  NQKIHWRSLEGL NRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKREKQTTTT
        PLLERLL+R  ++  T
Subjt:  PLLERLLKREKQTTTT

XP_022148300.1 uncharacterized protein LOC111016984 isoform X1 [Momordica charantia]1.5e-10188.43Show/hide
Query:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD
        MS AAVT NSNPNS NLSH SI RRNGVL MA+PT RSINS+ PSLPELVFRIPRGSLKRS NP FPRVKFVSPVMEWQNCTAKMEVDIPASVAY CYSD
Subjt:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD

Query:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKV+EDNPSLSRWSLKY AFGQDIEFSWLA+NLQP LNQKIHWRSLEGL NRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKREKQTTTT
        PLLERLLKR  ++  T
Subjt:  PLLERLLKREKQTTTT

XP_022960550.1 uncharacterized protein LOC111461253 [Cucurbita moschata]1.1e-9986.57Show/hide
Query:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD
        MS+AA+T NSNPN++NLSHRSI R NGVLFMA+PT RSINSR  SLP+L+FRIPR S K  RNPI PRVKFVSPVMEWQNCTAKMEVDIPASVAY CYSD
Subjt:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD

Query:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNP+LSRWSLKY AFGQDIEFSWLARNLQP LNQKIHWRSLEGL NRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKREKQTTTT
        PLLERLLKR  ++  T
Subjt:  PLLERLLKREKQTTTT

XP_023004647.1 uncharacterized protein LOC111497878 [Cucurbita maxima]5.5e-9986.11Show/hide
Query:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD
        MS+AA+T NSNPN++NLSHRSI R NGVLFMA+PT RSIN R  SLP+L+FRIPR S K  RNPI PRVKFVSPVMEWQNCTAKMEVDIPASVAY CYSD
Subjt:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD

Query:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNP+LSRWSLKY AFGQDIEFSWLARNLQP LNQKIHWRSLEGL NRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKREKQTTTT
        PLLERLLKR  ++  T
Subjt:  PLLERLLKREKQTTTT

XP_023514604.1 uncharacterized protein LOC111778852 [Cucurbita pepo subsp. pepo]4.9e-10086.57Show/hide
Query:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD
        MS+AA+T NSNPN++NLSHRSI R NGVLFMA+PT R+INSR  SLP+L+FRIPRGS K  RNPI PRVKFVSPVMEWQNCTAKMEVDIPASVAY CYSD
Subjt:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD

Query:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNP+LSRWSLKY AFGQDIEFSWLARNLQP LNQKIHWRSLEGL NRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKREKQTTTT
        PLLERLLKR  ++  T
Subjt:  PLLERLLKREKQTTTT

TrEMBL top hitse value%identityAlignment
A0A0A0K6F1 Polyketide_cyc domain-containing protein1.5e-9483.33Show/hide
Query:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD
        MSIA +T NS+PNS+ L HRSI RRNG++FMA+PT RSINSR  SLP+LVF+IPR S KRSRNP+    KFVSPVMEWQNCTAKMEVDIPASVAY CYSD
Subjt:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD

Query:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNP+LSRWSLKY AFGQDIEFSWLARNLQP  NQKIHWRSLEGL NRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKREKQTTTT
        PLLERLL+R  ++  T
Subjt:  PLLERLLKREKQTTTT

A0A1S3CEM7 uncharacterized protein LOC103499992 isoform X12.9e-9885.65Show/hide
Query:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD
        MSIA +T NS PNS+ L HRSI RRNG+LFMA+PT RSINSR  SLP+LVF+IPR S KRSRNPI PR+KFVSPVMEWQNCTAKMEVDIPASVAY CYSD
Subjt:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD

Query:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNPSLSRWSLKY AFGQDIEFSWLARNLQP  NQKIHWRSLEGL NRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKREKQTTTT
        PLLERLL+R  ++  T
Subjt:  PLLERLLKREKQTTTT

A0A6J1D2J2 uncharacterized protein LOC111016984 isoform X17.4e-10288.43Show/hide
Query:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD
        MS AAVT NSNPNS NLSH SI RRNGVL MA+PT RSINS+ PSLPELVFRIPRGSLKRS NP FPRVKFVSPVMEWQNCTAKMEVDIPASVAY CYSD
Subjt:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD

Query:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKV+EDNPSLSRWSLKY AFGQDIEFSWLA+NLQP LNQKIHWRSLEGL NRGVVRFYPKG SSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKREKQTTTT
        PLLERLLKR  ++  T
Subjt:  PLLERLLKREKQTTTT

A0A6J1H9D8 uncharacterized protein LOC1114612535.3e-10086.57Show/hide
Query:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD
        MS+AA+T NSNPN++NLSHRSI R NGVLFMA+PT RSINSR  SLP+L+FRIPR S K  RNPI PRVKFVSPVMEWQNCTAKMEVDIPASVAY CYSD
Subjt:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD

Query:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNP+LSRWSLKY AFGQDIEFSWLARNLQP LNQKIHWRSLEGL NRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKREKQTTTT
        PLLERLLKR  ++  T
Subjt:  PLLERLLKREKQTTTT

A0A6J1KR05 uncharacterized protein LOC1114978782.6e-9986.11Show/hide
Query:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD
        MS+AA+T NSNPN++NLSHRSI R NGVLFMA+PT RSIN R  SLP+L+FRIPR S K  RNPI PRVKFVSPVMEWQNCTAKMEVDIPASVAY CYSD
Subjt:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSD

Query:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
        REAIPKWMPFISSVKVLEDNP+LSRWSLKY AFGQDIEFSWLARNLQP LNQKIHWRSLEGL NRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ
Subjt:  REAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQ

Query:  PLLERLLKREKQTTTT
        PLLERLLKR  ++  T
Subjt:  PLLERLLKREKQTTTT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02470.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein1.5e-4660Show/hide
Query:  PVMEWQNCTAKMEVDIPASVAYNCYSDREAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPK
        PVM+WQ+ T KM VD PASVAY  Y+DRE  PKWMPF+SSV+ +E +P LSR+ +K  +FGQ+IE+ +LA+NLQPI ++KIHWRS+EG  NRG VRF+P+
Subjt:  PVMEWQNCTAKMEVDIPASVAYNCYSDREAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPK

Query:  GPSSCLVELTVSYEVPPLLSPVASALQPLLERLLK
        GPSSCLVE++ SYEVP   +PVA A++P +E++++
Subjt:  GPSSCLVELTVSYEVPPLLSPVASALQPLLERLLK

AT1G02470.2 Polyketide cyclase/dehydrase and lipid transport superfamily protein3.7e-4559.56Show/hide
Query:  PVMEWQNCT-AKMEVDIPASVAYNCYSDREAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYP
        PVM+WQ+ T  KM VD PASVAY  Y+DRE  PKWMPF+SSV+ +E +P LSR+ +K  +FGQ+IE+ +LA+NLQPI ++KIHWRS+EG  NRG VRF+P
Subjt:  PVMEWQNCT-AKMEVDIPASVAYNCYSDREAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYP

Query:  KGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLK
        +GPSSCLVE++ SYEVP   +PVA A++P +E++++
Subjt:  KGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLK

AT1G02475.1 Polyketide cyclase/dehydrase and lipid transport superfamily protein5.3e-6056.19Show/hide
Query:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELV--FRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCY
        MS+ A+  N+  N ++ + ++ + ++ + F   P   S+   +P  P+ +  F     S+ R    I P+ +  S  MEWQ+C+ KMEVD+P SVAYN Y
Subjt:  MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELV--FRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCY

Query:  SDREAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASA
         DRE+ PKWMPFISSV+VL+D P LSRWSLKY AFGQDI++SWLARNLQP  NQKIHWRSLEGL N+G VRF+PKGPSSC+VELTVSYEVP LL+PVAS 
Subjt:  SDREAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASA

Query:  LQPLLERLLK
        L+P +E LL+
Subjt:  LQPLLERLLK

AT4G01883.1 Polyketide cyclase / dehydrase and lipid transport protein2.6e-5148.47Show/hide
Query:  MSIAAVTYNSNPNSV--NLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFR---IPRGSLKRS-RNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVA
        MS  A+   +NP  +     +R+   R+     A P+ R   S    +  L       P  S  RS ++ +F R      +MEWQ C  KM+V++P SVA
Subjt:  MSIAAVTYNSNPNSV--NLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFR---IPRGSLKRS-RNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVA

Query:  YNCYSDREAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSP
        Y  YS+RE+IPKWM FISSVKVL+D P LSRW+LKY AFGQ++E++WLA+NLQP+ NQKIHW SLEGL N+G VRF+P GPSSC VELT +YEVP LL P
Subjt:  YNCYSDREAIPKWMPFISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSP

Query:  VASALQPLLERLLKREKQTTTTVRLSNES
         A+ALQPL++ L+K   +    +  S ++
Subjt:  VASALQPLLERLLKREKQTTTTVRLSNES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTATTGCAGCAGTCACATATAATTCGAACCCCAACTCTGTGAATCTCAGTCATCGATCGATCAGTAGGAGAAATGGCGTCCTGTTCATGGCTGTTCCCACTTGCAG
AAGCATCAATTCAAGGTTACCGTCTCTACCAGAGCTTGTGTTCAGAATCCCTCGCGGGTCTTTGAAGCGAAGCAGAAACCCCATTTTCCCTCGAGTTAAATTCGTCTCCC
CTGTGATGGAATGGCAGAATTGCACGGCTAAGATGGAAGTCGACATACCTGCCTCGGTCGCCTATAACTGCTACTCAGATCGTGAAGCCATCCCCAAATGGATGCCCTTT
ATTTCGTCTGTGAAGGTATTGGAAGATAATCCTTCTTTATCACGGTGGTCATTAAAATATACGGCTTTTGGTCAAGATATTGAGTTCTCTTGGCTTGCTCGAAACTTGCA
GCCGATCCTAAATCAAAAAATCCATTGGCGGTCTCTCGAAGGTCTTCGAAACAGAGGTGTTGTACGATTTTATCCAAAAGGCCCCTCATCTTGCCTTGTAGAACTGACAG
TCTCTTACGAAGTTCCACCTCTTTTATCTCCAGTGGCATCTGCACTGCAACCTCTGCTTGAAAGATTACTTAAACGAGAGAAGCAGACGACGACGACAGTGAGATTATCA
AATGAGTCTAATCGCAAAGCCTGCAGAACGAGGTCTCGAGCACATCGCTCAGGGTCATCATGTCGCTTAAGTCCCTGTCGAACTACATTAACTGCTTGCTGGCTTGACAT
CACGTCCCAGATCCCGTCGCAGGCCATGATCAAGAACTCGTCCTCCTCTGTTAAAACCACCTGCCTGCATTCTGGTTCTGCAATGAGAGTGCCACTCTACAAGGTGGCTG
GAGATGATGATTTGAACACAGTGGAAAGGAGGAACAAGACAGCAAAATCAGAAGGGAGGAAAAAGATACCTTCCAAGTATAAGAGCAGTCAGTGCCGTCGTCCCAGAAGA
ACTACTGACATCAGAATCATCTGCCAGAGCTTGATCAGCCAGAAGAAATGCTTTTCGGAGAAAATTTTCGATCTCCTCCGGCAAGACTTCATCAACATCTGGCATTTGAG
GAAAACTAACATCCTCAAAAAACAGTCTAAGAACATTCTTTCTGATATAAGCTGCTGCTTCAGGTCCAATATCAGCAAAGCTGCCCGACCGAAGGCTCGGAAATTCCTCA
TCGCTCGACGAATCAATGGCAGAATCTGGAATTTTCTTCATGGGACCCCTAATGAACCGACTTCAGAACTGAAATTAGCCAAGATTGAATTCCAGATGAACAAAAAACCG
AAACGGACAAAAACCCAGAAAGAAGAACTCACCGATTGGGAGCTGGACAACTCGGCCGAGACCGATTCGGAAGCTCGAACTCGGTTGAGAGTCGGAGCAATCGCCTCATC
GATCGACTCAATTTCATCACCAAGCCGACTTTCTTTACCGAAATACGGCACCTCCAAAACCGGCAGGCTCTTCGAGCAAACAACAACCTCAGCTTCTGCTACCATTTTTG
TCCTCACGCTATCATTTACCGGAACGGAAAACACCGATAAGTTCTTCTCAATCAGATTTCCCAAAGAATCGAATCGTCTAATCAGAGTGGGCCGGCTACCGAGTCCGGTG
GTTGACGACTCCGATTTGGCCGACGTTTCGATGGTTTCGGACGCGGTTCGAATAAAATTGTGGGTCCACCGCTACGTGTCGGTAGATGAGTGGCTCCACGGCGTACAGTC
CCCTGGAGCGAGATTTATCATCGACGGCAATTTATGGAATAGTGTATTGCCAATTATTGAGCAATTAAAGTGTAAGAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTATTGCAGCAGTCACATATAATTCGAACCCCAACTCTGTGAATCTCAGTCATCGATCGATCAGTAGGAGAAATGGCGTCCTGTTCATGGCTGTTCCCACTTGCAG
AAGCATCAATTCAAGGTTACCGTCTCTACCAGAGCTTGTGTTCAGAATCCCTCGCGGGTCTTTGAAGCGAAGCAGAAACCCCATTTTCCCTCGAGTTAAATTCGTCTCCC
CTGTGATGGAATGGCAGAATTGCACGGCTAAGATGGAAGTCGACATACCTGCCTCGGTCGCCTATAACTGCTACTCAGATCGTGAAGCCATCCCCAAATGGATGCCCTTT
ATTTCGTCTGTGAAGGTATTGGAAGATAATCCTTCTTTATCACGGTGGTCATTAAAATATACGGCTTTTGGTCAAGATATTGAGTTCTCTTGGCTTGCTCGAAACTTGCA
GCCGATCCTAAATCAAAAAATCCATTGGCGGTCTCTCGAAGGTCTTCGAAACAGAGGTGTTGTACGATTTTATCCAAAAGGCCCCTCATCTTGCCTTGTAGAACTGACAG
TCTCTTACGAAGTTCCACCTCTTTTATCTCCAGTGGCATCTGCACTGCAACCTCTGCTTGAAAGATTACTTAAACGAGAGAAGCAGACGACGACGACAGTGAGATTATCA
AATGAGTCTAATCGCAAAGCCTGCAGAACGAGGTCTCGAGCACATCGCTCAGGGTCATCATGTCGCTTAAGTCCCTGTCGAACTACATTAACTGCTTGCTGGCTTGACAT
CACGTCCCAGATCCCGTCGCAGGCCATGATCAAGAACTCGTCCTCCTCTGTTAAAACCACCTGCCTGCATTCTGGTTCTGCAATGAGAGTGCCACTCTACAAGGTGGCTG
GAGATGATGATTTGAACACAGTGGAAAGGAGGAACAAGACAGCAAAATCAGAAGGGAGGAAAAAGATACCTTCCAAGTATAAGAGCAGTCAGTGCCGTCGTCCCAGAAGA
ACTACTGACATCAGAATCATCTGCCAGAGCTTGATCAGCCAGAAGAAATGCTTTTCGGAGAAAATTTTCGATCTCCTCCGGCAAGACTTCATCAACATCTGGCATTTGAG
GAAAACTAACATCCTCAAAAAACAGTCTAAGAACATTCTTTCTGATATAAGCTGCTGCTTCAGGTCCAATATCAGCAAAGCTGCCCGACCGAAGGCTCGGAAATTCCTCA
TCGCTCGACGAATCAATGGCAGAATCTGGAATTTTCTTCATGGGACCCCTAATGAACCGACTTCAGAACTGAAATTAGCCAAGATTGAATTCCAGATGAACAAAAAACCG
AAACGGACAAAAACCCAGAAAGAAGAACTCACCGATTGGGAGCTGGACAACTCGGCCGAGACCGATTCGGAAGCTCGAACTCGGTTGAGAGTCGGAGCAATCGCCTCATC
GATCGACTCAATTTCATCACCAAGCCGACTTTCTTTACCGAAATACGGCACCTCCAAAACCGGCAGGCTCTTCGAGCAAACAACAACCTCAGCTTCTGCTACCATTTTTG
TCCTCACGCTATCATTTACCGGAACGGAAAACACCGATAAGTTCTTCTCAATCAGATTTCCCAAAGAATCGAATCGTCTAATCAGAGTGGGCCGGCTACCGAGTCCGGTG
GTTGACGACTCCGATTTGGCCGACGTTTCGATGGTTTCGGACGCGGTTCGAATAAAATTGTGGGTCCACCGCTACGTGTCGGTAGATGAGTGGCTCCACGGCGTACAGTC
CCCTGGAGCGAGATTTATCATCGACGGCAATTTATGGAATAGTGTATTGCCAATTATTGAGCAATTAAAGTGTAAGAAATGA
Protein sequenceShow/hide protein sequence
MSIAAVTYNSNPNSVNLSHRSISRRNGVLFMAVPTCRSINSRLPSLPELVFRIPRGSLKRSRNPIFPRVKFVSPVMEWQNCTAKMEVDIPASVAYNCYSDREAIPKWMPF
ISSVKVLEDNPSLSRWSLKYTAFGQDIEFSWLARNLQPILNQKIHWRSLEGLRNRGVVRFYPKGPSSCLVELTVSYEVPPLLSPVASALQPLLERLLKREKQTTTTVRLS
NESNRKACRTRSRAHRSGSSCRLSPCRTTLTACWLDITSQIPSQAMIKNSSSSVKTTCLHSGSAMRVPLYKVAGDDDLNTVERRNKTAKSEGRKKIPSKYKSSQCRRPRR
TTDIRIICQSLISQKKCFSEKIFDLLRQDFINIWHLRKTNILKKQSKNILSDISCCFRSNISKAARPKARKFLIARRINGRIWNFLHGTPNEPTSELKLAKIEFQMNKKP
KRTKTQKEELTDWELDNSAETDSEARTRLRVGAIASSIDSISSPSRLSLPKYGTSKTGRLFEQTTTSASATIFVLTLSFTGTENTDKFFSIRFPKESNRLIRVGRLPSPV
VDDSDLADVSMVSDAVRIKLWVHRYVSVDEWLHGVQSPGARFIIDGNLWNSVLPIIEQLKCKK