; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015567 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015567
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionformin-E isoform X3
Genome locationtig00004835:202858..205516
RNA-Seq ExpressionSgr015567
SyntenySgr015567
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576625.1 hypothetical protein SDJN03_24199, partial [Cucurbita argyrosperma subsp. sororia]3.4e-9865.35Show/hide
Query:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRIL----------TPTRLRPTTSLTPILCAGPSPPALPPKM----
        MLCS+ AGKA PNWLDRLRSNKGFPI DNL LDHFL+NQNLDNPSP   P       L          +  R   ++S +PI    PS   +   +    
Subjt:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRIL----------TPTRLRPTTSLTPILCAGPSPPALPPKM----

Query:  -------------GTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIG
                     G +   KQS PKICS+PS+ NA+  DDKNLCC  AQKEDNILSSNSDNSSKGG N GSD   K QNAR  VEEE+VEDEKGEKEL G
Subjt:  -------------GTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIG

Query:  YSKSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYC
        YSKSEVTVIDTS DVWKSDKLIFRRKNVWKVK+KKGKLRSYGRKKRKQ SE N + +M AS SKKTK+WGSEERFH NA QIRGKE+LK LNKGYN Q+C
Subjt:  YSKSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYC

Query:  SGPEISANAPDGGNDKMENVYTFSSENGS
        SGPEIS +APD  NDK ENVY+ S ENGS
Subjt:  SGPEISANAPDGGNDKMENVYTFSSENGS

KAG7014678.1 hypothetical protein SDJN02_22307 [Cucurbita argyrosperma subsp. argyrosperma]5.8e-9865.35Show/hide
Query:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRIL----------TPTRLRPTTSLTPILCAGPSPPALPPKM----
        MLCS+ AGKA PNWLDRLRSNKGFPI DNL LDHFL+NQNLDNPSP   P       L          +  R   ++S +PI    PS   +   +    
Subjt:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRIL----------TPTRLRPTTSLTPILCAGPSPPALPPKM----

Query:  -------------GTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIG
                     G +   KQS PKICS+PS+ NA+  DDKNLCC  AQKEDNILSSNSDNSSKGG N GSD   K QNAR  VEEE+VEDEKGEKEL G
Subjt:  -------------GTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIG

Query:  YSKSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYC
        YSKSEVTVIDTS DVWKSDKLIFRRKNVWKVK+KKGKLRSYGRKKRKQ SE N + +M AS SKKTK WGSEERFH NA QIRGKE+LK LNKGYN Q+C
Subjt:  YSKSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYC

Query:  SGPEISANAPDGGNDKMENVYTFSSENGS
        SGPEIS +APD  NDK ENVY+ S ENGS
Subjt:  SGPEISANAPDGGNDKMENVYTFSSENGS

XP_022922604.1 uncharacterized protein LOC111430562 isoform X1 [Cucurbita moschata]1.2e-9564.44Show/hide
Query:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRIL----------TPTRLRPTTSLTPILCAGPSPPALPPKM----
        MLCS+ AGKA PNWLDRLRSNKGFPI DNL LDHFL+NQNLDNPSP   P       L          +  R   ++S +PI    PS   +   +    
Subjt:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRIL----------TPTRLRPTTSLTPILCAGPSPPALPPKM----

Query:  -------------GTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIG
                     G +   KQS PKICS+PS+ NA+  DDKNLCC  AQKEDNILSSNSDNSSKGG N GSD   K QNAR  VEEE+VEDEKGEKEL G
Subjt:  -------------GTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIG

Query:  YSKSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYC
        YSKSEVTVIDTS DVWKSDKLIFRRKNVWKVK+KKGKLRSYGRKKRKQ SE N + +M AS SK TK WGSEERFH NA QIRGKE+LK LNKGYN Q+C
Subjt:  YSKSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYC

Query:  SGPEISANAPDGGNDKMENVYTFSSENGS
        S PEIS +APD  NDK E+VY+ S ENGS
Subjt:  SGPEISANAPDGGNDKMENVYTFSSENGS

XP_022984364.1 uncharacterized protein LOC111482689 [Cucurbita maxima]5.5e-9664.44Show/hide
Query:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRIL----------TPTRLRPTTSLTPILCAGPSPPALPPKM----
        MLCS+ AGKA PNWLDRLRSNKGFPI DNL LDHFL+NQNLDNPSP   P       L          +  R   ++S +PI    PS   +   +    
Subjt:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRIL----------TPTRLRPTTSLTPILCAGPSPPALPPKM----

Query:  -------------GTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIG
                     G +   KQS PKICS+PS+ NA+  DDKNLCC  AQKEDNILSSNSDNSSKGG N GSD   K QNAR  VEEE+VEDEKGEKEL G
Subjt:  -------------GTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIG

Query:  YSKSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYC
        YSKSEVTVIDTS DVWKSDKLIFRRKNVWKVK+KKGKLRSYGRKKRKQ SE N + +M AS SKK K WGSEERFH NA QIRGKE+LK LNKGYN  +C
Subjt:  YSKSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYC

Query:  SGPEISANAPDGGNDKMENVYTFSSENGS
        SGPEIS +APD  NDK  NVY+ S ENGS
Subjt:  SGPEISANAPDGGNDKMENVYTFSSENGS

XP_023551713.1 uncharacterized protein LOC111809606 isoform X1 [Cucurbita pepo subsp. pepo]6.5e-9765.05Show/hide
Query:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRIL----------TPTRLRPTTSLTPILCAGPSPPALPPKM----
        MLCS+ AGKA PNWLDRLRSNKGFPI DNL LDHFL+NQNLDNPSP   P       L          +  R   ++S +PI    PS   +   +    
Subjt:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRIL----------TPTRLRPTTSLTPILCAGPSPPALPPKM----

Query:  -------------GTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIG
                     G +   KQS PKICS+PS+ NA+  DDKNLCC  AQKEDNILSSNSDNSSKGG N GSD   K QNAR  VEEE+VEDEKGEKEL G
Subjt:  -------------GTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIG

Query:  YSKSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYC
        YSKSEVTVIDTS DVWKSDKLIFRRKNVWKVK+KKGKLRSYGRKKRKQ SE N + +M AS SKKTK WGSEERFH NA QIRGKE+LK LNKGYN Q+C
Subjt:  YSKSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYC

Query:  SGPEISANAPDGGNDKMENVYTFSSENGS
        SGPEIS +APD  NDK ENV + S ENGS
Subjt:  SGPEISANAPDGGNDKMENVYTFSSENGS

TrEMBL top hitse value%identityAlignment
A0A0A0KN29 Uncharacterized protein5.7e-8360.73Show/hide
Query:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRI-------LTPTRLRPTTSLTPILCAGPSPPALPP-------KM
        MLCS+ AGKAGPNWLDRLRSNKGFPI DNL LDHFL++QNLDNP  L        R        L       ++S +PI    PS   +          M
Subjt:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRI-------LTPTRLRPTTSLTPILCAGPSPPALPP-------KM

Query:  GTEDH----------MKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIGYSK
        G               KQS PKI SIPSV N +  D KNLCC   QKEDNILSSNSDNSSKG  + GSD     QN   KV EE+V DEK EKEL GYSK
Subjt:  GTEDH----------MKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIGYSK

Query:  SEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYCSGP
        SEVTVIDTS DVWKSDKLIFRRK+VWKVK+KK KLRSYGRKKRKQSSETND+P+ I S SKKTKVWGSEERFHLN QQI GKESLKPLNK +N Q+C GP
Subjt:  SEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYCSGP

Query:  EISANAPDGGNDKMENVYTFSSENGSCNPEK
        E    APD  N+K EN  T S +NG  +P++
Subjt:  EISANAPDGGNDKMENVYTFSSENGSCNPEK

A0A1S4DS96 uncharacterized protein LOC1034825694.0e-8460.73Show/hide
Query:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRI-------LTPTRLRPTTSLTPILCAGPSPPALPP-------KM
        MLCS+ AGKAGPNWLDRLRSNKGFPI DNL LDHFL++QNLDNP  L        R        L       ++S +PI    PS   +          M
Subjt:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRI-------LTPTRLRPTTSLTPILCAGPSPPALPP-------KM

Query:  GTEDH----------MKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKG-GNGGSDKELKGQNARGKVEEEDVEDEKGEKELIGYSK
        G               KQS PKICSIPS+ N +  D KNLCC   QKEDNILSSNSDNSSKG  + GSD     QN   KV EE+V DEK EKEL GYSK
Subjt:  GTEDH----------MKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKG-GNGGSDKELKGQNARGKVEEEDVEDEKGEKELIGYSK

Query:  SEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYCSGP
        SEVTVIDTS DVWKSDKLIFRRK+VWKVK+KK KLRSYGRKKRKQSSE ND+P+ I S SKKTKVWGSEERFHLN QQI GKESLKPLNK +N Q+C GP
Subjt:  SEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYCSGP

Query:  EISANAPDGGNDKMENVYTFSSENGSCNPEK
        EI   APD  N+K EN  T S +NG  +P++
Subjt:  EISANAPDGGNDKMENVYTFSSENGSCNPEK

A0A5A7TPJ9 Uncharacterized protein4.0e-8460.73Show/hide
Query:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRI-------LTPTRLRPTTSLTPILCAGPSPPALPP-------KM
        MLCS+ AGKAGPNWLDRLRSNKGFPI DNL LDHFL++QNLDNP  L        R        L       ++S +PI    PS   +          M
Subjt:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRI-------LTPTRLRPTTSLTPILCAGPSPPALPP-------KM

Query:  GTEDH----------MKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKG-GNGGSDKELKGQNARGKVEEEDVEDEKGEKELIGYSK
        G               KQS PKICSIPS+ N +  D KNLCC   QKEDNILSSNSDNSSKG  + GSD     QN   KV EE+V DEK EKEL GYSK
Subjt:  GTEDH----------MKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKG-GNGGSDKELKGQNARGKVEEEDVEDEKGEKELIGYSK

Query:  SEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYCSGP
        SEVTVIDTS DVWKSDKLIFRRK+VWKVK+KK KLRSYGRKKRKQSSE ND+P+ I S SKKTKVWGSEERFHLN QQI GKESLKPLNK +N Q+C GP
Subjt:  SEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYCSGP

Query:  EISANAPDGGNDKMENVYTFSSENGSCNPEK
        EI   APD  N+K EN  T S +NG  +P++
Subjt:  EISANAPDGGNDKMENVYTFSSENGSCNPEK

A0A6J1E997 uncharacterized protein LOC111430562 isoform X15.9e-9664.44Show/hide
Query:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRIL----------TPTRLRPTTSLTPILCAGPSPPALPPKM----
        MLCS+ AGKA PNWLDRLRSNKGFPI DNL LDHFL+NQNLDNPSP   P       L          +  R   ++S +PI    PS   +   +    
Subjt:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRIL----------TPTRLRPTTSLTPILCAGPSPPALPPKM----

Query:  -------------GTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIG
                     G +   KQS PKICS+PS+ NA+  DDKNLCC  AQKEDNILSSNSDNSSKGG N GSD   K QNAR  VEEE+VEDEKGEKEL G
Subjt:  -------------GTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIG

Query:  YSKSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYC
        YSKSEVTVIDTS DVWKSDKLIFRRKNVWKVK+KKGKLRSYGRKKRKQ SE N + +M AS SK TK WGSEERFH NA QIRGKE+LK LNKGYN Q+C
Subjt:  YSKSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYC

Query:  SGPEISANAPDGGNDKMENVYTFSSENGS
        S PEIS +APD  NDK E+VY+ S ENGS
Subjt:  SGPEISANAPDGGNDKMENVYTFSSENGS

A0A6J1J8F5 uncharacterized protein LOC1114826892.6e-9664.44Show/hide
Query:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRIL----------TPTRLRPTTSLTPILCAGPSPPALPPKM----
        MLCS+ AGKA PNWLDRLRSNKGFPI DNL LDHFL+NQNLDNPSP   P       L          +  R   ++S +PI    PS   +   +    
Subjt:  MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRIL----------TPTRLRPTTSLTPILCAGPSPPALPPKM----

Query:  -------------GTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIG
                     G +   KQS PKICS+PS+ NA+  DDKNLCC  AQKEDNILSSNSDNSSKGG N GSD   K QNAR  VEEE+VEDEKGEKEL G
Subjt:  -------------GTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGG-NGGSDKELKGQNARGKVEEEDVEDEKGEKELIG

Query:  YSKSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYC
        YSKSEVTVIDTS DVWKSDKLIFRRKNVWKVK+KKGKLRSYGRKKRKQ SE N + +M AS SKK K WGSEERFH NA QIRGKE+LK LNKGYN  +C
Subjt:  YSKSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQSSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYC

Query:  SGPEISANAPDGGNDKMENVYTFSSENGS
        SGPEIS +APD  NDK  NVY+ S ENGS
Subjt:  SGPEISANAPDGGNDKMENVYTFSSENGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G24500.1 unknown protein2.1e-1639.2Show/hide
Query:  PSPPALPPKMGTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGGNGGSDKELKGQNARGKVEEE--DVEDEKGEKELIGYS
        P    LP K     H     P+   +P VN    +DD N  C  + +E    SS S  + K        E++ +  R  VE +  D E+EKGEK+L+G+S
Subjt:  PSPPALPPKMGTEDHMKQSKPKICSIPSVNNANCTDDKNLCCAQAQKEDNILSSNSDNSSKGGNGGSDKELKGQNARGKVEEE--DVEDEKGEKELIGYS

Query:  KSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKG------KLRSYGRKKRKQSSETNDVPE----MIASGSKKTKV
        +SEVTVIDTS  +WKS+KL+FRR+NVWKV+EKKG      KL+   +KK+K+  + +DV +    +    SKK K+
Subjt:  KSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKG------KLRSYGRKKRKQSSETNDVPE----MIASGSKKTKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTGTTCGCTATCAGCCGGGAAGGCTGGTCCCAACTGGCTGGACCGACTCCGTTCAAACAAGGGTTTTCCGATCGGCGACAACCTTGGCCTTGATCACTTCCTCAG
TAACCAAAACCTCGATAATCCCTCCCCTCTTCGGTTCCCACTTCTGCTGCTTTTCCGCATTCTGACCCCGACTCGACTCCGACCAACCACCAGTCTGACCCCAATTCTCT
GCGCCGGGCCGAGTCCTCCAGCTCTCCCACCGAAAATGGGAACCGAGGATCATATGAAACAGTCCAAGCCCAAGATTTGCTCCATTCCTTCTGTTAATAATGCCAACTGT
ACGGATGACAAGAATCTATGTTGCGCCCAGGCCCAGAAAGAAGATAACATCCTCTCATCAAACTCTGATAATAGTTCGAAAGGCGGCAATGGTGGGTCGGATAAAGAACT
CAAAGGACAGAATGCGCGCGGTAAGGTTGAAGAGGAAGACGTGGAGGATGAGAAGGGCGAGAAAGAACTTATAGGATACTCGAAGAGCGAGGTCACGGTCATAGATACTA
GCTGCGACGTGTGGAAGTCCGACAAACTGATTTTCAGAAGAAAGAACGTATGGAAGGTCAAGGAGAAGAAGGGTAAGCTGAGGAGCTATGGGAGGAAGAAGCGGAAGCAG
TCGTCTGAAACTAATGACGTTCCGGAAATGATTGCTTCCGGAAGTAAGAAAACCAAAGTGTGGGGTTCAGAGGAGCGCTTCCATTTGAATGCACAACAAATCCGTGGGAA
GGAATCTCTAAAACCATTGAACAAGGGATATAATCCCCAGTATTGTTCTGGCCCAGAAATTAGTGCAAATGCACCAGATGGCGGTAACGACAAGATGGAAAATGTTTACA
CTTTCTCATCTGAAAATGGCAGCTGCAATCCTGAAAAAAGTGGCAAATCACCCAAGGATCTAGTAAGTGAAGCTAAGCAGTCTCTTGATTATAATGACGTCCCATTTTGT
AGTTGCTGCAATTGCACCGGTGGACTCTATCAGCCTGTTGTCCGCCTCGCTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTTGTTCGCTATCAGCCGGGAAGGCTGGTCCCAACTGGCTGGACCGACTCCGTTCAAACAAGGGTTTTCCGATCGGCGACAACCTTGGCCTTGATCACTTCCTCAG
TAACCAAAACCTCGATAATCCCTCCCCTCTTCGGTTCCCACTTCTGCTGCTTTTCCGCATTCTGACCCCGACTCGACTCCGACCAACCACCAGTCTGACCCCAATTCTCT
GCGCCGGGCCGAGTCCTCCAGCTCTCCCACCGAAAATGGGAACCGAGGATCATATGAAACAGTCCAAGCCCAAGATTTGCTCCATTCCTTCTGTTAATAATGCCAACTGT
ACGGATGACAAGAATCTATGTTGCGCCCAGGCCCAGAAAGAAGATAACATCCTCTCATCAAACTCTGATAATAGTTCGAAAGGCGGCAATGGTGGGTCGGATAAAGAACT
CAAAGGACAGAATGCGCGCGGTAAGGTTGAAGAGGAAGACGTGGAGGATGAGAAGGGCGAGAAAGAACTTATAGGATACTCGAAGAGCGAGGTCACGGTCATAGATACTA
GCTGCGACGTGTGGAAGTCCGACAAACTGATTTTCAGAAGAAAGAACGTATGGAAGGTCAAGGAGAAGAAGGGTAAGCTGAGGAGCTATGGGAGGAAGAAGCGGAAGCAG
TCGTCTGAAACTAATGACGTTCCGGAAATGATTGCTTCCGGAAGTAAGAAAACCAAAGTGTGGGGTTCAGAGGAGCGCTTCCATTTGAATGCACAACAAATCCGTGGGAA
GGAATCTCTAAAACCATTGAACAAGGGATATAATCCCCAGTATTGTTCTGGCCCAGAAATTAGTGCAAATGCACCAGATGGCGGTAACGACAAGATGGAAAATGTTTACA
CTTTCTCATCTGAAAATGGCAGCTGCAATCCTGAAAAAAGTGGCAAATCACCCAAGGATCTAGTAAGTGAAGCTAAGCAGTCTCTTGATTATAATGACGTCCCATTTTGT
AGTTGCTGCAATTGCACCGGTGGACTCTATCAGCCTGTTGTCCGCCTCGCTAATTAA
Protein sequenceShow/hide protein sequence
MLCSLSAGKAGPNWLDRLRSNKGFPIGDNLGLDHFLSNQNLDNPSPLRFPLLLLFRILTPTRLRPTTSLTPILCAGPSPPALPPKMGTEDHMKQSKPKICSIPSVNNANC
TDDKNLCCAQAQKEDNILSSNSDNSSKGGNGGSDKELKGQNARGKVEEEDVEDEKGEKELIGYSKSEVTVIDTSCDVWKSDKLIFRRKNVWKVKEKKGKLRSYGRKKRKQ
SSETNDVPEMIASGSKKTKVWGSEERFHLNAQQIRGKESLKPLNKGYNPQYCSGPEISANAPDGGNDKMENVYTFSSENGSCNPEKSGKSPKDLVSEAKQSLDYNDVPFC
SCCNCTGGLYQPVVRLAN