; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016216 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016216
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00007724:792304..799077
RNA-Seq ExpressionSgr016216
SyntenySgr016216
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022138526.1 uncharacterized protein LOC111009671 [Momordica charantia]9.5e-12788.17Show/hide
Query:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCS-RSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGD
        MTDRLTLQFAHRSLSLG C SL L IRCRFQ+Q+ F      A SPLCS  SGLVFRSNLRQ+SPI  SL RGMSNSMVNEAEP+ELRDESDFEA+C  D
Subjt:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCS-RSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGD

Query:  DYISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVL
        +YISICGFGSLLSERSARSTFPEL+NFRVARL+GFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIII+VFEIKKSEIPAFIHREIEFRFLAVL
Subjt:  DYISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVL

Query:  PETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH
        PE LDG LYDKPAVLCSRSTDEEFFQVRCKGN+DIYFQHYGRHNIDKIWRDDILPCRVYLRH
Subjt:  PETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH

XP_022959219.1 uncharacterized protein LOC111460273 isoform X1 [Cucurbita moschata]1.1e-12586.21Show/hide
Query:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGDD
        MTDRL LQFAHRSL LGH ASLR+ +R +FQVQ+A P SF  A+SPLCS S LV RS LR+ISPI+ SL  GMSNS+VNEAEP+ELRDESDFEAI SG D
Subjt:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGDD

Query:  YISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVLP
        +IS+CGFGSLLSERSARSTFPELINFRVARLNGFRRVF NVAP+FFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFI REIEFRFLAVLP
Subjt:  YISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVLP

Query:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH
        ETLDGKLYDKPAVLC RSTDEEFFQVRCKGNEDI+ QHYGRHNIDKIWRDDILPCRVYLRH
Subjt:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH

XP_022959221.1 uncharacterized protein LOC111460273 isoform X3 [Cucurbita moschata]1.1e-12586.21Show/hide
Query:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGDD
        MTDRL LQFAHRSL LGH ASLR+ +R +FQVQ+A P SF  A+SPLCS S LV RS LR+ISPI+ SL  GMSNS+VNEAEP+ELRDESDFEAI SG D
Subjt:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGDD

Query:  YISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVLP
        +IS+CGFGSLLSERSARSTFPELINFRVARLNGFRRVF NVAP+FFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFI REIEFRFLAVLP
Subjt:  YISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVLP

Query:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH
        ETLDGKLYDKPAVLC RSTDEEFFQVRCKGNEDI+ QHYGRHNIDKIWRDDILPCRVYLRH
Subjt:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH

XP_023548382.1 uncharacterized protein LOC111807043 isoform X1 [Cucurbita pepo subsp. pepo]1.6e-12686.97Show/hide
Query:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGDD
        MTDRL LQFAHRSL LGH ASLR+SIR +FQVQ+A P SF  A+SPLCS S LV RS LR+ISPI+ SL  GMSNSMVNEAEP+ELRDESDFEAI SG D
Subjt:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGDD

Query:  YISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVLP
        + S+CGFGSLLSERSARSTFPELINFRVARLNGFRRVF NVAP+FFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFI REIEFRFLAVLP
Subjt:  YISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVLP

Query:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH
        ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGN+DI+ QHYGRHNIDKIWRDDILPCRVYLRH
Subjt:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH

XP_023548384.1 uncharacterized protein LOC111807043 isoform X3 [Cucurbita pepo subsp. pepo]1.6e-12686.97Show/hide
Query:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGDD
        MTDRL LQFAHRSL LGH ASLR+SIR +FQVQ+A P SF  A+SPLCS S LV RS LR+ISPI+ SL  GMSNSMVNEAEP+ELRDESDFEAI SG D
Subjt:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGDD

Query:  YISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVLP
        + S+CGFGSLLSERSARSTFPELINFRVARLNGFRRVF NVAP+FFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFI REIEFRFLAVLP
Subjt:  YISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVLP

Query:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH
        ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGN+DI+ QHYGRHNIDKIWRDDILPCRVYLRH
Subjt:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH

TrEMBL top hitse value%identityAlignment
A0A6J1C9Z2 uncharacterized protein LOC1110096714.6e-12788.17Show/hide
Query:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCS-RSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGD
        MTDRLTLQFAHRSLSLG C SL L IRCRFQ+Q+ F      A SPLCS  SGLVFRSNLRQ+SPI  SL RGMSNSMVNEAEP+ELRDESDFEA+C  D
Subjt:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCS-RSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGD

Query:  DYISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVL
        +YISICGFGSLLSERSARSTFPEL+NFRVARL+GFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIII+VFEIKKSEIPAFIHREIEFRFLAVL
Subjt:  DYISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVL

Query:  PETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH
        PE LDG LYDKPAVLCSRSTDEEFFQVRCKGN+DIYFQHYGRHNIDKIWRDDILPCRVYLRH
Subjt:  PETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH

A0A6J1EWR6 uncharacterized protein LOC1114368423.4e-11480.15Show/hide
Query:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSG------LVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEA
        M + L+LQFAHRSL L   AS    IR RFQV+    +SF I +S LCS SG      LVF SNLR I PI+ SL RGMSNS+ N+ EP+ELRDESDFE+
Subjt:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSG------LVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEA

Query:  ICSGDDYISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFR
        + S D YIS+CGFGSLLSERSARSTFPELINFRVARLNGFRR+F NVAPVFFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFI REIEFR
Subjt:  ICSGDDYISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFR

Query:  FLAVLPETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH
        FLAVLPETLDGKLYDKPAVLCSRSTDEEFF+VRCKGNED+YFQHYGRHNIDKIWRDDILPCRVYLRH
Subjt:  FLAVLPETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH

A0A6J1H494 uncharacterized protein LOC111460273 isoform X35.1e-12686.21Show/hide
Query:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGDD
        MTDRL LQFAHRSL LGH ASLR+ +R +FQVQ+A P SF  A+SPLCS S LV RS LR+ISPI+ SL  GMSNS+VNEAEP+ELRDESDFEAI SG D
Subjt:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGDD

Query:  YISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVLP
        +IS+CGFGSLLSERSARSTFPELINFRVARLNGFRRVF NVAP+FFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFI REIEFRFLAVLP
Subjt:  YISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVLP

Query:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH
        ETLDGKLYDKPAVLC RSTDEEFFQVRCKGNEDI+ QHYGRHNIDKIWRDDILPCRVYLRH
Subjt:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH

A0A6J1H7C8 uncharacterized protein LOC111460273 isoform X15.1e-12686.21Show/hide
Query:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGDD
        MTDRL LQFAHRSL LGH ASLR+ +R +FQVQ+A P SF  A+SPLCS S LV RS LR+ISPI+ SL  GMSNS+VNEAEP+ELRDESDFEAI SG D
Subjt:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGDD

Query:  YISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVLP
        +IS+CGFGSLLSERSARSTFPELINFRVARLNGFRRVF NVAP+FFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFI REIEFRFLAVLP
Subjt:  YISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVLP

Query:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH
        ETLDGKLYDKPAVLC RSTDEEFFQVRCKGNEDI+ QHYGRHNIDKIWRDDILPCRVYLRH
Subjt:  ETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH

A0A6J1KHJ0 uncharacterized protein LOC1114944342.0e-11481.13Show/hide
Query:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSG----LVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAIC
        M + L+LQFAHRSL L   AS    IR RFQV+  FP SF I +S LCS SG    LVF SNLR + PI+ SL RGMSNS+VN+  P+ELRDESDFE++ 
Subjt:  MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSG----LVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAIC

Query:  SGDDYISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFL
        S D YIS+CGFGSLLSERSARSTFPELINFRVARLNGFRR+F NVAPVFFERGIAKPETKEISSLCAEPCEGETII+TVFEIKKSEIPAFI REIEFRFL
Subjt:  SGDDYISICGFGSLLSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFL

Query:  AVLPETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH
        AVLPETLDGKLYDKPAVLCSRSTDEEFF+VRCKGNED+YFQHYGRHNIDKIWRDDILPCRVYLRH
Subjt:  AVLPETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G16060.1 unknown protein3.0e-7864.47Show/hide
Query:  AFPVSFSIAYSPLCSRSGLVFRSNLRQISPIDVSLGR-GMSNSMVNEAEPRELRDESDFEAICSGDDYISICGFGSLLSERSARSTFPELINFRVARLNG
        + P SF+ + +P  SR+     S  R    +  S  R     S+   ++  EL DESDFE + S D+ ISI GFGSLLSERSARSTFP+L NFR+A+L G
Subjt:  AFPVSFSIAYSPLCSRSGLVFRSNLRQISPIDVSLGR-GMSNSMVNEAEPRELRDESDFEAICSGDDYISICGFGSLLSERSARSTFPELINFRVARLNG

Query:  FRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVLPETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNED
        FRRVFA+ AP+FFERGIA PETKEISSL  EPCEGE++++TVFEIK SEIPAFI RE+EFRFLAV+PETL+GK Y   AVLC R +DEEFFQ+RCKGN+ 
Subjt:  FRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVLPETLDGKLYDKPAVLCSRSTDEEFFQVRCKGNED

Query:  IYFQHYGRHNIDKIWRDDILPCRVYLRH
        IYFQHYGR  IDKIWRDDILPCR+YLRH
Subjt:  IYFQHYGRHNIDKIWRDDILPCRVYLRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGATCGTCTCACTCTCCAATTCGCTCATCGTTCTCTCTCTCTAGGGCATTGCGCTTCGCTTCGCTTGTCAATCCGCTGTAGGTTTCAGGTTCAGCTTGCGTTCCC
TGTCTCTTTCTCTATAGCGTACTCTCCTCTCTGCTCCCGCTCCGGACTGGTTTTTCGAAGTAACCTCCGGCAAATATCTCCGATCGACGTATCCTTAGGCCGCGGAATGT
CAAATTCGATGGTGAACGAAGCAGAACCGAGAGAATTGCGAGACGAGTCCGATTTTGAAGCCATCTGCTCCGGCGATGATTACATTTCCATCTGTGGCTTTGGTTCTCTT
CTATCTGAGAGGAGCGCGCGAAGTACCTTTCCTGAACTGATCAACTTTAGAGTCGCGAGATTGAACGGCTTCAGGCGCGTTTTCGCAAATGTAGCTCCCGTATTCTTTGA
GCGCGGCATTGCAAAACCTGAAACCAAGGAGATTTCAAGCTTGTGTGCGGAGCCTTGCGAAGGAGAAACTATCATCATTACGGTTTTTGAGATTAAGAAGTCTGAGATTC
CAGCTTTTATACACAGAGAGATTGAGTTTCGATTTCTTGCTGTTCTTCCTGAAACATTAGATGGAAAGCTATATGATAAACCAGCGGTGCTTTGTTCTCGATCCACTGAT
GAGGAATTCTTCCAAGTGAGATGCAAAGGAAATGAGGACATTTATTTTCAGCATTATGGTCGTCATAATATTGATAAGATCTGGAGAGATGATATCTTACCCTGTCGCGT
CTATCTTCGACACTGGCATTATGGAAGAGGAGCCTCCAGAATCCCTCAAGTTCCGATATGGCGGTTGATATTACAGAACAAAAAAACTGCAGAGAAGGAAGCCTTGAGAG
ATATTAGAGGAGATTTGGATGAGTCTTCTGCCATGGAGTTGAGTGACATGAATGTTGGCTTTCTGCACCCAGACAGGGCGTCTCATATATACCAACTCTTTCCTGCACTA
TCTCCTCCTCGTCGTCGTCATGAACTACATTATCAGAATCCTGTGTGGGGACAGAAACAATATGCATGGTTTTGCCGGGAGGATAGAAACGTATGTTTTCCGAAGCATCC
AGAGAAGGCATTGCCTTTTGGCTGCTTCGCTCAACAACTGTGGCGCTCCTCGTCTGTGTCGTGTTCTGAACTATCATCACCTGAAGAACCAGAACCAGACTCTAGCTTCT
TCTCAATTGCATTGATCGTAACTTGTTCAGCTATAAGTGATTCGTGCTTTCTTTCAGATATCAGAGGAGCTTCAGGCAACTCCTCCGTGGGATTAGATAAAACGCTTCCA
TTACGTCTGCGAGCACCCATGCAAGACCAGGAAGATAGAGAGGAGCGAGTCCTGACAACCGCCCGAGCTACATTTTGGGCACACTGGGCGGAGAAGGGCGCCCGCACCGG
CGACCTTTGCTTTTGCAGTGGCTATAGATGGAAGGCGTCGTCTATAGAAGCAGCTGAGAAGCTGGGCACCAGATCTGAGCCATTAATAATTGTAGTGATGAACTGCTTAC
CTGATTCTGCCAACTCCCATGTCATACAGGCAGCTACACAAATGGAAGTAATTAATACACGTCACGAAAAGATTATGATGACAACAAACGAAAAGAATGAGGATAAGAAA
TCTCGAAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGACTGATCGTCTCACTCTCCAATTCGCTCATCGTTCTCTCTCTCTAGGGCATTGCGCTTCGCTTCGCTTGTCAATCCGCTGTAGGTTTCAGGTTCAGCTTGCGTTCCC
TGTCTCTTTCTCTATAGCGTACTCTCCTCTCTGCTCCCGCTCCGGACTGGTTTTTCGAAGTAACCTCCGGCAAATATCTCCGATCGACGTATCCTTAGGCCGCGGAATGT
CAAATTCGATGGTGAACGAAGCAGAACCGAGAGAATTGCGAGACGAGTCCGATTTTGAAGCCATCTGCTCCGGCGATGATTACATTTCCATCTGTGGCTTTGGTTCTCTT
CTATCTGAGAGGAGCGCGCGAAGTACCTTTCCTGAACTGATCAACTTTAGAGTCGCGAGATTGAACGGCTTCAGGCGCGTTTTCGCAAATGTAGCTCCCGTATTCTTTGA
GCGCGGCATTGCAAAACCTGAAACCAAGGAGATTTCAAGCTTGTGTGCGGAGCCTTGCGAAGGAGAAACTATCATCATTACGGTTTTTGAGATTAAGAAGTCTGAGATTC
CAGCTTTTATACACAGAGAGATTGAGTTTCGATTTCTTGCTGTTCTTCCTGAAACATTAGATGGAAAGCTATATGATAAACCAGCGGTGCTTTGTTCTCGATCCACTGAT
GAGGAATTCTTCCAAGTGAGATGCAAAGGAAATGAGGACATTTATTTTCAGCATTATGGTCGTCATAATATTGATAAGATCTGGAGAGATGATATCTTACCCTGTCGCGT
CTATCTTCGACACTGGCATTATGGAAGAGGAGCCTCCAGAATCCCTCAAGTTCCGATATGGCGGTTGATATTACAGAACAAAAAAACTGCAGAGAAGGAAGCCTTGAGAG
ATATTAGAGGAGATTTGGATGAGTCTTCTGCCATGGAGTTGAGTGACATGAATGTTGGCTTTCTGCACCCAGACAGGGCGTCTCATATATACCAACTCTTTCCTGCACTA
TCTCCTCCTCGTCGTCGTCATGAACTACATTATCAGAATCCTGTGTGGGGACAGAAACAATATGCATGGTTTTGCCGGGAGGATAGAAACGTATGTTTTCCGAAGCATCC
AGAGAAGGCATTGCCTTTTGGCTGCTTCGCTCAACAACTGTGGCGCTCCTCGTCTGTGTCGTGTTCTGAACTATCATCACCTGAAGAACCAGAACCAGACTCTAGCTTCT
TCTCAATTGCATTGATCGTAACTTGTTCAGCTATAAGTGATTCGTGCTTTCTTTCAGATATCAGAGGAGCTTCAGGCAACTCCTCCGTGGGATTAGATAAAACGCTTCCA
TTACGTCTGCGAGCACCCATGCAAGACCAGGAAGATAGAGAGGAGCGAGTCCTGACAACCGCCCGAGCTACATTTTGGGCACACTGGGCGGAGAAGGGCGCCCGCACCGG
CGACCTTTGCTTTTGCAGTGGCTATAGATGGAAGGCGTCGTCTATAGAAGCAGCTGAGAAGCTGGGCACCAGATCTGAGCCATTAATAATTGTAGTGATGAACTGCTTAC
CTGATTCTGCCAACTCCCATGTCATACAGGCAGCTACACAAATGGAAGTAATTAATACACGTCACGAAAAGATTATGATGACAACAAACGAAAAGAATGAGGATAAGAAA
TCTCGAAGATAA
Protein sequenceShow/hide protein sequence
MTDRLTLQFAHRSLSLGHCASLRLSIRCRFQVQLAFPVSFSIAYSPLCSRSGLVFRSNLRQISPIDVSLGRGMSNSMVNEAEPRELRDESDFEAICSGDDYISICGFGSL
LSERSARSTFPELINFRVARLNGFRRVFANVAPVFFERGIAKPETKEISSLCAEPCEGETIIITVFEIKKSEIPAFIHREIEFRFLAVLPETLDGKLYDKPAVLCSRSTD
EEFFQVRCKGNEDIYFQHYGRHNIDKIWRDDILPCRVYLRHWHYGRGASRIPQVPIWRLILQNKKTAEKEALRDIRGDLDESSAMELSDMNVGFLHPDRASHIYQLFPAL
SPPRRRHELHYQNPVWGQKQYAWFCREDRNVCFPKHPEKALPFGCFAQQLWRSSSVSCSELSSPEEPEPDSSFFSIALIVTCSAISDSCFLSDIRGASGNSSVGLDKTLP
LRLRAPMQDQEDREERVLTTARATFWAHWAEKGARTGDLCFCSGYRWKASSIEAAEKLGTRSEPLIIVVMNCLPDSANSHVIQAATQMEVINTRHEKIMMTTNEKNEDKK
SRR