; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027573 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027573
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein-serine/threonine phosphatase
Genome locationtig00153054:2691686..2703096
RNA-Seq ExpressionSgr027573
SyntenySgr027573
Gene Ontology termsGO:0006369 - termination of RNA polymerase II transcription (biological process)
GO:0006378 - mRNA polyadenylation (biological process)
GO:0070940 - dephosphorylation of RNA polymerase II C-terminal domain (biological process)
GO:0005847 - mRNA cleavage and polyadenylation specificity factor complex (cellular component)
GO:0008420 - RNA polymerase II CTD heptapeptide repeat phosphatase activity (molecular function)
InterPro domainsIPR006811 - RNA polymerase II subunit A


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137380.1 RNA polymerase II subunit A C-terminal domain phosphatase SSU72 [Cucumis sativus]8.8e-9795.21Show/hide
Query:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA
        MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGIL MLKRNA VK APQRWQDNAA
Subjt:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA

Query:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        DG FDVVFTFEEKVFDMVIEDLNTRDH +MKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWED+IDDIVI FEKQ RRKLLY
Subjt:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY

XP_008446784.1 PREDICTED: RNA polymerase II subunit A C-terminal domain phosphatase SSU72 [Cucumis melo]3.0e-9795.74Show/hide
Query:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA
        MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGIL MLKRNA VK APQRWQDNAA
Subjt:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA

Query:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        DG FDVVFTFEEKVFDMVIEDLNTRDH LMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWED+IDDIVI FEKQ RRKLLY
Subjt:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY

XP_022137335.1 RNA polymerase II subunit A C-terminal domain phosphatase SSU72 [Momordica charantia]2.1e-9896.81Show/hide
Query:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA
        MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGIL MLKRNATVK APQRWQDNAA
Subjt:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA

Query:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        DG FDVVF+FEEKVFDMVIEDLNTRDH LMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWE TIDDIVIAFEKQHRRKLLY
Subjt:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY

XP_022948646.1 RNA polymerase II subunit A C-terminal domain phosphatase SSU72 [Cucurbita moschata]9.4e-9996.81Show/hide
Query:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA
        MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGIL MLKRNA VK APQRWQDNAA
Subjt:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA

Query:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIER ESWEDTIDDIVIAFEK  RRKLLY
Subjt:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY

XP_022997876.1 RNA polymerase II subunit A C-terminal domain phosphatase SSU72 [Cucurbita maxima]1.0e-9795.74Show/hide
Query:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA
        MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGIL MLKRNA VK APQRWQDNAA
Subjt:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA

Query:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        DGFFDVVFTFEEKVFDMVIEDLNTRDH LMKTVLIVNLEVKDNHEEAAIGARLTF+LCQEIER ESWEDTIDDIVIAFEK  RRKLLY
Subjt:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY

TrEMBL top hitse value%identityAlignment
A0A1S3BFX9 Protein-serine/threonine phosphatase1.5e-9795.74Show/hide
Query:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA
        MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGIL MLKRNA VK APQRWQDNAA
Subjt:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA

Query:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        DG FDVVFTFEEKVFDMVIEDLNTRDH LMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWED+IDDIVI FEKQ RRKLLY
Subjt:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY

A0A2P5FAG2 Protein-serine/threonine phosphatase4.4e-9485.35Show/hide
Query:  WSEGTAGKFAMKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKP
        W E +     MKYRYAMVCSSNQNRSMEAHS+LK +GF+VSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGIL MLKRN+TVK 
Subjt:  WSEGTAGKFAMKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKP

Query:  APQRWQDNAADGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        APQRWQDNAADG FDVVFTFEEKVFDMVIEDL+TR+HVLMK+VL++NLEVKDNHEEAA+GARL  DLCQEIE TE WED+IDD+V AFEKQHRRKLLY
Subjt:  APQRWQDNAADGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY

A0A6J1C6C8 Protein-serine/threonine phosphatase1.0e-9896.81Show/hide
Query:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA
        MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGIL MLKRNATVK APQRWQDNAA
Subjt:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA

Query:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        DG FDVVF+FEEKVFDMVIEDLNTRDH LMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWE TIDDIVIAFEKQHRRKLLY
Subjt:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY

A0A6J1G9U0 Protein-serine/threonine phosphatase4.5e-9996.81Show/hide
Query:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA
        MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGIL MLKRNA VK APQRWQDNAA
Subjt:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA

Query:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIER ESWEDTIDDIVIAFEK  RRKLLY
Subjt:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY

A0A6J1K8Q5 Protein-serine/threonine phosphatase5.0e-9895.74Show/hide
Query:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA
        MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGIL MLKRNA VK APQRWQDNAA
Subjt:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA

Query:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        DGFFDVVFTFEEKVFDMVIEDLNTRDH LMKTVLIVNLEVKDNHEEAAIGARLTF+LCQEIER ESWEDTIDDIVIAFEK  RRKLLY
Subjt:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY

SwissProt top hitse value%identityAlignment
Q17QI2 RNA polymerase II subunit A C-terminal domain phosphatase SSU721.2e-5154.05Show/hide
Query:  RYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAADGF
        R A+VCSSNQNRSMEAH+IL  +GF+V S+GTG HVKLPGP+  +PNVYDF T Y  M++DL RKD ELY +NGIL ML RN  +KP P+R+Q N  D  
Subjt:  RYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAADGF

Query:  FDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        FD++ T EE+V+D V+EDLN+R+    + V ++N++++DNHEEA +GA L  +LCQ I+ TE  E+ ID+++  FE++  R  L+
Subjt:  FDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY

Q4KLK9 RNA polymerase II subunit A C-terminal domain phosphatase SSU728.9e-5254.59Show/hide
Query:  RYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAADGF
        R A+VCSSNQNRSMEAH+IL  +GF+V S+GTG HVKLPGP+  +PNVYDF T Y  M++DL RKD ELY +NGIL ML RN  +KP P+R+Q N  D  
Subjt:  RYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAADGF

Query:  FDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        FD++ T EE+V+D V+EDLN+R+    + V +VN++++DNHEEA +GA L  +LCQ I+ TE  E+ ID+++  FE++  R  L+
Subjt:  FDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY

Q5ZJQ7 RNA polymerase II subunit A C-terminal domain phosphatase SSU721.5e-5154.05Show/hide
Query:  RYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAADGF
        R A+VCSSNQNRSMEAH+IL  +GF+V S+GTG HVKLPGP+  +PNVYDF T Y  M++DL RKD ELY +NGIL ML RN  +KP P+R+Q N  D  
Subjt:  RYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAADGF

Query:  FDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        FD++ T EE+V+D V+EDLN+R+    + V ++N++++DNHEEA +GA L  +LCQ I+ TE  E+ ID+++  FE++  R  L+
Subjt:  FDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY

Q9CY97 RNA polymerase II subunit A C-terminal domain phosphatase SSU726.9e-5254.59Show/hide
Query:  RYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAADGF
        R A+VCSSNQNRSMEAH+IL  +GF+V S+GTG HVKLPGP+  +PNVYDF T Y  M++DL RKD ELY +NGIL ML RN  +KP P+R+Q N  D  
Subjt:  RYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAADGF

Query:  FDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        FD++ T EE+V+D V+EDLN+R+    + V +VN++++DNHEEA +GA L  +LCQ I+ TE  E+ ID+++  FE++  R  L+
Subjt:  FDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY

Q9NP77 RNA polymerase II subunit A C-terminal domain phosphatase SSU728.9e-5254.59Show/hide
Query:  RYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAADGF
        R A+VCSSNQNRSMEAH+IL  +GF+V S+GTG HVKLPGP+  +PNVYDF T Y  M++DL RKD ELY +NGIL ML RN  +KP P+R+Q N  D  
Subjt:  RYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAADGF

Query:  FDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        FD++ T EE+V+D V+EDLN+R+    + V +VN++++DNHEEA +GA L  +LCQ I+ TE  E+ ID+++  FE++  R  L+
Subjt:  FDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY

Arabidopsis top hitse value%identityAlignment
AT1G73820.1 Ssu72-like family protein4.4e-8678.72Show/hide
Query:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA
        M++RYAMVCSSNQNRSMEAH++LK +G +V+SYGTG+HVKLPGPSLREPNVYDFGTPYK MFD+LRRKDPELYKRNGIL M+KRN +VK APQRWQDNA 
Subjt:  MKYRYAMVCSSNQNRSMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAA

Query:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY
        DG FDVV TFEEKVFD V+EDLN R+  L KT+L++NLEVKDNHEEAAIG RL  +LCQEIE  E+WEDTIDDIV  FEKQHRRKL+Y
Subjt:  DGFFDVVFTFEEKVFDMVIEDLNTRDHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATACCTGGAGTCCACATAACTCTCAACATCCCACAGGAAAGAACCGAAAGTCTCTTCAACCCACCAAAGCTGATTTTGATAAGCTCGTCTTTGGAGGTATCTACTGT
TCCTTGCGGGTTGATCACAATCTGCTGTCTTCGAAACAACGCCTCCGTGTGTCTCCCAATGATGCTAACGGCCTCTTTTGACCTTATTGTTGCTAATCTCTGCAGAGTCT
CCCCATCAAATGACATTAAGTGTGTTCTCAAGCGGGAGGGCTTGATCCCCAGACCCAACTCCCCAGGACTAACATCTCCTGGTCCTGCCATAGGAAGAGATTTCAGGATG
CACTGGTGCGTTGGAGGAGGAGGAATCCCATCTCACATCCCCAACTTCTCCTCCTTCATCAGTTGCATCTGGAAGTATCCGTACATCACAGACGCTGCATACACCTGCCC
CACTCTCAGTTTGCTTATCTGCGCTAGTGAGGCAAACCGCTCCAGCTTTTCATCGTCTTGTGGCCATGTGTCAACTCGGTCATATGGGTCCGTAGACGATGGGCCAATAG
CTGGTATTAGAGGAAAATTGGCGTCCATGAATCGTTGCACCACCATTGCATATAAAATCTCTTCTAAAGCCCTTCTCCTTTCATTTGCCTTGACCTCTTCAATTCTTCTA
AACATGTCAAATGCTAAGCTACGTGAGTTGTCCAGAAAAAGCAGCTTGATGCTTTGGTCACGCATGACGTTGGGTTTGCAGCTGTTTAAGCTGCTGGTCGACGGCTGCCG
GAAGAAGGTGAGGATGGGTAGTCAGGATTTGAGACAGAAACTGGCGAATTGGAGACTGGAGCTGAAGTGGAGCAATTTGGGCAGCAGAACCAGCGGGGTCAGAAGGAAAG
GAAAAGAAAATGAAACAAACCTGGAAAGAGGAAACTGGAGTGAAGGTACAGCAGGGAAGTTCGCAATGAAGTATCGTTATGCAATGGTTTGCTCATCCAACCAGAATCGA
AGCATGGAGGCTCACTCTATCCTCAAAAGCAAGGGCTTCAATGTTTCATCCTATGGTACTGGGGCTCACGTCAAGCTTCCTGGACCTTCTCTTCGAGAACCCAATGTATA
CGACTTTGGTACCCCCTACAAGCACATGTTTGATGATCTCCGTCGAAAAGATCCTGAACTATACAAGCGTAATGGCATTCTGTCCATGCTTAAAAGAAATGCAACAGTTA
AACCGGCTCCTCAACGTTGGCAAGATAATGCCGCGGACGGTTTCTTTGACGTGGTATTTACTTTTGAAGAAAAGGTTTTTGATATGGTTATTGAAGATCTCAACACCCGT
GATCATGTACTGATGAAAACCGTACTGATTGTAAATTTGGAGGTAAAAGATAACCACGAGGAGGCAGCTATAGGAGCACGGCTTACTTTTGATCTTTGCCAGGAGATTGA
ACGGACCGAATCGTGGGAAGATACAATAGATGACATCGTGATTGCCTTCGAGAAACAGCACAGAAGAAAGCTGTTGTATATGTCTGATTATTGTTGTGTGGCTTCACAGG
GCAAAGTCCTGGAACCACTCACAAGTCGACAATGGAACCTGGTTTATCTTTTCCTCAAATGCATCATATTTGTTCAACAGCAGCACAAAAGGGGTATTCCTAAAACAAGG
GTGCCTCACCAAACGCTCAAACAAATTCCTACTAGTCAGCATTTTATTCTGCAAAGGACCCTTACTATGGGACCACATCTGGTCATAGTCACTCAAGGAAATACAAAATA
TCACGGCCCTCACAAGGCTCGTATTCATTGCTTGATATCTCTATCACCTGCAGAAACATCCATTGAAGATAACGAATTGGAGATTTGCAAATTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGATACCTGGAGTCCACATAACTCTCAACATCCCACAGGAAAGAACCGAAAGTCTCTTCAACCCACCAAAGCTGATTTTGATAAGCTCGTCTTTGGAGGTATCTACTGT
TCCTTGCGGGTTGATCACAATCTGCTGTCTTCGAAACAACGCCTCCGTGTGTCTCCCAATGATGCTAACGGCCTCTTTTGACCTTATTGTTGCTAATCTCTGCAGAGTCT
CCCCATCAAATGACATTAAGTGTGTTCTCAAGCGGGAGGGCTTGATCCCCAGACCCAACTCCCCAGGACTAACATCTCCTGGTCCTGCCATAGGAAGAGATTTCAGGATG
CACTGGTGCGTTGGAGGAGGAGGAATCCCATCTCACATCCCCAACTTCTCCTCCTTCATCAGTTGCATCTGGAAGTATCCGTACATCACAGACGCTGCATACACCTGCCC
CACTCTCAGTTTGCTTATCTGCGCTAGTGAGGCAAACCGCTCCAGCTTTTCATCGTCTTGTGGCCATGTGTCAACTCGGTCATATGGGTCCGTAGACGATGGGCCAATAG
CTGGTATTAGAGGAAAATTGGCGTCCATGAATCGTTGCACCACCATTGCATATAAAATCTCTTCTAAAGCCCTTCTCCTTTCATTTGCCTTGACCTCTTCAATTCTTCTA
AACATGTCAAATGCTAAGCTACGTGAGTTGTCCAGAAAAAGCAGCTTGATGCTTTGGTCACGCATGACGTTGGGTTTGCAGCTGTTTAAGCTGCTGGTCGACGGCTGCCG
GAAGAAGGTGAGGATGGGTAGTCAGGATTTGAGACAGAAACTGGCGAATTGGAGACTGGAGCTGAAGTGGAGCAATTTGGGCAGCAGAACCAGCGGGGTCAGAAGGAAAG
GAAAAGAAAATGAAACAAACCTGGAAAGAGGAAACTGGAGTGAAGGTACAGCAGGGAAGTTCGCAATGAAGTATCGTTATGCAATGGTTTGCTCATCCAACCAGAATCGA
AGCATGGAGGCTCACTCTATCCTCAAAAGCAAGGGCTTCAATGTTTCATCCTATGGTACTGGGGCTCACGTCAAGCTTCCTGGACCTTCTCTTCGAGAACCCAATGTATA
CGACTTTGGTACCCCCTACAAGCACATGTTTGATGATCTCCGTCGAAAAGATCCTGAACTATACAAGCGTAATGGCATTCTGTCCATGCTTAAAAGAAATGCAACAGTTA
AACCGGCTCCTCAACGTTGGCAAGATAATGCCGCGGACGGTTTCTTTGACGTGGTATTTACTTTTGAAGAAAAGGTTTTTGATATGGTTATTGAAGATCTCAACACCCGT
GATCATGTACTGATGAAAACCGTACTGATTGTAAATTTGGAGGTAAAAGATAACCACGAGGAGGCAGCTATAGGAGCACGGCTTACTTTTGATCTTTGCCAGGAGATTGA
ACGGACCGAATCGTGGGAAGATACAATAGATGACATCGTGATTGCCTTCGAGAAACAGCACAGAAGAAAGCTGTTGTATATGTCTGATTATTGTTGTGTGGCTTCACAGG
GCAAAGTCCTGGAACCACTCACAAGTCGACAATGGAACCTGGTTTATCTTTTCCTCAAATGCATCATATTTGTTCAACAGCAGCACAAAAGGGGTATTCCTAAAACAAGG
GTGCCTCACCAAACGCTCAAACAAATTCCTACTAGTCAGCATTTTATTCTGCAAAGGACCCTTACTATGGGACCACATCTGGTCATAGTCACTCAAGGAAATACAAAATA
TCACGGCCCTCACAAGGCTCGTATTCATTGCTTGATATCTCTATCACCTGCAGAAACATCCATTGAAGATAACGAATTGGAGATTTGCAAATTGTAA
Protein sequenceShow/hide protein sequence
MIPGVHITLNIPQERTESLFNPPKLILISSSLEVSTVPCGLITICCLRNNASVCLPMMLTASFDLIVANLCRVSPSNDIKCVLKREGLIPRPNSPGLTSPGPAIGRDFRM
HWCVGGGGIPSHIPNFSSFISCIWKYPYITDAAYTCPTLSLLICASEANRSSFSSSCGHVSTRSYGSVDDGPIAGIRGKLASMNRCTTIAYKISSKALLLSFALTSSILL
NMSNAKLRELSRKSSLMLWSRMTLGLQLFKLLVDGCRKKVRMGSQDLRQKLANWRLELKWSNLGSRTSGVRRKGKENETNLERGNWSEGTAGKFAMKYRYAMVCSSNQNR
SMEAHSILKSKGFNVSSYGTGAHVKLPGPSLREPNVYDFGTPYKHMFDDLRRKDPELYKRNGILSMLKRNATVKPAPQRWQDNAADGFFDVVFTFEEKVFDMVIEDLNTR
DHVLMKTVLIVNLEVKDNHEEAAIGARLTFDLCQEIERTESWEDTIDDIVIAFEKQHRRKLLYMSDYCCVASQGKVLEPLTSRQWNLVYLFLKCIIFVQQQHKRGIPKTR
VPHQTLKQIPTSQHFILQRTLTMGPHLVIVTQGNTKYHGPHKARIHCLISLSPAETSIEDNELEICKL