; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027857 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027857
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionNuclear nucleic acid-binding protein C1D
Genome locationtig00153055:3279739..3284367
RNA-Seq ExpressionSgr027857
SyntenySgr027857
Gene Ontology termsGO:0000460 - maturation of 5.8S rRNA (biological process)
GO:0010468 - regulation of gene expression (biological process)
GO:0000178 - exosome (RNase complex) (cellular component)
GO:0005730 - nucleolus (cellular component)
GO:0005737 - cytoplasm (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR011082 - Exosome-associated factor Rrp47/DNA strand repair C1D


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7037658.1 Nuclear nucleic acid-binding protein C1D [Cucurbita argyrosperma subsp. argyrosperma]1.3e-8389.95Show/hide
Query:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG
        GVRESAVVPE VMD+VK TLDNVE+V+THLISFLSIA P VLAQM+PLQRAQSMLLL+R TTTLF+LKLRCSGVHPDDHPIKSELERLSLYQ+KLERFIG
Subjt:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG

Query:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG
        LSKAPL+ STTLNYQAATRFIEHSLPDLTQEQKQ MRDISRGKGPKMKQ+ER VQKKRKYQSSEKQSVQ AAKEFLEKAARELLG+NNG
Subjt:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG

XP_022142600.1 nuclear nucleic acid-binding protein C1D [Momordica charantia]8.5e-8892.59Show/hide
Query:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG
        GVRESAVVPES MD+VKKTLDNVEEVRTHLISFLSI+EP+VLAQMQPLQRAQSMLLLAR+TTTLFSLK+RCSGVHPDDHP+KSELERLSLYQ+KLERFIG
Subjt:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG

Query:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG
        LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQK+SMRDISRGKGPK+K LERN QKKRKYQSSEKQSVQTAAK+FL KAARELLGENNG
Subjt:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG

XP_022939979.1 nuclear nucleic acid-binding protein C1D-like [Cucurbita moschata]2.0e-8490.48Show/hide
Query:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG
        GVRESAVVPE VMD+VK TLDNVE+V+THLISFLSIAEP VLAQM+PLQRAQSMLLL+R TTTLF+LKLRCSGVHPDDHPIKSELERLSLYQ+KLERFIG
Subjt:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG

Query:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG
        LSKAPL+ STTLNYQAATRFIEHSLPDLTQEQKQ MRDISRGKGPKMKQ+ER VQKKRKYQSSEKQSVQ AAKEFLEKAARELLG+NNG
Subjt:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG

XP_022981849.1 nuclear nucleic acid-binding protein C1D-like [Cucurbita maxima]9.7e-8489.95Show/hide
Query:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG
        GVRESAVVPE  MD+VK TLD+VE+V+THLISFLSIAEP VLAQM+PLQRAQSMLLL+R TTTLF+LKLRCSGVHPDDHPIKSELERLSLYQ+KLERFIG
Subjt:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG

Query:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG
        LSKAPL+ STTLNYQAATRFIEHSLPDLTQEQKQ MRDISRGKGPKMKQLER VQKKRKYQSSEKQSVQ AAKEFLEKAARELLG+NNG
Subjt:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG

XP_023524382.1 nuclear nucleic acid-binding protein C1D-like [Cucurbita pepo subsp. pepo]4.4e-8489.95Show/hide
Query:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG
        GVRESAVVPE VMD+VK TLDNVE+V+THLISFLSIAEP VLAQM+PLQRAQSMLLL+R TTTLF+LKLRCSGVHPDDHPIKSELERLSLYQ+KLERFIG
Subjt:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG

Query:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG
        LSKAPL+ STTLNYQAATRFIEHSLPDL+QEQKQ MRDISRGKGPKMKQ+ER VQKKRKYQSSEKQSVQ AAKEFLEKAARELLG+NNG
Subjt:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG

TrEMBL top hitse value%identityAlignment
A0A0A0L118 Nuclear nucleic acid-binding protein C1D1.0e-8391.53Show/hide
Query:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG
        GVR+ AVVPE VMD+VK TLDNVE+V+THLISFLSIAEP VLAQMQPLQRAQSMLLLARVTTTLF+LKLRCSGVH DDHPIKSELERLSLYQDKLERFIG
Subjt:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG

Query:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG
        LSKAPL+ STTLNYQAATRFIEHSLPDLTQEQK SMRDISRGKG KMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLG+ NG
Subjt:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG

A0A5D3D8Y3 Nuclear nucleic acid-binding protein C1D5.2e-8389.95Show/hide
Query:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG
        GVR+ AVVPE VMD+VK TLDNVE V+THLISFLSIAEP+VLAQMQPLQRAQSMLLLARVTTTLF+LKLRC+GVH DDHPIKSELERLSLY+DKLERFIG
Subjt:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG

Query:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG
        LSK PL+ STTLNYQAATRFIEHSLPDLTQEQK SMRDISRGKG KMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLG++NG
Subjt:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG

A0A6J1CLD8 Nuclear nucleic acid-binding protein C1D4.1e-8892.59Show/hide
Query:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG
        GVRESAVVPES MD+VKKTLDNVEEVRTHLISFLSI+EP+VLAQMQPLQRAQSMLLLAR+TTTLFSLK+RCSGVHPDDHP+KSELERLSLYQ+KLERFIG
Subjt:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG

Query:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG
        LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQK+SMRDISRGKGPK+K LERN QKKRKYQSSEKQSVQTAAK+FL KAARELLGENNG
Subjt:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG

A0A6J1FIR0 Nuclear nucleic acid-binding protein C1D9.5e-8590.48Show/hide
Query:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG
        GVRESAVVPE VMD+VK TLDNVE+V+THLISFLSIAEP VLAQM+PLQRAQSMLLL+R TTTLF+LKLRCSGVHPDDHPIKSELERLSLYQ+KLERFIG
Subjt:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG

Query:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG
        LSKAPL+ STTLNYQAATRFIEHSLPDLTQEQKQ MRDISRGKGPKMKQ+ER VQKKRKYQSSEKQSVQ AAKEFLEKAARELLG+NNG
Subjt:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG

A0A6J1J0U0 Nuclear nucleic acid-binding protein C1D4.7e-8489.95Show/hide
Query:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG
        GVRESAVVPE  MD+VK TLD+VE+V+THLISFLSIAEP VLAQM+PLQRAQSMLLL+R TTTLF+LKLRCSGVHPDDHPIKSELERLSLYQ+KLERFIG
Subjt:  GVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIG

Query:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG
        LSKAPL+ STTLNYQAATRFIEHSLPDLTQEQKQ MRDISRGKGPKMKQLER VQKKRKYQSSEKQSVQ AAKEFLEKAARELLG+NNG
Subjt:  LSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNG

SwissProt top hitse value%identityAlignment
Q32PE4 Nuclear nucleic acid-binding protein C1D1.3e-0625.69Show/hide
Query:  EGVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFI
        EG+ E    P  + + +    +++  V   L + +S++  ++L ++ PL++A+  L+ A    ++F + L   GV+P +HP+K ELER+ +Y ++++   
Subjt:  EGVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFI

Query:  GLSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGK
           KA       L+  AA+RF++++L +   + K + +  ++GK
Subjt:  GLSKAPLRPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGK

Q3KPR1 Nuclear nucleic acid-binding protein C1D1.1e-0829.84Show/hide
Query:  PESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIGLSKAPLRP
        P  + + +    ++V  V   L   +S++  ++L +++PL++A+  L+ A    +LF + L   G++P +HP+K ELER+  Y ++++      KA    
Subjt:  PESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIGLSKAPLRP

Query:  STTLNYQAATRFIEHSLPDLTQEQ
           L+  AA RFI+H+L D T E+
Subjt:  STTLNYQAATRFIEHSLPDLTQEQ

Q5XJ97 Nuclear nucleic acid-binding protein C1D3.4e-0727.34Show/hide
Query:  PESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIGLSKAPLRP
        P  + D +     ++  V+  + + +S++    L ++ PL++A+  L+ A    ++F + L   GV+P DHPIK ELER+  Y +K++      KA    
Subjt:  PESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIGLSKAPLRP

Query:  STTLNYQAATRFIEHSLPDL-TQEQKQSMRDISRGKGPK
           ++ +AA+RF+ ++L D   +++K       +GK  K
Subjt:  STTLNYQAATRFIEHSLPDL-TQEQKQSMRDISRGKGPK

Q5ZHS3 Nuclear nucleic acid-binding protein C1D5.3e-0829.41Show/hide
Query:  KKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIGLSKAPLRPSTTLNYQA
        +K+L +V+E+   L + +S++  ++L +++PL++A+  L+ A    ++F + L   G++P +HP+K ELER+  Y +K++      KA     + L+  A
Subjt:  KKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIGLSKAPLRPSTTLNYQA

Query:  ATRFIEHSLPDLTQEQKQS
        A+RF+ ++L +   E  Q+
Subjt:  ATRFIEHSLPDLTQEQKQS

Q7TSU0 Nuclear nucleic acid-binding protein C1D5.8e-0727.48Show/hide
Query:  MDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIGLSKAPLRPSTTL
        + A++ +L  V+++   L + +S++  ++L ++ PL++A+  L+ A    ++F + L   GV+P +HP+K ELER+ +Y ++++      KA       L
Subjt:  MDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIGLSKAPLRPSTTL

Query:  NYQAATRFIEHSLPDLTQEQKQSMRDISRGK
        +  AA+RF++++L +   +QK +    ++GK
Subjt:  NYQAATRFIEHSLPDLTQEQKQSMRDISRGK

Arabidopsis top hitse value%identityAlignment
AT4G31830.1 unknown protein7.0e-3270Show/hide
Query:  MGSNEDWRKNADTHKMRLEDVKAAGVEASKRPPGHHPGTVLHQRRSLPYSITTMTVAGLVIIGAIGYLTLYALKKPEASAKDVAKVATNVAEPEDTKPRK
        M   EDWRK ADT KM  E VKAAGVE+SKRPPG +PG VLHQRR+LPYS TTM +AGL I GAI Y  +YA KKPEA+A DVAK AT  A+PEDT PRK
Subjt:  MGSNEDWRKNADTHKMRLEDVKAAGVEASKRPPGHHPGTVLHQRRSLPYSITTMTVAGLVIIGAIGYLTLYALKKPEASAKDVAKVATNVAEPEDTKPRK

AT5G25080.1 Sas10/Utp3/C1D family8.5e-5459.67Show/hide
Query:  VVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIGLSKAPL
        VVPES ++AV +TL  ++E++  L   L++AEP+VLA MQPLQRA++M LLA  TTTL+ L+LRC+GV PDDH +KSE+ER+++Y++K ++ +  SK PL
Subjt:  VVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIGLSKAPL

Query:  RPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGEN
        RP+T LN QAATRFIEHSLPDLT  QKQS+RD+S+G+  +++  E +  +KRKYQS+EKQSVQ+AAK+FLEKAARE++G N
Subjt:  RPSTTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGEN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGTGTTCGAGAATCCGCAGTGGTGCCCGAGTCAGTGATGGATGCAGTGAAGAAGACGTTGGACAATGTAGAAGAAGTGCGAACTCACCTCATTTCCTTCCTGTC
CATCGCCGAACCCCAGGTCCTTGCTCAAATGCAGCCTCTCCAACGAGCTCAGTCTATGCTTTTACTTGCTAGAGTCACTACCACTCTCTTCTCATTGAAGTTGAGGTGTA
GCGGAGTTCACCCTGATGATCATCCTATCAAGTCAGAGCTTGAGAGATTAAGCTTGTATCAAGACAAACTAGAACGGTTCATAGGGTTGAGTAAAGCACCATTAAGGCCT
TCTACTACCTTGAACTATCAAGCAGCTACTCGCTTTATTGAACATTCTCTGCCTGATCTCACTCAAGAGCAAAAGCAGAGTATGAGGGATATTAGTAGGGGAAAGGGGCC
AAAAATGAAGCAATTAGAGAGGAACGTACAAAAGAAGAGGAAGTATCAGTCTTCTGAAAAACAATCTGTTCAAACTGCTGCCAAGGAATTTCTTGAGAAAGCTGCACGTG
AGCTTCTTGGTGAAAATAATGGAGCCACCGCCACCATCTCATTAGAATTTAAAACTACGGCACAGCTGCAGACTAGCGGAGCCACCGCTATCGGATCGGAGCGCGGACTT
CGCCAGAGACGGCGCTGGAGAAATATCATCTCTGTTTCTTGTGCTGCTCACAATGACTCCGCCCCACTGATGAAAGCTCAAGAATTTGAACGAACAATTCTCCAGCAGCT
GAGGAATTTAGAGAGAGAGAGAGAGAGAGAGATGGGAAGCAATGAGGACTGGAGGAAAAATGCAGATACTCACAAAATGAGGCTGGAGGACGTAAAAGCAGCGGGAGTGG
AAGCGTCCAAGAGGCCTCCGGGGCACCACCCAGGCACAGTCTTACACCAGCGGCGGAGCCTCCCCTACAGCATCACCACCATGACCGTCGCCGGCCTCGTCATCATCGGA
GCCATCGGATACCTCACCTTATATGCCTTGAAAAAACCCGAAGCCTCCGCCAAAGATGTCGCCAAGGTCGCTACCAACGTCGCAGAACCTGAAGACACCAAGCCCAGGAA
AGACCAGCTTCGTGATTTCAAGCTTCGTGAAGAAGTAGAAGGTGGAATAGAGGAACAGGTAATCTTCGCTGCACAACTGGAAGTAGCAGAGCACGATGGTGATTTCGGCG
CAGGTGATGAGGAGGATGATGAAAACCAAGAAAAGGAAACCAAAAATGTAGTAGAACTGCTGCCCACGAAAACAAGGGGGACTGAGATTCCAAACCATAAGAAGACTAGA
GCAAACATTGTCCCAAAGGGCACGGCTCCGGATGATTTCTGTCCCCAAATTAGAGCATTCAAAACGAAGAATATCGCGAATACGGTGGCAGGAAACGTGACTGCAGTGTT
CAGGGCAATCCTCTTCCATTCTGTACCCTTGAACATTTTGTACAGACGGGTCGAGGCAAAACCAGCAAAAAGACCCATGAAGACCCAGAGCAAGAGCATGGCCGTCATAA
GTCCACCTCTGTTTGAAGGGGAGAGGAACCCAAGGATGGCGAACATCATTGTTACAACTACCATTCCCAAGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGTGTTCGAGAATCCGCAGTGGTGCCCGAGTCAGTGATGGATGCAGTGAAGAAGACGTTGGACAATGTAGAAGAAGTGCGAACTCACCTCATTTCCTTCCTGTC
CATCGCCGAACCCCAGGTCCTTGCTCAAATGCAGCCTCTCCAACGAGCTCAGTCTATGCTTTTACTTGCTAGAGTCACTACCACTCTCTTCTCATTGAAGTTGAGGTGTA
GCGGAGTTCACCCTGATGATCATCCTATCAAGTCAGAGCTTGAGAGATTAAGCTTGTATCAAGACAAACTAGAACGGTTCATAGGGTTGAGTAAAGCACCATTAAGGCCT
TCTACTACCTTGAACTATCAAGCAGCTACTCGCTTTATTGAACATTCTCTGCCTGATCTCACTCAAGAGCAAAAGCAGAGTATGAGGGATATTAGTAGGGGAAAGGGGCC
AAAAATGAAGCAATTAGAGAGGAACGTACAAAAGAAGAGGAAGTATCAGTCTTCTGAAAAACAATCTGTTCAAACTGCTGCCAAGGAATTTCTTGAGAAAGCTGCACGTG
AGCTTCTTGGTGAAAATAATGGAGCCACCGCCACCATCTCATTAGAATTTAAAACTACGGCACAGCTGCAGACTAGCGGAGCCACCGCTATCGGATCGGAGCGCGGACTT
CGCCAGAGACGGCGCTGGAGAAATATCATCTCTGTTTCTTGTGCTGCTCACAATGACTCCGCCCCACTGATGAAAGCTCAAGAATTTGAACGAACAATTCTCCAGCAGCT
GAGGAATTTAGAGAGAGAGAGAGAGAGAGAGATGGGAAGCAATGAGGACTGGAGGAAAAATGCAGATACTCACAAAATGAGGCTGGAGGACGTAAAAGCAGCGGGAGTGG
AAGCGTCCAAGAGGCCTCCGGGGCACCACCCAGGCACAGTCTTACACCAGCGGCGGAGCCTCCCCTACAGCATCACCACCATGACCGTCGCCGGCCTCGTCATCATCGGA
GCCATCGGATACCTCACCTTATATGCCTTGAAAAAACCCGAAGCCTCCGCCAAAGATGTCGCCAAGGTCGCTACCAACGTCGCAGAACCTGAAGACACCAAGCCCAGGAA
AGACCAGCTTCGTGATTTCAAGCTTCGTGAAGAAGTAGAAGGTGGAATAGAGGAACAGGTAATCTTCGCTGCACAACTGGAAGTAGCAGAGCACGATGGTGATTTCGGCG
CAGGTGATGAGGAGGATGATGAAAACCAAGAAAAGGAAACCAAAAATGTAGTAGAACTGCTGCCCACGAAAACAAGGGGGACTGAGATTCCAAACCATAAGAAGACTAGA
GCAAACATTGTCCCAAAGGGCACGGCTCCGGATGATTTCTGTCCCCAAATTAGAGCATTCAAAACGAAGAATATCGCGAATACGGTGGCAGGAAACGTGACTGCAGTGTT
CAGGGCAATCCTCTTCCATTCTGTACCCTTGAACATTTTGTACAGACGGGTCGAGGCAAAACCAGCAAAAAGACCCATGAAGACCCAGAGCAAGAGCATGGCCGTCATAA
GTCCACCTCTGTTTGAAGGGGAGAGGAACCCAAGGATGGCGAACATCATTGTTACAACTACCATTCCCAAGAACTGA
Protein sequenceShow/hide protein sequence
MEGVRESAVVPESVMDAVKKTLDNVEEVRTHLISFLSIAEPQVLAQMQPLQRAQSMLLLARVTTTLFSLKLRCSGVHPDDHPIKSELERLSLYQDKLERFIGLSKAPLRP
STTLNYQAATRFIEHSLPDLTQEQKQSMRDISRGKGPKMKQLERNVQKKRKYQSSEKQSVQTAAKEFLEKAARELLGENNGATATISLEFKTTAQLQTSGATAIGSERGL
RQRRRWRNIISVSCAAHNDSAPLMKAQEFERTILQQLRNLEREREREMGSNEDWRKNADTHKMRLEDVKAAGVEASKRPPGHHPGTVLHQRRSLPYSITTMTVAGLVIIG
AIGYLTLYALKKPEASAKDVAKVATNVAEPEDTKPRKDQLRDFKLREEVEGGIEEQVIFAAQLEVAEHDGDFGAGDEEDDENQEKETKNVVELLPTKTRGTEIPNHKKTR
ANIVPKGTAPDDFCPQIRAFKTKNIANTVAGNVTAVFRAILFHSVPLNILYRRVEAKPAKRPMKTQSKSMAVISPPLFEGERNPRMANIIVTTTIPKN