; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0007884 (gene) of Chayote v1 genome

Gene IDSed0007884
OrganismSechium edule (Chayote v1)
DescriptionRibonuclease H
Genome locationLG05:11993923..11995859
RNA-Seq ExpressionSed0007884
SyntenySed0007884
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]9.6e-2931.21Show/hide
Query:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIW-NLLNIGVDA
        SG+K++M  +  A+ A+ + +   WN I KL VPTK+KIF+WR  H  I     L  R +   P    CG   ES+ H FF+CKRAR+IW  L       
Subjt:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIW-NLLNIGVDA

Query:  KILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSYWIPPPPNI
             ISF  +           ++   AIT W IWNDRN +  G+ +  V+ K +W+  +L+    A +       +     ++     + YW P     
Subjt:  KILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSYWIPPPPNI

Query:  LKLNVDVKVPSC-GKKLGGGAILRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKN
        LKLN D    +C G     G I+R+       A  +     +  L AE   + EGLK A   +   +E+ESDS + +  I+N
Subjt:  LKLNVDVKVPSC-GKKLGGGAILRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKN

XP_030483481.1 uncharacterized protein LOC115700065 [Cannabis sativa]2.4e-2728.72Show/hide
Query:  WWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAKILSKISFPFVLDDFARVKSTAEI
        WW    KL++P K+KIF WR++H  +     L +RH+        C +  ES+ H  F CK A+ +W       D     +++    L   + + S  E+
Subjt:  WWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAKILSKISFPFVLDDFARVKSTAEI

Query:  ETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLK-KPLNGVDECLELENVEGASL--SYWIPPPPNILKLNVDVKVPSCGKKLGGGAI
        E+I   LW+IW +RN++  G+   S  +   +   YL   H A  K +P     E    +N   A +  S W PP P  LK+NVD  + +   ++G GA+
Subjt:  ETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLK-KPLNGVDECLELENVEGASL--SYWIPPPPNILKLNVDVKVPSCGKKLGGGAI

Query:  LRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIYSLLKLF
        +RN  G    A       N      EA A+F  L  A  + +   +IE+D+ ++ +A+  R    S F   + ++  LL  F
Subjt:  LRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIYSLLKLF

XP_030495196.1 uncharacterized protein LOC115710989 [Cannabis sativa]3.1e-2726.82Show/hide
Query:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAK
        SG+ +      +   ++   +  WW     L++P K+KIF WR+ H  +     L +R V        C +  ES+ H FF C  A+ +W L++   D K
Subjt:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAK

Query:  ILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSYWIPPPPNIL
            +     L   A   + AE E I  TLW IW++RN+V   +   S      +   YL+    A LK          ++ +   A  + W PPP   L
Subjt:  ILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSYWIPPPPNIL

Query:  KLNVDVKVPSCGKKLGGGAILRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIYSLLK
        KLN+D        K+G GA++RN++G    A  +    N      EA A+F G+       ++   +E+D+ ++ +A+K+       F   + +I  LL 
Subjt:  KLNVDVKVPSCGKKLGGGAILRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIYSLLK

Query:  LF
         F
Subjt:  LF

XP_030502765.1 uncharacterized protein LOC115717936 [Cannabis sativa]1.3e-2825.9Show/hide
Query:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAK
        SG+++ +    +   A+      WW+   K+K+P KV+IF+W+++H  +     L +RH+   P+   C    ES++H  F+C RA+ +W+L  + +D  
Subjt:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAK

Query:  ILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSY---WIPPPP
         L + +   +L   + V +T+E E   +  W  W++RN +  G S+ S      +   YL E   A  K+           +    +  ++   W  PP 
Subjt:  ILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSY---WIPPPP

Query:  NILKLNVDVKVPSCGKKLGGGAILRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIYS
          LKLN +  +     K+G GA LRN  G    A       N      EA  L   L    + +++   IE+DS ++V  + + +   S F   L  I  
Subjt:  NILKLNVDVKVPSCGKKLGGGAILRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIYS

Query:  LLKLF
        L+  F
Subjt:  LLKLF

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]3.1e-2728.37Show/hide
Query:  WWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAKILSKISFPFVLDDFARVKSTAEI
        WW+   KLK+P KV+IF+W+++H  +     L +RH+   P+   C    E++ H  F+C RA+ +W L N  +D + + + S    L   +   S++E+
Subjt:  WWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAKILSKISFPFVLDDFARVKSTAEI

Query:  ETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLK--KPLNGVDECL-ELENVEGASLSYWIPPPPNILKLNVDVKVPSCGKKLGGGAI
        E   +  W+IW++RN +  G S+ +      +   YL E   A  K  KP+           + E      W  PP   LKLN D  +      +G GA+
Subjt:  ETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLK--KPLNGVDECL-ELENVEGASLSYWIPPPPNILKLNVDVKVPSCGKKLGGGAI

Query:  LRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIYSLLKLF
        LRN  G+   A       N      EA  L   L    + +++   IE+DS ++V  +K      S F   L  I  L+  F
Subjt:  LRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIYSLLKLF

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248744.6e-2931.21Show/hide
Query:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIW-NLLNIGVDA
        SG+K++M  +  A+ A+ + +   WN I KL VPTK+KIF+WR  H  I     L  R +   P    CG   ES+ H FF+CKRAR+IW  L       
Subjt:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIW-NLLNIGVDA

Query:  KILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSYWIPPPPNI
             ISF  +           ++   AIT W IWNDRN +  G+ +  V+ K +W+  +L+    A +       +     ++     + YW P     
Subjt:  KILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSYWIPPPPNI

Query:  LKLNVDVKVPSC-GKKLGGGAILRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKN
        LKLN D    +C G     G I+R+       A  +     +  L AE   + EGLK A   +   +E+ESDS + +  I+N
Subjt:  LKLNVDVKVPSC-GKKLGGGAILRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKN

A0A803P5E3 Uncharacterized protein1.0e-2828.01Show/hide
Query:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAK
        SG+K+ +    +    +      WW     L +P+KV+IFLWR     +     L  RH++       C ++ +++ H  F+CKR R+ W L N  +D+ 
Subjt:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAK

Query:  ILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSYWIPPPPNIL
        + + +S   ++   + + ST ++E  A  LW++W  RNK   G     +++ +     YLEE H A + K L    +   L++ + +    W+ PP   L
Subjt:  ILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSYWIPPPPNIL

Query:  KLNVDVKVPSCGKKLGGGAILRNFLGLCCGAKC-----VFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEI
        KLN +  V    +  G GAIL+N  G C  A        F+ + I+VL     AL   L+   ++ +    IESDS V+++ ++++    S F   L +I
Subjt:  KLNVDVKVPSCGKKLGGGAILRNFLGLCCGAKC-----VFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEI

Query:  YSLLKLF
          LL +F
Subjt:  YSLLKLF

A0A803P623 Uncharacterized protein1.5e-2726.82Show/hide
Query:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAK
        SG+ +      +   ++   +  WW     L++P K+KIF WR+ H  +     L +R V        C +  ES+ H FF C  A+ +W L++   D K
Subjt:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAK

Query:  ILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSYWIPPPPNIL
            +     L   A   + AE E I  TLW IW++RN+V   +   S      +   YL+    A LK          ++ +   A  + W PPP   L
Subjt:  ILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSYWIPPPPNIL

Query:  KLNVDVKVPSCGKKLGGGAILRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIYSLLK
        KLN+D        K+G GA++RN++G    A  +    N      EA A+F G+       ++   +E+D+ ++ +A+K+       F   + +I  LL 
Subjt:  KLNVDVKVPSCGKKLGGGAILRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIYSLLK

Query:  LF
         F
Subjt:  LF

A0A803P9P5 Uncharacterized protein6.1e-2929.14Show/hide
Query:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAK
        SG+ +      +   A+ +    WW     L +P+K+KIFLWR  H  +    IL  RH++       C +  ES +H  F CKR R+IW L +  +   
Subjt:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAK

Query:  ILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSYWIPPPPNIL
        +   +S   VL + +++ S+ +IE  A  LW+IWN+RNK   G      ++ + +   Y+EE   A L       D               W+ PP   L
Subjt:  ILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSYWIPPPPNIL

Query:  KLNVDVKVPSCGKKLGGGAILRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIYSLLK
        KLN D  V +     G GAILRN  G    A  +           EA AL   L+   +  +    IE+DS ++V  ++  + H S F   L  I  L+ 
Subjt:  KLNVDVKVPSCGKKLGGGAILRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIYSLLK

Query:  LF
         F
Subjt:  LF

A0A803Q2K8 Uncharacterized protein1.5e-2726.47Show/hide
Query:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAK
        SG+++ +    +   A+      WW+   K+K+P KV+IF+W+++H  +     L +RH+   P    C    ES++H  F+C RA+ +W L ++ +D  
Subjt:  SGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAK

Query:  ILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKK----PLNGVDECLELENVEGASLSYWIPPP
         L + +   +L   +   ST+E E   +  W  W++RN +  G ++ S      +   YL E   A  K+    P++ +       + E      W  PP
Subjt:  ILSKISFPFVLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKK----PLNGVDECLELENVEGASLSYWIPPP

Query:  PNILKLNVDVKVPSCGKKLGGGAILRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIY
           LKLN D  +     K+G GA LRN  G    A       N      EA  L   L    + +++   IE+DS ++V  + + + + S F   L  I 
Subjt:  PNILKLNVDVKVPSCGKKLGGGAILRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIY

Query:  SLLKLF
         L+  F
Subjt:  SLLKLF

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.2e-1224.91Show/hide
Query:  RDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIW-NLLNIGVDAKILSKISFPFVLDDFAR
        R     ++N + K++VP +VK FLW + +  ++      +RH++       C    ES+ H   +C     IW  ++         SK  F ++ D+   
Subjt:  RDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIW-NLLNIGVDAKILSKISFPFVLDDFAR

Query:  VKSTAEI---ETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSYWIPPPPNILKLNVDVKVPSCGK
             +I      A+ +W  W  R     GE+    D +V+++  +  E++ A     L G+ +   +E + G     W+ P    +K+N D        
Subjt:  VKSTAEI---ETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSYWIPPPPNILKLNVDVKVPSCGK

Query:  KLGGGAILRNFLGLCCGAKCVFRAQNIDVLS---AEAWALFEGLKLACNMSVATIEIESDSKVLVDAIK
            G +LR+    C GA C   + NI   S   AE W ++ GL  A    V  +E+E DS+V+V  +K
Subjt:  KLGGGAILRNFLGLCCGAKCVFRAQNIDVLS---AEAWALFEGLKLACNMSVATIEIESDSKVLVDAIK

Arabidopsis top hitse value%identityAlignment
AT1G10000.1 Ribonuclease H-like superfamily protein1.1e-0922.6Show/hide
Query:  KVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAKILSKISFPFVLDDFARVKSTAEIETIAI-------
        K+K+FLW+   G +     L +RH++       CG   E+ +H  F+C  A ++WNL  + +       I  P +L+    +K T  +  + I       
Subjt:  KVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAKILSKISFPFVLDDFARVKSTAEIETIAI-------

Query:  -TLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGA--SLSYWIPPPPNILKLN------VDVKVPSCGKKLGGGA
           W IW  RN++                    +  H +V++     V + L  ++ + A   +    P P +   L       VD          G G 
Subjt:  -TLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGA--SLSYWIPPPPNILKLN------VDVKVPSCGKKLGGGA

Query:  IL-------RNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIYSLLKLFKA
        +        +       G +     +    L+AEAWA+   +  A  +  + + + SDSK +VDA+ +    N IFG+ L EI S+   F++
Subjt:  IL-------RNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIYSLLKLFKA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGTCATAAGGTGTTTATGGACAGAAGAGTTGAAGCTTCTGGAGCAAATAGAGATGAAAAGTGTCTTTGGTGGAATATTATTTGCAAGCTTAAAGTGCCTACTAA
GGTGAAAATTTTCCTTTGGAGATTGTATCATGGTTTTATTCTAGCCAATTTTATTCTCCAGCAGAGACATGTTAATGTGCAGCCATGGTCTGTGCATTGTGGTAAAGACC
ATGAATCTTTAAGTCATACCTTTTTCAATTGCAAGAGAGCTAGGAGAATCTGGAATCTGTTGAATATTGGAGTTGATGCTAAAATCCTTTCAAAGATTAGTTTCCCCTTT
GTTCTTGATGATTTCGCTCGTGTGAAATCCACTGCAGAGATTGAAACTATTGCGATTACTTTATGGGCAATTTGGAATGACAGAAACAAGGTTAATTGTGGAGAATCGAT
TCCATCTGTTGATATTAAAGTCCAATGGATTTGGGGTTACCTGGAGGAATTGCATTGTGCAGTTTTGAAGAAGCCTCTTAATGGTGTAGATGAATGTCTAGAGCTCGAAA
ATGTTGAGGGAGCTTCTTTATCCTATTGGATCCCTCCTCCCCCAAATATTTTGAAGCTTAATGTTGATGTTAAGGTGCCAAGTTGTGGTAAGAAGCTCGGAGGTGGGGCT
ATTTTGCGGAACTTTCTGGGTTTATGTTGTGGAGCTAAGTGTGTGTTTCGTGCCCAAAATATTGATGTTTTATCTGCTGAAGCTTGGGCTTTGTTTGAAGGTCTAAAGTT
AGCCTGCAATATGAGTGTTGCTACCATTGAAATCGAATCCGATTCAAAGGTTTTGGTGGATGCTATAAAGAATAGAAAGTTGCATAACTCTATTTTTGGAGTTTTCTTGG
AGGAAATTTATTCTTTGTTGAAGCTTTTTAAAGCTCAAGATCAACAACAACCAGACTCAATTTACTTACCCATCTATGGCGCCGATCCAAAGTCTTTTGTTATCACCCTC
GATTCCTTCTCACTCCTTTCCACCATCCACTGTGGACCATTCTTCCCCATTGCAGCCCCAACCTTCCTCGAAGATAATAGCTTCTTCAATCTCCCTCTATTGCTGAGGGT
TAACAATACCTTCGTCACTGCCACGATCAAAACAATAATTAAAGAAACTAACAATAGTAATAACTTAATTAAAAACATCAACTGCCTCCCTAATAATAACAATTCAAAGG
TTGAGAGCATCAATGATGGGGTTGAAGCAAATTTTCTTCGTGAGGAGTTTTCATTTGGTGAATGGTACTTGGAGCCTGGAGGATTTGATGAAAGACATTCCTTCTTCTTT
CCCCTGCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGTCATAAGGTGTTTATGGACAGAAGAGTTGAAGCTTCTGGAGCAAATAGAGATGAAAAGTGTCTTTGGTGGAATATTATTTGCAAGCTTAAAGTGCCTACTAA
GGTGAAAATTTTCCTTTGGAGATTGTATCATGGTTTTATTCTAGCCAATTTTATTCTCCAGCAGAGACATGTTAATGTGCAGCCATGGTCTGTGCATTGTGGTAAAGACC
ATGAATCTTTAAGTCATACCTTTTTCAATTGCAAGAGAGCTAGGAGAATCTGGAATCTGTTGAATATTGGAGTTGATGCTAAAATCCTTTCAAAGATTAGTTTCCCCTTT
GTTCTTGATGATTTCGCTCGTGTGAAATCCACTGCAGAGATTGAAACTATTGCGATTACTTTATGGGCAATTTGGAATGACAGAAACAAGGTTAATTGTGGAGAATCGAT
TCCATCTGTTGATATTAAAGTCCAATGGATTTGGGGTTACCTGGAGGAATTGCATTGTGCAGTTTTGAAGAAGCCTCTTAATGGTGTAGATGAATGTCTAGAGCTCGAAA
ATGTTGAGGGAGCTTCTTTATCCTATTGGATCCCTCCTCCCCCAAATATTTTGAAGCTTAATGTTGATGTTAAGGTGCCAAGTTGTGGTAAGAAGCTCGGAGGTGGGGCT
ATTTTGCGGAACTTTCTGGGTTTATGTTGTGGAGCTAAGTGTGTGTTTCGTGCCCAAAATATTGATGTTTTATCTGCTGAAGCTTGGGCTTTGTTTGAAGGTCTAAAGTT
AGCCTGCAATATGAGTGTTGCTACCATTGAAATCGAATCCGATTCAAAGGTTTTGGTGGATGCTATAAAGAATAGAAAGTTGCATAACTCTATTTTTGGAGTTTTCTTGG
AGGAAATTTATTCTTTGTTGAAGCTTTTTAAAGCTCAAGATCAACAACAACCAGACTCAATTTACTTACCCATCTATGGCGCCGATCCAAAGTCTTTTGTTATCACCCTC
GATTCCTTCTCACTCCTTTCCACCATCCACTGTGGACCATTCTTCCCCATTGCAGCCCCAACCTTCCTCGAAGATAATAGCTTCTTCAATCTCCCTCTATTGCTGAGGGT
TAACAATACCTTCGTCACTGCCACGATCAAAACAATAATTAAAGAAACTAACAATAGTAATAACTTAATTAAAAACATCAACTGCCTCCCTAATAATAACAATTCAAAGG
TTGAGAGCATCAATGATGGGGTTGAAGCAAATTTTCTTCGTGAGGAGTTTTCATTTGGTGAATGGTACTTGGAGCCTGGAGGATTTGATGAAAGACATTCCTTCTTCTTT
CCCCTGCATTAA
Protein sequenceShow/hide protein sequence
MSGHKVFMDRRVEASGANRDEKCLWWNIICKLKVPTKVKIFLWRLYHGFILANFILQQRHVNVQPWSVHCGKDHESLSHTFFNCKRARRIWNLLNIGVDAKILSKISFPF
VLDDFARVKSTAEIETIAITLWAIWNDRNKVNCGESIPSVDIKVQWIWGYLEELHCAVLKKPLNGVDECLELENVEGASLSYWIPPPPNILKLNVDVKVPSCGKKLGGGA
ILRNFLGLCCGAKCVFRAQNIDVLSAEAWALFEGLKLACNMSVATIEIESDSKVLVDAIKNRKLHNSIFGVFLEEIYSLLKLFKAQDQQQPDSIYLPIYGADPKSFVITL
DSFSLLSTIHCGPFFPIAAPTFLEDNSFFNLPLLLRVNNTFVTATIKTIIKETNNSNNLIKNINCLPNNNNSKVESINDGVEANFLREEFSFGEWYLEPGGFDERHSFFF
PLH