; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010864 (gene) of Snake gourd v1 genome

Gene IDTan0010864
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCCHC-type domain-containing protein
Genome locationLG02:32639726..32643124
RNA-Seq ExpressionTan0010864
SyntenyTan0010864
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5514102.1 hypothetical protein RHGRI_035489 [Rhododendron griersonianum]1.1e-3535.08Show/hide
Query:  MEFKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGME
        +EFKG  +TW   +    +I+ERLDR + N  + +LFPYA+V H   + SDH P+++N+       L++  + F+FE +W+  EEC  +I   WN     
Subjt:  MEFKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGME

Query:  AQLKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKV----NEAIQGNG--DVKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHR
          ++  L QKL  C + LK W   + GN  +KI   K ++    N+    +G    K+   +LE  L  EE+Y  QRSR++W+K+GD+N  +F+   + R
Subjt:  AQLKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKV----NEAIQGNG--DVKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHR

Query:  RRVNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTNGQQDSETLLE
        R+ N I+KIK   G WL  + DI       F  LF ++GQ+D  ++L+
Subjt:  RRVNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTNGQQDSETLLE

KAG5537975.1 hypothetical protein RHGRI_025166 [Rhododendron griersonianum]9.6e-4331.62Show/hide
Query:  MEFKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGME
        +EFKG+ FTW   +    +I ER+DR + N  + + FP A+V H   + SDHCP+++N        L+R    F+FE +W+    CE II   W+   ++
Subjt:  MEFKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGME

Query:  AQLKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKVNEAIQGN-------GDVKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASH
              L QKL NC + LK W +   GN ++K+   K ++   IQ +       G++     ++E LL+ EE+Y+ QRSR++WL++GDRN  +F+     
Subjt:  AQLKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKVNEAIQGN-------GDVKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASH

Query:  RRRVNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTNGQQDSETLLEFLNQFRE---FN--------------------QRSKG-----KQKVPSVRA
        RR+ N + ++K +NG W  S+ DI      +F  LF  NG ++ E +L  + Q R    FN                    Q S G     +Q    +  
Subjt:  RRRVNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTNGQQDSETLLEFLNQFRE---FN--------------------QRSKG-----KQKVPSVRA

Query:  QPNWKPPGYPFFKINTDVALNKENQKSGIGVVIRNEKGEVMLTLAKSINGIMEIDVVEALAIRDGLYMAKEMGFRQVEVESHSAKVIQL
          +W+ P   + K N DVA+ K   K+ I VV+RN KGEV L    S+         EALA+R    +       +  +ES +  VI+L
Subjt:  QPNWKPPGYPFFKINTDVALNKENQKSGIGVVIRNEKGEVMLTLAKSINGIMEIDVVEALAIRDGLYMAKEMGFRQVEVESHSAKVIQL

XP_030479501.1 uncharacterized protein LOC115696754 [Cannabis sativa]1.1e-3534.45Show/hide
Query:  MEFKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGME
        ME +GHP+TW + R     ++ RLDR L ++ +  LFP A + +L+   SDHCPI +NL K  P  + +K   FR+E  W+ +  C  I++  W    ++
Subjt:  MEFKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGME

Query:  AQLKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKVNEAIQGNGDVKSASN------QLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHR
             +L  KL +C+  L  WG    GN+ K+I    K + + ++G  D+ S S       QL ++L+++EV+W+QRS+  WL+ GD+N K+F+ CAS R
Subjt:  AQLKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKVNEAIQGNGDVKSASN------QLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHR

Query:  RRVNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTN
        +R N I ++K+++G W++ D  +      ++  LFT++
Subjt:  RRVNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTN

XP_040990995.1 uncharacterized protein LOC121238231 [Juglans microcarpa x Juglans regia]1.9e-3536.07Show/hide
Query:  GHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGMEAQLK
        G+ FTW   R+ V   KERLDR  CN  + E FP  +V  L  + SD+CP++I +E G      RK  PFRFE  W L+E+ + ++E  W     E+  K
Subjt:  GHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGMEAQLK

Query:  DNLIQK-LNNCAEKLKYWGSLNIGNYEKKIN-----IAKKKVNEAIQGNGDVKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHRRRVN
         N I   LN C +KL  W     GN  ++++     + + +     Q N  +K       KLL EE + W+QR++  WL+ GDRN K+F+ CAS RR+VN
Subjt:  DNLIQK-LNNCAEKLKYWGSLNIGNYEKKIN-----IAKKKVNEAIQGNGDVKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHRRRVN

Query:  YIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTNGQQDSETLLE
         I K+  ++G  ++S+E I   F ++++  F+T+  ++  T LE
Subjt:  YIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTNGQQDSETLLE

XP_041025435.1 uncharacterized protein LOC121265832 [Juglans microcarpa x Juglans regia]5.1e-3640.24Show/hide
Query:  FKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGMEAQ
        + G  FTW   R      KERLDRGL N+ + +LFP   V+H   + SDH PIVIN+ K           PFR+E  WAL+E+C ++I+  W KK M A 
Subjt:  FKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGMEAQ

Query:  LKDNLIQK-LNNCAEKLKYWGSLNIGNYEKKIN---IAKKKVNEAIQGNGD--VKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHRRR
         K  L+ K L  C +KLK W   N GN ++ IN       K+ E+ +G  D   KS S ++E LL E+ + WKQR++  WL+ GDRN K+F+ CA+  R 
Subjt:  LKDNLIQK-LNNCAEKLKYWGSLNIGNYEKKIN---IAKKKVNEAIQGNGD--VKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHRRR

Query:  VNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTNGQQDSETLLE
         N I KI ++ G  L + +DI   F +FF  LF+T      E  L+
Subjt:  VNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTNGQQDSETLLE

TrEMBL top hitse value%identityAlignment
A0A7J6F7A3 Uncharacterized protein6.1e-3535.74Show/hide
Query:  FKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGMEAQ
        F+G  FTW   RN    ++ERLDR  CN  + ELFP  +V + DF++SDH PIV  LE    +    K   FRFE  W    EC+ II   W        
Subjt:  FKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGMEAQ

Query:  LKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKVNEAIQGNG------DVKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHRRR
         +D+LI    +CA++L  W     G+  +++   +K++++ +  +       +VK    +L  L   EE YWK RSR DWL  GDRN K+F+N A+ R++
Subjt:  LKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKVNEAIQGNG------DVKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHRRR

Query:  VNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTT
         N I +I +E+G  L+++EDI      +F  +F++
Subjt:  VNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTT

A0A7J6HXD3 CCHC-type domain-containing protein8.0e-3535.02Show/hide
Query:  MEFKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGME
        M F+G  FTW   R  V H++ERLDR  CN  +  LFP  +V + DFI+SDH PIV  LE         K   FRFE  W    EC +I+   W    + 
Subjt:  MEFKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGME

Query:  AQLKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKVNEAIQGNG------DVKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHR
           +D+++     CA++L  W     G+  K +   +K++++ +  +       +VK    +L  LL+ EE YW+ RSR DWL  GDRN K+F+N A+ R
Subjt:  AQLKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKVNEAIQGNG------DVKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHR

Query:  RRVNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTT
        ++ N I +I +E+G    ++EDI      +F  +F++
Subjt:  RRVNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTT

A0A7N2LLK4 Uncharacterized protein2.1e-3537.18Show/hide
Query:  FKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRK-TFPFRFEEVWALQEECENIIEIGWNKKGMEA
        F+G+ +TW   R    + + RLDR      + E FP + VNHL    SDH PI+++++    + LR K    F+FEE W L EEC  ++   W K G EA
Subjt:  FKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRK-TFPFRFEEVWALQEECENIIEIGWNKKGMEA

Query:  QLKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKV---NEA---IQGNGDVKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHRR
           +   QK+ +CA +L+ WG        ++I + +K+V   N A    +   D  + S +L++LL ++E+YW QRSR+ WLK GD+N+K+F++ AS RR
Subjt:  QLKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKV---NEA---IQGNGDVKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHRR

Query:  RVNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLF
        R N+I  IK+ N  W+   EDI     ++F  LF
Subjt:  RVNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLF

A0A7N2MHC9 zf-RVT domain-containing protein7.0e-3935.74Show/hide
Query:  FKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGMEAQ
        F+G+ FTW     +   ++E LD+   N+ +  LFP+A+V H+   YSDH PI+IN+   P +  R+K  P RFEE WA  E CE ++ + W+ K     
Subjt:  FKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGMEAQ

Query:  LKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKVNE-AIQGN----GDVKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHRRRV
            L +K+  C + L  W    +GN++ KI   +  + E A+Q N      +K+  N++  LL+++EVYW+QRSR  WL  GD+N K+F+  AS R R 
Subjt:  LKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKVNE-AIQGN----GDVKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHRRRV

Query:  NYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTNGQQDSETLLEFLNQ
        N+I+ I S+ G W + D+ I      +F+ LFT++   D   +L+ ++Q
Subjt:  NYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTNGQQDSETLLEFLNQ

A0A803NN67 Uncharacterized protein5.5e-3634.45Show/hide
Query:  MEFKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGME
        ME +GHP+TW + R     ++ RLDR L ++ +  LFP A + +L+   SDHCPI +NL K  P  + +K   FR+E  W+ +  C  I++  W    ++
Subjt:  MEFKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGME

Query:  AQLKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKVNEAIQGNGDVKSASN------QLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHR
             +L  KL +C+  L  WG    GN+ K+I    K + + ++G  D+ S S       QL ++L+++EV+W+QRS+  WL+ GD+N K+F+ CAS R
Subjt:  AQLKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKVNEAIQGNGDVKSASN------QLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHR

Query:  RRVNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTN
        +R N I ++K+++G W++ D  +      ++  LFT++
Subjt:  RRVNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.0e-0521.21Show/hide
Query:  KGHPFTW--YQTRNKVVHIKERLDRGLCNNLFWELFPYA-EVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGME
        +G  +TW  +Q  N ++    +LDR + N  ++  FP A  V  L  + SDH P +I LE  P    +R    FR+    +        + + W ++   
Subjt:  KGHPFTW--YQTRNKVVHIKERLDRGLCNNLFWELFPYA-EVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGME

Query:  AQLKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKVNEAIQGNGDVKSASNQLEKLLNEEEV--------------YWKQRSRMDWLKWGDRNLKW
             +L + L    +  K       GN + K   A   + E+IQ     +  +N  + L   E V              +++Q+SR+ WL+ GD N ++
Subjt:  AQLKDNLIQKLNNCAEKLKYWGSLNIGNYEKKINIAKKKVNEAIQGNGDVKSASNQLEKLLNEEEV--------------YWKQRSRMDWLKWGDRNLKW

Query:  FYNCASHRRRVNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTNGQ---QDSETLLEFLNQFR-------EFNQRSKGKQKVPSVRAQPNWKPPG
        F+      +  N I  ++ ++   + +   ++     ++  L  ++      DS   ++ ++ FR         +     K+   +V A P  K PG
Subjt:  FYNCASHRRRVNYIAKIKSENGSWLNSDEDIQIAFTEFFQVLFTTNGQ---QDSETLLEFLNQFR-------EFNQRSKGKQKVPSVRAQPNWKPPG

AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.4e-0726.28Show/hide
Query:  QVLFTTNGQQDSETLLEFLNQFREFNQRS--KGKQKVPSV--RAQPNWKPPGYPFFKINTDVALNKENQKSGIGVVIRNEKGEVMLTLAKSINGIMEIDV
        +++F        E L   +  F E++ R   +GK   P V       WK P Y + K NTD     EN + GIG ++RNE G V+   A+++     +  
Subjt:  QVLFTTNGQQDSETLLEFLNQFREFNQRS--KGKQKVPSV--RAQPNWKPPGYPFFKINTDVALNKENQKSGIGVVIRNEKGEVMLTLAKSINGIMEIDV

Query:  VEALAIRDGLYMAKEMGFRQVEVESHSAKVIQLLQQN
         E  A+R  +       ++++  ES +  ++ LL  +
Subjt:  VEALAIRDGLYMAKEMGFRQVEVESHSAKVIQLLQQN

AT4G29090.1 Ribonuclease H-like superfamily protein1.7e-0833.33Show/hide
Query:  WKPPGYPFFKINTDVALNKENQKSGIGVVIRNEKGEVMLTLAKSINGIMEIDVVEALAIRDGLYMAKEMGFRQVEVESHSAKVIQLLQQN
        W+PP + + K NTD   N++N++ GIG V+RNEKGEV    A+++  +  +   E  A+R  +       +  V  ES S  +I++L  +
Subjt:  WKPPGYPFFKINTDVALNKENQKSGIGVVIRNEKGEVMLTLAKSINGIMEIDVVEALAIRDGLYMAKEMGFRQVEVESHSAKVIQLLQQN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTCAAGGGACATCCTTTTACATGGTATCAAACCAGAAATAAGGTGGTCCATATAAAAGAAAGGTTAGACAGAGGCCTGTGTAATAACCTATTTTGGGAACTCTT
CCCTTATGCAGAAGTTAATCACCTTGATTTCATATATTCGGATCACTGCCCGATTGTGATTAATTTAGAAAAGGGTCCACCAAAAGCTCTAAGGAGGAAAACCTTCCCAT
TCAGATTTGAAGAGGTTTGGGCCTTACAAGAAGAATGCGAAAACATTATTGAGATTGGATGGAATAAAAAAGGCATGGAGGCTCAATTAAAGGATAACTTAATTCAAAAG
TTAAACAACTGCGCTGAGAAATTAAAATATTGGGGAAGTCTCAACATAGGAAATTATGAAAAGAAGATAAACATAGCGAAAAAAAAGGTCAATGAAGCAATTCAGGGCAA
TGGAGATGTGAAATCGGCATCCAATCAACTTGAAAAACTGCTAAATGAAGAAGAAGTTTATTGGAAGCAAAGATCTAGAATGGATTGGCTGAAATGGGGAGATCGAAACT
TAAAGTGGTTTTATAATTGTGCATCTCATCGACGAAGGGTGAATTACATTGCAAAGATCAAAAGTGAGAATGGAAGTTGGTTGAATTCAGATGAGGACATTCAGATTGCA
TTTACTGAATTTTTCCAAGTCTTGTTTACCACTAACGGTCAGCAAGATAGCGAAACACTTCTGGAATTCTTGAATCAGTTCAGGGAGTTTAATCAAAGAAGCAAAGGAAA
ACAAAAGGTACCCTCAGTTCGGGCACAACCAAATTGGAAACCTCCTGGGTACCCGTTCTTCAAAATTAACACAGATGTTGCATTAAACAAAGAAAATCAGAAATCTGGGA
TAGGCGTGGTAATCAGAAATGAAAAGGGAGAGGTGATGCTTACTCTAGCTAAATCGATCAATGGAATCATGGAAATTGATGTCGTCGAAGCTCTGGCAATCCGAGATGGG
TTGTATATGGCAAAAGAAATGGGATTCCGACAGGTCGAAGTTGAGTCACATTCGGCCAAAGTCATCCAACTCCTACAACAAAATTGTCAAAACTTATCAGATTTGGGTTA
G
mRNA sequenceShow/hide mRNA sequence
ATGGAATTCAAGGGACATCCTTTTACATGGTATCAAACCAGAAATAAGGTGGTCCATATAAAAGAAAGGTTAGACAGAGGCCTGTGTAATAACCTATTTTGGGAACTCTT
CCCTTATGCAGAAGTTAATCACCTTGATTTCATATATTCGGATCACTGCCCGATTGTGATTAATTTAGAAAAGGGTCCACCAAAAGCTCTAAGGAGGAAAACCTTCCCAT
TCAGATTTGAAGAGGTTTGGGCCTTACAAGAAGAATGCGAAAACATTATTGAGATTGGATGGAATAAAAAAGGCATGGAGGCTCAATTAAAGGATAACTTAATTCAAAAG
TTAAACAACTGCGCTGAGAAATTAAAATATTGGGGAAGTCTCAACATAGGAAATTATGAAAAGAAGATAAACATAGCGAAAAAAAAGGTCAATGAAGCAATTCAGGGCAA
TGGAGATGTGAAATCGGCATCCAATCAACTTGAAAAACTGCTAAATGAAGAAGAAGTTTATTGGAAGCAAAGATCTAGAATGGATTGGCTGAAATGGGGAGATCGAAACT
TAAAGTGGTTTTATAATTGTGCATCTCATCGACGAAGGGTGAATTACATTGCAAAGATCAAAAGTGAGAATGGAAGTTGGTTGAATTCAGATGAGGACATTCAGATTGCA
TTTACTGAATTTTTCCAAGTCTTGTTTACCACTAACGGTCAGCAAGATAGCGAAACACTTCTGGAATTCTTGAATCAGTTCAGGGAGTTTAATCAAAGAAGCAAAGGAAA
ACAAAAGGTACCCTCAGTTCGGGCACAACCAAATTGGAAACCTCCTGGGTACCCGTTCTTCAAAATTAACACAGATGTTGCATTAAACAAAGAAAATCAGAAATCTGGGA
TAGGCGTGGTAATCAGAAATGAAAAGGGAGAGGTGATGCTTACTCTAGCTAAATCGATCAATGGAATCATGGAAATTGATGTCGTCGAAGCTCTGGCAATCCGAGATGGG
TTGTATATGGCAAAAGAAATGGGATTCCGACAGGTCGAAGTTGAGTCACATTCGGCCAAAGTCATCCAACTCCTACAACAAAATTGTCAAAACTTATCAGATTTGGGTTA
G
Protein sequenceShow/hide protein sequence
MEFKGHPFTWYQTRNKVVHIKERLDRGLCNNLFWELFPYAEVNHLDFIYSDHCPIVINLEKGPPKALRRKTFPFRFEEVWALQEECENIIEIGWNKKGMEAQLKDNLIQK
LNNCAEKLKYWGSLNIGNYEKKINIAKKKVNEAIQGNGDVKSASNQLEKLLNEEEVYWKQRSRMDWLKWGDRNLKWFYNCASHRRRVNYIAKIKSENGSWLNSDEDIQIA
FTEFFQVLFTTNGQQDSETLLEFLNQFREFNQRSKGKQKVPSVRAQPNWKPPGYPFFKINTDVALNKENQKSGIGVVIRNEKGEVMLTLAKSINGIMEIDVVEALAIRDG
LYMAKEMGFRQVEVESHSAKVIQLLQQNCQNLSDLG