; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0026194 (gene) of Chayote v1 genome

Gene IDSed0026194
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG02:32365663..32370277
RNA-Seq ExpressionSed0026194
SyntenySed0026194
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4381998.1 hypothetical protein G4B88_006630 [Cannabis sativa]5.5e-3130.45Show/hide
Query:  RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRAL
        R TGFYG+P ++ R  SW LL RL  +++ PW+  GDF EIL  +EK GGS +S + M +F+ A+  C L D+GF+   +TW         + ERLDR  
Subjt:  RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRAL

Query:  CNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVK-SNWEETLGNLNEAADLIG-WFFEK-
        CN  + DL+ F  V + DFL SDH PI  ++    R     K + FRFE  W    +C  I  ++  W       +N +  +      AD +G W   K 
Subjt:  CNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVK-SNWEETLGNLNEAADLIG-WFFEK-

Query:  ---------------------LPITKLEEFL---------FMCWCVWKKRNREVVGFKGGGINAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFS
                              P+ ++EE L          + W VW  RN  + G K   +N  +      E+  +   EF+  +V +  G     G S
Subjt:  ---------------------LPITKLEEFL---------FMCWCVWKKRNREVVGFKGGGINAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFS

Query:  LSQNF---WSPP
                W PP
Subjt:  LSQNF---WSPP

KAG6624235.1 hypothetical protein CIPAW_16G012000 [Carya illinoinensis]1.2e-3325.63Show/hide
Query:  LLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDF
        +L+ +    N  W+I GDF EIL   EKWGG  + ++QME FRE ++   L D+G++   YTW  +        ERLDRA+ N  + DL+    V+ +  
Subjt:  LLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDF

Query:  LGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIW---------------QCKKVKSNW------------------------EET
          SDH PI + ++         K   F++E  W + EDC     L  IW               Q ++V  +W                        EE 
Subjt:  LGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIW---------------QCKKVKSNW------------------------EET

Query:  LGNL-------------------------------NEAADLIGWFFEKLPITKLEEFLFMCWCVWKKRNREVVGFKGGGINAEIHLNFNWEYCCQFFAEF
          N+                               N+   + G    ++P   LEE   +   VW +RN  +   +      EI    +     Q   EF
Subjt:  LGNL-------------------------------NEAADLIGWFFEKLPITKLEEFLFMCWCVWKKRNREVVGFKGGGINAEIHLNFNWEYCCQFFAEF

Query:  RQTKVSLCDGVTNHSGFSLSQNFWSPPRSDNYKLNTDAFINLKEGRSGYGAIIQNYKGEVMFSMSQPVEWIVDPEIMEALAIREGVEMAFELGFQRIEVE
        +  + +    V   SG + S   W  P  D  K+N DA   +K+ R G G +I++  GE + ++    E ++D  + E +A+R+ VE+   L  ++   E
Subjt:  RQTKVSLCDGVTNHSGFSLSQNFWSPPRSDNYKLNTDAFINLKEGRSGYGAIIQNYKGEVMFSMSQPVEWIVDPEIMEALAIREGVEMAFELGFQRIEVE

Query:  FDALRVINLLQKHCKNQTEVGRIIEEMLQMARNFKFISFKWCNRETNILAHKLAHMASIDNQEGRWMEKCPDILNG
         DA  V+  +Q+  ++    G I+E++    RN    S ++ +R+ N +AH LA  A    +E  W+E  P  + G
Subjt:  FDALRVINLLQKHCKNQTEVGRIIEEMLQMARNFKFISFKWCNRETNILAHKLAHMASIDNQEGRWMEKCPDILNG

RYR02999.1 hypothetical protein Ahy_B06g081839 [Arachis hypogaea]2.1e-3526.85Show/hide
Query:  YGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAF
        YG+P   KRR  W+ L   +     P    GDF +IL + EK G   + +N +E FR+ +    L D+  K S YTW+  +    V  ERLDR L N  +
Subjt:  YGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAF

Query:  WDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSN-WEETLGNLNEA-ADLIGWFFEKLPITKL
          +Y   +++    + SDHC + +     PR   G+  K F+FE  W  HE+C  +  +   WQ  +   N W + +   N    +LI W  +    T  
Subjt:  WDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSN-WEETLGNLNEA-ADLIGWFFEKLPITKL

Query:  EEFLFM------CWCVWKKRNREV-----VGFKGGGINAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFSLSQN--FWSPPRSDNYKLNTDAFIN
        E+   +      CWC+WK RN+ +     +  K   INAE           Q   ++  T  S     T+ +  S  +    W PP  +  K NTDA  +
Subjt:  EEFLFM------CWCVWKKRNREV-----VGFKGGGINAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFSLSQN--FWSPPRSDNYKLNTDAFIN

Query:  LKEGRSGYGAIIQNYKGEVMFSMSQPVEWIVDPEI-MEALAIREGVEMAFELGFQRIEVEFDALRVINLLQKHCKNQTEVGRIIEEMLQMARNFKFISFK
         + G +    ++++++G+++   +    +I +  I  EA A RE + +   L      +E D L ++  ++       E   II ++ Q+      +   
Subjt:  LKEGRSGYGAIIQNYKGEVMFSMSQPVEWIVDPEI-MEALAIREGVEMAFELGFQRIEVEFDALRVINLLQKHCKNQTEVGRIIEEMLQMARNFKFISFK

Query:  WCNRETNILAHKLAHMASIDNQEGRWMEKCPD
        W  RE N +AH+LA MA+ +    +W+   P+
Subjt:  WCNRETNILAHKLAHMASIDNQEGRWMEKCPD

TXG69190.1 hypothetical protein EZV62_004125 [Acer yangbiense]5.6e-5230.97Show/hide
Query:  RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRAL
        RLTGFYG P  T+R   W LL+RL+GM   PW + GDF EI+   EK GG+ + +  M +F+EA++ C L D+GF    +TW    + +  + ERLDR +
Subjt:  RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRAL

Query:  CNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSNWEETLGNLNEAADLIGWFFEKLPI
         N+ + DL+    ++HLDF  SDH PI + +S      GG+    F +++ W   +D               V+ ++ +         D + +   KL I
Subjt:  CNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSNWEETLGNLNEAADLIGWFFEKLPI

Query:  TKLEEFLFMCWCVWKKRNREVVGFKGGGINAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFSLSQNF---WSPPRSDNYKLNTDAFINLKEGRSG
           E    + W VW +RN+ V        +  +H     ++   F  +F+  K       T  +G  + Q     W P  S +YK+NTDA ++ +   +G
Subjt:  TKLEEFLFMCWCVWKKRNREVVGFKGGGINAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFSLSQNF---WSPPRSDNYKLNTDAFINLKEGRSG

Query:  YGAIIQNYKGEVMFSMSQPVEWIVDPEIMEALAIREGVEMAFELGFQRIEVEFDALRVINLLQKHCKNQTEVGRIIEEMLQMARNFKFISFKWCNRETNI
         G +I++  G VM S+ Q    ++ P+ +EA+A+  G  +A E G     +E D+L V+NL+      + E+G ++ ++L M  N  F S  +  R TN 
Subjt:  YGAIIQNYKGEVMFSMSQPVEWIVDPEIMEALAIREGVEMAFELGFQRIEVEFDALRVINLLQKHCKNQTEVGRIIEEMLQMARNFKFISFKWCNRETNI

Query:  LAHKLAHMASIDNQEGRWMEKCP
        +AH LA ++     E  W+E CP
Subjt:  LAHKLAHMASIDNQEGRWMEKCP

XP_042942839.1 uncharacterized protein LOC122277021 [Carya illinoinensis]5.8e-3340.21Show/hide
Query:  LTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALC
        LTGFYG+P +TKRR SW+LLQ L       W+  GDF E+L   +K  G ++  NQ+E FR A +SC LFDMG+  + +TW    + Q  + ER+DRALC
Subjt:  LTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALC

Query:  NSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIW-QCKKVKSNWEETLGNLNEA-ADLIGW
        N  + +L+ FS V +L  L SDHCPI +SV ++  +   +K++ FR+E  W + ED Y +  L   W + ++V     + +  LN+   +L+ W
Subjt:  NSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIW-QCKKVKSNWEETLGNLNEA-ADLIGW

TrEMBL top hitse value%identityAlignment
A0A1U8M810 uncharacterized protein LOC1079339863.5e-3137.31Show/hide
Query:  GMAKVWN-------WKHKNELLDKNSSQSTKRSKKRLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAI
        G++  WN       + +    +D   +    ++K R TGFYG+P    ++ SW LL++L  MY+ PW + GDF EI+Y +EK GG  K + QM++FR  +
Subjt:  GMAKVWN-------WKHKNELLDKNSSQSTKRSKKRLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAI

Query:  QSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDC
        + C+L DMGF    +TW R       + ERLDR L N  + +L+    VQ+L    SDHCPI I ++   ++FG   N  FRFE  W     C
Subjt:  QSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDC

A0A5C7I4W9 RNase H domain-containing protein7.7e-3127.54Show/hide
Query:  RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRAL
        R +G YGDP  + R ++W L++RL  + N PWV  GDF E+L   EK GGS+K+   +  FR+ +  CE  D+GF    +TW      +  + ERLDR L
Subjt:  RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRAL

Query:  CNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSNWEETLGNL--------NEAADLIG
         ++ + ++Y    ++HL F  SDH P+   +  + RN     N+    +K             +  +   + +K +  E + N+        N   D++ 
Subjt:  CNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSNWEETLGNL--------NEAADLIG

Query:  WFFEKLPITKLEEFLFMCWCVWKKRNREVVGFKGGGI-------NAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFSLSQNFWSPPRSDNYKLNT
         FF    + +L  F  + W +W+ RN  +V   G G+        AE+ L+           EF QT +S    V   S   L    W  P     KLN+
Subjt:  WFFEKLPITKLEEFLFMCWCVWKKRNREVVGFKGGGI-------NAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFSLSQNFWSPPRSDNYKLNT

Query:  DAFINLKEGRSGYGAIIQNYKGEVMFSMSQPVEWIVDPEIMEALAIREGVEMAFELGFQRIEVEFDALRVINLL
            N     +  G++I + KG+++ + ++    +   E    LA+ EG+ +A  LG      E D L V+++L
Subjt:  DAFINLKEGRSGYGAIIQNYKGEVMFSMSQPVEWIVDPEIMEALAIREGVEMAFELGFQRIEVEFDALRVINLL

A0A5C7IIT4 Uncharacterized protein2.7e-5230.97Show/hide
Query:  RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRAL
        RLTGFYG P  T+R   W LL+RL+GM   PW + GDF EI+   EK GG+ + +  M +F+EA++ C L D+GF    +TW    + +  + ERLDR +
Subjt:  RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRAL

Query:  CNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSNWEETLGNLNEAADLIGWFFEKLPI
         N+ + DL+    ++HLDF  SDH PI + +S      GG+    F +++ W   +D               V+ ++ +         D + +   KL I
Subjt:  CNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSNWEETLGNLNEAADLIGWFFEKLPI

Query:  TKLEEFLFMCWCVWKKRNREVVGFKGGGINAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFSLSQNF---WSPPRSDNYKLNTDAFINLKEGRSG
           E    + W VW +RN+ V        +  +H     ++   F  +F+  K       T  +G  + Q     W P  S +YK+NTDA ++ +   +G
Subjt:  TKLEEFLFMCWCVWKKRNREVVGFKGGGINAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFSLSQNF---WSPPRSDNYKLNTDAFINLKEGRSG

Query:  YGAIIQNYKGEVMFSMSQPVEWIVDPEIMEALAIREGVEMAFELGFQRIEVEFDALRVINLLQKHCKNQTEVGRIIEEMLQMARNFKFISFKWCNRETNI
         G +I++  G VM S+ Q    ++ P+ +EA+A+  G  +A E G     +E D+L V+NL+      + E+G ++ ++L M  N  F S  +  R TN 
Subjt:  YGAIIQNYKGEVMFSMSQPVEWIVDPEIMEALAIREGVEMAFELGFQRIEVEFDALRVINLLQKHCKNQTEVGRIIEEMLQMARNFKFISFKWCNRETNI

Query:  LAHKLAHMASIDNQEGRWMEKCP
        +AH LA ++     E  W+E CP
Subjt:  LAHKLAHMASIDNQEGRWMEKCP

A0A7J6GGL8 CCHC-type domain-containing protein2.6e-3130.45Show/hide
Query:  RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRAL
        R TGFYG+P ++ R  SW LL RL  +++ PW+  GDF EIL  +EK GGS +S + M +F+ A+  C L D+GF+   +TW         + ERLDR  
Subjt:  RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRAL

Query:  CNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVK-SNWEETLGNLNEAADLIG-WFFEK-
        CN  + DL+ F  V + DFL SDH PI  ++    R     K + FRFE  W    +C  I  ++  W       +N +  +      AD +G W   K 
Subjt:  CNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVK-SNWEETLGNLNEAADLIG-WFFEK-

Query:  ---------------------LPITKLEEFL---------FMCWCVWKKRNREVVGFKGGGINAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFS
                              P+ ++EE L          + W VW  RN  + G K   +N  +      E+  +   EF+  +V +  G     G S
Subjt:  ---------------------LPITKLEEFL---------FMCWCVWKKRNREVVGFKGGGINAEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFS

Query:  LSQNF---WSPP
                W PP
Subjt:  LSQNF---WSPP

A0A803QD63 Uncharacterized protein5.9e-3139.68Show/hide
Query:  RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRAL
        R TGFYGDP  T+R  SW+LL+RLS MY GPW + G+F EIL + EK GGS K    + +FR+A+ SC+L D+GF+ S YTW     KQ ++ ERLD+  
Subjt:  RLTGFYGDPCSTKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRAL

Query:  CNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRN---FGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSNWEETLGNLNE
         NS +++ +  ++V+HLD + SDHCP+ ++  K P +      +    F FE  W   E+C  I  + ++W   +   + +E    L +
Subjt:  CNSAFWDLYLFSVVQHLDFLGSDHCPIKISVSKYPRN---FGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSNWEETLGNLNE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCATTTTCGACAATTTTGCTCCAGTGATGGCAAAGCCGCGGTCAATTAGGGGTGAGGTGGAAGTCACCGAAGATGGGAAGATGGTTGGAAAAATTAAAACGGCCGA
TTTAAATGGGGTAGGAGGGTTATTGAAGGAGATGGAAATTAACATAAATGAGGTAATGAGGGTTATGGAAAGGTTGGAAGAAGGAGAGGATATTACTTATCCTTTTGGGA
TGGCAAAAGTTTGGAATTGGAAACATAAAAATGAGCTATTGGACAAGAATTCTTCCCAATCAACAAAGAGAAGCAAAAAGCGACTAACTGGTTTTTATGGAGACCCTTGT
TCTACTAAACGGAGGTCTTCATGGGAACTATTACAACGTCTTAGTGGAATGTATAATGGACCATGGGTAATAGCAGGTGATTTCATTGAAATATTGTACGAACATGAGAA
ATGGGGAGGTTCAAAAAAATCACAAAATCAGATGGAAGATTTTCGAGAGGCAATTCAGTCTTGTGAATTGTTCGATATGGGTTTTAAGAGATCTTATTACACCTGGTATA
GAACTGTGAATAAGCAAGTTGTTTTGATGGAACGCTTGGATAGAGCGTTATGTAACTCAGCCTTTTGGGATTTGTACCTTTTTTCGGTTGTGCAACATCTTGATTTTTTG
GGGTCAGATCATTGTCCAATTAAGATTAGTGTTTCTAAATACCCTAGAAATTTTGGTGGTAAGAAGAATAAAATTTTTCGATTTGAAAAGGTTTGGACTATGCATGAAGA
TTGTTATTCCATTACTACTTTACATGCAATCTGGCAATGTAAAAAAGTGAAGTCTAATTGGGAGGAAACCCTTGGTAATTTAAATGAGGCTGCTGATTTGATTGGATGGT
TCTTTGAAAAATTGCCAATTACTAAATTGGAGGAGTTCTTATTTATGTGTTGGTGTGTGTGGAAAAAAAGAAATAGGGAGGTGGTTGGTTTCAAGGGTGGAGGCATCAAT
GCAGAAATACACTTAAATTTTAATTGGGAGTATTGTTGTCAGTTTTTTGCTGAGTTCAGGCAAACAAAAGTGAGTTTGTGTGACGGAGTTACAAATCACAGTGGGTTTTC
TCTAAGTCAGAATTTTTGGTCTCCTCCAAGAAGTGATAACTATAAATTAAATACTGATGCCTTTATAAATTTAAAGGAAGGAAGAAGTGGATATGGGGCTATTATTCAAA
ATTACAAGGGTGAGGTAATGTTCTCAATGTCTCAACCAGTCGAGTGGATTGTTGATCCAGAAATTATGGAAGCATTAGCTATTAGAGAAGGAGTGGAAATGGCCTTTGAA
CTTGGTTTCCAACGTATTGAGGTGGAGTTCGATGCCTTACGTGTGATTAATCTGTTACAAAAACATTGCAAGAATCAAACAGAGGTTGGAAGAATCATTGAAGAAATGCT
GCAAATGGCAAGGAATTTCAAGTTTATCTCTTTCAAATGGTGTAATCGGGAGACAAATATTCTTGCCCATAAACTAGCACACATGGCTAGTATTGACAATCAAGAAGGAA
GATGGATGGAAAAATGTCCAGATATTCTAAATGGCCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCATTTTCGACAATTTTGCTCCAGTGATGGCAAAGCCGCGGTCAATTAGGGGTGAGGTGGAAGTCACCGAAGATGGGAAGATGGTTGGAAAAATTAAAACGGCCGA
TTTAAATGGGGTAGGAGGGTTATTGAAGGAGATGGAAATTAACATAAATGAGGTAATGAGGGTTATGGAAAGGTTGGAAGAAGGAGAGGATATTACTTATCCTTTTGGGA
TGGCAAAAGTTTGGAATTGGAAACATAAAAATGAGCTATTGGACAAGAATTCTTCCCAATCAACAAAGAGAAGCAAAAAGCGACTAACTGGTTTTTATGGAGACCCTTGT
TCTACTAAACGGAGGTCTTCATGGGAACTATTACAACGTCTTAGTGGAATGTATAATGGACCATGGGTAATAGCAGGTGATTTCATTGAAATATTGTACGAACATGAGAA
ATGGGGAGGTTCAAAAAAATCACAAAATCAGATGGAAGATTTTCGAGAGGCAATTCAGTCTTGTGAATTGTTCGATATGGGTTTTAAGAGATCTTATTACACCTGGTATA
GAACTGTGAATAAGCAAGTTGTTTTGATGGAACGCTTGGATAGAGCGTTATGTAACTCAGCCTTTTGGGATTTGTACCTTTTTTCGGTTGTGCAACATCTTGATTTTTTG
GGGTCAGATCATTGTCCAATTAAGATTAGTGTTTCTAAATACCCTAGAAATTTTGGTGGTAAGAAGAATAAAATTTTTCGATTTGAAAAGGTTTGGACTATGCATGAAGA
TTGTTATTCCATTACTACTTTACATGCAATCTGGCAATGTAAAAAAGTGAAGTCTAATTGGGAGGAAACCCTTGGTAATTTAAATGAGGCTGCTGATTTGATTGGATGGT
TCTTTGAAAAATTGCCAATTACTAAATTGGAGGAGTTCTTATTTATGTGTTGGTGTGTGTGGAAAAAAAGAAATAGGGAGGTGGTTGGTTTCAAGGGTGGAGGCATCAAT
GCAGAAATACACTTAAATTTTAATTGGGAGTATTGTTGTCAGTTTTTTGCTGAGTTCAGGCAAACAAAAGTGAGTTTGTGTGACGGAGTTACAAATCACAGTGGGTTTTC
TCTAAGTCAGAATTTTTGGTCTCCTCCAAGAAGTGATAACTATAAATTAAATACTGATGCCTTTATAAATTTAAAGGAAGGAAGAAGTGGATATGGGGCTATTATTCAAA
ATTACAAGGGTGAGGTAATGTTCTCAATGTCTCAACCAGTCGAGTGGATTGTTGATCCAGAAATTATGGAAGCATTAGCTATTAGAGAAGGAGTGGAAATGGCCTTTGAA
CTTGGTTTCCAACGTATTGAGGTGGAGTTCGATGCCTTACGTGTGATTAATCTGTTACAAAAACATTGCAAGAATCAAACAGAGGTTGGAAGAATCATTGAAGAAATGCT
GCAAATGGCAAGGAATTTCAAGTTTATCTCTTTCAAATGGTGTAATCGGGAGACAAATATTCTTGCCCATAAACTAGCACACATGGCTAGTATTGACAATCAAGAAGGAA
GATGGATGGAAAAATGTCCAGATATTCTAAATGGCCTTTAG
Protein sequenceShow/hide protein sequence
MAIFDNFAPVMAKPRSIRGEVEVTEDGKMVGKIKTADLNGVGGLLKEMEININEVMRVMERLEEGEDITYPFGMAKVWNWKHKNELLDKNSSQSTKRSKKRLTGFYGDPC
STKRRSSWELLQRLSGMYNGPWVIAGDFIEILYEHEKWGGSKKSQNQMEDFREAIQSCELFDMGFKRSYYTWYRTVNKQVVLMERLDRALCNSAFWDLYLFSVVQHLDFL
GSDHCPIKISVSKYPRNFGGKKNKIFRFEKVWTMHEDCYSITTLHAIWQCKKVKSNWEETLGNLNEAADLIGWFFEKLPITKLEEFLFMCWCVWKKRNREVVGFKGGGIN
AEIHLNFNWEYCCQFFAEFRQTKVSLCDGVTNHSGFSLSQNFWSPPRSDNYKLNTDAFINLKEGRSGYGAIIQNYKGEVMFSMSQPVEWIVDPEIMEALAIREGVEMAFE
LGFQRIEVEFDALRVINLLQKHCKNQTEVGRIIEEMLQMARNFKFISFKWCNRETNILAHKLAHMASIDNQEGRWMEKCPDILNGL