; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0013911 (gene) of Snake gourd v1 genome

Gene IDTan0013911
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG05:75264910..75266104
RNA-Seq ExpressionTan0013911
SyntenyTan0013911
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_018818082.1 uncharacterized protein LOC108989062 [Juglans regia]5.6e-1526.76Show/hide
Query:  GSSDTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSIS-IWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLLEWWKPTNFWH
        G  D  +W  +  GVF V+SAY L +  N       SSN+S ++ +WKA+W  +   K+KI AW++ KD +P+  N++ K ++ + +C +   +  +  H
Subjt:  GSSDTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSIS-IWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLLEWWKPTNFWH

Query:  WMVKNGNGADTKKAILLIWAFGR--TEIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPESHGIWTHPPPNCWL-LNSDASWNAKERKGGLGWVVRDSSGSP
         +V      ++ K +L I    +   ++V T   ++   +++ T  +  ++ +  N + +G   +PPP  +L LN D S    ++K G+G V+RD  G  
Subjt:  WMVKNGNGADTKKAILLIWAFGR--TEIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPESHGIWTHPPPNCWL-LNSDASWNAKERKGGLGWVVRDSSGSP

Query:  IYAGFK------STTKDWLIKFVRAI-------ISRNKDLSEIRLIVEDID----NLRSKLNSVVFEKCPRSGNTVAHALARIA
        I+A  K            L+  VR +       I++ +  S+ +L+++D+        S+L ++   +    GN+ AH LAR+A
Subjt:  IYAGFK------STTKDWLIKFVRAI-------ISRNKDLSEIRLIVEDID----NLRSKLNSVVFEKCPRSGNTVAHALARIA

XP_034218997.1 uncharacterized protein LOC117630370 [Prunus dulcis]3.9e-1626.91Show/hide
Query:  DTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSIS-IWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLL-------------
        D ++W  D KG+FTVKSAYH+A +L+SS+  +SSSN+ +++  W  LW      +V+   W V    +P+KAN+  K +  ++ C+L             
Subjt:  DTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSIS-IWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLL-------------

Query:  EWWKPTNFWHWMVKNGNGADTKKAILLIWAFGRTEIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPESHGIWTHPPPNCWLLNSDASWNAKERKGGLGWVV
        +W      WH    + +  +        W F   E +        T+L +           +  P +  +W +       ++    + A   +GG+G VV
Subjt:  EWWKPTNFWHWMVKNGNGADTKKAILLIWAFGRTEIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPESHGIWTHPPPNCWLLNSDASWNAKERKGGLGWVV

Query:  RDSSGSPIYAGFKSTTKDWLIKFVRAIISRNKDLSEIRLIVEDIDNLRSKLNSVVFEKCPRSGNTVAHALARIAM
        RDS+G  +       T  +    V A+ +R  D S +  +VED  +L +++    F    R+ N VAH LAR A+
Subjt:  RDSSGSPIYAGFKSTTKDWLIKFVRAIISRNKDLSEIRLIVEDIDNLRSKLNSVVFEKCPRSGNTVAHALARIAM

XP_042950144.1 uncharacterized protein LOC122282259 [Carya illinoinensis]6.6e-1627.08Show/hide
Query:  GSSDTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSISIWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRC-------------
        G  D  +W  +  G F+V+SAY L       ++  SS+++S  S+WKA+W  +   K++I AWK+ KD +P   N+I K +D + +C             
Subjt:  GSSDTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSISIWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRC-------------

Query:  ------LLEWW-----------KPTNFWHWMVKNGNGADTKKA---ILLIWAF------GRTEIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPES---HG
              L+E+W           KP  F   +    N   T++     L+ W F         ++V+  IK +  +  LS + S ++  + KNP+    +G
Subjt:  ------LLEWW-----------KPTNFWHWMVKNGNGADTKKA---ILLIWAF------GRTEIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPES---HG

Query:  IWTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSGSPIYAGFK------STTKDWLIKFVRAI-------ISRNKDLSEIRLIVE-----------DID
        +W+ PP     LN D +    ++K G+G V+RD  G  I A  K            L+  +R +       IS+    S+ +L+VE            + 
Subjt:  IWTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSGSPIYAGFK------STTKDWLIKFVRAI-------ISRNKDLSEIRLIVE-----------DID

Query:  NLRSKLNSVV--FEKC-----PRSGNTVAHALARIA
        NL S++  ++  FE C      R GN  AH LAR+A
Subjt:  NLRSKLNSVV--FEKC-----PRSGNTVAHALARIA

XP_042965942.1 uncharacterized protein LOC122299620 [Carya illinoinensis]5.1e-1626.79Show/hide
Query:  GSSDTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSISIWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRC-------------
        G  D  +W  +  G F+V+SAY L       ++  SS+++S  S+WKA+W  +   K++I AWK+ KD +P+  N+I + +D + +C             
Subjt:  GSSDTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSISIWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRC-------------

Query:  ------LLEWW-----------KPTNFWHWMVKNGNGADTKKA---ILLIWAFGRT------EIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPES---HG
              L+E+W           KP  F   +    N   T++     L+ W+F         ++V+  IK +  +  LS + S ++  + KNP+    +G
Subjt:  ------LLEWW-----------KPTNFWHWMVKNGNGADTKKA---ILLIWAFGRT------EIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPES---HG

Query:  IWTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSGSPIYAGFK------STTKDWLIKFVRAI-------ISRNKDLSEIRLIVE-----------DID
        +W+ PP     LN D +    ++K G+G V+RD  G  I A  K            L+  +R +       IS+    S+ +L+VE            + 
Subjt:  IWTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSGSPIYAGFK------STTKDWLIKFVRAI-------ISRNKDLSEIRLIVE-----------DID

Query:  NLRSKLNSV--VFEKC-----PRSGNTVAHALARIA
        NL S++  +  +FE C      R GN  AH LAR+A
Subjt:  NLRSKLNSV--VFEKC-----PRSGNTVAHALARIA

XP_042988708.1 uncharacterized protein LOC122316242 [Carya illinoinensis]1.3e-1425.65Show/hide
Query:  DTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNN-SSISIWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLLEWWKPTNFWHWMV
        D +IW  +P G FTVKSAY +      +   S  S       +WK LW+ +   KVKI AWK  K+S+P+K N++ K + + D C +   +  +  H + 
Subjt:  DTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNN-SSISIWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLLEWWKPTNFWHWMV

Query:  KNGNGADTKKAILLI----WAFGRTEIVLTKIKLLQTYLNLSTEDSHLTK----DRLKNPESHGIWTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSG
           +  +  +   L+    W          K+  ++  +  +    H+ K      LK    H  W  PPP  + LN D +    +   G+G ++RD+ G
Subjt:  KNGNGADTKKAILLI----WAFGRTEIVLTKIKLLQTYLNLSTEDSHLTK----DRLKNPESHGIWTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSG

Query:  SPIYAGFKSTTKDWLIKFVRAIISRNKDLSEIRLIVEDIDNLRSKLNSVVFEKCPRSGNTVAHALARIA
          + A      K   +  V+ +      ++ +  ++ D   L S  ++       RS N  AH LAR A
Subjt:  SPIYAGFKSTTKDWLIKFVRAIISRNKDLSEIRLIVEDIDNLRSKLNSVVFEKCPRSGNTVAHALARIA

TrEMBL top hitse value%identityAlignment
A0A2I4EFB1 uncharacterized protein LOC1089890622.7e-1526.76Show/hide
Query:  GSSDTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSIS-IWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLLEWWKPTNFWH
        G  D  +W  +  GVF V+SAY L +  N       SSN+S ++ +WKA+W  +   K+KI AW++ KD +P+  N++ K ++ + +C +   +  +  H
Subjt:  GSSDTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSIS-IWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLLEWWKPTNFWH

Query:  WMVKNGNGADTKKAILLIWAFGR--TEIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPESHGIWTHPPPNCWL-LNSDASWNAKERKGGLGWVVRDSSGSP
         +V      ++ K +L I    +   ++V T   ++   +++ T  +  ++ +  N + +G   +PPP  +L LN D S    ++K G+G V+RD  G  
Subjt:  WMVKNGNGADTKKAILLIWAFGR--TEIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPESHGIWTHPPPNCWL-LNSDASWNAKERKGGLGWVVRDSSGSP

Query:  IYAGFK------STTKDWLIKFVRAI-------ISRNKDLSEIRLIVEDID----NLRSKLNSVVFEKCPRSGNTVAHALARIA
        I+A  K            L+  VR +       I++ +  S+ +L+++D+        S+L ++   +    GN+ AH LAR+A
Subjt:  IYAGFK------STTKDWLIKFVRAI-------ISRNKDLSEIRLIVEDID----NLRSKLNSVVFEKCPRSGNTVAHALARIA

M5VU98 Reverse transcriptase domain-containing protein1.9e-2129.19Show/hide
Query:  DTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSIS-IWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLLEWWKPTNFWHWMV
        D I+W  D  G+FTVKSAY +A+ + S     SSS+NS    +W+ +WN     K+KI AW+V  D +P+KAN+I KG+D  D C+       +  H + 
Subjt:  DTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSIS-IWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLLEWWKPTNFWHWMV

Query:  KNGNGADTKKAILLIWAFGR------TEIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPESHGIWTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSGSP
               T    LL     +       E+V    + +  ++  +   S +T DR+++P     W  PP      N D +++    +G +G V RD+ G  
Subjt:  KNGNGADTKKAILLIWAFGR------TEIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPESHGIWTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSGSP

Query:  IYAGFKST-------------------------TKDWLIK-----FVRAIISRNKDLSEIRLIVEDIDNLRSKLNSVVFEKCPRSGNTVAHALARIAM
        + A  KS                          T   + +      V AI    +D S I  IVED+ +L+ +  S +F+  PR  N VAH LAR  +
Subjt:  IYAGFKST-------------------------TKDWLIK-----FVRAIISRNKDLSEIRLIVEDIDNLRSKLNSVVFEKCPRSGNTVAHALARIAM

M5XHI9 Reverse transcriptase domain-containing protein5.7e-2127.85Show/hide
Query:  DTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSIS-IWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLLEWWKPTNFWHWMV
        D I+W  D  G+FTVKSAY +A+ + S     SSS+NS    +W+ +WN     K+KI AW+V  D +P+KAN+I KG+D  D C+       +  H + 
Subjt:  DTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSIS-IWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLLEWWKPTNFWHWMV

Query:  KNGNGADTKKAILLIWAFGR------TEIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPESHGIWTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSGSP
               T    LL     +       E+V    + +  ++  +   S +T DR+++P     W  PP      N D +++    +  +G V RD+ G  
Subjt:  KNGNGADTKKAILLIWAFGR------TEIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPESHGIWTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSGSP

Query:  IYAGFKSTTKDWLIKFVRAIISR------------------------------NKDLSEIRLIVEDIDNLRSKLNSVVFEKCPRSGNTVAHALARIAM
        + A  KS  +    +    +++R                               +D S I  IVED+ +L+ +  S +F+  PR  N VAH LAR  +
Subjt:  IYAGFKSTTKDWLIKFVRAIISR------------------------------NKDLSEIRLIVEDIDNLRSKLNSVVFEKCPRSGNTVAHALARIAM

M5XK32 Reverse transcriptase domain-containing protein (Fragment)3.7e-2027.52Show/hide
Query:  DTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSIS-IWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLLEWWKPTNFWHWMV
        D I+W  D  G+FTVKSAY +A+ + S     SSS+NS  S +W+ +WN     K+KI AW+V  D +P+KAN+I KG+D  D C+       +  H + 
Subjt:  DTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSIS-IWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLLEWWKPTNFWHWMV

Query:  KNGNGADTKKAILLIWAFGR------TEIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPESHGIWTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSGSP
               T    LL     +       ++V    + +  ++  +   S +T DR+++P     W  P       N D +++    +G +G V RD+ G  
Subjt:  KNGNGADTKKAILLIWAFGR------TEIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPESHGIWTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSGSP

Query:  IYAGFKSTTKDWLIKFVRAIISR------------------------------NKDLSEIRLIVEDIDNLRSKLNSVVFEKCPRSGNTVAHALARIAM
        + A  KS  +    +    + +R                               +D S I  IVED+ +L+ +  S +F+  PR  N V H LAR  +
Subjt:  IYAGFKSTTKDWLIKFVRAIISR------------------------------NKDLSEIRLIVEDIDNLRSKLNSVVFEKCPRSGNTVAHALARIAM

M5XSK0 Reverse transcriptase domain-containing protein6.5e-1726.17Show/hide
Query:  DTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSIS-IWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLLEWWKPTNFWHWM-
        D ++W  D KG+FTVKSAYH+A +L+SS+  +SSSN+ +++  W  LW      +VK   W+V    +P+KAN+  K +  ++ C+L      +  H + 
Subjt:  DTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSIS-IWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLLEWWKPTNFWHWM-

Query:  -VKNGNGADTKK-----------------AILLIWAFGRTEIVL------TKIKLLQTYLNLSTEDSHLTKDRLKNPESHG----IWTHPPPNCWLLNSD
             NGA + K                  +++ WA       L      ++ + +  + +L   D     + L +    G    +W  P  N   +N D
Subjt:  -VKNGNGADTKK-----------------AILLIWAFGRTEIVL------TKIKLLQTYLNLSTEDSHLTKDRLKNPESHG----IWTHPPPNCWLLNSD

Query:  ASWNAKERKGGLGWVVRDSSG---------------SPIYAGFKSTTKDWL---------------IKFVRAIISRNKDLSEIRLIVEDIDNLRSKLNSV
         +W     +GG+G VVRDS+G               +P      + T   L               ++ V A+ + + D S I  +VED  +L +++   
Subjt:  ASWNAKERKGGLGWVVRDSSG---------------SPIYAGFKSTTKDWL---------------IKFVRAIISRNKDLSEIRLIVEDIDNLRSKLNSV

Query:  VFEKCPRSGNTVAHALARIAM
         F    R+ N VAH LAR A+
Subjt:  VFEKCPRSGNTVAHALARIAM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G34320.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-0526.72Show/hide
Query:  WTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSGSPIYAGFKS--TTKD----------WLIKFV-----RAIISRNKDLSEIRLI------------V
        W  PP      N+DA+W  +  + G+GW++R+ SG  ++ G ++   TK+          W +  +     + II  +   + + L+            +
Subjt:  WTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSGSPIYAGFKS--TTKD----------WLIKFV-----RAIISRNKDLSEIRLI------------V

Query:  EDIDNLRSKLNSVVFEKCPRSGNTVAHALAR
        EDI  L      V FE  PR GN VA  +AR
Subjt:  EDIDNLRSKLNSVVFEKCPRSGNTVAHALAR

AT3G09510.1 Ribonuclease H-like superfamily protein9.3e-0821.6Show/hide
Query:  DTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSISIWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRC-------------LLE
        D IIW  +  G +TV+S Y L  +  S+++P+ +  + SI +   +WN    PK+K   W+    ++ +   +  +G+  +  C             L  
Subjt:  DTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSISIWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRC-------------LLE

Query:  WWKPTNFWHW------------------------MVKNGNGADTKK--AILLIWAF--GRTEIVLTKIKLLQTYLNLSTE-DSH----LTKDRLKNP---
            T  W                           V++   +D  K   + LIW     R  +V  K +   +   LS + ++H     T+   K P   
Subjt:  WWKPTNFWHW------------------------MVKNGNGADTKK--AILLIWAF--GRTEIVLTKIKLLQTYLNLSTE-DSH----LTKDRLKNP---

Query:  ----ESHGIWTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSGSPIYAG
            E+   W +PP      N DA ++ ++ +   GW++R+  G+PI  G
Subjt:  ----ESHGIWTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSGSPIYAG

AT4G29090.1 Ribonuclease H-like superfamily protein4.6e-1522.29Show/hide
Query:  DTIIWGVDPKGVFTVKSAYHLAIN-LNSSSLPSSSSNNSSISIWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCL--------------
        D+  W     G +TVKS Y +    +N  S P   S  S   I++ +W  +T PK++   WK   +S+P    +  + +     C+              
Subjt:  DTIIWGVDPKGVFTVKSAYHLAIN-LNSSSLPSSSSNNSSISIWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCL--------------

Query:  ------LEW------------WKPT----NFWHWMVKNGNGADTKKAILLIWAF-----GRTEIVLTKIKL-LQTYLNLSTEDSHLTKDRLK--------
              L W            W  +     +W + + NGN    K + L+ W        R E+V    +   Q  L  + +D    + R +        
Subjt:  ------LEW------------WKPT----NFWHWMVKNGNGADTKKAILLIWAF-----GRTEIVLTKIKL-LQTYLNLSTEDSHLTKDRLK--------

Query:  --NPESHGIWTHPPPNCWL-LNSDASWNAKERKGGLGWVVRDSSGSPIYAGFKSTTK------------DWLI-----------------KFVRAIISRN
          N  S G W  PPP+ W+  N+DA+WN    + G+GWV+R+  G   + G ++  K             W +                 + +  I++ +
Subjt:  --NPESHGIWTHPPPNCWL-LNSDASWNAKERKGGLGWVVRDSSGSPIYAGFKSTTK------------DWLI-----------------KFVRAIISRN

Query:  KDLSEIRLIVEDIDNLRSKLNSVVFEKCPRSGNTVAHALAR
        +    ++  ++D+  L S+   V F   PR GNT+A  +AR
Subjt:  KDLSEIRLIVEDIDNLRSKLNSVVFEKCPRSGNTVAHALAR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGCTGGCCGCACAGGCTCTTCAGACACTATTATATGGGGTGTCGATCCTAAGGGGGTCTTCACTGTTAAATCGGCCTACCATCTAGCAATCAACTTAAACTCAAG
TTCCTTACCTTCTAGCTCGAGCAATAACTCTTCCATCTCTATCTGGAAAGCTCTTTGGAACCAAGAAACGTGCCCTAAAGTAAAGATTTGCGCTTGGAAGGTCTTCAAGG
ATAGTATTCCCTCGAAAGCTAATATTATTGGCAAAGGAATTGACACTAATGATCGATGTCTTTTGGAATGGTGGAAGCCTACTAACTTTTGGCATTGGATGGTGAAGAAT
GGAAATGGAGCTGATACGAAGAAGGCAATCCTGCTAATTTGGGCATTTGGACGTACAGAAATCGTATTAACAAAGATAAAGCTACTCCAGACATATCTGAATTTATCAAC
TGAGGATTCGCATCTGACAAAGGACAGATTGAAGAACCCTGAAAGTCATGGAATTTGGACCCATCCGCCCCCAAATTGTTGGTTGCTCAATTCTGATGCTTCTTGGAACG
CCAAGGAGAGAAAAGGAGGTCTCGGTTGGGTGGTCCGTGACTCTTCTGGATCTCCGATTTATGCGGGTTTCAAGTCTACCACGAAGGATTGGCTGATTAAATTTGTAAGA
GCGATTATCTCTAGAAATAAAGACCTTTCGGAGATCCGTCTGATTGTGGAGGATATTGACAACCTTCGTTCGAAGCTTAACTCGGTGGTTTTTGAGAAATGCCCAAGGTC
TGGTAACACGGTGGCTCATGCCCTCGCCAGAATAGCCATGGAAGCTACTCCCCCTTCGACATGCGTTGAGAACGCGAGAGGCTCTGGATCAAAGGAAGGCGTCCTTATGT
CTTCTGTTTTTCCTTTTTGGCTTGTTAATCTCATTAATGAGGACGCTGGCAATTGTGCTTTTAGTGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGCTGGCCGCACAGGCTCTTCAGACACTATTATATGGGGTGTCGATCCTAAGGGGGTCTTCACTGTTAAATCGGCCTACCATCTAGCAATCAACTTAAACTCAAG
TTCCTTACCTTCTAGCTCGAGCAATAACTCTTCCATCTCTATCTGGAAAGCTCTTTGGAACCAAGAAACGTGCCCTAAAGTAAAGATTTGCGCTTGGAAGGTCTTCAAGG
ATAGTATTCCCTCGAAAGCTAATATTATTGGCAAAGGAATTGACACTAATGATCGATGTCTTTTGGAATGGTGGAAGCCTACTAACTTTTGGCATTGGATGGTGAAGAAT
GGAAATGGAGCTGATACGAAGAAGGCAATCCTGCTAATTTGGGCATTTGGACGTACAGAAATCGTATTAACAAAGATAAAGCTACTCCAGACATATCTGAATTTATCAAC
TGAGGATTCGCATCTGACAAAGGACAGATTGAAGAACCCTGAAAGTCATGGAATTTGGACCCATCCGCCCCCAAATTGTTGGTTGCTCAATTCTGATGCTTCTTGGAACG
CCAAGGAGAGAAAAGGAGGTCTCGGTTGGGTGGTCCGTGACTCTTCTGGATCTCCGATTTATGCGGGTTTCAAGTCTACCACGAAGGATTGGCTGATTAAATTTGTAAGA
GCGATTATCTCTAGAAATAAAGACCTTTCGGAGATCCGTCTGATTGTGGAGGATATTGACAACCTTCGTTCGAAGCTTAACTCGGTGGTTTTTGAGAAATGCCCAAGGTC
TGGTAACACGGTGGCTCATGCCCTCGCCAGAATAGCCATGGAAGCTACTCCCCCTTCGACATGCGTTGAGAACGCGAGAGGCTCTGGATCAAAGGAAGGCGTCCTTATGT
CTTCTGTTTTTCCTTTTTGGCTTGTTAATCTCATTAATGAGGACGCTGGCAATTGTGCTTTTAGTGTTTAA
Protein sequenceShow/hide protein sequence
MQAGRTGSSDTIIWGVDPKGVFTVKSAYHLAINLNSSSLPSSSSNNSSISIWKALWNQETCPKVKICAWKVFKDSIPSKANIIGKGIDTNDRCLLEWWKPTNFWHWMVKN
GNGADTKKAILLIWAFGRTEIVLTKIKLLQTYLNLSTEDSHLTKDRLKNPESHGIWTHPPPNCWLLNSDASWNAKERKGGLGWVVRDSSGSPIYAGFKSTTKDWLIKFVR
AIISRNKDLSEIRLIVEDIDNLRSKLNSVVFEKCPRSGNTVAHALARIAMEATPPSTCVENARGSGSKEGVLMSSVFPFWLVNLINEDAGNCAFSV