; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg018349 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg018349
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold3:11066120..11072459
RNA-Seq ExpressionSpg018349
SyntenySpg018349
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PNX96793.1 ribonuclease H, partial [Trifolium pratense]2.4e-2629.1Show/hide
Query:  RVDSILDEK-GRWKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSL-AKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKIS
        +V  ++D     W +  + + F++ DA +IL  P   K + D + W   K G++SVKS YH ++   + S   ++++S+ P +IWK  WK K  PK    
Subjt:  RVDSILDEK-GRWKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSL-AKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKIS

Query:  VWKIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPM-DYWAWLSDHLPKEELEKAITILWSLWDY
        +W+I++++IP K N+ KKG+  DP+   C +  ES+ H+  EC+W K++W    PLT    +LN        D++ +++++  K+ +EK + I++ +W+ 
Subjt:  VWKIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPM-DYWAWLSDHLPKEELEKAITILWSLWDY

Query:  RNQIDQTKINPDQSQLLRVIMRKLEDTEFQPIDYLSKPLRSPKS
        RN +   + N    ++     +KL + +        K   SP S
Subjt:  RNQIDQTKINPDQSQLLRVIMRKLEDTEFQPIDYLSKPLRSPKS

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]3.7e-2727Show/hide
Query:  VDSILDEKGRWKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVWK
        V  ++DEK +W++  ++++F   DA+ I+  P   + K D++IW  DK+G +SVKS Y + + +   ++PS SN      +W+  WKL    K KI +W+
Subjt:  VDSILDEKGRWKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVWK

Query:  IIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWSLWDYRNQ-
          HD +PT  N+ KK +  +P+   C    E++SH + EC   ++IW+ +  L   +  + R   + M  + W   H   E  E A  +LW++W  RN+ 
Subjt:  IIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWSLWDYRNQ-

Query:  -IDQTKINPDQSQLLRVIMRKLEDTEFQPIDYLSKPLRSP------KSRAKNLPRFKRIKRSWKVKMLEMKVIIE--------------------GLKSL
          +  K NP     LRV+         + I    K +R P      K  A+   ++      W+   ++  V +E                     +KSL
Subjt:  -IDQTKINPDQSQLLRVIMRKLEDTEFQPIDYLSKPLRSP------KSRAKNLPRFKRIKRSWKVKMLEMKVIIE--------------------GLKSL

Query:  L--GNEAIKE------GLKTRAV------VVESDASTVIKILNEEEEDHSEISFFADEINSLKDQFKEISFVFCPRDQNVAADCLASMASSSLDLFPFLD
           G+ A+ E      GLK          + ESD+  VI ++N++    +EI +   +I      F+       PRD N AA  LA +A    +   +LD
Subjt:  L--GNEAIKE------GLKTRAV------VVESDASTVIKILNEEEEDHSEISFFADEINSLKDQFKEISFVFCPRDQNVAADCLASMASSSLDLFPFLD

XP_024043202.1 uncharacterized protein LOC112099905 [Citrus clementina]3.3e-2827Show/hide
Query:  VDSILDEKGRWKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVWK
        V  ++D K  WK   + +N    DA+ IL+ P   +   DE+IW  DKRG + VKS Y + + L     P+ SNSS  N  W   WKL    K KI +W+
Subjt:  VDSILDEKGRWKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVWK

Query:  IIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWSLWDYRNQI
         + + +PT  N+ K+ I  +     C ++ E+  H +  CK  K++W+   PL +    +  G    +     +   L + ++E  +TILW +W  RN++
Subjt:  IIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWSLWDYRNQI

Query:  --DQTKINPDQSQL-LRVIMRKLEDTEF---------QPIDYLSKPLRSPKSRAKNLPRFKRIKRSWKVKMLEMKVIIEGLKSLLGNEAIKEGLKTRAVV
          +  K++P  S      I      T+F         Q +  +   +R    +       K +K S  V ++E +    G++       I + +  ++++
Subjt:  --DQTKINPDQSQL-LRVIMRKLEDTEF---------QPIDYLSKPLRSPKSRAKNLPRFKRIKRSWKVKMLEMKVIIEGLKSLLGNEAIKEGLKTRAVV

Query:  VESDASTVIKILNEEEEDHSEISFFADEINSLKDQFKEISFVFCPRDQNVAADCLASMASSSL
        +E+D+  V  ++N+ E + +EI +   EI SLK  F      + PR  N++A  LA +A + L
Subjt:  VESDASTVIKILNEEEEDHSEISFFADEINSLKDQFKEISFVFCPRDQNVAADCLASMASSSL

XP_030483481.1 uncharacterized protein LOC115700065 [Cannabis sativa]1.6e-2735.64Show/hide
Query:  VDSILDEKGRWKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVWK
        V +++ E  +W  + + E FS+ID D IL  P      +D +IW     G++ V S YH + SL  S   S+SNSS     WK FWKL+   K KI  W+
Subjt:  VDSILDEKGRWKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVWK

Query:  IIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWK--NFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWSLWDYRN
        + HD++P  ++++++ I TD    +C+   ES+ H ++ CK+ K +W+   F    N    +N+G     DY  +LS    K E+E  I ILWS+W  RN
Subjt:  IIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWK--NFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWSLWDYRN

Query:  QI
        +I
Subjt:  QI

XP_030509046.1 uncharacterized protein LOC115723707 [Cannabis sativa]1.8e-2635.45Show/hide
Query:  WKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVWKIIHDSIPTKS
        W   ++ ++F+ ID D IL  P      SD +IW  +  G +SVKS +HL  SL++  + SSS+  D    WK FWKL   PK KI  WK+I +++P  +
Subjt:  WKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVWKIIHDSIPTKS

Query:  NIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWSLWDYRNQ
         + K+ +    +   CKS  ES+ H ++ CK+ K IWKN       I + +  G    DY   L+  + KE  E  I ++WS+W+ RN+
Subjt:  NIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWSLWDYRNQ

TrEMBL top hitse value%identityAlignment
A0A2K3N168 Ribonuclease H (Fragment)1.2e-2629.1Show/hide
Query:  RVDSILDEK-GRWKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSL-AKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKIS
        +V  ++D     W +  + + F++ DA +IL  P   K + D + W   K G++SVKS YH ++   + S   ++++S+ P +IWK  WK K  PK    
Subjt:  RVDSILDEK-GRWKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSL-AKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKIS

Query:  VWKIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPM-DYWAWLSDHLPKEELEKAITILWSLWDY
        +W+I++++IP K N+ KKG+  DP+   C +  ES+ H+  EC+W K++W    PLT    +LN        D++ +++++  K+ +EK + I++ +W+ 
Subjt:  VWKIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPM-DYWAWLSDHLPKEELEKAITILWSLWDY

Query:  RNQIDQTKINPDQSQLLRVIMRKLEDTEFQPIDYLSKPLRSPKS
        RN +   + N    ++     +KL + +        K   SP S
Subjt:  RNQIDQTKINPDQSQLLRVIMRKLEDTEFQPIDYLSKPLRSPKS

A0A2N9EMZ0 Reverse transcriptase domain-containing protein1.2e-2631.05Show/hide
Query:  PELSRKRVDSILDEKGR-WKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIP
        P +S   V  ++D + R WK + V   F   +A VIL  P   +   D ++W   K G ++V+S YHL+++     EPSSS+++   ++W A W L   P
Subjt:  PELSRKRVDSILDEKGR-WKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIP

Query:  KAKISVWKIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWS
        K +  +W+  H+S+PT+SN+  + I  DP    C ++ ES  H +W+CK IK +W++  P    +  ++  G+I + Y  + +  L   EL+      W 
Subjt:  KAKISVWKIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWS

Query:  LWDYRNQIDQTKINPDQSQLLRVIMRKLED-TEFQPIDYLSKPLRSPK
        +W  RN++   ++      L ++I R L+   EFQ     S P  SPK
Subjt:  LWDYRNQIDQTKINPDQSQLLRVIMRKLED-TEFQPIDYLSKPLRSPK

A0A2N9H3I8 Uncharacterized protein2.7e-2835Show/hide
Query:  VDSILDEKG-RWKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVW
        VD ++++    WK + + E F   D D+I+  P   +   D +IW   KRG+FSVKS YHL +SL  SQE ++S+ +  + IW + W +K  PK K+ VW
Subjt:  VDSILDEKG-RWKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVW

Query:  KIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWSLWDYRNQ
        K  HD +PT++ + +KGI      L C  + E+  H++W C++  ++WK+     +  +SL        D+   L   LP   LE A T  W+LW  RN+
Subjt:  KIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWSLWDYRNQ

A0A803PIN0 Uncharacterized protein8.8e-2736.79Show/hide
Query:  RWKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVWKIIHDSIPTK
        +W  + + E FS+ID D IL  P      +D +IW     G+++V S YH V SL  S   S+SNS  P   WK FWKL+   K KI  W++ HD++P  
Subjt:  RWKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVWKIIHDSIPTK

Query:  SNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWK--NFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWSLWDYRNQI
        ++++++ I TD    +C+   ES+ H ++ CK+ K +W+   F    N    +N+G     DY   LS    K E+E  I ILWS+W  RN+I
Subjt:  SNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWK--NFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWSLWDYRNQI

A0A803Q6Y1 Uncharacterized protein8.8e-2735.45Show/hide
Query:  WKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVWKIIHDSIPTKS
        W   ++ ++F+ ID D IL  P      SD +IW  +  G +SVKS +HL  SL++  + SSS+  D    WK FWKL   PK KI  WK+I +++P  +
Subjt:  WKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVWKIIHDSIPTKS

Query:  NIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWSLWDYRNQ
         + K+ +    +   CKS  ES+ H ++ CK+ K IWKN       I + +  G    DY   L+  + KE  E  I ++WS+W+ RN+
Subjt:  NIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWSLWDYRNQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.2e-0525.23Show/hide
Query:  DEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVWKIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWE
        D  IW  D     ++ ST    ++L         N   P   +KA W    +PK     W +  + + T+  +   G+    + L+C S  ES +HL +E
Subjt:  DEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKAFWKLKAIPKAKISVWKIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWE

Query:  CKWIKEIWKNF
        C +   +W+ F
Subjt:  CKWIKEIWKNF

AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.9e-0530.43Show/hide
Query:  WKAFWKLKAIPKAKISVWKIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNF
        +KA W    IPK     W  +   + TK  +I  G    P+ L C +  E+  HL ++C++ +E+W  F
Subjt:  WKAFWKLKAIPKAKISVWKIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNF

AT3G25270.1 Ribonuclease H-like superfamily protein3.1e-0827.17Show/hide
Query:  WKLKAIPKAKISVWKIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMD------YWAWLSDHLP
        WKLK  PK K  +WK++  ++ T  N+ ++ I   P    C  + E+  HL ++C + +++W+       P   L   G I M+        + L++  P
Subjt:  WKLKAIPKAKISVWKIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMD------YWAWLSDHLP

Query:  KEELEKAITILWSLWDYRNQI--DQTKIN-PDQSQLLRVIMRKLEDTEFQPIDYLSKPLRSPKSRAKNLPRFK
         +    AI ILW LW  RNQ+   Q  I+  +  Q  R  +++ EDT    +  L++ + S + +   + R K
Subjt:  KEELEKAITILWSLWDYRNQI--DQTKIN-PDQSQLLRVIMRKLEDTEFQPIDYLSKPLRSPKSRAKNLPRFK

AT4G29090.1 Ribonuclease H-like superfamily protein1.1e-1623.06Show/hide
Query:  RVDSILDEKGRWKDKEVVEN-FSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLA-KSQEPSSSNSSDPNKIWKAFWKLKAIPKAKIS
        +V  ++DE GR   K+V+E  F  ++  +I       +   D   W     G ++VKS Y ++  +  K   P   +    N I++  WK +  PK +  
Subjt:  RVDSILDEKGRWKDKEVVEN-FSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLA-KSQEPSSSNSSDPNKIWKAFWKLKAIPKAKIS

Query:  VWKIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMD----YWAW-LSDHLPKEELEKAIT--IL
        +WK + +S+P    +  + +  +   + C S  E+++HL+++C + +  W     +  P+     G W        YW + L +  P+ E    +   +L
Subjt:  VWKIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMD----YWAW-LSDHLPKEELEKAIT--IL

Query:  WSLWDYRNQIDQTKINPDQSQLLRVIMRKLED----TEFQPIDYLSKPLRSPKSRAKNLPR--------------FKRIKRSWKVKMLEMKVIIEGLKSL
        W LW  RN++       +  ++LR     LE+    TE +      +  RS   R +  P                +R    W ++  + +V   G ++L
Subjt:  WSLWDYRNQIDQTKINPDQSQLLRVIMRKLED----TEFQPIDYLSKPLRSPKSRAKNLPR--------------FKRIKRSWKVKMLEMKVIIEGLKSL

Query:  LGNEAIKEG--------------LKTRAVVVESDASTVIKILNEEEEDHSEISFFADEINSLKDQFKEISFVFCPRDQNVAADCLASMASSSLDLFPFL
           +++ E                +   V+ ESD+  +I+ILN +E   S +     ++  L  QF E+ FVF PR+ N  A+ +A  + S L+  P L
Subjt:  LGNEAIKEG--------------LKTRAVVVESDASTVIKILNEEEEDHSEISFFADEINSLKDQFKEISFVFCPRDQNVAADCLASMASSSLDLFPFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACTCTACAAAAGCCCCGGGGGCAAACGGAAGTGAAGTCCTTGTCCTTACCAAGCCCGAGCTTTCAAGGAAAAGAGTGGACTCTATCCTTGATGAAAAGGGAAGGTG
GAAAGACAAAGAAGTGGTAGAAAATTTCTCGGCAATCGATGCTGACGTCATCCTCAACACGCCATCTTATGCAAAAGGAAAAAGTGACGAGATTATATGGAGTCGAGACA
AAAGGGGAATGTTCTCAGTAAAGAGCACCTACCATTTGGTCGTTTCCTTAGCAAAATCCCAGGAACCTTCTTCTTCGAACTCCTCGGATCCAAACAAAATTTGGAAGGCG
TTTTGGAAGCTAAAGGCCATCCCAAAGGCCAAAATCAGCGTGTGGAAAATAATTCATGACTCTATCCCTACTAAATCTAACATTATTAAAAAGGGGATCCCTACTGACCC
GATTTTCTTGATGTGCAAGTCAAAAGGAGAGTCAATGAGCCATCTTATGTGGGAATGTAAGTGGATCAAGGAGATTTGGAAGAATTTCTTTCCCCTAACGAATCCTATAT
TTTCTTTGAACAGGGGAGGATGGATTCCTATGGATTACTGGGCGTGGTTATCGGATCATCTTCCTAAGGAGGAGTTGGAGAAAGCTATTACTATTCTATGGAGCCTATGG
GACTACCGAAACCAGATAGATCAGACAAAAATCAATCCAGACCAATCCCAATTGCTCAGAGTCATAATGCGAAAGCTTGAAGATACAGAATTCCAGCCAATCGATTACCT
GTCCAAGCCACTTAGAAGCCCCAAATCAAGGGCGAAGAACCTCCCGAGATTCAAGCGGATTAAGAGAAGCTGGAAAGTGAAGATGTTAGAAATGAAAGTCATCATTGAGG
GGCTGAAAAGCTTACTTGGTAACGAGGCGATCAAAGAAGGTCTGAAGACAAGAGCAGTTGTGGTCGAATCGGATGCTTCAACCGTGATCAAGATTCTTAACGAAGAAGAA
GAAGATCACTCCGAAATTTCTTTCTTCGCAGATGAAATCAACTCGCTGAAGGATCAATTCAAGGAAATTTCTTTTGTTTTTTGCCCGAGAGATCAAAATGTTGCTGCTGA
CTGTTTGGCGAGCATGGCGAGCTCCTCCCTCGATCTGTTTCCTTTTTTGGATCCCTCTTCCAACGAGGAAGAAGGTGAGGGGTTTTGGGCCTCCCCCTATTGTTATTATT
AA
mRNA sequenceShow/hide mRNA sequence
ATGAACTCTACAAAAGCCCCGGGGGCAAACGGAAGTGAAGTCCTTGTCCTTACCAAGCCCGAGCTTTCAAGGAAAAGAGTGGACTCTATCCTTGATGAAAAGGGAAGGTG
GAAAGACAAAGAAGTGGTAGAAAATTTCTCGGCAATCGATGCTGACGTCATCCTCAACACGCCATCTTATGCAAAAGGAAAAAGTGACGAGATTATATGGAGTCGAGACA
AAAGGGGAATGTTCTCAGTAAAGAGCACCTACCATTTGGTCGTTTCCTTAGCAAAATCCCAGGAACCTTCTTCTTCGAACTCCTCGGATCCAAACAAAATTTGGAAGGCG
TTTTGGAAGCTAAAGGCCATCCCAAAGGCCAAAATCAGCGTGTGGAAAATAATTCATGACTCTATCCCTACTAAATCTAACATTATTAAAAAGGGGATCCCTACTGACCC
GATTTTCTTGATGTGCAAGTCAAAAGGAGAGTCAATGAGCCATCTTATGTGGGAATGTAAGTGGATCAAGGAGATTTGGAAGAATTTCTTTCCCCTAACGAATCCTATAT
TTTCTTTGAACAGGGGAGGATGGATTCCTATGGATTACTGGGCGTGGTTATCGGATCATCTTCCTAAGGAGGAGTTGGAGAAAGCTATTACTATTCTATGGAGCCTATGG
GACTACCGAAACCAGATAGATCAGACAAAAATCAATCCAGACCAATCCCAATTGCTCAGAGTCATAATGCGAAAGCTTGAAGATACAGAATTCCAGCCAATCGATTACCT
GTCCAAGCCACTTAGAAGCCCCAAATCAAGGGCGAAGAACCTCCCGAGATTCAAGCGGATTAAGAGAAGCTGGAAAGTGAAGATGTTAGAAATGAAAGTCATCATTGAGG
GGCTGAAAAGCTTACTTGGTAACGAGGCGATCAAAGAAGGTCTGAAGACAAGAGCAGTTGTGGTCGAATCGGATGCTTCAACCGTGATCAAGATTCTTAACGAAGAAGAA
GAAGATCACTCCGAAATTTCTTTCTTCGCAGATGAAATCAACTCGCTGAAGGATCAATTCAAGGAAATTTCTTTTGTTTTTTGCCCGAGAGATCAAAATGTTGCTGCTGA
CTGTTTGGCGAGCATGGCGAGCTCCTCCCTCGATCTGTTTCCTTTTTTGGATCCCTCTTCCAACGAGGAAGAAGGTGAGGGGTTTTGGGCCTCCCCCTATTGTTATTATT
AA
Protein sequenceShow/hide protein sequence
MNSTKAPGANGSEVLVLTKPELSRKRVDSILDEKGRWKDKEVVENFSAIDADVILNTPSYAKGKSDEIIWSRDKRGMFSVKSTYHLVVSLAKSQEPSSSNSSDPNKIWKA
FWKLKAIPKAKISVWKIIHDSIPTKSNIIKKGIPTDPIFLMCKSKGESMSHLMWECKWIKEIWKNFFPLTNPIFSLNRGGWIPMDYWAWLSDHLPKEELEKAITILWSLW
DYRNQIDQTKINPDQSQLLRVIMRKLEDTEFQPIDYLSKPLRSPKSRAKNLPRFKRIKRSWKVKMLEMKVIIEGLKSLLGNEAIKEGLKTRAVVVESDASTVIKILNEEE
EDHSEISFFADEINSLKDQFKEISFVFCPRDQNVAADCLASMASSSLDLFPFLDPSSNEEEGEGFWASPYCYY