; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036245 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036245
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold5:46022499..46027726
RNA-Seq ExpressionSpg036245
SyntenySpg036245
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.3e-10738.18Show/hide
Query:  VLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNW
        +LI   DR D + ++  +  +  S    L  +L ++ +        R K      IA  WG     LP +YLG+PLGG P ++ FW   ++KIQ+++ NW
Subjt:  VLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNW

Query:  RFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKS
        ++  LSKGGR+TLI S L S+P+Y +SVFK P  I  ++E     FLW+G S+    +L+RW  + SPK +GGLGIH + STN ALL KW+W+F TE+  
Subjt:  RFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKS

Query:  LWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFR
        LW++ I +KY  +   SFPS  +FSS+ SPW A+++  S F+ N  W+V +G+ I FW DNW+   PL     RL+ LS+NK  +V+E W      W+  
Subjt:  LWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFR

Query:  PRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF-YSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQK
          RPL D E   W+ +   LP P   RG     W  + +  F T   +  +  AP  P  + P   +   LWK + PKK K FIW+L H  +NT DRLQK
Subjt:  PRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF-YSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQK

Query:  IFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATIQSLCEDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSG
               +P+ C +C  S E I+HLFIHC     L +K  +AL   +  P  +QSL +++ +    +Q+ ++  N +   LW +W ERNNRIF+ + ++ 
Subjt:  IFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATIQSLCEDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSG

Query:  IQLWEDVISLAAFWATRSKAFSDYSASTIALNWESFM
          LWED ++    W+ +SK FS+Y   +IALN  +F+
Subjt:  IQLWEDVISLAAFWATRSKAFSDYSASTIALNWESFM

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.9e-0844.58Show/hide
Query:  GFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLW
        G+SFM RLK LA  +K W        +  K+  I EID+ID LE+ G   +I    R +LKADL Q  L EA+ W Q+CK++W
Subjt:  GFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLW

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.2e-10640.59Show/hide
Query:  WGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNL
        WG     LP  YLG+PLGG P ++ FW   ++KIQ+++ +W++  LSKGGR+TLI S L S+P+Y LSVFK P  I  ++E     FLW+G S+    +L
Subjt:  WGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNL

Query:  VRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWH
        +RW  V SPK +GGLGIH + STN ALL KW+W+F TE++ LW++ I +KY  +    FPS  ++SS+ SPW A++   S F+ N  W+V +G+ I FW 
Subjt:  VRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWH

Query:  DNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF
        DNW+   PL  V  RL+ LS+NK  +V++ W    + WN    RPL D E   W+ +   LP P   RG     W  + +  F T   +  L  A   P 
Subjt:  DNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF

Query:  -YSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATIQSLCED
         + P  ++   LWK D PKK K FIW+L H  +NT DRLQK       +P+ C +C  S E I+HLFIHC     L +K  QAL   +  P  ++SL ++
Subjt:  -YSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATIQSLCED

Query:  LLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALNWESFM
        + +    +Q+ ++  N     LW +W ERNNRIF+ + +    LWED+++    W+ +SK FS+Y   +IALN  +F+
Subjt:  LLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALNWESFM

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.0e-0739.81Show/hide
Query:  KRIIKSLWSSISVNWIALDALGSSGGFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYW
        KR I+  W + S    A        G+SFM RLK LA K+K W        +  K+  I EI+ ID LE+ G   +I    R +LKADL Q  L EA+ W
Subjt:  KRIIKSLWSSISVNWIALDALGSSGGFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYW

Query:  NQRCKKLW
         Q+CK++W
Subjt:  NQRCKKLW

KAA0041397.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.0e-10540.38Show/hide
Query:  WGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNL
        WG     LP  YLG+PLGG P ++ FW   ++KIQ+++ +W++  LSKGGR+TLI S L S+P+Y LSVFK P  I  ++E     FLW+G S+    +L
Subjt:  WGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNL

Query:  VRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWH
        +RW  V SPK +GGLGIH + STN ALL KW+W+F TE++ LW++ I +KY  +    FPS  ++SS+ SPW A++   S F+ N  W+V +G+ I FW 
Subjt:  VRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWH

Query:  DNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF
        DNW+   PL     RL+ LS+NK  +V++ W    + WN    RPL D E   W+ +   LP P   RG     W  + +  F T   +  L  A   P 
Subjt:  DNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF

Query:  -YSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATIQSLCED
         + P  ++   LWK D PKK K FIW+L H  +NT DRLQK       +P+ C +C  S E I+HLFIHC     L +K  QAL   +  P  ++SL ++
Subjt:  -YSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATIQSLCED

Query:  LLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALNWESFM
        + +    +Q+ ++  N     LW +W ERNNRIF+ + +    LWED+++    W+ +SK FS+Y   +IALN  +F+
Subjt:  LLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALNWESFM

KAA0056839.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]8.9e-10540.75Show/hide
Query:  TTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNS
        T  IA F+G  +  LP+ YLGVPLGG P++  FW  TIE I ++++ W++  +SKGGRLTL+++ L+S+P Y LS FKAPVS+   +E+    FLW G+ 
Subjt:  TTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNS

Query:  HSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNG
            ++L+ W I +SPK  GGLGI K+K TN+ALL KW+WR+  E  SLW+K I AKY+ +H    P   R SS+ SPW AI K +  + +   W   +G
Subjt:  HSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNG

Query:  KSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLL
         S+ FWH  W    PL     RLY LS+ +S TV+E W      WN  PRRPL +RE Q+W+ +   LP   + RG     W  S+   ++   A+ +  
Subjt:  KSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLL

Query:  VAPPRPFYSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPAT-
             P  +  E  L +LW++ IP+K K FIW++ H+ +NT D++QK       NPS C+ CR S+E ++HLFI C    F RN           P AT 
Subjt:  VAPPRPFYSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPAT-

Query:  -IQSLCEDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALN
         ++ LC  L      + + I+  N +IA+LW +W  RNN IF DK  S +  WED+ +L   W+++SK   +YS +TIALN
Subjt:  -IQSLCEDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALN

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-10340.5Show/hide
Query:  TTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNS
        T  IA F+G  +  LP+ YLGVPLGG P++  FW  TIE I ++++ W++  +SKGGRLTL+++ L+S+P Y LS FKAPVS+   +E+    FLW G+ 
Subjt:  TTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNS

Query:  HSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNG
            ++L+ W I +SPK  GGLGI K+K TN+ALL KW+WR+  E  SLW+K I AKY+ +H    P   R SS+ SPW AI K +  + +   W   +G
Subjt:  HSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNG

Query:  KSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLL
         S+ FWH  W    PL     RLY LS+ +S TV+E W      WN +PRRPL +RE Q+W+ +   LP   + RG     W  S+   ++   A+ +  
Subjt:  KSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLL

Query:  VAPPRPFYSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATI
             P  +  E  L +LW++ IP+K K FIW++ H+ +NT D +QK       NPS C+ CR S+E ++HLFI C     L N      G   V    +
Subjt:  VAPPRPFYSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATI

Query:  QSLCEDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALN
        + LC  L      + + I+  N +IA+LW +W  RNN IF DK  S +  WED+ +L   W+++SK   +YS +TIALN
Subjt:  QSLCEDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALN

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein1.6e-10738.18Show/hide
Query:  VLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNW
        +LI   DR D + ++  +  +  S    L  +L ++ +        R K      IA  WG     LP +YLG+PLGG P ++ FW   ++KIQ+++ NW
Subjt:  VLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNW

Query:  RFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKS
        ++  LSKGGR+TLI S L S+P+Y +SVFK P  I  ++E     FLW+G S+    +L+RW  + SPK +GGLGIH + STN ALL KW+W+F TE+  
Subjt:  RFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKS

Query:  LWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFR
        LW++ I +KY  +   SFPS  +FSS+ SPW A+++  S F+ N  W+V +G+ I FW DNW+   PL     RL+ LS+NK  +V+E W      W+  
Subjt:  LWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFR

Query:  PRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF-YSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQK
          RPL D E   W+ +   LP P   RG     W  + +  F T   +  +  AP  P  + P   +   LWK + PKK K FIW+L H  +NT DRLQK
Subjt:  PRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF-YSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQK

Query:  IFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATIQSLCEDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSG
               +P+ C +C  S E I+HLFIHC     L +K  +AL   +  P  +QSL +++ +    +Q+ ++  N +   LW +W ERNNRIF+ + ++ 
Subjt:  IFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATIQSLCEDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSG

Query:  IQLWEDVISLAAFWATRSKAFSDYSASTIALNWESFM
          LWED ++    W+ +SK FS+Y   +IALN  +F+
Subjt:  IQLWEDVISLAAFWATRSKAFSDYSASTIALNWESFM

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein2.9e-0844.58Show/hide
Query:  GFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLW
        G+SFM RLK LA  +K W        +  K+  I EID+ID LE+ G   +I    R +LKADL Q  L EA+ W Q+CK++W
Subjt:  GFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLW

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein3.0e-10640.59Show/hide
Query:  WGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNL
        WG     LP  YLG+PLGG P ++ FW   ++KIQ+++ +W++  LSKGGR+TLI S L S+P+Y LSVFK P  I  ++E     FLW+G S+    +L
Subjt:  WGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNL

Query:  VRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWH
        +RW  V SPK +GGLGIH + STN ALL KW+W+F TE++ LW++ I +KY  +    FPS  ++SS+ SPW A++   S F+ N  W+V +G+ I FW 
Subjt:  VRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWH

Query:  DNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF
        DNW+   PL  V  RL+ LS+NK  +V++ W    + WN    RPL D E   W+ +   LP P   RG     W  + +  F T   +  L  A   P 
Subjt:  DNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF

Query:  -YSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATIQSLCED
         + P  ++   LWK D PKK K FIW+L H  +NT DRLQK       +P+ C +C  S E I+HLFIHC     L +K  QAL   +  P  ++SL ++
Subjt:  -YSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATIQSLCED

Query:  LLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALNWESFM
        + +    +Q+ ++  N     LW +W ERNNRIF+ + +    LWED+++    W+ +SK FS+Y   +IALN  +F+
Subjt:  LLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALNWESFM

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein4.9e-0839.81Show/hide
Query:  KRIIKSLWSSISVNWIALDALGSSGGFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYW
        KR I+  W + S    A        G+SFM RLK LA K+K W        +  K+  I EI+ ID LE+ G   +I    R +LKADL Q  L EA+ W
Subjt:  KRIIKSLWSSISVNWIALDALGSSGGFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMGYLDDIASSLRKSLKADLQQTALLEARYW

Query:  NQRCKKLW
         Q+CK++W
Subjt:  NQRCKKLW

A0A5A7TIB8 LINE-1 retrotransposable element ORF2 protein1.9e-10540.38Show/hide
Query:  WGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNL
        WG     LP  YLG+PLGG P ++ FW   ++KIQ+++ +W++  LSKGGR+TLI S L S+P+Y LSVFK P  I  ++E     FLW+G S+    +L
Subjt:  WGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNL

Query:  VRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWH
        +RW  V SPK +GGLGIH + STN ALL KW+W+F TE++ LW++ I +KY  +    FPS  ++SS+ SPW A++   S F+ N  W+V +G+ I FW 
Subjt:  VRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWH

Query:  DNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF
        DNW+   PL     RL+ LS+NK  +V++ W    + WN    RPL D E   W+ +   LP P   RG     W  + +  F T   +  L  A   P 
Subjt:  DNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLVAPPRPF

Query:  -YSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATIQSLCED
         + P  ++   LWK D PKK K FIW+L H  +NT DRLQK       +P+ C +C  S E I+HLFIHC     L +K  QAL   +  P  ++SL ++
Subjt:  -YSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATIQSLCED

Query:  LLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALNWESFM
        + +    +Q+ ++  N     LW +W ERNNRIF+ + +    LWED+++    W+ +SK FS+Y   +IALN  +F+
Subjt:  LLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALNWESFM

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein6.2e-10440.5Show/hide
Query:  TTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNS
        T  IA F+G  +  LP+ YLGVPLGG P++  FW  TIE I ++++ W++  +SKGGRLTL+++ L+S+P Y LS FKAPVS+   +E+    FLW G+ 
Subjt:  TTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNS

Query:  HSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNG
            ++L+ W I +SPK  GGLGI K+K TN+ALL KW+WR+  E  SLW+K I AKY+ +H    P   R SS+ SPW AI K +  + +   W   +G
Subjt:  HSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNG

Query:  KSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLL
         S+ FWH  W    PL     RLY LS+ +S TV+E W      WN +PRRPL +RE Q+W+ +   LP   + RG     W  S+   ++   A+ +  
Subjt:  KSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLL

Query:  VAPPRPFYSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATI
             P  +  E  L +LW++ IP+K K FIW++ H+ +NT D +QK       NPS C+ CR S+E ++HLFI C     L N      G   V    +
Subjt:  VAPPRPFYSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATI

Query:  QSLCEDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALN
        + LC  L      + + I+  N +IA+LW +W  RNN IF DK  S +  WED+ +L   W+++SK   +YS +TIALN
Subjt:  QSLCEDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALN

A0A5A7UTI6 LINE-1 retrotransposable element ORF2 protein4.3e-10540.75Show/hide
Query:  TTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNS
        T  IA F+G  +  LP+ YLGVPLGG P++  FW  TIE I ++++ W++  +SKGGRLTL+++ L+S+P Y LS FKAPVS+   +E+    FLW G+ 
Subjt:  TTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNS

Query:  HSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNG
            ++L+ W I +SPK  GGLGI K+K TN+ALL KW+WR+  E  SLW+K I AKY+ +H    P   R SS+ SPW AI K +  + +   W   +G
Subjt:  HSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNG

Query:  KSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLL
         S+ FWH  W    PL     RLY LS+ +S TV+E W      WN  PRRPL +RE Q+W+ +   LP   + RG     W  S+   ++   A+ +  
Subjt:  KSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLL

Query:  VAPPRPFYSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPAT-
             P  +  E  L +LW++ IP+K K FIW++ H+ +NT D++QK       NPS C+ CR S+E ++HLFI C    F RN           P AT 
Subjt:  VAPPRPFYSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPAT-

Query:  -IQSLCEDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALN
         ++ LC  L      + + I+  N +IA+LW +W  RNN IF DK  S +  WED+ +L   W+++SK   +YS +TIALN
Subjt:  -IQSLCEDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALN

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657501.1e-3631.3Show/hide
Query:  IEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLK
        +E++  R+  WR  +LS  GRLTL ++VL+SMP++ +S    P SI NR++Q+   FLW   +     +LV+W  V SPK EGGLG+   KS N AL+ K
Subjt:  IEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLK

Query:  WIWRFFTEEKSLWRKFISAKYS----SDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLT
          WR   E+ SLW   +  KY      D     P  S  S+ RS   AI  L+        W   +G+ I FW D W    PL  + D   + +   ++ 
Subjt:  WIWRFFTEEKSLWRKFISAKYS----SDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLT

Query:  VEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGS-DVHRWLASEDGSFSTKVARSVLLV-APPRPFYSPGETILNNLWKADIPKKIKVFI
         ++ W+   R W+F    P      +    +     + D   G+ D   W  S+DG FS + A  +L V   PRP  +   +  N LWK  +P+++K F+
Subjt:  VEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGS-DVHRWLASEDGSFSTKVARSVLLV-APPRPFYSPGETILNNLWKADIPKKIKVFI

Query:  WSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHC
        W + +++V T +      R      ++C +C+   ES+ H+   C
Subjt:  WSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHC

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.2e-2926.17Show/hide
Query:  DIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHS
        DI   +   S +LP+ YLG+PL      T  + P +EKI+ RI  W    LS  GRL LI SV++S+  + +S F+ P +    ++ I   FLW G   +
Subjt:  DIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHS

Query:  GPSNLVRWEIVSSPKAEGGLGIHKIKSTNE---------ALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANF
             V W  V +PK EGGLGI  +K  N+           L  W+W+   + ++L   F+                                       
Subjt:  GPSNLVRWEIVSSPKAEGGLGIHKIKSTNE---------ALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISKLQSPFFANF

Query:  RWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASED---GSF
        + ++ NG +  FW DNWS +G L  V      +    +L    A    + V N RPRR   D  ++   ++   +       G D  RW  + D     F
Subjt:  RWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASED---GSF

Query:  STKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHC
        +TK   +       +  +  G      +W +    K  V  W      + T DR+     G+    S C+LC    E+ DHLF  C
Subjt:  STKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHC

AT3G25270.1 Ribonuclease H-like superfamily protein1.8e-1031.45Show/hide
Query:  PGET-ILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGV----LSVPPATIQSLC
        PG+  I   +WK     KIK F+W L   ++ T D L+   R    N   C  C    E+  HLF  C    F   +V++A G+    L     T+++  
Subjt:  PGET-ILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGV----LSVPPATIQSLC

Query:  EDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDK----------VRSGIQLWED
        E LL     + RQ  L N++I  LW +W  RN  +FQ K           R+ +Q WED
Subjt:  EDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDK----------VRSGIQLWED

AT3G25720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.4e-0732.74Show/hide
Query:  VSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHH---------NSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSI
        V  PKAEGGLG+      N  L LK +WR F+   SLW       +   HH         + F +S    S    W  + +L+       R  + NG + 
Subjt:  VSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHH---------NSFPSSSRFSSSRSPWFAISKLQSPFFANFRWEVRNGKSI

Query:  LFWHDNWSVLGPL
         FW DNW+  GPL
Subjt:  LFWHDNWSVLGPL

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.7e-0826.79Show/hide
Query:  EVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSL-TVEEAWLNLDRVW--NFRPRRPLFDREVQSWNEMTRLLPI--PDSFR-GSDVHRWLASEDGS
        EV +G +  FWHDNW  LGPL  V   L   +    +  V    L     W  + R R P+  +      E   LL     DSF   +D+H    +    
Subjt:  EVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSL-TVEEAWLNLDRVW--NFRPRRPLFDREVQSWNEMTRLLPI--PDSFR-GSDVHRWLASEDGS

Query:  FSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQA
        FS     S L    P+    P    +   +K  +PK   +  W +    ++T DRLQ         P+ CLLC    +S  HLF   C  S +  + + A
Subjt:  FSTKVARSVLLVAPPRPFYSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQA

Query:  LGVLSVPPATIQSLCEDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDV
           L+ PPA +      LL+        +++R    + ++ +W ERN R+     RS   + +D+
Subjt:  LGVLSVPPATIQSLCEDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDV

AT4G29090.1 Ribonuclease H-like superfamily protein4.7e-1925.16Show/hide
Query:  SMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFP
        ++P Y ++ F  P ++C ++  +L  F W     +   +   W+ +S  KAEGG+G   I++ N ALL K +WR  +  +SL  K   ++Y    H S P
Subjt:  SMPLYVLSVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFP

Query:  SSSRFSSSRS-PWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGP------LKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQS
         ++   S  S  W +I   Q       R  V NG+ I+ W   W    P      ++ V  + Y  S +  L V +      R W       LF    + 
Subjt:  SSSRFSSSRS-PWFAISKLQSPFFANFRWEVRNGKSILFWHDNWSVLGP------LKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQS

Query:  WNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLV-----APPRPFYSPG-ETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYF
             R    P   R  D + W  +  G ++ K    VL       + P+    P    I   +WK+    KI+ F+W     S+     L   +R    
Subjt:  WNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKVARSVLLV-----APPRPFYSPG-ETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYF

Query:  NPSICLLCRMSSESIDHLFIHC
          S C+ C    E+++HL   C
Subjt:  NPSICLLCRMSSESIDHLFIHC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCCATTGAAACCATTTGATGAAGCCAATAATGCAAATTCCTCAGCTTCTTCTTCAATAGTCCGATCAAAATCCGCAGAAAGACGATGGGTTGTTTACATTACCAA
CATCATCGAAAAAGTACACGACGCTAAAACGGACATCTCTTCTTCGATCTTCATCATCCGAAAATTCCTCCAACATTCTCAACAATTCTTAGCTCTAGGCCCTTATCACC
ATTTTCTCATGGAAATTTCTTATGGAAATCTCGAGGGTGTTGTTGTGGATGATTTGTTTGGTTCTGTTTTGGTGGGTTTCTGCATGGAAATTTCTCCTTTGAATTATGAT
CTAATCGAGGTCCAAGAAGGCTATTATTCTAACGTGGATCGTGTTGTTGAATGTTCTCATATTTTGGACCTTTTGTACCGAATGATTTTGCCGGAGAAATTGGAATGCTC
CGACGGCGTTCGAAACGAGCCCACGGCGACCCAAAATTCTTCTAGTGGTCTCGTTCATTCCGGCGGCTCTTTGAAGGGTTTCTCGAAATCGATATTGGATAGCGTAAAAT
TGTTGAGGAATTTTAAGAAAGTTTATGAATTTATTGAACGTTTGATAGAGCTGCTTCCACTTGTTAGGGACTTGGTTTCATCCATGAAAGAATATTCCGACGACGATAGA
AATGGAATCGAAGTTGCAATTGAAGACAAAACGCCATTAATGGAAGCTGGCATCAAGACTATATCCTTCAATACCACTTATGCTCCAGCGAGGAAATTAACTCCAATGGG
TCGAAATTCTGAAAAGCCCAAATCCGGCAAAGGTAAGGAGATATATCGACCCAAATCCACTACTGGAATAAAAATATCTGAGCCCGCCCAAGAGAAAGTCAATCTCGTTG
CACCTTTTGGCCCAGTTGAGCCCGCCCACAAGAAAGTCAATCTCGTTGCTCCTTCTGGCCCAGTTTACCAAGATATTAATGGTGAAAAGTTTACTTTAAGTGTTGATTTG
GGCTCATTGTCCCCTATATCTGATGCACCCATTTCAAGTCCAGAAAATACTCCATCTCCAAAAGCCCACACTGTGATTGAACCTCCTTCAGCAATTATTAATGAAAGTCT
CAAATTTTTGGTTTCTCCGGACAAAATGGATAGTACGGGTGAAGACAGTTTGAATGGCACTCCTAGATTCAAGAACATTGAAGCAGTCATTGATGATAATAGTCCTCGGA
AGGATCCTCAAGAAGTTATAGAACATGGCAAGCCGAATGACGAAAGTTTCAAGAAGAAGCTCAACGACTGGCTGACTGAAAACGATTTTTGTCTTGTTCCTACTAAATCT
GTTTCGGGTTTATTTTGTAATACTTCTACTTCTGATGATCATGTAACAAGCCGTCTGGCTTTGGGGAATCTTGATTTTGTCATTCTTACTGAAACTAAATTGACTAACGT
CAGCAAGCGTATTATCAAATCGTTATGGAGTTCTATTAGTGTCAATTGGATTGCTCTGGACGCGCTTGGTTCTTCAGGAGGTTTCTCTTTTATGGGAAGACTTAAGCTTC
TAGCTCGGAAAGTCAAAGATTGGAAGTCTTCCAATTCCGAATCTTTCAAAGAAAAGAAAAGGGTCTTAATAACAGAAATAGATCGCATTGATTCGCTGGAATCCATGGGT
TATTTGGATGATATTGCTAGCTCTCTCAGAAAATCGCTAAAAGCTGATCTCCAGCAAACTGCTCTTTTAGAAGCTCGCTATTGGAATCAGCGTTGCAAAAAGCTCTGGAC
TACTGATATTGCTCGCTTTTGGGGTTGTTGCTCTCATTCTCTGCCAATTGCTTATCTTGGTGTCCCTTTAGGCGGCATTCCAAAAAATACTCAGTTTTGGGTGCCCACGA
TTGAGAAGATTCAGAGACGAATTCACAATTGGCGGTTTGTTTCTCTTTCTAAGGGAGGTCGTCTTACTCTTATTCAATCGGTTCTTAACAGTATGCCTCTATATGTTCTC
TCTGTGTTCAAAGCGCCGGTTTCTATATGCAACAGAGTCGAACAAATCCTTCATAAATTTCTTTGGGATGGAAATTCTCATTCAGGGCCCTCAAATTTAGTGAGATGGGA
AATCGTATCATCCCCAAAGGCAGAAGGCGGTTTGGGCATTCACAAAATCAAAAGCACGAATGAAGCTCTCCTCCTTAAATGGATATGGCGTTTTTTCACCGAGGAAAAAT
CTCTTTGGAGGAAATTCATAAGTGCCAAATATTCCAGCGATCATCACAATAGTTTTCCCTCTAGTAGCAGATTCTCTAGCTCCAGATCTCCGTGGTTTGCTATTTCAAAG
CTTCAGTCTCCTTTCTTCGCAAATTTCAGATGGGAGGTGCGCAACGGTAAATCCATTCTCTTTTGGCATGATAACTGGTCTGTTCTTGGTCCTTTGAAATATGTTAATGA
TCGCCTCTATCAGTTATCTTCAAACAAAAGTCTCACAGTTGAGGAAGCTTGGTTGAATTTGGATAGAGTATGGAATTTTCGTCCTCGTCGGCCTCTTTTTGATAGAGAGG
TTCAAAGTTGGAATGAGATGACTAGGCTTTTACCCATTCCAGATTCTTTTCGTGGTTCTGATGTTCATCGTTGGCTAGCTTCCGAAGACGGTTCCTTCTCCACAAAAGTT
GCTCGATCCGTCCTTTTGGTTGCTCCTCCTAGACCTTTTTATAGCCCTGGAGAAACAATTCTCAACAACCTTTGGAAAGCTGATATCCCTAAAAAAATAAAGGTTTTCAT
CTGGTCCCTTTTTCATAGAAGTGTCAACACGACTGACAGGCTTCAGAAAATTTTTCGAGGCTCGTACTTCAATCCTTCCATCTGTCTCCTTTGCAGAATGAGTTCCGAAA
GTATCGATCATCTATTCATTCATTGTTGCTGTGTGTCCTTTCTTCGAAACAAGGTTTACCAAGCTTTGGGTGTTCTTTCTGTTCCTCCGGCAACGATTCAATCCCTTTGC
GAAGATTTGCTTGCTTTCAAAGCTTCATCCCAAAGACAAATTCTTCTCCGAAATATTTCTATTGCCTCCCTTTGGATCGTGTGGAATGAACGCAACAATCGTATTTTTCA
GGACAAGGTTCGCAGCGGGATTCAACTTTGGGAGGATGTCATTTCTCTTGCCGCTTTTTGGGCTACAAGATCAAAAGCTTTCTCTGATTATTCTGCTTCTACTATTGCTT
TAAATTGGGAATCCTTTATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGCCATTGAAACCATTTGATGAAGCCAATAATGCAAATTCCTCAGCTTCTTCTTCAATAGTCCGATCAAAATCCGCAGAAAGACGATGGGTTGTTTACATTACCAA
CATCATCGAAAAAGTACACGACGCTAAAACGGACATCTCTTCTTCGATCTTCATCATCCGAAAATTCCTCCAACATTCTCAACAATTCTTAGCTCTAGGCCCTTATCACC
ATTTTCTCATGGAAATTTCTTATGGAAATCTCGAGGGTGTTGTTGTGGATGATTTGTTTGGTTCTGTTTTGGTGGGTTTCTGCATGGAAATTTCTCCTTTGAATTATGAT
CTAATCGAGGTCCAAGAAGGCTATTATTCTAACGTGGATCGTGTTGTTGAATGTTCTCATATTTTGGACCTTTTGTACCGAATGATTTTGCCGGAGAAATTGGAATGCTC
CGACGGCGTTCGAAACGAGCCCACGGCGACCCAAAATTCTTCTAGTGGTCTCGTTCATTCCGGCGGCTCTTTGAAGGGTTTCTCGAAATCGATATTGGATAGCGTAAAAT
TGTTGAGGAATTTTAAGAAAGTTTATGAATTTATTGAACGTTTGATAGAGCTGCTTCCACTTGTTAGGGACTTGGTTTCATCCATGAAAGAATATTCCGACGACGATAGA
AATGGAATCGAAGTTGCAATTGAAGACAAAACGCCATTAATGGAAGCTGGCATCAAGACTATATCCTTCAATACCACTTATGCTCCAGCGAGGAAATTAACTCCAATGGG
TCGAAATTCTGAAAAGCCCAAATCCGGCAAAGGTAAGGAGATATATCGACCCAAATCCACTACTGGAATAAAAATATCTGAGCCCGCCCAAGAGAAAGTCAATCTCGTTG
CACCTTTTGGCCCAGTTGAGCCCGCCCACAAGAAAGTCAATCTCGTTGCTCCTTCTGGCCCAGTTTACCAAGATATTAATGGTGAAAAGTTTACTTTAAGTGTTGATTTG
GGCTCATTGTCCCCTATATCTGATGCACCCATTTCAAGTCCAGAAAATACTCCATCTCCAAAAGCCCACACTGTGATTGAACCTCCTTCAGCAATTATTAATGAAAGTCT
CAAATTTTTGGTTTCTCCGGACAAAATGGATAGTACGGGTGAAGACAGTTTGAATGGCACTCCTAGATTCAAGAACATTGAAGCAGTCATTGATGATAATAGTCCTCGGA
AGGATCCTCAAGAAGTTATAGAACATGGCAAGCCGAATGACGAAAGTTTCAAGAAGAAGCTCAACGACTGGCTGACTGAAAACGATTTTTGTCTTGTTCCTACTAAATCT
GTTTCGGGTTTATTTTGTAATACTTCTACTTCTGATGATCATGTAACAAGCCGTCTGGCTTTGGGGAATCTTGATTTTGTCATTCTTACTGAAACTAAATTGACTAACGT
CAGCAAGCGTATTATCAAATCGTTATGGAGTTCTATTAGTGTCAATTGGATTGCTCTGGACGCGCTTGGTTCTTCAGGAGGTTTCTCTTTTATGGGAAGACTTAAGCTTC
TAGCTCGGAAAGTCAAAGATTGGAAGTCTTCCAATTCCGAATCTTTCAAAGAAAAGAAAAGGGTCTTAATAACAGAAATAGATCGCATTGATTCGCTGGAATCCATGGGT
TATTTGGATGATATTGCTAGCTCTCTCAGAAAATCGCTAAAAGCTGATCTCCAGCAAACTGCTCTTTTAGAAGCTCGCTATTGGAATCAGCGTTGCAAAAAGCTCTGGAC
TACTGATATTGCTCGCTTTTGGGGTTGTTGCTCTCATTCTCTGCCAATTGCTTATCTTGGTGTCCCTTTAGGCGGCATTCCAAAAAATACTCAGTTTTGGGTGCCCACGA
TTGAGAAGATTCAGAGACGAATTCACAATTGGCGGTTTGTTTCTCTTTCTAAGGGAGGTCGTCTTACTCTTATTCAATCGGTTCTTAACAGTATGCCTCTATATGTTCTC
TCTGTGTTCAAAGCGCCGGTTTCTATATGCAACAGAGTCGAACAAATCCTTCATAAATTTCTTTGGGATGGAAATTCTCATTCAGGGCCCTCAAATTTAGTGAGATGGGA
AATCGTATCATCCCCAAAGGCAGAAGGCGGTTTGGGCATTCACAAAATCAAAAGCACGAATGAAGCTCTCCTCCTTAAATGGATATGGCGTTTTTTCACCGAGGAAAAAT
CTCTTTGGAGGAAATTCATAAGTGCCAAATATTCCAGCGATCATCACAATAGTTTTCCCTCTAGTAGCAGATTCTCTAGCTCCAGATCTCCGTGGTTTGCTATTTCAAAG
CTTCAGTCTCCTTTCTTCGCAAATTTCAGATGGGAGGTGCGCAACGGTAAATCCATTCTCTTTTGGCATGATAACTGGTCTGTTCTTGGTCCTTTGAAATATGTTAATGA
TCGCCTCTATCAGTTATCTTCAAACAAAAGTCTCACAGTTGAGGAAGCTTGGTTGAATTTGGATAGAGTATGGAATTTTCGTCCTCGTCGGCCTCTTTTTGATAGAGAGG
TTCAAAGTTGGAATGAGATGACTAGGCTTTTACCCATTCCAGATTCTTTTCGTGGTTCTGATGTTCATCGTTGGCTAGCTTCCGAAGACGGTTCCTTCTCCACAAAAGTT
GCTCGATCCGTCCTTTTGGTTGCTCCTCCTAGACCTTTTTATAGCCCTGGAGAAACAATTCTCAACAACCTTTGGAAAGCTGATATCCCTAAAAAAATAAAGGTTTTCAT
CTGGTCCCTTTTTCATAGAAGTGTCAACACGACTGACAGGCTTCAGAAAATTTTTCGAGGCTCGTACTTCAATCCTTCCATCTGTCTCCTTTGCAGAATGAGTTCCGAAA
GTATCGATCATCTATTCATTCATTGTTGCTGTGTGTCCTTTCTTCGAAACAAGGTTTACCAAGCTTTGGGTGTTCTTTCTGTTCCTCCGGCAACGATTCAATCCCTTTGC
GAAGATTTGCTTGCTTTCAAAGCTTCATCCCAAAGACAAATTCTTCTCCGAAATATTTCTATTGCCTCCCTTTGGATCGTGTGGAATGAACGCAACAATCGTATTTTTCA
GGACAAGGTTCGCAGCGGGATTCAACTTTGGGAGGATGTCATTTCTCTTGCCGCTTTTTGGGCTACAAGATCAAAAGCTTTCTCTGATTATTCTGCTTCTACTATTGCTT
TAAATTGGGAATCCTTTATGTAG
Protein sequenceShow/hide protein sequence
MEPLKPFDEANNANSSASSSIVRSKSAERRWVVYITNIIEKVHDAKTDISSSIFIIRKFLQHSQQFLALGPYHHFLMEISYGNLEGVVVDDLFGSVLVGFCMEISPLNYD
LIEVQEGYYSNVDRVVECSHILDLLYRMILPEKLECSDGVRNEPTATQNSSSGLVHSGGSLKGFSKSILDSVKLLRNFKKVYEFIERLIELLPLVRDLVSSMKEYSDDDR
NGIEVAIEDKTPLMEAGIKTISFNTTYAPARKLTPMGRNSEKPKSGKGKEIYRPKSTTGIKISEPAQEKVNLVAPFGPVEPAHKKVNLVAPSGPVYQDINGEKFTLSVDL
GSLSPISDAPISSPENTPSPKAHTVIEPPSAIINESLKFLVSPDKMDSTGEDSLNGTPRFKNIEAVIDDNSPRKDPQEVIEHGKPNDESFKKKLNDWLTENDFCLVPTKS
VSGLFCNTSTSDDHVTSRLALGNLDFVILTETKLTNVSKRIIKSLWSSISVNWIALDALGSSGGFSFMGRLKLLARKVKDWKSSNSESFKEKKRVLITEIDRIDSLESMG
YLDDIASSLRKSLKADLQQTALLEARYWNQRCKKLWTTDIARFWGCCSHSLPIAYLGVPLGGIPKNTQFWVPTIEKIQRRIHNWRFVSLSKGGRLTLIQSVLNSMPLYVL
SVFKAPVSICNRVEQILHKFLWDGNSHSGPSNLVRWEIVSSPKAEGGLGIHKIKSTNEALLLKWIWRFFTEEKSLWRKFISAKYSSDHHNSFPSSSRFSSSRSPWFAISK
LQSPFFANFRWEVRNGKSILFWHDNWSVLGPLKYVNDRLYQLSSNKSLTVEEAWLNLDRVWNFRPRRPLFDREVQSWNEMTRLLPIPDSFRGSDVHRWLASEDGSFSTKV
ARSVLLVAPPRPFYSPGETILNNLWKADIPKKIKVFIWSLFHRSVNTTDRLQKIFRGSYFNPSICLLCRMSSESIDHLFIHCCCVSFLRNKVYQALGVLSVPPATIQSLC
EDLLAFKASSQRQILLRNISIASLWIVWNERNNRIFQDKVRSGIQLWEDVISLAAFWATRSKAFSDYSASTIALNWESFM