; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0011905 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0011905
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr10:7529229..7532369
RNA-Seq ExpressionPI0011905
SyntenyPI0011905
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR025558 - Domain of unknown function DUF4283
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039309.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.6e-19842.44Show/hide
Query:  KIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACG
        KI QNL+KQ E +F+   FH EK +++      ANLLC N   KGW+TVG + V+FEKW+  +H S KL  SYGGW +FRG+PLHLWN  TF+QIG ACG
Subjt:  KIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACG

Query:  GFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFFE
        G + VA++T   ++L+EAK+K++YNY+GF+PA + I D  G  FVVQ V  +E KWL ERNV +HGTFKRQA ASF+++N +SE + F G  A++PD   
Subjt:  GFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFFE

Query:  LSKSEALNLEVTTPEA-------------NLTNLTKTKTPNPSHYGKTDKTTKK------TEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNKT
               ++    P A             + T L +    + S +   +K+  K       +  LDKGKQ +       S    +  KRKVSF SP NKT
Subjt:  LSKSEALNLEVTTPEA-------------NLTNLTKTKTPNPSHYGKTDKTTKK------TEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNKT

Query:  FFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVKKYYRI-KNKFNQAPDTCKEKA----AKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGD
         FFNP++APA H     S EK      E S  +K   ++   R  + K N      +  A    A K+G  LTVDLG L  L   ++ +  +S +  +  
Subjt:  FFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVKKYYRI-KNKFNQAPDTCKEKA----AKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGD

Query:  SSPEETNTLQAQESQ--AQTELIKDSSDPMMT--SDEDECRITRVKGKCEEEE-----ENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSY
           + TNT    E+     T+  K +S P +     +   R      K E++E     E FK QL+ WLKEN LKL+   + +  ++S            
Subjt:  SSPEETNTLQAQESQ--AQTELIKDSSDPMMT--SDEDECRITRVKGKCEEEE-----ENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSY

Query:  SPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNL
                            N+L+S               GGILI+W+ Q H LL+  EG F++SAN   S  N+WW+TGLYG  KR++R  +W +L NL
Subjt:  SPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNL

Query:  HNLCSTNWLIGGDFNVVRWNNETTTLNPGKHKNN----------LIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPL
        H+L S+ W+IGGD NVVR   E+T +    H +N          LIDPPLTNNR+TWSNLR+ PT SRLDRFLY S WE+ F  H TRTL R TSDHFPL
Subjt:  HNLCSTNWLIGGDFNVVRWNNETTTLNPGKHKNN----------LIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPL

Query:  VLE--ASNIRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSN
        V E   S +RWGP+PFRLN+ +L+DP+F RN+  WWE S   GHPGF F+QRLK+L+  +K WQ          K+ II+++D+IDK E    L+LE+SN
Subjt:  VLE--ASNIRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSN

Query:  RRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKPI
        RR A K++L  +  K++Q W QR KK WL EGDEN+ +FH++CS+ Q+ N I EIQDE G    ++++I+    NHFS IY    K+  + IENL W PI
Subjt:  RRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKPI

Query:  DSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIINK
        D    + LCAPF E E+   I S    KAPGPDG+ + F+K YW ++KE+++ IF DF +KG+INK
Subjt:  DSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIINK

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.1e-18740.93Show/hide
Query:  HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNAC
        ++I  +L+KQ E+ FS KPF  +KAIL L + + A LLCSN G+ GWSTVGN+ VKFE W S  H+   +  SYGGW  FRG+PLHLWNYNTF+ IG+AC
Subjt:  HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNAC

Query:  GGFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFF
        GGF+ VAK+TM+   L++AKIKV+YNY GF+PA+I ITD+ GE F+V TV   E++WL ERNV +HG+F+ +A   F+++N  +E Y + G  A+ P+  
Subjt:  GGFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFF

Query:  ELSKSEALNLEVTTPEANLTNLTKTKTPNPSHYGKTDKTTKKTEKRLDKGKQLMVDDDDTD---SQQNKFSSKRKVSFTSPKN-KTFFFNPE-NAPAKHL
              +++    + + +++  T+ K  N S         + +++R +KGK +++ +D      S+++K  S RKVSF SP   ++   N E N   K L
Subjt:  ELSKSEALNLEVTTPEANLTNLTKTKTPNPSHYGKTDKTTKKTEKRLDKGKQLMVDDDDTD---SQQNKFSSKRKVSFTSPKN-KTFFFNPE-NAPAKHL

Query:  CIKSSLEKNSSGPSEDSSMQKRKFVKKYYRIKNKFNQAPD----TCKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPE-ETNTLQAQE
         I +    N       S  QK K  K  YRIK    ++ +    + KE     +  +L+VD+G +SPL +   Q   N   +T  + +P+  + +  + E
Subjt:  CIKSSLEKNSSGPSEDSSMQKRKFVKKYYRIKNKFNQAPD----TCKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPE-ETNTLQAQE

Query:  SQAQTELIKDSSDPMMTSDEDECRITRVKGKCEEE---EENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKV
        ++  T  +K+ +D   ++            K   E   +  FK++L+IWLKEN LKL+P    ++PS           SSY P  VI+S+          
Subjt:  SQAQTELIKDSSDPMMTSDEDECRITRVKGKCEEE---EENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKV

Query:  INSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNLHNLCSTNWLIGGDFNVVRW
               +++        G  GGIL++W+D   ++ +   G+++IS NI ++ GN WW+T +YG  K   R KLW EL+ L +LC  NWLI GDFN+VRW
Subjt:  INSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNLHNLCSTNWLIGGDFNVVRW

Query:  NNETTTLNPGKHK----------NNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLEASNIRWGPSPFRLNNN
          ET   +  K            N LIDPP  NN FTWSNLR  PT SRLDRFL +  WE  F  H +RTL R+ SDHFP++LE+  I+WGP PFRLNN+
Subjt:  NNETTTLNPGKHK----------NNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLEASNIRWGPSPFRLNNN

Query:  SLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQAQCWA
        SL D +F +N   WW  SK  G PG+AF+Q L +LS+ +K WQ + +N  +  KK ++++ID IDKLE Q  ++     +R + KS L SI+  QAQ W 
Subjt:  SLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQAQCWA

Query:  QRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIYSEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAIN
        QR +++W   GDEN +YFH++C+ NQR N I  I D  G +  S D I+    +HF +IY+++    +LI+NL+W PI  L  +ELC PFDE E+   I 
Subjt:  QRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIYSEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAIN

Query:  SISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIIN
        S S++KAPGPDGYT+ FYKK+WP +K++++ +F DFHK GI+N
Subjt:  SISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIIN

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.0e-20244.02Show/hide
Query:  HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNAC
        HKI QNL+KQ E +F+   FH EKA+++      ANLLC N   KGWSTVG + V+FEKWS   H + KL  SYGGW +FRG+PLHLWN  TF+QIG AC
Subjt:  HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNAC

Query:  GGFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFF
         G + VA++T   K+L+EA+IKV+YNY+GF+PAN+ I D+ G  F VQ V   E KWL ERNV +HGTFKRQA ASF+++N ESE + F G+ A++PDF 
Subjt:  GGFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFF

Query:  ELS----------KSEALNLEVTTPEANLT--NLTKTKTPNPSHYGKTDKTTK-------KTEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNK
          S          +  AL   +  P+ N T  +    +  N S+   T   +K         +  LDKGKQ +      +S  N   SKRKVSF SP NK
Subjt:  ELS----------KSEALNLEVTTPEANLT--NLTKTKTPNPSHYGKTDKTTK-------KTEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNK

Query:  TFFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVK---KYYRIKNKFNQAP--DTCKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKG
        T  FNP++APA H    +S EK      E S  +K    +   K  + K  F   P      ++ A K+G  LTVDLG L  L  D N+ + +       
Subjt:  TFFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVK---KYYRIKNKFNQAP--DTCKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKG

Query:  DSSPEETNTLQAQESQAQTELIKDSSDPMMTSDEDECRITRVK----GKCEEEE-----ENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISS
        +   + TNT    E+      + ++S+    ++  + +    +     K EE+E     E FKKQL+ WLK+N LKL+   +    SS  +    ++++ 
Subjt:  DSSPEETNTLQAQESQAQTELIKDSSDPMMTSDEDECRITRVK----GKCEEEE-----ENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISS

Query:  YSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQN
         +    I        TNK++I SLW   SI W+  N+ G +GGILI+W+ Q H LL+  EG F++SAN   +  ++WW+TGLYG  KR++R   W EL N
Subjt:  YSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQN

Query:  LHNLCSTNWLIGGDFNVVRWNNETTTLNPGKH----------KNNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFP
        L +L S  W++GGD NV+R   E+T++    H           N LIDPPLTNNRFTWSNLR+ PT SR+DRFLY SSWE  F  H TRTL RSTSDHFP
Subjt:  LHNLCSTNWLIGGDFNVVRWNNETTTLNPGKH----------KNNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFP

Query:  LVLEASN--IRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDS
        LV E SN  + WGP PFRLN+ +LSDP+F RN+  WWE S   G+PGF+F+QRLK+L+  +K WQ   L+     K+ II+++D+IDK E    LT E+S
Subjt:  LVLEASN--IRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDS

Query:  NRRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKP
        NRR A K+ L  +  K++Q W QR KK WL EGDEN+++FH++CS+ Q+ + I EIQDE G    +++SI+      FS IY S  K   + IENL+W P
Subjt:  NRRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKP

Query:  IDSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYW
        I S   + LCAPF E E+   INS   KK PGPDG+ + F+K +W
Subjt:  IDSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYW

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.6e-19842.44Show/hide
Query:  KIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACG
        KI QNL+KQ E +F+   FH EK +++      ANLLC N   KGW+TVG + V+FEKW+  +H S KL  SYGGW +FRG+PLHLWN  TF+QIG ACG
Subjt:  KIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACG

Query:  GFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFFE
        G + VA++T   ++L+EAK+K++YNY+GF+PA + I D  G  FVVQ V  +E KWL ERNV +HGTFKRQA ASF+++N +SE + F G  A++PD   
Subjt:  GFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFFE

Query:  LSKSEALNLEVTTPEA-------------NLTNLTKTKTPNPSHYGKTDKTTKK------TEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNKT
               ++    P A             + T L +    + S +   +K+  K       +  LDKGKQ +       S    +  KRKVSF SP NKT
Subjt:  LSKSEALNLEVTTPEA-------------NLTNLTKTKTPNPSHYGKTDKTTKK------TEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNKT

Query:  FFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVKKYYRI-KNKFNQAPDTCKEKA----AKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGD
         FFNP++APA H     S EK      E S  +K   ++   R  + K N      +  A    A K+G  LTVDLG L  L   ++ +  +S +  +  
Subjt:  FFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVKKYYRI-KNKFNQAPDTCKEKA----AKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGD

Query:  SSPEETNTLQAQESQ--AQTELIKDSSDPMMT--SDEDECRITRVKGKCEEEE-----ENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSY
           + TNT    E+     T+  K +S P +     +   R      K E++E     E FK QL+ WLKEN LKL+   + +  ++S            
Subjt:  SSPEETNTLQAQESQ--AQTELIKDSSDPMMT--SDEDECRITRVKGKCEEEE-----ENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSY

Query:  SPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNL
                            N+L+S               GGILI+W+ Q H LL+  EG F++SAN   S  N+WW+TGLYG  KR++R  +W +L NL
Subjt:  SPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNL

Query:  HNLCSTNWLIGGDFNVVRWNNETTTLNPGKHKNN----------LIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPL
        H+L S+ W+IGGD NVVR   E+T +    H +N          LIDPPLTNNR+TWSNLR+ PT SRLDRFLY S WE+ F  H TRTL R TSDHFPL
Subjt:  HNLCSTNWLIGGDFNVVRWNNETTTLNPGKHKNN----------LIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPL

Query:  VLE--ASNIRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSN
        V E   S +RWGP+PFRLN+ +L+DP+F RN+  WWE S   GHPGF F+QRLK+L+  +K WQ          K+ II+++D+IDK E    L+LE+SN
Subjt:  VLE--ASNIRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSN

Query:  RRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKPI
        RR A K++L  +  K++Q W QR KK WL EGDEN+ +FH++CS+ Q+ N I EIQDE G    ++++I+    NHFS IY    K+  + IENL W PI
Subjt:  RRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKPI

Query:  DSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIINK
        D    + LCAPF E E+   I S    KAPGPDG+ + F+K YW ++KE+++ IF DF +KG+INK
Subjt:  DSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIINK

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.1e-18740.93Show/hide
Query:  HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNAC
        ++I  +L+KQ E+ FS KPF  +KAIL L + + A LLCSN G+ GWSTVGN+ VKFE W S  H+   +  SYGGW  FRG+PLHLWNYNTF+ IG+AC
Subjt:  HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNAC

Query:  GGFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFF
        GGF+ VAK+TM+   L++AKIKV+YNY GF+PA+I ITD+ GE F+V TV   E++WL ERNV +HG+F+ +A   F+++N  +E Y + G  A+ P+  
Subjt:  GGFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFF

Query:  ELSKSEALNLEVTTPEANLTNLTKTKTPNPSHYGKTDKTTKKTEKRLDKGKQLMVDDDDTD---SQQNKFSSKRKVSFTSPKN-KTFFFNPE-NAPAKHL
              +++    + + +++  T+ K  N S         + +++R +KGK +++ +D      S+++K  S RKVSF SP   ++   N E N   K L
Subjt:  ELSKSEALNLEVTTPEANLTNLTKTKTPNPSHYGKTDKTTKKTEKRLDKGKQLMVDDDDTD---SQQNKFSSKRKVSFTSPKN-KTFFFNPE-NAPAKHL

Query:  CIKSSLEKNSSGPSEDSSMQKRKFVKKYYRIKNKFNQAPD----TCKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPE-ETNTLQAQE
         I +    N       S  QK K  K  YRIK    ++ +    + KE     +  +L+VD+G +SPL +   Q   N   +T  + +P+  + +  + E
Subjt:  CIKSSLEKNSSGPSEDSSMQKRKFVKKYYRIKNKFNQAPD----TCKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPE-ETNTLQAQE

Query:  SQAQTELIKDSSDPMMTSDEDECRITRVKGKCEEE---EENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKV
        ++  T  +K+ +D   ++            K   E   +  FK++L+IWLKEN LKL+P    ++PS           SSY P  VI+S+          
Subjt:  SQAQTELIKDSSDPMMTSDEDECRITRVKGKCEEE---EENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKV

Query:  INSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNLHNLCSTNWLIGGDFNVVRW
               +++        G  GGIL++W+D   ++ +   G+++IS NI ++ GN WW+T +YG  K   R KLW EL+ L +LC  NWLI GDFN+VRW
Subjt:  INSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNLHNLCSTNWLIGGDFNVVRW

Query:  NNETTTLNPGKHK----------NNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLEASNIRWGPSPFRLNNN
          ET   +  K            N LIDPP  NN FTWSNLR  PT SRLDRFL +  WE  F  H +RTL R+ SDHFP++LE+  I+WGP PFRLNN+
Subjt:  NNETTTLNPGKHK----------NNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLEASNIRWGPSPFRLNNN

Query:  SLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQAQCWA
        SL D +F +N   WW  SK  G PG+AF+Q L +LS+ +K WQ + +N  +  KK ++++ID IDKLE Q  ++     +R + KS L SI+  QAQ W 
Subjt:  SLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQAQCWA

Query:  QRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIYSEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAIN
        QR +++W   GDEN +YFH++C+ NQR N I  I D  G +  S D I+    +HF +IY+++    +LI+NL+W PI  L  +ELC PFDE E+   I 
Subjt:  QRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIYSEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAIN

Query:  SISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIIN
        S S++KAPGPDGYT+ FYKK+WP +K++++ +F DFHK GI+N
Subjt:  SISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIIN

TrEMBL top hitse value%identityAlignment
A0A5A7TDG1 LINE-1 retrotransposable element ORF2 protein7.5e-19942.44Show/hide
Query:  KIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACG
        KI QNL+KQ E +F+   FH EK +++      ANLLC N   KGW+TVG + V+FEKW+  +H S KL  SYGGW +FRG+PLHLWN  TF+QIG ACG
Subjt:  KIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACG

Query:  GFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFFE
        G + VA++T   ++L+EAK+K++YNY+GF+PA + I D  G  FVVQ V  +E KWL ERNV +HGTFKRQA ASF+++N +SE + F G  A++PD   
Subjt:  GFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFFE

Query:  LSKSEALNLEVTTPEA-------------NLTNLTKTKTPNPSHYGKTDKTTKK------TEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNKT
               ++    P A             + T L +    + S +   +K+  K       +  LDKGKQ +       S    +  KRKVSF SP NKT
Subjt:  LSKSEALNLEVTTPEA-------------NLTNLTKTKTPNPSHYGKTDKTTKK------TEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNKT

Query:  FFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVKKYYRI-KNKFNQAPDTCKEKA----AKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGD
         FFNP++APA H     S EK      E S  +K   ++   R  + K N      +  A    A K+G  LTVDLG L  L   ++ +  +S +  +  
Subjt:  FFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVKKYYRI-KNKFNQAPDTCKEKA----AKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGD

Query:  SSPEETNTLQAQESQ--AQTELIKDSSDPMMT--SDEDECRITRVKGKCEEEE-----ENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSY
           + TNT    E+     T+  K +S P +     +   R      K E++E     E FK QL+ WLKEN LKL+   + +  ++S            
Subjt:  SSPEETNTLQAQESQ--AQTELIKDSSDPMMT--SDEDECRITRVKGKCEEEE-----ENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSY

Query:  SPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNL
                            N+L+S               GGILI+W+ Q H LL+  EG F++SAN   S  N+WW+TGLYG  KR++R  +W +L NL
Subjt:  SPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNL

Query:  HNLCSTNWLIGGDFNVVRWNNETTTLNPGKHKNN----------LIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPL
        H+L S+ W+IGGD NVVR   E+T +    H +N          LIDPPLTNNR+TWSNLR+ PT SRLDRFLY S WE+ F  H TRTL R TSDHFPL
Subjt:  HNLCSTNWLIGGDFNVVRWNNETTTLNPGKHKNN----------LIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPL

Query:  VLE--ASNIRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSN
        V E   S +RWGP+PFRLN+ +L+DP+F RN+  WWE S   GHPGF F+QRLK+L+  +K WQ          K+ II+++D+IDK E    L+LE+SN
Subjt:  VLE--ASNIRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSN

Query:  RRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKPI
        RR A K++L  +  K++Q W QR KK WL EGDEN+ +FH++CS+ Q+ N I EIQDE G    ++++I+    NHFS IY    K+  + IENL W PI
Subjt:  RRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKPI

Query:  DSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIINK
        D    + LCAPF E E+   I S    KAPGPDG+ + F+K YW ++KE+++ IF DF +KG+INK
Subjt:  DSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIINK

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein1.0e-18740.93Show/hide
Query:  HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNAC
        ++I  +L+KQ E+ FS KPF  +KAIL L + + A LLCSN G+ GWSTVGN+ VKFE W S  H+   +  SYGGW  FRG+PLHLWNYNTF+ IG+AC
Subjt:  HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNAC

Query:  GGFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFF
        GGF+ VAK+TM+   L++AKIKV+YNY GF+PA+I ITD+ GE F+V TV   E++WL ERNV +HG+F+ +A   F+++N  +E Y + G  A+ P+  
Subjt:  GGFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFF

Query:  ELSKSEALNLEVTTPEANLTNLTKTKTPNPSHYGKTDKTTKKTEKRLDKGKQLMVDDDDTD---SQQNKFSSKRKVSFTSPKN-KTFFFNPE-NAPAKHL
              +++    + + +++  T+ K  N S         + +++R +KGK +++ +D      S+++K  S RKVSF SP   ++   N E N   K L
Subjt:  ELSKSEALNLEVTTPEANLTNLTKTKTPNPSHYGKTDKTTKKTEKRLDKGKQLMVDDDDTD---SQQNKFSSKRKVSFTSPKN-KTFFFNPE-NAPAKHL

Query:  CIKSSLEKNSSGPSEDSSMQKRKFVKKYYRIKNKFNQAPD----TCKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPE-ETNTLQAQE
         I +    N       S  QK K  K  YRIK    ++ +    + KE     +  +L+VD+G +SPL +   Q   N   +T  + +P+  + +  + E
Subjt:  CIKSSLEKNSSGPSEDSSMQKRKFVKKYYRIKNKFNQAPD----TCKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPE-ETNTLQAQE

Query:  SQAQTELIKDSSDPMMTSDEDECRITRVKGKCEEE---EENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKV
        ++  T  +K+ +D   ++            K   E   +  FK++L+IWLKEN LKL+P    ++PS           SSY P  VI+S+          
Subjt:  SQAQTELIKDSSDPMMTSDEDECRITRVKGKCEEE---EENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKV

Query:  INSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNLHNLCSTNWLIGGDFNVVRW
               +++        G  GGIL++W+D   ++ +   G+++IS NI ++ GN WW+T +YG  K   R KLW EL+ L +LC  NWLI GDFN+VRW
Subjt:  INSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNLHNLCSTNWLIGGDFNVVRW

Query:  NNETTTLNPGKHK----------NNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLEASNIRWGPSPFRLNNN
          ET   +  K            N LIDPP  NN FTWSNLR  PT SRLDRFL +  WE  F  H +RTL R+ SDHFP++LE+  I+WGP PFRLNN+
Subjt:  NNETTTLNPGKHK----------NNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLEASNIRWGPSPFRLNNN

Query:  SLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQAQCWA
        SL D +F +N   WW  SK  G PG+AF+Q L +LS+ +K WQ + +N  +  KK ++++ID IDKLE Q  ++     +R + KS L SI+  QAQ W 
Subjt:  SLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQAQCWA

Query:  QRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIYSEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAIN
        QR +++W   GDEN +YFH++C+ NQR N I  I D  G +  S D I+    +HF +IY+++    +LI+NL+W PI  L  +ELC PFDE E+   I 
Subjt:  QRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIYSEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAIN

Query:  SISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIIN
        S S++KAPGPDGYT+ FYKK+WP +K++++ +F DFHK GI+N
Subjt:  SISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIIN

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein7.5e-19942.44Show/hide
Query:  KIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACG
        KI QNL+KQ E +F+   FH EK +++      ANLLC N   KGW+TVG + V+FEKW+  +H S KL  SYGGW +FRG+PLHLWN  TF+QIG ACG
Subjt:  KIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACG

Query:  GFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFFE
        G + VA++T   ++L+EAK+K++YNY+GF+PA + I D  G  FVVQ V  +E KWL ERNV +HGTFKRQA ASF+++N +SE + F G  A++PD   
Subjt:  GFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFFE

Query:  LSKSEALNLEVTTPEA-------------NLTNLTKTKTPNPSHYGKTDKTTKK------TEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNKT
               ++    P A             + T L +    + S +   +K+  K       +  LDKGKQ +       S    +  KRKVSF SP NKT
Subjt:  LSKSEALNLEVTTPEA-------------NLTNLTKTKTPNPSHYGKTDKTTKK------TEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNKT

Query:  FFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVKKYYRI-KNKFNQAPDTCKEKA----AKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGD
         FFNP++APA H     S EK      E S  +K   ++   R  + K N      +  A    A K+G  LTVDLG L  L   ++ +  +S +  +  
Subjt:  FFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVKKYYRI-KNKFNQAPDTCKEKA----AKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGD

Query:  SSPEETNTLQAQESQ--AQTELIKDSSDPMMT--SDEDECRITRVKGKCEEEE-----ENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSY
           + TNT    E+     T+  K +S P +     +   R      K E++E     E FK QL+ WLKEN LKL+   + +  ++S            
Subjt:  SSPEETNTLQAQESQ--AQTELIKDSSDPMMT--SDEDECRITRVKGKCEEEE-----ENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSY

Query:  SPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNL
                            N+L+S               GGILI+W+ Q H LL+  EG F++SAN   S  N+WW+TGLYG  KR++R  +W +L NL
Subjt:  SPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNL

Query:  HNLCSTNWLIGGDFNVVRWNNETTTLNPGKHKNN----------LIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPL
        H+L S+ W+IGGD NVVR   E+T +    H +N          LIDPPLTNNR+TWSNLR+ PT SRLDRFLY S WE+ F  H TRTL R TSDHFPL
Subjt:  HNLCSTNWLIGGDFNVVRWNNETTTLNPGKHKNN----------LIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPL

Query:  VLE--ASNIRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSN
        V E   S +RWGP+PFRLN+ +L+DP+F RN+  WWE S   GHPGF F+QRLK+L+  +K WQ          K+ II+++D+IDK E    L+LE+SN
Subjt:  VLE--ASNIRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSN

Query:  RRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKPI
        RR A K++L  +  K++Q W QR KK WL EGDEN+ +FH++CS+ Q+ N I EIQDE G    ++++I+    NHFS IY    K+  + IENL W PI
Subjt:  RRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKPI

Query:  DSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIINK
        D    + LCAPF E E+   I S    KAPGPDG+ + F+K YW ++KE+++ IF DF +KG+INK
Subjt:  DSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIINK

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein5.0e-20344.02Show/hide
Query:  HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNAC
        HKI QNL+KQ E +F+   FH EKA+++      ANLLC N   KGWSTVG + V+FEKWS   H + KL  SYGGW +FRG+PLHLWN  TF+QIG AC
Subjt:  HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNAC

Query:  GGFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFF
         G + VA++T   K+L+EA+IKV+YNY+GF+PAN+ I D+ G  F VQ V   E KWL ERNV +HGTFKRQA ASF+++N ESE + F G+ A++PDF 
Subjt:  GGFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFF

Query:  ELS----------KSEALNLEVTTPEANLT--NLTKTKTPNPSHYGKTDKTTK-------KTEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNK
          S          +  AL   +  P+ N T  +    +  N S+   T   +K         +  LDKGKQ +      +S  N   SKRKVSF SP NK
Subjt:  ELS----------KSEALNLEVTTPEANLT--NLTKTKTPNPSHYGKTDKTTK-------KTEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNK

Query:  TFFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVK---KYYRIKNKFNQAP--DTCKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKG
        T  FNP++APA H    +S EK      E S  +K    +   K  + K  F   P      ++ A K+G  LTVDLG L  L  D N+ + +       
Subjt:  TFFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVK---KYYRIKNKFNQAP--DTCKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKG

Query:  DSSPEETNTLQAQESQAQTELIKDSSDPMMTSDEDECRITRVK----GKCEEEE-----ENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISS
        +   + TNT    E+      + ++S+    ++  + +    +     K EE+E     E FKKQL+ WLK+N LKL+   +    SS  +    ++++ 
Subjt:  DSSPEETNTLQAQESQAQTELIKDSSDPMMTSDEDECRITRVK----GKCEEEE-----ENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISS

Query:  YSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQN
         +    I        TNK++I SLW   SI W+  N+ G +GGILI+W+ Q H LL+  EG F++SAN   +  ++WW+TGLYG  KR++R   W EL N
Subjt:  YSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQN

Query:  LHNLCSTNWLIGGDFNVVRWNNETTTLNPGKH----------KNNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFP
        L +L S  W++GGD NV+R   E+T++    H           N LIDPPLTNNRFTWSNLR+ PT SR+DRFLY SSWE  F  H TRTL RSTSDHFP
Subjt:  LHNLCSTNWLIGGDFNVVRWNNETTTLNPGKH----------KNNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFP

Query:  LVLEASN--IRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDS
        LV E SN  + WGP PFRLN+ +LSDP+F RN+  WWE S   G+PGF+F+QRLK+L+  +K WQ   L+     K+ II+++D+IDK E    LT E+S
Subjt:  LVLEASN--IRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDS

Query:  NRRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKP
        NRR A K+ L  +  K++Q W QR KK WL EGDEN+++FH++CS+ Q+ + I EIQDE G    +++SI+      FS IY S  K   + IENL+W P
Subjt:  NRRTAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIY-SEDKRGTMLIENLNWKP

Query:  IDSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYW
        I S   + LCAPF E E+   INS   KK PGPDG+ + F+K +W
Subjt:  IDSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYW

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein1.0e-18740.93Show/hide
Query:  HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNAC
        ++I  +L+KQ E+ FS KPF  +KAIL L + + A LLCSN G+ GWSTVGN+ VKFE W S  H+   +  SYGGW  FRG+PLHLWNYNTF+ IG+AC
Subjt:  HKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNAC

Query:  GGFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFF
        GGF+ VAK+TM+   L++AKIKV+YNY GF+PA+I ITD+ GE F+V TV   E++WL ERNV +HG+F+ +A   F+++N  +E Y + G  A+ P+  
Subjt:  GGFVAVAKDTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFF

Query:  ELSKSEALNLEVTTPEANLTNLTKTKTPNPSHYGKTDKTTKKTEKRLDKGKQLMVDDDDTD---SQQNKFSSKRKVSFTSPKN-KTFFFNPE-NAPAKHL
              +++    + + +++  T+ K  N S         + +++R +KGK +++ +D      S+++K  S RKVSF SP   ++   N E N   K L
Subjt:  ELSKSEALNLEVTTPEANLTNLTKTKTPNPSHYGKTDKTTKKTEKRLDKGKQLMVDDDDTD---SQQNKFSSKRKVSFTSPKN-KTFFFNPE-NAPAKHL

Query:  CIKSSLEKNSSGPSEDSSMQKRKFVKKYYRIKNKFNQAPD----TCKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPE-ETNTLQAQE
         I +    N       S  QK K  K  YRIK    ++ +    + KE     +  +L+VD+G +SPL +   Q   N   +T  + +P+  + +  + E
Subjt:  CIKSSLEKNSSGPSEDSSMQKRKFVKKYYRIKNKFNQAPD----TCKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPE-ETNTLQAQE

Query:  SQAQTELIKDSSDPMMTSDEDECRITRVKGKCEEE---EENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKV
        ++  T  +K+ +D   ++            K   E   +  FK++L+IWLKEN LKL+P    ++PS           SSY P  VI+S+          
Subjt:  SQAQTELIKDSSDPMMTSDEDECRITRVKGKCEEE---EENFKKQLIIWLKENNLKLAPSLNQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKV

Query:  INSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNLHNLCSTNWLIGGDFNVVRW
               +++        G  GGIL++W+D   ++ +   G+++IS NI ++ GN WW+T +YG  K   R KLW EL+ L +LC  NWLI GDFN+VRW
Subjt:  INSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITGLYGHAKRKQRNKLWIELQNLHNLCSTNWLIGGDFNVVRW

Query:  NNETTTLNPGKHK----------NNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLEASNIRWGPSPFRLNNN
          ET   +  K            N LIDPP  NN FTWSNLR  PT SRLDRFL +  WE  F  H +RTL R+ SDHFP++LE+  I+WGP PFRLNN+
Subjt:  NNETTTLNPGKHK----------NNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLEASNIRWGPSPFRLNNN

Query:  SLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQAQCWA
        SL D +F +N   WW  SK  G PG+AF+Q L +LS+ +K WQ + +N  +  KK ++++ID IDKLE Q  ++     +R + KS L SI+  QAQ W 
Subjt:  SLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQAQCWA

Query:  QRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIYSEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAIN
        QR +++W   GDEN +YFH++C+ NQR N I  I D  G +  S D I+    +HF +IY+++    +LI+NL+W PI  L  +ELC PFDE E+   I 
Subjt:  QRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIYSEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAIN

Query:  SISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIIN
        S S++KAPGPDGYT+ FYKK+WP +K++++ +F DFHK GI+N
Subjt:  SISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIIN

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein9.4e-1323.2Show/hide
Query:  LQNLHNLCSTNWLIGGDFNV----------VRWNNETTTLNPGKHKNNLIDPPLT-NNRFTWSNLRSQP--TCSRLDRFLYTSSWELCFKEHYTRTLSRS
        L +L     ++ LI GDFN            + N +T  LN   H+ +LID   T + + T     S P  T S++D  +   S  L  K   T  ++  
Subjt:  LQNLHNLCSTNWLIGGDFNV----------VRWNNETTTLNPGKHKNNLIDPPLT-NNRFTWSNLRSQP--TCSRLDRFLYTSSWELCFKEHYTRTLSRS

Query:  TSDHFPLVLE--ASNIRWGPS-PFRLNNNSLSD----PDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLN--KQEDEKKRIIQQIDNID
         SDH  + LE    N+    S  ++LNN  L+D     +    I+ ++E +++         Q L    + +   +F  LN  K++ E+ +I      + 
Subjt:  TSDHFPLVLE--ASNIRWGPS-PFRLNNNSLSD----PDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLN--KQEDEKKRIIQQIDNID

Query:  KLEKQNLLTLEDSNRR--TAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIYSED
        +LEKQ     + S R+  T  +++L  I+ ++       ++  +    ++      ++    +  N I  I+++ G        I   +  ++ H+Y+  
Subjt:  KLEKQNLLTLEDSNRR--TAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIYSED

Query:  KRG---------TMLIENLNWKPIDSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGII
                    T  +  LN + ++SL+      P    E++  INS+  KK+PGPDG+T +FY++Y   +   ++++F    K+GI+
Subjt:  KRG---------TMLIENLNWKPIDSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGII

P08548 LINE-1 reverse transcriptase homolog1.4e-0519.86Show/hide
Query:  SDHFPLVLEAS---NIRWGPSPFRLNNNSLSD----PDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNK--QEDEKKRIIQQIDNIDK
        SDH  + +E +   N+      ++LNN  L D     +  + I  + E + +         Q L   ++ +   +F  L    ++ E++ +   + ++ +
Subjt:  SDHFPLVLEAS---NIRWGPSPFRLNNNSLSD----PDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNK--QEDEKKRIIQQIDNIDK

Query:  LEKQNLLTLEDSNRR--TAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNH----FSHIY
        LEK+     + S R+  T  +++L  I+ K+      ++K  +  + ++       +    +  + IS I++ N         I  +L  +    +SH Y
Subjt:  LEKQNLLTLEDSNRR--TAFKSKLCSIDFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNH----FSHIY

Query:  SEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGII
           K     +E  +   +       L  P    E+   I ++  KK+PGPDG+T +FY+ +   +   ++ +F +  K+GI+
Subjt:  SEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAINSISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGII

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.2e-2024.02Show/hide
Query:  NNLIDPPLTNNRFTWSNLR-SQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLEASNI-RWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHL
        ++L+D P     +TWSN +   P   +LDR +    W   F            SDH P ++   N+ +     FR  +   + P F  ++   WE    +
Subjt:  NNLIDPPLTNNRFTWSNLR-SQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPLVLEASNI-RWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHL

Query:  GHPGFAFVQRLKTLSRTLKNWQFSLLNKQ--EDEKKRIIQQIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQA--QCWAQRTKKQWLNEGDENTTY
        G   F+  + LK   +  K     LLN+Q   + + +  + +D+++ ++ Q L    DS  R    ++     F  A    + Q+++ +WL +GD NT +
Subjt:  GHPGFAFVQRLKTLSRTLKNWQFSLLNKQ--EDEKKRIIQQIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSIDFKQA--QCWAQRTKKQWLNEGDENTTY

Query:  FHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIYSEDK-----RGTMLIENLN-WKPIDSLHHNELCAPFDEIEVLQAINSISDKKAPGPD
        FHKV  ANQ  N I  ++ ++ +   +   +  ++  +++H+   D           I++++ ++  D+L       P D+ E+  A+ ++   KAPGPD
Subjt:  FHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIYSEDK-----RGTMLIENLN-WKPIDSLHHNELCAPFDEIEVLQAINSISDKKAPGPD

Query:  GYTVKFYKKYWPMVKEEVMQIFNDFHKKGIINK
         +T +F+ + W +VK+  +    +F + G + K
Subjt:  GYTVKFYKKYWPMVKEEVMQIFNDFHKKGIINK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCATAAAATTAAGCAGAATCTCAAAAAACAGATTGAAGTCAACTTCTCTGTTAAACCATTCCACCCAGAAAAAGCCATACTAAACCTTCAGGATACAGAGCAAGC
AAATCTCCTATGTAGCAACAATGGAAGCAAAGGATGGTCCACAGTGGGAAATTTTTTTGTTAAGTTCGAAAAATGGTCATCAACAAACCATACGTCTCAGAAACTCTTCC
TTAGCTATGGTGGATGGAATTCCTTTAGAGGAGTTCCCCTTCATCTTTGGAATTACAATACTTTTAAGCAGATTGGAAATGCATGTGGGGGTTTTGTCGCTGTTGCTAAA
GATACAATGGAGAAAAAGGACCTGATGGAAGCAAAAATCAAAGTCAAGTATAACTATACAGGATTCATTCCAGCAAACATTGGAATAACTGACGATAATGGAGAACTTTT
TGTGGTCCAAACTGTTTGCAAGACGGAGAGTAAGTGGCTCAAAGAGAGAAATGTTGACATGCATGGAACTTTCAAAAGGCAAGCGACTGCTAGCTTCAATGAATACAATT
CAGAATCGGAAATGTATCACTTCACCGGAAATGTTGCAGTTACACCGGATTTTTTTGAACTTTCAAAATCCGAAGCATTAAATTTGGAAGTAACTACACCTGAAGCAAAT
CTCACAAATCTCACAAAAACCAAGACTCCAAATCCTAGCCATTATGGAAAGACTGACAAAACCACAAAAAAAACAGAAAAACGGCTGGACAAAGGGAAACAGTTAATGGT
GGATGATGATGATACTGACAGCCAGCAAAACAAATTCAGCTCGAAAAGAAAGGTATCATTTACCTCACCAAAAAACAAAACTTTTTTTTTCAATCCTGAAAATGCTCCAG
CTAAGCACCTCTGCATTAAAAGTTCGTTGGAAAAAAATAGCAGTGGGCCCTCTGAAGATTCTTCAATGCAAAAAAGAAAATTTGTGAAAAAATACTACAGAATCAAAAAC
AAATTTAACCAAGCTCCTGACACATGTAAAGAAAAGGCAGCAAAGAAAGAAGGCTATCATCTAACAGTGGACTTAGGACAGCTGTCTCCTTTAAAGAAGGATCAAAATCA
ACAAATTGTTAACAGTCAGGAAGAAACAAAAGGAGATTCAAGTCCTGAAGAAACAAATACTTTGCAAGCACAAGAATCACAAGCTCAAACAGAATTGATCAAGGACAGCT
CTGACCCTATGATGACAAGTGATGAAGATGAATGCAGGATCACAAGAGTAAAAGGAAAATGTGAGGAAGAGGAGGAAAACTTCAAAAAACAGTTGATAATCTGGCTTAAA
GAAAATAACTTAAAGTTAGCTCCATCTCTAAACCAAAACTTGCCAAGCTCATCAAACAGTGAACAAGTGAGGATCATTATTTCCTCTTACTCGCCGGATTTTGTTATTTT
ATCTGAAACGAAACGCATTTCAACAAACAAAAAAGTAATCAATTCTCTATGGTCTCTCAAAAGCATAAAGTGGTTAAATGTCAATTCAAGAGGAAGAACTGGAGGCATTC
TAATTATGTGGAACGACCAAAGACATAGGCTGTTAAATAGTTTTGAAGGAGATTTCACAATTTCTGCAAATATACAAGACTCCTTAGGCAACACTTGGTGGATCACAGGC
TTATATGGTCATGCTAAAAGAAAACAGAGAAACAAATTATGGATTGAACTTCAAAATCTTCACAATCTTTGTTCTACAAACTGGCTGATTGGAGGAGACTTTAATGTGGT
GAGATGGAACAATGAGACTACAACACTGAACCCAGGGAAACACAAAAATAATTTGATTGATCCTCCTCTCACAAACAACAGATTCACTTGGTCAAATCTTAGAAGTCAAC
CAACTTGTTCTAGGCTGGATAGATTCCTTTATACCAGCTCTTGGGAATTATGCTTTAAAGAACATTATACAAGGACTCTATCAAGATCCACTTCAGATCACTTTCCTCTT
GTCCTGGAAGCTTCAAACATTCGTTGGGGCCCATCACCCTTCAGGCTAAACAATAACTCCCTCTCTGATCCAGATTTCAATAGAAATATAAGGGGTTGGTGGGAAGGATC
AAAACATCTTGGTCATCCTGGTTTTGCTTTTGTACAAAGACTAAAGACTCTCTCCAGAACTTTAAAGAATTGGCAGTTCAGTCTTTTAAATAAACAGGAGGATGAAAAAA
AGAGAATTATTCAGCAGATTGACAACATTGACAAGCTAGAAAAGCAGAATCTTTTAACTTTGGAGGACAGCAACAGAAGAACGGCTTTCAAATCTAAGCTTTGTTCTATT
GATTTCAAACAAGCTCAATGTTGGGCCCAACGAACAAAGAAGCAATGGCTCAACGAAGGGGATGAAAATACAACTTATTTTCATAAGGTGTGCTCAGCAAATCAAAGAAC
AAATTGTATATCAGAGATTCAAGATGAAAATGGTTTGACTCATTGTTCAAGTGATTCCATTGCTGGAGTCTTAACCAATCATTTCAGCCATATTTACTCTGAAGACAAGA
GAGGCACAATGTTGATTGAAAACCTGAACTGGAAACCAATTGACTCATTGCATCACAATGAGTTATGTGCTCCCTTTGATGAAATTGAAGTATTACAAGCCATCAACTCT
ATTAGTGACAAAAAGGCCCCTGGACCTGATGGTTACACAGTGAAATTCTACAAAAAATATTGGCCCATGGTCAAAGAGGAGGTGATGCAAATTTTCAACGACTTTCATAA
AAAAGGCATCATCAACAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGCATAAAATTAAGCAGAATCTCAAAAAACAGATTGAAGTCAACTTCTCTGTTAAACCATTCCACCCAGAAAAAGCCATACTAAACCTTCAGGATACAGAGCAAGC
AAATCTCCTATGTAGCAACAATGGAAGCAAAGGATGGTCCACAGTGGGAAATTTTTTTGTTAAGTTCGAAAAATGGTCATCAACAAACCATACGTCTCAGAAACTCTTCC
TTAGCTATGGTGGATGGAATTCCTTTAGAGGAGTTCCCCTTCATCTTTGGAATTACAATACTTTTAAGCAGATTGGAAATGCATGTGGGGGTTTTGTCGCTGTTGCTAAA
GATACAATGGAGAAAAAGGACCTGATGGAAGCAAAAATCAAAGTCAAGTATAACTATACAGGATTCATTCCAGCAAACATTGGAATAACTGACGATAATGGAGAACTTTT
TGTGGTCCAAACTGTTTGCAAGACGGAGAGTAAGTGGCTCAAAGAGAGAAATGTTGACATGCATGGAACTTTCAAAAGGCAAGCGACTGCTAGCTTCAATGAATACAATT
CAGAATCGGAAATGTATCACTTCACCGGAAATGTTGCAGTTACACCGGATTTTTTTGAACTTTCAAAATCCGAAGCATTAAATTTGGAAGTAACTACACCTGAAGCAAAT
CTCACAAATCTCACAAAAACCAAGACTCCAAATCCTAGCCATTATGGAAAGACTGACAAAACCACAAAAAAAACAGAAAAACGGCTGGACAAAGGGAAACAGTTAATGGT
GGATGATGATGATACTGACAGCCAGCAAAACAAATTCAGCTCGAAAAGAAAGGTATCATTTACCTCACCAAAAAACAAAACTTTTTTTTTCAATCCTGAAAATGCTCCAG
CTAAGCACCTCTGCATTAAAAGTTCGTTGGAAAAAAATAGCAGTGGGCCCTCTGAAGATTCTTCAATGCAAAAAAGAAAATTTGTGAAAAAATACTACAGAATCAAAAAC
AAATTTAACCAAGCTCCTGACACATGTAAAGAAAAGGCAGCAAAGAAAGAAGGCTATCATCTAACAGTGGACTTAGGACAGCTGTCTCCTTTAAAGAAGGATCAAAATCA
ACAAATTGTTAACAGTCAGGAAGAAACAAAAGGAGATTCAAGTCCTGAAGAAACAAATACTTTGCAAGCACAAGAATCACAAGCTCAAACAGAATTGATCAAGGACAGCT
CTGACCCTATGATGACAAGTGATGAAGATGAATGCAGGATCACAAGAGTAAAAGGAAAATGTGAGGAAGAGGAGGAAAACTTCAAAAAACAGTTGATAATCTGGCTTAAA
GAAAATAACTTAAAGTTAGCTCCATCTCTAAACCAAAACTTGCCAAGCTCATCAAACAGTGAACAAGTGAGGATCATTATTTCCTCTTACTCGCCGGATTTTGTTATTTT
ATCTGAAACGAAACGCATTTCAACAAACAAAAAAGTAATCAATTCTCTATGGTCTCTCAAAAGCATAAAGTGGTTAAATGTCAATTCAAGAGGAAGAACTGGAGGCATTC
TAATTATGTGGAACGACCAAAGACATAGGCTGTTAAATAGTTTTGAAGGAGATTTCACAATTTCTGCAAATATACAAGACTCCTTAGGCAACACTTGGTGGATCACAGGC
TTATATGGTCATGCTAAAAGAAAACAGAGAAACAAATTATGGATTGAACTTCAAAATCTTCACAATCTTTGTTCTACAAACTGGCTGATTGGAGGAGACTTTAATGTGGT
GAGATGGAACAATGAGACTACAACACTGAACCCAGGGAAACACAAAAATAATTTGATTGATCCTCCTCTCACAAACAACAGATTCACTTGGTCAAATCTTAGAAGTCAAC
CAACTTGTTCTAGGCTGGATAGATTCCTTTATACCAGCTCTTGGGAATTATGCTTTAAAGAACATTATACAAGGACTCTATCAAGATCCACTTCAGATCACTTTCCTCTT
GTCCTGGAAGCTTCAAACATTCGTTGGGGCCCATCACCCTTCAGGCTAAACAATAACTCCCTCTCTGATCCAGATTTCAATAGAAATATAAGGGGTTGGTGGGAAGGATC
AAAACATCTTGGTCATCCTGGTTTTGCTTTTGTACAAAGACTAAAGACTCTCTCCAGAACTTTAAAGAATTGGCAGTTCAGTCTTTTAAATAAACAGGAGGATGAAAAAA
AGAGAATTATTCAGCAGATTGACAACATTGACAAGCTAGAAAAGCAGAATCTTTTAACTTTGGAGGACAGCAACAGAAGAACGGCTTTCAAATCTAAGCTTTGTTCTATT
GATTTCAAACAAGCTCAATGTTGGGCCCAACGAACAAAGAAGCAATGGCTCAACGAAGGGGATGAAAATACAACTTATTTTCATAAGGTGTGCTCAGCAAATCAAAGAAC
AAATTGTATATCAGAGATTCAAGATGAAAATGGTTTGACTCATTGTTCAAGTGATTCCATTGCTGGAGTCTTAACCAATCATTTCAGCCATATTTACTCTGAAGACAAGA
GAGGCACAATGTTGATTGAAAACCTGAACTGGAAACCAATTGACTCATTGCATCACAATGAGTTATGTGCTCCCTTTGATGAAATTGAAGTATTACAAGCCATCAACTCT
ATTAGTGACAAAAAGGCCCCTGGACCTGATGGTTACACAGTGAAATTCTACAAAAAATATTGGCCCATGGTCAAAGAGGAGGTGATGCAAATTTTCAACGACTTTCATAA
AAAAGGCATCATCAACAAATGA
Protein sequenceShow/hide protein sequence
MEHKIKQNLKKQIEVNFSVKPFHPEKAILNLQDTEQANLLCSNNGSKGWSTVGNFFVKFEKWSSTNHTSQKLFLSYGGWNSFRGVPLHLWNYNTFKQIGNACGGFVAVAK
DTMEKKDLMEAKIKVKYNYTGFIPANIGITDDNGELFVVQTVCKTESKWLKERNVDMHGTFKRQATASFNEYNSESEMYHFTGNVAVTPDFFELSKSEALNLEVTTPEAN
LTNLTKTKTPNPSHYGKTDKTTKKTEKRLDKGKQLMVDDDDTDSQQNKFSSKRKVSFTSPKNKTFFFNPENAPAKHLCIKSSLEKNSSGPSEDSSMQKRKFVKKYYRIKN
KFNQAPDTCKEKAAKKEGYHLTVDLGQLSPLKKDQNQQIVNSQEETKGDSSPEETNTLQAQESQAQTELIKDSSDPMMTSDEDECRITRVKGKCEEEEENFKKQLIIWLK
ENNLKLAPSLNQNLPSSSNSEQVRIIISSYSPDFVILSETKRISTNKKVINSLWSLKSIKWLNVNSRGRTGGILIMWNDQRHRLLNSFEGDFTISANIQDSLGNTWWITG
LYGHAKRKQRNKLWIELQNLHNLCSTNWLIGGDFNVVRWNNETTTLNPGKHKNNLIDPPLTNNRFTWSNLRSQPTCSRLDRFLYTSSWELCFKEHYTRTLSRSTSDHFPL
VLEASNIRWGPSPFRLNNNSLSDPDFNRNIRGWWEGSKHLGHPGFAFVQRLKTLSRTLKNWQFSLLNKQEDEKKRIIQQIDNIDKLEKQNLLTLEDSNRRTAFKSKLCSI
DFKQAQCWAQRTKKQWLNEGDENTTYFHKVCSANQRTNCISEIQDENGLTHCSSDSIAGVLTNHFSHIYSEDKRGTMLIENLNWKPIDSLHHNELCAPFDEIEVLQAINS
ISDKKAPGPDGYTVKFYKKYWPMVKEEVMQIFNDFHKKGIINK