; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G02120 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G02120
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr7:1847543..1849412
RNA-Seq ExpressionCSPI07G02120
SyntenyCSPI07G02120
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.1e-12940.42Show/hide
Query:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN
        +L  KN+ +KWR  I +CIS+V YSILING+P GRI+ ++GIRQGD +SPFIFVL MDYLSRLL++L     I GV F+ N+ LTH+LFADDI +FVED 
Subjt:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN

Query:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSK-------
        ++Y++NLKM L LFE ASGLNINLSKS I PIN+   RA+ +AD W I   HL   YLG+PLGG P  S+FWD V  KI KKL+NW YSQLSK       
Subjt:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSK-------

Query:  ---------------------AENNYSSDKRWAW-ALKNQY--------------------------TNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI
                             A+   +S + + W    N +                          TNF+LL KWLW+F  E + LW+++II+KY    
Subjt:  ---------------------AENNYSSDKRWAW-ALKNQY--------------------------TNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI

Query:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN
        +G  P+  KF +  +PW+++ + +  F  NI+W++N+GE+ISFW D W+    L+ A  RL+ALS +K   VKE WN     W L  + PL D E  LW+
Subjt:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN

Query:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKKAIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVMC
         I   LP P  ++G  KP WNL +N +F TASVK+AI  +P +  N   N ++K LWK   PKKCKFF+W+++H  IN  D+LQ+R  N  L+PNWC MC
Subjt:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKKAIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVMC

Query:  HSECETSDHLLVNCRAASFLWETLQAKTRLHHSFNDLKAIVTSILHQNY----------------------------------------DSCNYIGQWCS
        +   E  +HL ++C  +  LW   +A    + +  D+++++ +I   N                                         D+   IG W  
Subjt:  HSECETSDHLLVNCRAASFLWETLQAKTRLHHSFNDLKAIVTSILHQNY----------------------------------------DSCNYIGQWCS

Query:  RLHLLKDYSSSALALNFTAFL
        +  L  +Y   ++ALN +AF+
Subjt:  RLHLLKDYSSSALALNFTAFL

KAA0046762.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.0e-12940.58Show/hide
Query:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN
        +L  KNFP  WR WI  CISNV YSI++NG+P GRI+AN+G+RQGD +SPF+FV+ MDYLSRLLSHLE   +IKGV+F++N  ++H+LFADDI LF+EDN
Subjt:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN

Query:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------
        + +L NL+MAL+LFE+ASGL INL KS + P+N+S  RA+  A  W I +  L + YLGVPLGGNPK   FW  V+ KI KKL NW Y+Q+SK       
Subjt:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------

Query:  ----------------------------------ENNYSSDK----RWAWALKNQ-----------YTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI
                                          + N SS+      W    K++            TN +LL+KWLWR+ +EP ALWR++I  KYK   
Subjt:  ----------------------------------ENNYSSDK----RWAWALKNQ-----------YTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI

Query:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN
         G IP+     T+KAPWRSI+ ++D F +N +W++NNG+ ISFW+  WS  G L  AY RL+ALS  K   VK+ WN+I+ +W ++    LNDRE   W 
Subjt:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN

Query:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKKAIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVMC
        +I   LP P  + G  KP W  ++   F+ AS K  I       +    +++ + +WKS IP K KFF+W ++   IN M+ +Q R  N CL P+WCV+C
Subjt:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKKAIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVMC

Query:  HSECETSDHLLVNCRAASFLWETLQAKTRLHHSFNDL----------------KAIVTSIL--------------------HQN----YDSCN-YIGQWC
          + E+  HL ++C A   LW  LQ    L  S +DL                K +   I+                    H+     ++ C   IG WC
Subjt:  HSECETSDHLLVNCRAASFLWETLQAKTRLHHSFNDL----------------KAIVTSIL--------------------HQN----YDSCN-YIGQWC

Query:  SRLHLLKDYSSSALALNFTAF
        SR    ++YS++ +ALN + F
Subjt:  SRLHLLKDYSSSALALNFTAF

KAA0056839.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.7e-12941.94Show/hide
Query:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN
        ML  K+FP KWR WI+ACISNV YSIL+NG P GRI+A +GIRQGD +SPFIFVL MDYLSRLLSHLE   +IKGV+FNN   ++HLLFADD+ +FVEDN
Subjt:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN

Query:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------
        E YL NL+MALTLFEKASGL  N SKS ISPINIS GR   +A  +   T  L + YLGVPLGGNP+  SFW +    I+KKL  W YSQ+SK       
Subjt:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------

Query:  -------------------------ENNY-------SSDKRWA----WAL-------------KNQYTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI
                                 E ++       S DK+ A    W +             K + TN +LL KWLWR+HNE N+LW+K I AKY  + 
Subjt:  -------------------------ENNY-------SSDKRWA----WAL-------------KNQYTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI

Query:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN
         G IP   +  +A +PW +I K  D +E+ I+W  N+G ++SFWH +W     L+  + RLYALS  +   VKE+W+     W+++P  PLN+RE   W+
Subjt:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN

Query:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKK-AIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVM
         I   LP  + ++G  KP+WN  ++K +T AS K  A + S   +    + ++ K LW+S IP+KCKFF+W+++H+ +N MDK+Q+R  +  LNP+WC+ 
Subjt:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKK-AIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVM

Query:  CHSECETSDHLLVNCRAASFLWETLQAKTRLHHSFNDLK----------------------AIVT------------------SILHQNYDSCNYIGQWC
        C S  E  +HL + C  A  LW    ++T    +  ++K                      AI T                  S L+   D C   G W 
Subjt:  CHSECETSDHLLVNCRAASFLWETLQAKTRLHHSFNDLK----------------------AIVT------------------SILHQNYDSCNYIGQWC

Query:  SRLHLLKDYSSSALALNFTA
        S+   LK+YS + +ALN  A
Subjt:  SRLHLLKDYSSSALALNFTA

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.7e-12842.26Show/hide
Query:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN
        ML  K+FP KWR WI+ACISNV YSIL+NG P GRI+A +GIRQGD +SPFIFVL MDYLSRLLSHLE   +IKGV+FNN   ++HLLFADD+ +FVEDN
Subjt:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN

Query:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------
        E YL NL+MALTLFEKASGL  N SKS ISPINIS GR   +A  +   T  L + YLGVPLGGNP+  SFWD+    I+KKL  W YSQ+SK       
Subjt:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------

Query:  -------------------------ENNY-------SSDKRWA----WAL-------------KNQYTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI
                                 E ++       S DK+ A    W +             K + TN +LL KWLWR+HNE N+LW+K I AKY  + 
Subjt:  -------------------------ENNY-------SSDKRWA----WAL-------------KNQYTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI

Query:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN
         G IP   +  +A +PW +I K  D +E+ I+W  N+G ++SFWH +W     L+    RLYALS  +   VKE+W+     W++KP  PLN+RE   W+
Subjt:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN

Query:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKK-AIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVM
         I   LP  + ++G  KP WN  ++K +T AS K  A + S   +    + ++ K LW+S IP+KCKFF+W+++H+ +N MD +Q+R  +  LNP+WC+ 
Subjt:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKK-AIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVM

Query:  CHSECETSDHLLVNCRAASFLWETLQAKT---RLHHSFNDL-------------------KAIVT------------------SILHQNYDSCNYIGQWC
        C S  E  +HL + C  A  LW    ++T    ++ +  DL                    AI T                  S L+   D C   G W 
Subjt:  CHSECETSDHLLVNCRAASFLWETLQAKT---RLHHSFNDL-------------------KAIVT------------------SILHQNYDSCNYIGQWC

Query:  SRLHLLKDYSSSALALNFTA
        S+   LK+YS + +ALN  A
Subjt:  SRLHLLKDYSSSALALNFTA

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.3e-12842.26Show/hide
Query:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN
        ML  K+FP KWR WI+ACISNV YSIL+NG P GRI+A +GIRQGD +SPFIFVL MDYLSRLLSHLE   +IKGV+FNN   ++HLLFADD+ +FVEDN
Subjt:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN

Query:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------
        E YL NL+MALTLFEKASGL  N SKS ISPINIS GR   +A  +   T  L + YLGVPLGGNP+  SFWD+    I+KKL  W YSQ+SK       
Subjt:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------

Query:  -------------------------ENNY-------SSDKRWA----WAL-------------KNQYTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI
                                 E ++       S DK+ A    W +             K + TN +LL KWLWR+HNE N+LW+K I AKY  + 
Subjt:  -------------------------ENNY-------SSDKRWA----WAL-------------KNQYTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI

Query:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN
         G IP   +  +A +PW +I K  D +E+ I+W  N+G ++SFWH +W     L+    RLYALS  +   VKE+W+     W++KP  PLN+RE   W+
Subjt:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN

Query:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKK-AIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVM
         I   LP  + ++G  KP WN  ++K +T AS K  A + S   +    + ++ K LW+S IP+KCKFF+W+++H+ +N MD +Q+R  +  LNP+WC+ 
Subjt:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKK-AIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVM

Query:  CHSECETSDHLLVNCRAASFLWETLQAKT---RLHHSFNDL-------------------KAIVT------------------SILHQNYDSCNYIGQWC
        C S  E  +HL + C  A  LW    ++T    ++ +  DL                    AI T                  S L+   D C   G W 
Subjt:  CHSECETSDHLLVNCRAASFLWETLQAKT---RLHHSFNDL-------------------KAIVT------------------SILHQNYDSCNYIGQWC

Query:  SRLHLLKDYSSSALALNFTA
        S+   LK+YS + +ALN  A
Subjt:  SRLHLLKDYSSSALALNFTA

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein5.5e-13040.42Show/hide
Query:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN
        +L  KN+ +KWR  I +CIS+V YSILING+P GRI+ ++GIRQGD +SPFIFVL MDYLSRLL++L     I GV F+ N+ LTH+LFADDI +FVED 
Subjt:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN

Query:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSK-------
        ++Y++NLKM L LFE ASGLNINLSKS I PIN+   RA+ +AD W I   HL   YLG+PLGG P  S+FWD V  KI KKL+NW YSQLSK       
Subjt:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSK-------

Query:  ---------------------AENNYSSDKRWAW-ALKNQY--------------------------TNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI
                             A+   +S + + W    N +                          TNF+LL KWLW+F  E + LW+++II+KY    
Subjt:  ---------------------AENNYSSDKRWAW-ALKNQY--------------------------TNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI

Query:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN
        +G  P+  KF +  +PW+++ + +  F  NI+W++N+GE+ISFW D W+    L+ A  RL+ALS +K   VKE WN     W L  + PL D E  LW+
Subjt:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN

Query:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKKAIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVMC
         I   LP P  ++G  KP WNL +N +F TASVK+AI  +P +  N   N ++K LWK   PKKCKFF+W+++H  IN  D+LQ+R  N  L+PNWC MC
Subjt:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKKAIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVMC

Query:  HSECETSDHLLVNCRAASFLWETLQAKTRLHHSFNDLKAIVTSILHQNY----------------------------------------DSCNYIGQWCS
        +   E  +HL ++C  +  LW   +A    + +  D+++++ +I   N                                         D+   IG W  
Subjt:  HSECETSDHLLVNCRAASFLWETLQAKTRLHHSFNDLKAIVTSILHQNY----------------------------------------DSCNYIGQWCS

Query:  RLHLLKDYSSSALALNFTAFL
        +  L  +Y   ++ALN +AF+
Subjt:  RLHLLKDYSSSALALNFTAFL

A0A5A7TTK1 LINE-1 retrotransposable element ORF2 protein9.5e-13040.58Show/hide
Query:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN
        +L  KNFP  WR WI  CISNV YSI++NG+P GRI+AN+G+RQGD +SPF+FV+ MDYLSRLLSHLE   +IKGV+F++N  ++H+LFADDI LF+EDN
Subjt:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN

Query:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------
        + +L NL+MAL+LFE+ASGL INL KS + P+N+S  RA+  A  W I +  L + YLGVPLGGNPK   FW  V+ KI KKL NW Y+Q+SK       
Subjt:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------

Query:  ----------------------------------ENNYSSDK----RWAWALKNQ-----------YTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI
                                          + N SS+      W    K++            TN +LL+KWLWR+ +EP ALWR++I  KYK   
Subjt:  ----------------------------------ENNYSSDK----RWAWALKNQ-----------YTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI

Query:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN
         G IP+     T+KAPWRSI+ ++D F +N +W++NNG+ ISFW+  WS  G L  AY RL+ALS  K   VK+ WN+I+ +W ++    LNDRE   W 
Subjt:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN

Query:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKKAIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVMC
        +I   LP P  + G  KP W  ++   F+ AS K  I       +    +++ + +WKS IP K KFF+W ++   IN M+ +Q R  N CL P+WCV+C
Subjt:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKKAIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVMC

Query:  HSECETSDHLLVNCRAASFLWETLQAKTRLHHSFNDL----------------KAIVTSIL--------------------HQN----YDSCN-YIGQWC
          + E+  HL ++C A   LW  LQ    L  S +DL                K +   I+                    H+     ++ C   IG WC
Subjt:  HSECETSDHLLVNCRAASFLWETLQAKTRLHHSFNDL----------------KAIVTSIL--------------------HQN----YDSCN-YIGQWC

Query:  SRLHLLKDYSSSALALNFTAF
        SR    ++YS++ +ALN + F
Subjt:  SRLHLLKDYSSSALALNFTAF

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein8.0e-12942.26Show/hide
Query:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN
        ML  K+FP KWR WI+ACISNV YSIL+NG P GRI+A +GIRQGD +SPFIFVL MDYLSRLLSHLE   +IKGV+FNN   ++HLLFADD+ +FVEDN
Subjt:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN

Query:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------
        E YL NL+MALTLFEKASGL  N SKS ISPINIS GR   +A  +   T  L + YLGVPLGGNP+  SFWD+    I+KKL  W YSQ+SK       
Subjt:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------

Query:  -------------------------ENNY-------SSDKRWA----WAL-------------KNQYTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI
                                 E ++       S DK+ A    W +             K + TN +LL KWLWR+HNE N+LW+K I AKY  + 
Subjt:  -------------------------ENNY-------SSDKRWA----WAL-------------KNQYTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI

Query:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN
         G IP   +  +A +PW +I K  D +E+ I+W  N+G ++SFWH +W     L+    RLYALS  +   VKE+W+     W++KP  PLN+RE   W+
Subjt:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN

Query:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKK-AIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVM
         I   LP  + ++G  KP WN  ++K +T AS K  A + S   +    + ++ K LW+S IP+KCKFF+W+++H+ +N MD +Q+R  +  LNP+WC+ 
Subjt:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKK-AIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVM

Query:  CHSECETSDHLLVNCRAASFLWETLQAKT---RLHHSFNDL-------------------KAIVT------------------SILHQNYDSCNYIGQWC
        C S  E  +HL + C  A  LW    ++T    ++ +  DL                    AI T                  S L+   D C   G W 
Subjt:  CHSECETSDHLLVNCRAASFLWETLQAKT---RLHHSFNDL-------------------KAIVT------------------SILHQNYDSCNYIGQWC

Query:  SRLHLLKDYSSSALALNFTA
        S+   LK+YS + +ALN  A
Subjt:  SRLHLLKDYSSSALALNFTA

A0A5A7UTI6 LINE-1 retrotransposable element ORF2 protein2.8e-12941.94Show/hide
Query:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN
        ML  K+FP KWR WI+ACISNV YSIL+NG P GRI+A +GIRQGD +SPFIFVL MDYLSRLLSHLE   +IKGV+FNN   ++HLLFADD+ +FVEDN
Subjt:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN

Query:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------
        E YL NL+MALTLFEKASGL  N SKS ISPINIS GR   +A  +   T  L + YLGVPLGGNP+  SFW +    I+KKL  W YSQ+SK       
Subjt:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------

Query:  -------------------------ENNY-------SSDKRWA----WAL-------------KNQYTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI
                                 E ++       S DK+ A    W +             K + TN +LL KWLWR+HNE N+LW+K I AKY  + 
Subjt:  -------------------------ENNY-------SSDKRWA----WAL-------------KNQYTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI

Query:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN
         G IP   +  +A +PW +I K  D +E+ I+W  N+G ++SFWH +W     L+  + RLYALS  +   VKE+W+     W+++P  PLN+RE   W+
Subjt:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN

Query:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKK-AIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVM
         I   LP  + ++G  KP+WN  ++K +T AS K  A + S   +    + ++ K LW+S IP+KCKFF+W+++H+ +N MDK+Q+R  +  LNP+WC+ 
Subjt:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKK-AIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVM

Query:  CHSECETSDHLLVNCRAASFLWETLQAKTRLHHSFNDLK----------------------AIVT------------------SILHQNYDSCNYIGQWC
        C S  E  +HL + C  A  LW    ++T    +  ++K                      AI T                  S L+   D C   G W 
Subjt:  CHSECETSDHLLVNCRAASFLWETLQAKTRLHHSFNDLK----------------------AIVT------------------SILHQNYDSCNYIGQWC

Query:  SRLHLLKDYSSSALALNFTA
        S+   LK+YS + +ALN  A
Subjt:  SRLHLLKDYSSSALALNFTA

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein6.1e-12942.26Show/hide
Query:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN
        ML  K+FP KWR WI+ACISNV YSIL+NG P GRI+A +GIRQGD +SPFIFVL MDYLSRLLSHLE   +IKGV+FNN   ++HLLFADD+ +FVEDN
Subjt:  MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDN

Query:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------
        E YL NL+MALTLFEKASGL  N SKS ISPINIS GR   +A  +   T  L + YLGVPLGGNP+  SFWD+    I+KKL  W YSQ+SK       
Subjt:  ENYLANLKMALTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKA------

Query:  -------------------------ENNY-------SSDKRWA----WAL-------------KNQYTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI
                                 E ++       S DK+ A    W +             K + TN +LL KWLWR+HNE N+LW+K I AKY  + 
Subjt:  -------------------------ENNY-------SSDKRWA----WAL-------------KNQYTNFSLLSKWLWRFHNEPNALWRKIIIAKYKASI

Query:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN
         G IP   +  +A +PW +I K  D +E+ I+W  N+G ++SFWH +W     L+    RLYALS  +   VKE+W+     W++KP  PLN+RE   W+
Subjt:  IGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWN

Query:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKK-AIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVM
         I   LP  + ++G  KP WN  ++K +T AS K  A + S   +    + ++ K LW+S IP+KCKFF+W+++H+ +N MD +Q+R  +  LNP+WC+ 
Subjt:  QISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKK-AIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVM

Query:  CHSECETSDHLLVNCRAASFLWETLQAKT---RLHHSFNDL-------------------KAIVT------------------SILHQNYDSCNYIGQWC
        C S  E  +HL + C  A  LW    ++T    ++ +  DL                    AI T                  S L+   D C   G W 
Subjt:  CHSECETSDHLLVNCRAASFLWETLQAKT---RLHHSFNDL-------------------KAIVT------------------SILHQNYDSCNYIGQWC

Query:  SRLHLLKDYSSSALALNFTA
        S+   LK+YS + +ALN  A
Subjt:  SRLHLLKDYSSSALALNFTA

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.5e-0724.31Show/hide
Query:  IEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDNENYLANLKMALTLF
        I A       +I++NG+         G RQG  +SP +F +V++ L+R    + +   IKG+       +   LFADD+ +++E+      NL   ++ F
Subjt:  IEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDNENYLANLKMALTLF

Query:  EKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQL---SKAENNYSSDKRWAWALKNQ
         K SG  IN+ KS     N +      +  +         I YLG+ L               +  K L   NY  L    K + N   +   +W  +  
Subjt:  EKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQL---SKAENNYSSDKRWAWALKNQ

Query:  YTNFSLLSKWLWRFHNEP
            ++L K ++RF+  P
Subjt:  YTNFSLLSKWLWRFHNEP

P08548 LINE-1 reverse transcriptase homolog8.8e-0826.15Show/hide
Query:  IEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDNENYLANLKMALTLF
        IEA  S    +I++NG          G RQG  +SP +F +VM+ L+     + +  +IKG++  +   +   LFADD+ +++E+  +    L   +  +
Subjt:  IEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDNENYLANLKMALTLF

Query:  EKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSK--AEN-NYSSDKRWAWALKNQ
           SG  IN  KS       +N   + V D      +   + YLGV L               K  K L   NY  L K  AE+ N   +   +W  +  
Subjt:  EKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSK--AEN-NYSSDKRWAWALKNQ

Query:  YTNFSLLSKWLWRFHNEP
            S+L K ++ F+  P
Subjt:  YTNFSLLSKWLWRFHNEP

P0C2F6 Putative ribonuclease H protein At1g657502.8e-1424.1Show/hide
Query:  NFSLLSKWLWRFHNEPNALWRKIIIAKYKASIIGKIPTFSKFCTAKAPWRSIVKSL-----DLFETNITWEINNGENISFWHDRWSRFGALANAYARLYA
        N +L+SK  WR   E N+LW  ++  KY    +G+I   S++   K  W S  +S+     D+    + W   +G+ I FW DRW            L  
Subjt:  NFSLLSKWLWRFHNEPNALWRKIIIAKYKASIIGKIPTFSKFCTAKAPWRSIVKSL-----DLFETNITWEINNGENISFWHDRWSRFGALANAYARLYA

Query:  LSQSKICE---VKEMWNSIEKKWDLKPHIPLNDRECYLWNQISFDLPIPNKD---KGRGKPNWNLENNKVFTTASVKKAIQISPTNENNGVDNQIFKALW
          +   C+    K++W    + WD     P      Y  N    +L     D     R + +W    +  F+  S  + + +      N      F  LW
Subjt:  LSQSKICE---VKEMWNSIEKKWDLKPHIPLNDRECYLWNQISFDLPIPNKD---KGRGKPNWNLENNKVFTTASVKKAIQISPTNENNGVDNQIFKALW

Query:  KSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVMCHSECETSDHLLVNCRAASFLWETLQAKTRLHHSFN
        K  +P++ K FLW + ++ +   ++  RR L+     N C +C    E+  H+L +C A   +W  +  + R    F+
Subjt:  KSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVMCHSECETSDHLLVNCRAASFLWETLQAKTRLHHSFN

P11369 LINE-1 retrotransposable element ORF2 protein1.0e-0824.19Show/hide
Query:  IEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDNENYLANLKMALTLF
        I+A  S  V +I +NG+    I    G RQG  +SP++F +V++ L+R    + +   IKG+          LL ADD+ +++ D +N    L   +  F
Subjt:  IEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDNENYLANLKMALTLF

Query:  EKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKAENNYSSDKRWAWALKNQYTN
         +  G  IN +KS       +    + + +      +  +I YLGV L    ++   +DK    + K++         K +     D   +W  +     
Subjt:  EKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKAENNYSSDKRWAWALKNQYTN

Query:  FSLLSKWLWRFHNEP
         ++L K ++RF+  P
Subjt:  FSLLSKWLWRFHNEP

P92555 Uncharacterized mitochondrial protein AtMg012502.3e-0840.3Show/hide
Query:  LINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNI-CLTHLLFADD
        +ING P G +  ++G+RQGD +SP++F+L  + LS L    ++   + G+  +NN   + HLLFADD
Subjt:  LINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNI-CLTHLLFADD

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.7e-0720.93Show/hide
Query:  WRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWNQISFDLPIPNKDKGRG
        WR + K  ++    +  ++ +G    FWHD W+  G L +    L                ++    D    I     + ++W     DL  P       
Subjt:  WRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWNQISFDLPIPNKDKGRG

Query:  KPNWNLENNKVFTTASVKKAIQISPTNENNGVDNQIFKALW-KSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVMCHSECETSDHLLVNCR
                + +F+TA  K ++ + P N         +KA+W K+ +PK   F  W +    ++  D+L+   L+    P  C++C+S  E+  HL   C 
Subjt:  KPNWNLENNKVFTTASVKKAIQISPTNENNGVDNQIFKALW-KSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVMCHSECETSDHLLVNCR

Query:  AASFLWETLQAKTRL
            +W     +  L
Subjt:  AASFLWETLQAKTRL

AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-1224.55Show/hide
Query:  NFSLLSKWLWRFHNEPNALWRKIIIAKY------KASIIGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLY
        N +LL K +WR  + P +L  K+  ++Y        + +G  P+F         W+SI  S ++        + NGE+I  W  +W      A+A  R+ 
Subjt:  NFSLLSKWLWRFHNEPNALWRKIIIAKY------KASIIGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLY

Query:  ALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWNQISFDLPIPNKDK---GRGKP---------NWNLENNKVFTTASVKKAI------QISPTNE
         +   +   V    +SI K  DL   I  + RE   W +   ++  P  ++   G  +P          W+  ++  +T  S    +      + SP   
Subjt:  ALSQSKICEVKEMWNSIEKKWDLKPHIPLNDRECYLWNQISFDLPIPNKDK---GRGKP---------NWNLENNKVFTTASVKKAI------QISPTNE

Query:  NNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVMCHSECETSDHLLVNCRAASFLW
        +    N I++ +WKS    K + FLW  L   +     L  R L+     + C+ C S  ET +HLL  C  A   W
Subjt:  NNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPNWCVMCHSECETSDHLLVNCRAASFLW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.7e-0940.3Show/hide
Query:  LINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNI-CLTHLLFADD
        +ING P G +  ++G+RQGD +SP++F+L  + LS L    ++   + G+  +NN   + HLLFADD
Subjt:  LINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNI-CLTHLLFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTATGTTAAAAACTTTCCTCGCAAATGGAGAAGTTGGATTGAAGCTTGTATTAGCAATGTGGTTTACTCCATTCTCATAAATGGCAAGCCCCATGGTAGAATCCA
AGCCAATAAGGGAATTCGTCAAGGTGATACCATTTCTCCCTTCATTTTCGTTCTTGTCATGGACTATCTAAGCAGATTATTATCTCATCTTGAGAAAAATAACTCGATAA
AGGGAGTCAACTTCAACAACAATATTTGTTTAACCCATCTCCTTTTTGCAGACGATATTTTCCTATTTGTCGAAGACAATGAAAATTACCTCGCAAACCTGAAGATGGCT
TTAACTCTTTTTGAAAAAGCTTCCGGGTTAAACATAAATCTATCTAAGTCCCCCATAAGCCCCATAAATATCTCAAATGGCCGTGCTAGAATTGTCGCGGACAAGTGGGA
TATTCCTACTATTCATCTGTCGATTCTTTATCTTGGAGTCCCCTTAGGAGGCAATCCAAAATTGAGTTCTTTTTGGGACAAGGTTGATGTCAAAATCAACAAGAAGCTCA
CTAATTGGAATTACTCTCAGCTTTCAAAAGCAGAAAATAACTACTCCTCTGACAAAAGGTGGGCTTGGGCTCTCAAAAATCAATATACAAATTTCTCCCTTCTATCAAAA
TGGCTGTGGAGATTTCATAATGAACCTAATGCTCTATGGAGGAAAATCATTATAGCCAAATACAAAGCCTCCATAATAGGTAAAATCCCAACTTTTAGTAAGTTTTGTAC
TGCCAAAGCCCCTTGGAGGAGTATTGTCAAAAGTTTGGATTTGTTTGAAACAAATATTACTTGGGAAATTAATAATGGTGAAAACATCTCTTTCTGGCATGATAGGTGGA
GTAGGTTTGGTGCTCTTGCCAATGCATACGCAAGGCTTTACGCTCTCTCCCAATCGAAAATTTGCGAGGTTAAAGAAATGTGGAACTCTATTGAGAAGAAATGGGATTTA
AAGCCTCACATACCACTAAACGACAGAGAATGTTATCTTTGGAATCAGATTTCGTTTGATCTCCCTATTCCGAATAAAGACAAAGGTAGAGGCAAGCCAAATTGGAACCT
AGAAAATAACAAAGTCTTCACCACAGCTTCGGTAAAAAAGGCTATTCAAATCTCCCCAACCAACGAGAATAACGGAGTCGACAACCAGATTTTCAAAGCCCTTTGGAAAT
CACCAATTCCAAAGAAATGTAAATTTTTCCTCTGGTCCATTCTCCATGAAGGTATCAACGCAATGGATAAGCTTCAAAGAAGACAGCTGAATACATGCCTTAATCCTAAT
TGGTGCGTCATGTGCCACTCTGAATGTGAAACAAGTGACCACTTATTGGTAAACTGTAGAGCAGCTAGCTTTCTCTGGGAAACATTGCAAGCTAAAACTAGATTGCATCA
TTCGTTCAATGATCTAAAAGCGATAGTTACTTCCATTTTGCACCAGAATTATGATAGCTGTAACTATATCGGTCAATGGTGTAGCCGCCTTCACCTTCTTAAAGACTACT
CTTCTAGTGCGTTAGCTCTTAACTTCACTGCTTTTCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTTATGTTAAAAACTTTCCTCGCAAATGGAGAAGTTGGATTGAAGCTTGTATTAGCAATGTGGTTTACTCCATTCTCATAAATGGCAAGCCCCATGGTAGAATCCA
AGCCAATAAGGGAATTCGTCAAGGTGATACCATTTCTCCCTTCATTTTCGTTCTTGTCATGGACTATCTAAGCAGATTATTATCTCATCTTGAGAAAAATAACTCGATAA
AGGGAGTCAACTTCAACAACAATATTTGTTTAACCCATCTCCTTTTTGCAGACGATATTTTCCTATTTGTCGAAGACAATGAAAATTACCTCGCAAACCTGAAGATGGCT
TTAACTCTTTTTGAAAAAGCTTCCGGGTTAAACATAAATCTATCTAAGTCCCCCATAAGCCCCATAAATATCTCAAATGGCCGTGCTAGAATTGTCGCGGACAAGTGGGA
TATTCCTACTATTCATCTGTCGATTCTTTATCTTGGAGTCCCCTTAGGAGGCAATCCAAAATTGAGTTCTTTTTGGGACAAGGTTGATGTCAAAATCAACAAGAAGCTCA
CTAATTGGAATTACTCTCAGCTTTCAAAAGCAGAAAATAACTACTCCTCTGACAAAAGGTGGGCTTGGGCTCTCAAAAATCAATATACAAATTTCTCCCTTCTATCAAAA
TGGCTGTGGAGATTTCATAATGAACCTAATGCTCTATGGAGGAAAATCATTATAGCCAAATACAAAGCCTCCATAATAGGTAAAATCCCAACTTTTAGTAAGTTTTGTAC
TGCCAAAGCCCCTTGGAGGAGTATTGTCAAAAGTTTGGATTTGTTTGAAACAAATATTACTTGGGAAATTAATAATGGTGAAAACATCTCTTTCTGGCATGATAGGTGGA
GTAGGTTTGGTGCTCTTGCCAATGCATACGCAAGGCTTTACGCTCTCTCCCAATCGAAAATTTGCGAGGTTAAAGAAATGTGGAACTCTATTGAGAAGAAATGGGATTTA
AAGCCTCACATACCACTAAACGACAGAGAATGTTATCTTTGGAATCAGATTTCGTTTGATCTCCCTATTCCGAATAAAGACAAAGGTAGAGGCAAGCCAAATTGGAACCT
AGAAAATAACAAAGTCTTCACCACAGCTTCGGTAAAAAAGGCTATTCAAATCTCCCCAACCAACGAGAATAACGGAGTCGACAACCAGATTTTCAAAGCCCTTTGGAAAT
CACCAATTCCAAAGAAATGTAAATTTTTCCTCTGGTCCATTCTCCATGAAGGTATCAACGCAATGGATAAGCTTCAAAGAAGACAGCTGAATACATGCCTTAATCCTAAT
TGGTGCGTCATGTGCCACTCTGAATGTGAAACAAGTGACCACTTATTGGTAAACTGTAGAGCAGCTAGCTTTCTCTGGGAAACATTGCAAGCTAAAACTAGATTGCATCA
TTCGTTCAATGATCTAAAAGCGATAGTTACTTCCATTTTGCACCAGAATTATGATAGCTGTAACTATATCGGTCAATGGTGTAGCCGCCTTCACCTTCTTAAAGACTACT
CTTCTAGTGCGTTAGCTCTTAACTTCACTGCTTTTCTTTAA
Protein sequenceShow/hide protein sequence
MLYVKNFPRKWRSWIEACISNVVYSILINGKPHGRIQANKGIRQGDTISPFIFVLVMDYLSRLLSHLEKNNSIKGVNFNNNICLTHLLFADDIFLFVEDNENYLANLKMA
LTLFEKASGLNINLSKSPISPINISNGRARIVADKWDIPTIHLSILYLGVPLGGNPKLSSFWDKVDVKINKKLTNWNYSQLSKAENNYSSDKRWAWALKNQYTNFSLLSK
WLWRFHNEPNALWRKIIIAKYKASIIGKIPTFSKFCTAKAPWRSIVKSLDLFETNITWEINNGENISFWHDRWSRFGALANAYARLYALSQSKICEVKEMWNSIEKKWDL
KPHIPLNDRECYLWNQISFDLPIPNKDKGRGKPNWNLENNKVFTTASVKKAIQISPTNENNGVDNQIFKALWKSPIPKKCKFFLWSILHEGINAMDKLQRRQLNTCLNPN
WCVMCHSECETSDHLLVNCRAASFLWETLQAKTRLHHSFNDLKAIVTSILHQNYDSCNYIGQWCSRLHLLKDYSSSALALNFTAFL