; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021237 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021237
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:5806036..5811008
RNA-Seq ExpressionLag0021237
SyntenyLag0021237
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]2.7e-13741.34Show/hide
Query:  WKQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQ---------------FPSAGNFE-------KVLQH
        W+QRSR  WLK GD+NT +FH RAS R KRN + G+ D    WQTE  +I   F  YFK +F+S                  SA N +       + L+H
Subjt:  WKQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQ---------------FPSAGNFE-------KVLQH

Query:  IPNRVTP--APGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDD
           ++ P  APG DG PALF+QKYW +VG K    CL ILN E S++++N+T I LIPKV  P +V+++RPISLC   YK++ K IANRLK VL  +I +
Subjt:  IPNRVTP--APGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDD

Query:  CQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVYQ
         QSAF+P+R I DN++   E ++ +    KGR    ALKLDM+KAYDRVEW F  A+M KLGFSA W+  +                        +G+ Q
Subjt:  CQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVYQ

Query:  R-PLSLSLLMGRRRGLSAML-GHVRSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDT----
          PLS  L +    G S +L G  R   + G  +   A  ++HL FA DS++F+KA+ +     + +   YE  +GQ IN  KS +  S N  R      
Subjt:  R-PLSLSLLMGRRRGLSAML-GHVRSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDT----

Query:  ------------------------------QYYLSSIVQMQSGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQR
                                      Q+    + +  SGWK    S AGKEILIK+V+QAI TY+MSCFR+PKGL  +++ + ARFWW     K+ 
Subjt:  ------------------------------QYYLSSIVQMQSGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQR

Query:  IHWRRWKELCKPKAQGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIM
        IHW +W+ LCK K  GGL FRD E FNQAL+AKQ WRIL  PES VAR+ + +Y PS   LEAEV +NPS+ WRS  WG +LL  GLR ++G G S+ + 
Subjt:  IHWRRWKELCKPKAQGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIM

Query:  NDPWIPRPYLFKVIGSKVYCEDMKVSDCITATGQWNVPKLQQLLFDEDIQEIVGLPIS
         D W+P P  FK++         +V D  T++GQWNVP L+ + +D+++  I+ +P++
Subjt:  NDPWIPRPYLFKVIGSKVYCEDMKVSDCITATGQWNVPKLQQLLFDEDIQEIVGLPIS

ONI09819.1 hypothetical protein PRUPE_4G011200 [Prunus persica]1.2e-13737.55Show/hide
Query:  WSKIIYNNWN-----QGLHHKLYKCGQALKDWGFRENKNRWASIRLIKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNT
        ++K+I   W      + + + L  C + LK W      N    +     +L  +  +          ++E  +  L  ++E+ W+QRSR  WLK GD+NT
Subjt:  WSKIIYNNWN-----QGLHHKLYKCGQALKDWGFRENKNRWASIRLIKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNT

Query:  KWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQHIPNRV--------------------------TPAPGPDG
         +FH RAS R KRN + G+ D    WQTE  +I   F  YFK +F+S        E++L  +   +                          T APG DG
Subjt:  KWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQHIPNRV--------------------------TPAPGPDG

Query:  FPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSRSISDNM
         PALF+QKYW +VG K    CL ILN E S++++N+T I LIPKV  P  V+++RPISLC   YK++ K IANRLK VL  +I + QSAF+P+R I DN+
Subjt:  FPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSRSISDNM

Query:  IVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVYQR-PLSLSLLMGRRRG
        +   E ++ +    K R    ALKLDM+KAYDRVEW F  A+M KLGFSA W+  +                        +G+ Q  PLS  L +    G
Subjt:  IVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVYQR-PLSLSLLMGRRRG

Query:  LSAML-GHVRSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDT-------------------
         S +L G  R   + G  +   A  ++HL FA DS++F+KA+ +A    + +   YE  +GQ IN  KS +  S N  R                     
Subjt:  LSAML-GHVRSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDT-------------------

Query:  ---------------QYYLSSIVQMQSGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQ
                       Q+    + +  SGWK    S AGKEILIK+V+QAI TY+MSCF++PKGL  +++ + ARFWW     K+ IHW +W+ LCK K  
Subjt:  ---------------QYYLSSIVQMQSGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQ

Query:  GGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWIPRPYLFKVIG
        GGL FRD E FNQAL+AKQ WRIL  PES VAR+ + +Y PS   LEAEV +NPS+ W S  WG +LL  G+R ++G G S+ +  D W+P P  FK++ 
Subjt:  GGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWIPRPYLFKVIG

Query:  SKVYCEDMKVSDCITATGQWNVPKLQQLLFDEDIQEIVGLPIS
                +V D  T++GQWNVP L+ + +D+++  I+ +P++
Subjt:  SKVYCEDMKVSDCITATGQWNVPKLQQLLFDEDIQEIVGLPIS

XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]7.4e-14339.92Show/hide
Query:  SALCASGWSKIIYNNWNQGLH--HKLYKCGQA-LKDWGFRENKNRWASIRLIKDKLKLVYDKPL-PINFQEVHRLERQLDKLFLEEEVYWKQRSRENWLK
        S +  S W     N+W   +    ++ K   A LK W   E + R      + D+LK+   +PL  I+ +E+ +LE Q+  + ++EEVYWKQRSR +WLK
Subjt:  SALCASGWSKIIYNNWNQGLH--HKLYKCGQA-LKDWGFRENKNRWASIRLIKDKLKLVYDKPL-PINFQEVHRLERQLDKLFLEEEVYWKQRSRENWLK

Query:  WGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQ------------HIPNRVTP--------------
         GD+NTK+FH +AS RR++N I GVED +G W  +   I   F  +F+ +F S  PS     + L+            H+    TP              
Subjt:  WGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQ------------HIPNRVTP--------------

Query:  APGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSR
        APGPDG PA F+QK+W +VG+     CL ILN + ++   N+T I LIPKV  PR V ++RPISLCNV Y+IV K IANRLK +L+ II   QSAFIP+R
Subjt:  APGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSR

Query:  SISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVYQR-PLSLSLL
         I+DN+I+G+E LH +      R G  ALKLD+SKAYDRVEW+F    M  LGFSA WI +I                        +G+ Q  PLS  L 
Subjt:  SISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVYQR-PLSLSLL

Query:  MGRRRGLSAMLGHV-RSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDTQYYLSSIVQMQ--
        +      S +L    R   I G       + I+HL FA DSL+F KAS     + KGI   Y +ASGQ  N +KS + FS     +    + SI Q++  
Subjt:  MGRRRGLSAMLGHV-RSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDTQYYLSSIVQMQ--

Query:  --------------------------------SGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKEL
                                        S W    FS  GKEILIK+V QA+  YAMS F+LPKGL + I    ARFWWG+   K  IHW RW  +
Subjt:  --------------------------------SGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKEL

Query:  CKPKAQGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWIPRPY
         K K +GGL FRD   FNQAL+AKQ WR++  P S +ARV+K +Y+ +S    A+V SNPS+ WRS +WG  +++ G+R +IGDG+ V +  D WIPRP 
Subjt:  CKPKAQGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWIPRPY

Query:  LFKVIGSKVYCEDMKVSDCITATGQWNVPKLQQLLFDEDIQEIV
         F+ I  K    +  V+D I +  +W V +L+Q    EDI+ I+
Subjt:  LFKVIGSKVYCEDMKVSDCITATGQWNVPKLQQLLFDEDIQEIV

XP_018816058.1 uncharacterized protein LOC108987582 [Juglans regia]1.5e-13839.21Show/hide
Query:  ARLSLSALCASGWSKIIYNNWNQ---------GLHHKLYKCGQALKDWGFRENKNRWASIRLIKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYW
        A  +L A C    S II   WN+         G++ +L KC  ALK W   +N     +I     +L+ + D       + + +L+++L+   +E+++ W
Subjt:  ARLSLSALCASGWSKIIYNNWNQ---------GLHHKLYKCGQALKDWGFRENKNRWASIRLIKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYW

Query:  KQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQHIPNRVTP----------------
        KQR++++WL+ GD+NTK+FH  A+ RRK N I  V D      TE  +I   F ++F D+F S  P   N E  L+ +  +V+P                
Subjt:  KQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQHIPNRVTP----------------

Query:  ----------APGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIID
                  +PGPDGFPALFYQ +WD++G++     L ILN   S++  N T I LIPKV  P+ V DYRPISLCNV YKIV K+++NR+K VL DII 
Subjt:  ----------APGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIID

Query:  DCQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVY
          QSAF+P R+IS+N++V +E LH + +R K + GF ALKLD+SKAYDRVEW F  ++M +LGF   WI ++                        +G+ 
Subjt:  DCQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVY

Query:  Q-RPLSLSLLMGRRRGLSAMLGHVRSIG-IAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDTQYY
        Q  PLS  L +     LSAM+     IG I+   M     ++SHLFFA DSLIF K+++  +     I+  YE ASGQ +N +KS I FSKN   + Q  
Subjt:  Q-RPLSLSLLMGRRRGLSAMLGHVRSIG-IAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDTQYY

Query:  LSSIVQMQS----------------------------------GWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQ
        + SI  ++S                                   WK    S AGKEIL+KSV+QAI TYAM  FRLP+ +  ++  LC +FWWGST+ + 
Subjt:  LSSIVQMQS----------------------------------GWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQ

Query:  RIHWRRWKELCKPKAQGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSI
        +I W +W  L + K QGGL FR F  FN AL+AKQ W IL  P S  A++LK KYFPSS LLEA+V S PS  WRS + GL+LL+ GL  +IG+G  V I
Subjt:  RIHWRRWKELCKPKAQGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSI

Query:  MNDPWIPRPYLFKVIGSKVYCEDMKVSDCITATGQWNVPKLQQLLFDEDIQEIVGLPISM
            W+P+  L          E +     I    QWN P LQ L   ++I+ I  +PIS+
Subjt:  MNDPWIPRPYLFKVIGSKVYCEDMKVSDCITATGQWNVPKLQQLLFDEDIQEIVGLPISM

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]2.4e-14138.47Show/hide
Query:  ISARLSLSALCASGWSKIIYNNWNQGLHHKLYKCGQALKDWGFRENKNRWASIRLIKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYWKQRSREN
        IS  L + A+  S WS  IY              GQ  K    + ++    ++R + + L L           E++RL  +++ L  +EE YW QR++ +
Subjt:  ISARLSLSALCASGWSKIIYNNWNQGLHHKLYKCGQALKDWGFRENKNRWASIRLIKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYWKQRSREN

Query:  WLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQHIPNRVT------------------------
        WLK GDRNTK+FH +AS RRK+N+I+G+ D +GRW      I +A  +YF +I++S  PS    E+V + IP +VT                        
Subjt:  WLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQHIPNRVT------------------------

Query:  --PAPGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFI
           APGPDG  A+F+QKYW +VG    D  L +LN    I + N TNI LIPK  NP+ +TD+RPISLCNV YK+++K++ANRLK +L  II + QSAF 
Subjt:  --PAPGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFI

Query:  PSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVYQ-RPLSL
          R I+DN++V  E +H+L+ +T G++GF A+KLDMSKA+DRVEW F   +M+++GF   W D++                        +G+ Q  PLS 
Subjt:  PSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVYQ-RPLSL

Query:  SLLMGRRRGLSAMLGH-VRSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDTQYYLSSIV-Q
        SL +    GLSA++    R+  I G S+N    K++HLFFA DS++F KA+ E     + I+  YE ASGQ IN DKS I FS N  ++T+  + +I+  
Subjt:  SLLMGRRRGLSAMLGH-VRSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDTQYYLSSIV-Q

Query:  MQ---------------------------------SGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRW
        MQ                                 +GWK    S  GKEILIK+V QAI TY MSCF LP+GL D +  +   FWWG  + + ++ W  W
Subjt:  MQ---------------------------------SGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRW

Query:  KELCKPKAQGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWIP
        K +C  KA GGL FR+ + FN A++AKQAWRIL  P S V RVLK +YFP+ +LL A++ S+PSY WRS    L+++R G R ++G+G+ + I  D W+P
Subjt:  KELCKPKAQGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWIP

Query:  RPYLFKVIGSKVY-CEDMKVSDCITA-TGQWNVPKLQQLLFDEDIQEIVGLPISMSMSD
         P  +KVI  +++  E   VS  I   T  W V  L+ +    +++ I+ +P+S ++ +
Subjt:  RPYLFKVIGSKVY-CEDMKVSDCITA-TGQWNVPKLQQLLFDEDIQEIVGLPISMSMSD

TrEMBL top hitse value%identityAlignment
A0A2I4E9H8 uncharacterized protein LOC1089875827.0e-13939.21Show/hide
Query:  ARLSLSALCASGWSKIIYNNWNQ---------GLHHKLYKCGQALKDWGFRENKNRWASIRLIKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYW
        A  +L A C    S II   WN+         G++ +L KC  ALK W   +N     +I     +L+ + D       + + +L+++L+   +E+++ W
Subjt:  ARLSLSALCASGWSKIIYNNWNQ---------GLHHKLYKCGQALKDWGFRENKNRWASIRLIKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYW

Query:  KQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQHIPNRVTP----------------
        KQR++++WL+ GD+NTK+FH  A+ RRK N I  V D      TE  +I   F ++F D+F S  P   N E  L+ +  +V+P                
Subjt:  KQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQHIPNRVTP----------------

Query:  ----------APGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIID
                  +PGPDGFPALFYQ +WD++G++     L ILN   S++  N T I LIPKV  P+ V DYRPISLCNV YKIV K+++NR+K VL DII 
Subjt:  ----------APGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIID

Query:  DCQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVY
          QSAF+P R+IS+N++V +E LH + +R K + GF ALKLD+SKAYDRVEW F  ++M +LGF   WI ++                        +G+ 
Subjt:  DCQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVY

Query:  Q-RPLSLSLLMGRRRGLSAMLGHVRSIG-IAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDTQYY
        Q  PLS  L +     LSAM+     IG I+   M     ++SHLFFA DSLIF K+++  +     I+  YE ASGQ +N +KS I FSKN   + Q  
Subjt:  Q-RPLSLSLLMGRRRGLSAMLGHVRSIG-IAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDTQYY

Query:  LSSIVQMQS----------------------------------GWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQ
        + SI  ++S                                   WK    S AGKEIL+KSV+QAI TYAM  FRLP+ +  ++  LC +FWWGST+ + 
Subjt:  LSSIVQMQS----------------------------------GWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQ

Query:  RIHWRRWKELCKPKAQGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSI
        +I W +W  L + K QGGL FR F  FN AL+AKQ W IL  P S  A++LK KYFPSS LLEA+V S PS  WRS + GL+LL+ GL  +IG+G  V I
Subjt:  RIHWRRWKELCKPKAQGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSI

Query:  MNDPWIPRPYLFKVIGSKVYCEDMKVSDCITATGQWNVPKLQQLLFDEDIQEIVGLPISM
            W+P+  L          E +     I    QWN P LQ L   ++I+ I  +PIS+
Subjt:  MNDPWIPRPYLFKVIGSKVYCEDMKVSDCITATGQWNVPKLQQLLFDEDIQEIVGLPISM

A0A2N9GY02 Reverse transcriptase domain-containing protein3.4e-14139.57Show/hide
Query:  WSKIIYNNWNQGLHHKLYKCGQALKDWGFRENKNRWASIRLIKDKLKLV-YDKPLPINFQEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNTKWFH
        W   + +N    L  K+  C   L  W      N    +R   D+L+ V +       F  +  L+ +L+ L   +E++WKQRSR  WL+ GD+NTK+FH
Subjt:  WSKIIYNNWNQGLHHKLYKCGQALKDWGFRENKNRWASIRLIKDKLKLV-YDKPLPINFQEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNTKWFH

Query:  HRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQHIPNRVTP--------------------------APGPDGFPAL
         +AS RRKRN+I G+ D  G W T+   I +    Y+ +IF S    +   E+ ++ + +RVTP                           PGPDG   L
Subjt:  HRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQHIPNRVTP--------------------------APGPDGFPAL

Query:  FYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSRSISDNMIVGH
        FYQKYW +VG    D  L+ LN  H +K  N+T+IVLIPKV NP  V+DYRPISLCNVS+K+++KV ANRLKR+L  II D QSAF+P R I+DN +V  
Subjt:  FYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSRSISDNMIVGH

Query:  EFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVYQ-RPLSLSLLMGRRRGLSAM
        E LH +  +  GR G  ALKLDMSKAYDRVEW F   IM+KLGF+A W+D++                        +G+ Q  PLS  L +    GLS++
Subjt:  EFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVYQ-RPLSLSLLMGRRRGLSAM

Query:  LGHV-RSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDTQYYLSSIVQMQ------------
        +    RS  I G   +     ISHLFFA DSL+F +A+      F+ I+  YE ASGQ +N +K+ + FSKN  + TQ  L+ I+ +             
Subjt:  LGHV-RSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDTQYYLSSIVQMQ------------

Query:  ----------------------SGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQGGLD
                               GWK  F S A +EILIK+V QAI TY MSCF+LPK LL ++ S+ +RFWWG    + +IHW  WK+LC  K +GG+ 
Subjt:  ----------------------SGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQGGLD

Query:  FRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWIPRPYLFKVIGSKVY
        FRD + FN AL+AKQ WR++  PES + RVLK KYFP    +EA ++   S  WR  + G  +L  GLR  +G+G  + I  D WIP P  +K+   +  
Subjt:  FRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWIPRPYLFKVIGSKVY

Query:  CEDMK---VSDCI-TATGQWNVPKLQQLLFDEDIQEIVGLPIS
           M    VSD I T +  W    L  L F  D+  I  +PIS
Subjt:  CEDMK---VSDCI-TATGQWNVPKLQQLLFDEDIQEIVGLPIS

A0A803P3X8 Uncharacterized protein3.2e-13937.38Show/hide
Query:  NNWN--QGLHHKLYKCGQALKDWGFRENKNRWASIRLIKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNTKWFHHRASF
        +NW    GL  ++  CG+ LKDW   +     A  + +K++LK+V       ++Q   ++ER L+ +  +EE+ W+QRSR  WL  GDRNTK+FHH+AS 
Subjt:  NNWN--QGLHHKLYKCGQALKDWGFRENKNRWASIRLIKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNTKWFHHRASF

Query:  RRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQHIPNRVT--------------------------PAPGPDGFPALFYQKY
        R+K+N+I G+ D + RW T+ N+I      Y+ D+F+S  P     ++++Q +PNR+                            APG DG P LFYQ +
Subjt:  RRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQHIPNRVT--------------------------PAPGPDGFPALFYQKY

Query:  WDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSRSISDNMIVGHEFLHF
        W+ VGQ+ +  CL +LN +      N T + LIPK+ NP +V DYRP+SLCNVSYK ++K +ANR+K  +D +I + QSAFI  R I DN I+G E LH 
Subjt:  WDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSRSISDNMIVGHEFLHF

Query:  LNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMIKG-------------------VYQR------PLSLSLLMGRRRGLSAMLGHV-
        L     G     ALKLDMSKAYDRVEW F   +M +LG+   WI  + G                   + QR      PLS  L +    GLS +L    
Subjt:  LNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMIKG-------------------VYQR------PLSLSLLMGRRRGLSAMLGHV-

Query:  RSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVH-----------------RDTQY---------
        R   I G    +  ++++HL FA DSL+FL+A+ E       I+  Y   SGQCIN  KS +C  + +H                   T+Y         
Subjt:  RSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVH-----------------RDTQY---------

Query:  --------YLSSIVQMQSGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQGGLDFRDFE
                  + + +   GWK   FS AG+E+LIKSV+Q I  Y MSCFR+ KGL+ +I +L ARFWWGST  K ++HW  W++LC  K  GG+ FRD E
Subjt:  --------YLSSIVQMQSGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQGGLDFRDFE

Query:  VFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWIPR--PYLFKVIGSKVYCED
         FNQAL+AKQ W+++  P+S +A+ LK  Y+P+++ L A+  +  S  WR  +WG DLL  G+R ++ DG  V I  D W+PR  P+L + +   +   +
Subjt:  VFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWIPR--PYLFKVIGSKVYCED

Query:  MKVSDCITATGQWNVPKLQQLLFDEDIQEIVGL
          V+  +   G+WN   +++ + ++DI  I+G+
Subjt:  MKVSDCITATGQWNVPKLQQLLFDEDIQEIVGL

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)1.9e-13940.8Show/hide
Query:  RLERQLDKLFLEEEVYWKQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQ---------------FPSA
        ++E  +  L  ++E+ W+QRSR  WLK GD+NT +FH RAS R KRN + G+ D    WQTE  +I   F  YFK +F+S                  SA
Subjt:  RLERQLDKLFLEEEVYWKQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQ---------------FPSA

Query:  GNFE-------KVLQHIPNRVTP--APGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTK
         N +       + L+H   ++ P  APG DG PALF+QKYW +VG K    CL ILN E S++++N+T I LIPKV  P +V+++RPISLC   YK++ K
Subjt:  GNFE-------KVLQHIPNRVTP--APGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTK

Query:  VIANRLKRVLDDIIDDCQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI-------------
         IANRLK VL  +I + QSAF+P+R I DN++   E ++ +    KGR    ALKLDM+KAYDRVEW F  A+M KLGFSA W+  +             
Subjt:  VIANRLKRVLDDIIDDCQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI-------------

Query:  -----------KGVYQR-PLSLSLLMGRRRGLSAML-GHVRSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKS
                   +G+ Q  PLS  L +    G S +L G  R   + G  +   A  ++HL FA DS++F+KA+ +     + +   YE  +GQ IN  KS
Subjt:  -----------KGVYQR-PLSLSLLMGRRRGLSAML-GHVRSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKS

Query:  RICFSKNVHRDT----------------------------------QYYLSSIVQMQSGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKIS
         +  S N  R                                    Q+    + +  SGWK    S AGKEILIK+V+QAI TY+MSCFR+PKGL  +++
Subjt:  RICFSKNVHRDT----------------------------------QYYLSSIVQMQSGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKIS

Query:  SLCARFWWGSTDAKQRIHWRRWKELCKPKAQGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLR
         + ARFWW     K+ IHW +W+ LCK K  GGL FRD E FNQAL+AKQ WRIL  PES VAR+ + +Y PS   LEAEV +NPS+ WRS  WG +LL 
Subjt:  SLCARFWWGSTDAKQRIHWRRWKELCKPKAQGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLR

Query:  SGLRKQIGDGQSVSIMNDPWIPRPYLFKVIGSKVYCEDMKVSDCITATGQWNVPKLQQLLFDEDIQEIVGLPIS
         GLR ++G G S+ +  D W+P P  FK++         +V D  T++GQWNVP L+ + +D+++  I+ +P++
Subjt:  SGLRKQIGDGQSVSIMNDPWIPRPYLFKVIGSKVYCEDMKVSDCITATGQWNVPKLQQLLFDEDIQEIVGLPIS

M5WJW2 Reverse transcriptase domain-containing protein6.0e-13837.55Show/hide
Query:  WSKIIYNNWN-----QGLHHKLYKCGQALKDWGFRENKNRWASIRLIKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNT
        ++K+I   W      + + + L  C + LK W      N    +     +L  +  +          ++E  +  L  ++E+ W+QRSR  WLK GD+NT
Subjt:  WSKIIYNNWN-----QGLHHKLYKCGQALKDWGFRENKNRWASIRLIKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNT

Query:  KWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQHIPNRV--------------------------TPAPGPDG
         +FH RAS R KRN + G+ D    WQTE  +I   F  YFK +F+S        E++L  +   +                          T APG DG
Subjt:  KWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVLQHIPNRV--------------------------TPAPGPDG

Query:  FPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSRSISDNM
         PALF+QKYW +VG K    CL ILN E S++++N+T I LIPKV  P  V+++RPISLC   YK++ K IANRLK VL  +I + QSAF+P+R I DN+
Subjt:  FPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSRSISDNM

Query:  IVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVYQR-PLSLSLLMGRRRG
        +   E ++ +    K R    ALKLDM+KAYDRVEW F  A+M KLGFSA W+  +                        +G+ Q  PLS  L +    G
Subjt:  IVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI------------------------KGVYQR-PLSLSLLMGRRRG

Query:  LSAML-GHVRSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDT-------------------
         S +L G  R   + G  +   A  ++HL FA DS++F+KA+ +A    + +   YE  +GQ IN  KS +  S N  R                     
Subjt:  LSAML-GHVRSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDT-------------------

Query:  ---------------QYYLSSIVQMQSGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQ
                       Q+    + +  SGWK    S AGKEILIK+V+QAI TY+MSCF++PKGL  +++ + ARFWW     K+ IHW +W+ LCK K  
Subjt:  ---------------QYYLSSIVQMQSGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQ

Query:  GGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWIPRPYLFKVIG
        GGL FRD E FNQAL+AKQ WRIL  PES VAR+ + +Y PS   LEAEV +NPS+ W S  WG +LL  G+R ++G G S+ +  D W+P P  FK++ 
Subjt:  GGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWIPRPYLFKVIG

Query:  SKVYCEDMKVSDCITATGQWNVPKLQQLLFDEDIQEIVGLPIS
                +V D  T++GQWNVP L+ + +D+++  I+ +P++
Subjt:  SKVYCEDMKVSDCITATGQWNVPKLQQLLFDEDIQEIVGLPIS

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.9e-2020.57Show/hide
Query:  QEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQF--------------
        QE+ ++  +L ++  ++ +     SR  + +  ++  +        +R++N I  +++ +G   T+  +I      Y+K ++ ++               
Subjt:  QEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQF--------------

Query:  --------------PSAGN-FEKVLQHIPNRVTPAPGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTD-YRPISL
                      P  G+    ++  +P +   +PGPDGF A FYQ+Y + +    +    +I         +   +I+LIPK        + +RPISL
Subjt:  --------------PSAGN-FEKVLQHIPNRVTPAPGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTD-YRPISL

Query:  CNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMIKGV
         N+  KI+ K++ANR+++ +  +I   Q  FIP      N+      +  +N R K +     + +D  KA+D+++  F    ++KLG    ++ +I+ +
Subjt:  CNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMIKGV

Query:  YQRPLSLSLLMGR-----------RRG--LSAMLGHV------RSI----GIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQ
        Y +P +  +L G+           R+G  LS +L ++      R+I     I G  +     K+S   FA D +++L+    +      ++ ++ + SG 
Subjt:  YQRPLSLSLLMGR-----------RRG--LSAMLGHV------RSI----GIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQ

Query:  CINLDKSRICFSKNVHRDTQY------------------------------------YLSSIVQMQSGWKRNFFSNAGKEILIKSVVQAILTYAMSC--F
         IN+ KS+  F  N +R T+                                      L  I +  + WK    S  G+  ++K  +   + Y  +    
Subjt:  CINLDKSRICFSKNVHRDTQY------------------------------------YLSSIVQMQSGWKRNFFSNAGKEILIKSVVQAILTYAMSC--F

Query:  RLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQ-GGLDFRDFEVFNQALIAKQAW
        +LP     ++     +F W    A      R  K +   K + GG+   DF+++ +A + K AW
Subjt:  RLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQ-GGLDFRDFEVFNQALIAKQAW

P08548 LINE-1 reverse transcriptase homolog7.0e-1921.99Show/hide
Query:  IKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFN
        +K   K  +  P P   +E+ ++  +L+++  +  +    +S+  + +  ++  K   +    +R ++ I  + +      T+ ++I K    Y+K +++
Subjt:  IKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFN

Query:  SQFPSAGNFEKVLQ--HIP----------NR---------------VTPAPGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNY-TNIVLIPK
         ++ +    ++ L+  H+P          NR                  +PGPDGF + FYQ + + +    + +    + +E  + +  Y  NI LIPK
Subjt:  SQFPSAGNFEKVLQ--HIP----------NR---------------VTPAPGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNY-TNIVLIPK

Query:  V-PNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIM
           +P    +YRPISL N+  KI+ K++ NR+++ +  II   Q  FIP      N+      +  +N      +    L +D  KA+D ++  F    +
Subjt:  V-PNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIM

Query:  DKLGFSAGWIDMIKGVYQRPL-----------SLSLLMGRRRG--LSAMLGHV----------RSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAF
         K+G    ++ +I+ +Y +P            S  L  G R+G  LS +L ++              I G  + S   K+S   FA D +++L+ + ++ 
Subjt:  DKLGFSAGWIDMIKGVYQRPL-----------SLSLLMGRRRG--LSAMLGHV----------RSIGIAGFSMNSSASKISHLFFAHDSLIFLKASAEAF

Query:  GFFKGIMVDYERASGQCINLDKS
             ++ +Y   SG  IN  KS
Subjt:  GFFKGIMVDYERASGQCINLDKS

P0C2F6 Putative ribonuclease H protein At1g657504.2e-2430.8Show/hide
Query:  KNVHRDT-QYYLSSIVQMQSGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQGGLDFRD
        K +++DT    L  +    SGW+    S AG+  L K+V+ ++  ++MS   LP+ +L+++  L   F WGST  K++ H  +W ++C PK +GGL  R 
Subjt:  KNVHRDT-QYYLSSIVQMQSGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQGGLDFRD

Query:  FEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAE---VRSNPSYFWRSFVWGL-DLLRSGLRKQIGDGQSVSIMNDPWIP-RPYLFKVIGSK
         +  N+ALI+K  WR+L +  S    VL+ KY    E+ ++     + + S  WRS   GL D++  G+    GDGQ +    D W+  +P L    G +
Subjt:  FEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAE---VRSNPSYFWRSFVWGL-DLLRSGLRKQIGDGQSVSIMNDPWIP-RPYLFKVIGSK

Query:  -VYCEDMKVSDCITATGQWNVPKL
           C+ +   D       W+  K+
Subjt:  -VYCEDMKVSDCITATGQWNVPKL

P11369 LINE-1 retrotransposable element ORF2 protein7.5e-2121.62Show/hide
Query:  QEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVL----
        QE+ +L  +++++     +    ++R  + +  ++  K         R +  I  + + +G   T+  +I     +++K +++++  +    +K L    
Subjt:  QEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQFPSAGNFEKVL----

Query:  ---------QHIPNRVTP--------------APGPDGFPALFYQKYWDVVGQKTVDDCLAILNR-EHSIK-------DWNYTNIVLIPK-VPNPRLVTD
                  H+ + ++P              +PGPDGF A FYQ +         +D + IL++  H I+        +    I LIPK   +P  + +
Subjt:  ---------QHIPNRVTP--------------APGPDGFPALFYQKYWDVVGQKTVDDCLAILNR-EHSIK-------DWNYTNIVLIPK-VPNPRLVTD

Query:  YRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWI
        +RPISL N+  KI+ K++ANR++  +  II   Q  FIP      N+      +H++N      +    + LD  KA+D+++  F   ++++ G    ++
Subjt:  YRPISLCNVSYKIVTKVIANRLKRVLDDIIDDCQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWI

Query:  DMIKGVYQRPL-----------SLSLLMGRRRG--LSAMLGHV------RSI----GIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDY
        +MIK +Y +P+           ++ L  G R+G  LS  L ++      R+I     I G  +     KIS L  A D ++++     +      ++  +
Subjt:  DMIKGVYQRPL-----------SLSLLMGRRRG--LSAMLGHV------RSI----GIAGFSMNSSASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDY

Query:  ERASGQCINLDKS-RICFSKN---------------VHRDTQYYLSSIVQMQSGWKRNFFSNAGKEI----------------LIKSVVQAILTYAMSCF
            G  IN +KS    ++KN               V  + +Y   ++ +         F +  KEI                 I  V  AIL  A+  F
Subjt:  ERASGQCINLDKS-RICFSKN---------------VHRDTQYYLSSIVQMQSGWKRNFFSNAGKEI----------------LIKSVVQAILTYAMSCF

Query:  -----RLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPK-AQGGLDFRDFEVFNQALIAKQAW
             ++P    +++     +F W   + K RI     K L K K   GG+   D +++ +A++ K AW
Subjt:  -----RLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPK-AQGGLDFRDFEVFNQALIAKQAW

P93295 Uncharacterized mitochondrial protein AtMg003102.7e-3447.55Show/hide
Query:  AILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKA-QGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLE
        A+  YAMSCFRL K L  K++S    FWW S + K++I W  W++LCK K   GGL FRD   FNQAL+AKQ++RI+ +P + ++R+L+ +YFP S ++E
Subjt:  AILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKA-QGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLE

Query:  AEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWI
          V + PSY WRS + G +LL  GL + IGDG    +  D WI
Subjt:  AEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein9.1e-1426.48Show/hide
Query:  KCGQALKDWGF----RENKNRWASIRLIKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGV
        KC + L   GF     + K    S+  I+ +L      P    F+  H   ++ +      E +++Q+SR  WL+ GD NT++FH      + +N I  +
Subjt:  KCGQALKDWGF----RENKNRWASIRLIKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGV

Query:  EDVEGRWQTEANQIHKAFEAYF-------------------KDIFN-----------SQFPSAGNFEKVLQHIPNRVTPAPGPDGFPALFYQKYWDVVGQ
           +        Q+ +   AY+                   KDI             S  PS       +  +P     APGPD F A F+ + W VV  
Subjt:  EDVEGRWQTEANQIHKAFEAYF-------------------KDIFN-----------SQFPSAGNFEKVLQHIPNRVTPAPGPDGFPALFYQKYWDVVGQ

Query:  KTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVT
         T+          H +K +N T I LIPKV     ++ +RP+S C V YKI+T
Subjt:  KTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVT

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.1e-1032.94Show/hide
Query:  WKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQGGLDFRDFEVFNQ
        W     S AG+  LI SV+ ++  + MS FRLP   + +I S+C+ F W   +   +     W ++C PK +GGL  R  +  N+
Subjt:  WKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQGGLDFRDFEVFNQ

AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.7e-1036.05Show/hide
Query:  IANRLKRVLDDIIDDCQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI
        +  RLK ++ ++I   Q++FIP R  +DN++   E +H +  R KG +G+  LKLD+ KAYDR+ W +    +   GF   W+  I
Subjt:  IANRLKRVLDDIIDDCQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMI

AT4G29090.1 Ribonuclease H-like superfamily protein3.6e-3435.68Show/hide
Query:  AILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEA
        A+ TY M+CF LPK +  +I S+ A FWW +    + +HW+ W  L   KA+GG+ F+D E FN AL+ KQ WR+L +PES +A+V K +YF  S+ L A
Subjt:  AILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEA

Query:  EVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWI-PRPYLFKVIGSKVYCED-------MKVSDCITATGQWNVPKLQQLLFDEDIQEIVG
         + S PS+ W+S     ++LR G R  +G+G+ + I    W+  +P    +   +V  ++       +KVSD I  +G+     + ++LF E  ++++G
Subjt:  EVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWI-PRPYLFKVIGSKVYCED-------MKVSDCITATGQWNVPKLQQLLFDEDIQEIVG

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.9e-3547.55Show/hide
Query:  AILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKA-QGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLE
        A+  YAMSCFRL K L  K++S    FWW S + K++I W  W++LCK K   GGL FRD   FNQAL+AKQ++RI+ +P + ++R+L+ +YFP S ++E
Subjt:  AILTYAMSCFRLPKGLLDKISSLCARFWWGSTDAKQRIHWRRWKELCKPKA-QGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLE

Query:  AEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWI
          V + PSY WRS + G +LL  GL + IGDG    +  D WI
Subjt:  AEVRSNPSYFWRSFVWGLDLLRSGLRKQIGDGQSVSIMNDPWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCTAGGGTTTTTTAGAAGTTCGGAGGCGTTTTGGGACAAACCAGGCAAATCCGGGGCGGCTAGAGGCTACAGGGACCAGACGGGGCCCGACGGGCCGAGGCCGAG
CAGGGGAGGCCCGACCCCTTCGGTCTTGGCCCGTCCCGCTCGCCAGTTTTGCCTCTTGGGTCTATCTTTCGGTCTGATTTCTGCTCGGTTGTCCTTGTCAGCTCTTTGTG
CATCGGGGTGGTCCAAAATTATCTATAACAATTGGAATCAAGGTCTTCATCACAAGTTGTATAAGTGTGGCCAAGCTTTGAAGGATTGGGGATTTAGGGAAAACAAGAAT
AGATGGGCTAGCATCCGCCTTATAAAGGATAAACTTAAGTTAGTTTATGATAAGCCTTTGCCAATTAATTTTCAGGAGGTGCATAGATTGGAGCGTCAACTGGACAAATT
ATTTTTGGAGGAAGAGGTGTACTGGAAGCAGCGGTCACGAGAAAACTGGCTTAAGTGGGGAGATCGGAATACAAAGTGGTTTCATCACCGTGCCTCTTTTCGAAGGAAAC
GGAATTCTATTCTAGGCGTGGAGGATGTGGAAGGGCGGTGGCAAACGGAAGCTAATCAAATTCACAAGGCCTTTGAAGCATACTTTAAAGATATTTTCAACTCTCAATTC
CCCTCTGCGGGAAACTTTGAGAAAGTATTACAACATATTCCCAATCGAGTCACGCCAGCCCCGGGGCCGGATGGTTTCCCAGCTTTGTTCTATCAAAAATACTGGGACGT
CGTGGGACAAAAAACAGTGGACGATTGCCTTGCCATCTTGAATCGGGAGCATTCTATCAAGGATTGGAATTACACAAACATCGTTCTCATTCCAAAAGTTCCAAATCCAA
GGTTAGTGACTGACTACCGTCCAATTAGTCTGTGCAATGTTTCTTATAAGATTGTCACTAAAGTGATTGCTAATCGTCTGAAAAGAGTACTTGATGATATAATTGATGAC
TGTCAATCTGCATTCATACCTAGTCGTTCCATTTCGGATAATATGATTGTTGGGCATGAATTCCTTCATTTTCTTAACTCTCGAACGAAAGGTCGTCAAGGTTTTGCTGC
TTTGAAGCTTGATATGAGTAAGGCCTATGATCGTGTCGAGTGGTCATTCTTCTGTGCGATTATGGATAAACTGGGTTTCTCTGCTGGGTGGATTGATATGATCAAGGGTG
TATATCAACGGCCTCTTTCTCTATCCTTATTAATGGGGAGGCGAAGGGGTTTATCTGCGATGCTTGGTCATGTAAGGAGCATTGGTATTGCAGGCTTTTCTATGAACTCG
TCTGCATCGAAAATATCTCATTTATTCTTCGCTCATGATAGCCTCATCTTCCTGAAGGCTTCGGCTGAGGCATTTGGTTTTTTTAAAGGCATCATGGTGGATTATGAACG
TGCATCTGGTCAATGTATTAATTTGGATAAATCTCGGATCTGTTTTTCTAAGAATGTGCATCGCGATACTCAATACTATCTCAGTTCAATAGTCCAAATGCAGTCGGGAT
GGAAGAGGAATTTTTTTTCCAATGCGGGCAAGGAAATTCTTATCAAAAGTGTGGTTCAGGCGATTCTGACATATGCCATGAGCTGTTTTCGTCTTCCCAAAGGCCTCTTG
GACAAGATTTCTTCCTTATGTGCCAGGTTCTGGTGGGGTTCGACAGATGCAAAACAACGTATTCATTGGAGGAGGTGGAAGGAGCTATGTAAACCTAAGGCACAAGGGGG
TCTAGATTTTCGAGATTTTGAGGTATTCAATCAGGCTCTGATTGCAAAACAAGCTTGGAGGATTTTAATCAAGCCCGAGTCTACAGTAGCTCGAGTCCTTAAAGGAAAGT
ATTTTCCTTCCTCAGAACTATTAGAGGCAGAGGTACGGTCCAATCCATCGTATTTCTGGAGAAGTTTTGTTTGGGGGCTGGACCTCTTGAGGTCAGGACTTCGAAAGCAA
ATTGGTGATGGTCAATCTGTGAGTATTATGAATGACCCTTGGATTCCTCGTCCATATTTATTTAAAGTGATTGGTAGCAAAGTATATTGCGAGGATATGAAGGTTTCAGA
TTGCATCACAGCGACGGGACAATGGAATGTTCCTAAACTTCAGCAACTTCTCTTTGATGAGGACATACAGGAAATTGTAGGTCTCCCAATTAGTATGTCGATGTCAGATA
GATGGATGTTGCCTTGTGCAAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCCTAGGGTTTTTTAGAAGTTCGGAGGCGTTTTGGGACAAACCAGGCAAATCCGGGGCGGCTAGAGGCTACAGGGACCAGACGGGGCCCGACGGGCCGAGGCCGAG
CAGGGGAGGCCCGACCCCTTCGGTCTTGGCCCGTCCCGCTCGCCAGTTTTGCCTCTTGGGTCTATCTTTCGGTCTGATTTCTGCTCGGTTGTCCTTGTCAGCTCTTTGTG
CATCGGGGTGGTCCAAAATTATCTATAACAATTGGAATCAAGGTCTTCATCACAAGTTGTATAAGTGTGGCCAAGCTTTGAAGGATTGGGGATTTAGGGAAAACAAGAAT
AGATGGGCTAGCATCCGCCTTATAAAGGATAAACTTAAGTTAGTTTATGATAAGCCTTTGCCAATTAATTTTCAGGAGGTGCATAGATTGGAGCGTCAACTGGACAAATT
ATTTTTGGAGGAAGAGGTGTACTGGAAGCAGCGGTCACGAGAAAACTGGCTTAAGTGGGGAGATCGGAATACAAAGTGGTTTCATCACCGTGCCTCTTTTCGAAGGAAAC
GGAATTCTATTCTAGGCGTGGAGGATGTGGAAGGGCGGTGGCAAACGGAAGCTAATCAAATTCACAAGGCCTTTGAAGCATACTTTAAAGATATTTTCAACTCTCAATTC
CCCTCTGCGGGAAACTTTGAGAAAGTATTACAACATATTCCCAATCGAGTCACGCCAGCCCCGGGGCCGGATGGTTTCCCAGCTTTGTTCTATCAAAAATACTGGGACGT
CGTGGGACAAAAAACAGTGGACGATTGCCTTGCCATCTTGAATCGGGAGCATTCTATCAAGGATTGGAATTACACAAACATCGTTCTCATTCCAAAAGTTCCAAATCCAA
GGTTAGTGACTGACTACCGTCCAATTAGTCTGTGCAATGTTTCTTATAAGATTGTCACTAAAGTGATTGCTAATCGTCTGAAAAGAGTACTTGATGATATAATTGATGAC
TGTCAATCTGCATTCATACCTAGTCGTTCCATTTCGGATAATATGATTGTTGGGCATGAATTCCTTCATTTTCTTAACTCTCGAACGAAAGGTCGTCAAGGTTTTGCTGC
TTTGAAGCTTGATATGAGTAAGGCCTATGATCGTGTCGAGTGGTCATTCTTCTGTGCGATTATGGATAAACTGGGTTTCTCTGCTGGGTGGATTGATATGATCAAGGGTG
TATATCAACGGCCTCTTTCTCTATCCTTATTAATGGGGAGGCGAAGGGGTTTATCTGCGATGCTTGGTCATGTAAGGAGCATTGGTATTGCAGGCTTTTCTATGAACTCG
TCTGCATCGAAAATATCTCATTTATTCTTCGCTCATGATAGCCTCATCTTCCTGAAGGCTTCGGCTGAGGCATTTGGTTTTTTTAAAGGCATCATGGTGGATTATGAACG
TGCATCTGGTCAATGTATTAATTTGGATAAATCTCGGATCTGTTTTTCTAAGAATGTGCATCGCGATACTCAATACTATCTCAGTTCAATAGTCCAAATGCAGTCGGGAT
GGAAGAGGAATTTTTTTTCCAATGCGGGCAAGGAAATTCTTATCAAAAGTGTGGTTCAGGCGATTCTGACATATGCCATGAGCTGTTTTCGTCTTCCCAAAGGCCTCTTG
GACAAGATTTCTTCCTTATGTGCCAGGTTCTGGTGGGGTTCGACAGATGCAAAACAACGTATTCATTGGAGGAGGTGGAAGGAGCTATGTAAACCTAAGGCACAAGGGGG
TCTAGATTTTCGAGATTTTGAGGTATTCAATCAGGCTCTGATTGCAAAACAAGCTTGGAGGATTTTAATCAAGCCCGAGTCTACAGTAGCTCGAGTCCTTAAAGGAAAGT
ATTTTCCTTCCTCAGAACTATTAGAGGCAGAGGTACGGTCCAATCCATCGTATTTCTGGAGAAGTTTTGTTTGGGGGCTGGACCTCTTGAGGTCAGGACTTCGAAAGCAA
ATTGGTGATGGTCAATCTGTGAGTATTATGAATGACCCTTGGATTCCTCGTCCATATTTATTTAAAGTGATTGGTAGCAAAGTATATTGCGAGGATATGAAGGTTTCAGA
TTGCATCACAGCGACGGGACAATGGAATGTTCCTAAACTTCAGCAACTTCTCTTTGATGAGGACATACAGGAAATTGTAGGTCTCCCAATTAGTATGTCGATGTCAGATA
GATGGATGTTGCCTTGTGCAAAGTGA
Protein sequenceShow/hide protein sequence
MLLGFFRSSEAFWDKPGKSGAARGYRDQTGPDGPRPSRGGPTPSVLARPARQFCLLGLSFGLISARLSLSALCASGWSKIIYNNWNQGLHHKLYKCGQALKDWGFRENKN
RWASIRLIKDKLKLVYDKPLPINFQEVHRLERQLDKLFLEEEVYWKQRSRENWLKWGDRNTKWFHHRASFRRKRNSILGVEDVEGRWQTEANQIHKAFEAYFKDIFNSQF
PSAGNFEKVLQHIPNRVTPAPGPDGFPALFYQKYWDVVGQKTVDDCLAILNREHSIKDWNYTNIVLIPKVPNPRLVTDYRPISLCNVSYKIVTKVIANRLKRVLDDIIDD
CQSAFIPSRSISDNMIVGHEFLHFLNSRTKGRQGFAALKLDMSKAYDRVEWSFFCAIMDKLGFSAGWIDMIKGVYQRPLSLSLLMGRRRGLSAMLGHVRSIGIAGFSMNS
SASKISHLFFAHDSLIFLKASAEAFGFFKGIMVDYERASGQCINLDKSRICFSKNVHRDTQYYLSSIVQMQSGWKRNFFSNAGKEILIKSVVQAILTYAMSCFRLPKGLL
DKISSLCARFWWGSTDAKQRIHWRRWKELCKPKAQGGLDFRDFEVFNQALIAKQAWRILIKPESTVARVLKGKYFPSSELLEAEVRSNPSYFWRSFVWGLDLLRSGLRKQ
IGDGQSVSIMNDPWIPRPYLFKVIGSKVYCEDMKVSDCITATGQWNVPKLQQLLFDEDIQEIVGLPISMSMSDRWMLPCAK