; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010203 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010203
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:45388649..45392724
RNA-Seq ExpressionLag0010203
SyntenyLag0010203
Gene Ontology termsGO:0050789 - regulation of biological process (biological process)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]4.6e-25240.99Show/hide
Query:  VQETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGD
        ++ETK    D   + S+W++ +  W++L + GASGGILI+W     + +E + G FSVSI   +    S WL+ +YGP+    R DFW EL D+AGL   
Subjt:  VQETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGD

Query:  RWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGD
        RW +GGDFNV R S EK  G   T SM+ F+ +I+  +L+++PL++  FTWS+   N     +DRFL S +    F  S    L R TSDH+P+ L    
Subjt:  RWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGD

Query:  INWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALS-QLPSLITQLKQLDNLE-DNGLTLSQKESRRILREQ
          WGP PFRFEN WLQ  SF+E    WW +    GW GH FM KL+ +K +L+ WNK+    LS +   +++ L   D+LE + GL+      R + + +
Subjt:  INWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALS-QLPSLITQLKQLDNLE-DNGLTLSQKESRRILREQ

Query:  IEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALE
        +E+L  +E I W+Q+ ++KW+KEGD NS+FFH++   R+ +  I E+ +  G+ + ++  I++E + ++  L+T  +   +    +DW PIS   +  LE
Subjt:  IEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALE

Query:  VAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRL
           +EEE+F A+  +   K+PGPDG+T   F+  W  IK+DL+ +  +F+ +GIIN + N ++I L+PK+  S+ + D+RPISLI+  YKIIA+VL+ R+
Subjt:  VAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRL

Query:  KHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKII
        + VL  TI   Q AFV  RQILDA LIANE++D+ + S + GVV K+D EKA+D V WDFLD VL  KGFG  WRKW+RGC+SSV++++++NG  +G + 
Subjt:  KHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKII

Query:  PSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDD
         SRG+RQGDPLSPFLF +V+D LSR+L  +     +    VG +   ++HLQFADDT+ FSS   + +  L  V+ +F   SGL VNL KS I GI+++ 
Subjt:  PSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDD

Query:  TELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEG
          L  +A    CK   WP  YLGLPLGGNPK S FW PVIE+I  +L  W+ A++S GGR TLIQ+ L+ MP YFLSLFK+P+ VA  ++++ RDF W G
Subjt:  TELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEG

Query:  SQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKY---YNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIH
               H VNWD    PK  GGLG G   +RN ALL KW+WR+  E  +LW  +I++ Y    N  DV +    I R   + PW+ I     + +    
Subjt:  SQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKY---YNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIH

Query:  RIIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWNSTES-AWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RNMNDSWSWPLEASKLFSV
         ++GNG    FW D W     L   +PRL R+ T+ NA ++ +   T   +WN + RR+L++ EI +   L   L  + +  ++ D  SW L  S LF+V
Subjt:  RIIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWNSTES-AWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RNMNDSWSWPLEASKLFSV

Query:  KSLMVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEA
        KS  + L   S +        +W    P K+K F+W ++   +NT D LQ R P+ +LSP  C +C  + E   HLF+ CS     W+ + ++
Subjt:  KSLMVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEA

RVW16209.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]4.1e-24540.72Show/hide
Query:  VQETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFS-FWLTNIYGPSRREFRVDFWKELHDLAGLGG
        +QETK  + D   + S+W+  +  W +L + GASGGILI+W     + +E + G FSVS+  F  DG    W++ +YGP+    R DFW EL D+ GL  
Subjt:  VQETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFS-FWLTNIYGPSRREFRVDFWKELHDLAGLGG

Query:  DRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFG
          W +GGDFNV R S EK  G  +T SMR F+ +I+  +LL+ PL+N  FTWS+  ++P    +DRFL S +    F       L R TSDH+P+ +   
Subjt:  DRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFG

Query:  DINWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALSQLPSLITQLKQLDNLEDNGLTLSQKESRRILREQI
           WGP PFRFEN WLQ  +F+E   +WW+     GW GH FM +L+ +K +L+ WNKS                            S  E +   + ++
Subjt:  DINWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALSQLPSLITQLKQLDNLEDNGLTLSQKESRRILREQI

Query:  EDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALEV
        E+L  +E I W+Q+ K+KW+KEGD NSKF+H++   R+ +  I E+ +  G+ L +A  I +E + ++  L+T      +    +DW PISE  +  LE 
Subjt:  EDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALEV

Query:  AISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRLK
          +EEE+  A+  L   K+PGPDG+T   F+  W+ IK+DL+ +  +F+ +GIIN + N ++I LIPK+  SK + D+RPISLI+  YKIIA+VLS RL+
Subjt:  AISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRLK

Query:  HVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKIIP
         VL  TI   Q AFV  RQILDA LIANE++D+ + S + GVV K+D EKA+D V WDFLD +L  KGF   WRKW+ GC+SSV+++I++NG  +G +  
Subjt:  HVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKIIP

Query:  SRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDDT
        SRG+RQGDPLSPFLF LV+D LSR+L  +     +    VG +   ++HLQFADD + FS+   + L  L  ++ +F    GL VNL KS I GI++D  
Subjt:  SRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDDT

Query:  ELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGS
         L  +A    CK   WP  YLGLPLGGNPK+  FW PV+E+I  +L  W+ A++S GGR TLIQ+ L+ +P+YFLSLFK+P+ VA  +++L RDF W G 
Subjt:  ELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGS

Query:  QGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIHRIIG
              H V WD    PK +GGLG+GN   RN ALL KW+WR+  E  +LW  +I++  Y S         + R   + PW+ I     + +     + G
Subjt:  QGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIHRIIG

Query:  NGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWN-STESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLR-NMNDSWSWPLEASKLFSVKSLM
        NG    FW+D W     L   +PRL+R+  + N  ++ V   S    WNL+ RR+L++ EI +   L   L  + L  ++ D+  WPL +S LFSVKS  
Subjt:  NGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWN-STESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLR-NMNDSWSWPLEASKLFSVKSLM

Query:  VDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLE
        + L   S ++ N     +W    P K+K F+W ++   +NT D LQ R P+ +LSP  C++C  + E+  HLF+ CS     W+ + +
Subjt:  VDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLE

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]5.2e-25641.85Show/hide
Query:  VQETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGD
        +QETK  + D   + S+W    + W++L + GASGGI+I+W        E + G FSV++     +  SFWLT++YGP    +R DFW EL DL GL   
Subjt:  VQETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGD

Query:  RWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGD
        RW +GGDFNV R   EK     +T +MR F+++I    L++ PL+N  FTWS+   +P    +DRFL S +    F  S    L R TSDH P+ L    
Subjt:  RWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGD

Query:  INWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALSQLPSLI-TQLKQLDNLEDNG-LTLSQKESRRILREQ
        + WGP PFRFEN WL    F+E    WW + + +GW GH FM KLK +K++L+ WN      L +   LI T L ++D +E  G L       R + R +
Subjt:  INWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALSQLPSLI-TQLKQLDNLEDNG-LTLSQKESRRILREQ

Query:  IEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALE
        +ED+  +E +QW+Q+ ++KW+KEGD NSKFFHR+   R+ +  I  ++S  G +L +  DI +E V+F+  L++K     +    +DW PIS      L+
Subjt:  IEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALE

Query:  VAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRL
           +EEEV  A+  L   K+PGPDG+T   ++  W+ IK+DLM +  +F+  G+IN + N T+I L+PK+  S  + DYRPISL++  YKIIA+VLS RL
Subjt:  VAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRL

Query:  KHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKII
        + VL  TI+D+Q AFV  R ILDA LIANE++D+ + S + G+V K+D EKA+D VDW FLD VL  KGF   WR WIRGC+SS +++I++NG  +G + 
Subjt:  KHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKII

Query:  PSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDD
         SRG+RQGDPLSPFLF LV+D LSR+L  +   G      VG     ++ LQFADDT+ FS    + L  L  ++ +F   SGL +NL KS I GI+   
Subjt:  PSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDD

Query:  TELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEG
          L  +A+ F C+   WP +YLGLPLGGNPK   FW PV+E+I  +L  WK A++S GGR TLIQ+ LS +P+YFLSLFK+P+ +A  ++K+ R+F W G
Subjt:  TELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEG

Query:  SQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIHRII
        +      H V W+    PK LGGLG G   LRN ALL KW+WRF  E   LW   +I   Y +         + R   + PW+ I     + +  +  ++
Subjt:  SQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIHRII

Query:  GNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSTESAWNLSPRRHLNEFEIIEWANLSYLLSPIR-LRNMNDSWSWPLEASKLFSVKSL
        GNG    FW+D W     L + F  LYR+ +  N  V+ V  NS   AWNL+ RR+L + EI     L   LS +R   ++ DS +W L +S LF+VKS 
Subjt:  GNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSTESAWNLSPRRHLNEFEIIEWANLSYLLSPIR-LRNMNDSWSWPLEASKLFSVKSL

Query:  MVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEAFG
         + L   SN  L      +W    P K+K   W ++ G +NT D+LQ R P+ SL P WC++C  N E+  HLF+ C      WN + +  G
Subjt:  MVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEAFG

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]3.5e-25241.12Show/hide
Query:  QETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGDR
        QETK    D   + S+W++ +  W++L + GASGGILI+W     + +E + G FSVSI   +    S WL+ +YGP+    R D W EL D+AGL   R
Subjt:  QETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGDR

Query:  WILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGDI
        W +GGDFNV R S EK  G  +T SM+ F+ +I+  +L+++PL++  FTWS+   NP    +DRFL S +    F  S    L R TSDH+P+ L     
Subjt:  WILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGDI

Query:  NWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALS-QLPSLITQLKQLDNLE-DNGLTLSQKESRRILREQI
         WGP PFRFEN WLQ  SF+E    WW +    GW GH FM KL+ +K +L+ WNK+    LS +   +++ L   D+LE + GL+      R I + ++
Subjt:  NWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALS-QLPSLITQLKQLDNLE-DNGLTLSQKESRRILREQI

Query:  EDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALEV
        E+L  +E I W+Q+ ++KW+KEGD NSKFFH++   R+ +  I E+ +  G  + ++  I++E + ++  L+T  +   +    +DW PIS   +  LE 
Subjt:  EDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALEV

Query:  AISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRLK
          +EEE+  A+  +   K+PGPDG+T   F+  W  IK+DL+ +  +F+ +GIIN + N ++I L+PK+  S+ + D+RPISLI+  YKIIA+VL+ R++
Subjt:  AISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRLK

Query:  HVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKIIP
         VL  TI   Q AFV  RQILDA LIANE++D+ + S + GVV K+D EKA+D V WDFLD V+  KGFG  WRKW+RGC+SSV++++++NG  +G +  
Subjt:  HVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKIIP

Query:  SRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDDT
        SRG+RQGDPLSPFLF +V+D LSR+L  +     +    VG +   ++HLQFADDT+ FSS   + +  L  V+ +F   SGL VNL KS I GI+++  
Subjt:  SRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDDT

Query:  ELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGS
         L  +A    CK   WP  YLGLPLGGNPK S FW PVIE+I  +L  W+ A++S GGR TLIQ+ L+ MP YFLSLFK+P+ VA  ++++ RDF W G 
Subjt:  ELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGS

Query:  QGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKY---YNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIHR
              H VNWD    PK  GGLG G   +RN ALL KW+WR+  E  +LW  +I++ Y    N  DV +    I R   + PW+ I     + +     
Subjt:  QGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKY---YNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIHR

Query:  IIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWNSTES-AWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RNMNDSWSWPLEASKLFSVK
        ++GNG    FW D W     L   +PRL R+ T+ NA ++ +  ST   +WN + RR+L++ EI +   L      + +  ++ D  SW L +S LF+VK
Subjt:  IIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWNSTES-AWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RNMNDSWSWPLEASKLFSVK

Query:  SLMVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEA
        S  + L   S +        +W    P K+K F+W ++   +NT D LQ R P+ +LSP  C +C  + E   HLF+ CS     W+ + ++
Subjt:  SLMVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEA

RVX11537.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.3e-24340.66Show/hide
Query:  VQETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGD
        +QETK  + D   + S+W    + W++L + GA GGI+I+W    F   E + G FSV++     +  S WLT++YGP    +R DFW EL DL GL   
Subjt:  VQETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGD

Query:  RWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGD
        RW +GGDFNV R   EK     +T +MR F+++I    LL+ PL+N  FTWS+   +P    +DRFL S +    F  S    L R TSDH P+ L    
Subjt:  RWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGD

Query:  INWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALSQLPSLI-TQLKQLDNLEDNG-LTLSQKESRRILREQ
        + WGP PFRFEN WL    F+E    WW + + +GW GH FM KLK +K++L+ WN      L +   LI T L ++D +E  G L       R + R +
Subjt:  INWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALSQLPSLI-TQLKQLDNLEDNG-LTLSQKESRRILREQ

Query:  IEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALE
        +ED+  +E +QW+Q+ K+KW+KEGD NSKFFHR+                                              +    +DW PIS+     L+
Subjt:  IEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALE

Query:  VAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRL
           +EEEV  A+  L   K+PGPDG+T   ++  W+ IK+DLM +  +F+  G+IN + N T+I L+PK+  S  + DYRPISL++  YKIIA+VLS RL
Subjt:  VAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRL

Query:  KHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKII
        + VL  TI+D+Q AFV  R ILDA LIANE++D+ + S + G+V K+D EKA+D VDW FLD VL  KGF   WR WIRGC+SS +++I++NG  +G + 
Subjt:  KHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKII

Query:  PSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDD
         SRG+RQGDPLSPFLF LV+D LSR+L  +   G      VG     ++ LQFADDT+ FS    + L  L  ++ +F   SGL +N+ KS I GI+   
Subjt:  PSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDD

Query:  TELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEG
          L  +A+ F C+   WP +YLGLPLGGNPK   FW PV+E+I  +L  WK A++S GGR TLIQ+ LS +P+YFLSLFK+P+ +A  ++K+ R+F W  
Subjt:  TELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEG

Query:  SQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIHRII
        +      H V W+    PK LGGLG G   LRN ALL KW+WRF  E   LW   +I   Y +         + R   + PW+ I     + +  +  ++
Subjt:  SQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIHRII

Query:  GNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSTESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLR-NMNDSWSWPLEASKLFSVKSL
        GNG    FW+D W     L + F  LYR+ +  N  V+ V  NS   AWNL+ RR+L + EI     L   LS +R   ++ DS +W L +S LF+VKS 
Subjt:  GNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSTESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLR-NMNDSWSWPLEASKLFSVKSL

Query:  MVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEAFG
         + L   SN  L      +W    P K+K   W ++ G +NT D+LQ R P+ SL P WC++C  N E+  HLF+ C      WN + +  G
Subjt:  MVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEAFG

TrEMBL top hitse value%identityAlignment
A0A438FWU5 LINE-1 retrotransposable element ORF2 protein2.5e-25641.85Show/hide
Query:  VQETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGD
        +QETK  + D   + S+W    + W++L + GASGGI+I+W        E + G FSV++     +  SFWLT++YGP    +R DFW EL DL GL   
Subjt:  VQETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGD

Query:  RWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGD
        RW +GGDFNV R   EK     +T +MR F+++I    L++ PL+N  FTWS+   +P    +DRFL S +    F  S    L R TSDH P+ L    
Subjt:  RWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGD

Query:  INWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALSQLPSLI-TQLKQLDNLEDNG-LTLSQKESRRILREQ
        + WGP PFRFEN WL    F+E    WW + + +GW GH FM KLK +K++L+ WN      L +   LI T L ++D +E  G L       R + R +
Subjt:  INWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALSQLPSLI-TQLKQLDNLEDNG-LTLSQKESRRILREQ

Query:  IEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALE
        +ED+  +E +QW+Q+ ++KW+KEGD NSKFFHR+   R+ +  I  ++S  G +L +  DI +E V+F+  L++K     +    +DW PIS      L+
Subjt:  IEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALE

Query:  VAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRL
           +EEEV  A+  L   K+PGPDG+T   ++  W+ IK+DLM +  +F+  G+IN + N T+I L+PK+  S  + DYRPISL++  YKIIA+VLS RL
Subjt:  VAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRL

Query:  KHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKII
        + VL  TI+D+Q AFV  R ILDA LIANE++D+ + S + G+V K+D EKA+D VDW FLD VL  KGF   WR WIRGC+SS +++I++NG  +G + 
Subjt:  KHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKII

Query:  PSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDD
         SRG+RQGDPLSPFLF LV+D LSR+L  +   G      VG     ++ LQFADDT+ FS    + L  L  ++ +F   SGL +NL KS I GI+   
Subjt:  PSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDD

Query:  TELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEG
          L  +A+ F C+   WP +YLGLPLGGNPK   FW PV+E+I  +L  WK A++S GGR TLIQ+ LS +P+YFLSLFK+P+ +A  ++K+ R+F W G
Subjt:  TELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEG

Query:  SQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIHRII
        +      H V W+    PK LGGLG G   LRN ALL KW+WRF  E   LW   +I   Y +         + R   + PW+ I     + +  +  ++
Subjt:  SQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIHRII

Query:  GNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSTESAWNLSPRRHLNEFEIIEWANLSYLLSPIR-LRNMNDSWSWPLEASKLFSVKSL
        GNG    FW+D W     L + F  LYR+ +  N  V+ V  NS   AWNL+ RR+L + EI     L   LS +R   ++ DS +W L +S LF+VKS 
Subjt:  GNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEV-WNSTESAWNLSPRRHLNEFEIIEWANLSYLLSPIR-LRNMNDSWSWPLEASKLFSVKSL

Query:  MVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEAFG
         + L   SN  L      +W    P K+K   W ++ G +NT D+LQ R P+ SL P WC++C  N E+  HLF+ C      WN + +  G
Subjt:  MVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEAFG

A0A438GDE7 LINE-1 retrotransposable element ORF2 protein1.7e-25241.12Show/hide
Query:  QETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGDR
        QETK    D   + S+W++ +  W++L + GASGGILI+W     + +E + G FSVSI   +    S WL+ +YGP+    R D W EL D+AGL   R
Subjt:  QETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGDR

Query:  WILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGDI
        W +GGDFNV R S EK  G  +T SM+ F+ +I+  +L+++PL++  FTWS+   NP    +DRFL S +    F  S    L R TSDH+P+ L     
Subjt:  WILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGDI

Query:  NWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALS-QLPSLITQLKQLDNLE-DNGLTLSQKESRRILREQI
         WGP PFRFEN WLQ  SF+E    WW +    GW GH FM KL+ +K +L+ WNK+    LS +   +++ L   D+LE + GL+      R I + ++
Subjt:  NWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALS-QLPSLITQLKQLDNLE-DNGLTLSQKESRRILREQI

Query:  EDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALEV
        E+L  +E I W+Q+ ++KW+KEGD NSKFFH++   R+ +  I E+ +  G  + ++  I++E + ++  L+T  +   +    +DW PIS   +  LE 
Subjt:  EDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALEV

Query:  AISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRLK
          +EEE+  A+  +   K+PGPDG+T   F+  W  IK+DL+ +  +F+ +GIIN + N ++I L+PK+  S+ + D+RPISLI+  YKIIA+VL+ R++
Subjt:  AISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRLK

Query:  HVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKIIP
         VL  TI   Q AFV  RQILDA LIANE++D+ + S + GVV K+D EKA+D V WDFLD V+  KGFG  WRKW+RGC+SSV++++++NG  +G +  
Subjt:  HVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKIIP

Query:  SRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDDT
        SRG+RQGDPLSPFLF +V+D LSR+L  +     +    VG +   ++HLQFADDT+ FSS   + +  L  V+ +F   SGL VNL KS I GI+++  
Subjt:  SRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDDT

Query:  ELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGS
         L  +A    CK   WP  YLGLPLGGNPK S FW PVIE+I  +L  W+ A++S GGR TLIQ+ L+ MP YFLSLFK+P+ VA  ++++ RDF W G 
Subjt:  ELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGS

Query:  QGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKY---YNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIHR
              H VNWD    PK  GGLG G   +RN ALL KW+WR+  E  +LW  +I++ Y    N  DV +    I R   + PW+ I     + +     
Subjt:  QGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKY---YNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIHR

Query:  IIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWNSTES-AWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RNMNDSWSWPLEASKLFSVK
        ++GNG    FW D W     L   +PRL R+ T+ NA ++ +  ST   +WN + RR+L++ EI +   L      + +  ++ D  SW L +S LF+VK
Subjt:  IIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWNSTES-AWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RNMNDSWSWPLEASKLFSVK

Query:  SLMVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEA
        S  + L   S +        +W    P K+K F+W ++   +NT D LQ R P+ +LSP  C +C  + E   HLF+ CS     W+ + ++
Subjt:  SLMVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEA

A5BCI7 Reverse transcriptase domain-containing protein2.2e-25240.99Show/hide
Query:  VQETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGD
        ++ETK    D   + S+W++ +  W++L + GASGGILI+W     + +E + G FSVSI   +    S WL+ +YGP+    R DFW EL D+AGL   
Subjt:  VQETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGD

Query:  RWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGD
        RW +GGDFNV R S EK  G   T SM+ F+ +I+  +L+++PL++  FTWS+   N     +DRFL S +    F  S    L R TSDH+P+ L    
Subjt:  RWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGD

Query:  INWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALS-QLPSLITQLKQLDNLE-DNGLTLSQKESRRILREQ
          WGP PFRFEN WLQ  SF+E    WW +    GW GH FM KL+ +K +L+ WNK+    LS +   +++ L   D+LE + GL+      R + + +
Subjt:  INWGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALS-QLPSLITQLKQLDNLE-DNGLTLSQKESRRILREQ

Query:  IEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALE
        +E+L  +E I W+Q+ ++KW+KEGD NS+FFH++   R+ +  I E+ +  G+ + ++  I++E + ++  L+T  +   +    +DW PIS   +  LE
Subjt:  IEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALE

Query:  VAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRL
           +EEE+F A+  +   K+PGPDG+T   F+  W  IK+DL+ +  +F+ +GIIN + N ++I L+PK+  S+ + D+RPISLI+  YKIIA+VL+ R+
Subjt:  VAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRL

Query:  KHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKII
        + VL  TI   Q AFV  RQILDA LIANE++D+ + S + GVV K+D EKA+D V WDFLD VL  KGFG  WRKW+RGC+SSV++++++NG  +G + 
Subjt:  KHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKII

Query:  PSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDD
         SRG+RQGDPLSPFLF +V+D LSR+L  +     +    VG +   ++HLQFADDT+ FSS   + +  L  V+ +F   SGL VNL KS I GI+++ 
Subjt:  PSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDD

Query:  TELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEG
          L  +A    CK   WP  YLGLPLGGNPK S FW PVIE+I  +L  W+ A++S GGR TLIQ+ L+ MP YFLSLFK+P+ VA  ++++ RDF W G
Subjt:  TELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEG

Query:  SQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKY---YNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIH
               H VNWD    PK  GGLG G   +RN ALL KW+WR+  E  +LW  +I++ Y    N  DV +    I R   + PW+ I     + +    
Subjt:  SQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKY---YNSEDVRHWPFPIQRGYFKSPWRFICTTIDQITSRIH

Query:  RIIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWNSTES-AWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RNMNDSWSWPLEASKLFSV
         ++GNG    FW D W     L   +PRL R+ T+ NA ++ +   T   +WN + RR+L++ EI +   L   L  + +  ++ D  SW L  S LF+V
Subjt:  RIIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWNSTES-AWNLSPRRHLNEFEIIEWANLSYLLSPIRL-RNMNDSWSWPLEASKLFSV

Query:  KSLMVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEA
        KS  + L   S +        +W    P K+K F+W ++   +NT D LQ R P+ +LSP  C +C  + E   HLF+ CS     W+ + ++
Subjt:  KSLMVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEA

M5VS59 Reverse transcriptase domain-containing protein (Fragment)2.4e-25942.83Show/hide
Query:  ETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGDRW
        ETK  ++D  L+  +W S    W    S+G SGGI ++W+    ++ +++ G FSVSI +    G  +WL+ IYGP R+  R  FW+EL DL G  GD+W
Subjt:  ETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGDRW

Query:  ILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGDIN
         LGGDFNV R+S EKS+   +T+SMR FN +I    L +  L N  FTWS+  +N     +DRFL+S    + F       L RITSDH P+ L    + 
Subjt:  ILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGDIN

Query:  WGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSH----RLALSQLPSLITQLKQLDNLEDNGLTLSQKESRRILREQ
        WGP PFRFEN WL    F   +  WW ++   GW G+ FM +LK LK++L+ W+K         L +  + +  L Q +  E  GL    +  R  L  +
Subjt:  WGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSH----RLALSQLPSLITQLKQLDNLEDNGLTLSQKESRRILREQ

Query:  IEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEV-LSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKAL
        I DL  +E ++W+QR K+KW +EGD N+KFFHR+    +++N I ++ +   GV  V AN IE+E + F++ L++ + +  +    ++WCPIS+ ++  L
Subjt:  IEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEV-LSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKAL

Query:  EVAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNR
        E     EEV  A+   G  KSPGPDG++  FF+  W  +K DLM ++QDF+ +GI+N   NET+ICLIPK+ +S  V D RPISL++  YK+I++VL++R
Subjt:  EVAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNR

Query:  LKHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKI
        L+ VL +TI+ +Q AFV  RQILDA L+ANE++++ +   ++G+V K+D EKA+D V+W+F+D VL  KGFG  WR WI GC+ SVN+SI+INGKPRGK 
Subjt:  LKHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKI

Query:  IPSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHID
          SRG+RQGDPLSPFLF LVSD LSR++  +  +  +     G     ++HLQFADDT+       +    L +++ +F   SG+ +N AKS ILGI+  
Subjt:  IPSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHID

Query:  DTELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWE
           L +MA  +GC+ G WP  YLGLPLGGNP+   FW PV++K++ +L  WK A +SKGGR TLIQA LSS+P+Y++SLFK+P  VA  +++L+R+F WE
Subjt:  DTELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWE

Query:  GSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRHWPF-PIQRGYFKSPWRFICTTIDQITSRIHR
        G +     H V W++    K  GGLGIG+ + RN AL AKW+WRF  E +SLW  +I +KY    D   W    I +   ++PWR I    +        
Subjt:  GSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRHWPF-PIQRGYFKSPWRFICTTIDQITSRIHR

Query:  IIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--TESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRNMN-DSWSWPLEASKLFSV
         +GNG    FW+D WL   IL +LFPRL  L+   N  +A   N+      W+   RR+L+E EI E   L  +L  +RL     D  SW +E    FS 
Subjt:  IIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--TESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRNMN-DSWSWPLEASKLFSV

Query:  KSLMVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEAFG
        KS    LL  +       +S IWK   P KI+ F+W  + G INT D +QRR P   LSPSWCV+C  N EN  HLF+ CS++ + W  ML A G
Subjt:  KSLMVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEAFG

M5WPQ5 Reverse transcriptase domain-containing protein3.4e-25342.28Show/hide
Query:  ETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGDRW
        ETK   +D  L+  +W S    W    S+G SGGI ++W+    ++ +++ G FSVSI +    G  +WL+ IYGP R+  R  FW+EL DL G  GD+W
Subjt:  ETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLGGDRW

Query:  ILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGDIN
         LGGDFNV R+S EKS+   +T+SMR FN +I    L +  L N  FTWS+  +N     +DRFL+S      F       L RITSDH P+ L    + 
Subjt:  ILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGDIN

Query:  WGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSH----RLALSQLPSLITQLKQLDNLEDNGLTLSQKESRRILREQ
        WGP PFRFEN WL    F+  +  WW ++   GW G+ FM +LK LK++L+ W+K         L +  + +  L Q +  E  GL    +  R  L  +
Subjt:  WGPGPFRFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSH----RLALSQLPSLITQLKQLDNLEDNGLTLSQKESRRILREQ

Query:  IEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEV-LSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKAL
        I DL  +E ++W+QR K+KW +EGD N+KFFHR+    +++N I ++ +   GV  V AN IE+E + F++ L++++ +  +    ++WCPIS+ ++  L
Subjt:  IEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEV-LSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKAL

Query:  EVAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNR
        E     EEV  A+   G  KSPGPDG++  FF+  W  +K DLM ++QDF+ +GI+N   NET+ICLIPK+ +S  V DYRPISL++  YK+I++VL++R
Subjt:  EVAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNR

Query:  LKHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKI
        L+ VL +TI+ +Q AFV  RQILDA L+ANE++++ +   ++G+V K+D EKA+D V+W+F+D V+  KGFG  WR WI GC+ SVN+SI+INGKPRGK 
Subjt:  LKHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKI

Query:  IPSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHID
          SRG+RQGDPLSPFLF LV +                          ++HLQFADDT+       +    L +++ +F   SG+ +N AKS ILGI+  
Subjt:  IPSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHID

Query:  DTELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWE
           L +MA  +GC+ G WP  YLGLPLGGNP+   FW PV+EK++ +L  WK A +SKGGR TLIQA LSS+P+Y++SLFK+P  VA  +++L+R+F WE
Subjt:  DTELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWE

Query:  GSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRHWPFP-IQRGYFKSPWRFICTTIDQITSRIHR
        G +     H V W++    K  GGLGIG+ + RN AL AKW+WRF  E +SLW  +I +KY    D   W    I +   ++PWR I    +        
Subjt:  GSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRHWPFP-IQRGYFKSPWRFICTTIDQITSRIHR

Query:  IIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--TESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRNMN-DSWSWPLEASKLFSV
         +GNG    FW+D WL   IL +LFPRL  L+   N  +A   N+      W+   RR+L+E E+ E   L  +L  +RL     D  SW +E    FS 
Subjt:  IIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWNS--TESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRNMN-DSWSWPLEASKLFSV

Query:  KSLMVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEAFG
        KS    LL  +       +S IWK   P KI+ F+W  + G INT D +QRR P   LSPSWCV+C  N EN  HLF+ CS++ + W  ML A G
Subjt:  KSLMVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEAFG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.3e-4722.96Show/hide
Query:  VQETKTSSIDNHLIKSLWSSSHIGWSSLDSVGA----SGGILIMWSEPDF---TIKETIQGLFSVSIHVFMADGF----SFWLTNIYGPSRREFRVDFWK
        +QET  +  D H +K        GW  +         +G  +++  + DF    IK   +G      H  M  G        + NIY P+    R  F K
Subjt:  VQETKTSSIDNHLIKSLWSSSHIGWSSLDSVGA----SGGILIMWSEPDF---TIKETIQGLFSVSIHVFMADGF----SFWLTNIYGPSRREFRVDFWK

Query:  E-LHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNI----PLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRL
        + L DL        ++ GDFN      ++S  + + +  +  N  +    L++I      ++  +T+ S   + Y S ID  + SK  L+K   + +  +
Subjt:  E-LHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNI----PLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRL

Query:  DRITSDHYPLSLTFGDIN--------WGPGPFRFENSWL------QIASFREVLDNWWNQNSFQG-WPGHAFMMKLKGLKNELRNWNKSHRLALSQLPSL
            SDH  + L     N        W        + W+      +I  F E  +N     ++Q  W   AF    +G    L  + +      S++ +L
Subjt:  DRITSDHYPLSLTFGDIN--------WGPGPFRFENSWL------QIASFREVLDNWWNQNSFQG-WPGHAFMMKLKGLKNELRNWNKSHRLALSQLPSL

Query:  ITQLKQLDNLEDNGLTLSQKESRRILREQIEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRL
         +QLK+L+  E      S+++    +R +++++  Q+ +Q     +  + +  ++  +   R++  ++ KN I  + +  G       +I+    ++Y+ 
Subjt:  ITQLKQLDNLEDNGLTLSQKESRRILREQIEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRL

Query:  LF-TKDNHTRFLPTNVDWCP---ISEAQSKALEVAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLI
        L+  K  +   + T +D      +++ + ++L   I+  E+   +NSL + KSPGPDG+TAEF++     +   L+ + Q     GI+  +  E  I LI
Subjt:  LF-TKDNHTRFLPTNVDWCP---ISEAQSKALEVAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLI

Query:  PK-RLDSKSVVDYRPISLISCAYKIIARVLSNRLKHVLSSTIADNQMAFVANRQ-ILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVL
        PK   D+    ++RPISL++   KI+ ++L+NR++  +   I  +Q+ F+   Q   +     N +    +   K  V++ +D EKAFDK+   F+   L
Subjt:  PK-RLDSKSVVDYRPISLISCAYKIIARVLSNRLKHVLSSTIADNQMAFVANRQ-ILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVL

Query:  HAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDS
        +  G   ++ K IR        +II+NG+         G RQG PLSP LF +V + L+R +     +  I    +G     L+   FADD +++     
Subjt:  HAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDS

Query:  DALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELE-----DMAAKFGCKRGFWPSTYLGLPLGGNPKN--SVFWQPVIEKIQHKLHSWKYAFISKG
         +   L ++I+ F   SG  +N+ KS+   ++ ++ + E     ++      KR      YLG+ L  + K+     ++P++++I+   + WK    S  
Subjt:  DALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELE-----DMAAKFGCKRGFWPSTYLGLPLGGNPKN--SVFWQPVIEKIQHKLHSWKYAFISKG

Query:  GRHTLIQATLSSMPTYFLSL--FKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEED
        GR  +++  +     Y  +    KLP      L+K    F W   +       +           GG+ + +F+L   A + K  W +    D
Subjt:  GRHTLIQATLSSMPTYFLSL--FKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEED

P08548 LINE-1 reverse transcriptase homolog3.5e-4523.57Show/hide
Query:  LTNIYGPSRREFRVDFWKE-LHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNI-----PLQNGCFTWSSFGDNPYLSLIDR
        + NIY P+       F +E L D++ L     I+ GDFN      ++S  + +++ +   N  I    L +I     P +   +T+ S     Y S ID 
Subjt:  LTNIYGPSRREFRVDFWKE-LHDLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNI-----PLQNGCFTWSSFGDNPYLSLIDR

Query:  FLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGD--------INWGPGPFRFENSWLQIASFREVLDNWWNQNSFQG------WPGHAFMMKLKGLKNE
         L  K  L+KF    ++    I SDH+ + +   +          W       +++W+ I   ++ +  +  QN+ Q       W        L+G    
Subjt:  FLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGD--------INWGPGPFRFENSWLQIASFREVLDNWWNQNSFQG------WPGHAFMMKLKGLKNE

Query:  LRNWNKSHRLALSQLPSLITQLKQLDNLEDNGLTLSQKESRRILREQIEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGV
        L+ + K  +    ++ +L+  LKQL+  E +    S+++    +R ++ ++  +  IQ   + K  + ++ ++  K    +   ++ K+ I+ + +    
Subjt:  LRNWNKSHRLALSQLPSLITQLKQLDNLEDNGLTLSQKESRRILREQIEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGV

Query:  SLVSANDIEKEFVDFYRLLFT-KDNHTRFLPTNVDWCPISEAQSKALEV---AISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDF
             ++I+K   ++Y+ L++ K  + + +   ++ C +     K +E+    IS  E+ + + +L   KSPGPDG+T+EF++     +   L+++ Q+ 
Subjt:  SLVSANDIEKEFVDFYRLLFT-KDNHTRFLPTNVDWCPISEAQSKALEV---AISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDF

Query:  YNTGIINVALNETYICLIPK-RLDSKSVVDYRPISLISCAYKIIARVLSNRLKHVLSSTIADNQMAFVANRQ-ILDASLIANELIDDWQISSKRGVVLKV
           GI+     E  I LIPK   D     +YRPISL++   KI+ ++L+NR++  +   I  +Q+ F+   Q   +     N +    ++ +K  ++L +
Subjt:  YNTGIINVALNETYICLIPK-RLDSKSVVDYRPISLISCAYKIIARVLSNRLKHVLSSTIADNQMAFVANRQ-ILDASLIANELIDDWQISSKRGVVLKV

Query:  DLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFS
        D EKAFD +   F+   L   G    + K I    S    +II+NG          G RQG PLSP LF +V + L+  +    ++  I    +G+    
Subjt:  DLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFS

Query:  LNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEDMAAKFGCKRGFWPS--TYLGLPLGGNPKN--SVFWQPVIEKI
        L+   FADD +++     D+  KL EVI  +   SG  +N  KS +  I+ ++ + E    K        P    YLG+ L  + K+     ++ + ++I
Subjt:  LNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEDMAAKFGCKRGFWPS--TYLGLPLGGNPKN--SVFWQPVIEKI

Query:  QHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSL--FKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLL-------GGLGIGNFQLRNT
           ++ WK    S  GR  +++ ++     Y  +    K P    K L+K++  F W            N  K Q+ K L       GG+ + + +L   
Subjt:  QHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSL--FKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLL-------GGLGIGNFQLRNT

Query:  ALLAK--WIWRFLHEEDSLWRNL
        +++ K  W W    E D +W  +
Subjt:  ALLAK--WIWRFLHEEDSLWRNL

P0C2F6 Putative ribonuclease H protein At1g657502.1e-3428.98Show/hide
Query:  VIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLA
        ++E++  ++  W+   +S  GR TL +A LSSMP + +S   LP  +   LD+L R F W  +      H V W K   PK  GGLG+   +  N AL++
Subjt:  VIEKIQHKLHSWKYAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLA

Query:  KWIWRFLHEEDSLWRNLIIAKYYNSEDVRHWPFPIQRGYFKSPWRFICTTI-DQITSRIHRIIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARV
        K  WR L E++SLW  L++ K Y+  ++R   + I +G + S WR I   + D ++  +  I G+G    FW D W++G  L  L     R T       
Subjt:  KWIWRFLHEEDSLWRNLIIAKYYNSEDVRHWPFPIQRGYFKSPWRFICTTI-DQITSRIHRIIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARV

Query:  AEVWNSTESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL---RNMNDSWSWPLEASKLFSVKS----LMVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFL
         ++W      W+ +      + +     N    L  + L       D  SW       FSV+S    L VD +   N  + + ++ +WK   P+++K FL
Subjt:  AEVWNSTESAWNLSPRRHLNEFEIIEWANLSYLLSPIRL---RNMNDSWSWPLEASKLFSVKS----LMVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFL

Query:  WELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYW
        W +   A+ T +   RR  H S S + C +C   +E+  H+   C      W
Subjt:  WELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYW

P11369 LINE-1 retrotransposable element ORF2 protein1.9e-4324.06Show/hide
Query:  VQETKTSSIDNHLIKSLWSSSHIGWSSL---DSVGASGGILIMWSEP-DF---TIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKE-LH
        +QET     D H ++        GW ++   + +    G+ I+ S+  DF    IK+  +G F +     + +  S  + NIY P+ R     F ++ L 
Subjt:  VQETKTSSIDNHLIKSLWSSSHIGWSSL---DSVGASGGILIMWSEP-DF---TIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKE-LH

Query:  DLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNI-----PLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRI
         L        I+ GDFN    S ++S  + + R      + +    L +I     P   G +T+ S     + S ID  +  K  LN++   ++  +  I
Subjt:  DLAGLGGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNI-----PLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRI

Query:  TSDHYPLSLTF-GDINWGPGPF--RFENSWLQIASFREVLDN------WWNQNSFQGWPGHAFMMK--LKGLKNELRNWNKSHRLALSQLPSLITQLKQL
         SDH+ L L F  +IN G   F  +  N+ L     +E +         +N+N    +P     MK  L+G    L    K    A     SL T LK L
Subjt:  TSDHYPLSLTF-GDINWGPGPF--RFENSWLQIASFREVLDN------WWNQNSFQGWPGHAFMMK--LKGLKNELRNWNKSHRLALSQLPSLITQLKQL

Query:  DNLEDNGLTLSQKESRRILREQIEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTK---
        +  E N    S+++    LR +I  +  +  IQ   + +  + ++ ++  K   R+    + K  I ++ +  G       +I+     FY+ L++    
Subjt:  DNLEDNGLTLSQKESRRILREQIEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTK---

Query:  --DNHTRFLPTNVDWCPISEAQSKALEVAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYN----TGIINVALNETYICLIPK
          D   +FL        +++ Q   L   IS +E+   +NSL + KSPGPDG++AEF++    T K+DL+ ++   ++     G +  +  E  I LIPK
Subjt:  --DNHTRFLPTNVDWCPISEAQSKALEVAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYN----TGIINVALNETYICLIPK

Query:  -RLDSKSVVDYRPISLISCAYKIIARVLSNRLKHVLSSTIADNQMAFVANRQ---ILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVL
         + D   + ++RPISL++   KI+ ++L+NR++  + + I  +Q+ F+   Q    +  S+     I+  ++  K  +++ +D EKAFDK+   F+  VL
Subjt:  -RLDSKSVVDYRPISLISCAYKIIARVLSNRLKHVLSSTIADNQMAFVANRQ---ILDASLIANELIDDWQISSKRGVVLKVDLEKAFDKVDWDFLDAVL

Query:  HAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDS
           G    +   I+   S    +I +NG+    I    G RQG PLSP+LF +V + L+R +     +  I    +G     ++ L  ADD +++ S   
Subjt:  HAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDTLLFSSFDS

Query:  DALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEDMAAKFGCKRGFWPSTYLGLPLGGNPKN--SVFWQPVIEKIQHKLHSWKYAFISKGGRHTL
        ++  +L  +IN F    G  +N  KS       +    +++              YLG+ L    K+     ++ + ++I+  L  WK    S  GR  +
Subjt:  DALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEDMAAKFGCKRGFWPSTYLGLPLGGNPKN--SVFWQPVIEKIQHKLHSWKYAFISKGGRHTL

Query:  IQATLSSMPTYFLSL--FKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIW
        ++  +     Y  +    K+P++    L+  +  F W   +       +   +T      GG+ + + +L   A++ K  W
Subjt:  IQATLSSMPTYFLSL--FKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIW

P14381 Transposon TX1 uncharacterized 149 kDa protein1.1e-5125.49Show/hide
Query:  GFSFWLTNIYGPSRREFRVDFWKELHDLAGL--GGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNG----CFTWSSFGD-NPY
        G ++ L N+Y P+    R  F++ L          +  I+GGDFN T  + +++  +    S  +  + IA + L+++  +       FT+    D +  
Subjt:  GFSFWLTNIYGPSRREFRVDFWKELHDLAGL--GGDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNG----CFTWSSFGD-NPY

Query:  LSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGDINWGP--GPFRFENSWLQIASF-REVLDNWWNQNSFQG--------WPGHAFMMKLKGL
         S IDR  IS   +++   S  +RL    SDH  +SL        P    + F NS L+   F + V D W    +FQ         W      +KL   
Subjt:  LSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGDINWGP--GPFRFENSWLQIASF-REVLDNWWNQNSFQG--------WPGHAFMMKLKGL

Query:  KNELRNWNKSHRLALSQLPSLITQLKQ-LDNLEDNGLTLSQKESRRILREQIEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLS
        +   ++ +      +  L   +  L+Q L   ED  L     E +  LR   +       +    R +++ L + D  S+FF+ +   +  +  IT + +
Subjt:  KNELRNWNKSHRLALSQLPSLITQLKQ-LDNLEDNGLTLSQKESRRILREQIEDLTAQEHIQWQQRCKLKWLKEGDENSKFFHRILAARKRKNSITEVLS

Query:  RGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCP--------ISEAQSKALEVAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQD
          G  L     I      FY+ LF+ D      P + D C         +SE + + LE  I+ +E+  AL  +  +KSPG DG T EFF+F W+T+  D
Subjt:  RGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCP--------ISEAQSKALEVAISEEEVFNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQD

Query:  LMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRLKHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKR
           ++ + +  G + ++     + L+PK+ D + + ++RP+SL+S  YKI+A+ +S RLK VL+  I  +Q   V  R I D   +  +L+   + +   
Subjt:  LMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRLKHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKR

Query:  GVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPV
           L +D EKAFD+VD  +L   L A  FG  +  +++   +S    + IN      +   RG+RQG PLS  L+ L  +    LL    + G ++  P 
Subjt:  GVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPV

Query:  GASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEDMAAKFGCKRGFWPS---TYLGLPLGGN--PKNSVFW
             S     +ADD +L +  D   L +  E   ++  AS   +N +KS   G+     +++ +   F  +   W S    YLG+ L     P +  F 
Subjt:  GASSFSLNHLQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEDMAAKFGCKRGFWPS---TYLGLPLGGN--PKNSVFW

Query:  QPVIEKIQHKLHSWK--YAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNT
        + + E +  +L  WK     +S  GR  +I   ++S   Y L       +    + + + DF W G       H V+   + LP   GG G+   + +  
Subjt:  QPVIEKIQHKLHSWK--YAFISKGGRHTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNT

Query:  ALLAKWIWRFLHEEDS
            + I R+L+ + S
Subjt:  ALLAKWIWRFLHEEDS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.9e-2626.3Show/hide
Query:  RSMRIFNQWIASYKLLNIPLQNGCFTWSSF-GDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGDI-NWGPGPFRFENSWLQIASFRE
        R +  F   +    L++IP +   +TWS+   DNP +  +DR + + D  + F ++  +      SDH P  +   ++       FR+ +      +F  
Subjt:  RSMRIFNQWIASYKLLNIPLQNGCFTWSSF-GDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGDI-NWGPGPFRFENSWLQIASFRE

Query:  VLDNWWNQNSFQGWPGHAFMM--------KLKGLKNELRNWNKSHRL--ALSQLPSLITQLKQLDNLEDNGLTLSQKESRRILREQIEDLTAQEHIQWQQ
         L   W +    G   H F +        K   L N     N  H+   AL  L S+ +QL  L N  D     S      + R++     A     ++Q
Subjt:  VLDNWWNQNSFQGWPGHAFMM--------KLKGLKNELRNWNKSHRL--ALSQLPSLITQLKQLDNLEDNGLTLSQKESRRILREQIEDLTAQEHIQWQQ

Query:  RCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNV----DWCPI--SEAQSKALEVAISEEEV
        + ++KWL++GD N++FFH+++ A + KN I  +     V + +   +++  V +Y  L   D+     P +V    D  P   ++  +  L    S++E+
Subjt:  RCKLKWLKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNV----DWCPI--SEAQSKALEVAISEEEV

Query:  FNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKII
          A+ ++  +K+PGPD +TAEFF  SW  +K   ++ +++F+ TG +    N T I LIPK      +  +RP+S  +  YKII
Subjt:  FNALNSLGSSKSPGPDGYTAEFFKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.3e-0635Show/hide
Query:  RLKHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGV----VLKVDLEKAFDKVDWDFLDAVLHAKGFGSVW
        RLK ++++ I   Q +F+  R   D  +   E +    +  K+GV    +LK+DLEKA+D++ WD+L+  L + GF  VW
Subjt:  RLKHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSKRGV----VLKVDLEKAFDKVDWDFLDAVLHAKGFGSVW

AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-2926.46Show/hide
Query:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRHW
        ++PTY ++ F LP  V K +  ++ DF+W   Q   GMH   WD     K  GG+G  + +  N ALL K +WR L   +SL   +  ++Y++  D  + 
Subjt:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRHW

Query:  PFPIQRGYFKSPWRFICTTIDQITSRIHRIIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWNSTESAWNLSPRRHLNEFEIIEWANLSY
        P   +  +    W+ I  + + +      ++GNG     W+  WL+    S    R+ R+     A V+ +   ++           +  E++       
Subjt:  PFPIQRGYFKSPWRFICTTIDQITSRIHRIIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWNSTESAWNLSPRRHLNEFEIIEWANLSY

Query:  LLSPIRL--RNMNDSWSWPLEASKLFSVKS---LMVDLLGG-------SNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSW
        L+  +R   R + DS++W   +S  ++VKS   ++  ++         S  +LN +Y  IWK     KI+ FLW+    ++  A  L  R  H S   S 
Subjt:  LLSPIRL--RNMNDSWSWPLEASKLFSVKS---LMVDLLGG-------SNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSW

Query:  CVMCSSNMENTGHLFVTCSFATKYW
        C+ C S  E   HL   C+FA   W
Subjt:  CVMCSSNMENTGHLFVTCSFATKYW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein9.8e-1127.4Show/hide
Query:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKL-LGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRH
        ++P Y +S F+L   + K L   + +F+W   +    +  V W K    K   GGLG  +    N ALLAK  +R +H+  +L   L+ ++Y+    +  
Subjt:  SMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKL-LGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRH

Query:  WPFPIQRGYFKSPWRFICTTIDQITSRIHRIIGNGCSTFFWKDAWL
             +  Y    WR I    + ++  + R IG+G  T  W D W+
Subjt:  WPFPIQRGYFKSPWRFICTTIDQITSRIHRIIGNGCSTFFWKDAWL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.4e-1252.94Show/hide
Query:  IINGKPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDT
        IING P+G + PSRG+RQGDPLSP+LFIL ++ LS L   +   G++    V  +S  +NHL FADDT
Subjt:  IINGKPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNHLQFADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACAGCCACCATTTAATGAAGATTCAATCTCGGTTGATTTTCCTAATTTAACAGATTCCACAAAATCCTCCAAAGGAAAAGAGCCATTAATAATGGAAGAGCCATC
GGTGGAATTAAAAAATAATATATCCCTTCCTGTGGGCCCTACGAATTTGAAAATTGGTCAAAAGGGCTCAACTTCTGGTCTTAGCCCAAAATTGGTTATTGTAGGCTCTG
ATACTGAAGCTTATTTATCCAGCCCATCTCCAAACAATTCACCTCATAAGATTAACTTGGACCCATCCCCCACACACGATTTTGATCTCACCATTTTTAACCCTGACCAA
TATCATCTTCCTTTAGCCATTGTGCCGCCAGAATCAACAAATGCTGGCCAATCAGCCATAAACAACAAAGTGATTGCTCCAGCCAATGTTTTAACCAACCATCCCCAACC
AGAAACACCTCCCCAACCAGAAACACCTCCCATCAATCATCAGCCAATGTTTGCCCTCCCAGAATATCTCCGTCATATAGCTCCAATTCTTAGTGAGCATGGATTGTGTA
TCATGGCCATCCCTCAATTTCTACCACCTAAAAGGAAGACAGTTACTACTACCGGGAGGAAAAAAAAAAAACTCCAAAGAGAGCTTGATAACCTAAAAACTACAGTGCAT
TATGATAAAACTGCTTCATTGGCCTTAACGGAGGGAGTCCAGGAAACAAAAACGTCTTCTATAGACAATCATCTGATTAAATCCTTATGGAGTTCATCTCATATTGGTTG
GTCTTCTCTCGATTCAGTTGGAGCATCAGGGGGCATCCTTATAATGTGGAGTGAGCCAGACTTCACTATCAAAGAGACAATTCAAGGTCTTTTCTCTGTCTCTATTCATG
TTTTTATGGCTGATGGTTTTTCTTTTTGGCTTACAAATATTTATGGTCCTTCTCGACGAGAATTTCGTGTTGACTTTTGGAAAGAATTACATGATCTGGCGGGTCTGGGA
GGTGATCGTTGGATCCTTGGAGGAGATTTTAATGTTACCCGATGGTCATGGGAGAAATCTCATGGACGTCACATCACTCGGAGTATGCGTATTTTCAACCAATGGATTGC
ATCTTACAAGCTTCTGAATATTCCATTACAGAATGGTTGTTTCACCTGGTCCAGTTTTGGTGACAATCCGTATCTCTCCTTAATAGACAGATTTTTGATTTCTAAAGATT
GTCTGAATAAATTCGGGGCTTCTCATCTTCTTCGGCTTGACAGAATTACTTCAGATCACTACCCTCTTTCCCTTACTTTTGGTGATATTAATTGGGGTCCTGGGCCTTTC
CGATTTGAAAATTCCTGGCTGCAAATTGCATCATTCCGTGAGGTGTTGGATAATTGGTGGAATCAAAATTCTTTCCAAGGTTGGCCGGGTCATGCTTTTATGATGAAATT
AAAAGGGCTGAAAAATGAGCTTCGAAATTGGAATAAATCCCACAGGTTGGCATTATCACAACTACCCTCTCTGATTACTCAGTTAAAACAATTAGATAACCTTGAGGACA
ATGGTCTAACTTTGTCTCAAAAGGAATCTAGGCGTATTCTTCGTGAACAGATTGAGGATTTAACCGCCCAGGAGCATATCCAATGGCAACAACGTTGTAAACTTAAATGG
CTTAAAGAGGGTGATGAAAATAGTAAATTCTTCCATCGCATCTTGGCAGCCCGTAAAAGGAAGAATTCGATTACTGAGGTGTTATCTCGAGGTGGGGTCAGTTTGGTTTC
AGCCAATGATATCGAAAAGGAATTCGTTGATTTTTATCGGCTTTTGTTCACCAAAGATAATCATACTCGTTTTCTGCCAACCAATGTTGATTGGTGCCCAATTAGTGAAG
CTCAATCAAAAGCCTTGGAAGTTGCTATTTCTGAAGAAGAAGTTTTTAATGCTCTGAACTCTCTTGGTTCTAGTAAGTCTCCAGGTCCGGATGGTTATACAGCTGAATTC
TTTAAATTCTCTTGGAACACTATCAAACAGGATCTAATGTCTATGATTCAAGATTTTTATAACACTGGAATTATTAACGTGGCTCTAAATGAAACTTATATTTGCTTGAT
TCCAAAGAGGTTAGACTCCAAATCTGTTGTTGATTATCGTCCCATTAGCCTAATCTCATGTGCTTATAAGATCATTGCTCGAGTTTTATCTAATCGTTTGAAGCATGTAT
TGTCGTCTACCATAGCAGATAATCAAATGGCTTTTGTTGCTAACAGACAGATTTTGGATGCTTCTTTAATTGCTAATGAATTAATAGATGATTGGCAAATTTCCTCGAAG
AGAGGGGTGGTTCTTAAAGTAGATTTGGAAAAGGCATTTGATAAAGTAGACTGGGATTTTCTGGATGCAGTCCTACATGCAAAGGGCTTTGGTTCAGTTTGGAGAAAATG
GATTCGTGGTTGTATCTCTAGTGTGAATTATTCAATCATCATTAACGGGAAACCTAGAGGTAAAATTATCCCTTCTCGTGGCATTCGTCAAGGTGATCCTCTGTCTCCTT
TCTTATTTATTCTGGTTTCAGATTGTTTGAGTAGACTCCTTTCCCACAGCGCTAGTTTGGGTAAAATTATGGCTCATCCAGTTGGTGCTTCATCTTTTAGTCTGAATCAT
TTACAATTTGCAGACGATACACTTTTATTCTCTTCTTTTGATTCAGATGCTTTGAACAAGCTTTTTGAAGTTATCAATATATTTGAATTGGCTTCTGGCCTAAATGTCAA
CCTTGCCAAGAGTGAAATCTTAGGGATCCATATCGATGATACAGAGTTGGAAGATATGGCTGCTAAATTTGGTTGTAAGCGTGGTTTTTGGCCTAGCACTTATCTTGGAC
TTCCTTTGGGCGGTAACCCGAAAAACTCTGTTTTCTGGCAACCAGTTATTGAGAAGATTCAGCATAAATTACATAGTTGGAAATATGCCTTTATATCCAAAGGAGGAAGG
CATACACTCATCCAAGCCACTCTTTCGAGTATGCCGACGTATTTTCTGTCTTTGTTTAAACTTCCAAGTAAGGTTGCTAAATCTCTTGACAAGCTAGTACGAGATTTTTT
CTGGGAAGGTTCTCAAGGGGATGGTGGTATGCATAATGTTAATTGGGATAAGACTCAGCTTCCGAAATTACTGGGAGGTCTTGGCATTGGCAATTTTCAGCTTCGAAATA
CAGCCTTGTTAGCTAAATGGATTTGGAGGTTTTTGCACGAAGAAGATTCTCTTTGGCGTAATCTCATTATTGCTAAATATTACAACTCGGAGGATGTTAGACATTGGCCT
TTTCCCATTCAAAGGGGATATTTCAAATCTCCTTGGCGCTTTATTTGTACTACTATCGATCAAATTACTAGTCGTATTCATCGAATTATTGGTAATGGTTGTAGCACATT
TTTTTGGAAGGATGCATGGCTGAATGGAGTGATTCTCTCAAATCTCTTCCCTCGCCTTTATCGGTTAACTACCAATCCAAACGCCAGGGTTGCAGAAGTATGGAACTCTA
CAGAATCAGCATGGAATCTGAGTCCTCGTCGTCACCTTAATGAGTTTGAGATTATTGAATGGGCAAATTTATCCTATCTTCTGTCTCCTATTCGACTTCGGAATATGAAT
GACTCTTGGTCTTGGCCTCTTGAAGCATCCAAATTATTCTCTGTTAAATCCTTGATGGTTGATCTTTTGGGTGGTTCTAATACTACTTTGAATAATTTATATTCGGTGAT
ATGGAAAGATAATTATCCTAAAAAGATAAAAATCTTTCTATGGGAGCTTAGTCTTGGGGCTATCAATACAGCGGATCGTCTTCAACGTAGAATGCCTCATTTTTCCCTTT
CGCCATCTTGGTGTGTTATGTGCTCATCAAATATGGAGAATACGGGTCATCTATTTGTCACTTGTTCTTTTGCTACCAAGTATTGGAATTTGATGCTTGAAGCTTTCGGA
TAG
mRNA sequenceShow/hide mRNA sequence
ATGGAACAGCCACCATTTAATGAAGATTCAATCTCGGTTGATTTTCCTAATTTAACAGATTCCACAAAATCCTCCAAAGGAAAAGAGCCATTAATAATGGAAGAGCCATC
GGTGGAATTAAAAAATAATATATCCCTTCCTGTGGGCCCTACGAATTTGAAAATTGGTCAAAAGGGCTCAACTTCTGGTCTTAGCCCAAAATTGGTTATTGTAGGCTCTG
ATACTGAAGCTTATTTATCCAGCCCATCTCCAAACAATTCACCTCATAAGATTAACTTGGACCCATCCCCCACACACGATTTTGATCTCACCATTTTTAACCCTGACCAA
TATCATCTTCCTTTAGCCATTGTGCCGCCAGAATCAACAAATGCTGGCCAATCAGCCATAAACAACAAAGTGATTGCTCCAGCCAATGTTTTAACCAACCATCCCCAACC
AGAAACACCTCCCCAACCAGAAACACCTCCCATCAATCATCAGCCAATGTTTGCCCTCCCAGAATATCTCCGTCATATAGCTCCAATTCTTAGTGAGCATGGATTGTGTA
TCATGGCCATCCCTCAATTTCTACCACCTAAAAGGAAGACAGTTACTACTACCGGGAGGAAAAAAAAAAAACTCCAAAGAGAGCTTGATAACCTAAAAACTACAGTGCAT
TATGATAAAACTGCTTCATTGGCCTTAACGGAGGGAGTCCAGGAAACAAAAACGTCTTCTATAGACAATCATCTGATTAAATCCTTATGGAGTTCATCTCATATTGGTTG
GTCTTCTCTCGATTCAGTTGGAGCATCAGGGGGCATCCTTATAATGTGGAGTGAGCCAGACTTCACTATCAAAGAGACAATTCAAGGTCTTTTCTCTGTCTCTATTCATG
TTTTTATGGCTGATGGTTTTTCTTTTTGGCTTACAAATATTTATGGTCCTTCTCGACGAGAATTTCGTGTTGACTTTTGGAAAGAATTACATGATCTGGCGGGTCTGGGA
GGTGATCGTTGGATCCTTGGAGGAGATTTTAATGTTACCCGATGGTCATGGGAGAAATCTCATGGACGTCACATCACTCGGAGTATGCGTATTTTCAACCAATGGATTGC
ATCTTACAAGCTTCTGAATATTCCATTACAGAATGGTTGTTTCACCTGGTCCAGTTTTGGTGACAATCCGTATCTCTCCTTAATAGACAGATTTTTGATTTCTAAAGATT
GTCTGAATAAATTCGGGGCTTCTCATCTTCTTCGGCTTGACAGAATTACTTCAGATCACTACCCTCTTTCCCTTACTTTTGGTGATATTAATTGGGGTCCTGGGCCTTTC
CGATTTGAAAATTCCTGGCTGCAAATTGCATCATTCCGTGAGGTGTTGGATAATTGGTGGAATCAAAATTCTTTCCAAGGTTGGCCGGGTCATGCTTTTATGATGAAATT
AAAAGGGCTGAAAAATGAGCTTCGAAATTGGAATAAATCCCACAGGTTGGCATTATCACAACTACCCTCTCTGATTACTCAGTTAAAACAATTAGATAACCTTGAGGACA
ATGGTCTAACTTTGTCTCAAAAGGAATCTAGGCGTATTCTTCGTGAACAGATTGAGGATTTAACCGCCCAGGAGCATATCCAATGGCAACAACGTTGTAAACTTAAATGG
CTTAAAGAGGGTGATGAAAATAGTAAATTCTTCCATCGCATCTTGGCAGCCCGTAAAAGGAAGAATTCGATTACTGAGGTGTTATCTCGAGGTGGGGTCAGTTTGGTTTC
AGCCAATGATATCGAAAAGGAATTCGTTGATTTTTATCGGCTTTTGTTCACCAAAGATAATCATACTCGTTTTCTGCCAACCAATGTTGATTGGTGCCCAATTAGTGAAG
CTCAATCAAAAGCCTTGGAAGTTGCTATTTCTGAAGAAGAAGTTTTTAATGCTCTGAACTCTCTTGGTTCTAGTAAGTCTCCAGGTCCGGATGGTTATACAGCTGAATTC
TTTAAATTCTCTTGGAACACTATCAAACAGGATCTAATGTCTATGATTCAAGATTTTTATAACACTGGAATTATTAACGTGGCTCTAAATGAAACTTATATTTGCTTGAT
TCCAAAGAGGTTAGACTCCAAATCTGTTGTTGATTATCGTCCCATTAGCCTAATCTCATGTGCTTATAAGATCATTGCTCGAGTTTTATCTAATCGTTTGAAGCATGTAT
TGTCGTCTACCATAGCAGATAATCAAATGGCTTTTGTTGCTAACAGACAGATTTTGGATGCTTCTTTAATTGCTAATGAATTAATAGATGATTGGCAAATTTCCTCGAAG
AGAGGGGTGGTTCTTAAAGTAGATTTGGAAAAGGCATTTGATAAAGTAGACTGGGATTTTCTGGATGCAGTCCTACATGCAAAGGGCTTTGGTTCAGTTTGGAGAAAATG
GATTCGTGGTTGTATCTCTAGTGTGAATTATTCAATCATCATTAACGGGAAACCTAGAGGTAAAATTATCCCTTCTCGTGGCATTCGTCAAGGTGATCCTCTGTCTCCTT
TCTTATTTATTCTGGTTTCAGATTGTTTGAGTAGACTCCTTTCCCACAGCGCTAGTTTGGGTAAAATTATGGCTCATCCAGTTGGTGCTTCATCTTTTAGTCTGAATCAT
TTACAATTTGCAGACGATACACTTTTATTCTCTTCTTTTGATTCAGATGCTTTGAACAAGCTTTTTGAAGTTATCAATATATTTGAATTGGCTTCTGGCCTAAATGTCAA
CCTTGCCAAGAGTGAAATCTTAGGGATCCATATCGATGATACAGAGTTGGAAGATATGGCTGCTAAATTTGGTTGTAAGCGTGGTTTTTGGCCTAGCACTTATCTTGGAC
TTCCTTTGGGCGGTAACCCGAAAAACTCTGTTTTCTGGCAACCAGTTATTGAGAAGATTCAGCATAAATTACATAGTTGGAAATATGCCTTTATATCCAAAGGAGGAAGG
CATACACTCATCCAAGCCACTCTTTCGAGTATGCCGACGTATTTTCTGTCTTTGTTTAAACTTCCAAGTAAGGTTGCTAAATCTCTTGACAAGCTAGTACGAGATTTTTT
CTGGGAAGGTTCTCAAGGGGATGGTGGTATGCATAATGTTAATTGGGATAAGACTCAGCTTCCGAAATTACTGGGAGGTCTTGGCATTGGCAATTTTCAGCTTCGAAATA
CAGCCTTGTTAGCTAAATGGATTTGGAGGTTTTTGCACGAAGAAGATTCTCTTTGGCGTAATCTCATTATTGCTAAATATTACAACTCGGAGGATGTTAGACATTGGCCT
TTTCCCATTCAAAGGGGATATTTCAAATCTCCTTGGCGCTTTATTTGTACTACTATCGATCAAATTACTAGTCGTATTCATCGAATTATTGGTAATGGTTGTAGCACATT
TTTTTGGAAGGATGCATGGCTGAATGGAGTGATTCTCTCAAATCTCTTCCCTCGCCTTTATCGGTTAACTACCAATCCAAACGCCAGGGTTGCAGAAGTATGGAACTCTA
CAGAATCAGCATGGAATCTGAGTCCTCGTCGTCACCTTAATGAGTTTGAGATTATTGAATGGGCAAATTTATCCTATCTTCTGTCTCCTATTCGACTTCGGAATATGAAT
GACTCTTGGTCTTGGCCTCTTGAAGCATCCAAATTATTCTCTGTTAAATCCTTGATGGTTGATCTTTTGGGTGGTTCTAATACTACTTTGAATAATTTATATTCGGTGAT
ATGGAAAGATAATTATCCTAAAAAGATAAAAATCTTTCTATGGGAGCTTAGTCTTGGGGCTATCAATACAGCGGATCGTCTTCAACGTAGAATGCCTCATTTTTCCCTTT
CGCCATCTTGGTGTGTTATGTGCTCATCAAATATGGAGAATACGGGTCATCTATTTGTCACTTGTTCTTTTGCTACCAAGTATTGGAATTTGATGCTTGAAGCTTTCGGA
TAG
Protein sequenceShow/hide protein sequence
MEQPPFNEDSISVDFPNLTDSTKSSKGKEPLIMEEPSVELKNNISLPVGPTNLKIGQKGSTSGLSPKLVIVGSDTEAYLSSPSPNNSPHKINLDPSPTHDFDLTIFNPDQ
YHLPLAIVPPESTNAGQSAINNKVIAPANVLTNHPQPETPPQPETPPINHQPMFALPEYLRHIAPILSEHGLCIMAIPQFLPPKRKTVTTTGRKKKKLQRELDNLKTTVH
YDKTASLALTEGVQETKTSSIDNHLIKSLWSSSHIGWSSLDSVGASGGILIMWSEPDFTIKETIQGLFSVSIHVFMADGFSFWLTNIYGPSRREFRVDFWKELHDLAGLG
GDRWILGGDFNVTRWSWEKSHGRHITRSMRIFNQWIASYKLLNIPLQNGCFTWSSFGDNPYLSLIDRFLISKDCLNKFGASHLLRLDRITSDHYPLSLTFGDINWGPGPF
RFENSWLQIASFREVLDNWWNQNSFQGWPGHAFMMKLKGLKNELRNWNKSHRLALSQLPSLITQLKQLDNLEDNGLTLSQKESRRILREQIEDLTAQEHIQWQQRCKLKW
LKEGDENSKFFHRILAARKRKNSITEVLSRGGVSLVSANDIEKEFVDFYRLLFTKDNHTRFLPTNVDWCPISEAQSKALEVAISEEEVFNALNSLGSSKSPGPDGYTAEF
FKFSWNTIKQDLMSMIQDFYNTGIINVALNETYICLIPKRLDSKSVVDYRPISLISCAYKIIARVLSNRLKHVLSSTIADNQMAFVANRQILDASLIANELIDDWQISSK
RGVVLKVDLEKAFDKVDWDFLDAVLHAKGFGSVWRKWIRGCISSVNYSIIINGKPRGKIIPSRGIRQGDPLSPFLFILVSDCLSRLLSHSASLGKIMAHPVGASSFSLNH
LQFADDTLLFSSFDSDALNKLFEVINIFELASGLNVNLAKSEILGIHIDDTELEDMAAKFGCKRGFWPSTYLGLPLGGNPKNSVFWQPVIEKIQHKLHSWKYAFISKGGR
HTLIQATLSSMPTYFLSLFKLPSKVAKSLDKLVRDFFWEGSQGDGGMHNVNWDKTQLPKLLGGLGIGNFQLRNTALLAKWIWRFLHEEDSLWRNLIIAKYYNSEDVRHWP
FPIQRGYFKSPWRFICTTIDQITSRIHRIIGNGCSTFFWKDAWLNGVILSNLFPRLYRLTTNPNARVAEVWNSTESAWNLSPRRHLNEFEIIEWANLSYLLSPIRLRNMN
DSWSWPLEASKLFSVKSLMVDLLGGSNTTLNNLYSVIWKDNYPKKIKIFLWELSLGAINTADRLQRRMPHFSLSPSWCVMCSSNMENTGHLFVTCSFATKYWNLMLEAFG