; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002125 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002125
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:39572762..39576193
RNA-Seq ExpressionLag0002125
SyntenyLag0002125
Gene Ontology termsGO:0006281 - DNA repair (biological process)
GO:0050789 - regulation of biological process (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004519 - endonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR020847 - AP endonuclease 1, binding site
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN68165.1 hypothetical protein VITISV_008538 [Vitis vinifera]3.7e-28244.39Show/hide
Query:  KLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQI
        K+ISWN RGLGSR KR +VKDFL  E PD+V++QETK    DRR V SVW++R+  W  L A G++GGIL++W    +   + VLG++S+S +    G  
Subjt:  KLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQI

Query:  QGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFLI
        Q W++ VYGP S++ RK F  EL+D+ GL    WCV GDFN++R   E+L   R T SM+  + FI   +LID P+    FTWS + E     RLDRFL 
Subjt:  QGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFLI

Query:  THQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKKK
        +++W   F +   + L R TSDH+P+ L     +WGP PFRFENMWL HP F++   +WW EF   GW G +FM KL+ LK +LKEWNK  FG+L E+KK
Subjt:  THQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKKK

Query:  TILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIGF
         IL  I   D  E++G + P  + +R   K  L E+ + ++    QK ++KW++EGD NS  FH+     +++ FI  LE+E G  L     I++EI+ +
Subjt:  TILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIGF

Query:  FTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLIP
        F  LY+      + ++G +W+P+  + ++RLE PF E+EI++AI  +    +PGPDG T   F++ W+++K DLV VF EF R+GIIN+ TN ++I L+P
Subjt:  FTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLIP

Query:  KKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLK
        KK  A KIS++RPISL+TSLYKI+AKVLA RL+ IL  TI   Q AFVQGR ILD +L+A+E+V+E +   E+GV+FK+DFEKAYD VSW+FLD +++ K
Subjt:  KKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLK

Query:  GFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESVM
        GF    RKWIR CL++ +F+I++NG  +G V   RGLRQGDPLSPFLFTIV D  S       E+ V +G  VGR++T VS LQ+ADDTI FS+  E  +
Subjt:  GFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESVM

Query:  LNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSVL
        L    +L +    SGL +NL K++I GIN   + + R A    CK    PI  LG PLGGN ++ +FWDP++++  ++L+GW+K  LS GGR+TL QS L
Subjt:  LNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSVL

Query:  NSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGWR
          +P Y+ SL K P  V  ++E+L RDF+W+G       +LV W+        GG+G+G +  RN+ALL KWLWR+  E   LW +VI SIYG  S GW 
Subjt:  NSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGWR

Query:  TMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENNQ-TWNLGLRRGLFDRELSSWVALI
                 R  W  I + +  F KFT+F V  G RIRFW D W   Q L V FP +  +   K   ++         +WN   RR L D E+    +L+
Subjt:  TMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENNQ-TWNLGLRRGLFDRELSSWVALI

Query:  GKLDNIQMGNEM-DRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSP
          LD+I +   + D+ SW L  SGLF+ KS F A    S   ++     +W  + P K+K F+W VA++ +NT+D +Q +    ALSP
Subjt:  GKLDNIQMGNEM-DRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSP

CAN68838.1 hypothetical protein VITISV_030956 [Vitis vinifera]1.5e-28045.19Show/hide
Query:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQ
        MK+ISWN RGLGS+ KR +VKDFL  E PD+V+ QETK +  DRR V SVW++R+  W +L A G++GGIL++W    +   + +LG++S+S + T +G 
Subjt:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQ

Query:  IQGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFL
           W++ VYGP +S+ RK    EL+D+AGL    WCV GDFN++R   E+L  +R T SM+ F+ FIS  +LID+P+    FTWS +       RLDRFL
Subjt:  IQGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFL

Query:  ITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKK
         +++W  TF +     L R TSDH+P+ L     +WGP PFRFENMWL HP F+++  +WW EF   GW G +FM KL+ +K +LK WNK  FG L+++K
Subjt:  ITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKK

Query:  KTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIG
        + IL  +   D  E++G +    +A+R   K  L E+ + ++    QK ++KW++EGD NS FFH+     +++ FI  LE+E G+ ++    I++EI+ 
Subjt:  KTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIG

Query:  FFTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLI
        +F  LY+      + ++G +W+P+  + + RLE PF E+EI +AI  +   K+PGPDG T   F++ W ++K DLV+VF EF R+GIIN+ TN ++I L+
Subjt:  FFTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLI

Query:  PKKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKL
        PKK  + +ISDFRPISL+TSLYKI+AKVLA R++++L  TI   Q AFVQGR ILD +L+A+E+V+E R   E+GV+FK+DFEKAYD VSW+FLD ++++
Subjt:  PKKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKL

Query:  KGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESV
        KGFG  WRKW+RGCL++ +F++++NG  +G V ASRGLRQGDPLSPFLFTIV D LSR      E+ VL+G  VGR++T VS LQ+ADDTI FS+  E  
Subjt:  KGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESV

Query:  MLNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSV
        M+    +L++    SGL +NL K++I GIN     ++R A    CK    PI  LG PLGGN +T  FWDP++++   +L+GW+K  LS GGR+TL QS 
Subjt:  MLNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSV

Query:  LNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGW
        L  +P Y+ SL K P  V  K+E++ RDF+W+G       +LV W+    P S GG+G G +  RN ALL KWLWR+  E   LW +VI SIYG  S GW
Subjt:  LNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGW

Query:  RTMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENNQ-TWNLGLRRGLFDRELSSWVAL
            I     R  W  I   Y  F KFT+F V  G RIRFW D W   QPL V +P +  +   K A ++         +WN   RR L D E+     L
Subjt:  RTMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENNQ-TWNLGLRRGLFDRELSSWVAL

Query:  IGKLDNIQMGNEM-DRISWKLEGS--GLFSTKSLFRAAVG
        +   D + + + + D+ SW L  S    F  K + R  VG
Subjt:  IGKLDNIQMGNEM-DRISWKLEGS--GLFSTKSLFRAAVG

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.5e-28845.14Show/hide
Query:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQ
        MK++SWN RGLGS+ KR +V+ FLS +NPD+V+LQETK +  DRR V SVW  + V W +L A G++GGI+++W  + +E  + VLG++S++ +     +
Subjt:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQ

Query:  IQGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFL
           W+T VYGP +   RK F  EL D+ GL    WCV GDFN++R + E+L  TR T +MR F+ FI    LID P+ +  FTWS +       RLDRFL
Subjt:  IQGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFL

Query:  ITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKK
         + +W   F +   + L R TSDH P+ L    ++WGP PFRFENMWL HP+F++    WW E T  GW G +FM KLK +K +LKEWN   FG+L E+K
Subjt:  ITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKK

Query:  KTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIG
        K IL  +  +D  E++G++    + ER   +  L ++ + ++ +  QK ++KW++EGD NS FFHR     +S+ FI +L SE GE L+   DI +EI+ 
Subjt:  KTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIG

Query:  FFTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLI
        FF NLYSK     + ++G +W P+  +    L+ PF E+E+ RA+  L   K+PGPDG T   ++  W+++K DL+ VF EF  NG+IN+ TN T+I L+
Subjt:  FFTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLI

Query:  PKKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKL
        PKK  + KISD+RPISLVTSLYKI+AKVL+ RL+ +L  TISD Q AFV+GR ILD +L+A+EVV+E R   E+G++FK+DFEKAYD V W FLD +L+ 
Subjt:  PKKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKL

Query:  KGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESV
        KGF   WR WIRGCL++S+F+I++NG  +G V ASRGLRQGDPLSPFLFT+V D LSR      E  + +G  VGRD+T VS+LQ+ADDTI FS      
Subjt:  KGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESV

Query:  MLNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSV
        + N   IL++  + SGL +NL K++I GINT  E ++  A+ F C+V   P+  LG PLGGN +T  FWDP+V++   +L+GWKK  LS GGR+TL QS 
Subjt:  MLNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSV

Query:  LNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGW
        L+ +P Y+ SL K P  +  K+EK+ R+F+W+G       +LV+WE  + P   GG+G G +  RN ALL KWLWRF  E+ GLW +VIGSIYG    GW
Subjt:  LNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGW

Query:  RTMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDEN-NQTWNLGLRRGLFDRELSSWVAL
           ++     R  W  I + +  F  F +  V  G RIRFW D W   Q L   F ++Y +  +K  +V+     +    WNL  RR L D E+     L
Subjt:  RTMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDEN-NQTWNLGLRRGLFDRELSSWVAL

Query:  IGKLDNIQMGNEM-DRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSPR
        +  L +++    + D  +W L  SGLF+ KS F A    S  I    A  +W  K P KVK   W VA+  +NT+DK+Q +    +L P+
Subjt:  IGKLDNIQMGNEM-DRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSPR

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.5e-29144.01Show/hide
Query:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQ
        MK+ISWN RGLGS+ KR +VKDFL  E PD+V+ QETK +  DRR V SVW++R+  W +L A G++GGIL++W    +   + +LG++S+S + T +G 
Subjt:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQ

Query:  IQGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFL
           W++ VYGP +S+ RK    EL+D+AGL    WCV GDFN++R   E+L  +R T SM+ F+ FIS  +LID+P+    FTWS +       RLDRFL
Subjt:  IQGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFL

Query:  ITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKK
         +++W  TF +     L R TSDH+P+ L     +WGP PFRFENMWL HP F+++  +WW EF   GW G +FM KL+ +K +LK WNK  FG L+++K
Subjt:  ITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKK

Query:  KTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIG
        + IL  +   D  E++G +    +A+R   K  L E+ + ++    QK ++KW++EGD NS FFH+     +++ FI  LE+E G+ ++    I++EI+ 
Subjt:  KTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIG

Query:  FFTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLI
        +F  LY+      + ++G +W+P+  + + RLE PF E+EI +AI  +   K+PGPDG T   F++ W ++K DLV+VF EF R+GIIN+ TN ++I L+
Subjt:  FFTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLI

Query:  PKKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKL
        PKK  + +ISDFRPISL+TSLYKI+AKVLA R++++L  TI   Q AFVQGR ILD +L+A+E+V+E R   E+GV+FK+DFEKAYD VSW+FLD ++++
Subjt:  PKKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKL

Query:  KGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESV
        KGFG  WRKW+RGCL++ +F++++NG  +G V ASRGLRQGDPLSPFLFTIV D LSR      E+ VL+G  VGR++T VS LQ+ADDTI FS+  E  
Subjt:  KGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESV

Query:  MLNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSV
        M+    +L++    SGL +NL K++I GIN     ++R A    CK    PI  LG PLGGN +T  FWDP++++   +L+GW+K  LS GGR+TL QS 
Subjt:  MLNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSV

Query:  LNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGW
        L  +P Y+ SL K P  V  K+E++ RDF+W+G       +LV W+    P S GG+G G +  RN ALL KWLWR+  E   LW +VI SIYG  S GW
Subjt:  LNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGW

Query:  RTMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENNQ-TWNLGLRRGLFDRELSSWVAL
            I     R  W  I   Y  F KFT+F V  G RIRFW D W   QPL V +P +  +   K A ++         +WN   RR L D E+     L
Subjt:  RTMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENNQ-TWNLGLRRGLFDRELSSWVAL

Query:  IGKLDNIQMGNEM-DRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSPRDVDYVLKR--
        +   D + + + + D+ SW L  SGLF+ KS F A    S    +     +W  + P KVK F+W VA++ +NT+D +Q +    ALSP      +K   
Subjt:  IGKLDNIQMGNEM-DRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSPRDVDYVLKR--

Query:  --VRILITSSFIVSMLARLGTSLQVCW
            + +  S  + +  RL  S ++ W
Subjt:  --VRILITSSFIVSMLARLGTSLQVCW

RVW99790.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]2.6e-28044.07Show/hide
Query:  GLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQIQGWVTGVY
        GLGS+ KR +VK+FLS E PD+V++QETK +  DRR+V SVWS R+  W +L A G++GGIL++W    +   + VLG++S+S +    G    W++ VY
Subjt:  GLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQIQGWVTGVY

Query:  GPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFLITHQWSNTF
        GP +S+ RK F  EL+D+AGL H  WCV GDFN++R   E+L  +R +  M+ F+ FI   +LID P+    +TWS + E     RLDRFL +++W   F
Subjt:  GPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFLITHQWSNTF

Query:  KELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKKKTILDQIDV
         +     L R TSDH+P+ L     +WGP PF+FENMWL H  F+++  +WW EF   GW G +FM KL+ +K +LKEWNK  FG L++KKK IL  +  
Subjt:  KELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKKKTILDQIDV

Query:  LDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIGFFTNLYSKD
         D  E++G +    + +R   K  L E+ + ++    QK ++KW++EGD NS FFH+     +++ FI  LE+E G  L+    I++EI+ +F  LY   
Subjt:  LDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIGFFTNLYSKD

Query:  DSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLIPKKRGASKI
            + ++G +W+P+D + ++RLE PF E+EI++AI  +   K+PGPD  T   F++ W+++K DLV VF EF R+GIIN+ TN ++I LIPKK  + +I
Subjt:  DSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLIPKKRGASKI

Query:  SDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLKGFGTTWRK
        SDFRPISL+TSLY+I+AKVLA RL+ +L  TI   Q AFVQGR ILD +L+A+E+V+E R   E+GV+FK+DFEKAYD VSW+FLD +L++KGF   WRK
Subjt:  SDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLKGFGTTWRK

Query:  WIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESVMLNWWEILM
        W+RGCL++ ++++++NG  +G V ASRGLRQGDPLSPFLFTIV D LSR      E+ VL+G  VGR++T VS LQ+ADDTI FS+  E  ++    +L+
Subjt:  WIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESVMLNWWEILM

Query:  LVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSVLNSLPIYYF
        +    SGL +NL K++I GIN     ++R A    CK    PI  LG PLGGN +   FWDP++++   +L+ W+K  LS GGR+TL QS L  +P Y+ 
Subjt:  LVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSVLNSLPIYYF

Query:  SLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGWRTMVITHIK
        SL K P  V  K+E++ R+F+W+G       +LV W+    P S GG+G G +  RN ALL KWLWR+  E   LW +VI SIYG  S GW         
Subjt:  SLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGWRTMVITHIK

Query:  GRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENNQ-TWNLGLRRGLFDRELSSWVALIGKLDNIQM
         R  W  I   +  F KFT+F V  G RIRFW D W   Q L   +P +  +   K A ++     ++  +WN   RR L D E+    +L+  LD + +
Subjt:  GRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENNQ-TWNLGLRRGLFDRELSSWVALIGKLDNIQM

Query:  GNEM-DRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSP
           + D+ SW +  SGLF+ KS F A    S    +     +W  + P KVK F+W VA++ LNT+D +Q +  + ALSP
Subjt:  GNEM-DRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSP

TrEMBL top hitse value%identityAlignment
A0A438FWU5 LINE-1 retrotransposable element ORF2 protein7.4e-28945.14Show/hide
Query:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQ
        MK++SWN RGLGS+ KR +V+ FLS +NPD+V+LQETK +  DRR V SVW  + V W +L A G++GGI+++W  + +E  + VLG++S++ +     +
Subjt:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQ

Query:  IQGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFL
           W+T VYGP +   RK F  EL D+ GL    WCV GDFN++R + E+L  TR T +MR F+ FI    LID P+ +  FTWS +       RLDRFL
Subjt:  IQGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFL

Query:  ITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKK
         + +W   F +   + L R TSDH P+ L    ++WGP PFRFENMWL HP+F++    WW E T  GW G +FM KLK +K +LKEWN   FG+L E+K
Subjt:  ITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKK

Query:  KTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIG
        K IL  +  +D  E++G++    + ER   +  L ++ + ++ +  QK ++KW++EGD NS FFHR     +S+ FI +L SE GE L+   DI +EI+ 
Subjt:  KTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIG

Query:  FFTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLI
        FF NLYSK     + ++G +W P+  +    L+ PF E+E+ RA+  L   K+PGPDG T   ++  W+++K DL+ VF EF  NG+IN+ TN T+I L+
Subjt:  FFTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLI

Query:  PKKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKL
        PKK  + KISD+RPISLVTSLYKI+AKVL+ RL+ +L  TISD Q AFV+GR ILD +L+A+EVV+E R   E+G++FK+DFEKAYD V W FLD +L+ 
Subjt:  PKKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKL

Query:  KGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESV
        KGF   WR WIRGCL++S+F+I++NG  +G V ASRGLRQGDPLSPFLFT+V D LSR      E  + +G  VGRD+T VS+LQ+ADDTI FS      
Subjt:  KGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESV

Query:  MLNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSV
        + N   IL++  + SGL +NL K++I GINT  E ++  A+ F C+V   P+  LG PLGGN +T  FWDP+V++   +L+GWKK  LS GGR+TL QS 
Subjt:  MLNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSV

Query:  LNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGW
        L+ +P Y+ SL K P  +  K+EK+ R+F+W+G       +LV+WE  + P   GG+G G +  RN ALL KWLWRF  E+ GLW +VIGSIYG    GW
Subjt:  LNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGW

Query:  RTMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDEN-NQTWNLGLRRGLFDRELSSWVAL
           ++     R  W  I + +  F  F +  V  G RIRFW D W   Q L   F ++Y +  +K  +V+     +    WNL  RR L D E+     L
Subjt:  RTMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDEN-NQTWNLGLRRGLFDRELSSWVAL

Query:  IGKLDNIQMGNEM-DRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSPR
        +  L +++    + D  +W L  SGLF+ KS F A    S  I    A  +W  K P KVK   W VA+  +NT+DK+Q +    +L P+
Subjt:  IGKLDNIQMGNEM-DRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSPR

A0A438GDE7 LINE-1 retrotransposable element ORF2 protein7.2e-29244.01Show/hide
Query:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQ
        MK+ISWN RGLGS+ KR +VKDFL  E PD+V+ QETK +  DRR V SVW++R+  W +L A G++GGIL++W    +   + +LG++S+S + T +G 
Subjt:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQ

Query:  IQGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFL
           W++ VYGP +S+ RK    EL+D+AGL    WCV GDFN++R   E+L  +R T SM+ F+ FIS  +LID+P+    FTWS +       RLDRFL
Subjt:  IQGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFL

Query:  ITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKK
         +++W  TF +     L R TSDH+P+ L     +WGP PFRFENMWL HP F+++  +WW EF   GW G +FM KL+ +K +LK WNK  FG L+++K
Subjt:  ITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKK

Query:  KTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIG
        + IL  +   D  E++G +    +A+R   K  L E+ + ++    QK ++KW++EGD NS FFH+     +++ FI  LE+E G+ ++    I++EI+ 
Subjt:  KTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIG

Query:  FFTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLI
        +F  LY+      + ++G +W+P+  + + RLE PF E+EI +AI  +   K+PGPDG T   F++ W ++K DLV+VF EF R+GIIN+ TN ++I L+
Subjt:  FFTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLI

Query:  PKKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKL
        PKK  + +ISDFRPISL+TSLYKI+AKVLA R++++L  TI   Q AFVQGR ILD +L+A+E+V+E R   E+GV+FK+DFEKAYD VSW+FLD ++++
Subjt:  PKKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKL

Query:  KGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESV
        KGFG  WRKW+RGCL++ +F++++NG  +G V ASRGLRQGDPLSPFLFTIV D LSR      E+ VL+G  VGR++T VS LQ+ADDTI FS+  E  
Subjt:  KGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESV

Query:  MLNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSV
        M+    +L++    SGL +NL K++I GIN     ++R A    CK    PI  LG PLGGN +T  FWDP++++   +L+GW+K  LS GGR+TL QS 
Subjt:  MLNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSV

Query:  LNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGW
        L  +P Y+ SL K P  V  K+E++ RDF+W+G       +LV W+    P S GG+G G +  RN ALL KWLWR+  E   LW +VI SIYG  S GW
Subjt:  LNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGW

Query:  RTMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENNQ-TWNLGLRRGLFDRELSSWVAL
            I     R  W  I   Y  F KFT+F V  G RIRFW D W   QPL V +P +  +   K A ++         +WN   RR L D E+     L
Subjt:  RTMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENNQ-TWNLGLRRGLFDRELSSWVAL

Query:  IGKLDNIQMGNEM-DRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSPRDVDYVLKR--
        +   D + + + + D+ SW L  SGLF+ KS F A    S    +     +W  + P KVK F+W VA++ +NT+D +Q +    ALSP      +K   
Subjt:  IGKLDNIQMGNEM-DRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSPRDVDYVLKR--

Query:  --VRILITSSFIVSMLARLGTSLQVCW
            + +  S  + +  RL  S ++ W
Subjt:  --VRILITSSFIVSMLARLGTSLQVCW

A5BPI6 Uncharacterized protein4.0e-28244.39Show/hide
Query:  KLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQI
        K+ISWN RGLGSR KR +VKDFL  E PD+V++QETK    DRR V SVW++R+  W  L A G++GGIL++W    +   + VLG++S+S +    G  
Subjt:  KLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQI

Query:  QGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFLI
        Q W + VYGP S++ RK F  EL+D+ GL    WCV GDFN++R   E+L   R T SM+  + FI   +LID P+    FTWS + E     RLDRFL 
Subjt:  QGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFLI

Query:  THQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKKK
        +++W   F +   + L R TSDH+P+ L     +WGP PFRFENMWL HP F++   +WW EF   GW G +FM KL+ LK +LKEWNK  FG+L E+KK
Subjt:  THQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKKK

Query:  TILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIGF
         IL  I   D  E++G + P  + +R   K  L E+ + ++    QK ++KW++EGD NS  FH+     +++ FI  LE+E G  L     I++EI+ +
Subjt:  TILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIGF

Query:  FTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLIP
        F  LY+      + ++G +W+P+  + ++RLE PF E+EI++AI  +    +PGPDG T   F++ W+++K DLV VF EF R+GIIN+ TN ++I L+P
Subjt:  FTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLIP

Query:  KKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLK
        KK  A KIS++RPISL+TSLYKI+AKVLA RL+ IL  TI   Q AFVQGR ILD +L+A+E+V+E +   E+GV+FK+DFEKAYD VSW+FLD +++ K
Subjt:  KKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLK

Query:  GFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESVM
        GF    RKWIR CL++ +F+I++NG  +G V   RGLRQGDPLSPFLFTIV D  S       E+ V +G  VGR++T VS LQ+ADDTI FS+  E  +
Subjt:  GFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESVM

Query:  LNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSVL
        L    +L +    SGL +NL K++I GIN   + + R A    CK    PI  LG PLGGN ++ +FWDP++++  ++L+GW+K  LS GGR+TL QS L
Subjt:  LNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSVL

Query:  NSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGWR
          +P Y+ SL K P  V  ++E+L RDF+W+G       +LV W+        GG+G+G +  RN+ALL KWLWR+  E   LW +VI SIYG  S GW 
Subjt:  NSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGWR

Query:  TMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENNQ-TWNLGLRRGLFDRELSSWVALI
                 R  W  I + +  F KFT+F V  G RIRFW D W   Q L V FP +  +   K   ++         +WN   RR L D E+    +L+
Subjt:  TMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENNQ-TWNLGLRRGLFDRELSSWVALI

Query:  GKLDNIQMGNEM-DRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSP
          LD+I +   + D+ SW L  SGLF+ KS F A    S   ++     +W  + P K+K F+W VA++ +NT+D +Q +    ALSP
Subjt:  GKLDNIQMGNEM-DRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSP

A5CAA2 Reverse transcriptase domain-containing protein7.5e-28145.19Show/hide
Query:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQ
        MK+ISWN RGLGS+ KR +VKDFL  E PD+V+ QETK +  DRR V SVW++R+  W +L A G++GGIL++W    +   + +LG++S+S + T +G 
Subjt:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQ

Query:  IQGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFL
           W++ VYGP +S+ RK    EL+D+AGL    WCV GDFN++R   E+L  +R T SM+ F+ FIS  +LID+P+    FTWS +       RLDRFL
Subjt:  IQGWVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFL

Query:  ITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKK
         +++W  TF +     L R TSDH+P+ L     +WGP PFRFENMWL HP F+++  +WW EF   GW G +FM KL+ +K +LK WNK  FG L+++K
Subjt:  ITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKK

Query:  KTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIG
        + IL  +   D  E++G +    +A+R   K  L E+ + ++    QK ++KW++EGD NS FFH+     +++ FI  LE+E G+ ++    I++EI+ 
Subjt:  KTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIG

Query:  FFTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLI
        +F  LY+      + ++G +W+P+  + + RLE PF E+EI +AI  +   K+PGPDG T   F++ W ++K DLV+VF EF R+GIIN+ TN ++I L+
Subjt:  FFTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLI

Query:  PKKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKL
        PKK  + +ISDFRPISL+TSLYKI+AKVLA R++++L  TI   Q AFVQGR ILD +L+A+E+V+E R   E+GV+FK+DFEKAYD VSW+FLD ++++
Subjt:  PKKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKL

Query:  KGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESV
        KGFG  WRKW+RGCL++ +F++++NG  +G V ASRGLRQGDPLSPFLFTIV D LSR      E+ VL+G  VGR++T VS LQ+ADDTI FS+  E  
Subjt:  KGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESV

Query:  MLNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSV
        M+    +L++    SGL +NL K++I GIN     ++R A    CK    PI  LG PLGGN +T  FWDP++++   +L+GW+K  LS GGR+TL QS 
Subjt:  MLNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSV

Query:  LNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGW
        L  +P Y+ SL K P  V  K+E++ RDF+W+G       +LV W+    P S GG+G G +  RN ALL KWLWR+  E   LW +VI SIYG  S GW
Subjt:  LNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGW

Query:  RTMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENNQ-TWNLGLRRGLFDRELSSWVAL
            I     R  W  I   Y  F KFT+F V  G RIRFW D W   QPL V +P +  +   K A ++         +WN   RR L D E+     L
Subjt:  RTMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENNQ-TWNLGLRRGLFDRELSSWVAL

Query:  IGKLDNIQMGNEM-DRISWKLEGS--GLFSTKSLFRAAVG
        +   D + + + + D+ SW L  S    F  K + R  VG
Subjt:  IGKLDNIQMGNEM-DRISWKLEGS--GLFSTKSLFRAAVG

M5VS59 Reverse transcriptase domain-containing protein (Fragment)5.2e-28243.44Show/hide
Query:  KRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQIQGWVTGVYGPCSSS
        KR+LVK+ L +  PD+VIL ETK + +DR++V  VW SR   W+   ++G +GGI ++W    V V DS++G +S+S     +     W++G+YGPC   
Subjt:  KRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQIQGWVTGVYGPCSSS

Query:  DRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFLITHQWSNTFKELRLD
        +R SF +ELAD+ G C   WC+ GDFN+VR+  E+ N  R TKSMR FN FI   +L D  + +  FTWS + E A   RLDRFL++  W + F   R  
Subjt:  DRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFLITHQWSNTFKELRLD

Query:  RLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKKKTILDQIDVLDQFEE
         L R TSDH P+ L    ++WGP PFRFENMWL+HPDF + ++ WW E    GW G++FM++LK LK +LK W+KE FG++    +    ++ VLDQ E 
Subjt:  RLQRPTSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKKKTILDQIDVLDQFEE

Query:  DGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIGFFTNLYSKDDSSRFV
           +     +ER  L   + ++   ++ +  Q+ K+KW REGD N+ FFHR     + +++I  LE E+   +  + +IE+E+I FF  LYS + +  + 
Subjt:  DGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIGFFTNLYSKDDSSRFV

Query:  LDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLIPKKRGASKISDFRPI
        ++G NW P+    +  LE PF  +E+ +A+   G  KSPGPDG +  FF++ W ++K DL++V Q+FF++GI+N  TNET+ICLIPKK  + K++D RPI
Subjt:  LDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLIPKKRGASKISDFRPI

Query:  SLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLKGFGTTWRKWIRGCL
        SLVTSLYK+++KVLA RL+++L  TIS  Q AFVQ R ILD +LVA+EVVEE R +  KG++FK+DFEKAYD V W F+D +L  KGFG  WR WI GCL
Subjt:  SLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLKGFGTTWRKWIRGCL

Query:  TNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESVMLNWWEILMLVLKGS
         + NFSIMINGKPRGK  ASRGLRQGDPLSPFLFT+V D LSR      +  ++ G++ G DQ EVS LQ+ADDTI     +E   LN  ++L L    S
Subjt:  TNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESVMLNWWEILMLVLKGS

Query:  GLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSVLNSLPIYYFSLLKAP
        G+ +N AK+ I+GIN  +E +   A  +GC+V   P+  LG PLGGN R   FW+P++DK E +L+ WK+  LSKGGR+TL Q+VL+S+P YY SL K P
Subjt:  GLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSVLNSLPIYYFSLLKAP

Query:  TMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGWRTMVITHIKGRRLWP
          V  K+E+L R+F+W G +    C+LV+WE        GG+G+G+L  RN AL  KWLWRF  E   LW R+I S YG+DS GW T  I  +  R  W 
Subjt:  TMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGWRTMVITHIKGRRLWP

Query:  NIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENN---QTWNLGLRRGLFDRELSSWVALIGKLDNIQM-GNE
         I + Y+ F +  +F V  G +IRFW D W     L+  FP +  +S  K  S+A C+  N+     W+   RR L + E++  V L+  L N+++ G+ 
Subjt:  NIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENN---QTWNLGLRRGLFDRELSSWVALIGKLDNIQM-GNE

Query:  MDRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSP----------RDVDYVLKRVRILI
         DR SW++E  G FS KS FR+ +  +T+        IWK K+P K++ F+W  A   +NT D +QR+     LSP           ++D++   +    
Subjt:  MDRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSP----------RDVDYVLKRVRILI

Query:  TSSFIVSMLARLG--------TSLQVCWGYLSAYQ
        +      ML  LG        + +++C G+L +YQ
Subjt:  TSSFIVSMLARLG--------TSLQVCWGYLSAYQ

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.9e-5723.95Show/hide
Query:  LISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSL---DAIGSAGGILLMWKENC----VEVHDSVLGAYSISAEC
        +++ NV GL S  KR  +  ++  ++P +  +QET L   D   +K        GW  +   +      G+ ++  +       ++     G Y +    
Subjt:  LISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSL---DAIGSAGGILLMWKENC----VEVHDSVLGAYSISAEC

Query:  TFHGQIQG---WVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIP------------MNHG
           G IQ     +  +Y P + + R    Q L+D+         + GDFN    + +R    +  K  ++ N  +   DLIDI              +  
Subjt:  TFHGQIQG---WVTGVYGPCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIP------------MNHG

Query:  RFTWSR----VGERAAASRLDRF-LITHQWSNTFK---ELRLDRLQRPTSDHFPLALSVGAMRW--GPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWA
          T+S+    VG +A  S+  R  +IT+  S+      ELR+  L +  S  + L   +    W    M    +  +  + +   + +  WD F      
Subjt:  RFTWSR----VGERAAASRLDRF-LITHQWSNTFK---ELRLDRLQRPTSDHFPLALSVGAMRW--GPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWA

Query:  GFRFMSKLKGLKEQLKEWNKEVFGNLNEKKKTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGS
         F  ++  K  +E+              K  T+  Q+  L++ E+  S +     E  K++A L EI      + + + +  +    ++      R +  
Subjt:  GFRFMSKLKGLKEQLKEWNKEVFGNLNEKKKTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGS

Query:  MKSKSFIAALESEEGEFLSAEPDIEKEIIGFFTNLYSKD----DSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKN
         + K+ I  +++++G+  +   +I+  I  ++ +LY+      +     LD      L+ +    L  P    EI   I SL   KSPGPDG T EF++ 
Subjt:  MKSKSFIAALESEEGEFLSAEPDIEKEIIGFFTNLYSKD----DSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKN

Query:  FWNILKPDLVEVFQEFFRNGIINKKTNETYICLIPKK-RGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVV
        +   L P L+++FQ   + GI+     E  I LIPK  R  +K  +FRPISL+    KI+ K+LA R++  +   I   Q  F+ G      I  +  V+
Subjt:  FWNILKPDLVEVFQEFFRNGIINKKTNETYICLIPKK-RGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVV

Query:  EEY-RCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLKGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCL
        +   R K++  V+  +D EKA+DK+   F+   L   G    + K IR        +I++NG+         G RQG PLSP LF IV + L+R+     
Subjt:  EEY-RCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLKGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCL

Query:  EKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESVMLNWWEILMLV---LKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGG
        ++K +KG+ +G+++ ++S+  +ADD I+   Y E+ +++   +L L+    K SG  +N+ K+     N   +  ++   +    + +  IK LG  L  
Subjt:  EKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESVMLNWWEILMLV---LKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGG

Query:  NHRT--RTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSVLNSLPIYYFSL--LKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGI
        + +   +  + P++ + +     WK +  S  GR+ + +  +    IY F+   +K P     +LEK T  FIWN  + + A +++  +  A     GGI
Subjt:  NHRT--RTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSVLNSLPIYYFSL--LKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGI

Query:  GVGALGSRNNALLTKWLWRFAHEKE-GLWRR
         +        A +TK  W +   ++   W R
Subjt:  GVGALGSRNNALLTKWLWRFAHEKE-GLWRR

P08548 LINE-1 reverse transcriptase homolog3.2e-5524.65Show/hide
Query:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDR-RIVKSVWSSRHVGWLSLDAIGSAGGILLMWKE----NCVEVHDSVLGAYSISAEC
        + + S NV GL    KR  + D++ K  PD+  +QE+ L   D+ R+    WSS        +      GI +++ +       ++     G +      
Subjt:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDR-RIVKSVWSSRHVGWLSLDAIGSAGGILLMWKE----NCVEVHDSVLGAYSISAEC

Query:  TFHGQIQGWVTGVYGPCSSSDRKSFLQE-LADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDI----PMNHGRFTWSRVGER
        T + +I   +  +Y P  + +   F++E L D++ L      V GDFN    V +R +  + +K +   N  I   DL DI      N   +T+      
Subjt:  TFHGQIQGWVTGVYGPCSSSDRKSFLQE-LADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDI----PMNHGRFTWSRVGER

Query:  AAASRLDRFLITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMR--------WGPMPFRFENMWLDHPDFRKSVEKW----------WDEFTPTGWAGF
           S++D  ++ H+ SN  K  +++ +    SDH  + + +   R        W       ++ W+   + +K + K+          +     T  A  
Subjt:  AAASRLDRFLITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMR--------WGPMPFRFENMWLDHPDFRKSVEKW----------WDEFTPTGWAGF

Query:  RFMSKLKGLKEQLKEWNKEVFGNLNEKKKTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQ---KCKIKWLREGDENSAFFHRWVG
        R   K   L+  LK+  +E   NL    K +        + EE  + +P    E  K++A L EI   + +R++Q   K K  +  + ++          
Subjt:  RFMSKLKGLKEQLKEWNKEVFGNLNEKKKTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQ---KCKIKWLREGDENSAFFHRWVG

Query:  SMKSKSFIAALESEEGEFLSAEPDIEKEIIGFFTNLYS-KDDSSRFV---LDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFK
          + KS I+++ +   E  +   +I+K +  ++  LYS K ++ + +   L+  +   L  +    L  P    EI   I++L   KSPGPDG T EF++
Subjt:  SMKSKSFIAALESEEGEFLSAEPDIEKEIIGFFTNLYS-KDDSSRFV---LDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFK

Query:  NFWNILKPDLVEVFQEFFRNGIINKKTNETYICLIPKK-RGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEV
         F   L P L+ +FQ   + GI+     E  I LIPK  +  ++  ++RPISL+    KI+ K+L  R++  +   I   Q  F+ G      I  +  V
Subjt:  NFWNILKPDLVEVFQEFFRNGIINKKTNETYICLIPKK-RGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEV

Query:  VEEY-RCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLKGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYC
        ++   + KN+  ++  +D EKA+D +   F+   LK  G   T+ K I    +    +I++NG          G RQG PLSP LF IV + L+ +    
Subjt:  VEEY-RCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLKGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYC

Query:  LEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESVMLNWWEILMLVLKGSGLSLNLAKT--SIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGG
         E+K +KG+ +G ++ ++S+  +ADD I++            E++      SG  +N  K+   I   N  +E   + +  F     T+  K + Y   G
Subjt:  LEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESVMLNWWEILMLVLKGSGLSLNLAKT--SIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGG

Query:  NHRTRTFWDPIVDKYEA-------KLEGWKKLLLSKGGRVTLAQSVLNSLPIYYFSL--LKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPT
         + T+   D   + YE         +  WK +  S  GR+ + +  +    IY F+   +KAP    K LEK+   FIWN  + + A  L+  +  A   
Subjt:  NHRTRTFWDPIVDKYEA-------KLEGWKKLLLSKGGRVTLAQSVLNSLPIYYFSL--LKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPT

Query:  SHGGIGVGALGSRNNALLTKWLWRFAHEKE-GLWRRV
          GGI +  L     +++ K  W +   +E  +W R+
Subjt:  SHGGIGVGALGSRNNALLTKWLWRFAHEKE-GLWRRV

P0C2F6 Putative ribonuclease H protein At1g657501.2e-3027.53Show/hide
Query:  IVDKYEAKLEGWKKLLLSKGGRVTLAQSVLNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLT
        I+++  +++ GW++  LS  GR+TL ++VL+S+P++  S +  P  ++ +L++L+R F+W     K   +LVKW     P   GG+GV A  S N AL++
Subjt:  IVDKYEAKLEGWKKLLLSKGGRVTLAQSVLNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLT

Query:  KWLWRFAHEKEGLWRRVIGSIYGVDSLGWRTMVITHIKGRRLWPNIQRNY-DCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVA
        K  WR   EK  LW  V+   Y V  +     +I        W +I     D       +    G++IRFW D W   +PL        E+ + +  +  
Subjt:  KWLWRFAHEKEGLWRRVIGSIYGVDSLGWRTMVITHIKGRRLWPNIQRNY-DCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVA

Query:  ECWDENNQTWNLGLRRGLFDRELSSWVALIGKLDNIQMGNEM-----DRISWKLEGSGLFSTKSLFR-AAVGKSTKINL-SLAGKIWKHKSPKKVKIFLW
        +     +  W  G  RG    ++  +     +L+   +  ++     DR+SWK    G FS +S +    V +  + N+ S    +WK + P++VK FLW
Subjt:  ECWDENNQTWNLGLRRGLFDRELSSWVALIGKLDNIQMGNEM-----DRISWKLEGSGLFSTKSLFR-AAVGKSTKINL-SLAGKIWKHKSPKKVKIFLW

Query:  SVAYRSLNTDDKVQRK
         V  +++ T+++  R+
Subjt:  SVAYRSLNTDDKVQRK

P11369 LINE-1 retrotransposable element ORF2 protein4.7e-5423.71Show/hide
Query:  LISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSL---DAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHG
        LIS N+ GL S  KR  + D+L K++P    LQET L+  DR  +      R  GW ++   + +    G+ ++  +  ++    V+            G
Subjt:  LISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSL---DAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHG

Query:  QI---QGWVTGVYGPCSSSDRKSFLQE-LADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDI-----PMNHGRFTWSRVGER
        +I   +  +  +Y P  ++   +F+++ L  +         + GDFN      +R    +  +   K    +   DL DI     P   G   +S     
Subjt:  QI---QGWVTGVYGPCSSSDRKSFLQE-LADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDI-----PMNHGRFTWSRVGER

Query:  AAASRLDRFLITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMP---FRFENMWLDHPDFRKSVEKWWDEFT----------PTGWAGFR--FM
           S++D  +      N +K + +  +    SDH  L L          P   ++  N  L+    ++ ++K   +F           P  W   +    
Subjt:  AAASRLDRFLITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAMRWGPMP---FRFENMWLDHPDFRKSVEKWWDEFT----------PTGWAGFR--FM

Query:  SKLKGLKEQLKEWNKEVFGNLNEKKKTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKS
         KL  L    K+       +L    K +        + +E  S +     E +KL+  + ++      + + + +  +  + ++      R     + K 
Subjt:  SKLKGLKEQLKEWNKEVFGNLNEKKKTILDQIDVLDQFEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKS

Query:  FIAALESEEGEFLSAEPDIEKEIIGFFTNLYSK-----DDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNI
         I  + +E+G+  +   +I+  I  F+  LYS      D+  +F LD      L+      L  P    EI   I SL   KSPGPDG + EF++ F   
Subjt:  FIAALESEEGEFLSAEPDIEKEIIGFFTNLYSK-----DDSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNI

Query:  LKPDLVEVFQEFFRNGIINKKTNETYICLIPK-KRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEY-
        L P L ++F +    G +     E  I LIPK ++  +KI +FRPISL+    KI+ K+LA R+++ +   I   Q  F+ G      I  +  V+    
Subjt:  LKPDLVEVFQEFFRNGIINKKTNETYICLIPK-KRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEY-

Query:  RCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLKGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKV
        + K++  ++  LD EKA+DK+   F+  +L+  G    +   I+   +    +I +NG+    +    G RQG PLSP+LF IV + L+R+     ++K 
Subjt:  RCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLKGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKV

Query:  LKGMLVGRDQTEVSMLQYADDTIIFSAYEESVMLNWWEILMLV-----LKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNH
        +KG+ +G+++ ++S+L  ADD I++ +  ++   +  E+L L+     + G  ++ N +   +   N  +E   R    F   + T  IK LG  L    
Subjt:  LKGMLVGRDQTEVSMLQYADDTIIFSAYEESVMLNWWEILMLV-----LKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNH

Query:  RTRTFWD----PIVDKYEAKLEGWKKLLLSKGGRVTLAQSVLNSLPIYYFSL--LKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGI
          +  +D     +  + +  L  WK L  S  GR+ + +  +    IY F+   +K PT    +LE     F+WN  + + A +L+K + T+     GGI
Subjt:  RTRTFWD----PIVDKYEAKLEGWKKLLLSKGGRVTLAQSVLNSLPIYYFSL--LKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGI

Query:  GVGALGSRNNALLTKWLWRFAHEKE-GLWRRV
         +  L     A++ K  W +  +++   W R+
Subjt:  GVGALGSRNNALLTKWLWRFAHEKE-GLWRRV

P14381 Transposon TX1 uncharacterized 149 kDa protein3.1e-5824.87Show/hide
Query:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSR----HVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECT
        + + + N  G  +  +   V  FL +    +  LQET             W  R    H+ W S        G++ ++ ++       VL A S+     
Subjt:  MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSR----HVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECT

Query:  FHGQIQGW-----VTGVYGPCSSSDRKSFLQELADVAGLCHG--VWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHG----RFTWS
         H +++       +  VY P +  +R  F + L+             + GDFN      +R    +   S       I+ F L+D+          FT+ 
Subjt:  FHGQIQGW-----VTGVYGPCSSSDRKSFLQELADVAGLCHG--VWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHG----RFTWS

Query:  RVGE-RAAASRLDRFLITHQWSNTFKELRLDRLQRPTSDH--FPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFR---------
        RV +   + SR+DR  I+    +  +   + RL  P SDH    L +S+         + F N  L+   F KSV   W      GW  F+         
Subjt:  RVGE-RAAASRLDRFLITHQWSNTFKELRLDRLQRPTSDH--FPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFR---------

Query:  -FMSKLKGLKEQLKEWNKEVFGNLNEKKKTILDQIDVLDQ---FEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVG
          + K+  LK   +E+ K V G  N + + +  ++  L+Q     ED ++Q     E L+ K +L  +     R    + +++ L + D  S FF+    
Subjt:  -FMSKLKGLKEQLKEWNKEVFGNLNEKKKTILDQIDVLDQ---FEEDGSIQPHHIAERLKLKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVG

Query:  SMKSKSFIAALESEEGEFLSAEPDIEKEIIGFFTNLYSKD----DSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFK
           ++  I  L +E+G  L     I      F+ NL+S D    D+   + DG     +  +   RLE P   DE+ +A++ +   KSPG DG+T EFF+
Subjt:  SMKSKSFIAALESEEGEFLSAEPDIEKEIIGFFTNLYSKD----DSSRFVLDGPNWAPLDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFK

Query:  NFWNILKPDLVEVFQEFFRNGIINKKTNETYICLIPKKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVV
         FW+ L PD   V  E F+ G +        + L+PKK     I ++RP+SL+++ YKIVAK ++ RLK +L   I   Q+  V GR I D + +  +++
Subjt:  NFWNILKPDLVEVFQEFFRNGIINKKTNETYICLIPKKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQGRLILDPILVASEVV

Query:  EEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLKGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLE
           R          LD EKA+D+V  ++L   L+   FG  +  +++    ++   + IN      +   RG+RQG PLS  L+     +L+     CL 
Subjt:  EEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLKGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLE

Query:  KKVLKGMLVGRDQTEVSMLQYADDTIIFS----------------AYEESVMLNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVET
        +K L G+++      V +  YADD I+ +                A   S  +NW        K SGL     K   +    P+     W +K       
Subjt:  KKVLKGMLVGRDQTEVSMLQYADDTIIFS----------------AYEESVMLNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVET

Query:  LPIKNLG-------YPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSVLNSLPIYYFSLLKAPTM-VIKKLEKLTRDFIWNGGQFKPACN
          IK LG       YP+  N       +  V     K +G+ K+L  +G  + + Q V +   I+Y  +  +PT   I K+++   DF+W G  +     
Subjt:  LPIKNLG-------YPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSVLNSLPIYYFSLLKAPTM-VIKKLEKLTRDFIWNGGQFKPACN

Query:  LVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAH-EKEGLWRRVIGSIY
         V    ++LP   GG GV  + S+ +    + + R+ + +    W  +  S Y
Subjt:  LVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAH-EKEGLWRRVIGSIY

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.9e-4228.11Show/hide
Query:  GDFNMVRWVDER---LNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWS-RVGERAAASRLDRFLITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAM
        GDF+ +    +    L  + P + + +F   +   DL+DIP     +TWS    +     +LDR +    W ++F            SDH P  + +  +
Subjt:  GDFNMVRWVDER---LNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWS-RVGERAAASRLDRFLITHQWSNTFKELRLDRLQRPTSDHFPLALSVGAM

Query:  -RWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKKKTILDQIDVLDQ----FEEDGSIQPHHIAERLK
         +     FR+ +    HP F  S+   W+E  P G   F     LK  K+  K  N++ FGN+  K K  LD ++ +         D   +  H+A +  
Subjt:  -RWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKKKTILDQIDVLDQ----FEEDGSIQPHHIAERLK

Query:  LKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIGFFTNLYSKD------DSSRFVLDGPNWAP
         K +     +    R  QK +IKWL++GD N+ FFH+ + + ++K+ I  L  ++   +     +++ I+ ++T+L   D      DS + + D   +  
Subjt:  LKASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIGFFTNLYSKD------DSSRFVLDGPNWAP

Query:  LDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLIPKKRGASKISDFRPISLVTSLYK
         D  +++RL     + EI  A+ ++   K+PGPD  T EFF   W ++K   +   +EFFR G + K+ N T I LIPK  G  ++S FRP+S  T +YK
Subjt:  LDAQISARLEMPFKEDEIFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLIPKKRGASKISDFRPISLVTSLYK

Query:  IV
        I+
Subjt:  IV

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.2e-1924.02Show/hide
Query:  LPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSVLNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTA
        LP++ LG PL     T + + P+V+K   ++  W    LS  GR+ L  SV++SL  ++ S  + P+  IK+++ +   F+W+G +       V W    
Subjt:  LPIKNLGYPLGGNHRTRTFWDPIVDKYEAKLEGWKKLLLSKGGRVTLAQSVLNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTA

Query:  LPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLG-WRTMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDV
         P   GG+G+ +L   N              K   W     SI G  +LG W            +W  I ++      F K  +  G    FW D W  +
Subjt:  LPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLG-WRTMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDV

Query:  -QPLQVT-FPEIYEISHMKLASVAECWDENNQTWNLGLRRGLFDRELSSWVALIGKLDNIQMGNEMDRISWKLEG---SGLFSTKSLFRAAVGKSTKINL
         + + VT      ++     ASVAE         N   RR   D  L     +I ++ +  + +  D + WK  G      F+TK  + A   +  K+ +
Subjt:  -QPLQVT-FPEIYEISHMKLASVAECWDENNQTWNLGLRRGLFDRELSSWVALIGKLDNIQMGNEMDRISWKLEG---SGLFSTKSLFRAAVGKSTKINL

Query:  SLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKV
        +    +W   +  K  +  W      L T D++
Subjt:  SLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKV

AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.5e-1028.57Show/hide
Query:  LAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGV----LFKLDFEKAYDKVSWEFLDAILKLKGFGTTWRKWIRGCLTNSNFSIMI
        + ERLK ++   I   QA+F+ GR+  D I+   E V   R K  KGV    L KLD EKAYD++ W++L+  L   GF   W   I      S F    
Subjt:  LAERLKDILPTTISDCQAAFVQGRLILDPILVASEVVEEYRCKNEKGV----LFKLDFEKAYDKVSWEFLDAILKLKGFGTTWRKWIRGCLTNSNFSIMI

Query:  NGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSM-LQYADDTIIFSAYEESV-MLNWWEILMLVLKGSGLSLNLA
             G+  AS+  R  D    F +  +    + ++  C E  +L+   +GR     SM L   ++ +   A  +S+  +  W+ L  + +    SL++ 
Subjt:  NGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSM-LQYADDTIIFSAYEESV-MLNWWEILMLVLKGSGLSLNLA

Query:  KTS
         T+
Subjt:  KTS

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.6e-0726.49Show/hide
Query:  SLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWE-WTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGWR
        +LP+Y  S  +   ++ KKL     +F W+  + K   + V W+         GG+G   LG  N ALL K  +R  H+   L  R++ S Y   S    
Subjt:  SLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWE-WTALPTSHGGIGVGALGSRNNALLTKWLWRFAHEKEGLWRRVIGSIYGVDSLGWR

Query:  TMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAW-CDVQPL
          V T  +    W +I    +   +     +  G   + W D W  D  PL
Subjt:  TMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAW-CDVQPL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.2e-0945.59Show/hide
Query:  MINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDT
        +ING P+G V  SRGLRQGDPLSP+LF +  + LS       E+  L G+ V  +   ++ L +ADDT
Subjt:  MINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSAHYCLEKKVLKGMLVGRDQTEVSMLQYADDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCTGATCTCCTGGAATGTTAGGGGCCTGGGAAGCCGGTCTAAACGGATGTTAGTTAAAGATTTCCTTAGTAAAGAGAATCCGGACTTAGTGATCTTGCAAGAGAC
TAAACTTCAGCACATTGATAGGCGGATAGTCAAATCAGTTTGGAGCTCCAGGCATGTTGGGTGGCTCAGTCTAGATGCGATCGGGTCGGCTGGTGGTATCCTTTTAATGT
GGAAAGAAAATTGCGTTGAGGTTCACGATTCGGTCTTAGGGGCTTACTCCATTTCTGCCGAGTGTACCTTCCATGGTCAGATTCAAGGGTGGGTGACTGGAGTTTACGGC
CCTTGTTCGTCTTCTGATAGAAAGAGCTTTCTGCAGGAATTGGCAGATGTGGCTGGCCTGTGCCATGGCGTTTGGTGTGTCTCTGGCGACTTCAATATGGTGAGATGGGT
GGATGAAAGGCTTAACGCCACTAGGCCTACAAAAAGCATGAGGAAGTTTAATCGCTTCATATCTTCTTTTGATCTTATTGATATTCCGATGAATCATGGCAGGTTTACTT
GGTCTAGAGTGGGCGAAAGAGCTGCAGCCTCGAGGCTTGACAGATTTTTAATCACCCATCAGTGGTCTAATACGTTCAAGGAGCTCAGACTGGACAGACTTCAGCGTCCA
ACTTCTGATCACTTTCCCCTAGCTTTATCCGTAGGTGCCATGAGATGGGGCCCTATGCCTTTTAGATTCGAGAATATGTGGCTTGATCATCCGGACTTCAGAAAGTCAGT
GGAGAAATGGTGGGATGAGTTCACCCCGACCGGTTGGGCTGGCTTCCGTTTTATGAGTAAGTTGAAGGGCCTGAAGGAGCAGCTTAAAGAGTGGAACAAAGAAGTTTTTG
GGAACCTGAATGAGAAAAAGAAGACCATTCTTGATCAAATTGATGTCTTAGATCAGTTCGAGGAGGATGGGAGTATCCAACCACATCACATTGCTGAGAGGCTTAAGCTG
AAAGCTTCCCTCCTTGAAATTACGGTGAGTGACCAAAGAAGATTACTGCAGAAATGCAAAATTAAATGGTTGAGGGAAGGTGACGAGAACTCTGCGTTCTTTCATAGATG
GGTTGGTTCCATGAAGAGCAAGTCCTTCATTGCCGCTCTTGAAAGTGAAGAGGGAGAATTCCTTTCAGCCGAGCCCGACATTGAGAAGGAGATCATTGGTTTTTTCACCA
ACTTGTATTCCAAAGATGATAGCTCTCGGTTTGTCTTAGATGGTCCGAATTGGGCCCCCTTAGATGCTCAGATCAGTGCTCGACTAGAGATGCCTTTCAAAGAAGATGAG
ATCTTTAGAGCTATTAAAAGCTTAGGCCCTATGAAGTCCCCGGGCCCCGACGGCATGACTGGAGAGTTTTTTAAAAACTTTTGGAACATTTTGAAGCCAGATTTAGTAGA
GGTGTTCCAGGAGTTTTTTAGAAACGGCATCATAAACAAGAAAACTAACGAGACCTACATTTGCTTGATCCCGAAGAAAAGGGGTGCCTCCAAAATCAGTGATTTCAGAC
CTATCAGCTTAGTTACTTCCCTCTACAAAATTGTGGCCAAAGTCCTTGCTGAGAGACTAAAGGATATCCTCCCCACTACTATAAGTGATTGTCAGGCCGCATTTGTGCAA
GGGCGGCTAATTCTTGACCCAATTCTGGTGGCTTCTGAGGTGGTAGAAGAGTATAGATGCAAGAATGAGAAAGGAGTGCTCTTCAAGCTTGATTTTGAAAAAGCCTATGA
TAAGGTGAGCTGGGAGTTTCTTGATGCCATTCTTAAGCTTAAAGGATTCGGGACCACTTGGAGGAAATGGATTAGAGGTTGCCTGACGAACTCTAACTTCTCCATTATGA
TAAATGGCAAGCCGAGGGGGAAAGTTTATGCTTCTAGAGGGCTAAGACAAGGGGACCCCCTATCTCCTTTCCTTTTTACCATTGTAGGTGATGCCCTCAGTAGATCGGCC
CATTACTGCTTGGAAAAGAAAGTCCTTAAAGGCATGCTCGTGGGCAGGGACCAAACTGAGGTGTCCATGTTGCAATACGCTGATGACACAATCATTTTTAGTGCATATGA
GGAGTCTGTTATGTTGAACTGGTGGGAGATTTTGATGCTTGTTTTGAAGGGATCGGGGCTGTCTTTGAACCTCGCCAAAACCTCGATTATTGGGATCAACACTCCTTCGG
AGGACATGACTAGGTGGGCCAACAAATTTGGGTGCAAGGTTGAGACTCTCCCAATAAAGAATCTAGGTTATCCCCTGGGTGGGAACCATCGCACTAGAACGTTTTGGGAC
CCCATTGTTGATAAATATGAAGCTAAACTTGAAGGTTGGAAGAAGTTGCTGCTCTCTAAAGGGGGAAGGGTCACTTTGGCCCAGTCGGTCCTTAATAGCCTCCCCATATA
CTACTTCTCCCTCCTTAAAGCCCCGACGATGGTGATTAAGAAACTAGAAAAGCTTACGAGGGACTTTATTTGGAATGGTGGGCAGTTCAAACCCGCTTGCAATCTTGTCA
AATGGGAATGGACAGCTTTGCCTACTTCCCATGGAGGCATTGGGGTTGGTGCTTTAGGTTCGCGGAACAACGCCCTTCTTACCAAGTGGCTTTGGAGATTTGCCCATGAG
AAGGAAGGCCTATGGAGGAGGGTCATTGGCAGTATCTATGGGGTGGATAGTCTTGGCTGGAGAACTATGGTGATTACCCATATAAAAGGTAGAAGATTATGGCCCAATAT
TCAGCGGAATTATGACTGTTTTGAAAAGTTTACTAAATTCCAGGTGAGCTGTGGGAGAAGAATAAGGTTTTGGGGTGATGCTTGGTGTGATGTGCAACCTCTTCAAGTTA
CTTTCCCGGAAATTTATGAGATTTCCCATATGAAACTTGCTTCCGTGGCAGAGTGTTGGGATGAGAACAACCAAACGTGGAACCTTGGCCTGCGAAGAGGGCTGTTTGAC
CGTGAGCTGAGCAGTTGGGTGGCCCTCATTGGCAAGTTAGACAACATCCAGATGGGTAATGAGATGGACAGAATATCTTGGAAATTGGAAGGATCGGGGCTTTTCTCTAC
CAAATCCTTATTTCGTGCAGCTGTTGGAAAATCTACCAAGATTAACCTGTCTCTTGCGGGAAAGATTTGGAAGCACAAGTCCCCTAAGAAAGTGAAAATCTTCCTTTGGA
GTGTGGCTTACAGAAGTCTAAATACGGATGACAAAGTTCAAAGGAAACTAAAAAATTGGGCTCTATCCCCTCGGGATGTAGACTATGTTTTAAAGAGAGTGAGGATATTG
ATCACATCCTCCTTCATTGTGAGTATGCTAGCAAGACTTGGAACTTCATTGCAGGTCTGTTGGGGATATCTTTCTGCTTACCAAAGAGGGTGGAAGACTGGCTTGAAGAA
GGGCTGCAAGCTTGGAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCTGATCTCCTGGAATGTTAGGGGCCTGGGAAGCCGGTCTAAACGGATGTTAGTTAAAGATTTCCTTAGTAAAGAGAATCCGGACTTAGTGATCTTGCAAGAGAC
TAAACTTCAGCACATTGATAGGCGGATAGTCAAATCAGTTTGGAGCTCCAGGCATGTTGGGTGGCTCAGTCTAGATGCGATCGGGTCGGCTGGTGGTATCCTTTTAATGT
GGAAAGAAAATTGCGTTGAGGTTCACGATTCGGTCTTAGGGGCTTACTCCATTTCTGCCGAGTGTACCTTCCATGGTCAGATTCAAGGGTGGGTGACTGGAGTTTACGGC
CCTTGTTCGTCTTCTGATAGAAAGAGCTTTCTGCAGGAATTGGCAGATGTGGCTGGCCTGTGCCATGGCGTTTGGTGTGTCTCTGGCGACTTCAATATGGTGAGATGGGT
GGATGAAAGGCTTAACGCCACTAGGCCTACAAAAAGCATGAGGAAGTTTAATCGCTTCATATCTTCTTTTGATCTTATTGATATTCCGATGAATCATGGCAGGTTTACTT
GGTCTAGAGTGGGCGAAAGAGCTGCAGCCTCGAGGCTTGACAGATTTTTAATCACCCATCAGTGGTCTAATACGTTCAAGGAGCTCAGACTGGACAGACTTCAGCGTCCA
ACTTCTGATCACTTTCCCCTAGCTTTATCCGTAGGTGCCATGAGATGGGGCCCTATGCCTTTTAGATTCGAGAATATGTGGCTTGATCATCCGGACTTCAGAAAGTCAGT
GGAGAAATGGTGGGATGAGTTCACCCCGACCGGTTGGGCTGGCTTCCGTTTTATGAGTAAGTTGAAGGGCCTGAAGGAGCAGCTTAAAGAGTGGAACAAAGAAGTTTTTG
GGAACCTGAATGAGAAAAAGAAGACCATTCTTGATCAAATTGATGTCTTAGATCAGTTCGAGGAGGATGGGAGTATCCAACCACATCACATTGCTGAGAGGCTTAAGCTG
AAAGCTTCCCTCCTTGAAATTACGGTGAGTGACCAAAGAAGATTACTGCAGAAATGCAAAATTAAATGGTTGAGGGAAGGTGACGAGAACTCTGCGTTCTTTCATAGATG
GGTTGGTTCCATGAAGAGCAAGTCCTTCATTGCCGCTCTTGAAAGTGAAGAGGGAGAATTCCTTTCAGCCGAGCCCGACATTGAGAAGGAGATCATTGGTTTTTTCACCA
ACTTGTATTCCAAAGATGATAGCTCTCGGTTTGTCTTAGATGGTCCGAATTGGGCCCCCTTAGATGCTCAGATCAGTGCTCGACTAGAGATGCCTTTCAAAGAAGATGAG
ATCTTTAGAGCTATTAAAAGCTTAGGCCCTATGAAGTCCCCGGGCCCCGACGGCATGACTGGAGAGTTTTTTAAAAACTTTTGGAACATTTTGAAGCCAGATTTAGTAGA
GGTGTTCCAGGAGTTTTTTAGAAACGGCATCATAAACAAGAAAACTAACGAGACCTACATTTGCTTGATCCCGAAGAAAAGGGGTGCCTCCAAAATCAGTGATTTCAGAC
CTATCAGCTTAGTTACTTCCCTCTACAAAATTGTGGCCAAAGTCCTTGCTGAGAGACTAAAGGATATCCTCCCCACTACTATAAGTGATTGTCAGGCCGCATTTGTGCAA
GGGCGGCTAATTCTTGACCCAATTCTGGTGGCTTCTGAGGTGGTAGAAGAGTATAGATGCAAGAATGAGAAAGGAGTGCTCTTCAAGCTTGATTTTGAAAAAGCCTATGA
TAAGGTGAGCTGGGAGTTTCTTGATGCCATTCTTAAGCTTAAAGGATTCGGGACCACTTGGAGGAAATGGATTAGAGGTTGCCTGACGAACTCTAACTTCTCCATTATGA
TAAATGGCAAGCCGAGGGGGAAAGTTTATGCTTCTAGAGGGCTAAGACAAGGGGACCCCCTATCTCCTTTCCTTTTTACCATTGTAGGTGATGCCCTCAGTAGATCGGCC
CATTACTGCTTGGAAAAGAAAGTCCTTAAAGGCATGCTCGTGGGCAGGGACCAAACTGAGGTGTCCATGTTGCAATACGCTGATGACACAATCATTTTTAGTGCATATGA
GGAGTCTGTTATGTTGAACTGGTGGGAGATTTTGATGCTTGTTTTGAAGGGATCGGGGCTGTCTTTGAACCTCGCCAAAACCTCGATTATTGGGATCAACACTCCTTCGG
AGGACATGACTAGGTGGGCCAACAAATTTGGGTGCAAGGTTGAGACTCTCCCAATAAAGAATCTAGGTTATCCCCTGGGTGGGAACCATCGCACTAGAACGTTTTGGGAC
CCCATTGTTGATAAATATGAAGCTAAACTTGAAGGTTGGAAGAAGTTGCTGCTCTCTAAAGGGGGAAGGGTCACTTTGGCCCAGTCGGTCCTTAATAGCCTCCCCATATA
CTACTTCTCCCTCCTTAAAGCCCCGACGATGGTGATTAAGAAACTAGAAAAGCTTACGAGGGACTTTATTTGGAATGGTGGGCAGTTCAAACCCGCTTGCAATCTTGTCA
AATGGGAATGGACAGCTTTGCCTACTTCCCATGGAGGCATTGGGGTTGGTGCTTTAGGTTCGCGGAACAACGCCCTTCTTACCAAGTGGCTTTGGAGATTTGCCCATGAG
AAGGAAGGCCTATGGAGGAGGGTCATTGGCAGTATCTATGGGGTGGATAGTCTTGGCTGGAGAACTATGGTGATTACCCATATAAAAGGTAGAAGATTATGGCCCAATAT
TCAGCGGAATTATGACTGTTTTGAAAAGTTTACTAAATTCCAGGTGAGCTGTGGGAGAAGAATAAGGTTTTGGGGTGATGCTTGGTGTGATGTGCAACCTCTTCAAGTTA
CTTTCCCGGAAATTTATGAGATTTCCCATATGAAACTTGCTTCCGTGGCAGAGTGTTGGGATGAGAACAACCAAACGTGGAACCTTGGCCTGCGAAGAGGGCTGTTTGAC
CGTGAGCTGAGCAGTTGGGTGGCCCTCATTGGCAAGTTAGACAACATCCAGATGGGTAATGAGATGGACAGAATATCTTGGAAATTGGAAGGATCGGGGCTTTTCTCTAC
CAAATCCTTATTTCGTGCAGCTGTTGGAAAATCTACCAAGATTAACCTGTCTCTTGCGGGAAAGATTTGGAAGCACAAGTCCCCTAAGAAAGTGAAAATCTTCCTTTGGA
GTGTGGCTTACAGAAGTCTAAATACGGATGACAAAGTTCAAAGGAAACTAAAAAATTGGGCTCTATCCCCTCGGGATGTAGACTATGTTTTAAAGAGAGTGAGGATATTG
ATCACATCCTCCTTCATTGTGAGTATGCTAGCAAGACTTGGAACTTCATTGCAGGTCTGTTGGGGATATCTTTCTGCTTACCAAAGAGGGTGGAAGACTGGCTTGAAGAA
GGGCTGCAAGCTTGGAATTTGA
Protein sequenceShow/hide protein sequence
MKLISWNVRGLGSRSKRMLVKDFLSKENPDLVILQETKLQHIDRRIVKSVWSSRHVGWLSLDAIGSAGGILLMWKENCVEVHDSVLGAYSISAECTFHGQIQGWVTGVYG
PCSSSDRKSFLQELADVAGLCHGVWCVSGDFNMVRWVDERLNATRPTKSMRKFNRFISSFDLIDIPMNHGRFTWSRVGERAAASRLDRFLITHQWSNTFKELRLDRLQRP
TSDHFPLALSVGAMRWGPMPFRFENMWLDHPDFRKSVEKWWDEFTPTGWAGFRFMSKLKGLKEQLKEWNKEVFGNLNEKKKTILDQIDVLDQFEEDGSIQPHHIAERLKL
KASLLEITVSDQRRLLQKCKIKWLREGDENSAFFHRWVGSMKSKSFIAALESEEGEFLSAEPDIEKEIIGFFTNLYSKDDSSRFVLDGPNWAPLDAQISARLEMPFKEDE
IFRAIKSLGPMKSPGPDGMTGEFFKNFWNILKPDLVEVFQEFFRNGIINKKTNETYICLIPKKRGASKISDFRPISLVTSLYKIVAKVLAERLKDILPTTISDCQAAFVQ
GRLILDPILVASEVVEEYRCKNEKGVLFKLDFEKAYDKVSWEFLDAILKLKGFGTTWRKWIRGCLTNSNFSIMINGKPRGKVYASRGLRQGDPLSPFLFTIVGDALSRSA
HYCLEKKVLKGMLVGRDQTEVSMLQYADDTIIFSAYEESVMLNWWEILMLVLKGSGLSLNLAKTSIIGINTPSEDMTRWANKFGCKVETLPIKNLGYPLGGNHRTRTFWD
PIVDKYEAKLEGWKKLLLSKGGRVTLAQSVLNSLPIYYFSLLKAPTMVIKKLEKLTRDFIWNGGQFKPACNLVKWEWTALPTSHGGIGVGALGSRNNALLTKWLWRFAHE
KEGLWRRVIGSIYGVDSLGWRTMVITHIKGRRLWPNIQRNYDCFEKFTKFQVSCGRRIRFWGDAWCDVQPLQVTFPEIYEISHMKLASVAECWDENNQTWNLGLRRGLFD
RELSSWVALIGKLDNIQMGNEMDRISWKLEGSGLFSTKSLFRAAVGKSTKINLSLAGKIWKHKSPKKVKIFLWSVAYRSLNTDDKVQRKLKNWALSPRDVDYVLKRVRIL
ITSSFIVSMLARLGTSLQVCWGYLSAYQRGWKTGLKKGCKLGI