; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031879 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031879
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr11:17626953..17636185
RNA-Seq ExpressionLag0031879
SyntenyLag0031879
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN69274.1 TatD related DNase [Prunus dulcis]1.0e-10331.03Show/hide
Query:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV
        +ILD+ L+ANEVVEE R + +KG + K+D EKA+D V W F++ V+  KGF  KW  WI G +    FSI ING+PRG+  ASR LRQGDPLSPFLF LV
Subjt:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV

Query:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD
        S+VL+ +I                           +R +  NL    +S                      D +  +H  +A                 D
Subjt:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD

Query:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVWNQ
         +   L+                             G  ++          W  L +   L+C V                  G+ +      I  +   
Subjt:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVWNQ

Query:  IESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSSI
        IE+L       G  +  W   ++G                LP+            GG  R + FW+ V++ +  +L  W    +SKGGR TLI+AVLSSI
Subjt:  IESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSSI

Query:  PTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAF
        P+YYM++F++PIGV  ++E+  R FLW+G +  K  HLVRW +VT  ++ GGLGI ++R +N AL  KWLWRF  E +SLW ++IKSKYG          
Subjt:  PTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAF

Query:  KKWKCCSNPWKDIAK-LASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWND----VAWDLKLRRGLFDREMNEWMALSS
             C NPW++I+K   S L+  R + +G G K  FW+D WL +  LK  FP ++ LS  K+  +    N+    + WD   RR L + E+ E + L  
Subjt:  KKWKCCSNPWKDIAK-LASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWND----VAWDLKLRRGLFDREMNEWMALSS

Query:  LLNNCRLNNS-DDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKETA
        +L N RL  S  D   W+++  G FS KS      +  +V+     + IW+ K P +++FF+W  A+  INT D +Q+R P + +SP+ C+ C  + E  
Subjt:  LLNNCRLNNS-DDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKETA

Query:  SHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVF-SDKHMHFAFFCDLVQHTASNWSALDSI
         H+ +HC ++  +W    D    +   P      L+  L   G   KA +L      AI W++W ERN R+F     +      D ++  AS W+++   
Subjt:  SHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVF-SDKHMHFAFFCDLVQHTASNWSALDSI

Query:  FCNYSAALI
        F +Y  + I
Subjt:  FCNYSAALI

KAA0046762.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]7.4e-10229.93Show/hide
Query:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV
        +I D+ L+ANE V+ ++ KK KG+ILKLD+EKAFD ++WDF++FVL+ K F   W +WI G + +  +SI +NGRP+GR+ A+R LRQGDPLSPFLF++ 
Subjt:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV

Query:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD
         + L+ L+  + E  G  K +SF                  N+S    +D                                         + L   D D
Subjt:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD

Query:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVWNQ
            +L +   +     G     ++S  +    + +  ++ +  A              S W                     G+S +            
Subjt:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVWNQ

Query:  IESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSSI
                                               SLP++ +     GV +GG  ++  FW +V E I+ KL  W    ISKGGR TLIK+ LSS+
Subjt:  IESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSSI

Query:  PTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAF
        PTY +++F+ P    K IEK +R FLWKG +S +G HL+ W KVT  +  GGLGI  +   N ALL KWLWR+ +EP +LWR++I+ KY   +     + 
Subjt:  PTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAF

Query:  KKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWN--DVAWDLKLRRGLFDREMNEWMALSSLLN
                PW+ I       + N+ + +  G++  FW   W  +  L   +P +F LS+ KD TV+D WN  D  W ++ RR L DRE N+W  +  +L 
Subjt:  KKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWN--DVAWDLKLRRGLFDREMNEWMALSSLLN

Query:  NCRLNNSDDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQ------IWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKE
          R N+      W  D    FS+ S+   F   R + + S   Q      IW+   P ++KFF+W +  + INTM+ VQ R P++ + P+ C+LC +  E
Subjt:  NCRLNNSDDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQ------IWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKE

Query:  TASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGG---LKGKAKVLWSNFSRAIMWHLWKERNARVFS--DKHMHFAFFCDLVQHTASNW
        + +H+ LHC+  K +W++  + FS+     +     L +V             KV++     A+ W +W ERN R+F     H   A   +  +    NW
Subjt:  TASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGG---LKGKAKVLWSNFSRAIMWHLWKERNARVFS--DKHMHFAFFCDLVQHTASNW

Query:  SALDSIFCNYSAALINLQWKAF
         + DS F NYSAA I L    F
Subjt:  SALDSIFCNYSAALINLQWKAF

KAA0056839.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.7e-10129.15Show/hide
Query:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV
        +I D+ LIANE ++ ++ +K KG++LKLDLEKAFD++ W F++F+L  K F  KW +WI   + + ++SI +NG P+GR+ A R +RQGDPLSPF+F+L 
Subjt:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV

Query:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD
         + L+ L+  +  KG   K +SF                  N+S    +D               +   + D  R+ + L          ++AL  +++ 
Subjt:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD

Query:  SSAT--HLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVW
        S  T  + + T    N+  GR T Q+ S                                                     FG                 
Subjt:  SSAT--HLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVW

Query:  NQIESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLS
                                             FQ   LP   VN L  GV +GG  R+  FW   +E I  KL  W    ISKGGR TL+KA LS
Subjt:  NQIESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLS

Query:  SIPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHL
        S+PTY ++ F+ P+ V KEIEK +R FLW G +  + +HL+ W+  T+P++LGGLGI  ++  N ALL KWLWR+ NE +SLW++ I +KY +       
Subjt:  SIPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHL

Query:  AFKKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWNDVA--WDLKLRRGLFDREMNEWMALSSL
           +    ++PW  I K     +    +    G+   FW   W  + PL +QFP ++ LS  + ATV+++W+  +  W+++ RR L +RE   W ++   
Subjt:  AFKKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWNDVA--WDLKLRRGLFDREMNEWMALSSL

Query:  LNNCRLNNSDDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDS----LSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKE
        L     N       W    S K++V S+  +     ++ +++        +W    P + KFF+WT+ H+ +NTMDK+QKR PS+S++P+ C+ C  S E
Subjt:  LNNCRLNNSDDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDS----LSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKE

Query:  TASHVLLHCDFAKDVWNYF----GDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVFSDKHMHFAFFCDLVQHTASNWS
          +H+ + C FA+++WN +    G P +    K  C+       L     +    ++  N + A +W +W  RN  +F+DK   +    + +     +WS
Subjt:  TASHVLLHCDFAKDVWNYF----GDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVFSDKHMHFAFFCDLVQHTASNWS

Query:  ALDSIFCNYSAALINLQWKA
        +      NYS A I L  KA
Subjt:  ALDSIFCNYSAALINLQWKA

RVW24937.1 putative ribonuclease H protein [Vitis vinifera]1.7e-10130.5Show/hide
Query:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV
        +ILD+ LIANE+V+E R   ++G + K++ EKA+D V WDFL+ VL+ KGF  +W +W++G +    ++I +NG  +G V ASR LRQGDPLSPFLF LV
Subjt:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV

Query:  SEVLNALIQVIHEK--------GGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRV
        ++VL+ +I    E+        G    ++S  Q   D I         F+ SR     T     + FG                                
Subjt:  SEVLNALIQVIHEK--------GGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRV

Query:  ALRAYDQDSSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWI
                       I+    NL +  +              GI    + + ALL+       R  + L CK                        S W 
Subjt:  ALRAYDQDSSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWI

Query:  SICKVWNQIESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTL
                                                         PI     L+ G+ +GG  +  GFWD VVE I ++L  W    +S GGR TL
Subjt:  SICKVWNQIESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTL

Query:  IKAVLSSIPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQ
         ++ L+ +P+Y++++F+IP  V  +IE+  R FLW G    K  HLVRWD V  P+ +GGLG+ NI  +N ALL KWLWR+  E  +LW QVI S YG  
Subjt:  IKAVLSSIPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQ

Query:  ---FIRHHLAFKKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLW---NDVAWDLKLRRGLFDRE
           +  + L     +C   PWK IA++     +  +Y +G G +  FW+D W  D PL IQ+P +FR+ + K+ ++  +        W+L  RR L D E
Subjt:  ---FIRHHLAFKKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLW---NDVAWDLKLRRGLFDRE

Query:  MNEWMALSSLLNNCRLNNS-DDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICM
        + +   L   +++  L++S  D   W L  SG FSVKS            ++  S  +W  + P +VK F+W VAHK +NT D +Q R P  ++SP+IC+
Subjt:  MNEWMALSSLLNNCRLNNS-DDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICM

Query:  LCHRSKETASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVFSDKHMHFAFFCDLVQHTAS
        LC +  E+A H+ LHC     +W+       +    P  +   +     G G   +  VLW   S A++  +W ERNAR+F DK  +  F  D +   AS
Subjt:  LCHRSKETASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVFSDKHMHFAFFCDLVQHTAS

Query:  NWSALDSIFCNYSAALINLQWKA
         W+     F      +I L WKA
Subjt:  NWSALDSIFCNYSAALINLQWKA

VVA39726.1 Hypothetical predicted protein, partial [Prunus dulcis]9.7e-10231.02Show/hide
Query:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV
        +ILD+ALIANEVVEE R   K G + K+DLEKA+D V W F++ VL  KGF  +W  WI G +    FS+ INGRPRG+  ASR LRQGDPLSPFLF LV
Subjt:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV

Query:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD
         +VL+ +++                                   +A+ +D F                                 G  P           
Subjt:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD

Query:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIH-----GQNRFNWHTFGKVGLSLRSPWIS-I
                                      NG + I  L+  +  +       ++ + +  W  +++ +         + N      VG++L    ++ +
Subjt:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIH-----GQNRFNWHTFGKVGLSLRSPWIS-I

Query:  CKVWNQIESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIK
           W             G  +  W              P L+               G+ +GG  R I FWD VVE + N+L  W    +SKGGR T+I+
Subjt:  CKVWNQIESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIK

Query:  AVLSSIPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFI
        AVL SIP YYM++FRIPIGV   IEK  R FLW+G D  K +H V W+ V   +  GGLG+ ++R++++ L  KWLWRF NEP +LW +VI+S YG    
Subjt:  AVLSSIPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFI

Query:  RHHLAFKKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWND----VAWDLKLRRGLFDREMNEW
                   C +PW+DI+   ++      + +G G +  FW+D W     LK  FP +F LS  ++  +    +     ++WD   RR L + E+ E 
Subjt:  RHHLAFKKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWND----VAWDLKLRRGLFDREMNEW

Query:  MALSSLLNNCRLNNSD-DVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHR
          L  LL   RL  S  D   WKLD SG F+  S     Q           TQIW+ K P +VK F+W      +NT D +Q+R P L ISP+ C LC++
Subjt:  MALSSLLNNCRLNNSD-DVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHR

Query:  SKETASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVFSD-KHMHFAFFCDLVQHTASNWS
        + ++  H+LLHC F+  +W       +     P       +  +   G   KAK+LW +  +A++W+LW ERN R+F D K +  A   D V+  A+ W+
Subjt:  SKETASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVFSD-KHMHFAFFCDLVQHTASNWS

Query:  ALDSIF
        +    F
Subjt:  ALDSIF

TrEMBL top hitse value%identityAlignment
A0A5H2XQW2 TatD related DNase5.0e-10431.03Show/hide
Query:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV
        +ILD+ L+ANEVVEE R + +KG + K+D EKA+D V W F++ V+  KGF  KW  WI G +    FSI ING+PRG+  ASR LRQGDPLSPFLF LV
Subjt:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV

Query:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD
        S+VL+ +I                           +R +  NL    +S                      D +  +H  +A                 D
Subjt:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD

Query:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVWNQ
         +   L+                             G  ++          W  L +   L+C V                  G+ +      I  +   
Subjt:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVWNQ

Query:  IESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSSI
        IE+L       G  +  W   ++G                LP+            GG  R + FW+ V++ +  +L  W    +SKGGR TLI+AVLSSI
Subjt:  IESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSSI

Query:  PTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAF
        P+YYM++F++PIGV  ++E+  R FLW+G +  K  HLVRW +VT  ++ GGLGI ++R +N AL  KWLWRF  E +SLW ++IKSKYG          
Subjt:  PTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAF

Query:  KKWKCCSNPWKDIAK-LASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWND----VAWDLKLRRGLFDREMNEWMALSS
             C NPW++I+K   S L+  R + +G G K  FW+D WL +  LK  FP ++ LS  K+  +    N+    + WD   RR L + E+ E + L  
Subjt:  KKWKCCSNPWKDIAK-LASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWND----VAWDLKLRRGLFDREMNEWMALSS

Query:  LLNNCRLNNS-DDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKETA
        +L N RL  S  D   W+++  G FS KS      +  +V+     + IW+ K P +++FF+W  A+  INT D +Q+R P + +SP+ C+ C  + E  
Subjt:  LLNNCRLNNS-DDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKETA

Query:  SHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVF-SDKHMHFAFFCDLVQHTASNWSALDSI
         H+ +HC ++  +W    D    +   P      L+  L   G   KA +L      AI W++W ERN R+F     +      D ++  AS W+++   
Subjt:  SHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVF-SDKHMHFAFFCDLVQHTASNWSALDSI

Query:  FCNYSAALI
        F +Y  + I
Subjt:  FCNYSAALI

A0A803P8A0 Uncharacterized protein1.2e-10231.22Show/hide
Query:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV
        +ILDS L+ANE VE+YR++ KKG++LK+D EKA+DRV W FL+ VL+ KGF  +W +WI G V    FSIF+NGR RG+   SR LRQGDPLSPFLF LV
Subjt:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV

Query:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD
        ++VL  ++    +K    +  S +Q   D I+          LS  + +D                                                  
Subjt:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD

Query:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVWNQ
                    D L                                      ++++ DSL  K+VK +     F   +  KV L+ +S  + IC     
Subjt:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVWNQ

Query:  IESLAIFKLGNGSRIVFWHDFWIGDLPF-YLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSS
           L+   +  G+ ++      +G  P  YL                     G+ +GG  R   FW+ V++    ++  W    +S+GGR TLI++VLSS
Subjt:  IESLAIFKLGNGSRIVFWHDFWIGDLPF-YLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSS

Query:  IPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLA
        +P YY+++F++P  V KE+EK  R F W+GGD   G HLV WD+V  PR  GGL I  +  +N  LL+KWLWRF  E +SLW +VIKS+YG+        
Subjt:  IPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLA

Query:  FKKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWND--------VAWDLKLRRGLFDREMNEWM
                 PW DIA L        K+K+G G    FW+D W+    L+ QFP +  LS AK+A++Q++  D         +WD K RR + DRE+    
Subjt:  FKKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWND--------VAWDLKLRRGLFDREMNEWM

Query:  ALSSLLNNCR-LNNSDDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRS
         L   L + R L+  DD   W  D  G FS KS+   F A +   E S +  +W+ + P++VK F W VA   +N   ++QK+ P L ISP  C+ C  S
Subjt:  ALSSLLNNCR-LNNSDDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRS

Query:  KETASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVFSDKHMHFAFFCDLVQHTASNWSAL
         E  +H+ L C  A+ +W    + F IQ   P  V   L   +   G   ++  LW     +++W +W ERN R F           + ++   + W   
Subjt:  KETASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVFSDKHMHFAFFCDLVQHTASNWSAL

Query:  DSIFCNYSAALINLQWKAFV
           F N S   +   W++ +
Subjt:  DSIFCNYSAALINLQWKAFV

A0A803QEA6 Uncharacterized protein3.3e-10330.75Show/hide
Query:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV
        +ILDS L+ANE VE+YR++ +KG++LK+D EKA+DRV W FL+ VL+ KGF  +W +WI G V    FSIFINGR RG+   SR LRQGDPLSPFLF ++
Subjt:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV

Query:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD
        ++VL  ++    +K    + ++ +Q   D I+          LS  + +D        F ++  +  Q L                              
Subjt:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD

Query:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVWNQ
                                                                       KVVK+  G +        KV L+ +S  + IC     
Subjt:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVWNQ

Query:  IESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSSI
        +   AI     G  +  W   ++                            G+++GG  R   FW+ V++    ++  W    +S+GGR TLI++VLSS+
Subjt:  IESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSSI

Query:  PTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAF
        P YY+++F+ P  V KE+EK  R F W+GGD   G HLV WD+V  PR  GGL I  +  +N  LL+KWLWRF  EP+SLW +VIKS+YG+       A 
Subjt:  PTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAF

Query:  KKWKCCS-------NPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWND--------VAWDLKLRRGLFDR
          W            PWKDI+ L        K+K+G G +  FW+D W+  + L+ QFP +  +S AK+ ++Q++  D         +WDL  RR + DR
Subjt:  KKWKCCS-------NPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWND--------VAWDLKLRRGLFDR

Query:  EMNEWMALSSLLNNCR-LNNSDDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNIC
        E+     L   L + R LN  +D   W  D  G FS KS+   F +     E   +  +W+ + P++VK F W VA   +N   ++QK+ P +SISP  C
Subjt:  EMNEWMALSSLLNNCR-LNNSDDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNIC

Query:  MLCHRSKETASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVFSDKHMHFAFFCDLVQHTA
        + C  S E  +H+ L C  A  +W    + F IQ   PS V   L  V   GG K ++  LW     +++W +W ERN R+F           D ++   
Subjt:  MLCHRSKETASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVFSDKHMHFAFFCDLVQHTA

Query:  SNWSALDSIFCNYSAALINLQWKAFV
        ++W      F N S   +   W+  +
Subjt:  SNWSALDSIFCNYSAALINLQWKAFV

M5WKV4 Reverse transcriptase domain-containing protein (Fragment)1.5e-10331.26Show/hide
Query:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV
        +ILD+ L+ANEVVEE R +K+KG + K+D EKA+D V W+F++ VL  KGF  KW  WI G +    FSI ING+PRG+  ASR LRQGDPLSPFLF LV
Subjt:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV

Query:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD
        S+VL+ +I                           +R +  NL    +S                                               +DQ 
Subjt:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD

Query:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVWNQ
           +HL+       L  G+                                W  L +   L+C+V                  G+ +      I  +   
Subjt:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVWNQ

Query:  IESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSSI
         ++L       G  +  W   ++G                LP+            GG  R + FW+ V++ +  +L  W    +SKGGR TLI+AVLSSI
Subjt:  IESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSSI

Query:  PTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAF
        P+YYM++F++PIGV  ++E+  R FLW+G +  K  HLVRW++VT  ++ GGLGI ++R +N AL  KWLWRF  EP+SLW ++IKSKYG          
Subjt:  PTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAF

Query:  KKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWND----VAWDLKLRRGLFDREMNEWMALSSL
             C NPW++I+K  +      ++ +G G K  FW+D WL +  LK  FP +  LS  K+ ++    N+    + WD   RR L + E+ E + L  +
Subjt:  KKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWND----VAWDLKLRRGLFDREMNEWMALSSL

Query:  LNNCRLNNS-DDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKETAS
        L N RL  S  D   W++   G FS KS      +    +    S+ IW+ K P +++FF+W  A+  INT D +Q+R P + +SP+ C+LC  + E   
Subjt:  LNNCRLNNS-DDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKETAS

Query:  HVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVF
        H+ +HC ++  +W        ++   P      L+  L   G   +A +L      AI W++W ERN R+F
Subjt:  HVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVF

M5X4S0 Reverse transcriptase domain-containing protein (Fragment)1.1e-10331.57Show/hide
Query:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV
        +ILD+ L+ANEVVEE R +K+KG + K+D EKA+D V W+F++ V+  KGF  KW  WI G +    FSI ING+PRG+  ASR LRQGDPLSPFLF LV
Subjt:  KILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLV

Query:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD
        S+VL+ LI                           +R +  NL    +S                                               +DQ 
Subjt:  SEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALISQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQD

Query:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVWNQ
           +HL+       L  G+                                W  L +   L+C V                  G+ +      I  +   
Subjt:  SSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFNWHTFGKVGLSLRSPWISICKVWNQ

Query:  IESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSSI
         E L       G  +  W   ++G                LP+            GG  R + FW+ V+E +  +L  W    +SKGGR TLI+AVLSSI
Subjt:  IESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSSI

Query:  PTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAF
        P+YYM++F++PIGV  ++E+  R FLW+G +  K  HLVRW++VT  ++ GGLGI ++R +N AL  KWLWRF  E +SLW ++IKSKYG          
Subjt:  PTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAF

Query:  KKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWND----VAWDLKLRRGLFDREMNEWMALSSL
             C NPW++I+K  +      ++ +G G K  FW+D WL +  LK  FP +  LS  K+ ++    N+    + WD   RR L + E+ E + L  +
Subjt:  KKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWND----VAWDLKLRRGLFDREMNEWMALSSL

Query:  LNNCRLNNS-DDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKETAS
        L N RL  S  D   W+++  G FS KS      +    +    S+ IW+ K P +++FF+W  A+  INT D +Q+R P + +SP+ C+LC  + E   
Subjt:  LNNCRLNNS-DDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKETAS

Query:  HVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVFSD
        H+ LHC ++  +W        ++   P      L+  L   G   +A +L      AI W++W ERN R+F D
Subjt:  HVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVFSD

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.9e-0736.56Show/hide
Query:  RAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLVSEVLNALIQ
        RAK K   I+ +D EKAFD++   F+   L   G  G +++ I      P  +I +NG+           RQG PLSP LF +V EVL   I+
Subjt:  RAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLVSEVLNALIQ

P08548 LINE-1 reverse transcriptase homolog9.4e-0736.56Show/hide
Query:  RAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLVSEVLNALIQ
        + K K   IL +D EKAFD +   F+   LK  G  G +++ I      P  +I +NG            RQG PLSP LF +V EVL   I+
Subjt:  RAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLVSEVLNALIQ

P0C2F6 Putative ribonuclease H protein At1g657502.5e-4429.06Show/hide
Query:  VVESIRNKLATWNGFPISKGGRTTLIKAVLSSIPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLL
        ++E + ++++ W    +S  GR TL KAVLSS+P + M+   +P  +   +++  R FLW      K  HLV+W KV +P+  GGLG+   +S N AL+ 
Subjt:  VVESIRNKLATWNGFPISKGGRTTLIKAVLSSIPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLL

Query:  KWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAFKKWKCCSNPWKDIA-KLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQ
        K  WR   E +SLW  V++ KY    IR           S+ W+ IA  L  V+     +  G G +  FW D W++  PL ++     R +       +
Subjt:  KWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAFKKWKCCSNPWKDIA-KLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQ

Query:  DLW-NDVAWDL-KLRRGLFDREMNEWMALSSLLNNCRLNNSDDVGWWKLDRSGKFSVKSSLQVFQARRAVLED--SLSTQIWEGKAPNRVKFFLWTVAHK
        DLW     WD  K+     +    E  A+   L    +  + D   WK  + G+FSV+S+ ++         +  S    +W+ + P RVK FLW V ++
Subjt:  DLW-NDVAWDL-KLRRGLFDREMNEWMALSSLLNNCRLNNSDDVGWWKLDRSGKFSVKSSLQVFQARRAVLED--SLSTQIWEGKAPNRVKFFLWTVAHK

Query:  SINTMDKVQKRYPSLSISPNICMLCHRSKETASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERN
        ++ T ++  +R+ S S   N+C +C    E+  HVL  C     +W         QG     +  WL + L  G   G   + WS     I+W  WK R 
Subjt:  SINTMDKVQKRYPSLSISPNICMLCHRSKETASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERN

Query:  ARVFSD
          +F +
Subjt:  ARVFSD

P11369 LINE-1 retrotransposable element ORF2 protein2.2e-0836.56Show/hide
Query:  RAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLVSEVLNALIQ
        + K K   I+ LD EKAFD++   F+  VL+  G  G ++  I      P  +I +NG     +      RQG PLSP+LF +V EVL   I+
Subjt:  RAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWVEWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLVSEVLNALIQ

P92555 Uncharacterized mitochondrial protein AtMg012503.2e-0757.45Show/hide
Query:  INGRPRGRVCASRSLRQGDPLSPFLFLLVSEVLNALIQVIHEKGGYP
        ING P+G V  SR LRQGDPLSP+LF+L +EVL+ L +   E+G  P
Subjt:  INGRPRGRVCASRSLRQGDPLSPFLFLLVSEVLNALIQVIHEKGGYP

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.6e-2023.03Show/hide
Query:  GVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSSIPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLG
        G+ +  K  T   +  +VE IR ++  W    +S  GR  LI +V+ S+  ++M+ FR+P    KEI+     FLW G + +     V W  V TP+D G
Subjt:  GVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISKGGRTTLIKAVLSSIPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLG

Query:  GLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAFKKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQF
        GLGI +++  N                S W            I  +     W      WK I K  ++     K+ I  G+   FW D W     L I  
Subjt:  GLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHLAFKKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQF

Query:  PGMFRLSMAKDATVQDLWNDVAWDLKLRRGLFDREMNEWMALSSLLNNCRLNNSDDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNR
         G  R  +    T+     +   + + RR   D  +     ++ + +    +  D V W       K    +       R   L+ +    +W   A  +
Subjt:  PGMFRLSMAKDATVQDLWNDVAWDLKLRRGLFDREMNEWMALSSLLNNCRLNNSDDVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNR

Query:  VKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKETASHVLLHCDFAKDV
             W      + T D++         S   C+LCH   ET  H+   C ++ +V
Subjt:  VKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKETASHVLLHCDFAKDV

AT3G25270.1 Ribonuclease H-like superfamily protein2.4e-1026.17Show/hide
Query:  LSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKETASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGL
        +  +IW+ K   ++K FLW +   ++ T D +++R+  +   P  C  C +  ET+ H+   C +A+ VW   G P   +       +    E+LL   L
Subjt:  LSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKETASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEVLLGGGL

Query:  KGKAKVLWSNFSRAIMWHLWKERNARVFSDKHMHFAFFCDLVQHTASNW
          +   L+ N +  I+W LWK RN  VF  K + +       ++    W
Subjt:  KGKAKVLWSNFSRAIMWHLWKERNARVFSDKHMHFAFFCDLVQHTASNW

AT4G29090.1 Ribonuclease H-like superfamily protein2.1e-3026.32Show/hide
Query:  SIPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHL
        ++PTY MA F +P  V K+I      F W+     KG H   WD ++  +  GG+G  +I + N ALL K +WR  + P+SL  +V KS+Y  +     L
Subjt:  SIPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHHL

Query:  AFKKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNP----LKIQ-FPGMFRLSMAKDATVQDLWNDVA--WDLKLRRGLFDREMNEWM
                S  WK I     +L+   +  +G G   + W   WL   P    L++Q  P     S++    V DL ++    W   +   LF     +  
Subjt:  AFKKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNP----LKIQ-FPGMFRLSMAKDATVQDLWNDVA--WDLKLRRGLFDREMNEWM

Query:  ALSSLLNNCRLNNSD--DVGWWKLDRSGKFSVKSSLQVF-------QARRAVLEDSLS---TQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSI
            L+   R       D   W    SG ++VKS   V         + + V E SL+    +IW+ +   +++ FLW     S+     +  R+ S   
Subjt:  ALSSLLNNCRLNNSD--DVGWWKLDRSGKFSVKSSLQVF-------QARRAVLEDSLS---TQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSI

Query:  SPNICMLCHRSKETASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEV---LLGGGLKGKAKVLWSNFSRAI---MWHLWKERNARVFSDKHMH
          + C+ C   KET +H+L  C FA+  W     P  + G        W + +   L      G     W   S+ +   +W LWK RN  VF  +  +
Subjt:  SPNICMLCHRSKETASHVLLHCDFAKDVWNYFGDPFSIQGCKPSCVVSWLNEV---LLGGGLKGKAKVLWSNFSRAI---MWHLWKERNARVFSDKHMH

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.0e-0826Show/hide
Query:  SIPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKV-TTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHH
        ++P Y M+ FR+   + K++  A   F W   ++ +    V W K+  +  D GGLG  ++   N ALL K  +R  ++P +L  ++++S+Y       H
Subjt:  SIPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKV-TTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRHH

Query:  LAFKKWKCCSNP---WKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTD
         +  +    + P   W+ I     +L       IG G     W D W+ D
Subjt:  LAFKKWKCCSNP---WKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTD

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.3e-0857.45Show/hide
Query:  INGRPRGRVCASRSLRQGDPLSPFLFLLVSEVLNALIQVIHEKGGYP
        ING P+G V  SR LRQGDPLSP+LF+L +EVL+ L +   E+G  P
Subjt:  INGRPRGRVCASRSLRQGDPLSPFLFLLVSEVLNALIQVIHEKGGYP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGTGGGGGCCTCTGTCACTTTGAACATCGTGCAGAAGGCGAGCACCTTATTGAAGCATCATAACAACCTTTGTGAAGAAGATATAGCTGGGGACTCTGTTCTTGA
TGACTTAAAAACTCTTTTCCAGACACCTAACAAGTTTGACTACTCCATTTTTAAGATTCTTGATTCAGCTCTCATTGCCAATGAAGTAGTGGAGGAATATAGAGCTAAGA
AAAAAAAAGGCTGGATTTTGAAGTTGGATCTTGAGAAAGCCTTTGATAGGGTGCATTGGGATTTCCTTGAGTTTGTTCTCAAGCTTAAAGGCTTTTGTGGTAAATGGGTT
GAATGGATAAATGGGTTTGTCCGGGATCCGAAATTTTCTATCTTCATTAATGGTCGGCCAAGAGGGAGAGTTTGTGCTTCTAGAAGCCTTAGACAAGGAGATCCTCTTTC
TCCTTTCCTATTTCTTCTAGTAAGCGAAGTGTTGAATGCCCTTATTCAAGTTATTCATGAGAAAGGTGGTTATCCGAAAAAGATCTCCTTCTGGCAGCCGGTTGTTGATA
AGATTCAAAAGAAGCTTGATAGATGGAGGCGCTTTAATTTGTCTAGAGCAAGGTTGTCAGATACATTTCCTAGAAGTTTCATAACATTTGGTATCAGAGCTGCTTTGATT
TCACAATGTCTTTGGGACCCACTTCGACACACACATAGGCTTTGGGCCGTGTGTCGAGGTGCTTTGCCCTTTAGGGTTGCTCTGCGCGCGTATGACCAGGATAGCTCTGC
CACACATCTTGAGATTACTCGATGTGTGGATAATCTGGGGCGGGGGCGTGTCACACAACAAGTTCGTTCTAAACCTCTTTCAAATGGAGGTCTCGGTATTGGTAGCTTGA
AACATAGGAATTTGGCTCTTCTTGCTAAGTGGGGTTGGAGATATTTGAGGGAACCGGATTCGTTGTGGTGCAAAGTTGTAAAGAGTATTCATGGGCAGAATCGTTTTAAT
TGGCATACTTTCGGTAAGGTTGGCTTGAGTCTTCGAAGTCCCTGGATTAGTATATGTAAGGTATGGAATCAAATTGAGAGTTTGGCTATTTTTAAACTCGGAAATGGCTC
TCGAATAGTTTTTTGGCATGACTTTTGGATTGGAGATCTTCCTTTTTATTTAAAATTTCCAAGATTATTTCAAATTGCTTCTCTTCCAATTGCTTCCGTTAATGATCTTT
GGGATGGGGTCACTGTGGGGGGAAAGACAAGGACAATTGGCTTTTGGGATTCAGTGGTTGAAAGCATTAGAAATAAATTGGCAACTTGGAATGGTTTTCCTATCTCCAAG
GGTGGTAGAACGACCCTTATCAAAGCGGTACTATCCAGCATTCCGACGTACTATATGGCTATCTTCAGAATTCCTATTGGGGTGAATAAGGAGATCGAAAAGGCTTTTAG
AAGATTCCTATGGAAGGGTGGAGACAGTGACAAAGGTTCTCATCTAGTTAGATGGGATAAGGTGACCACTCCAAGAGACCTTGGGGGGCTTGGTATTGACAATATTCGAT
CAAAAAACTCAGCCCTTTTGTTGAAATGGCTATGGCGATTTGAAAACGAGCCTGATTCTTTGTGGAGACAGGTGATAAAGAGCAAGTATGGGGAGCAATTCATTAGACAT
CACTTAGCTTTTAAGAAATGGAAGTGTTGTAGCAACCCATGGAAGGATATTGCGAAGCTGGCCAGCGTCCTTAAGGTGAACAGAAAGTATAAAATTGGCAGGGGAAATAA
GGATCTCTTTTGGGATGATTGCTGGCTCACTGACAACCCTCTTAAAATCCAATTTCCTGGTATGTTCAGACTCTCGATGGCAAAAGATGCTACTGTCCAGGACTTATGGA
ATGATGTTGCGTGGGATTTGAAATTGAGAAGGGGGCTTTTTGACAGAGAAATGAATGAATGGATGGCCTTGTCCTCCCTCCTTAACAATTGCAGATTAAACAACAGTGAT
GATGTGGGTTGGTGGAAACTAGATAGGTCGGGGAAGTTCTCAGTTAAATCCTCATTACAAGTCTTTCAAGCTAGGAGGGCAGTGCTAGAAGACAGCTTATCAACCCAAAT
ATGGGAAGGAAAAGCCCCAAATAGGGTGAAATTCTTCTTGTGGACGGTGGCTCATAAAAGCATCAACACGATGGACAAGGTGCAAAAGAGATATCCTAGTCTCTCTATCT
CCCCAAATATATGTATGTTATGCCATAGAAGCAAGGAGACAGCCTCCCACGTGCTGCTTCATTGTGATTTCGCTAAAGATGTGTGGAATTATTTTGGGGATCCCTTCAGT
ATTCAGGGGTGCAAGCCTAGTTGTGTGGTGAGTTGGTTAAATGAAGTGTTATTAGGAGGAGGGCTTAAAGGAAAAGCCAAAGTCCTATGGAGCAATTTCTCTAGAGCAAT
TATGTGGCATCTTTGGAAGGAAAGAAATGCAAGAGTGTTCTCTGATAAGCATATGCATTTTGCTTTTTTTTGTGATCTTGTACAGCATACGGCCTCGAATTGGAGCGCCT
TAGATAGTATTTTTTGTAATTACTCTGCTGCTTTGATTAACCTTCAATGGAAGGCTTTTGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGTGGGGGCCTCTGTCACTTTGAACATCGTGCAGAAGGCGAGCACCTTATTGAAGCATCATAACAACCTTTGTGAAGAAGATATAGCTGGGGACTCTGTTCTTGA
TGACTTAAAAACTCTTTTCCAGACACCTAACAAGTTTGACTACTCCATTTTTAAGATTCTTGATTCAGCTCTCATTGCCAATGAAGTAGTGGAGGAATATAGAGCTAAGA
AAAAAAAAGGCTGGATTTTGAAGTTGGATCTTGAGAAAGCCTTTGATAGGGTGCATTGGGATTTCCTTGAGTTTGTTCTCAAGCTTAAAGGCTTTTGTGGTAAATGGGTT
GAATGGATAAATGGGTTTGTCCGGGATCCGAAATTTTCTATCTTCATTAATGGTCGGCCAAGAGGGAGAGTTTGTGCTTCTAGAAGCCTTAGACAAGGAGATCCTCTTTC
TCCTTTCCTATTTCTTCTAGTAAGCGAAGTGTTGAATGCCCTTATTCAAGTTATTCATGAGAAAGGTGGTTATCCGAAAAAGATCTCCTTCTGGCAGCCGGTTGTTGATA
AGATTCAAAAGAAGCTTGATAGATGGAGGCGCTTTAATTTGTCTAGAGCAAGGTTGTCAGATACATTTCCTAGAAGTTTCATAACATTTGGTATCAGAGCTGCTTTGATT
TCACAATGTCTTTGGGACCCACTTCGACACACACATAGGCTTTGGGCCGTGTGTCGAGGTGCTTTGCCCTTTAGGGTTGCTCTGCGCGCGTATGACCAGGATAGCTCTGC
CACACATCTTGAGATTACTCGATGTGTGGATAATCTGGGGCGGGGGCGTGTCACACAACAAGTTCGTTCTAAACCTCTTTCAAATGGAGGTCTCGGTATTGGTAGCTTGA
AACATAGGAATTTGGCTCTTCTTGCTAAGTGGGGTTGGAGATATTTGAGGGAACCGGATTCGTTGTGGTGCAAAGTTGTAAAGAGTATTCATGGGCAGAATCGTTTTAAT
TGGCATACTTTCGGTAAGGTTGGCTTGAGTCTTCGAAGTCCCTGGATTAGTATATGTAAGGTATGGAATCAAATTGAGAGTTTGGCTATTTTTAAACTCGGAAATGGCTC
TCGAATAGTTTTTTGGCATGACTTTTGGATTGGAGATCTTCCTTTTTATTTAAAATTTCCAAGATTATTTCAAATTGCTTCTCTTCCAATTGCTTCCGTTAATGATCTTT
GGGATGGGGTCACTGTGGGGGGAAAGACAAGGACAATTGGCTTTTGGGATTCAGTGGTTGAAAGCATTAGAAATAAATTGGCAACTTGGAATGGTTTTCCTATCTCCAAG
GGTGGTAGAACGACCCTTATCAAAGCGGTACTATCCAGCATTCCGACGTACTATATGGCTATCTTCAGAATTCCTATTGGGGTGAATAAGGAGATCGAAAAGGCTTTTAG
AAGATTCCTATGGAAGGGTGGAGACAGTGACAAAGGTTCTCATCTAGTTAGATGGGATAAGGTGACCACTCCAAGAGACCTTGGGGGGCTTGGTATTGACAATATTCGAT
CAAAAAACTCAGCCCTTTTGTTGAAATGGCTATGGCGATTTGAAAACGAGCCTGATTCTTTGTGGAGACAGGTGATAAAGAGCAAGTATGGGGAGCAATTCATTAGACAT
CACTTAGCTTTTAAGAAATGGAAGTGTTGTAGCAACCCATGGAAGGATATTGCGAAGCTGGCCAGCGTCCTTAAGGTGAACAGAAAGTATAAAATTGGCAGGGGAAATAA
GGATCTCTTTTGGGATGATTGCTGGCTCACTGACAACCCTCTTAAAATCCAATTTCCTGGTATGTTCAGACTCTCGATGGCAAAAGATGCTACTGTCCAGGACTTATGGA
ATGATGTTGCGTGGGATTTGAAATTGAGAAGGGGGCTTTTTGACAGAGAAATGAATGAATGGATGGCCTTGTCCTCCCTCCTTAACAATTGCAGATTAAACAACAGTGAT
GATGTGGGTTGGTGGAAACTAGATAGGTCGGGGAAGTTCTCAGTTAAATCCTCATTACAAGTCTTTCAAGCTAGGAGGGCAGTGCTAGAAGACAGCTTATCAACCCAAAT
ATGGGAAGGAAAAGCCCCAAATAGGGTGAAATTCTTCTTGTGGACGGTGGCTCATAAAAGCATCAACACGATGGACAAGGTGCAAAAGAGATATCCTAGTCTCTCTATCT
CCCCAAATATATGTATGTTATGCCATAGAAGCAAGGAGACAGCCTCCCACGTGCTGCTTCATTGTGATTTCGCTAAAGATGTGTGGAATTATTTTGGGGATCCCTTCAGT
ATTCAGGGGTGCAAGCCTAGTTGTGTGGTGAGTTGGTTAAATGAAGTGTTATTAGGAGGAGGGCTTAAAGGAAAAGCCAAAGTCCTATGGAGCAATTTCTCTAGAGCAAT
TATGTGGCATCTTTGGAAGGAAAGAAATGCAAGAGTGTTCTCTGATAAGCATATGCATTTTGCTTTTTTTTGTGATCTTGTACAGCATACGGCCTCGAATTGGAGCGCCT
TAGATAGTATTTTTTGTAATTACTCTGCTGCTTTGATTAACCTTCAATGGAAGGCTTTTGTGTAG
Protein sequenceShow/hide protein sequence
MKVGASVTLNIVQKASTLLKHHNNLCEEDIAGDSVLDDLKTLFQTPNKFDYSIFKILDSALIANEVVEEYRAKKKKGWILKLDLEKAFDRVHWDFLEFVLKLKGFCGKWV
EWINGFVRDPKFSIFINGRPRGRVCASRSLRQGDPLSPFLFLLVSEVLNALIQVIHEKGGYPKKISFWQPVVDKIQKKLDRWRRFNLSRARLSDTFPRSFITFGIRAALI
SQCLWDPLRHTHRLWAVCRGALPFRVALRAYDQDSSATHLEITRCVDNLGRGRVTQQVRSKPLSNGGLGIGSLKHRNLALLAKWGWRYLREPDSLWCKVVKSIHGQNRFN
WHTFGKVGLSLRSPWISICKVWNQIESLAIFKLGNGSRIVFWHDFWIGDLPFYLKFPRLFQIASLPIASVNDLWDGVTVGGKTRTIGFWDSVVESIRNKLATWNGFPISK
GGRTTLIKAVLSSIPTYYMAIFRIPIGVNKEIEKAFRRFLWKGGDSDKGSHLVRWDKVTTPRDLGGLGIDNIRSKNSALLLKWLWRFENEPDSLWRQVIKSKYGEQFIRH
HLAFKKWKCCSNPWKDIAKLASVLKVNRKYKIGRGNKDLFWDDCWLTDNPLKIQFPGMFRLSMAKDATVQDLWNDVAWDLKLRRGLFDREMNEWMALSSLLNNCRLNNSD
DVGWWKLDRSGKFSVKSSLQVFQARRAVLEDSLSTQIWEGKAPNRVKFFLWTVAHKSINTMDKVQKRYPSLSISPNICMLCHRSKETASHVLLHCDFAKDVWNYFGDPFS
IQGCKPSCVVSWLNEVLLGGGLKGKAKVLWSNFSRAIMWHLWKERNARVFSDKHMHFAFFCDLVQHTASNWSALDSIFCNYSAALINLQWKAFV