; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002503 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002503
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:43400823..43405019
RNA-Seq ExpressionLag0002503
SyntenyLag0002503
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]6.7e-29439.96Show/hide
Query:  ERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDS--DGSWRFTGIYGNPQRDKHHETWTLLDRLRDNSGMPWVIGGDFNEITS
        ER++N + F  GLIVP  G+SGG+ LLW +EI++ ++SY++ HIDA+IS++  D  WR TG YG+P+  K +++W LL  L     +PW+  GDFNEI S
Subjt:  ERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDS--DGSWRFTGIYGNPQRDKHHETWTLLDRLRDNSGMPWVIGGDFNEITS

Query:  NSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDFKVFHLPFTASDHRPLLAVWLKDQKSQRKKGM
         +EK GG  R++  M  FR  ++ C  HD G+ GP+YTW N      RI  RLDR L   +        KV HL  +  DH  LL     +    R +  
Subjt:  NSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDFKVFHLPFTASDHRPLLAVWLKDQKSQRKKGM

Query:  RYPRRFEEGWVKYDDCRKIIDQSWK-EMDQLGGKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQELSSYK-GHSVMREIEHKEKELENLLEDD
        R+   FE  W K +DC+ II+ SW   +D    + + E  ++C   LS+WS + Y G I   I  K   +  L+  +    +  EI    +E+  LL+D+
Subjt:  RYPRRFEEGWVKYDDCRKIIDQSWK-EMDQLGGKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQELSSYK-GHSVMREIEHKEKELENLLEDD

Query:  EVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSITQEQNRRLLKKFTR
        E YW QRA+  W+  GDRNTK+FH +A+ RRK N I G+ D+ G W +++E + + A  YF N+++SS+P+   IE + +AIP  +T+E N  L+++FT+
Subjt:  EVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSITQEQNRRLLKKFTR

Query:  EDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKIIAKTLANRMRTVLD
        E+V   LK +HP+KAPGPDG+ A+F+QKYW IVGN++ D+ L VLN   PI +LNKT I+LIPKT +P  M DFRPISLC+V+YK+I+K LANR++ +L 
Subjt:  EDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKIIAKTLANRMRTVLD

Query:  TIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLNGIPRASFTPN
         IIS +QSAF   RLI+DN ++ FE +H +  K  GKEG  A+KLDMSKA+DRVEW +I KVM +MGF N W D +M C+ SVS+ IL+NG+   +  P+
Subjt:  TIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLNGIPRASFTPN

Query:  RGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKSSFMVSPNTKAS
        RGLRQGDPLSP LFL+CAEGLS+ +N++ + +  +G+ IN  CP ++HLF+ADDS+LF KA  ++C  +++IL +YE+ASGQ IN +KSS   SPNT   
Subjt:  RGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKSSFMVSPNTKAS

Query:  HVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNELNAICARFWWGAT
          D I  +LG        +YLGLPS +GRSK ++F  +K++V   L GWKGK  S  GKEILIK+VAQAIP Y MSCF  P  LC+++  +   FWWG  
Subjt:  HVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNELNAICARFWWGAT

Query:  DKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGRDLFKKGYRWRIGNGL
        ++  K+ W SW R+C+ K  GG+GFR+LK FN A+LAKQ+WRI+  P SL+ RVL+ RYF TG  LNA LG++PSY+WRSI    ++ ++G RWR+GNG 
Subjt:  DKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGRDLFKKGYRWRIGNGL

Query:  QVEASKEPWIPKEGSCKPILIHPDVQTF---TVAQFID-DQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKSAYRLGFHLQDM
        Q+   ++ W+P   + K  +I P +  F    V+  ID D   W  + +++ F+  + +TIL IP++  + ED++IW  + KG FSVKSAY +   + D 
Subjt:  QVEASKEPWIPKEGSCKPILIHPDVQTF---TVAQFID-DQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKSAYRLGFHLQDM

Query:  TE-ASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVDYLCFGGRGDGN
         E    S       LWK  W   +P KIK+  WR   D LPT  N++ RG+  + TC +C    E   H    C+    +W F+    DY       +G+
Subjt:  TE-ASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVDYLCFGGRGDGN

Query:  QQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGYGPVESSDRGVA
          D+      +  S+  +        ++ W IW++RN+++H+       ++       + +FK                                    A
Subjt:  QQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGYGPVESSDRGVA

Query:  RGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLKSISVLP-PKLVIELDSVQVV
          L    PR S    RW  P    + +N D   + Q     IG I+R  +G  V A  K +   +    +EALAL +G+     L   ++++E D++ V+
Subjt:  RGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLKSISVLP-PKLVIELDSVQVV

Query:  HLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSW
          L +      ELG  +   + +  +++     H+ R  N +AH+LA+ A    SS  W
Subjt:  HLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSW

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]4.1e-27538.61Show/hide
Query:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDSDG-SWRFTGIYGNPQRDKHHETWTLLDRLRDNSGM-PWV
        METKL      R +N L F  GL V   G  GGLMLLWQ  +DV + S +  + D  I   DG  W F+ +YG P+      TW L+ RL D S + PW+
Subjt:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDSDG-SWRFTGIYGNPQRDKHHETWTLLDRLRDNSGM-PWV

Query:  IGGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDF---KVFHLPFTASDHRPLLAV
        + GD NEI SN  K GG  R +  MQ FR+++D C L +   +G E+TW  N      + ERLD   IN    +W  +F   K+ HL +  SDHR LLA 
Subjt:  IGGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDF---KVFHLPFTASDHRPLLAV

Query:  --WLKDQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLG-GKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQEL--SSYKGHSVMR
          +     +Q  +  R+  RFE+ W+K  +C +II  SW    +      +    +VC + L +W   K+ G ++  I   +K +  L  S+        
Subjt:  --WLKDQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLG-GKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQEL--SSYKGHSVMR

Query:  EIEHKEKELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPT
        ++   E  L+ LL ++E YW+QR+R +W+  GDRNTK+FH +A+ R   NRI+ L DD GN +   EG+ +V + YFQ LFT+SN +  A+  +L  IPT
Subjt:  EIEHKEKELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPT

Query:  SITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIY
        +I+ EQN  L + FT  +V   LK +   K+PG DG+ A+FY   W+IVG  +  + L VLN     +  NKT I LIPK K P  MKDFRPISLC+V Y
Subjt:  SITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIY

Query:  KIIAKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVS
        KII+K LA R + VL ++IS +QSAF+  RLI+DN ++ FE +H++K + +G +G AALKLDMSKA+DRVEW ++  VMGKMGF       IM+C+ + S
Subjt:  KIIAKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVS

Query:  FQILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTI
        F  L+NG    S  P RGLRQGDPLSPYLFLIC+EGLS  L   EQ     GL ++ + PSI+HL +ADDSLLF +A ++ C SIK  L+ Y +ASGQ +
Subjt:  FQILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTI

Query:  NFEKSSFMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISL
        N +KS    SPNT  +   + +++LG+        YLGLP+   R K ++FNNIK+R+WK +  W  K FS  GKE+L+K+V QAIP YAMSCFR     
Subjt:  NFEKSSFMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISL

Query:  CNELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWG
        C ++  + ARFWWG++   +KIHW++W  LC+ K  GG+GFR    FNQA LAKQ+WRI + P SLL+RVL+GRY+    ++ A +    S TW+ IVWG
Subjt:  CNELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWG

Query:  RDLFKKGYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVK
        R+L  KG   +IG+G  V  + + WIP     KP+          VA +I D   W+ +L+   F   D   IL IP++     D   W  D  G ++VK
Subjt:  RDLFKKGYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVK

Query:  SAYRLGFHLQDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVD
        S Y L   L++   +SSST +  E+ W+ FW   +P K+++ GWRV N  LP   NL HR + T+ TC LC    E+  H  + C   K +W      +D
Subjt:  SAYRLGFHLQDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVD

Query:  YLCFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGY
        +       DG+    L        S       + K     W IW  RN  IH K      +LK+ +        I    E+YL     V   ++P  +  
Subjt:  YLCFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGY

Query:  GPVESSDRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGL---KSISVLP
                            A     +W+ P+     +N DA  ++  N  GIG I+R   G  + A  K +  N++   +EA A+  GL   K + + P
Subjt:  GPVESSDRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGL---KSISVLP

Query:  PKLVIELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFPSWLLEL
            +E D + +VH L+ K+  L+     V++  + LS++ +  ISH+ R  N  AH LA++A  L++   W +  PS +  +
Subjt:  PKLVIELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFPSWLLEL

XP_030498122.1 uncharacterized protein LOC115713779 [Cannabis sativa]3.1e-27538.19Show/hide
Query:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDSDG-SWRFTGIYGNPQRDKHHETWTLLDRLRDNS-GMPWV
        METKL+  N ++ R  L F  G+ VP QGQ GGLMLLW+  + ++I +YS  HID  +   DG S  FTG YG+P   + H TWTLL R  D +   PW 
Subjt:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDSDG-SWRFTGIYGNPQRDKHHETWTLLDRLRDNS-GMPWV

Query:  IGGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDF---KVFHLPFTASDHRPLLAV
        + GDFNEI S+ +K+GG  R++  ++ FR+ +D+C+L    F G   TW N    +  + ERLD   IN +   W   F    + HL F  SDHR L A 
Subjt:  IGGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDF---KVFHLPFTASDHRPLLAV

Query:  WLKDQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLGGKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQEL--SSYKGHSVMREIE
                     +   RFE+ W++ + C  II  +W   D      +      C S L +W  S + G++R  I    K +  L  SS         ++
Subjt:  WLKDQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLGGKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQEL--SSYKGHSVMREIE

Query:  HKEKELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSIT
        + E+ L++LL  +E YW QRAR  W+  GD NTK+FH RA +R   NRI+ LKD  GN    +  +  + S YFQ++FTS   +  AI  IL+ +PT + 
Subjt:  HKEKELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSIT

Query:  QEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKII
        +     +   FT  +V+  L +M   K+PG DG+  +F+  YW+IVG+ + +  L+VLN        N T I LIPK K P  +  +RPISLC+V+YK++
Subjt:  QEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKII

Query:  AKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQI
        +K++  R++  L  +IS  QSAF+  RLI+DN +I FE LH++K++++G +G AA+KLDMSKA+DRVEW YI ++M KMGF     + I+ C++SVS+  
Subjt:  AKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQI

Query:  LLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFE
        LLNG       P+RG+RQGDPLSPYLFLICAEGLS  L   E   +  GLR++   PS+SHLF+ADDS+LF +A  +  RSI+ +L+ Y +ASGQ +N +
Subjt:  LLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFE

Query:  KSSFMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNE
        K     SPNT+  H +  +++L +  Q    +YLGLPS  GR K  +F+ I D++WK L  WK + FSA GKE+L+K+V QAIP YAMSCFR P +LC++
Subjt:  KSSFMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNE

Query:  LNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGRDL
        + ++ A FWWG+T  G  IHW++W  LC  K QGG+GFR+   FNQALLAKQ+WR++  P SLL+R+L  RYF  GS L+A LGN PS TWRSIVWG++L
Subjt:  LNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGRDL

Query:  FKKGYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKSAY
          KG RWR+G G Q+    +PW+P      P        +  VA  I+    W+   V A+F + D   IL+IP++   K D +IW+    G +SVKS Y
Subjt:  FKKGYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKSAY

Query:  RLGFHLQDMTE-ASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVDYL
             L +  +  SS+TY+H    W  FWK  +P K+++  W+V++++LP  + LN R +  +P C LC+ + ET  H  + C   K +W     S   L
Subjt:  RLGFHLQDMTE-ASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVDYL

Query:  CFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGYGP
         F      + ++ L      T +  F+     + L +CW IW  RN   H K       +K+   QY+ +++    + +             P  A   P
Subjt:  CFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGYGP

Query:  VESSDRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLKSI-SVLPPKLV
          +S              A+ ++  W  P    + LN+DA ++      GIG +LR   G    A  K I  +++   +EA ALV  L+ + S+      
Subjt:  VESSDRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLKSI-SVLPPKLV

Query:  IELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDS
        IE DS+ VV  L+      +     +++  +L+S +    +SH+ R  N  AH LA+ A ++++  S
Subjt:  IELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDS

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]3.6e-27937.61Show/hide
Query:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDSDG-SWRFTGIYGNPQRDKHHETWTLLDRLRDNSGM-PWV
        +E+KLK  + ++VR+ L+F  G+ VP QG  GGLMLLW+  + V+I ++S  HID  I  +DG S+ FTG YG+P   + H TWT+L R  D + + PW+
Subjt:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDSDG-SWRFTGIYGNPQRDKHHETWTLLDRLRDNSGM-PWV

Query:  IGGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDFKVFHLPFTASDHRPL--LAVW
        + GDFNEI S+ +K+GG  R    ++ FR+++  C L+   F G   TW +    +  + ERLD   +N   +   Q   V HL +  SDHR +  L   
Subjt:  IGGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDFKVFHLPFTASDHRPL--LAVW

Query:  LKDQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLGGKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQELSSYKGHSV--MREIEH
        L +Q        R+  RFE+ W++ D C  +I  +W          V        S+L  W  S + G ++  I   +K +  L + + +    ++ ++ 
Subjt:  LKDQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLGGKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQELSSYKGHSV--MREIEH

Query:  KEKELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSITQ
         E+ L+ LL  +E YW QR+R  W+  GD NTK+FH  AT+RRK N+IR L D +GN   +   +  + S Y+ +LFTS   + E++++ILD+IP+++  
Subjt:  KEKELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSITQ

Query:  EQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKIIA
             +   FT  DV+  LK M   K+PG DG+  +FY  YW IVG  +    L VLN        N T + LIPK K P  +  +RPISLC+V+YK+++
Subjt:  EQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKIIA

Query:  KTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQIL
        K +  R++  L  +IS  QSAF+  RLI+DN ++ FE LH++K++++G +G AA+KLDMSKA+DRVEW ++ +VM KMGF     + I+ C++SVS+  L
Subjt:  KTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQIL

Query:  LNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEK
        LNG  +    P+RG+RQGDPLSPYLFLICAEGLS  L   E   +  GL+I+   PS+SHLF+ADDS+LF +A ++  R+I   L  Y +ASGQ IN EK
Subjt:  LNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEK

Query:  SSFMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNEL
             S NT+       K++LG+  Q    QYLGLPS  G++KK++F  I D++WK L  WK   FSA GKE+L+K+V QAIP YAMSCFR P++LC+++
Subjt:  SSFMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNEL

Query:  NAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGRDLF
         ++ ARFWWG+T  G+ IHW++W  LC  K QGG+GFR+   FNQALLAKQ+WRI+ +P SLL+ +LR RYF  G+YL A LG+NPS TWRS+VWG++L 
Subjt:  NAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGRDLF

Query:  KKGYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKSAYR
         KG RWR+G+G ++    + W+P   + KP           VA  I +  +W+   ++ +F + D   +L+IP++P   +D +IW+    G+++VKS Y 
Subjt:  KKGYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKSAYR

Query:  LGFHLQDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVDYLCF
            L +  +++ S    +E  W +FWK  +PPK+++  W+V++  LP    L  R +  +P C +C +  ET  H  + C   K +W     S+D+   
Subjt:  LGFHLQDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVDYLCF

Query:  GGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGYGPVE
              +   LL        S    S  +   L++CW IWH RN + H         +      Y+ EF+               A+ + P TA      
Subjt:  GGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGYGPVE

Query:  SSDRGVARGLTEEDPRAS-TTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLKSISVLPPKL---
            G A   T   P +      +W  P      LN+DA  + + N  GIG +LR  DG  V A  K  + N++   +EAL L   L  +  L   L   
Subjt:  SSDRGVARGLTEEDPRAS-TTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLKSISVLPPKL---

Query:  VIELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFPSWLLEL
         IE DS+ VV  L+     L+     ++   +L+S +    I H+ R  N  AH LA+ A ++++   W   FPS L+ L
Subjt:  VIELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFPSWLLEL

XP_030509050.1 uncharacterized protein LOC115723712 [Cannabis sativa]1.7e-27338.08Show/hide
Query:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDSD-GSWRFTGIYGNPQRDKHHETWTLLDRLRDNS-GMPWV
        METKL+  + ++ RN L F  G+ VP  G  GGLMLLW+ E DV I ++S  HID  I   D  S+ FTG YG+PQ  +   TWTLL R  D +   PW+
Subjt:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDSD-GSWRFTGIYGNPQRDKHHETWTLLDRLRDNS-GMPWV

Query:  IGGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDFK---VFHLPFTASDHRPL-LA
        + GDFNE+ + ++K+GG  R    +Q F+ ++D C L    F G   TW N       + ERLD    N     WC  F    + HL +  SDHR L L 
Subjt:  IGGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDFK---VFHLPFTASDHRPL-LA

Query:  VWLKD-QKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLGGKPVWEKTKVCLSRLSEWSRSKYEGSIRGAI--ARKEKEIQELSSYKGHSVMRE
        V L   Q ++     R+  RFE+ W++  +C  II + W               + C S+L  W   K+ G ++  I  A K  E+ + SS         
Subjt:  VWLKD-QKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLGGKPVWEKTKVCLSRLSEWSRSKYEGSIRGAI--ARKEKEIQELSSYKGHSVMRE

Query:  IEHKEKELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTS
        ++H EK L+ LL  +E YW QR+R  W+  GD NTK+FH RA  R   N+I+ L  DDG        +      YF ++FTS   + +AI+ +LD IP  
Subjt:  IEHKEKELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTS

Query:  ITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYK
        I++E    L   FT  +V+  LK+M    +PG DG+  +FY  YW IVG+ +    L VLN  G     N+T I LIPK K P  +  +RPISLC+V+YK
Subjt:  ITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYK

Query:  IIAKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSF
        +++KT+  R++  +  +IS  QSAF+  RLI+DN ++ FE LH++K++++G++G AA+KLDMSKA+DRVEW ++ +VM K+GF     + I+ C++SVS+
Subjt:  IIAKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSF

Query:  QILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTIN
          LLNG  + S TP RG+RQGDPLSPYLFLIC+EG S  L   E      GL+++   P I+HL +ADDS+LF +A     R+I   L+ Y +ASGQ +N
Subjt:  QILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTIN

Query:  FEKSSFMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLC
         EKS    SPNT+ +     + +L +  Q    QYLGLPS  GR K ++F+ I D++WK +  W+ + FS  GKE+L+K+V QAIP YAMSCF+ P+ LC
Subjt:  FEKSSFMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLC

Query:  NELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGR
        N++  + +RFWWG +  G  IHW++W  LC  K  GGMGFR+   FNQALLAKQ+WRI+  P SL+ARVL+ RYF TG +L A+ G  PS TW+SIVWG+
Subjt:  NELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGR

Query:  DLFKKGYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKS
        +L  KG RWRIG+G  V    +PWIP   + KP+L     +   VA FI     W+   ++  F   D   IL+IP++    ED ++W     G ++VKS
Subjt:  DLFKKGYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKS

Query:  AYRLGFHLQDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVDY
         Y+L   + D    SS T    E+ W+ FW   +P KI++  WR Y++ LPT   L +R + ++P C LC+   ET  H F+ C   K +W  +  S+++
Subjt:  AYRLGFHLQDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVDY

Query:  LCFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGYG
                 +  D L        S +  + ++   L   W IW  RN   H K    + ++      Y+ EF+  + + +  G    V+  +S  +A   
Subjt:  LCFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGYG

Query:  PVESSDRGVARGLTEE----DPRA--STTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLKSISV
          + S    A     +    DP      T  +WL P +    +N+DA  N      G+  ILR   G+ + A  K +K   +   +EALA+  GLK +  
Subjt:  PVESSDRGVARGLTEE----DPRA--STTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLKSISV

Query:  LPPKL-VIELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFPSWLL
        L   +  IE DS+ VV+ L+     L+   + +++   LLSN+    ISH+ R  NN AH LA+ A +++S  +W +  P  L+
Subjt:  LPPKL-VIELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFPSWLL

TrEMBL top hitse value%identityAlignment
A0A2N9F6L9 Reverse transcriptase domain-containing protein2.1e-27738.53Show/hide
Query:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISD-SDGSWRFTGIYGNPQRDKHHETWTLLDRLRDNSGMPWVI
        MET   ++  E +R +L F   L+V S  + GGL L W  +IDV+I+SYS  HIDA+I+D    +WR TG+YG P+ +  H+TW L+ RL   S + W  
Subjt:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISD-SDGSWRFTGIYGNPQRDKHHETWTLLDRLRDNSGMPWVI

Query:  GGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDF---KVFHLPFTASDHRPLLAVW
         GDFNEI   +E  G   R +R MQ FR+ +D C L D GF G  +TWCNN       W RLDR+++N E   W + F   +V HL    SDH+    +W
Subjt:  GGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDF---KVFHLPFTASDHRPLLAVW

Query:  LKDQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLGGK--PVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQ--ELSSYKGHSVMREI
        L  +     +  R P RFEE W+    C + I ++W   D+ G     V  K + C  +L  WSR  + G++   I   + E++  E  + +GHS    +
Subjt:  LKDQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLGGK--PVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQ--ELSSYKGHSVMREI

Query:  EHKEKELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSI
        +     L +L E +E  WRQR+R  W+  GDRNTK+FH RAT R + NRI GLKD+ G   +  EGM  +   Y+ +LFT+  P  + IE ++  +   +
Subjt:  EHKEKELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSI

Query:  TQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKI
        T++ N+ L+++FT  +V   LK M P+KAPGPDG+  +FYQK+W +VG+D+    L  LN    +  +N T+I LIPKTK+P  + +FRPISLC+VIYK+
Subjt:  TQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKI

Query:  IAKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQ
        I+K LANR++ +L  I+S SQSAFVPGRLI+DN ++ FE LH +   + G++G  ALKLDMSKAYDRVEW ++ K+M K+GF+  W   +  C+ +VS+ 
Subjt:  IAKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQ

Query:  ILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINF
        IL+NG P     P+RGLRQGDPLSPYLFL+CAEGL S + K+       G+ +    P I+HLF+ADDSLLF KA  + C  I++IL++YEKASGQ +N 
Subjt:  ILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINF

Query:  EKSSFMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCN
        +K++   S  T  +  + IK  L V       +YLGLPS VGR++ E F+ IK+RVW+ L+GWK K  S AG+EILIK+VAQAIP Y+MSCFR P  LCN
Subjt:  EKSSFMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCN

Query:  ELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGRD
        +L A+  RFWW    + RKIHW SW +LC  K +GG+GFRDL+ FN ALLAKQ WR+I    SL  RV + ++F  GS ++       SY W+SI+  R+
Subjt:  ELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGRD

Query:  LFKKGYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPDVQTF-TVAQ-FIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVK
        +  +G  WR+GNG  +    + W+ ++   K +   P++    TV Q  I  Q  W+H L+   F   DA+ I +IP++     D++IW  +  G++SV+
Subjt:  LFKKGYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPDVQTF-TVAQ-FIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVK

Query:  SAYRLGFHLQDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVD
        S YRL    ++M+    ST   L++ W+S W   IP K ++  W+   + LPT  NL  R +  +PTC +C    E  +H  W CK  + +W      V 
Subjt:  SAYRLGFHLQDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVD

Query:  YLCFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKS---LIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTT
                 G+  DLL        +K    GR  +    ++ICW +W  RN++  H+      ++  K + Y+         E YL         S P  
Subjt:  YLCFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKS---LIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTT

Query:  AGYGPVESSDRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLK-SISVL
                             P  S     W+ P +    +N D    +Q N  GIG I+R   G  + +  + ++    V  +EA A    ++ ++ + 
Subjt:  AGYGPVESSDRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLK-SISVL

Query:  PPKLVIELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFP
          +   E DS  VV  L +    L   G+ + +AK +    Q    +H+ R+ N +AH LA KA   NS + W +  P
Subjt:  PPKLVIELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFP

A0A2N9J7Z5 Reverse transcriptase domain-containing protein1.5e-27837.98Show/hide
Query:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDSDGS-WRFTGIYGNPQRDKHHETWTLLDRLRDNSGMPWVI
        +ET   +   E +R  LKF+  L+V ++G+ GGL L WQ E++++IRS+S  HIDAII++ D + WRFTG YG P  +   E+W LL  L     +PW+ 
Subjt:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDSDGS-WRFTGIYGNPQRDKHHETWTLLDRLRDNSGMPWVI

Query:  GGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDFKVFHLPFTASDHRPLLAVWLKD
         GDFNEIT N EK G LPR E+ M+ FR ++D CEL D G+ G  YTWCNN L T  +W RLDR + +++     Q  +V HL   +SDH PLL  +   
Subjt:  GGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDFKVFHLPFTASDHRPLLAVWLKD

Query:  QKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLGGKP-VWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQ--ELSSYKGHSVMREIEHKE
             KK    P RFE+ W     C + ++++WK+     G P + +K K+C   LS WS++++ GS+R  +  K  +++  EL S +G     + +  +
Subjt:  QKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLGGKP-VWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQ--ELSSYKGHSVMREIEHKE

Query:  KELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSITQEQ
        KE+  L++ DE  W+QR+R +W+  GD+N+++FH +AT R++ NRI  ++D  G    D + +  +   YF NLF +SNPN    E +L+ +   IT+E 
Subjt:  KELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSITQEQ

Query:  NRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKIIAKT
        N  LL  FT  +VH  +  M P KAPGPDG+  +FY +YW+ +G ++ +  L  L+       +N T++ LIPK K P  + +FRPISLC+VIYKII+K 
Subjt:  NRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKIIAKT

Query:  LANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLN
        + NR++ +L  IIS +QSAFVPGRLI+DN ++ FE LH +K+ + G+    ALKLDMSKAYDRVEW ++ K+M KMGFNN W D +M CV +VS+ +L+N
Subjt:  LANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLN

Query:  GIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKSS
        G P     P+RGLRQGDPLSPYLFLICAEGL + + ++ Q     G+ +    P I+HLF+ADDSLLF KA  ++C  I+NIL+ YE+ASGQ +N  K++
Subjt:  GIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKSS

Query:  FMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNELNA
           S NT     + +K +LGV       +YLGLPS VGRSKK  F +IK+RVW+ LQGWK K  S AGKEILIK+V QA+P Y+M CF+ P SLC ++ A
Subjt:  FMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNELNA

Query:  ICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGRDLFKK
        +  +F+WG T   R+IHW  W RLC  K  GG+GFRD++ FN A+LAKQ WR++ +  +LL +V + ++F T S L A      S+ W+SI   R + + 
Subjt:  ICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGRDLFKK

Query:  GYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPD---VQTFTVAQFIDD-QGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKSA
        G  WRIG+G QV     PW+P   + +  +I P      T  V+  ID    +WN  ++   F++ D   I  IP++ R+  D +IW   P G +SV+SA
Subjt:  GYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPD---VQTFTVAQFIDD-QGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKSA

Query:  YRLGFHLQDMTEA-SSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVDY
        Y +    QD  +  + S+     +LW   W   +P K+K   WR  N+ LP  TNL  R +     C  C    ET +H  W C   + +W     + D 
Subjt:  YRLGFHLQDMTEA-SSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVDY

Query:  LCFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGYG
        L F      N    ++ +W A       +  + K  ++ W +W  RN +    +    +++ ++    + EF+  + +                      
Subjt:  LCFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGYG

Query:  PVESSDRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLKSISVLP-PKL
                         P  +  L +W  P +E +  N D    +  +  GIG I+R   G P+    +LI     V  +EA A  E ++  + L   K+
Subjt:  PVESSDRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLKSISVLP-PKL

Query:  VIELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFPSWLLELNNVDI
        + E DS  +++ L+  +  L   G  ++++K  ++ ++    SH  R+ N++AH LARKA  L+  + W    PS ++   +VDI
Subjt:  VIELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFPSWLLELNNVDI

A0A7N2LIH6 Uncharacterized protein7.7e-28839.29Show/hide
Query:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAII--SDSDGSWRFTGIYGNPQRDKHHETWTLLDRLRDNSGMPWV
        +ETK      +  +N L F  G+IVPS G+SGGL LLW++  D+  +S S  HID ++  + S G WR TG YG+P   K + +W LL+ L     MPW+
Subjt:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAII--SDSDGSWRFTGIYGNPQRDKHHETWTLLDRLRDNSGMPWV

Query:  IGGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDFKVFHLPFTASDHRPLLAVWLK
        + GDFNEI    EK+G   R    M  FR  +  C L D GF+GP +TWCN     +R   RLDRM+ N    +   + KV H+  +ASDH  LLA++L 
Subjt:  IGGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDFKVFHLPFTASDHRPLLAVWLK

Query:  DQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLGGKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQELSSYK-GHSVMREIEHKEK
           +QR+   R+   FEE W + ++C++I++ +W    +    PV E+ + C   L +W+++ + G++   I +K+  +Q+L S    H    EI+  +K
Subjt:  DQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLGGKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQELSSYK-GHSVMREIEHKEK

Query:  ELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSITQEQN
        E+  L   +EV W+QR+R  W+ +GD+N+K+FH  A+ RR+ NRI GL DD G W ED E  EK+   YF+++++S+ P   + +V L+A+   +T E N
Subjt:  ELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSITQEQN

Query:  RRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLN-GVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKIIAKT
          L K+F   +V   L+ MHP+KAPGPDG+  IFYQKYWDIVG+ + +  L+ LN GV P D +NKTYI LIPKTK+P  + +FRPISLC+VIYKII+K 
Subjt:  RRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLN-GVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKIIAKT

Query:  LANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLN
        LANR++ VL  +I  +QSAFVPGR+I+DN ++ FE +H+I  +RKGKEG+ A+KLDMSKAYDRVEW Y+  +M KMGF + W   IM CV SVSF +L+N
Subjt:  LANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLN

Query:  GIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKSS
        G P+ SFTP+RGLRQGDP+SPYLFL+C EGLS+ + K E+     G+      P ISHLF+ADDS++F +A   +C  +  +L  YE+ SGQ +N +K+S
Subjt:  GIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKSS

Query:  FMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNELNA
           S NTK    +  K + G Q  +   +YLGLP  +GR+KK+ FN IKD+V + + GWKGK  S AG+E+LIK+VAQA P Y M+ F+ P SLC ELN+
Subjt:  FMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNELNA

Query:  ICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGRDLFKK
        +   FWWG   + +K+ W SW  LC  K  GGMGF+DLK FN ALLAKQ WR+ + P SL  RVL+ +YF   S++ A LG  PSY WRSI+  +++ K+
Subjt:  ICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGRDLFKK

Query:  GYRWRIGNGLQVEASKEPWIPKEGSCKPILIHP-DVQTFTVAQFI-DDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKSAYR
        G RW +G+G  +E     W+P   S K +      VQ   VA  I  ++G W   LV+ +F+  +A+ IL+IP++     D ++W   P G F+VKSAYR
Subjt:  GYRWRIGNGLQVEASKEPWIPKEGSCKPILIHP-DVQTFTVAQFI-DDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKSAYR

Query:  LGFHL-----QDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSV
          F       +       S    + ++WK+ W    P KIK   WR    ILPT   L HR +  +  C  C  + ET+ H  W C + K  W     ++
Subjt:  LGFHL-----QDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSV

Query:  DYLCFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAG
        D          +  + L+  W   +S+           I+ W +W++RN V H  ++   + +  + ++Y  E +            +L A+G       
Subjt:  DYLCFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAG

Query:  YGPVESSDRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLKSISVLPPK
                         + P+     +RW  P  + + +N DA    +    GIG ++R   G  + A  K +    R    EA A   G+     L  K
Subjt:  YGPVESSDRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLKSISVLPPK

Query:  -LVIELDSVQVVHLLEEKEVDL-TELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFP
         +V+E D+  V+  L  K VD  T +   ++ A+  L  +      H  RR+N  AH LAR++ ++N    W +  P
Subjt:  -LVIELDSVQVVHLLEEKEVDL-TELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFP

A0A803PIB6 Uncharacterized protein2.0e-27538.61Show/hide
Query:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDSDG-SWRFTGIYGNPQRDKHHETWTLLDRLRDNSGM-PWV
        METKL      R +N L F  GL V   G  GGLMLLWQ  +DV + S +  + D  I   DG  W F+ +YG P+      TW L+ RL D S + PW+
Subjt:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDSDG-SWRFTGIYGNPQRDKHHETWTLLDRLRDNSGM-PWV

Query:  IGGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDF---KVFHLPFTASDHRPLLAV
        + GD NEI SN  K GG  R +  MQ FR+++D C L +   +G E+TW  N      + ERLD   IN    +W  +F   K+ HL +  SDHR LLA 
Subjt:  IGGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDF---KVFHLPFTASDHRPLLAV

Query:  --WLKDQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLG-GKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQEL--SSYKGHSVMR
          +     +Q  +  R+  RFE+ W+K  +C +II  SW    +      +    +VC + L +W   K+ G ++  I   +K +  L  S+        
Subjt:  --WLKDQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLG-GKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQEL--SSYKGHSVMR

Query:  EIEHKEKELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPT
        ++   E  L+ LL ++E YW+QR+R +W+  GDRNTK+FH +A+ R   NRI+ L DD GN +   EG+ +V + YFQ LFT+SN +  A+  +L  IPT
Subjt:  EIEHKEKELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPT

Query:  SITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIY
        +I+ EQN  L + FT  +V   LK +   K+PG DG+ A+FY   W+IVG  +  + L VLN     +  NKT I LIPK K P  MKDFRPISLC+V Y
Subjt:  SITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIY

Query:  KIIAKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVS
        KII+K LA R + VL ++IS +QSAF+  RLI+DN ++ FE +H++K + +G +G AALKLDMSKA+DRVEW ++  VMGKMGF       IM+C+ + S
Subjt:  KIIAKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVS

Query:  FQILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTI
        F  L+NG    S  P RGLRQGDPLSPYLFLIC+EGLS  L   EQ     GL ++ + PSI+HL +ADDSLLF +A ++ C SIK  L+ Y +ASGQ +
Subjt:  FQILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTI

Query:  NFEKSSFMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISL
        N +KS    SPNT  +   + +++LG+        YLGLP+   R K ++FNNIK+R+WK +  W  K FS  GKE+L+K+V QAIP YAMSCFR     
Subjt:  NFEKSSFMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISL

Query:  CNELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWG
        C ++  + ARFWWG++   +KIHW++W  LC+ K  GG+GFR    FNQA LAKQ+WRI + P SLL+RVL+GRY+    ++ A +    S TW+ IVWG
Subjt:  CNELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWG

Query:  RDLFKKGYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVK
        R+L  KG   +IG+G  V  + + WIP     KP+          VA +I D   W+ +L+   F   D   IL IP++     D   W  D  G ++VK
Subjt:  RDLFKKGYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVK

Query:  SAYRLGFHLQDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVD
        S Y L   L++   +SSST +  E+ W+ FW   +P K+++ GWRV N  LP   NL HR + T+ TC LC    E+  H  + C   K +W      +D
Subjt:  SAYRLGFHLQDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVD

Query:  YLCFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGY
        +       DG+    L        S       + K     W IW  RN  IH K      +LK+ +        I    E+YL     V   ++P  +  
Subjt:  YLCFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGY

Query:  GPVESSDRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGL---KSISVLP
                            A     +W+ P+     +N DA  ++  N  GIG I+R   G  + A  K +  N++   +EA A+  GL   K + + P
Subjt:  GPVESSDRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGL---KSISVLP

Query:  PKLVIELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFPSWLLEL
            +E D + +VH L+ K+  L+     V++  + LS++ +  ISH+ R  N  AH LA++A  L++   W +  PS +  +
Subjt:  PKLVIELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFPSWLLEL

A0A803QH07 Uncharacterized protein1.5e-27838.64Show/hide
Query:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDSDGS-WRFTGIYGNPQRDKHHETWTLLDRLRDNSGM-PWV
        METKL  N+  R RN L F +GL VP  G SGGLMLLW  E  V + +++    D  +   +G    FT  YG P       +WTLL RL+D + M PW+
Subjt:  METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDSDGS-WRFTGIYGNPQRDKHHETWTLLDRLRDNSGM-PWV

Query:  IGGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDFKVFHLPFTASDHRPLLAVWLK
        I GDFNEI  N  K GG  R E  M++FRS +D C L +  F G  +TW  N  Q E I ERLD    N       Q     HL F +SDHR +    L 
Subjt:  IGGDFNEITSNSEKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDFKVFHLPFTASDHRPLLAVWLK

Query:  DQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLGGKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQELSSY--KGHSVMREIEHKE
            Q++   +   RFE+ W+K ++   +I  +WK +              C   L  W   K+ G ++  I++ +K + +L++   +  + + +++  E
Subjt:  DQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLGGKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQELSSY--KGHSVMREIEHKE

Query:  KELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSITQEQ
        K L+ LL ++E YW QR+R +W+  GD+NTK+FH +A++R+  N I+ L +D G  +   E + +V   Y++ LF S + + ++++ +++AIP++I    
Subjt:  KELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSITQEQ

Query:  NRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKIIAKT
        N+ L   F+  +V+  L++M P K+PG DG+ A+FYQ YWDIVG+ +  L L +LN    + QLN + I LIPK  +P  M D+RPISLC+VIYK+I+KT
Subjt:  NRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKIIAKT

Query:  LANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLN
        +  R + VL  +IS +QSAF+  RLI+DN ++ FE +H ++ K +G+ G +ALKLDMSKA+DRVEW Y+  VM KMGF   W   IMSC+ + SF   LN
Subjt:  LANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLN

Query:  GIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKSS
        G       P RGLRQGDPLSPYLFLIC+EGLS  L   E      GLR+    PS+SHL +ADDSLLF +A E+   ++K  L  Y KASGQ +N +KS 
Subjt:  GIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKSS

Query:  FMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNELNA
           SPNT  +        L +   +   +YLGLPS  GR K+E+F+NIK++VWK L  W  K FS  GKE+L+K+V Q+IP YAMSCF+     CN+L +
Subjt:  FMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNELNA

Query:  ICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGRDLFKK
        + A FWWG    G KIHW+ W  LC  K +GGMGFR    FNQALLAKQ+WRI   P+SLL+R+L+ RYF T S+L+AS+G++PSYTW+SI WGR+L  K
Subjt:  ICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNPSYTWRSIVWGRDLFKK

Query:  GYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKSAYRLG
        G R+++GNG  ++ SK+PWIP   S +P+      Q   V+  I+D   WN  L+   F   D + IL+IP++    +D +IW     G ++VKS + L 
Subjt:  GYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKSAYRLG

Query:  FHLQDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVDYLCFGG
         HL+D  ++SSS        WK FW   +PPKI++  W+V   ILP    L  R +  +  C LC +  E+  H  + C   K +W      +D+     
Subjt:  FHLQDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVDYLCFGG

Query:  RGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGYGPVESS
          +G+    L   +   D + F        + + W IW  RN V H        ++ + I  Y   F        +L         SS +TA    + S+
Subjt:  RGDGNQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGYGPVESS

Query:  DRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLKSISVLPPKLV-IELD
         +          P+       W  P+   + LN DA  N+     GIG I+R  DG  V A  K+++ ++R   +EA AL   L  +S     +  IE D
Subjt:  DRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLKSISVLPPKLV-IELD

Query:  SVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFP
        +++V   L   + DL+     + + + LLS +   L+SH  R  N  AH LAR A  L+   SW    P
Subjt:  SVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFP

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.5e-4124.62Show/hide
Query:  RRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIP-TSITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQK
        +R+ N+I  +K+D G+   D   ++    +Y+++L+ +   N+E ++  LD      + QE+   L +  T  ++  ++ ++   K+PGPDG  A FYQ+
Subjt:  RRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIP-TSITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQK

Query:  YWDIVGNDICDLCLKVLNGVGPIDQLNKTY----IALIPKT-KDPGCMKDFRPISLCSVIYKIIAKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIG
        Y +    ++    LK+   +     L  ++    I LIPK  +D    ++FRPISL ++  KI+ K LANR++  +  +I   Q  F+PG     N    
Subjt:  YWDIVGNDICDLCLKVLNGVGPIDQLNKTY----IALIPKT-KDPGCMKDFRPISLCSVIYKIIAKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIG

Query:  FECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSS
           +  I ++ K K  +  + +D  KA+D+++  ++ K + K+G +  +   I +  +  +  I+LNG    +F    G RQG PLSP LF I  E L+ 
Subjt:  FECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSS

Query:  TLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKSSFMVSPNTKASHVDTIKEVLGVQHQESLGQYLGL
         +    Q +   G+++      +S   +ADD +++ +      +++  +++ + K SG  IN +KS   +  N + +    + E+      + + +YLG+
Subjt:  TLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKSSFMVSPNTKASHVDTIKEVLGVQHQESLGQYLGL

Query:  PSQVGRSKKEIFNNIKDRVWKALQ----GWKGKFFSAAGKEILIKS--VAQAIPNYAMSCFRFPISLCNELNAICARFWWGATDKGRKIHWRSWTRLCSH
          Q+ R  K++F      + K ++     WK    S  G+  ++K   + + I  +     + P++   EL     +F W    K  +I   + + L   
Subjt:  PSQVGRSKKEIFNNIKDRVWKALQ----GWKGKFFSAAGKEILIKS--VAQAIPNYAMSCFRFPISLCNELNAICARFWWGATDKGRKIHWRSWTRLCSH

Query:  KTQGGMGFRDLKIFNQALLAKQSW
           GG+   D K++ +A + K +W
Subjt:  KTQGGMGFRDLKIFNQALLAKQSW

P08548 LINE-1 reverse transcriptase homolog3.6e-4024.01Show/hide
Query:  KEKEIQELSSYKGHSVMREIEH-------KEKELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRAT---------TRRK--ANRIRGLKDDDGNWIE
        K+ E +E+++  GH    E E        + KE+  +  +      +R  ++     +++  WF  +           TR+K   + I  +++ +     
Subjt:  KEKEIQELSSYKGHSVMREIEH-------KEKELENLLEDDEVYWRQRAREEWITWGDRNTKWFHMRAT---------TRRK--ANRIRGLKDDDGNWIE

Query:  DDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIP-TSITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNG
        D   ++K+ ++Y++ L++    N++ I+  L+A     ++Q++   L +  +  ++   ++N+   K+PGPDG  + FYQ + +    ++  + L +   
Subjt:  DDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIP-TSITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVGNDICDLCLKVLNG

Query:  VGPIDQLNKTY----IALIPKT-KDPGCMKDFRPISLCSVIYKIIAKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAA
        +     L  T+    I LIPK  KDP   +++RPISL ++  KI+ K L NR++  +  II   Q  F+PG     N       +  I +K K K+ +  
Subjt:  VGPIDQLNKTY----IALIPKT-KDPGCMKDFRPISLCSVIYKIIAKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAA

Query:  LKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNY
        L +D  KA+D ++  ++ + + K+G   T+   I +     +  I+LNG+   SF    G RQG PLSP LF I  E L+  +    + +   G+ I + 
Subjt:  LKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNY

Query:  CPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKS-SFMVSPNTKASHVDTIKEVLGVQHQESLGQYLG--LPSQVGRSKKEIFNNIK
           I    +ADD +++ +        +  ++ +Y   SG  IN  KS +F+ + N +A    T+K+ +         +YLG  L   V    KE +  ++
Subjt:  CPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKS-SFMVSPNTKASHVDTIKEVLGVQHQESLGQYLG--LPSQVGRSKKEIFNNIK

Query:  DRVWKALQGWKGKFFSAAGKEILIKS--VAQAIPNYAMSCFRFPISLCNELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLA
          + + +  WK    S  G+  ++K   + +AI N+     + P+S   +L  I   F W      +K    + T L +    GG+   DL+++ ++++ 
Subjt:  DRVWKALQGWKGKFFSAAGKEILIKS--VAQAIPNYAMSCFRFPISLCNELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLA

Query:  KQSW
        K +W
Subjt:  KQSW

P0C2F6 Putative ribonuclease H protein At1g657509.8e-3827.69Show/hide
Query:  LPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGG
        +P    R  K+ F  I +RV   + GW+ K  S AG+  L K+V  ++P ++MS    P S+ N L+ +   F WG+T + +K H   W+++CS K +GG
Subjt:  LPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGG

Query:  MGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRY----FKTGSYLNASLGNNPSYTWRSIVWG-RDLFKKGYRWRIGNGLQVEASKEPWIPKEGSCK
        +G R  K  N+AL++K  WR+++   SL   VL+ +Y     +   +L      + S TWRSI  G RD+   G  W  G+G Q+    + W+    S K
Subjt:  MGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRY----FKTGSYLNASLGNNPSYTWRSIVWG-RDLFKKGYRWRIGNGLQVEASKEPWIPKEGSCK

Query:  PILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRI--------KEDEIIWDLDPKGLFSVKSAYRLGFHLQDMTEASSSTYKHLES
        P+L   + +  T    +  +  W        F + D  T  N  +  R           D + W     G FSV+SAY       +M         ++ S
Subjt:  PILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRI--------KEDEIIWDLDPKGLFSVKSAYRLGFHLQDMTEASSSTYKHLES

Query:  LWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFP
         +   WK  +P ++K   W V N  + T    + R +  +  C +C+   E+ +H+  +C     +W    P
Subjt:  LWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFP

P11369 LINE-1 retrotransposable element ORF2 protein8.2e-4527.08Show/hide
Query:  IRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPT-SITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVG
        I  ++++ G+   D E ++     +++ L+++   N++ ++  LD      + Q+Q   L    + +++  V+ ++   K+PGPDG  A FYQ + +   
Subjt:  IRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPT-SITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVG

Query:  NDICDLCLKVLNGVGPIDQLNKTY----IALIPK-TKDPGCMKDFRPISLCSVIYKIIAKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHA
         D+  +  K+ + +     L  ++    I LIPK  KDP  +++FRPISL ++  KI+ K LANR++  +  II P Q  F+PG     N       +H 
Subjt:  NDICDLCLKVLNGVGPIDQLNKTY----IALIPK-TKDPGCMKDFRPISLCSVIYKIIAKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHA

Query:  IKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSE
        I +K K K  +  + LD  KA+D+++  ++ KV+ + G    + + I +        I +NG    +     G RQG PLSPYLF I  E L+  +    
Subjt:  IKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSE

Query:  QTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKS-SFMVSPNTKASHVDTIKEVLGVQHQESLGQYLG--LPSQ
        Q +   G++I      IS L  ADD +++    +   R + N++N + +  G  IN  KS +F+ + N +A     I+E        +  +YLG  L  +
Subjt:  QTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKS-SFMVSPNTKASHVDTIKEVLGVQHQESLGQYLG--LPSQ

Query:  VGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKS--VAQAIPNYAMSCFRFPISLCNELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMG
        V     + F ++K  + + L+ WK    S  G+  ++K   + +AI  +     + P    NEL     +F W   +K  +I   + + L   +T GG+ 
Subjt:  VGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKS--VAQAIPNYAMSCFRFPISLCNELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMG

Query:  FRDLKIFNQALLAKQSW
          DLK++ +A++ K +W
Subjt:  FRDLKIFNQALLAKQSW

P14381 Transposon TX1 uncharacterized 149 kDa protein8.2e-4525.23Show/hide
Query:  ISDSDGSWRFTGIYG---NPQRDKHHETWTLLDRLRDNSGMPWVIGGDFNEITSNSEKLGGLPRAE-------RDMQDFRSSIDSCELHDPGFIGPEYTW
        + +S  ++    +Y     P+R +  E+ +      D S    +IGGDFN      ++   +P+         R++    S +D     +P  +   Y  
Subjt:  ISDSDGSWRFTGIYG---NPQRDKHHETWTLLDRLRDNSGMPWVIGGDFNEITSNSEKLGGLPRAE-------RDMQDFRSSIDSCELHDPGFIGPEYTW

Query:  C-NNHLQTERIWERLDRMLINTEMQIWCQDFKVFHLPFTASDHR------------PLLAVWLKDQKSQRKKGM-RYPRRFEEGWVKYDDCRKIIDQSWK
          + H+   RI    DR+ I++ +    Q   +   PF  SDH             P  A W  +      +G  +  R    GW  + D    ++Q W 
Subjt:  C-NNHLQTERIWERLDRMLINTEMQIWCQDFKVFHLPFTASDHR------------PLLAVWLKDQKSQRKKGM-RYPRRFEEGWVKYDDCRKIIDQSWK

Query:  EMDQLGGKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEI----QELSSYKGHSVMREIEHKEKELENLLEDDEVYWRQRAREEWITWGDRNTKWF
                  W+  KV L  L +       G     I     E+    Q LS  +  ++  E   +++ L N+ +        R+R + +   DR +++F
Subjt:  EMDQLGGKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEI----QELSSYKGHSVMREIEHKEKELENLLEDDEVYWRQRAREEWITWGDRNTKWF

Query:  HMRATTRRKANR--IRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGI
        +  A  ++K NR  I  L  +DG  +ED E +   A  ++QNLF+    + +A E + D +P  +++ +  RL    T +++   L+ M  +K+PG DG+
Subjt:  HMRATTRRKANR--IRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGI

Query:  QAIFYQKYWDIVGNDICDLCLKVL-NGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKIIAKTLANRMRTVLDTIISPSQSAFVPGRLISDNT
           F+Q +WD +G D   +  +    G  P+    +  ++L+PK  D   +K++RP+SL S  YKI+AK ++ R+++VL  +I P QS  VPGR I DN 
Subjt:  QAIFYQKYWDIVGNDICDLCLKVL-NGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKIIAKTLANRMRTVLDTIISPSQSAFVPGRLISDNT

Query:  VIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEG
         +  + LH   ++R G   +A L LD  KA+DRV+  Y+   +    F   +   + +   S    + +N    A     RG+RQG PLS  L+ +  E 
Subjt:  VIGFECLHAIKSKRKGKEGIAALKLDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEG

Query:  LSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKSSFMVSPNTKASHVDTIKEVL-GVQHQESLGQ
            L K       +GL +      +    YADD +L  + +  D    +     Y  AS   IN+ KSS ++  + K   VD +      +  +  + +
Subjt:  LSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDSLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKSSFMVSPNTKASHVDTIKEVL-GVQHQESLGQ

Query:  YLGL-PSQVGRSKKEIFNNIKDRVWKALQGWKG--KFFSAAGKEILIKSVAQAIPNYAMSC
        YLG+  S       + F  +++ V   L  WKG  K  S  G+ ++I  +  +   Y + C
Subjt:  YLGL-PSQVGRSKKEIFNNIKDRVWKALQGWKG--KFFSAAGKEILIKSVAQAIPNYAMSC

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein5.2e-2624.03Show/hide
Query:  SYSKGHIDAIISDSDGSWRFTGIYGNPQRDKHHETW--TLLDRLRDNSGMPWVIGGDFNEITSNSEKLGGLPRA--ERDMQDFRSSIDSCELHDPGFIGP
        S S+ +  AI+  +  SWR    Y   +  +    W  ++   +   +    ++ GDF++I + S+    L  +   R +++F++ +   +L D    G 
Subjt:  SYSKGHIDAIISDSDGSWRFTGIYGNPQRDKHHETW--TLLDRLRDNSGMPWVIGGDFNEITSNSEKLGGLPRA--ERDMQDFRSSIDSCELHDPGFIGP

Query:  EYTWCNNHLQTERIWERLDRMLINTEMQIWCQDF----KVFHLPFTASDHRPLLAVWLKDQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLG
         YTW +NH     I  +LDR + N +   W   F     VF L    SDH P + + L++   + KK  RY          +      +  +W+E   +G
Subjt:  EYTWCNNHLQTERIWERLDRMLINTEMQIWCQDF----KVFHLPFTASDHRPLLAVWLKDQKSQRKKGMRYPRRFEEGWVKYDDCRKIIDQSWKEMDQLG

Query:  GKPV-----WEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQELSSYKGHSVMREIEHKEKELENLLEDD-EVYWRQRAREEWITWGDRNTKWFHMRA
                  +  K C   L+       +   + A+   E    +L +    S+ R +EH  ++  N      E ++RQ++R +W+  GD NT++FH   
Subjt:  GKPV-----WEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQELSSYKGHSVMREIEHKEKELENLLEDD-EVYWRQRAREEWITWGDRNTKWFHMRA

Query:  TTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNM--EAIEVILDAIPTSITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIF
           +  N I+ L+ DD   +E+   ++++   Y+ +L  S +  +  ++++ I D  P         RL    + +++   +  M  +KAPGPD   A F
Subjt:  TTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNM--EAIEVILDAIPTSITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIF

Query:  YQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKII
        + + W +V +       +       + + N T I LIPK      +  FRP+S C+V+YKII
Subjt:  YQKYWDIVGNDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKII

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.5e-1723.46Show/hide
Query:  QYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNELNAICARFWWGATDKGRKIHWRSWTRLCSHK
        +YLGLP    +     +  + +++   +  W  +  S AG+  LI SV  ++ N+ MS FR P +   E+++IC+ F W   +   K    +W+ +C+ K
Subjt:  QYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIKSVAQAIPNYAMSCFRFPISLCNELNAICARFWWGATDKGRKIHWRSWTRLCSHK

Query:  TQGGMGFRDLKIFNQALLAKQSWRI---IRYPESLLARVLRGRYFKTGSYL-NASLGNNPSY---TWRSIVWGRDLFKKGYRWRIGNGLQVEASKEPWIP
         +GG+G R LK  N+       W I         +  ++L+ R   +G    +   G+N S+    W  I  GR +   G+R  I  G+ + AS    + 
Subjt:  TQGGMGFRDLKIFNQALLAKQSWRI---IRYPESLLARVLRGRYFKTGSYL-NASLGNNPSY---TWRSIVWGRDLFKKGYRWRIGNGLQVEASKEPWIP

Query:  KEGSCKPILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKSAYRLGFHLQDMTEASSSTYKHLESLW
             +P     D    T+ +  D      HQ + +                    ED + W    KG   +   ++  F+ ++ T A++   K   + +
Subjt:  KEGSCKPILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVKSAYRLGFHLQDMTEASSSTYKHLESLW

Query:  KSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWEC
        K  W +   PK  +  W    + L T   +       + +C LC +  ET  HLF+ C
Subjt:  KSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWEC

AT4G29090.1 Ribonuclease H-like superfamily protein1.6e-6426.73Show/hide
Query:  AIPNYAMSCFRFPISLCNELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNA
        A+P Y M+CF  P ++C ++ ++ A FWW    + + +HW++W  L  +K +GG+GF+D++ FN ALL KQ WR++  PESL+A+V + RYF     LNA
Subjt:  AIPNYAMSCFRFPISLCNELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNA

Query:  SLGNNPSYTWRSIVWGRDLFKKGYRWRIGNGLQVEASKEPWI---PKEGSCKPILIHPD-----VQTFTVAQFIDDQG-SWNHQLVKASFMECDAQTILN
         LG+ PS+ W+SI   +++ ++G R  +GNG  +   +  W+   P   + +   + P           V+  ID+ G  W   +++  F E + + I  
Subjt:  SLGNNPSYTWRSIVWGRDLFKKGYRWRIGNGLQVEASKEPWI---PKEGSCKPILIHPD-----VQTFTVAQFIDDQG-SWNHQLVKASFMECDAQTILN

Query:  IPINPRIKEDEIIWDLDPKGLFSVKSAY-RLGFHLQDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNK
        +    R   D   WD    G ++VKS Y  L   +   +     +   L  +++  WK+   PKI+   W+  ++ LP    L +R +     C  C + 
Subjt:  IPINPRIKEDEIIWDLDPKGLFSVKSAY-RLGFHLQDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNK

Query:  PETTVHLFWECKMTKCLWTFFFPSVDYLCFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLI--ICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVE
         ET  HL ++C   +  W     S+     G   D    +L   F     + +++       L+  + W++W +RNE++      +++++  + +  + E
Subjt:  PETTVHLFWECKMTKCLWTFFFPSVDYLCFGGRGDGNQQDLLERFWKATDSKRFDSGRIGKSLI--ICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVE

Query:  FKIQEGEESYLGGGSLVAEGSSPTTAGYGPVESSDRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLI
        ++I+   ES          G+ P       V  S  G                 RW  P  +    N+DATWN      GIGW+LR + G     G + +
Subjt:  FKIQEGEESYLGGGSLVAEGSSPTTAGYGPVESSDRGVARGLTEEDPRASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLI

Query:  KKNWRVSWLEALALVEGLKSISVLPPKLVI-ELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACS-LNSSDSWS
         K   V   E  A+   + S+S      VI E DS  ++ +L   E+    L   + + + LLS + +     IPR  N +A ++AR++ S LN      
Subjt:  KKNWRVSWLEALALVEGLKSISVLPPKLVI-ELDSVQVVHLLEEKEVDLTELGIFVDEAKHLLSNYQDHLISHIPRRHNNMAHQLARKACS-LNSSDSWS

Query:  DFFPSW
           PSW
Subjt:  DFFPSW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.9e-3748.03Show/hide
Query:  AIPNYAMSCFRFPISLCNELNAICARFWWGATDKGRKIHWRSWTRLC-SHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLN
        A+P YAMSCFR    LC +L +    FWW + +  RKI W +W +LC S +  GG+GFRDL  FNQALLAKQS+RII  P +LL+R+LR RYF   S + 
Subjt:  AIPNYAMSCFRFPISLCNELNAICARFWWGATDKGRKIHWRSWTRLC-SHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLN

Query:  ASLGNNPSYTWRSIVWGRDLFKKGYRWRIGNGLQVEASKEPWIPKEGSCKPI
         S+G  PSY WRSI+ GR+L  +G    IG+G+  +   + WI  E    P+
Subjt:  ASLGNNPSYTWRSIVWGRDLFKKGYRWRIGNGLQVEASKEPWIPKEGSCKPI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.1e-1548.53Show/hide
Query:  LLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDS
        ++NG P+   TP+RGLRQGDPLSPYLF++C E LS    ++++     G+R++N  P I+HL +ADD+
Subjt:  LLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGACCAAACTAAAGCAAAATAATTGTGAAAGAGTTCGTAACCATCTGAAGTTTGATTATGGCTTAATAGTTCCAAGCCAGGGTCAAAGTGGAGGTTTGATGTTACT
TTGGCAAAAGGAAATAGATGTCAATATTCGATCCTATTCGAAAGGGCACATAGATGCCATCATATCAGATTCTGATGGGTCTTGGCGGTTCACAGGTATCTATGGTAACC
CTCAAAGGGACAAGCATCATGAGACTTGGACCCTGCTGGATAGGCTGAGGGATAATTCGGGAATGCCTTGGGTTATTGGGGGGGATTTTAATGAGATCACTAGCAACTCT
GAGAAATTGGGAGGACTTCCGAGAGCCGAAAGAGACATGCAAGACTTTAGAAGCAGCATTGACTCATGTGAGCTCCATGATCCAGGTTTCATTGGGCCGGAATACACGTG
GTGCAACAATCATCTCCAAACAGAGAGGATATGGGAACGCCTGGACAGAATGCTTATAAACACTGAAATGCAAATTTGGTGTCAGGACTTTAAGGTGTTCCACCTTCCTT
TTACCGCCTCAGATCACCGCCCTTTGTTGGCAGTTTGGCTAAAAGACCAGAAAAGCCAGAGAAAGAAAGGCATGCGATATCCAAGGAGGTTTGAGGAAGGATGGGTCAAA
TATGATGATTGTAGGAAGATTATTGACCAGTCGTGGAAGGAAATGGACCAGTTGGGAGGTAAACCAGTTTGGGAGAAGACCAAGGTTTGCCTAAGCCGACTATCAGAATG
GAGTAGGAGTAAATATGAGGGGTCTATCAGGGGAGCCATTGCTAGAAAGGAAAAGGAAATTCAAGAGTTGAGCTCCTACAAAGGCCATAGTGTAATGAGAGAAATTGAGC
ACAAAGAGAAAGAGCTAGAGAATTTACTGGAGGATGATGAAGTATATTGGAGACAGAGAGCCCGGGAAGAGTGGATCACATGGGGTGATCGAAACACTAAATGGTTTCAT
ATGAGGGCAACAACTAGACGCAAGGCGAACCGTATTCGAGGGCTAAAGGACGACGATGGGAACTGGATTGAGGATGATGAAGGGATGGAAAAGGTAGCCAGTCAATACTT
CCAAAATTTATTCACTTCCTCCAACCCGAACATGGAGGCTATTGAAGTTATCCTTGATGCAATTCCTACCAGCATTACACAGGAGCAAAACAGAAGATTGTTAAAGAAGT
TCACTAGAGAAGATGTGCATGGGGTGCTAAAAAATATGCACCCTTCAAAGGCCCCAGGACCTGATGGGATCCAAGCCATCTTCTATCAGAAATACTGGGACATCGTGGGA
AATGATATTTGTGATCTATGCTTGAAGGTCTTAAATGGGGTTGGACCTATAGATCAGTTGAACAAAACATATATAGCCTTAATTCCGAAAACAAAGGATCCTGGATGTAT
GAAGGATTTTCGCCCTATTAGTCTATGCTCGGTTATCTACAAGATCATTGCTAAAACTCTAGCCAACAGAATGAGGACGGTTCTCGACACGATCATCTCGCCGAGTCAAT
CAGCTTTTGTGCCAGGACGACTCATATCAGATAACACCGTTATAGGGTTTGAATGCCTCCATGCGATTAAGAGCAAAAGAAAAGGGAAGGAAGGAATTGCAGCGCTTAAG
CTAGATATGAGCAAAGCGTACGACAGGGTGGAATGGTGCTACATCAGGAAAGTCATGGGTAAGATGGGCTTTAATAACACATGGACGGACAAAATAATGAGCTGCGTGGA
GTCAGTGAGCTTCCAAATTCTTCTTAATGGAATTCCTCGAGCTAGCTTCACCCCAAATCGGGGGCTGAGACAAGGAGATCCTCTCTCTCCATATTTGTTTCTGATCTGTG
CAGAGGGTTTGTCTAGTACCCTCAACAAATCAGAACAAACACGAACGTTTTCAGGTTTGCGTATCAATAATTATTGCCCTTCTATATCTCATCTCTTTTACGCTGATGAT
AGTCTCCTGTTTTTCAAAGCTATGGAAAAAGATTGCAGGTCCATCAAGAATATCCTCAACAAATACGAGAAGGCCTCGGGCCAAACCATAAATTTTGAGAAATCATCATT
TATGGTAAGCCCAAACACGAAGGCATCCCATGTGGACACAATAAAGGAGGTGTTAGGAGTTCAACATCAGGAAAGTCTAGGGCAATATCTAGGTCTTCCCTCTCAAGTTG
GCAGAAGCAAGAAGGAGATTTTTAACAACATCAAGGATAGAGTTTGGAAGGCGTTACAGGGATGGAAAGGGAAGTTTTTCTCAGCTGCTGGGAAAGAAATTCTTATTAAA
TCTGTTGCACAAGCAATCCCAAACTATGCGATGAGCTGTTTTCGATTTCCTATTTCCCTGTGTAACGAGTTAAATGCTATCTGTGCTAGGTTTTGGTGGGGAGCGACGGA
CAAAGGGAGGAAGATCCATTGGAGAAGTTGGACAAGACTCTGCAGTCATAAAACTCAGGGAGGCATGGGCTTTCGAGACCTGAAGATTTTCAACCAAGCATTGCTTGCAA
AACAGAGTTGGAGAATCATTCGTTACCCAGAGAGTTTGCTAGCTAGAGTTCTTAGGGGAAGATACTTCAAAACCGGCTCTTATCTGAATGCCTCGTTAGGAAACAATCCA
TCCTATACTTGGCGAAGCATAGTGTGGGGTCGTGATCTATTCAAAAAGGGTTATCGGTGGAGAATTGGGAATGGGCTCCAGGTGGAAGCTAGCAAAGAGCCTTGGATTCC
CAAAGAAGGATCTTGCAAGCCTATTCTAATCCACCCTGATGTTCAGACATTCACAGTGGCCCAATTCATCGATGATCAAGGAAGTTGGAACCATCAGTTAGTGAAAGCTT
CATTCATGGAGTGTGATGCCCAGACTATCCTGAATATCCCTATCAATCCTCGGATCAAGGAAGATGAGATTATTTGGGACTTAGACCCTAAAGGGCTCTTTTCTGTAAAG
AGTGCTTATAGGTTGGGTTTCCATCTACAGGACATGACGGAAGCATCTTCTTCGACCTACAAGCATTTGGAATCACTCTGGAAAAGCTTTTGGAAAGCTCCAATCCCTCC
AAAAATTAAATTATGTGGATGGAGAGTCTATAATGATATTCTCCCTACTCTTACTAATTTAAACCATAGAGGGATGGATACTAATCCTACATGTTATTTATGCAGGAATA
AACCGGAGACCACAGTGCATCTCTTTTGGGAATGCAAAATGACCAAATGCCTGTGGACTTTTTTCTTTCCATCTGTTGATTATCTATGTTTTGGTGGCAGGGGCGATGGG
AATCAACAAGATTTGCTGGAAAGATTTTGGAAAGCAACTGATTCAAAGAGGTTCGATAGCGGAAGAATAGGGAAAAGCCTAATCATTTGTTGGCAGATTTGGCATCATAG
AAATGAAGTTATTCATCACAAGCTCAACACAGACTCTGAGAAATTGAAGAATAAAATACAACAATATATGGTTGAATTCAAGATCCAAGAGGGAGAAGAATCGTACCTTG
GGGGAGGGTCCTTAGTGGCAGAGGGCTCCTCTCCAACGACAGCAGGTTACGGTCCAGTGGAGTCTTCTGATCGAGGCGTCGCCCGAGGCTTGACGGAGGAGGATCCGCGA
GCTTCAACCACCCTGCAAAGATGGTTGAAGCCTTCGACTGAATGTTGGACATTAAATAGCGATGCAACATGGAACGCGCAAATGAATTGTGGCGGTATTGGATGGATTCT
TAGGCGGCAAGATGGAAATCCTGTTACGGCTGGATTCAAATTAATCAAGAAAAACTGGAGAGTTAGTTGGTTGGAAGCCCTAGCTTTGGTGGAAGGGCTGAAATCGATTT
CGGTTTTGCCTCCCAAGTTGGTTATCGAGCTTGACTCAGTGCAAGTGGTACACCTGCTTGAGGAGAAAGAAGTTGATCTCACTGAACTCGGCATTTTTGTCGATGAAGCA
AAGCATCTGCTTTCCAACTACCAAGATCACTTGATTTCCCACATTCCCAGGAGGCACAATAACATGGCCCATCAACTGGCCCGCAAAGCTTGTTCTCTTAATTCTTCGGA
TAGTTGGAGTGATTTTTTCCCTTCGTGGCTTTTAGAATTAAACAATGTAGACATTGGTGTTGATTCCATTTTTGGGGGTGCCTGTCCCACAAATGGCCACCCAATGGGAC
CGGTTGCTCTTGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGACCAAACTAAAGCAAAATAATTGTGAAAGAGTTCGTAACCATCTGAAGTTTGATTATGGCTTAATAGTTCCAAGCCAGGGTCAAAGTGGAGGTTTGATGTTACT
TTGGCAAAAGGAAATAGATGTCAATATTCGATCCTATTCGAAAGGGCACATAGATGCCATCATATCAGATTCTGATGGGTCTTGGCGGTTCACAGGTATCTATGGTAACC
CTCAAAGGGACAAGCATCATGAGACTTGGACCCTGCTGGATAGGCTGAGGGATAATTCGGGAATGCCTTGGGTTATTGGGGGGGATTTTAATGAGATCACTAGCAACTCT
GAGAAATTGGGAGGACTTCCGAGAGCCGAAAGAGACATGCAAGACTTTAGAAGCAGCATTGACTCATGTGAGCTCCATGATCCAGGTTTCATTGGGCCGGAATACACGTG
GTGCAACAATCATCTCCAAACAGAGAGGATATGGGAACGCCTGGACAGAATGCTTATAAACACTGAAATGCAAATTTGGTGTCAGGACTTTAAGGTGTTCCACCTTCCTT
TTACCGCCTCAGATCACCGCCCTTTGTTGGCAGTTTGGCTAAAAGACCAGAAAAGCCAGAGAAAGAAAGGCATGCGATATCCAAGGAGGTTTGAGGAAGGATGGGTCAAA
TATGATGATTGTAGGAAGATTATTGACCAGTCGTGGAAGGAAATGGACCAGTTGGGAGGTAAACCAGTTTGGGAGAAGACCAAGGTTTGCCTAAGCCGACTATCAGAATG
GAGTAGGAGTAAATATGAGGGGTCTATCAGGGGAGCCATTGCTAGAAAGGAAAAGGAAATTCAAGAGTTGAGCTCCTACAAAGGCCATAGTGTAATGAGAGAAATTGAGC
ACAAAGAGAAAGAGCTAGAGAATTTACTGGAGGATGATGAAGTATATTGGAGACAGAGAGCCCGGGAAGAGTGGATCACATGGGGTGATCGAAACACTAAATGGTTTCAT
ATGAGGGCAACAACTAGACGCAAGGCGAACCGTATTCGAGGGCTAAAGGACGACGATGGGAACTGGATTGAGGATGATGAAGGGATGGAAAAGGTAGCCAGTCAATACTT
CCAAAATTTATTCACTTCCTCCAACCCGAACATGGAGGCTATTGAAGTTATCCTTGATGCAATTCCTACCAGCATTACACAGGAGCAAAACAGAAGATTGTTAAAGAAGT
TCACTAGAGAAGATGTGCATGGGGTGCTAAAAAATATGCACCCTTCAAAGGCCCCAGGACCTGATGGGATCCAAGCCATCTTCTATCAGAAATACTGGGACATCGTGGGA
AATGATATTTGTGATCTATGCTTGAAGGTCTTAAATGGGGTTGGACCTATAGATCAGTTGAACAAAACATATATAGCCTTAATTCCGAAAACAAAGGATCCTGGATGTAT
GAAGGATTTTCGCCCTATTAGTCTATGCTCGGTTATCTACAAGATCATTGCTAAAACTCTAGCCAACAGAATGAGGACGGTTCTCGACACGATCATCTCGCCGAGTCAAT
CAGCTTTTGTGCCAGGACGACTCATATCAGATAACACCGTTATAGGGTTTGAATGCCTCCATGCGATTAAGAGCAAAAGAAAAGGGAAGGAAGGAATTGCAGCGCTTAAG
CTAGATATGAGCAAAGCGTACGACAGGGTGGAATGGTGCTACATCAGGAAAGTCATGGGTAAGATGGGCTTTAATAACACATGGACGGACAAAATAATGAGCTGCGTGGA
GTCAGTGAGCTTCCAAATTCTTCTTAATGGAATTCCTCGAGCTAGCTTCACCCCAAATCGGGGGCTGAGACAAGGAGATCCTCTCTCTCCATATTTGTTTCTGATCTGTG
CAGAGGGTTTGTCTAGTACCCTCAACAAATCAGAACAAACACGAACGTTTTCAGGTTTGCGTATCAATAATTATTGCCCTTCTATATCTCATCTCTTTTACGCTGATGAT
AGTCTCCTGTTTTTCAAAGCTATGGAAAAAGATTGCAGGTCCATCAAGAATATCCTCAACAAATACGAGAAGGCCTCGGGCCAAACCATAAATTTTGAGAAATCATCATT
TATGGTAAGCCCAAACACGAAGGCATCCCATGTGGACACAATAAAGGAGGTGTTAGGAGTTCAACATCAGGAAAGTCTAGGGCAATATCTAGGTCTTCCCTCTCAAGTTG
GCAGAAGCAAGAAGGAGATTTTTAACAACATCAAGGATAGAGTTTGGAAGGCGTTACAGGGATGGAAAGGGAAGTTTTTCTCAGCTGCTGGGAAAGAAATTCTTATTAAA
TCTGTTGCACAAGCAATCCCAAACTATGCGATGAGCTGTTTTCGATTTCCTATTTCCCTGTGTAACGAGTTAAATGCTATCTGTGCTAGGTTTTGGTGGGGAGCGACGGA
CAAAGGGAGGAAGATCCATTGGAGAAGTTGGACAAGACTCTGCAGTCATAAAACTCAGGGAGGCATGGGCTTTCGAGACCTGAAGATTTTCAACCAAGCATTGCTTGCAA
AACAGAGTTGGAGAATCATTCGTTACCCAGAGAGTTTGCTAGCTAGAGTTCTTAGGGGAAGATACTTCAAAACCGGCTCTTATCTGAATGCCTCGTTAGGAAACAATCCA
TCCTATACTTGGCGAAGCATAGTGTGGGGTCGTGATCTATTCAAAAAGGGTTATCGGTGGAGAATTGGGAATGGGCTCCAGGTGGAAGCTAGCAAAGAGCCTTGGATTCC
CAAAGAAGGATCTTGCAAGCCTATTCTAATCCACCCTGATGTTCAGACATTCACAGTGGCCCAATTCATCGATGATCAAGGAAGTTGGAACCATCAGTTAGTGAAAGCTT
CATTCATGGAGTGTGATGCCCAGACTATCCTGAATATCCCTATCAATCCTCGGATCAAGGAAGATGAGATTATTTGGGACTTAGACCCTAAAGGGCTCTTTTCTGTAAAG
AGTGCTTATAGGTTGGGTTTCCATCTACAGGACATGACGGAAGCATCTTCTTCGACCTACAAGCATTTGGAATCACTCTGGAAAAGCTTTTGGAAAGCTCCAATCCCTCC
AAAAATTAAATTATGTGGATGGAGAGTCTATAATGATATTCTCCCTACTCTTACTAATTTAAACCATAGAGGGATGGATACTAATCCTACATGTTATTTATGCAGGAATA
AACCGGAGACCACAGTGCATCTCTTTTGGGAATGCAAAATGACCAAATGCCTGTGGACTTTTTTCTTTCCATCTGTTGATTATCTATGTTTTGGTGGCAGGGGCGATGGG
AATCAACAAGATTTGCTGGAAAGATTTTGGAAAGCAACTGATTCAAAGAGGTTCGATAGCGGAAGAATAGGGAAAAGCCTAATCATTTGTTGGCAGATTTGGCATCATAG
AAATGAAGTTATTCATCACAAGCTCAACACAGACTCTGAGAAATTGAAGAATAAAATACAACAATATATGGTTGAATTCAAGATCCAAGAGGGAGAAGAATCGTACCTTG
GGGGAGGGTCCTTAGTGGCAGAGGGCTCCTCTCCAACGACAGCAGGTTACGGTCCAGTGGAGTCTTCTGATCGAGGCGTCGCCCGAGGCTTGACGGAGGAGGATCCGCGA
GCTTCAACCACCCTGCAAAGATGGTTGAAGCCTTCGACTGAATGTTGGACATTAAATAGCGATGCAACATGGAACGCGCAAATGAATTGTGGCGGTATTGGATGGATTCT
TAGGCGGCAAGATGGAAATCCTGTTACGGCTGGATTCAAATTAATCAAGAAAAACTGGAGAGTTAGTTGGTTGGAAGCCCTAGCTTTGGTGGAAGGGCTGAAATCGATTT
CGGTTTTGCCTCCCAAGTTGGTTATCGAGCTTGACTCAGTGCAAGTGGTACACCTGCTTGAGGAGAAAGAAGTTGATCTCACTGAACTCGGCATTTTTGTCGATGAAGCA
AAGCATCTGCTTTCCAACTACCAAGATCACTTGATTTCCCACATTCCCAGGAGGCACAATAACATGGCCCATCAACTGGCCCGCAAAGCTTGTTCTCTTAATTCTTCGGA
TAGTTGGAGTGATTTTTTCCCTTCGTGGCTTTTAGAATTAAACAATGTAGACATTGGTGTTGATTCCATTTTTGGGGGTGCCTGTCCCACAAATGGCCACCCAATGGGAC
CGGTTGCTCTTGTTTAA
Protein sequenceShow/hide protein sequence
METKLKQNNCERVRNHLKFDYGLIVPSQGQSGGLMLLWQKEIDVNIRSYSKGHIDAIISDSDGSWRFTGIYGNPQRDKHHETWTLLDRLRDNSGMPWVIGGDFNEITSNS
EKLGGLPRAERDMQDFRSSIDSCELHDPGFIGPEYTWCNNHLQTERIWERLDRMLINTEMQIWCQDFKVFHLPFTASDHRPLLAVWLKDQKSQRKKGMRYPRRFEEGWVK
YDDCRKIIDQSWKEMDQLGGKPVWEKTKVCLSRLSEWSRSKYEGSIRGAIARKEKEIQELSSYKGHSVMREIEHKEKELENLLEDDEVYWRQRAREEWITWGDRNTKWFH
MRATTRRKANRIRGLKDDDGNWIEDDEGMEKVASQYFQNLFTSSNPNMEAIEVILDAIPTSITQEQNRRLLKKFTREDVHGVLKNMHPSKAPGPDGIQAIFYQKYWDIVG
NDICDLCLKVLNGVGPIDQLNKTYIALIPKTKDPGCMKDFRPISLCSVIYKIIAKTLANRMRTVLDTIISPSQSAFVPGRLISDNTVIGFECLHAIKSKRKGKEGIAALK
LDMSKAYDRVEWCYIRKVMGKMGFNNTWTDKIMSCVESVSFQILLNGIPRASFTPNRGLRQGDPLSPYLFLICAEGLSSTLNKSEQTRTFSGLRINNYCPSISHLFYADD
SLLFFKAMEKDCRSIKNILNKYEKASGQTINFEKSSFMVSPNTKASHVDTIKEVLGVQHQESLGQYLGLPSQVGRSKKEIFNNIKDRVWKALQGWKGKFFSAAGKEILIK
SVAQAIPNYAMSCFRFPISLCNELNAICARFWWGATDKGRKIHWRSWTRLCSHKTQGGMGFRDLKIFNQALLAKQSWRIIRYPESLLARVLRGRYFKTGSYLNASLGNNP
SYTWRSIVWGRDLFKKGYRWRIGNGLQVEASKEPWIPKEGSCKPILIHPDVQTFTVAQFIDDQGSWNHQLVKASFMECDAQTILNIPINPRIKEDEIIWDLDPKGLFSVK
SAYRLGFHLQDMTEASSSTYKHLESLWKSFWKAPIPPKIKLCGWRVYNDILPTLTNLNHRGMDTNPTCYLCRNKPETTVHLFWECKMTKCLWTFFFPSVDYLCFGGRGDG
NQQDLLERFWKATDSKRFDSGRIGKSLIICWQIWHHRNEVIHHKLNTDSEKLKNKIQQYMVEFKIQEGEESYLGGGSLVAEGSSPTTAGYGPVESSDRGVARGLTEEDPR
ASTTLQRWLKPSTECWTLNSDATWNAQMNCGGIGWILRRQDGNPVTAGFKLIKKNWRVSWLEALALVEGLKSISVLPPKLVIELDSVQVVHLLEEKEVDLTELGIFVDEA
KHLLSNYQDHLISHIPRRHNNMAHQLARKACSLNSSDSWSDFFPSWLLELNNVDIGVDSIFGGACPTNGHPMGPVALV