; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033450 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033450
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold5:4817049..4823425
RNA-Seq ExpressionSpg033450
SyntenySpg033450
Gene Ontology termsNA
InterPro domainsIPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.3e-13327.44Show/hide
Query:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL
        MW++   +I    KG FSVS+ V  ++G  WWLS IYGPA RK R  FW EL  L  +C   W+LGGDFNV R   ET++ NPA LSM +FN FIS+ +L
Subjt:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL

Query:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF
        +DPPL N  YTW+NLR++  +SRLDRFLF++ W   F  H SK L+R TSDHFPI+L+ S+ +WGP PFRF N  L +  +  N+E WW +T   G+ G+
Subjt:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF

Query:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR
        SF+RRLK LA  +K W        +  KKA   EID+ID LE+ GS  ++ ++ R +LKADL +  L EA+ W Q+CK++W+ + DENS+FFHKICTAR+
Subjt:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR

Query:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSA-------------------------------------------
        ++  I ++    G + ++D  +    I HF DIY  N+NS+  I NL+W PI+++++                                           
Subjt:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSA-------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------RGSTN-----LVRWEIVSSSKAEGGLGIHKIKETNDALLLKW
                                                                   G++N     L+RW  + S K +GGLGIH +  TN ALL KW
Subjt:  ----------------------------------------------------------RGSTN-----LVRWEIVSSSKAEGGLGIHKIKETNDALLLKW

Query:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW
        +W+F  E++ LW+  I +KY       FPS  K SS+ SPW A+++    F+ N  W + +G    FW DNW+   PL     RL+ LS+NK  S+ E W
Subjt:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW

Query:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRP--FHSPGEKILNNLWTADIPKKIKVFIWSLFH
        + S   W+    RPL D +   W      LPTP P RG     W ++ + +F T S +  ++  P  P  FH     +   LW  + PKK K FIW+L H
Subjt:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRP--FHSPGEKILNNLWTADIPKKIKVFIWSLFH

Query:  RSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERN
          INT+DRLQ    N   +P+ C +C K+ EDI+HLFIHC  +    +K    L  +   P  + S   ++ +    +Q+ L+  N     LW +W ERN
Subjt:  RSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERN

Query:  RRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSFL
         RIF+ + +    LWED ++   LW+ KSK+FS+Y    IALN  +F+
Subjt:  RRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSFL

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.2e-11027.77Show/hide
Query:  SDSLFIETSDSIL---CETKLCNTNKRIIKSLWSSISVNWIALDAYGSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRK
        +DS    TS ++L     + L  TNKRIIKSLW S S+NWIA +A GSSGGI+I+WD    ++    +GLFS+S    L++  SWWL+G+YGP  R++R 
Subjt:  SDSLFIETSDSIL---CETKLCNTNKRIIKSLWSSISVNWIALDAYGSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRK

Query:  SFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLS
         FW EL++L  L    W+LGGD NV R   E++S   +  +    N FIS+  L+DPPL N  +TW+NLR+ P  SR+DRFL+++SW   F+ H ++ L 
Subjt:  SFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLS

Query:  RCTSDHFPILLDDSSS--TWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESS
        R TSDHFP++ +DS+   +WGP PFR ++  L +  F  N+  WW ++   G+PGFSFI+RLK LA  +K W+     S    K+AI  E+D ID  E  
Subjt:  RCTSDHFPILLDDSSS--TWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESS

Query:  GSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWII
          L       R +LKADL E +L E+++W QR KKLWL + DENS+FFH+IC++R++R+ IHE+  +EG    ++  +    I  F+ IY  +  S+ + 
Subjt:  GSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWII

Query:  V-NLNWEPINS-----------------------------------------------------------------------------------------
        + NL+W PI S                                                                                         
Subjt:  V-NLNWEPINS-----------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------------VSARGSTN
                                                                                                    VS  G+ N
Subjt:  --------------------------------------------------------------------------------------------VSARGSTN

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------LVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNE
                                                                     L+ W  VS SK EGGLGI ++  TN ALL KW+WR+ +E
Subjt:  -------------------------------------------------------------LVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNE

Query:  ENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMW
         N LWR  I  KY        PS+   S+S++PW +I    D F +N  WD+ NG    FW+ NWS  G L     RL+ L+ +K +S+ + W++ D  W
Subjt:  ENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMW

Query:  NFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRL
        N   RR L DR+   W +  E+LPTP   RGS    W+   +  F+  SA+ ++S    +    P  K+L  +W + IP KIK F+W L  R INT + +
Subjt:  NFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRL

Query:  QAIFQNSLHNPS
        Q    N+L  P+
Subjt:  QAIFQNSLHNPS

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.3e-11927.83Show/hide
Query:  SSISVNWIALDAYGSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETS
        ++ S N +      S+GGI+I+WD    ++    +G FS+S   + S   SWWL+G+YGP  R++R + W +L++LH L    W++GGD NV R   E++
Subjt:  SSISVNWIALDAYGSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETS

Query:  SNNPAKLSMSKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSST--WGPCPFRFDNYLLD
        +   +  S +  N FIS+  L+DPPL N  YTW+NLR+ P  SRLDRFL+++ W I FN H ++ L R TSDHFP++ +DS+ST  WGP PFR ++  L+
Subjt:  SNNPAKLSMSKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSST--WGPCPFRFDNYLLD

Query:  NKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRC
        +  F  N+E WW  +   G PGF FI+RLK LA  +K W+     S    K+ I  E+D ID  E    L       R +LKA+L + +L E+++W QR 
Subjt:  NKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRC

Query:  KKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEG----------ISIVSDF-----------------------------------------------
        KKLWL + DENSAFFH+IC++R++RN IHE+  +EG          ++ V+ F                                               
Subjt:  KKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEG----------ISIVSDF-----------------------------------------------

Query:  ---------------------LLEKEVIDHFADIYD-----YNQNSEWIIV-------------------------------------------------
                             LL+++++D F D ++      N N+ +I +                                                 
Subjt:  ---------------------LLEKEVIDHFADIYD-----YNQNSEWIIV-------------------------------------------------

Query:  ---------------------------------------NLNWE--------------------------------------------------------
                                               NLNW                                                         
Subjt:  ---------------------------------------NLNWE--------------------------------------------------------

Query:  ------------------------------------------------------------------------------------PINSVSA---------
                                                                                            PI  +S          
Subjt:  ------------------------------------------------------------------------------------PINSVSA---------

Query:  -----------RGS-----TNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKL
                   +GS     ++L+ W IV+  K EGGLGI +++ TN ALL KW+WR+++E N+LWR  I+ KY   H    PS+   SSS++PW +I   
Subjt:  -----------RGS-----TNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKL

Query:  QDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVS
         D F +N  WD+ NG    FW+ NWS  G L     RL+ LS +K  SI +VW+S++  W    RR L DR+L  W +  E LP     RG     W+  
Subjt:  QDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVS

Query:  GDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNK
            F+  SA++ +S  P R   +P  K+LN +W   +P KIK F+W L  R +NT +        +L  P+ C+LC K SE   HLF+HC       + 
Subjt:  GDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNK

Query:  INLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVF---IATLWLLWNERNRRIFEDKARTRN--QLWEDIVSLAALWATKSKVFSDYSASHIALNW
        ++ +L  + +     D F A      +++Q     + VF   IA LW +W ERN RIF+  +  ++   LWED   L   W ++   F +YSA+ IALN 
Subjt:  INLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVF---IATLWLLWNERNRRIFEDKARTRN--QLWEDIVSLAALWATKSKVFSDYSASHIALNW

Query:  KSF
         +F
Subjt:  KSF

TYK06777.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.0e-13128.39Show/hide
Query:  MWDELCCNITDHVKGLFSVSVLVTLSDGFS---WWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISD
        MWD+L  N+TD ++G FS+S+ +   DG S   WWLS IYGP+  + RKSFW EL DL   C   WLL GDFNV R  SETS+ NP+K SM  FNKFI+D
Subjt:  MWDELCCNITDHVKGLFSVSVLVTLSDGFS---WWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISD

Query:  TDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGF
        ++L+DPPL N  +TW+NLR  PV+SR+DRFL++ +W   F  H+SK LSR TSDHFPI+L+ S  +WGP PF+  N  L    F  N+ +WW +   EG 
Subjt:  TDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGF

Query:  PGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICT
        PGFSF+R+LK L+  ++N +  N     E K A   EID ID LE+ G+L +     R  LKAD+  +   EA+ W Q+ K+LW+++ DEN++FFHKIC+
Subjt:  PGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICT

Query:  ARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY-DYNQNSEWIIVNLNWEPINS------------------------------------------
        AR+RR+ I  + + +G+   ++  + K  +DHF DIY    + S W+I NLNW PI++                                          
Subjt:  ARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY-DYNQNSEWIIVNLNWEPINS------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------VSARGSTN------------------
                                                                                  V   G+ N                  
Subjt:  --------------------------------------------------------------------------VSARGSTN------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------LVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHF
                                                   LV W  ++SSK +GGLGI ++K+TN ALL KW+WR+ +E++ LW+  IN KY SL  
Subjt:  -------------------------------------------LVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHF

Query:  ECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSE
           P     SSSRSPW +I K  + F  +  W I+NGRS  FWH +W    PL     RLY LS+NK  SI ++W+++   W+  PRR L + +L  W+E
Subjt:  ECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSE

Query:  FAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCW
            +     + G D   W ++ +GL+T  S +  L            +    NLW   IPKK   FIW+L + S+NT+++L          PS C++C 
Subjt:  FAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCW

Query:  KNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRRIFEDKARTRNQLWEDIVSLAALWAT
        +N ED  HLFI C  A      I+  L  S V   +    C  + + K  +++ ++  N + + LW +W ERN RIF  K +T  ++WEDI +LA LW +
Subjt:  KNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRRIFEDKARTRNQLWEDIVSLAALWAT

Query:  KSKVFSDYSASHIALNWKSF
        +S +FS+Y AS IALN  +F
Subjt:  KSKVFSDYSASHIALNWKSF

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.0e-11025.35Show/hide
Query:  GSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFN
        G  GGI+++WD+    + D   G +S+S+ +  ++G +WWL+ +YGP     R   W EL  L  LC   WL+ GDFN+ R   ET++ +  K +M+ FN
Subjt:  GSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFN

Query:  KFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDT
         FIS  +L+DPP +N  +TW+NLR  P  SRLDRFL S  W   F  H S+ L R  SDHFPILL+     WGPCPFR +N  L +K F  N  +WW  +
Subjt:  KFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDT

Query:  FCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFF
           GFPG++FI+ L  L++ +K W+ +  + +   KKA+  EID ID LE  G +     Q R SLK+DL      +A+ W+QR ++ W    DEN+++F
Subjt:  FCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFF

Query:  HKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPIN---------------------------------------
        H+ICT  +R+N I  +    G S+ S   + +  I HF +IY      E +I NL+W PI+                                       
Subjt:  HKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPIN---------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------SVSA--------------------------------
                                                                        ++SA                                
Subjt:  ----------------------------------------------------------------SVSA--------------------------------

Query:  -----------------------------------------------------------------------RGSTNLVRWEIVSSSKAEGGLGIHKIKET
                                                                               + + +L+ W I +S K  GGLGI K+K+T
Subjt:  -----------------------------------------------------------------------RGSTNLVRWEIVSSSKAEGGLGIHKIKET

Query:  NDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNK
        N ALL KW+WR+ NE N+LW+  I+ KY+  H    P   + SS+ SPW+AI K +D + +   W   +G S  FWH  W +  PL     RLY LS+ +
Subjt:  NDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNK

Query:  NLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVF
        + ++ E+W      WN +PRRPL +R+ Q W      LP  +  RG     W  S    +T  SA+ I     S P  +  EK L +LW + IP+K K F
Subjt:  NLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVF

Query:  IWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWL
        IW++ H+ +NT D +Q    +   NPS CI C  ++ED++HLFI C  A    N  +   G  MV    +   C  L      + + ++  N  IATLW 
Subjt:  IWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWL

Query:  LWNERNRRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKS
        +W  RN  IF DK  +    WEDI +L   W++KSK   +YS + IALN K+
Subjt:  LWNERNRRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKS

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein3.0e-13327.44Show/hide
Query:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL
        MW++   +I    KG FSVS+ V  ++G  WWLS IYGPA RK R  FW EL  L  +C   W+LGGDFNV R   ET++ NPA LSM +FN FIS+ +L
Subjt:  MWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDL

Query:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF
        +DPPL N  YTW+NLR++  +SRLDRFLF++ W   F  H SK L+R TSDHFPI+L+ S+ +WGP PFRF N  L +  +  N+E WW +T   G+ G+
Subjt:  LDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGF

Query:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR
        SF+RRLK LA  +K W        +  KKA   EID+ID LE+ GS  ++ ++ R +LKADL +  L EA+ W Q+CK++W+ + DENS+FFHKICTAR+
Subjt:  SFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARR

Query:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSA-------------------------------------------
        ++  I ++    G + ++D  +    I HF DIY  N+NS+  I NL+W PI+++++                                           
Subjt:  RRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSA-------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------RGSTN-----LVRWEIVSSSKAEGGLGIHKIKETNDALLLKW
                                                                   G++N     L+RW  + S K +GGLGIH +  TN ALL KW
Subjt:  ----------------------------------------------------------RGSTN-----LVRWEIVSSSKAEGGLGIHKIKETNDALLLKW

Query:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW
        +W+F  E++ LW+  I +KY       FPS  K SS+ SPW A+++    F+ N  W + +G    FW DNW+   PL     RL+ LS+NK  S+ E W
Subjt:  IWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVW

Query:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRP--FHSPGEKILNNLWTADIPKKIKVFIWSLFH
        + S   W+    RPL D +   W      LPTP P RG     W ++ + +F T S +  ++  P  P  FH     +   LW  + PKK K FIW+L H
Subjt:  SSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRP--FHSPGEKILNNLWTADIPKKIKVFIWSLFH

Query:  RSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERN
          INT+DRLQ    N   +P+ C +C K+ EDI+HLFIHC  +    +K    L  +   P  + S   ++ +    +Q+ L+  N     LW +W ERN
Subjt:  RSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERN

Query:  RRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSFL
         RIF+ + +    LWED ++   LW+ KSK+FS+Y    IALN  +F+
Subjt:  RRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSFL

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein9.5e-11125.35Show/hide
Query:  GSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFN
        G  GGI+++WD+    + D   G +S+S+ +  ++G +WWL+ +YGP     R   W EL  L  LC   WL+ GDFN+ R   ET++ +  K +M+ FN
Subjt:  GSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFN

Query:  KFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDT
         FIS  +L+DPP +N  +TW+NLR  P  SRLDRFL S  W   F  H S+ L R  SDHFPILL+     WGPCPFR +N  L +K F  N  +WW  +
Subjt:  KFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDT

Query:  FCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFF
           GFPG++FI+ L  L++ +K W+ +  + +   KKA+  EID ID LE  G +     Q R SLK+DL      +A+ W+QR ++ W    DEN+++F
Subjt:  FCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFF

Query:  HKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPIN---------------------------------------
        H+ICT  +R+N I  +    G S+ S   + +  I HF +IY      E +I NL+W PI+                                       
Subjt:  HKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPIN---------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------SVSA--------------------------------
                                                                        ++SA                                
Subjt:  ----------------------------------------------------------------SVSA--------------------------------

Query:  -----------------------------------------------------------------------RGSTNLVRWEIVSSSKAEGGLGIHKIKET
                                                                               + + +L+ W I +S K  GGLGI K+K+T
Subjt:  -----------------------------------------------------------------------RGSTNLVRWEIVSSSKAEGGLGIHKIKET

Query:  NDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNK
        N ALL KW+WR+ NE N+LW+  I+ KY+  H    P   + SS+ SPW+AI K +D + +   W   +G S  FWH  W +  PL     RLY LS+ +
Subjt:  NDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNK

Query:  NLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVF
        + ++ E+W      WN +PRRPL +R+ Q W      LP  +  RG     W  S    +T  SA+ I     S P  +  EK L +LW + IP+K K F
Subjt:  NLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVF

Query:  IWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWL
        IW++ H+ +NT D +Q    +   NPS CI C  ++ED++HLFI C  A    N  +   G  MV    +   C  L      + + ++  N  IATLW 
Subjt:  IWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWL

Query:  LWNERNRRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKS
        +W  RN  IF DK  +    WEDI +L   W++KSK   +YS + IALN K+
Subjt:  LWNERNRRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKS

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein1.1e-11927.83Show/hide
Query:  SSISVNWIALDAYGSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETS
        ++ S N +      S+GGI+I+WD    ++    +G FS+S   + S   SWWL+G+YGP  R++R + W +L++LH L    W++GGD NV R   E++
Subjt:  SSISVNWIALDAYGSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETS

Query:  SNNPAKLSMSKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSST--WGPCPFRFDNYLLD
        +   +  S +  N FIS+  L+DPPL N  YTW+NLR+ P  SRLDRFL+++ W I FN H ++ L R TSDHFP++ +DS+ST  WGP PFR ++  L+
Subjt:  SNNPAKLSMSKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSST--WGPCPFRFDNYLLD

Query:  NKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRC
        +  F  N+E WW  +   G PGF FI+RLK LA  +K W+     S    K+ I  E+D ID  E    L       R +LKA+L + +L E+++W QR 
Subjt:  NKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRC

Query:  KKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEG----------ISIVSDF-----------------------------------------------
        KKLWL + DENSAFFH+IC++R++RN IHE+  +EG          ++ V+ F                                               
Subjt:  KKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEG----------ISIVSDF-----------------------------------------------

Query:  ---------------------LLEKEVIDHFADIYD-----YNQNSEWIIV-------------------------------------------------
                             LL+++++D F D ++      N N+ +I +                                                 
Subjt:  ---------------------LLEKEVIDHFADIYD-----YNQNSEWIIV-------------------------------------------------

Query:  ---------------------------------------NLNWE--------------------------------------------------------
                                               NLNW                                                         
Subjt:  ---------------------------------------NLNWE--------------------------------------------------------

Query:  ------------------------------------------------------------------------------------PINSVSA---------
                                                                                            PI  +S          
Subjt:  ------------------------------------------------------------------------------------PINSVSA---------

Query:  -----------RGS-----TNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKL
                   +GS     ++L+ W IV+  K EGGLGI +++ TN ALL KW+WR+++E N+LWR  I+ KY   H    PS+   SSS++PW +I   
Subjt:  -----------RGS-----TNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKL

Query:  QDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVS
         D F +N  WD+ NG    FW+ NWS  G L     RL+ LS +K  SI +VW+S++  W    RR L DR+L  W +  E LP     RG     W+  
Subjt:  QDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVS

Query:  GDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNK
            F+  SA++ +S  P R   +P  K+LN +W   +P KIK F+W L  R +NT +        +L  P+ C+LC K SE   HLF+HC       + 
Subjt:  GDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNK

Query:  INLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVF---IATLWLLWNERNRRIFEDKARTRN--QLWEDIVSLAALWATKSKVFSDYSASHIALNW
        ++ +L  + +     D F A      +++Q     + VF   IA LW +W ERN RIF+  +  ++   LWED   L   W ++   F +YSA+ IALN 
Subjt:  INLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVF---IATLWLLWNERNRRIFEDKARTRN--QLWEDIVSLAALWATKSKVFSDYSASHIALNW

Query:  KSF
         +F
Subjt:  KSF

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein5.6e-11127.77Show/hide
Query:  SDSLFIETSDSIL---CETKLCNTNKRIIKSLWSSISVNWIALDAYGSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRK
        +DS    TS ++L     + L  TNKRIIKSLW S S+NWIA +A GSSGGI+I+WD    ++    +GLFS+S    L++  SWWL+G+YGP  R++R 
Subjt:  SDSLFIETSDSIL---CETKLCNTNKRIIKSLWSSISVNWIALDAYGSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRK

Query:  SFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLS
         FW EL++L  L    W+LGGD NV R   E++S   +  +    N FIS+  L+DPPL N  +TW+NLR+ P  SR+DRFL+++SW   F+ H ++ L 
Subjt:  SFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLS

Query:  RCTSDHFPILLDDSSS--TWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESS
        R TSDHFP++ +DS+   +WGP PFR ++  L +  F  N+  WW ++   G+PGFSFI+RLK LA  +K W+     S    K+AI  E+D ID  E  
Subjt:  RCTSDHFPILLDDSSS--TWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESS

Query:  GSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWII
          L       R +LKADL E +L E+++W QR KKLWL + DENS+FFH+IC++R++R+ IHE+  +EG    ++  +    I  F+ IY  +  S+ + 
Subjt:  GSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWII

Query:  V-NLNWEPINS-----------------------------------------------------------------------------------------
        + NL+W PI S                                                                                         
Subjt:  V-NLNWEPINS-----------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------------------------VSARGSTN
                                                                                                    VS  G+ N
Subjt:  --------------------------------------------------------------------------------------------VSARGSTN

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------------------------LVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNE
                                                                     L+ W  VS SK EGGLGI ++  TN ALL KW+WR+ +E
Subjt:  -------------------------------------------------------------LVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNE

Query:  ENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMW
         N LWR  I  KY        PS+   S+S++PW +I    D F +N  WD+ NG    FW+ NWS  G L     RL+ L+ +K +S+ + W++ D  W
Subjt:  ENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMW

Query:  NFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRL
        N   RR L DR+   W +  E+LPTP   RGS    W+   +  F+  SA+ ++S    +    P  K+L  +W + IP KIK F+W L  R INT + +
Subjt:  NFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRL

Query:  QAIFQNSLHNPS
        Q    N+L  P+
Subjt:  QAIFQNSLHNPS

A0A5D3C4J1 LINE-1 retrotransposable element ORF2 protein9.8e-13228.39Show/hide
Query:  MWDELCCNITDHVKGLFSVSVLVTLSDGFS---WWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISD
        MWD+L  N+TD ++G FS+S+ +   DG S   WWLS IYGP+  + RKSFW EL DL   C   WLL GDFNV R  SETS+ NP+K SM  FNKFI+D
Subjt:  MWDELCCNITDHVKGLFSVSVLVTLSDGFS---WWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSMSKFNKFISD

Query:  TDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGF
        ++L+DPPL N  +TW+NLR  PV+SR+DRFL++ +W   F  H+SK LSR TSDHFPI+L+ S  +WGP PF+  N  L    F  N+ +WW +   EG 
Subjt:  TDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGF

Query:  PGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICT
        PGFSF+R+LK L+  ++N +  N     E K A   EID ID LE+ G+L +     R  LKAD+  +   EA+ W Q+ K+LW+++ DEN++FFHKIC+
Subjt:  PGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICT

Query:  ARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY-DYNQNSEWIIVNLNWEPINS------------------------------------------
        AR+RR+ I  + + +G+   ++  + K  +DHF DIY    + S W+I NLNW PI++                                          
Subjt:  ARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADIY-DYNQNSEWIIVNLNWEPINS------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------------------------VSARGSTN------------------
                                                                                  V   G+ N                  
Subjt:  --------------------------------------------------------------------------VSARGSTN------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------------------------LVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHF
                                                   LV W  ++SSK +GGLGI ++K+TN ALL KW+WR+ +E++ LW+  IN KY SL  
Subjt:  -------------------------------------------LVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHF

Query:  ECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSE
           P     SSSRSPW +I K  + F  +  W I+NGRS  FWH +W    PL     RLY LS+NK  SI ++W+++   W+  PRR L + +L  W+E
Subjt:  ECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSE

Query:  FAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCW
            +     + G D   W ++ +GL+T  S +  L            +    NLW   IPKK   FIW+L + S+NT+++L          PS C++C 
Subjt:  FAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCW

Query:  KNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRRIFEDKARTRNQLWEDIVSLAALWAT
        +N ED  HLFI C  A      I+  L  S V   +    C  + + K  +++ ++  N + + LW +W ERN RIF  K +T  ++WEDI +LA LW +
Subjt:  KNSEDIDHLFIHCSRASFFRNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRRIFEDKARTRNQLWEDIVSLAALWAT

Query:  KSKVFSDYSASHIALNWKSF
        +S +FS+Y AS IALN  +F
Subjt:  KSKVFSDYSASHIALNWKSF

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657502.2e-1925.07Show/hide
Query:  SVSARGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAIS-KLQDIFFANFRWD
        S + +   +LV+W  V S K EGGLG+   K  N AL+ K  WR   E+N+LW   +  KY               S  S W +I+  L+D+      W 
Subjt:  SVSARGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAIS-KLQDIFFANFRWD

Query:  IRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSAR
          +G+   FW D W S  PL  + N   +  ++ +  +A+      R W+F    P +  +  R    A +L      R  D   W  S DG F+ +SA 
Subjt:  IRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSAR

Query:  AILSV--LPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSM
         +L+V  +P RP  +      N LW   +P+++K F+W + ++++ T +      +  L   +VC +C    E + H+   C           L + + +
Subjt:  AILSV--LPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFFRNKINLALGLSM

Query:  VPPATIDSFCA-DLF------TSKAISQRQLLRRNVFIATLWLLWNERNRRIFEDKARTRNQL
        VP      F +  LF               +    +F   +W  W  R   IF +  + R+++
Subjt:  VPPATIDSFCA-DLF------TSKAISQRQLLRRNVFIATLWLLWNERNRRIFEDKARTRNQL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.4e-1324.37Show/hide
Query:  LLGGDFNVFRHSSETSSNNPAKLSM---SKFNKFISDTDLLDPPLINGPYTWTNLRSE-PVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFP--ILL
        +L GDF+    +S+  S     + M    +F   + D+DL+D P     YTW+N + + P++ +LDR + +  W   F    +       SDH P  I+L
Subjt:  LLGGDFNVFRHSSETSSNNPAKLSM---SKFNKFISDTDLLDPPLINGPYTWTNLRSE-PVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFP--ILL

Query:  DDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALES---SGSLDDMAKQL
        ++       C FR+ ++L  + +F+ ++   W +    G   FS    LK  A K K  KL N   F   +      +D +++++S   +   D + +  
Subjt:  DDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFPGFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALES---SGSLDDMAKQL

Query:  RKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADI
          + K      A LE+ ++ Q+ +  WL D D N+ FFHK+  A + +N I  L   + + + +   +++ ++ ++  +
Subjt:  RKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARRRRNQIHELFTKEGISIVSDFLLEKEVIDHFADI

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.0e-1225.63Show/hide
Query:  VRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWH
        V W  V + K EGGLGI  +KE N              + + W                  S   +     W  I K + +     + DI NG +T FW 
Subjt:  VRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWH

Query:  DNWSSFGPLKFVCNRLYQLSSNK---NLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGD---GLFTTKSARAILSV
        DNWS  G       RL  ++ ++   ++ I    S ++ + N +PRR   D  L+     AE +       G D  RW  +GD     F TK   A    
Subjt:  DNWSSFGPLKFVCNRLYQLSSNK---NLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGD---GLFTTKSARAILSV

Query:  LPSRPFHSPGEKI--LNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHC
                P  K+     +W +    K  V  W      + T DR+ +    +    S C+LC    E  DHLF  C
Subjt:  LPSRPFHSPGEKI--LNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHC

AT3G25720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.5e-1034.91Show/hide
Query:  KAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLH------FECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNW
        KAEGGLG+    E N  L LK +WR F+   +LW ++  ++Y  L          F +S ++ S    W  + +L+ +     R +I NG +  FW DNW
Subjt:  KAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLH------FECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNW

Query:  SSFGPL
        + FGPL
Subjt:  SSFGPL

AT4G29090.1 Ribonuclease H-like superfamily protein6.1e-1724.36Show/hide
Query:  WEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRS-PWHAISKLQDIFFANFRWDIRNGRSTLFWHD
        W+ +S  KAEGG+G   I+  N ALL K +WR  +   +L      ++Y     +  P ++ + S  S  W +I   Q+I     R  + NG   + W  
Subjt:  WEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLHFECFPSSSKVSSSRS-PWHAISKLQDIFFANFRWDIRNGRSTLFWHD

Query:  NWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSD------RMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVL
         W    P      R+ ++   +  S++ +   SD      R W       LF  +++R     EL   P  +R  D   W  +  G +T KS   +L+ +
Subjt:  NWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSD------RMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFTTKSARAILSVL

Query:  PSRPFHSPGE-------KILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHC--SRASFFRNKINLALGLS
         ++   SP E        I   +W +    KI+ F+W     S+  +    A+    L   S CI C    E ++HL   C  +R ++  + I + LG  
Subjt:  PSRPFHSPGE-------KILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHC--SRASFFRNKINLALGLS

Query:  MVPPATIDSFCADLF--TSKAISQRQLLRRNVFIA-TLWLLWNERNRRIFEDK
               DS   +L+   +      Q  + +  +   LW LW  RN  +F  +
Subjt:  MVPPATIDSFCADLF--TSKAISQRQLLRRNVFIA-TLWLLWNERNRRIFEDK

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-1223.57Show/hide
Query:  RWDIRNGRSTLFWHDNWSSFGPLKFVCNRL--YQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFT
        R D+ NG S  FW+D W+ FG L          QL   ++  + E   + D  W     R     + Q +     + P P+  RG D   W  +      
Subjt:  RWDIRNGRSTLFWHDNWSSFGPLKFVCNRL--YQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPNPQRGSDIRRWMVSGDGLFT

Query:  TKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRA----SFFRNKIN
        + S+R     +     HSP       +W  +   +  +  W  F   + T DRL+    N    PS  +LC    E   HLF  CS +     FF +K  
Subjt:  TKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRA----SFFRNKIN

Query:  LALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRRIFEDKARTRNQL
         +      PP  + +  + +      S    + + +  + ++ +W ERN RIF   + + + L
Subjt:  LALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRRIFEDKARTRNQL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGATAAAGGAATCTCAAAAAACCTGTTGTCGACGTCGAACGACAAAGGGAGCACCGAAGAGAACCCTGCCGCAACTCGTGCTAAGAAGTGCGCTACCTTGTTGCT
TGCCCTAGGGCATTTGGAAAAGGATACCACCTTTGCTTTCTCCGCTAAAGAGACGATTTCATTCAGCACCACTCTCCCCGCTTCGACAAGAGATCTTGAGGAATCACGGA
TGATCCAACCTACTCCACTGGTCTCTGATTCTTTGTTCATCGAGACATCAGATTCCATATTATGTGAAACTAAATTGTGCAACACAAATAAGCGCATCATTAAATCTTTG
TGGAGTTCCATTAGTGTTAATTGGATTGCTCTTGATGCCTATGGATCCTCTGGAGGTATTATTATTATGTGGGATGAATTATGTTGCAACATCACTGATCATGTCAAAGG
TCTGTTCTCTGTTTCTGTTCTCGTTACTCTCTCAGATGGCTTCTCTTGGTGGTTGTCCGGTATTTATGGCCCTGCGAGTAGGAAAAAACGTAAGTCTTTTTGGAGAGAAC
TTTATGATCTTCACGGCTTATGTGGTGACTGTTGGCTTCTTGGTGGTGATTTTAATGTTTTCAGGCATTCTTCGGAAACATCCTCCAACAATCCTGCCAAATTAAGTATG
TCGAAGTTTAACAAATTCATTTCGGACACTGACCTCCTCGACCCGCCGCTTATTAATGGTCCATATACTTGGACAAATCTTCGCAGTGAGCCTGTTATGTCCCGTCTTGA
CAGATTTCTTTTCTCTAATAGCTGGTGTATTAAATTTAACGATCATCATTCGAAGAAGCTTTCTCGTTGCACATCAGATCATTTTCCTATTCTTCTGGATGATTCTTCTT
CTACTTGGGGCCCTTGTCCTTTTCGTTTCGACAATTATCTTCTGGACAATAAATCCTTCATTGGCAATGTTGAGCATTGGTGGACTGATACTTTCTGTGAGGGTTTTCCT
GGATTTTCTTTTATTCGCAGACTAAAGTTCTTAGCAAGGAAAGTCAAAAACTGGAAGCTTTCCAACACAGATTCCTTCAAGGAAAAGAAAAAGGCCATTTCCAACGAGAT
TGATCGCATTGATGCTCTTGAATCTTCGGGTTCTTTGGACGATATGGCAAAGCAACTTAGAAAATCTCTTAAAGCTGATCTGCAAGAAACAGCTCTCCTTGAAGCCCGTT
ATTGGAATCAGCGTTGTAAAAAGCTTTGGCTCAGCGATAGAGATGAGAACTCTGCTTTTTTCCATAAAATTTGTACTGCTCGCCGCCGAAGAAACCAAATCCATGAGTTA
TTTACAAAAGAAGGTATTAGCATTGTTTCTGATTTTTTGCTGGAAAAGGAAGTAATCGATCATTTTGCGGATATTTATGACTACAATCAGAATTCTGAATGGATTATTGT
CAATCTTAATTGGGAACCAATTAATTCTGTTTCTGCTAGAGGCTCCACTAATCTTGTGAGATGGGAAATTGTTTCTTCTTCTAAAGCTGAAGGTGGTCTTGGCATTCACA
AAATTAAAGAGACTAACGACGCTCTTCTCCTTAAGTGGATCTGGCGTTTCTTCAATGAGGAAAACACTCTTTGGAGGAATTTCATCAACAATAAATATTCCAGTCTTCAT
TTTGAGTGCTTTCCTTCAAGCAGCAAAGTCTCCAGCTCCAGATCTCCTTGGCATGCTATCTCTAAGCTTCAAGACATTTTCTTTGCCAATTTCAGATGGGATATTCGCAA
TGGTCGCTCTACTCTGTTTTGGCATGATAATTGGTCTTCCTTTGGCCCCCTAAAATTTGTCTGCAACCGTTTATATCAGCTATCATCCAACAAAAATCTCTCTATAGCTG
AAGTTTGGTCTTCTTCGGACAGAATGTGGAACTTTCAGCCCCGAAGACCTTTGTTCGATAGAGATTTACAAAGGTGGAGCGAATTTGCGGAATTATTGCCCACCCCTAAT
CCTCAAAGGGGTTCGGATATTCGGCGTTGGATGGTATCTGGAGATGGTCTATTTACTACTAAATCTGCGCGCGCTATTTTATCTGTTCTCCCTTCAAGACCGTTTCACAG
TCCTGGAGAAAAAATTCTCAACAATCTTTGGACCGCTGACATTCCAAAGAAAATCAAGGTCTTCATTTGGTCTCTCTTTCATCGTAGCATAAATACGTCGGATAGACTTC
AAGCGATTTTTCAGAATTCTCTTCACAACCCATCAGTCTGCATTCTTTGTTGGAAGAATTCTGAAGATATTGATCATCTTTTCATCCACTGTAGTCGCGCATCCTTTTTC
AGGAACAAAATCAACCTTGCTTTGGGCCTCTCTATGGTTCCTCCGGCAACTATAGATTCTTTTTGCGCTGATTTGTTTACATCCAAAGCTATTTCGCAAAGGCAATTGCT
CAGAAGAAACGTTTTCATAGCTACCCTTTGGTTATTATGGAACGAGCGTAATCGCCGTATTTTTGAAGATAAAGCTCGCACTCGAAATCAACTCTGGGAGGACATCGTCT
CTCTTGCTGCTCTTTGGGCTACGAAATCCAAAGTTTTCTCTGATTATAGTGCTTCTCATATTGCTTTAAATTGGAAATCTTTTCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAGATAAAGGAATCTCAAAAAACCTGTTGTCGACGTCGAACGACAAAGGGAGCACCGAAGAGAACCCTGCCGCAACTCGTGCTAAGAAGTGCGCTACCTTGTTGCT
TGCCCTAGGGCATTTGGAAAAGGATACCACCTTTGCTTTCTCCGCTAAAGAGACGATTTCATTCAGCACCACTCTCCCCGCTTCGACAAGAGATCTTGAGGAATCACGGA
TGATCCAACCTACTCCACTGGTCTCTGATTCTTTGTTCATCGAGACATCAGATTCCATATTATGTGAAACTAAATTGTGCAACACAAATAAGCGCATCATTAAATCTTTG
TGGAGTTCCATTAGTGTTAATTGGATTGCTCTTGATGCCTATGGATCCTCTGGAGGTATTATTATTATGTGGGATGAATTATGTTGCAACATCACTGATCATGTCAAAGG
TCTGTTCTCTGTTTCTGTTCTCGTTACTCTCTCAGATGGCTTCTCTTGGTGGTTGTCCGGTATTTATGGCCCTGCGAGTAGGAAAAAACGTAAGTCTTTTTGGAGAGAAC
TTTATGATCTTCACGGCTTATGTGGTGACTGTTGGCTTCTTGGTGGTGATTTTAATGTTTTCAGGCATTCTTCGGAAACATCCTCCAACAATCCTGCCAAATTAAGTATG
TCGAAGTTTAACAAATTCATTTCGGACACTGACCTCCTCGACCCGCCGCTTATTAATGGTCCATATACTTGGACAAATCTTCGCAGTGAGCCTGTTATGTCCCGTCTTGA
CAGATTTCTTTTCTCTAATAGCTGGTGTATTAAATTTAACGATCATCATTCGAAGAAGCTTTCTCGTTGCACATCAGATCATTTTCCTATTCTTCTGGATGATTCTTCTT
CTACTTGGGGCCCTTGTCCTTTTCGTTTCGACAATTATCTTCTGGACAATAAATCCTTCATTGGCAATGTTGAGCATTGGTGGACTGATACTTTCTGTGAGGGTTTTCCT
GGATTTTCTTTTATTCGCAGACTAAAGTTCTTAGCAAGGAAAGTCAAAAACTGGAAGCTTTCCAACACAGATTCCTTCAAGGAAAAGAAAAAGGCCATTTCCAACGAGAT
TGATCGCATTGATGCTCTTGAATCTTCGGGTTCTTTGGACGATATGGCAAAGCAACTTAGAAAATCTCTTAAAGCTGATCTGCAAGAAACAGCTCTCCTTGAAGCCCGTT
ATTGGAATCAGCGTTGTAAAAAGCTTTGGCTCAGCGATAGAGATGAGAACTCTGCTTTTTTCCATAAAATTTGTACTGCTCGCCGCCGAAGAAACCAAATCCATGAGTTA
TTTACAAAAGAAGGTATTAGCATTGTTTCTGATTTTTTGCTGGAAAAGGAAGTAATCGATCATTTTGCGGATATTTATGACTACAATCAGAATTCTGAATGGATTATTGT
CAATCTTAATTGGGAACCAATTAATTCTGTTTCTGCTAGAGGCTCCACTAATCTTGTGAGATGGGAAATTGTTTCTTCTTCTAAAGCTGAAGGTGGTCTTGGCATTCACA
AAATTAAAGAGACTAACGACGCTCTTCTCCTTAAGTGGATCTGGCGTTTCTTCAATGAGGAAAACACTCTTTGGAGGAATTTCATCAACAATAAATATTCCAGTCTTCAT
TTTGAGTGCTTTCCTTCAAGCAGCAAAGTCTCCAGCTCCAGATCTCCTTGGCATGCTATCTCTAAGCTTCAAGACATTTTCTTTGCCAATTTCAGATGGGATATTCGCAA
TGGTCGCTCTACTCTGTTTTGGCATGATAATTGGTCTTCCTTTGGCCCCCTAAAATTTGTCTGCAACCGTTTATATCAGCTATCATCCAACAAAAATCTCTCTATAGCTG
AAGTTTGGTCTTCTTCGGACAGAATGTGGAACTTTCAGCCCCGAAGACCTTTGTTCGATAGAGATTTACAAAGGTGGAGCGAATTTGCGGAATTATTGCCCACCCCTAAT
CCTCAAAGGGGTTCGGATATTCGGCGTTGGATGGTATCTGGAGATGGTCTATTTACTACTAAATCTGCGCGCGCTATTTTATCTGTTCTCCCTTCAAGACCGTTTCACAG
TCCTGGAGAAAAAATTCTCAACAATCTTTGGACCGCTGACATTCCAAAGAAAATCAAGGTCTTCATTTGGTCTCTCTTTCATCGTAGCATAAATACGTCGGATAGACTTC
AAGCGATTTTTCAGAATTCTCTTCACAACCCATCAGTCTGCATTCTTTGTTGGAAGAATTCTGAAGATATTGATCATCTTTTCATCCACTGTAGTCGCGCATCCTTTTTC
AGGAACAAAATCAACCTTGCTTTGGGCCTCTCTATGGTTCCTCCGGCAACTATAGATTCTTTTTGCGCTGATTTGTTTACATCCAAAGCTATTTCGCAAAGGCAATTGCT
CAGAAGAAACGTTTTCATAGCTACCCTTTGGTTATTATGGAACGAGCGTAATCGCCGTATTTTTGAAGATAAAGCTCGCACTCGAAATCAACTCTGGGAGGACATCGTCT
CTCTTGCTGCTCTTTGGGCTACGAAATCCAAAGTTTTCTCTGATTATAGTGCTTCTCATATTGCTTTAAATTGGAAATCTTTTCTGTAG
Protein sequenceShow/hide protein sequence
MEDKGISKNLLSTSNDKGSTEENPAATRAKKCATLLLALGHLEKDTTFAFSAKETISFSTTLPASTRDLEESRMIQPTPLVSDSLFIETSDSILCETKLCNTNKRIIKSL
WSSISVNWIALDAYGSSGGIIIMWDELCCNITDHVKGLFSVSVLVTLSDGFSWWLSGIYGPASRKKRKSFWRELYDLHGLCGDCWLLGGDFNVFRHSSETSSNNPAKLSM
SKFNKFISDTDLLDPPLINGPYTWTNLRSEPVMSRLDRFLFSNSWCIKFNDHHSKKLSRCTSDHFPILLDDSSSTWGPCPFRFDNYLLDNKSFIGNVEHWWTDTFCEGFP
GFSFIRRLKFLARKVKNWKLSNTDSFKEKKKAISNEIDRIDALESSGSLDDMAKQLRKSLKADLQETALLEARYWNQRCKKLWLSDRDENSAFFHKICTARRRRNQIHEL
FTKEGISIVSDFLLEKEVIDHFADIYDYNQNSEWIIVNLNWEPINSVSARGSTNLVRWEIVSSSKAEGGLGIHKIKETNDALLLKWIWRFFNEENTLWRNFINNKYSSLH
FECFPSSSKVSSSRSPWHAISKLQDIFFANFRWDIRNGRSTLFWHDNWSSFGPLKFVCNRLYQLSSNKNLSIAEVWSSSDRMWNFQPRRPLFDRDLQRWSEFAELLPTPN
PQRGSDIRRWMVSGDGLFTTKSARAILSVLPSRPFHSPGEKILNNLWTADIPKKIKVFIWSLFHRSINTSDRLQAIFQNSLHNPSVCILCWKNSEDIDHLFIHCSRASFF
RNKINLALGLSMVPPATIDSFCADLFTSKAISQRQLLRRNVFIATLWLLWNERNRRIFEDKARTRNQLWEDIVSLAALWATKSKVFSDYSASHIALNWKSFL