; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg005966 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg005966
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationscaffold11:65354..73840
RNA-Seq ExpressionSpg005966
SyntenySpg005966
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN72727.1 hypothetical protein VITISV_015094 [Vitis vinifera]2.3e-9128.32Show/hide
Query:  GVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADGYSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFN
        G+ GGIVILW+   F   E V G + +++ L+  +  SFW+T VYG N    R+ FWLEL DL  L  P W +GGDFNV R   EK   S  T  M+ F+
Subjt:  GVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADGYSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFN

Query:  RFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYLISDSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNN
         FI +S L D PL N  +TWS+ + +P    +DR+L S      FS +    L R TSDH PI L     KWG  PF+F N WL H  F      WW+  
Subjt:  RFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYLISDSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNN

Query:  LSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQRYSLNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFS
          +GW GH F+ KLK +K + K+WN   FG+ ++++  + SEL  +D  E+ G L+      RT  +  L  L   EE+ WRQK ++KW++EGD NS F 
Subjt:  LSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQRYSLNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFS

Query:  HRVMTAHKRKNSIMEILAESGASLTCDDDIEKE-----------------------EAPLPGES----NRRRRSPEVA-------AREPHAPSEISAAAA
        HRV    + +  I  +++E G +L+  + I +E                        AP+ GES    NR     EV          +   P + + A  
Subjt:  HRVMTAHKRKNSIMEILAESGASLTCDDDIEKE-----------------------EAPLPGES----NRRRRSPEVA-------AREPHAPSEISAAAA

Query:  TR--------------RSRLLQFACSAQFRFV--------------------------------------------------------------------
        +R              R  L +    +Q  FV                                                                    
Subjt:  TR--------------RSRLLQFACSAQFRFV--------------------------------------------------------------------

Query:  -------------FVEFTPKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRLMVHPLYSH
                     +V+ +   I   IEK+ R FLW G  + K  HL++W  +  P + GGLG      +N++LL KW WRF  E   LW + ++  +Y  
Subjt:  -------------FVEFTPKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRLMVHPLYSH

Query:  HSHRYSIYRLRTAKIV---------------------DLW-----------------------------NSTEGAWNLHLRRRLRDSEIMEWALLSHHLS
        H + +    +  A++                      DLW                             NS+  +WN +  R L DSEI     L   LS
Subjt:  HSHRYSIYRLRTAKIV---------------------DLW-----------------------------NSTEGAWNLHLRRRLRDSEIMEWALLSHHLS

Query:  TFSFRDVE-DTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPMPKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCCSMCYRAEESQIHL
        +  F     D+  W L+ +G+FS  +    L+  S P        LW   +P KVK   W ++H  +NT D +Q   P  +L P  C +C R  ES  H+
Subjt:  TFSFRDVE-DTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPMPKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCCSMCYRAEESQIHL

Query:  FSNCEFASAFWDFIQSAFGWQFGRPGDILSLLHYTL--LGHPFKKDTYLLWRNFLYAFFWNLWLERNDRIFNSKHKNIQAFIESTSYLAMYWSSQISPFC
        F +C      W  + +  G  +  P  +  +L  T   LG+  +  T  LW+       W +W ERN+ IF  K +  +   +   + +  W+S I  F 
Subjt:  FSNCEFASAFWDFIQSAFGWQFGRPGDILSLLHYTL--LGHPFKKDTYLLWRNFLYAFFWNLWLERNDRIFNSKHKNIQAFIESTSYLAMYWSSQISPFC

Query:  NYPLSSLISQW
          PL+ L   W
Subjt:  NYPLSSLISQW

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.7e-9124.06Show/hide
Query:  KVKVVGSIMIIISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLS
        +++V    M IISWN RG+GS KKR ++KDF+ S  P +V+ QETK    +R+ + S+W++RN  W ++ A G +GGI+I+W+      +E++ G +++S
Subjt:  KVKVVGSIMIIISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLS

Query:  IHLSLADGYSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPT
        I  +L    S W++ VYG N+S  R+  W+EL D+  L  P W +GGDFNV R + EK   S  T +MK F+ FI D +L D+PL +  +TWS+ + NP 
Subjt:  IHLSLADGYSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPT

Query:  MTLIDRYLISDSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSV
           +DR+L S+     F  +    L R TSDH+PI L     KWGP PF+F N WL H SF      WW+     GW GH F+ KL+ +K +LK WN++ 
Subjt:  MTLIDRYLISDSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSV

Query:  FGNTKQQRYSLNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHKRKNSIMEILAESGASLTCDD
        FG   +++  + S L + D  E+ G LS +   +R   K  L  L   EEI WRQK ++KW++EGD NS F H+V    + +  I E+  E+G  +   +
Subjt:  FGNTKQQRYSLNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHKRKNSIMEILAESGASLTCDD

Query:  DIEKE-----------------------EAPLPGESNRRRRSP---------------------------------------------------------
         I++E                        +P+ GES  R  SP                                                         
Subjt:  DIEKE-----------------------EAPLPGESNRRRRSP---------------------------------------------------------

Query:  ------------------------------------------EVAAREPHAP------------------------------------------------
                                                  EV     H+                                                 
Subjt:  ------------------------------------------EVAAREPHAP------------------------------------------------

Query:  -----------------------SEISAA----------------------------------------------------AATRRSRL--LQFA-----
                               S +S A                                                        R+R+  LQFA     
Subjt:  -----------------------SEISAA----------------------------------------------------AATRRSRL--LQFA-----

Query:  ----------------------------------------------------CSAQ----------------------------------FRFVFVEF--
                                                            C A                                   ++  ++ F  
Subjt:  ----------------------------------------------------CSAQ----------------------------------FRFVFVEF--

Query:  ---------------------TPKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRLMVHP
                              P  +   IE++ R FLW G  + K  HL+ W  +  P   GGLG   I  +NV+LL KW WR+  E  ALW ++++  
Subjt:  ---------------------TPKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRLMVHP

Query:  LYSHH-----------SHR---------YSIYRLRTAKIV----------DLW-----------------------------NSTEGAWNLHLRRRLRDS
          SH            SHR         Y  +   T  +V          DLW                             ++   +WN   RR L DS
Subjt:  LYSHH-----------SHR---------YSIYRLRTAKIV----------DLW-----------------------------NSTEGAWNLHLRRRLRDS

Query:  EIMEWALLSHHLSTFSF-RDVEDTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPMPKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCC
        EI +   L            V D   W L+ +G+F+  +    L+  S+         +W   +P KVK F+W ++H  +NT D++Q R P  +LSP+ C
Subjt:  EIMEWALLSHHLSTFSF-RDVEDTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPMPKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCC

Query:  SMCYRAEESQIHLFSNCEFASAFWDFIQSAFGWQFGRPGDILSLLHYTLLGHPFKKDTYLLWRNFLYAFFWNLWLERNDRIFNSKHKNIQAFIESTSYLA
         +C +  E+  HLF +C      W  +  +    +  P  I  +L     G  F K   +LW+N   A  W +W ERN RIF  K +N +   +S  +L 
Subjt:  SMCYRAEESQIHLFSNCEFASAFWDFIQSAFGWQFGRPGDILSLLHYTLLGHPFKKDTYLLWRNFLYAFFWNLWLERNDRIFNSKHKNIQAFIESTSYLA

Query:  MYWSSQISPFCNYPLSSLISQW
         +W+     F   PL+ L   W
Subjt:  MYWSSQISPFCNYPLSSLISQW

RVW91038.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]5.1e-9125.11Show/hide
Query:  GMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADGYSFWITGVY
        G+GS KKR ++K+F+SS  P +V++QETK    +R+++ S+WS RN  W ++ A G +GGI+I+W+      +E+V G +++SI  ++    S W++ VY
Subjt:  GMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADGYSFWITGVY

Query:  GSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYLISDSIVTKF
        G N+S  R+ FW+EL D+  L  P W +GGDFNV R + EK   S  T  MK F+ FI D +L D PL +  YTWS+ + NP    +DR+L S+     F
Subjt:  GSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYLISDSIVTKF

Query:  SVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQRYSLNSELSD
          +    L R TSDH+PI L     KWGP PF+F N WL H SF      WW      GW GH F+ KL+ +K +LK+WN++ FG   +++  + + L++
Subjt:  SVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQRYSLNSELSD

Query:  LDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHKRKNSIMEILAESGASLTCDDDIEKE------------
         D  E+ G LS++   +R   K  L  L   EEI WRQK ++KW++EGD NS F H+V    + +  I E+  ESG  L   + I++E            
Subjt:  LDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHKRKNSIMEILAESGASLTCDDDIEKE------------

Query:  -----------EAPLPGESNRRRRSP--------------------------------------------------------------------------
                    +P+ GES  R  SP                                                                          
Subjt:  -----------EAPLPGESNRRRRSP--------------------------------------------------------------------------

Query:  -------------EV---------------------------------------------------------------------AAR-----EPHAPSEI
                     E+                                                                     A+R     +P +P   
Subjt:  -------------EV---------------------------------------------------------------------AAR-----EPHAPSEI

Query:  SAAA---------------------ATRRSRL--LQFACSAQF------------RFVFVEF--------------------------------------
        +  A                        R+R+  LQFA    F            + V + F                                      
Subjt:  SAAA---------------------ATRRSRL--LQFACSAQF------------RFVFVEF--------------------------------------

Query:  -------------------------------------TPKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWR
                                              P  +   IE++ R FLW G  + K  HL+ W  +  P   GGLG   I  +NV+LL KW WR
Subjt:  -------------------------------------TPKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWR

Query:  FYHEPEALWRRLMVHPLYSHHSHRYSI---------------------YRLRTAKIV----------DLW-----------------------------N
        +  E  ALW ++++  +Y  HS+ + +                     +   T  +V          DLW                              
Subjt:  FYHEPEALWRRLMVHPLYSHHSHRYSI---------------------YRLRTAKIV----------DLW-----------------------------N

Query:  STEGAWNLHLRRRLRDSEIMEWALLSHHLSTFSFR-DVEDTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPMPKKVKFFIWELSHACINTA
        S   +WN + RR L DSEI +   L   L        V D   W ++ +G+F+  +    L+ +S          +W   +P KVK F+W ++H  +NT 
Subjt:  STEGAWNLHLRRRLRDSEIMEWALLSHHLSTFSFR-DVEDTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPMPKKVKFFIWELSHACINTA

Query:  DIIQRRFPSSSLSPNCCSMCYRAEESQIHLFSNCEFASAFWDFIQSAFGWQFGRPGDILSLLHYTLLGHPFKKDTYLLWRNFLYAFFWNLWLERNDRIF
        D++Q R P  +LSPN C +C +  E+  HLF +C      W  +       +  P  I  +      G    K   +LW++   A  W +W ERN RIF
Subjt:  DIIQRRFPSSSLSPNCCSMCYRAEESQIHLFSNCEFASAFWDFIQSAFGWQFGRPGDILSLLHYTLLGHPFKKDTYLLWRNFLYAFFWNLWLERNDRIF

RVX15530.1 putative ribonuclease H protein [Vitis vinifera]2.0e-10328.41Show/hide
Query:  IISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADGYS
        I+SWN RG+GS KKR +++ F+S+ NP +V+LQETK    +R+ + S+W  + + W ++ A G +GGIVILW+ S F+  E V G +++++  +  +  S
Subjt:  IISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADGYS

Query:  FWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYLIS
        FW+T VYG  +   R+ FWLEL DL  L  P W +GGDFNV R   EK   +  T  M+ F+ FI +S L D PL N  +TWS+ + +P    +DR+L S
Subjt:  FWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYLIS

Query:  DSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQRYS
            T FS +    L R TSDH PI L     KWGP PF+F N WL H  F     +WW+    +GW GH F+ KLK +K +LK+WN   FG+ K+++  
Subjt:  DSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQRYS

Query:  LNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHK-RKNSI--MEILAESGASL----TCDDDIE
        +  +LS +D  E+ G L+      RT  +  L  +   EE+ WRQK ++KW++EGD NS F HRV T    R   I  + I  ESG  L    T ++D+ 
Subjt:  LNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHK-RKNSI--MEILAESGASL----TCDDDIE

Query:  K----------------------------------------EEAPLPGES------------NRRRRSPE------------------------------
        +                                        + A + G              + +RRS E                              
Subjt:  K----------------------------------------EEAPLPGES------------NRRRRSPE------------------------------

Query:  ----------------------------------VAAREPHAPSEISAAAATRRSRLLQFACSAQ----FRFVFVEF----------------------T
                                          +   +P +P   +  A      +     S +     + + + F                       
Subjt:  ----------------------------------VAAREPHAPSEISAAAATRRSRLLQFACSAQ----FRFVFVEF----------------------T

Query:  PK------------------KITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRLMVHPLYSH
        PK                   I   IEK+ R FLW G  + K  HL++W  +  P + GGLG   I  +N++LL KW WRF  E   LW +++V  +Y  
Subjt:  PK------------------KITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRLMVHPLYSH

Query:  HSHRYSIYRLRTAKIVDLWNSTEGAWNLHLRRRLRDSEIMEWALLSHHLSTFSFRDVEDTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPM
        H + +       A +V  W S    W   + +  ++  +    ++ +       +       W L+ +G+FS  +    L+  S P        LW   +
Subjt:  HSHRYSIYRLRTAKIVDLWNSTEGAWNLHLRRRLRDSEIMEWALLSHHLSTFSFRDVEDTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPM

Query:  PKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCCSMCYRAEESQIHLFSNCEFASAFWD--FIQSAFGWQFGRPGDILSLLHYTLLGHPFKKDTYLLW
        P KVK   W ++H  +NT D +Q R P  SL P  C +C    ES  HLF  C      W+  F  +   W   R  + + ++ +  LG+  +  T  LW
Subjt:  PKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCCSMCYRAEESQIHLFSNCEFASAFWD--FIQSAFGWQFGRPGDILSLLHYTLLGHPFKKDTYLLW

Query:  RNFLYAFFWNLWLERNDRIFNSKHKNIQAFIESTSYLAMYWSSQISPFCNYPLSSLISQWR
        +       W +W ERN+RIF  K ++ +   +   + +  W+S  + F   PL+ L   WR
Subjt:  RNFLYAFFWNLWLERNDRIFNSKHKNIQAFIESTSYLAMYWSSQISPFCNYPLSSLISQWR

XP_022158956.1 uncharacterized protein LOC111025405 [Momordica charantia]2.2e-10245.59Show/hide
Query:  MIIISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADG
        M  ++WNVRG+ SWKK ALIK FIS  NP++VILQETK++ ++  I+KS+WS+  I W+++DA G+A GI+ILWN+      E++EG+++L+I+  L+DG
Subjt:  MIIISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADG

Query:  YSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYL
        + FW++G+YG ++++   LFW EL+DL  LC  +WIL GDFNVTRW+WEK      T++M  FN FIEDS L D+PL+NG++TWS    N + +LID +L
Subjt:  YSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYL

Query:  ISDSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQR
        +++  + K  +  A+R+ R TSDHFPI L  G+  WG  PF+F N WLSH +F   ++ WW N    GWPGHG + KLK LK  +K W    F     Q+
Subjt:  ISDSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQR

Query:  YSLNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHKRKNSIMEILAESGASLTCDDDIEKE
          L + ++ LD  E    ++  ++  R + K  L+ + A EE  WRQ+CK KWL EGD N+ F HR +   +R++ I EIL++ G  LT   DIE+E
Subjt:  YSLNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHKRKNSIMEILAESGASLTCDDDIEKE

TrEMBL top hitse value%identityAlignment
A0A438GDE7 LINE-1 retrotransposable element ORF2 protein8.4e-9224.06Show/hide
Query:  KVKVVGSIMIIISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLS
        +++V    M IISWN RG+GS KKR ++KDF+ S  P +V+ QETK    +R+ + S+W++RN  W ++ A G +GGI+I+W+      +E++ G +++S
Subjt:  KVKVVGSIMIIISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLS

Query:  IHLSLADGYSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPT
        I  +L    S W++ VYG N+S  R+  W+EL D+  L  P W +GGDFNV R + EK   S  T +MK F+ FI D +L D+PL +  +TWS+ + NP 
Subjt:  IHLSLADGYSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPT

Query:  MTLIDRYLISDSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSV
           +DR+L S+     F  +    L R TSDH+PI L     KWGP PF+F N WL H SF      WW+     GW GH F+ KL+ +K +LK WN++ 
Subjt:  MTLIDRYLISDSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSV

Query:  FGNTKQQRYSLNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHKRKNSIMEILAESGASLTCDD
        FG   +++  + S L + D  E+ G LS +   +R   K  L  L   EEI WRQK ++KW++EGD NS F H+V    + +  I E+  E+G  +   +
Subjt:  FGNTKQQRYSLNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHKRKNSIMEILAESGASLTCDD

Query:  DIEKE-----------------------EAPLPGESNRRRRSP---------------------------------------------------------
         I++E                        +P+ GES  R  SP                                                         
Subjt:  DIEKE-----------------------EAPLPGESNRRRRSP---------------------------------------------------------

Query:  ------------------------------------------EVAAREPHAP------------------------------------------------
                                                  EV     H+                                                 
Subjt:  ------------------------------------------EVAAREPHAP------------------------------------------------

Query:  -----------------------SEISAA----------------------------------------------------AATRRSRL--LQFA-----
                               S +S A                                                        R+R+  LQFA     
Subjt:  -----------------------SEISAA----------------------------------------------------AATRRSRL--LQFA-----

Query:  ----------------------------------------------------CSAQ----------------------------------FRFVFVEF--
                                                            C A                                   ++  ++ F  
Subjt:  ----------------------------------------------------CSAQ----------------------------------FRFVFVEF--

Query:  ---------------------TPKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRLMVHP
                              P  +   IE++ R FLW G  + K  HL+ W  +  P   GGLG   I  +NV+LL KW WR+  E  ALW ++++  
Subjt:  ---------------------TPKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRLMVHP

Query:  LYSHH-----------SHR---------YSIYRLRTAKIV----------DLW-----------------------------NSTEGAWNLHLRRRLRDS
          SH            SHR         Y  +   T  +V          DLW                             ++   +WN   RR L DS
Subjt:  LYSHH-----------SHR---------YSIYRLRTAKIV----------DLW-----------------------------NSTEGAWNLHLRRRLRDS

Query:  EIMEWALLSHHLSTFSF-RDVEDTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPMPKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCC
        EI +   L            V D   W L+ +G+F+  +    L+  S+         +W   +P KVK F+W ++H  +NT D++Q R P  +LSP+ C
Subjt:  EIMEWALLSHHLSTFSF-RDVEDTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPMPKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCC

Query:  SMCYRAEESQIHLFSNCEFASAFWDFIQSAFGWQFGRPGDILSLLHYTLLGHPFKKDTYLLWRNFLYAFFWNLWLERNDRIFNSKHKNIQAFIESTSYLA
         +C +  E+  HLF +C      W  +  +    +  P  I  +L     G  F K   +LW+N   A  W +W ERN RIF  K +N +   +S  +L 
Subjt:  SMCYRAEESQIHLFSNCEFASAFWDFIQSAFGWQFGRPGDILSLLHYTLLGHPFKKDTYLLWRNFLYAFFWNLWLERNDRIFNSKHKNIQAFIESTSYLA

Query:  MYWSSQISPFCNYPLSSLISQW
         +W+     F   PL+ L   W
Subjt:  MYWSSQISPFCNYPLSSLISQW

A0A438I2T6 Transposon TX1 uncharacterized 149 kDa protein2.4e-9125.11Show/hide
Query:  GMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADGYSFWITGVY
        G+GS KKR ++K+F+SS  P +V++QETK    +R+++ S+WS RN  W ++ A G +GGI+I+W+      +E+V G +++SI  ++    S W++ VY
Subjt:  GMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADGYSFWITGVY

Query:  GSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYLISDSIVTKF
        G N+S  R+ FW+EL D+  L  P W +GGDFNV R + EK   S  T  MK F+ FI D +L D PL +  YTWS+ + NP    +DR+L S+     F
Subjt:  GSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYLISDSIVTKF

Query:  SVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQRYSLNSELSD
          +    L R TSDH+PI L     KWGP PF+F N WL H SF      WW      GW GH F+ KL+ +K +LK+WN++ FG   +++  + + L++
Subjt:  SVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQRYSLNSELSD

Query:  LDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHKRKNSIMEILAESGASLTCDDDIEKE------------
         D  E+ G LS++   +R   K  L  L   EEI WRQK ++KW++EGD NS F H+V    + +  I E+  ESG  L   + I++E            
Subjt:  LDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHKRKNSIMEILAESGASLTCDDDIEKE------------

Query:  -----------EAPLPGESNRRRRSP--------------------------------------------------------------------------
                    +P+ GES  R  SP                                                                          
Subjt:  -----------EAPLPGESNRRRRSP--------------------------------------------------------------------------

Query:  -------------EV---------------------------------------------------------------------AAR-----EPHAPSEI
                     E+                                                                     A+R     +P +P   
Subjt:  -------------EV---------------------------------------------------------------------AAR-----EPHAPSEI

Query:  SAAA---------------------ATRRSRL--LQFACSAQF------------RFVFVEF--------------------------------------
        +  A                        R+R+  LQFA    F            + V + F                                      
Subjt:  SAAA---------------------ATRRSRL--LQFACSAQF------------RFVFVEF--------------------------------------

Query:  -------------------------------------TPKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWR
                                              P  +   IE++ R FLW G  + K  HL+ W  +  P   GGLG   I  +NV+LL KW WR
Subjt:  -------------------------------------TPKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWR

Query:  FYHEPEALWRRLMVHPLYSHHSHRYSI---------------------YRLRTAKIV----------DLW-----------------------------N
        +  E  ALW ++++  +Y  HS+ + +                     +   T  +V          DLW                              
Subjt:  FYHEPEALWRRLMVHPLYSHHSHRYSI---------------------YRLRTAKIV----------DLW-----------------------------N

Query:  STEGAWNLHLRRRLRDSEIMEWALLSHHLSTFSFR-DVEDTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPMPKKVKFFIWELSHACINTA
        S   +WN + RR L DSEI +   L   L        V D   W ++ +G+F+  +    L+ +S          +W   +P KVK F+W ++H  +NT 
Subjt:  STEGAWNLHLRRRLRDSEIMEWALLSHHLSTFSFR-DVEDTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPMPKKVKFFIWELSHACINTA

Query:  DIIQRRFPSSSLSPNCCSMCYRAEESQIHLFSNCEFASAFWDFIQSAFGWQFGRPGDILSLLHYTLLGHPFKKDTYLLWRNFLYAFFWNLWLERNDRIF
        D++Q R P  +LSPN C +C +  E+  HLF +C      W  +       +  P  I  +      G    K   +LW++   A  W +W ERN RIF
Subjt:  DIIQRRFPSSSLSPNCCSMCYRAEESQIHLFSNCEFASAFWDFIQSAFGWQFGRPGDILSLLHYTLLGHPFKKDTYLLWRNFLYAFFWNLWLERNDRIF

A0A438K2W1 Putative ribonuclease H protein9.6e-10428.41Show/hide
Query:  IISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADGYS
        I+SWN RG+GS KKR +++ F+S+ NP +V+LQETK    +R+ + S+W  + + W ++ A G +GGIVILW+ S F+  E V G +++++  +  +  S
Subjt:  IISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADGYS

Query:  FWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYLIS
        FW+T VYG  +   R+ FWLEL DL  L  P W +GGDFNV R   EK   +  T  M+ F+ FI +S L D PL N  +TWS+ + +P    +DR+L S
Subjt:  FWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYLIS

Query:  DSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQRYS
            T FS +    L R TSDH PI L     KWGP PF+F N WL H  F     +WW+    +GW GH F+ KLK +K +LK+WN   FG+ K+++  
Subjt:  DSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQRYS

Query:  LNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHK-RKNSI--MEILAESGASL----TCDDDIE
        +  +LS +D  E+ G L+      RT  +  L  +   EE+ WRQK ++KW++EGD NS F HRV T    R   I  + I  ESG  L    T ++D+ 
Subjt:  LNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHK-RKNSI--MEILAESGASL----TCDDDIE

Query:  K----------------------------------------EEAPLPGES------------NRRRRSPE------------------------------
        +                                        + A + G              + +RRS E                              
Subjt:  K----------------------------------------EEAPLPGES------------NRRRRSPE------------------------------

Query:  ----------------------------------VAAREPHAPSEISAAAATRRSRLLQFACSAQ----FRFVFVEF----------------------T
                                          +   +P +P   +  A      +     S +     + + + F                       
Subjt:  ----------------------------------VAAREPHAPSEISAAAATRRSRLLQFACSAQ----FRFVFVEF----------------------T

Query:  PK------------------KITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRLMVHPLYSH
        PK                   I   IEK+ R FLW G  + K  HL++W  +  P + GGLG   I  +N++LL KW WRF  E   LW +++V  +Y  
Subjt:  PK------------------KITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRLMVHPLYSH

Query:  HSHRYSIYRLRTAKIVDLWNSTEGAWNLHLRRRLRDSEIMEWALLSHHLSTFSFRDVEDTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPM
        H + +       A +V  W S    W   + +  ++  +    ++ +       +       W L+ +G+FS  +    L+  S P        LW   +
Subjt:  HSHRYSIYRLRTAKIVDLWNSTEGAWNLHLRRRLRDSEIMEWALLSHHLSTFSFRDVEDTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPM

Query:  PKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCCSMCYRAEESQIHLFSNCEFASAFWD--FIQSAFGWQFGRPGDILSLLHYTLLGHPFKKDTYLLW
        P KVK   W ++H  +NT D +Q R P  SL P  C +C    ES  HLF  C      W+  F  +   W   R  + + ++ +  LG+  +  T  LW
Subjt:  PKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCCSMCYRAEESQIHLFSNCEFASAFWD--FIQSAFGWQFGRPGDILSLLHYTLLGHPFKKDTYLLW

Query:  RNFLYAFFWNLWLERNDRIFNSKHKNIQAFIESTSYLAMYWSSQISPFCNYPLSSLISQWR
        +       W +W ERN+RIF  K ++ +   +   + +  W+S  + F   PL+ L   WR
Subjt:  RNFLYAFFWNLWLERNDRIFNSKHKNIQAFIESTSYLAMYWSSQISPFCNYPLSSLISQWR

A0A6J1E2G6 uncharacterized protein LOC1110254051.1e-10245.59Show/hide
Query:  MIIISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADG
        M  ++WNVRG+ SWKK ALIK FIS  NP++VILQETK++ ++  I+KS+WS+  I W+++DA G+A GI+ILWN+      E++EG+++L+I+  L+DG
Subjt:  MIIISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADG

Query:  YSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYL
        + FW++G+YG ++++   LFW EL+DL  LC  +WIL GDFNVTRW+WEK      T++M  FN FIEDS L D+PL+NG++TWS    N + +LID +L
Subjt:  YSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYL

Query:  ISDSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQR
        +++  + K  +  A+R+ R TSDHFPI L  G+  WG  PF+F N WLSH +F   ++ WW N    GWPGHG + KLK LK  +K W    F     Q+
Subjt:  ISDSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQR

Query:  YSLNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHKRKNSIMEILAESGASLTCDDDIEKE
          L + ++ LD  E    ++  ++  R + K  L+ + A EE  WRQ+CK KWL EGD N+ F HR +   +R++ I EIL++ G  LT   DIE+E
Subjt:  YSLNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHKRKNSIMEILAESGASLTCDDDIEKE

A5BV05 Uncharacterized protein1.1e-9128.32Show/hide
Query:  GVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADGYSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFN
        G+ GGIVILW+   F   E V G + +++ L+  +  SFW+T VYG N    R+ FWLEL DL  L  P W +GGDFNV R   EK   S  T  M+ F+
Subjt:  GVAGGIVILWNESSFDVKEIVEGMYTLSIHLSLADGYSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFN

Query:  RFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYLISDSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNN
         FI +S L D PL N  +TWS+ + +P    +DR+L S      FS +    L R TSDH PI L     KWG  PF+F N WL H  F      WW+  
Subjt:  RFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLIDRYLISDSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNN

Query:  LSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQRYSLNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFS
          +GW GH F+ KLK +K + K+WN   FG+ ++++  + SEL  +D  E+ G L+      RT  +  L  L   EE+ WRQK ++KW++EGD NS F 
Subjt:  LSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQRYSLNSELSDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFS

Query:  HRVMTAHKRKNSIMEILAESGASLTCDDDIEKE-----------------------EAPLPGES----NRRRRSPEVA-------AREPHAPSEISAAAA
        HRV    + +  I  +++E G +L+  + I +E                        AP+ GES    NR     EV          +   P + + A  
Subjt:  HRVMTAHKRKNSIMEILAESGASLTCDDDIEKE-----------------------EAPLPGES----NRRRRSPEVA-------AREPHAPSEISAAAA

Query:  TR--------------RSRLLQFACSAQFRFV--------------------------------------------------------------------
        +R              R  L +    +Q  FV                                                                    
Subjt:  TR--------------RSRLLQFACSAQFRFV--------------------------------------------------------------------

Query:  -------------FVEFTPKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRLMVHPLYSH
                     +V+ +   I   IEK+ R FLW G  + K  HL++W  +  P + GGLG      +N++LL KW WRF  E   LW + ++  +Y  
Subjt:  -------------FVEFTPKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRLMVHPLYSH

Query:  HSHRYSIYRLRTAKIV---------------------DLW-----------------------------NSTEGAWNLHLRRRLRDSEIMEWALLSHHLS
        H + +    +  A++                      DLW                             NS+  +WN +  R L DSEI     L   LS
Subjt:  HSHRYSIYRLRTAKIV---------------------DLW-----------------------------NSTEGAWNLHLRRRLRDSEIMEWALLSHHLS

Query:  TFSFRDVE-DTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPMPKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCCSMCYRAEESQIHL
        +  F     D+  W L+ +G+FS  +    L+  S P        LW   +P KVK   W ++H  +NT D +Q   P  +L P  C +C R  ES  H+
Subjt:  TFSFRDVE-DTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPMPKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCCSMCYRAEESQIHL

Query:  FSNCEFASAFWDFIQSAFGWQFGRPGDILSLLHYTL--LGHPFKKDTYLLWRNFLYAFFWNLWLERNDRIFNSKHKNIQAFIESTSYLAMYWSSQISPFC
        F +C      W  + +  G  +  P  +  +L  T   LG+  +  T  LW+       W +W ERN+ IF  K +  +   +   + +  W+S I  F 
Subjt:  FSNCEFASAFWDFIQSAFGWQFGRPGDILSLLHYTL--LGHPFKKDTYLLWRNFLYAFFWNLWLERNDRIFNSKHKNIQAFIESTSYLAMYWSSQISPFC

Query:  NYPLSSLISQW
          PL+ L   W
Subjt:  NYPLSSLISQW

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.0e-0624.08Show/hide
Query:  SIMIIISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMA--NINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSF---DVKEIVEGMYTLSI
        S + I++ NV G+ S  KR  +  +I S +PS+  +QET +   + +R  IK  W      + +   +  AG  +++ +++ F    +K   EG Y + +
Subjt:  SIMIIISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMA--NINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSF---DVKEIVEGMYTLSI

Query:  HLSLADGYSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDI-----PLSNGKYTWSSFR
          S+       I  +Y  N+   R +  + L DLQ     + ++ GDFN      ++ T     +  ++ N  +  +DL DI     P S  +YT+ S  
Subjt:  HLSLADGYSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDI-----PLSNGKYTWSSFR

Query:  PNPTMTLIDRYLISDSIVTKFSVASARRLDRIT---SDHFPISLTL---GKEKWGPAPFKFNNAWLSHHSFHN----TVDIWWKNNLSQGWPGHGFIHKL
        P+ T + ID  + S ++++K      +R + IT   SDH  I L L      +     +K NN  L+ +  HN     + ++++ N ++           
Subjt:  PNPTMTLIDRYLISDSIVTKFSVASARRLDRIT---SDHFPISLTL---GKEKWGPAPFKFNNAWLSHHSFHN----TVDIWWKNNLSQGWPGHGFIHKL

Query:  KGL-KKELKQWNQSVFGNTKQQRYSLNSELSDLDKREELGRLSEQEAHRRTEI
        K + + +    N       + +  +L S+L +L+K+E+    +  +A RR EI
Subjt:  KGL-KKELKQWNQSVFGNTKQQRYSLNSELSDLDKREELGRLSEQEAHRRTEI

P0C2F6 Putative ribonuclease H protein At1g657505.0e-1723.74Show/hide
Query:  PKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRLMVHPLYSHHSHRYSIYRLRTAKIVDL
        P+ I   +++L R FLW   ++KK  HL+KWS +  P KEGGLG+      N +L++K  WR   E  +LW  ++          +Y +  +R ++    
Subjt:  PKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRLMVHPLYSHHSHRYSIYRLRTAKIVDL

Query:  WNSTEGAWNLHLRR---RLRD-------------SEIMEWA--------LL----SHHLSTFSFRDVEDTWI----------------------------
        W   +G+W+   R     LRD              +I  W         LL        +       +D WI                            
Subjt:  WNSTEGAWNLHLRR---RLRD-------------SEIMEWA--------LL----SHHLSTFSFRDVEDTWI----------------------------

Query:  ----------WHLNENGVFSTGTLTRNLASNSL--PNSTDFYSQLWKGPMPKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCCSMCYRAEESQIHLF
                  W  +++G FS  +    L  + +  PN   F++ LWK  +P++VK F+W + +  + T +   RR  S+S   N C +C    ES +H+ 
Subjt:  ----------WHLNENGVFSTGTLTRNLASNSL--PNSTDFYSQLWKGPMPKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCCSMCYRAEESQIHLF

Query:  SNCEFASAFW-----------DFIQSAFGWQFGRPGD
         +C      W            F +S F W +   GD
Subjt:  SNCEFASAFW-----------DFIQSAFGWQFGRPGD

P11369 LINE-1 retrotransposable element ORF2 protein2.3e-0624.64Show/hide
Query:  TAKKKVKVVGSIMIIISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSI-DAEGV---AGGIVILWNESSFD---V
        T   K+K   +   +IS N+ G+ S  KR  + D++   +P+   LQET +   +R  +      R   W +I  A G+   AG  +++ ++  F    +
Subjt:  TAKKKVKVVGSIMIIISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSI-DAEGV---AGGIVILWNESSFD---V

Query:  KEIVEGMYTLSIHLSLADGYSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDI-----P
        K+  EG + L     L +  S  I  +Y  N ++        L+ L+A   P+ I+ GDFN    + ++       R   K    ++  DL DI     P
Subjt:  KEIVEGMYTLSIHLSLADGYSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDI-----P

Query:  LSNGKYTWSSFRPNPTMTLIDRYLISDSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAP---FKFNNAWLS
         + G YT+ S  P+ T + ID  +   + + ++   +   +  I SDH  + L          P   +K NN  L+
Subjt:  LSNGKYTWSSFRPNPTMTLIDRYLISDSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAP---FKFNNAWLS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.8e-1730.04Show/hide
Query:  ILGGDFNVTRWTWEKFT---HSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFR-PNPTMTLIDRYLISDSIVTKFSVASARRLDRITSDHFPISLTL
        IL GDF+    T + ++    S P R +++F   + DSDL DIP     YTWS+ +  NP +  +DR + +    + F  A A       SDH P  + L
Subjt:  ILGGDFNVTRWTWEKFT---HSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFR-PNPTMTLIDRYLISDSIVTKFSVASARRLDRITSDHFPISLTL

Query:  -GKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSVFGN----TKQQRYSLNSELSDL--DKREELGRLSEQEA
            K     F++ +   +H +F  ++ + W+  +  G         LK  KK  K  N+  FGN    TK+   SL S  S L  +  + L R+ E  A
Subjt:  -GKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSVFGN----TKQQRYSLNSELSDL--DKREELGRLSEQEA

Query:  HRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHKRKNSIMEILAESGASLTCDDDIEKE
         ++          +A  E  +RQK ++KWL++GD N+ F H+V+ A++ KN I          L  DDD+  E
Subjt:  HRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHKRKNSIMEILAESGASLTCDDDIEKE

AT2G02520.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.0e-0426.97Show/hide
Query:  NSLPNSTDFYSQLW-KGPMPKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCCSMCYRAEESQIHLFSNCEFASAFWDFIQSAFGWQFGRPGDILSLL
        N +    D++  +W KG +PK   F  W      ++T D   R      + P  C  C   +E++ HLF +CEFA   W +  S       R      LL
Subjt:  NSLPNSTDFYSQLW-KGPMPKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCCSMCYRAEESQIHLFSNCEFASAFWDFIQSAFGWQFGRPGDILSLL

Query:  H---YTLLGHPFK-KDTYLLWRNFLYAFFWNLWLERNDRIFNSKHKNIQAFI
               L +P + K+   + R   +A  + +W ERN R+ +S  +   A I
Subjt:  H---YTLLGHPFK-KDTYLLWRNFLYAFFWNLWLERNDRIFNSKHKNIQAFI

AT3G09510.1 Ribonuclease H-like superfamily protein4.5e-0521.4Show/hide
Query:  DTWIWHLNENGVFSTGT---LTRNLASNSLP------NSTDFYSQLWKGPMPKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCCSMCYRAEESQIHL
        D  IW+ N  G ++  +   L  +  S ++P       S D  +++W  P+  K+K F+W      + T + +  R     + P+ C  C+R  ES  H 
Subjt:  DTWIWHLNENGVFSTGT---LTRNLASNSLP------NSTDFYSQLWKGPMPKKVKFFIWELSHACINTADIIQRRFPSSSLSPNCCSMCYRAEESQIHL

Query:  FSNCEFASAFWDFIQSAFGWQFGRPGDILSLLHYTLLGHPFKKDTYLLWRNFLYA-------------FFWNLWLERNDRIFNSKHKN-----IQAFIES
           C FA+  W    S             SL+   L+ + F+++   +  NF+                 W +W  RN+ +FN   ++     + A  E+
Subjt:  FSNCEFASAFWDFIQSAFGWQFGRPGDILSLLHYTLLGHPFKKDTYLLWRNFLYA-------------FFWNLWLERNDRIFNSKHKN-----IQAFIES

Query:  TSYLAMYWSSQISPFCNYPLSSLISQWRS
          +L    S + +P     ++    +WR+
Subjt:  TSYLAMYWSSQISPFCNYPLSSLISQWRS

AT4G29090.1 Ribonuclease H-like superfamily protein5.7e-0835.14Show/hide
Query:  PKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRL
        PK + K I  +   F WR   + KG H   W ++     EGG+G  DI   N++LL K  WR    PE+L  ++
Subjt:  PKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGCCCTTCAGCAGCACGTATCCGCATTTTCTTCCATCAACCCTCTCCTACCAGACAAAGCTCTCCTAGCCTGTGAAGACGAAGAGCAAGCTTGCACATTAGCTAA
AATCCGAGGTTGGTATAAGGTTGGGAAATACCATCTTCGATTCTACCCATGGAGTGCCGAAGTCATGAATAGCGAGCCGAAAGTTCCATCCTACGGAGGTTGGATTAAGA
TAAGAAACTTGCCTCTCGACAAATGGTCCATTGAGACCTTCCGAAAAATTGGAGATGATTGCGGAGGATATTTGGAGACTGCCAATAAAACTCTAGCAAGAATGGATATG
ATGGAGGTATGTATTAAGATCAAGGAAAATAGTTTCGGCTTTATCCCAGCGGAAATACACTTACCTTCTTCATCCGCCAGCCCATCAATCGCCAAGATCGATCCCTTTTT
CATGGAAGACTACAGCATTGGTTATATTGCCAGCGTTCATGGTAAAATACCAGCCACCGTGGTGGATCGAGAAGCTGCTGAGGAAGACGATGGACATGCACGCGCCGTCC
CTGCGCGTCTGGAAACAATTGCGGAAAAAGTTCAGACCTCGTACAAGTTGGAAACCCAACGATGCCCAAATGATGAAATTTATTCAGTCTCGGATTTGACCCCAGCCGTA
TTCACAGAGAGGGCCCCACATGTAACCTCATCTCACACTCAACCTCAACCCCCAAAATCCCTTATCATCGGCCAGCCGTATTCACAGAGAGGGACCCACACACCAAAAGC
CCCCACTAATATTCCCTCCGCTCTCCTAGACAAAAAAAACCCATATCCAGCCCATCCTCATAAAGACGCCCAACCCAACACCCTCACAAACCCTGCCTCATGCCTTGAAA
ATCGTACCAACCGGAAAAAACCCATCACTATCAATAATAAGGAAATTTTTCTTCTCACTGGTACGAAATTCTCCACCAATACACAGCTTCTCTTATCAGATTCGGACGAA
GGAGTCTCCTCCCCGTGTTCCACACCCATGGAGCAATCTCCCATGATCCCCAGAGGCAACCCCCCATCTGTTTCCCCACCATCTATCGGTATCCTTTTTGAAGAAGAGAA
TAATCAACAGCTTCTGATGGACCACCCCCGTCCATTAAGAATCGAGGAACCCGATCACAGAAATACAATTTTATCCATTGACAAGACCGACCAATCTCTAATTGATATCA
GTGTGGAAGAAGAAGACAGTGATGAATTTTACATGGAGACAGTGCACAACGACCCAGCGACATATCTTCCATTATTATTTCCTTGGCTTACTGAACATGGCATGTGCATT
ATGCCAATGCCCAACAGACAAAAGCTCTCCAACACAGCAAAGAAGAAAGTCAAGGTCGTCGGCTCCATCATGATTATTATCTCCTGGAATGTCCGTGGCATGGGCTCTTG
GAAAAAGAGAGCTCTTATCAAAGATTTTATTTCCTCCCATAATCCATCTCTGGTGATTCTCCAAGAAACCAAAATGGCCAACATTAACAGGAAGATCATTAAGTCCATTT
GGAGTTCTCGGAATATCGCTTGGACCTCTATAGACGCTGAAGGAGTTGCTGGTGGCATTGTGATCCTATGGAATGAATCCTCCTTTGATGTCAAGGAGATTGTCGAAGGT
ATGTACACTCTATCTATCCACCTATCTTTGGCTGATGGCTACTCCTTTTGGATCACAGGTGTTTATGGATCCAACTCCTCTAAGGAGAGACGTTTATTCTGGTTAGAATT
AATGGATCTTCAAGCCCTTTGTCTCCCTAATTGGATTTTGGGTGGTGATTTTAACGTGACTCGGTGGACATGGGAGAAATTCACCCATTCGGCACCAACTCGAGCTATGA
AGAAGTTCAATCGTTTTATAGAAGATTCTGATCTTCAAGACATTCCCCTGAGCAATGGTAAATATACATGGTCTAGCTTTAGGCCTAACCCCACCATGACCCTCATTGAT
CGGTATCTCATATCCGACAGCATTGTCACCAAATTTTCAGTTGCTTCTGCCCGTAGATTGGATAGAATTACGTCGGACCATTTCCCTATCAGTCTCACATTAGGGAAGGA
AAAATGGGGACCAGCCCCTTTCAAATTCAACAATGCCTGGCTTTCACATCACTCCTTCCATAATACAGTCGATATTTGGTGGAAGAACAACCTTTCTCAAGGGTGGCCAG
GTCACGGGTTCATACACAAATTGAAAGGTCTCAAAAAGGAATTAAAGCAGTGGAATCAATCTGTTTTTGGTAATACTAAACAGCAAAGATATAGTTTGAACTCAGAACTA
TCAGATCTGGACAAGAGGGAGGAACTCGGTCGATTATCTGAACAAGAAGCCCATAGAAGAACAGAGATAAAAGCCCATCTTATAATGCTATCAGCAAACGAAGAGATCAT
GTGGAGACAGAAATGTAAACTTAAATGGCTTAGAGAGGGCGATTTTAATTCGGCCTTTTCCCACAGAGTTATGACAGCCCACAAAAGGAAAAACTCTATCATGGAAATTC
TTGCAGAATCTGGTGCCAGTTTGACCTGCGATGATGATATAGAAAAGGAAGAAGCGCCGCTGCCAGGGGAGTCGAATCGCCGTCGTCGTTCGCCGGAAGTAGCTGCGCGT
GAGCCACACGCGCCGTCGGAGATTTCTGCAGCAGCCGCCACGCGCCGGTCACGTTTGCTTCAGTTTGCGTGCTCGGCTCAATTTCGGTTTGTTTTTGTTGAGTTCACACC
CAAGAAGATCACCAAATCCATCGAGAAGTTGTTCAGATTGTTCCTTTGGCGAGGTGGTTCTGATAAGAAGGGTTGTCATCTTCTGAAATGGTCCTATATTCAATTGCCCA
CCAAGGAGGGAGGTTTGGGTCTCTACGACATTCATAAGAAAAATGTCTCTCTTTTAGCGAAATGGGCTTGGAGATTTTACCATGAGCCCGAGGCCCTCTGGAGGAGACTC
ATGGTGCATCCTCTTTACAGTCATCATTCCCATCGCTATTCAATTTATCGCTTAAGAACTGCTAAGATCGTTGATTTATGGAATTCAACCGAGGGGGCTTGGAACTTGCA
CCTTAGGAGACGTCTTCGGGACTCCGAAATTATGGAATGGGCTTTATTATCGCATCATCTATCCACCTTTTCCTTCAGAGATGTAGAAGACACATGGATATGGCATCTTA
ACGAAAATGGTGTTTTCTCTACTGGAACCCTTACCAGAAATTTGGCTTCAAATTCTTTACCGAACAGCACTGATTTTTACAGTCAATTGTGGAAAGGTCCTATGCCTAAA
AAAGTCAAATTCTTCATTTGGGAGCTCAGCCACGCTTGTATCAATACCGCTGACATTATTCAGAGGAGATTTCCAAGTTCTTCACTATCCCCCAATTGTTGCAGCATGTG
TTATAGGGCTGAAGAATCTCAAATCCATCTATTCAGTAATTGTGAGTTTGCTTCAGCTTTTTGGGACTTCATTCAAAGCGCTTTCGGATGGCAATTCGGTCGACCGGGTG
ATATTCTCTCCCTTCTTCACTATACTCTCCTTGGTCACCCTTTTAAAAAAGACACTTATTTGCTTTGGAGGAACTTTTTGTATGCTTTCTTCTGGAACTTATGGTTAGAG
AGGAATGACAGAATCTTCAACTCCAAACACAAGAATATACAAGCCTTTATTGAATCCACATCTTATTTAGCTATGTATTGGAGTAGTCAAATCTCCCCATTTTGTAATTA
TCCTTTATCTTCCCTCATATCTCAATGGAGATCATTATTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGCCCTTCAGCAGCACGTATCCGCATTTTCTTCCATCAACCCTCTCCTACCAGACAAAGCTCTCCTAGCCTGTGAAGACGAAGAGCAAGCTTGCACATTAGCTAA
AATCCGAGGTTGGTATAAGGTTGGGAAATACCATCTTCGATTCTACCCATGGAGTGCCGAAGTCATGAATAGCGAGCCGAAAGTTCCATCCTACGGAGGTTGGATTAAGA
TAAGAAACTTGCCTCTCGACAAATGGTCCATTGAGACCTTCCGAAAAATTGGAGATGATTGCGGAGGATATTTGGAGACTGCCAATAAAACTCTAGCAAGAATGGATATG
ATGGAGGTATGTATTAAGATCAAGGAAAATAGTTTCGGCTTTATCCCAGCGGAAATACACTTACCTTCTTCATCCGCCAGCCCATCAATCGCCAAGATCGATCCCTTTTT
CATGGAAGACTACAGCATTGGTTATATTGCCAGCGTTCATGGTAAAATACCAGCCACCGTGGTGGATCGAGAAGCTGCTGAGGAAGACGATGGACATGCACGCGCCGTCC
CTGCGCGTCTGGAAACAATTGCGGAAAAAGTTCAGACCTCGTACAAGTTGGAAACCCAACGATGCCCAAATGATGAAATTTATTCAGTCTCGGATTTGACCCCAGCCGTA
TTCACAGAGAGGGCCCCACATGTAACCTCATCTCACACTCAACCTCAACCCCCAAAATCCCTTATCATCGGCCAGCCGTATTCACAGAGAGGGACCCACACACCAAAAGC
CCCCACTAATATTCCCTCCGCTCTCCTAGACAAAAAAAACCCATATCCAGCCCATCCTCATAAAGACGCCCAACCCAACACCCTCACAAACCCTGCCTCATGCCTTGAAA
ATCGTACCAACCGGAAAAAACCCATCACTATCAATAATAAGGAAATTTTTCTTCTCACTGGTACGAAATTCTCCACCAATACACAGCTTCTCTTATCAGATTCGGACGAA
GGAGTCTCCTCCCCGTGTTCCACACCCATGGAGCAATCTCCCATGATCCCCAGAGGCAACCCCCCATCTGTTTCCCCACCATCTATCGGTATCCTTTTTGAAGAAGAGAA
TAATCAACAGCTTCTGATGGACCACCCCCGTCCATTAAGAATCGAGGAACCCGATCACAGAAATACAATTTTATCCATTGACAAGACCGACCAATCTCTAATTGATATCA
GTGTGGAAGAAGAAGACAGTGATGAATTTTACATGGAGACAGTGCACAACGACCCAGCGACATATCTTCCATTATTATTTCCTTGGCTTACTGAACATGGCATGTGCATT
ATGCCAATGCCCAACAGACAAAAGCTCTCCAACACAGCAAAGAAGAAAGTCAAGGTCGTCGGCTCCATCATGATTATTATCTCCTGGAATGTCCGTGGCATGGGCTCTTG
GAAAAAGAGAGCTCTTATCAAAGATTTTATTTCCTCCCATAATCCATCTCTGGTGATTCTCCAAGAAACCAAAATGGCCAACATTAACAGGAAGATCATTAAGTCCATTT
GGAGTTCTCGGAATATCGCTTGGACCTCTATAGACGCTGAAGGAGTTGCTGGTGGCATTGTGATCCTATGGAATGAATCCTCCTTTGATGTCAAGGAGATTGTCGAAGGT
ATGTACACTCTATCTATCCACCTATCTTTGGCTGATGGCTACTCCTTTTGGATCACAGGTGTTTATGGATCCAACTCCTCTAAGGAGAGACGTTTATTCTGGTTAGAATT
AATGGATCTTCAAGCCCTTTGTCTCCCTAATTGGATTTTGGGTGGTGATTTTAACGTGACTCGGTGGACATGGGAGAAATTCACCCATTCGGCACCAACTCGAGCTATGA
AGAAGTTCAATCGTTTTATAGAAGATTCTGATCTTCAAGACATTCCCCTGAGCAATGGTAAATATACATGGTCTAGCTTTAGGCCTAACCCCACCATGACCCTCATTGAT
CGGTATCTCATATCCGACAGCATTGTCACCAAATTTTCAGTTGCTTCTGCCCGTAGATTGGATAGAATTACGTCGGACCATTTCCCTATCAGTCTCACATTAGGGAAGGA
AAAATGGGGACCAGCCCCTTTCAAATTCAACAATGCCTGGCTTTCACATCACTCCTTCCATAATACAGTCGATATTTGGTGGAAGAACAACCTTTCTCAAGGGTGGCCAG
GTCACGGGTTCATACACAAATTGAAAGGTCTCAAAAAGGAATTAAAGCAGTGGAATCAATCTGTTTTTGGTAATACTAAACAGCAAAGATATAGTTTGAACTCAGAACTA
TCAGATCTGGACAAGAGGGAGGAACTCGGTCGATTATCTGAACAAGAAGCCCATAGAAGAACAGAGATAAAAGCCCATCTTATAATGCTATCAGCAAACGAAGAGATCAT
GTGGAGACAGAAATGTAAACTTAAATGGCTTAGAGAGGGCGATTTTAATTCGGCCTTTTCCCACAGAGTTATGACAGCCCACAAAAGGAAAAACTCTATCATGGAAATTC
TTGCAGAATCTGGTGCCAGTTTGACCTGCGATGATGATATAGAAAAGGAAGAAGCGCCGCTGCCAGGGGAGTCGAATCGCCGTCGTCGTTCGCCGGAAGTAGCTGCGCGT
GAGCCACACGCGCCGTCGGAGATTTCTGCAGCAGCCGCCACGCGCCGGTCACGTTTGCTTCAGTTTGCGTGCTCGGCTCAATTTCGGTTTGTTTTTGTTGAGTTCACACC
CAAGAAGATCACCAAATCCATCGAGAAGTTGTTCAGATTGTTCCTTTGGCGAGGTGGTTCTGATAAGAAGGGTTGTCATCTTCTGAAATGGTCCTATATTCAATTGCCCA
CCAAGGAGGGAGGTTTGGGTCTCTACGACATTCATAAGAAAAATGTCTCTCTTTTAGCGAAATGGGCTTGGAGATTTTACCATGAGCCCGAGGCCCTCTGGAGGAGACTC
ATGGTGCATCCTCTTTACAGTCATCATTCCCATCGCTATTCAATTTATCGCTTAAGAACTGCTAAGATCGTTGATTTATGGAATTCAACCGAGGGGGCTTGGAACTTGCA
CCTTAGGAGACGTCTTCGGGACTCCGAAATTATGGAATGGGCTTTATTATCGCATCATCTATCCACCTTTTCCTTCAGAGATGTAGAAGACACATGGATATGGCATCTTA
ACGAAAATGGTGTTTTCTCTACTGGAACCCTTACCAGAAATTTGGCTTCAAATTCTTTACCGAACAGCACTGATTTTTACAGTCAATTGTGGAAAGGTCCTATGCCTAAA
AAAGTCAAATTCTTCATTTGGGAGCTCAGCCACGCTTGTATCAATACCGCTGACATTATTCAGAGGAGATTTCCAAGTTCTTCACTATCCCCCAATTGTTGCAGCATGTG
TTATAGGGCTGAAGAATCTCAAATCCATCTATTCAGTAATTGTGAGTTTGCTTCAGCTTTTTGGGACTTCATTCAAAGCGCTTTCGGATGGCAATTCGGTCGACCGGGTG
ATATTCTCTCCCTTCTTCACTATACTCTCCTTGGTCACCCTTTTAAAAAAGACACTTATTTGCTTTGGAGGAACTTTTTGTATGCTTTCTTCTGGAACTTATGGTTAGAG
AGGAATGACAGAATCTTCAACTCCAAACACAAGAATATACAAGCCTTTATTGAATCCACATCTTATTTAGCTATGTATTGGAGTAGTCAAATCTCCCCATTTTGTAATTA
TCCTTTATCTTCCCTCATATCTCAATGGAGATCATTATTGTAA
Protein sequenceShow/hide protein sequence
MRALQQHVSAFSSINPLLPDKALLACEDEEQACTLAKIRGWYKVGKYHLRFYPWSAEVMNSEPKVPSYGGWIKIRNLPLDKWSIETFRKIGDDCGGYLETANKTLARMDM
MEVCIKIKENSFGFIPAEIHLPSSSASPSIAKIDPFFMEDYSIGYIASVHGKIPATVVDREAAEEDDGHARAVPARLETIAEKVQTSYKLETQRCPNDEIYSVSDLTPAV
FTERAPHVTSSHTQPQPPKSLIIGQPYSQRGTHTPKAPTNIPSALLDKKNPYPAHPHKDAQPNTLTNPASCLENRTNRKKPITINNKEIFLLTGTKFSTNTQLLLSDSDE
GVSSPCSTPMEQSPMIPRGNPPSVSPPSIGILFEEENNQQLLMDHPRPLRIEEPDHRNTILSIDKTDQSLIDISVEEEDSDEFYMETVHNDPATYLPLLFPWLTEHGMCI
MPMPNRQKLSNTAKKKVKVVGSIMIIISWNVRGMGSWKKRALIKDFISSHNPSLVILQETKMANINRKIIKSIWSSRNIAWTSIDAEGVAGGIVILWNESSFDVKEIVEG
MYTLSIHLSLADGYSFWITGVYGSNSSKERRLFWLELMDLQALCLPNWILGGDFNVTRWTWEKFTHSAPTRAMKKFNRFIEDSDLQDIPLSNGKYTWSSFRPNPTMTLID
RYLISDSIVTKFSVASARRLDRITSDHFPISLTLGKEKWGPAPFKFNNAWLSHHSFHNTVDIWWKNNLSQGWPGHGFIHKLKGLKKELKQWNQSVFGNTKQQRYSLNSEL
SDLDKREELGRLSEQEAHRRTEIKAHLIMLSANEEIMWRQKCKLKWLREGDFNSAFSHRVMTAHKRKNSIMEILAESGASLTCDDDIEKEEAPLPGESNRRRRSPEVAAR
EPHAPSEISAAAATRRSRLLQFACSAQFRFVFVEFTPKKITKSIEKLFRLFLWRGGSDKKGCHLLKWSYIQLPTKEGGLGLYDIHKKNVSLLAKWAWRFYHEPEALWRRL
MVHPLYSHHSHRYSIYRLRTAKIVDLWNSTEGAWNLHLRRRLRDSEIMEWALLSHHLSTFSFRDVEDTWIWHLNENGVFSTGTLTRNLASNSLPNSTDFYSQLWKGPMPK
KVKFFIWELSHACINTADIIQRRFPSSSLSPNCCSMCYRAEESQIHLFSNCEFASAFWDFIQSAFGWQFGRPGDILSLLHYTLLGHPFKKDTYLLWRNFLYAFFWNLWLE
RNDRIFNSKHKNIQAFIESTSYLAMYWSSQISPFCNYPLSSLISQWRSLL