; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028602 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028602
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr8:26087506..26094359
RNA-Seq ExpressionLag0028602
SyntenyLag0028602
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]1.1e-9530.41Show/hide
Query:  LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDP--HEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKA
        L + +SVKLD  N+ LW+ +VL ++ G K+DGY+L T   P E +  ++  + ++      Q  D       +++  T  +       ++ +TS+++W  
Subjt:  LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDP--HEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKA

Query:  LEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLI-RPTT----LLAALN-----LNHMVI----PTRILGVAIGAEVVA----
         + + GA ++++I  ++    + +KG MKM +YL  MK    NL + L L   P +    ++  LN      N +V+     T +  V + A+++     
Subjt:  LEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLI-RPTT----LLAALN-----LNHMVI----PTRILGVAIGAEVVA----

Query:  VIIVERTSGLPVSSAVTPDLHSWKQFKTSMIMWSG------RRGMSGALSAEAGFRILGLGRYIC-----------APNNNQGGNN--GGSSTYLASPEI
        +  +   + L +++       S  + K+S   W G      R G     S +   ++ GL  +I            + +N+  G++  G  + +LAS   
Subjt:  VIIVERTSGLPVSSAVTPDLHSWKQFKTSMIMWSG------RRGMSGALSAEAGFRILGLGRYIC-----------APNNNQGGNN--GGSSTYLASPEI

Query:  LSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCD
        + D  W  DS A+NHVT          +++GK  L V N  KL +   G+S + S      + L DIL+VP+I +NL+S+++L ADNN  VEF    +C 
Subjt:  LSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCD

Query:  GQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVS-KSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLS
          + K +  V  + LL             L      + + S  VS K  WH+ LGH + KV + VL SC      ++  SF + CQY K H LPF  S S
Subjt:  GQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVS-KSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLS

Query:  SSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPH
         ++ PLEL+H D+WGP+P+ ++ G++YY+ FVD++SR T+IYPLK KS+  + F Q+K L EN+F+K++K +Q D   E++      + + I  R  CP+
Subjt:  SSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPH

Query:  TRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------------
        T QQNG  ERKH  + E GLTL+AQA +PL +WW+AFS+AV++ NRLP+QV    SP+    +   D   L+ FGCAC+PCL+P                
Subjt:  TRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------------

Query:  --------TKLTNSNTTVPNVCSLVI----------VLPTKS--STTASSPQSVSVPTQSPNIASPTLSPVSRPLSPIAQS--PALCLSSGSEASHSLPS
                 K  NS+  +     ++            L T+S   TT + P +      + N+      P+    +P   +   +  ++S +E +++ PS
Subjt:  --------TKLTNSNTTVPNVCSLVI----------VLPTKS--STTASSPQSVSVPTQSPNIASPTLSPVSRPLSPIAQS--PALCLSSGSEASHSLPS

Query:  S-----ESSLDKS-------------------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTN
              E +LD +                   TR+KSG+ KPK    ++    + +   EP   KEAL+ PLW++AM  E  AL  N+TW LV      N
Subjt:  S-----ESSLDKS-------------------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTN

Query:  LIGNKWIFKLK
        ++ +KW+FK K
Subjt:  LIGNKWIFKLK

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]2.7e-9430.15Show/hide
Query:  LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNI-------KTSR
        L +++SVKLD  N+ LW+ +VL+++ G K+DGY+L T   P + V  A+          ++V+  F    ++  Q L   L  +  ++I       +TS+
Subjt:  LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNI-------KTSR

Query:  EVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA---
        ++W   + + GA +K+RI  ++    N +KG MKM EYL  MK  S+ L        N  L+  T        N +V+       +  V + A+++A   
Subjt:  EVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA---

Query:  -VIIVERTSGLPVSSAVTPDLHSWKQFK----TSMIMW--SGRRGM-------------------SGALSAEAGFRILGLGRYICAPNNNQGGNNGGSST
         +      SGL ++++   +  +  +F+     S   W  S  RGM                   +G ++ +  +R      Y     + +    G  S 
Subjt:  -VIIVERTSGLPVSSAVTPDLHSWKQFK----TSMIMW--SGRRGM-------------------SGALSAEAGFRILGLGRYICAPNNNQGGNNGGSST

Query:  YLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEF
        ++ASP    D +W  DS A NHVT          ++NGK  L V N  KL +   G      S   + + L D+L+VP I +NL+S+++L ADNN  VEF
Subjt:  YLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEF

Query:  HPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLP
              D         ++ Q LL          LS+     ++   +S       WH+ LGH + KV + VL+ CN     ++  SF + CQ+ K H LP
Subjt:  HPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLP

Query:  FSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDS
        F  S S  + PL LIH D+WGP+P+ S  G++YY+ F+D++SR T+I+PLK KSD    F Q+K L EN+F+KK+K +Q D   E+++     I + I  
Subjt:  FSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDS

Query:  RHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------
        R  CP+T QQNG  ERKH  V E+GLTL+AQA +PL++WW+AFS+AV++ NRLP+ V    SP+   F+   D + L+ FGCAC+PCL+P          
Subjt:  RHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------

Query:  --------------TKLTNSN-----------------------------TTVPNVCSLVIVLPTKSSTTASSPQSVSVPTQSPNIASPTLSPVSRPLSP
                       K  NS+                              T+ +  S+++   +  +TT  + +  +  T   N  S   S  +     
Subjt:  --------------TKLTNSN-----------------------------TTVPNVCSLVIVLPTKSSTTASSPQSVSVPTQSPNIASPTLSPVSRPLSP

Query:  IAQSPALCLSSGS-----EASHSLPS-------------SESSLDKS------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMT
        +  S     ++ S     EA +S+ S              ++  D S      TR+K G+ KPK    ++    +  +  EP+++KEAL  P+W++AM  
Subjt:  IAQSPALCLSSGS-----EASHSLPS-------------SESSLDKS------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMT

Query:  EGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
        E  AL  N TW+LV      N+I +KWIFK K
Subjt:  EGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]1.1e-9531.24Show/hide
Query:  LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALE
        L +++SVKLD  N+ LW+ +VL ++ G K DGY+L T   P + V  A+  +  +   +  +    A   +       ++      ++ +TS+++W   +
Subjt:  LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALE

Query:  EVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA----VIIVER
         + GA +K+RI  ++    N +KG MKM EYL  MK  S+ L        N  L+  T        N +V+       +  V + A+++A    +  +  
Subjt:  EVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA----VIIVER

Query:  TSGLPVSSAVT---------PDLHSWKQFKTSMIM-WSGRRGMSGALSAEAGFRILGLGR------------YICAPNNNQGGNNGGSSTYLASPEILSD
         SGL ++++              HS   ++ S      G RG  G +S        G G             Y     + +    G  S ++ASP    D
Subjt:  TSGLPVSSAVT---------PDLHSWKQFKTSMIM-WSGRRGMSGALSAEAGFRILGLGR------------YICAPNNNQGGNNGGSSTYLASPEILSD

Query:  PKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQR
         +W  DS A+NHVT          ++NGK  L V N  KL +   G      S   + + L D+L+VP I +NL+S+++L ADNN FVEF      D   
Subjt:  PKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQR

Query:  YKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLSSSKC
              ++ Q LL          LS ++   + +      V +S WH+ LGH + KV   VL+ CN     ++  SF + CQ+ K H LPF  S S  + 
Subjt:  YKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLSSSKC

Query:  PLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQ
        PL LIH D+WGP+P+ S  G++YY+ F+D++SR T+I+PLK KSD    F Q+K L EN+F+KK+K +Q D   E+++     I + I  R  CP+T QQ
Subjt:  PLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQ

Query:  NGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTKLTNSNTTVPNVCSLVIVL
        NG  ERKH  VVE+GLTL+AQA +PL++WW+AFS+AV++ NRL + V    SP+   F+   D + L+ FGCAC+PCL+P               +    
Subjt:  NGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTKLTNSNTTVPNVCSLVIVL

Query:  PTKSSTTASSPQSVS------VPTQSPNIASPTLSPVSRPLSPIAQSPALCLSSGSEASHSLPSSESSLDKS-----------------------TRAKS
          K STT  +  S +        T   N  S   S  +           +  ++GS     + +   S D++                       TR+K+
Subjt:  PTKSSTTASSPQSVS------VPTQSPNIASPTLSPVSRPLSPIAQSPALCLSSGSEASHSLPSSESSLDKS-----------------------TRAKS

Query:  GVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
        G+ KPK    ++    +  +  EP ++KEAL  P+W++AM  E  AL  N TW+LV      N+I +KWIFK K
Subjt:  GVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK

RVW85836.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]7.9e-9430.78Show/hide
Query:  LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQP-----SEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREV
        L+  L VKLD  N++LW+  +  ++     + ++  +   P     S ++  A +   R D              +S   +  +    A  +   +S   
Subjt:  LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQP-----SEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREV

Query:  WKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLIRPTT-------LLAAL--NLNHMVIPTRILGVAIGAEVVAVII---
        W ALE+ + ++S+ARI  +R  LQ+ KKG++ M++Y+  +K A+ +L     +  P +       LL  L  + N +V    I    I  E V  ++   
Subjt:  WKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLIRPTT-------LLAAL--NLNHMVIPTRILGVAIGAEVVAVII---

Query:  ---VERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGR----------------------------------------YICA
           +E+ S +   S ++ +  S    +     ++G RG +   +  + +   G GR                                        Y  +
Subjt:  ---VERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGR----------------------------------------YICA

Query:  PNNNQGGNNGGS----STYLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQR
         ++N   +N G+       +AS   L+D  W  DS A++H+T    NL   + Y G  K+T+ N   L++ N G+  + S+  +    LK + HVP I  
Subjt:  PNNNQGGNNGGS----STYLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQR

Query:  NLISIARLIADNNAFVEFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAF--AYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAY
        NLIS+A+  +DNNA +EF          + +      Q L         L+   +AF  A +  +S  C    ++WH  LGHAS  +   +++SCN S  
Subjt:  NLISIARLIADNNAFVEFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAF--AYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAY

Query:  VNE---VDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKT
         N+     +    CQ +KSHRLP   SLS +  PLEL+H DLWGP+PV ST G RY+I F+D+YSR T+ YPL+ K     VF ++K  VEN+FD K+K 
Subjt:  VNE---VDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKT

Query:  LQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFL
        LQSD   EFRSF +FL  + I  R  CP+   QNG VERKH  VVE GL L+A ASLP++FW  AF +A F+ NR+P++VL  +SP+   F+ V D   L
Subjt:  LQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFL

Query:  RVFGCACFPCLRPTK----------------------------LTNSNTTVPNVCSLVIVLPTKSSTTASSPQSVSVPTQSPNI----ASPTLSPVSRPL
        RVFGC C+P +RP                              LT      P+V       P   +   S  +  S  T +P I     +PT      P 
Subjt:  RVFGCACFPCLRPTK----------------------------LTNSNTTVPNVCSLVIVLPTKSSTTASSPQSVSVPTQSPNI----ASPTLSPVSRPL

Query:  SPIAQSPALCLSSGSEASHSL----------------PSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALY
        S ++ SP++  +S S +S ++                PSS  +   +TR   G+ + K          S  ++SEP T+K+AL  P W QAM  E  AL+
Subjt:  SPIAQSPALCLSSGSEASHSL----------------PSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALY

Query:  HNRTWSLVSPPPNTNLIGNKWIFKLK
         N+TW LV  PP  NLIG KW++KLK
Subjt:  HNRTWSLVSPPPNTNLIGNKWIFKLK

RVX23584.1 Retrovirus-related Pol polyprotein from transposon RE1 [Vitis vinifera]7.9e-9430.68Show/hide
Query:  SSPATVPISASTIVS----LSFGHP----LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQP-----SEMVEKAEIERPRHDPHEQRVDFHF
        SSP     S+ +I S        HP    L+  L VKLD  N++LW+  +  ++     + ++  +   P     S ++  A +   R D          
Subjt:  SSPATVPISASTIVS----LSFGHP----LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQP-----SEMVEKAEIERPRHDPHEQRVDFHF

Query:  AAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLIRPTT-------LLAAL-
            +S   +  +    A  +   +S   W ALE+++ ++S+ARI  +R  LQ+ KKG++ M++Y+  +K A+++L     +  P +       LL  L 
Subjt:  AAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLIRPTT-------LLAAL-

Query:  -NLNHMVIPTRILGVAIGAEVVAVII------VERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGR---------------
         + N +V    I    I  E V  ++      +E+ S +   S ++ +  S    +     ++G RG +   +  + +   G GR               
Subjt:  -NLNHMVIPTRILGVAIGAEVVAVII------VERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGR---------------

Query:  -------------------------YICAPNNNQGGNNGGS----STYLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVY
                                 Y  + ++N   +N G+       +AS   L+D  W  DS A++H+T    NL   + Y G  K+T+ N   L++ 
Subjt:  -------------------------YICAPNNNQGGNNGGS----STYLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVY

Query:  NVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAF--AYHVESSKSCKV
        N G+  + S+  +    LK + HVP I  NLIS+A+  +DNNA +EF          + +      Q L         L+   +AF  A +  +S  C  
Subjt:  NVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAF--AYHVESSKSCKV

Query:  SKSIWHQCLGHASVKVFNSVLRSCNQSAYVNE---VDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIY
          ++WH  LGHAS  +   +++SCN S   N+     +    CQ +KSHRLP   SLS +  PLEL+H DLWGP+PV ST G RY+I F+D+YSR T+ Y
Subjt:  SKSIWHQCLGHASVKVFNSVLRSCNQSAYVNE---VDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIY

Query:  PLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVF
        PL+ K     VF ++K  VEN+FD K+K LQSD   EFRSF +FL  + I  R  CP+   QNG VERKH  VVE GL L+A ASLP++FW  AF +A F
Subjt:  PLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVF

Query:  ITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTK----------------------------LTNSNTTVPNVCSLVIVLPTKSSTTASS
        + NR+P++VL  +SP+   F+ V D   LRVFGC C+P +RP                              LT      P+V       P   +   S 
Subjt:  ITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTK----------------------------LTNSNTTVPNVCSLVIVLPTKSSTTASS

Query:  PQSVSVPTQSPNI----ASPTLSPVSRPLSPIAQSPALCLSSGSEASHSL----------------PSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSRE
         +  S  T +P I     +PT      P S ++ SP++  +S S +S ++                PSS  +   +TR   G+ + K          S  
Subjt:  PQSVSVPTQSPNI----ASPTLSPVSRPLSPIAQSPALCLSSGSEASHSL----------------PSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSRE

Query:  QLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
        ++SEP T+K+AL  P W QAM  E  AL+ N+TW LV  PP  NLIG KW++KLK
Subjt:  QLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK

TrEMBL top hitse value%identityAlignment
A0A2K3NEN7 Copia-like polyprotein (Fragment)5.3e-9631.24Show/hide
Query:  LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALE
        L +++SVKLD  N+ LW+ +VL ++ G K DGY+L T   P + V  A+  +  +   +  +    A   +       ++      ++ +TS+++W   +
Subjt:  LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALE

Query:  EVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA----VIIVER
         + GA +K+RI  ++    N +KG MKM EYL  MK  S+ L        N  L+  T        N +V+       +  V + A+++A    +  +  
Subjt:  EVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA----VIIVER

Query:  TSGLPVSSAVT---------PDLHSWKQFKTSMIM-WSGRRGMSGALSAEAGFRILGLGR------------YICAPNNNQGGNNGGSSTYLASPEILSD
         SGL ++++              HS   ++ S      G RG  G +S        G G             Y     + +    G  S ++ASP    D
Subjt:  TSGLPVSSAVT---------PDLHSWKQFKTSMIM-WSGRRGMSGALSAEAGFRILGLGR------------YICAPNNNQGGNNGGSSTYLASPEILSD

Query:  PKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQR
         +W  DS A+NHVT          ++NGK  L V N  KL +   G      S   + + L D+L+VP I +NL+S+++L ADNN FVEF      D   
Subjt:  PKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQR

Query:  YKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLSSSKC
              ++ Q LL          LS ++   + +      V +S WH+ LGH + KV   VL+ CN     ++  SF + CQ+ K H LPF  S S  + 
Subjt:  YKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLSSSKC

Query:  PLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQ
        PL LIH D+WGP+P+ S  G++YY+ F+D++SR T+I+PLK KSD    F Q+K L EN+F+KK+K +Q D   E+++     I + I  R  CP+T QQ
Subjt:  PLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQ

Query:  NGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTKLTNSNTTVPNVCSLVIVL
        NG  ERKH  VVE+GLTL+AQA +PL++WW+AFS+AV++ NRL + V    SP+   F+   D + L+ FGCAC+PCL+P               +    
Subjt:  NGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTKLTNSNTTVPNVCSLVIVL

Query:  PTKSSTTASSPQSVS------VPTQSPNIASPTLSPVSRPLSPIAQSPALCLSSGSEASHSLPSSESSLDKS-----------------------TRAKS
          K STT  +  S +        T   N  S   S  +           +  ++GS     + +   S D++                       TR+K+
Subjt:  PTKSSTTASSPQSVS------VPTQSPNIASPTLSPVSRPLSPIAQSPALCLSSGSEASHSLPSSESSLDKS-----------------------TRAKS

Query:  GVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
        G+ KPK    ++    +  +  EP ++KEAL  P+W++AM  E  AL  N TW+LV      N+I +KWIFK K
Subjt:  GVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK

A0A2Z6MBG6 Integrase catalytic domain-containing protein5.3e-9630.41Show/hide
Query:  LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDP--HEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKA
        L + +SVKLD  N+ LW+ +VL ++ G K+DGY+L T   P E +  ++  + ++      Q  D       +++  T  +       ++ +TS+++W  
Subjt:  LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDP--HEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKA

Query:  LEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLI-RPTT----LLAALN-----LNHMVI----PTRILGVAIGAEVVA----
         + + GA ++++I  ++    + +KG MKM +YL  MK    NL + L L   P +    ++  LN      N +V+     T +  V + A+++     
Subjt:  LEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLI-RPTT----LLAALN-----LNHMVI----PTRILGVAIGAEVVA----

Query:  VIIVERTSGLPVSSAVTPDLHSWKQFKTSMIMWSG------RRGMSGALSAEAGFRILGLGRYIC-----------APNNNQGGNN--GGSSTYLASPEI
        +  +   + L +++       S  + K+S   W G      R G     S +   ++ GL  +I            + +N+  G++  G  + +LAS   
Subjt:  VIIVERTSGLPVSSAVTPDLHSWKQFKTSMIMWSG------RRGMSGALSAEAGFRILGLGRYIC-----------APNNNQGGNN--GGSSTYLASPEI

Query:  LSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCD
        + D  W  DS A+NHVT          +++GK  L V N  KL +   G+S + S      + L DIL+VP+I +NL+S+++L ADNN  VEF    +C 
Subjt:  LSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCD

Query:  GQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVS-KSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLS
          + K +  V  + LL             L      + + S  VS K  WH+ LGH + KV + VL SC      ++  SF + CQY K H LPF  S S
Subjt:  GQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVS-KSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLS

Query:  SSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPH
         ++ PLEL+H D+WGP+P+ ++ G++YY+ FVD++SR T+IYPLK KS+  + F Q+K L EN+F+K++K +Q D   E++      + + I  R  CP+
Subjt:  SSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPH

Query:  TRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------------
        T QQNG  ERKH  + E GLTL+AQA +PL +WW+AFS+AV++ NRLP+QV    SP+    +   D   L+ FGCAC+PCL+P                
Subjt:  TRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------------

Query:  --------TKLTNSNTTVPNVCSLVI----------VLPTKS--STTASSPQSVSVPTQSPNIASPTLSPVSRPLSPIAQS--PALCLSSGSEASHSLPS
                 K  NS+  +     ++            L T+S   TT + P +      + N+      P+    +P   +   +  ++S +E +++ PS
Subjt:  --------TKLTNSNTTVPNVCSLVI----------VLPTKS--STTASSPQSVSVPTQSPNIASPTLSPVSRPLSPIAQS--PALCLSSGSEASHSLPS

Query:  S-----ESSLDKS-------------------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTN
              E +LD +                   TR+KSG+ KPK    ++    + +   EP   KEAL+ PLW++AM  E  AL  N+TW LV      N
Subjt:  S-----ESSLDKS-------------------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTN

Query:  LIGNKWIFKLK
        ++ +KW+FK K
Subjt:  LIGNKWIFKLK

A0A2Z6P4D5 Integrase catalytic domain-containing protein1.3e-9430.15Show/hide
Query:  LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNI-------KTSR
        L +++SVKLD  N+ LW+ +VL+++ G K+DGY+L T   P + V  A+          ++V+  F    ++  Q L   L  +  ++I       +TS+
Subjt:  LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNI-------KTSR

Query:  EVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA---
        ++W   + + GA +K+RI  ++    N +KG MKM EYL  MK  S+ L        N  L+  T        N +V+       +  V + A+++A   
Subjt:  EVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENL------YENLFLIRPTTLLAALNLNHMVI----PTRILGVAIGAEVVA---

Query:  -VIIVERTSGLPVSSAVTPDLHSWKQFK----TSMIMW--SGRRGM-------------------SGALSAEAGFRILGLGRYICAPNNNQGGNNGGSST
         +      SGL ++++   +  +  +F+     S   W  S  RGM                   +G ++ +  +R      Y     + +    G  S 
Subjt:  -VIIVERTSGLPVSSAVTPDLHSWKQFK----TSMIMW--SGRRGM-------------------SGALSAEAGFRILGLGRYICAPNNNQGGNNGGSST

Query:  YLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEF
        ++ASP    D +W  DS A NHVT          ++NGK  L V N  KL +   G      S   + + L D+L+VP I +NL+S+++L ADNN  VEF
Subjt:  YLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEF

Query:  HPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLP
              D         ++ Q LL          LS+     ++   +S       WH+ LGH + KV + VL+ CN     ++  SF + CQ+ K H LP
Subjt:  HPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLP

Query:  FSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDS
        F  S S  + PL LIH D+WGP+P+ S  G++YY+ F+D++SR T+I+PLK KSD    F Q+K L EN+F+KK+K +Q D   E+++     I + I  
Subjt:  FSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDS

Query:  RHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------
        R  CP+T QQNG  ERKH  V E+GLTL+AQA +PL++WW+AFS+AV++ NRLP+ V    SP+   F+   D + L+ FGCAC+PCL+P          
Subjt:  RHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP----------

Query:  --------------TKLTNSN-----------------------------TTVPNVCSLVIVLPTKSSTTASSPQSVSVPTQSPNIASPTLSPVSRPLSP
                       K  NS+                              T+ +  S+++   +  +TT  + +  +  T   N  S   S  +     
Subjt:  --------------TKLTNSN-----------------------------TTVPNVCSLVIVLPTKSSTTASSPQSVSVPTQSPNIASPTLSPVSRPLSP

Query:  IAQSPALCLSSGS-----EASHSLPS-------------SESSLDKS------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMT
        +  S     ++ S     EA +S+ S              ++  D S      TR+K G+ KPK    ++    +  +  EP+++KEAL  P+W++AM  
Subjt:  IAQSPALCLSSGS-----EASHSLPS-------------SESSLDKS------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMT

Query:  EGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
        E  AL  N TW+LV      N+I +KWIFK K
Subjt:  EGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK

A0A438KQV8 Retrovirus-related Pol polyprotein from transposon RE13.8e-9430.68Show/hide
Query:  SSPATVPISASTIVS----LSFGHP----LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQP-----SEMVEKAEIERPRHDPHEQRVDFHF
        SSP     S+ +I S        HP    L+  L VKLD  N++LW+  +  ++     + ++  +   P     S ++  A +   R D          
Subjt:  SSPATVPISASTIVS----LSFGHP----LSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQP-----SEMVEKAEIERPRHDPHEQRVDFHF

Query:  AAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLIRPTT-------LLAAL-
            +S   +  +    A  +   +S   W ALE+++ ++S+ARI  +R  LQ+ KKG++ M++Y+  +K A+++L     +  P +       LL  L 
Subjt:  AAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQASENLYENLFLIRPTT-------LLAAL-

Query:  -NLNHMVIPTRILGVAIGAEVVAVII------VERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGR---------------
         + N +V    I    I  E V  ++      +E+ S +   S ++ +  S    +     ++G RG +   +  + +   G GR               
Subjt:  -NLNHMVIPTRILGVAIGAEVVAVII------VERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGR---------------

Query:  -------------------------YICAPNNNQGGNNGGS----STYLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVY
                                 Y  + ++N   +N G+       +AS   L+D  W  DS A++H+T    NL   + Y G  K+T+ N   L++ 
Subjt:  -------------------------YICAPNNNQGGNNGGS----STYLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVY

Query:  NVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAF--AYHVESSKSCKV
        N G+  + S+  +    LK + HVP I  NLIS+A+  +DNNA +EF          + +      Q L         L+   +AF  A +  +S  C  
Subjt:  NVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAF--AYHVESSKSCKV

Query:  SKSIWHQCLGHASVKVFNSVLRSCNQSAYVNE---VDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIY
          ++WH  LGHAS  +   +++SCN S   N+     +    CQ +KSHRLP   SLS +  PLEL+H DLWGP+PV ST G RY+I F+D+YSR T+ Y
Subjt:  SKSIWHQCLGHASVKVFNSVLRSCNQSAYVNE---VDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIY

Query:  PLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVF
        PL+ K     VF ++K  VEN+FD K+K LQSD   EFRSF +FL  + I  R  CP+   QNG VERKH  VVE GL L+A ASLP++FW  AF +A F
Subjt:  PLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVF

Query:  ITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTK----------------------------LTNSNTTVPNVCSLVIVLPTKSSTTASS
        + NR+P++VL  +SP+   F+ V D   LRVFGC C+P +RP                              LT      P+V       P   +   S 
Subjt:  ITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTK----------------------------LTNSNTTVPNVCSLVIVLPTKSSTTASS

Query:  PQSVSVPTQSPNI----ASPTLSPVSRPLSPIAQSPALCLSSGSEASHSL----------------PSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSRE
         +  S  T +P I     +PT      P S ++ SP++  +S S +S ++                PSS  +   +TR   G+ + K          S  
Subjt:  PQSVSVPTQSPNI----ASPTLSPVSRPLSPIAQSPALCLSSGSEASHSL----------------PSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSRE

Query:  QLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
        ++SEP T+K+AL  P W QAM  E  AL+ N+TW LV  PP  NLIG KW++KLK
Subjt:  QLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK

A0A803PM38 Uncharacterized protein1.4e-10132.99Show/hide
Query:  ASTIVSLSFGHPLSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVN
        A  IV   FG  L+   ++KLD  NF LWR MV AI+ G ++DGY+  T+ +P E +   +++       +    F      +   Q L   L+ +    
Subjt:  ASTIVSLSFGHPLSTVLSVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVN

Query:  I-------KTSREVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQ-------ASENLYENLFLIRPTTLLAALNLNHMVIPTRILGVAI
        I        +S  +W ALEE++GA SKA+++  R  +Q  +KGA+ M +YL   +Q       A E   EN  +   + +L+ L++ +  +P  +L  A 
Subjt:  I-------KTSREVWKALEEVYGATSKARINSVRGILQNKKKGAMKMVEYLAIMKQ-------ASENLYENLFLIRPTTLLAALNLNHMVIPTRILGVAI

Query:  GAEVVAVIIVERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGRYICAPNNNQGG--NNGGSS---------------TYLA
        G+     +   +   L + S +   LHS   F  S  +       S +L+ +        G +    NNN+GG  NN GS+               T   
Subjt:  GAEVVAVIIVERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSAEAGFRILGLGRYICAPNNNQGG--NNGGSS---------------TYLA

Query:  SPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVG-NSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHP
          +         +  A+NH+TS+   + +K +YNGK+K+TV+N ++L ++++G  SL   SA  SP++LK+ILHVP I +NL+SI++L +DNN  VEF  
Subjt:  SPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVG-NSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHP

Query:  LLSC-----DGQ-----RYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSI-----------WHQCLGHASVKVFNSVLRSCNQSAYV
         L        GQ     + K+       P  T + +SN       +F+  V S+    V+K +           WH+ LGH S++V ++VL   N    +
Subjt:  LLSC-----DGQ-----RYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSI-----------WHQCLGHASVKVFNSVLRSCNQSAYV

Query:  NEVDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSD
        N   SF D CQ  KSH LPF  +   +  PLEL+H D+WGPSP+ S   +RYYI F+D++SR T+IYPLK KS+    F Q+K LVEN+F+ +VK +Q+D
Subjt:  NEVDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSD

Query:  LEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFG
           E++ F  F     I  +HPCPHT  QNG  ERKH  +VEMGLTL+AQA +P K+WWDAF +AV++ NRLPT VL   +P+E  F+   D  FL+VFG
Subjt:  LEREFRSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFG

Query:  CACFPCLR----------PTKLTN-------------SNTTVPNVCSLVIV----LPTKS----STTASSPQSVSVP-----------------------
         +CFPCLR           TK  N             S+T    +   VI      P KS    +    +P SV VP                       
Subjt:  CACFPCLR----------PTKLTN-------------SNTTVPNVCSLVIV----LPTKS----STTASSPQSVSVP-----------------------

Query:  ---TQSPNIASPTLSPVSRPLSPI------------------------AQSPALCLSSG------SEASHSLPSSESSLDKSTRAKSGVFKPKDWGSFLS
           T   +  +PT S V   LS                            +    L S       S + H+L +  S+    TRAK+G+FKPK   ++L+
Subjt:  ---TQSPNIASPTLSPVSRPLSPI------------------------AQSPALCLSSG------SEASHSLPSSESSLDKSTRAKSGVFKPKDWGSFLS

Query:  TCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
                SEP++I+EAL    W  AM +E  AL  N TW LV   P+ ++I NKW++K K
Subjt:  TCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.6e-2325.51Show/hide
Query:  ILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSC
        ++ +  ++ DS A++H+ +D        +     K+ V+   +  +Y     +V        I L+D+L       NL+S+ RL  +    +EF      
Subjt:  ILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSC

Query:  DGQRYKESPSVSSQPLLTAASNS--NNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRS--CNQSAYVNEVDSFYDVCQ---YSKSHRL
             K   ++S   L+   ++   NN+ + +   AY + +    K +  +WH+  GH S      + R    +  + +N ++   ++C+     K  RL
Subjt:  DGQRYKESPSVSSQPLLTAASNS--NNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRS--CNQSAYVNEVDSFYDVCQ---YSKSHRL

Query:  PFS--RSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRS--FSSFLIA
        PF   +  +  K PL ++H D+ GP    +     Y++ FVD+++     Y +K KSD F +F  + A  E  F+ KV  L  D  RE+ S     F + 
Subjt:  PFS--RSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRS--FSSFLIA

Query:  SRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSS--PWECAFRIVLDISFLRVFGCACF
          I      PHT Q NG+ ER    + E   T+++ A L   FW +A  +A ++ NR+P++ L  SS  P+E        +  LRVFG   +
Subjt:  SRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSS--PWECAFRIVLDISFLRVFGCACF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.9e-3225.29Show/hide
Query:  YLASPEILSDPKWLADSSATNHVT--SDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFV
        +L+ PE     +W+ D++A++H T   D     +  D+     + + N S   +  +G+  + ++ +   ++LKD+ HVP ++ NLIS   L  D     
Subjt:  YLASPEILSDPKWLADSSATNHVT--SDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFV

Query:  EFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHR
         +    +    R  +   V ++ +        N  +          ++   ++S  +WH+ +GH S K    + +    S          D C + K HR
Subjt:  EFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHR

Query:  LPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREF--RSFSSFLIAS
        + F  S       L+L++ D+ GP  + S  G +Y+++F+D+ SR  ++Y LK K   F+VF ++ ALVE +  +K+K L+SD   E+  R F  +  + 
Subjt:  LPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREF--RSFSSFLIAS

Query:  RIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACF---PCLRPTKL
         I      P T Q NG+ ER +  +VE   +++  A LP  FW +A  +A ++ NR P+  L    P        +  S L+VFGC  F   P  + TKL
Subjt:  RIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACF---PCLRPTKL

Query:  TNSN--------------------TTVPNVCSLVIVLPTKSSTTASS----------PQSVSVPTQS--PNIASPTLSPVS----RPLSPIAQSPALCLS
         + +                         + S  +V       TA+           P  V++P+ S  P  A  T   VS    +P   I Q     L 
Subjt:  TNSN--------------------TTVPNVCSLVIVLPTKSSTTASS----------PQSVSVPTQS--PNIASPTLSPVS----RPLSPIAQSPALCLS

Query:  SGSEASHSLPSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQ---AMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKL
         G E        E       R++    + + + S      S ++  EP ++KE L+ P   Q   AM  E  +L  N T+ LV  P     +  KW+FKL
Subjt:  SGSEASHSLPSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQ---AMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKL

Query:  K
        K
Subjt:  K

Q07163 Transposon TyH3 Gag-Pol polyprotein2.7e-1225.08Show/hide
Query:  ILHVPHIQRNLISIARLIA-DNNAFVEFHPLLSCDGQ------RYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVK
        +LH P+I  +L+S+  L A D  A    + L   DG       +Y +   VS + LL      +N+S+ ++   +  ES++  K      H+ L HA+ +
Subjt:  ILHVPHIQRNLISIARLIA-DNNAFVEFHPLLSCDGQ------RYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVK

Query:  VFNSVLRSCNQSAYVNEVDSFYD-----------VCQYSKSHRLPFSR-SLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPL--KL
             L++ N   Y NE D  +            + + +K   +  SR    +S  P + +H D++GP          Y+ISF DE ++  ++YPL  + 
Subjt:  VFNSVLRSCNQSAYVNEVDSFYD-----------VCQYSKSHRLPFSR-SLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPL--KL

Query:  KSDPFRVFCQYKALVENKFDKKVKTLQSDLEREF--RSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFIT
        +     VF    A ++N+F   V  +Q D   E+  R+   FL  + I   +      + +G+ ER +  +++   T +  + LP   W+ A   +  + 
Subjt:  KSDPFRVFCQYKALVENKFDKKVKTLQSDLEREF--RSFSSFLIASRIDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFIT

Query:  NRLPTQVLDGSSPWECAFRIVLDISFLRVFG
        N L +      S  + A    LDIS L  FG
Subjt:  NRLPTQVLDGSSPWECAFRIVLDISFLRVFG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.5e-6330.35Show/hide
Query:  LASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFH
        L SP   S   WL DS AT+H+TSD  NL++   Y G   + V++ S + + + G++ +  S  + P+ L +IL+VP+I +NLIS+ RL   N   VEF 
Subjt:  LASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFH

Query:  PLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVE--SSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDV--CQYSKSH
        P        ++     +  PLL      + L    +A +  V   +S S K + S WH  LGH +  + NSV+ + + S  +N    F     C  +KS+
Subjt:  PLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVE--SSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDV--CQYSKSH

Query:  RLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASR
        ++PFS+S  +S  PLE I+ D+W  SP+ S   YRYY+ FVD ++R T++YPLK KS     F  +K L+EN+F  ++ T  SD   EF +   +     
Subjt:  RLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASR

Query:  IDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP-------
        I      PHT + NG+ ERKH  +VE GLTL++ AS+P  +W  AF+ AV++ NRLPT +L   SP++  F    +   LRVFGCAC+P LRP       
Subjt:  IDSRHPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRP-------

Query:  ----------TKLTNS--------------------------------------------------NTTVPNVCSLVIVLPTKSST----TASSPQSVSV
                    LT S                                                  +TT+P   +   VLP  S +     A+ P S S 
Subjt:  ----------TKLTNS--------------------------------------------------NTTVPNVCSLVIVLPTKSST----TASSPQSVSV

Query:  PTQSPNIASPTL----------------------SPVSRPL---------------SPIAQSP---ALCLSSGSEASHSLPSSESSLDKS----------
        P ++  ++S  L                       P ++P                +P  +SP   A  LS+ +++S S PS  +S   S          
Subjt:  PTQSPNIASPTL----------------------SPVSRPL---------------SPIAQSP---ALCLSSGSEASHSLPSSESSLDKS----------

Query:  -------------------------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPP-NTNLIGNKW
                                 TRAK+G+ KP    S   + ++    SEPRT  +AL    WR AM +E  A   N TW LV PPP +  ++G +W
Subjt:  -------------------------TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPP-NTNLIGNKW

Query:  IFKLK
        IF  K
Subjt:  IFKLK

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.0e-6431.56Show/hide
Query:  WLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQRYK
        WL DS AT+H+TSD  NL+    Y G   + +++ S + + + G++ +P+S+ +  + L  +L+VP+I +NLIS+ RL   N   VEF P        ++
Subjt:  WLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQRNLISIARLIADNNAFVEFHPLLSCDGQRYK

Query:  ESPSVSSQPLLTAASNSNNLSLS-SLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYV----NEVDSFYDVCQYSKSHRLPFSRSLSS
             +  PLL   +         + + A  + +S   K + S WH  LGH S+ + NSV+   N S  V    +++ S  D C  +KSH++PFS S  +
Subjt:  ESPSVSSQPLLTAASNSNNLSLS-SLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYV----NEVDSFYDVCQYSKSHRLPFSRSLSS

Query:  SKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHT
        S  PLE I+ D+W  SP+ S   YRYY+ FVD ++R T++YPLK KS     F  +K+LVEN+F  ++ TL SD   EF     +L    I      PHT
Subjt:  SKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSRHPCPHT

Query:  RQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLR------------------
         + NG+ ERKH  +VEMGLTL++ AS+P  +W  AFS AV++ NRLPT +L   SP++  F    +   L+VFGCAC+P LR                  
Subjt:  RQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLR------------------

Query:  -------------------------------PTKLTN------------------SNTTVP-------------------------------------NV
                                       P   TN                  S+TT+P                                     N+
Subjt:  -------------------------------PTKLTN------------------SNTTVP-------------------------------------NV

Query:  CSLVIVLPTKSSTTASS---PQSVSVP--TQSPNIASPTL--------SPVS------RPLSPIAQSPALCLSSGSEASHSLPSSESSL-----------
         S  I  P+ S  TA S   PQ  + P  TQ+ N  SP L        SP S       P SPI+ SP +   S S +  + PSS S+            
Subjt:  CSLVIVLPTKSSTTASS---PQSVSVP--TQSPNIASPTL--------SPVS------RPLSPIAQSPALCLSSGSEASHSLPSSESSL-----------

Query:  --------------DKSTRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLV-SPPPNTNLIGNKWIFKLK
                        +TRAK G+ KP    S+ ++ ++    SEPRT  +A+    WRQAM +E  A   N TW LV  PPP+  ++G +WIF  K
Subjt:  --------------DKSTRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLV-SPPPNTNLIGNKWIFKLK

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.1e-0440.38Show/hide
Query:  EPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
        EP T  EA    +W  AM  E  A+    TW + + PPN   IG KW++K+K
Subjt:  EPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK

ATMG00300.1 Gag-Pol-related retrotransposon family protein3.3e-0529.63Show/hide
Query:  SKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVP
        +++ K    +WH  L H S +    +++     +       F + C Y K+HR+ FS    ++K PL+ +H DLWG   VP
Subjt:  SKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVCQYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVP

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.7e-0943.04Show/hide
Query:  TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK
        TR+K+G+ K     S   T + ++   EP+++  AL  P W QAM  E  AL  N+TW LV PP N N++G KW+FK K
Subjt:  TRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAMMTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGATACCCAAGAATTCGGCAGTAACTTCAAGCAGGGGATTTATCTTCAGAAAGATGAGAAAGAAATCAGACAAGATTTTTTCAAGAACCCTCTTTACTTAGCTCT
GATACCAATTGTTGAGCGTGAAGGATCAAAGCCTCTCAATCAAAAGCAGAATTTAGATGAAGTAAAGACAAAAGAGCGTATTTTTTTTCTTCCCTGCGCTATGGCGGCTG
ACGATCTTTCGTTCTCTTCAAATGCTGGTATTTCTTCTCCAGCCACCGTACCTATTTCTGCCTCGACCATTGTTAGTTTGTCGTTTGGTCACCCTCTGAGCACTGTGTTA
TCAGTCAAGCTCGATGACAAGAATTTCTTACTGTGGAGAGGAATGGTTCTAGCCATTCTTTGCGGTCAAAAGGTGGATGGCTATGTGCTTGAGACTGTAGCCCAACCCTC
TGAAATGGTTGAGAAGGCCGAGATCGAACGTCCTCGTCACGATCCCCACGAACAACGGGTTGATTTCCACTTCGCGGCCCCATCTGTTTCCGCCTTCCAAACGCTCTGTT
CTGTTCTGTTTACGGCTCCCAAAGTCAATATCAAAACCTCTCGTGAAGTCTGGAAAGCTCTTGAGGAAGTTTATGGTGCAACAAGCAAGGCTCGGATCAACTCTGTTCGC
GGAATCCTACAAAATAAAAAGAAAGGCGCTATGAAAATGGTCGAATACCTGGCAATCATGAAGCAGGCCTCTGAAAATCTTTACGAAAATTTATTTCTAATCAGACCAAC
AACTCTTCTGGCAGCCCTCAATCTCAACCATATGGTAATCCCCACTCGAATTTTAGGGGTGGCAATCGGAGCAGAGGTCGTGGCCGTAATCATCGTGGAAAGAACTTCAG
GCCTTCCTGTCAGCTCTGCTGTAACACCCGATCTTCACTCTTGGAAGCAGTTTAAGACGAGTATGATAATGTGGTCGGGTCGCAGGGGAATGTCGGGGGCCTTAAGTGCC
GAAGCCGGATTCCGAATCCTGGGCCTGGGACGTTACATCTGCGCTCCCAACAACAACCAAGGAGGGAACAATGGAGGCTCATCTACTTATTTGGCCTCTCCTGAAATTCT
CTCTGATCCGAAGTGGCTAGCTGATAGCAGTGCCACCAATCATGTGACCTCTGACGCTGGAAATTTAGCCATTAAGGCTGATTACAATGGTAAACAGAAGCTCACTGTTA
GTAACGATTCTAAACTCACTGTTTATAATGTTGGAAATAGCTTGGTGCCTTCTTCTGCTCTTACTTCTCCTATATTGCTCAAAGATATACTTCATGTTCCGCATATTCAA
CGAAATTTGATAAGCATAGCTCGTCTTATTGCTGACAATAATGCTTTTGTTGAATTTCACCCTCTCTTGTCTTGTGATGGACAAAGATACAAAGAATCGCCCTCTGTTAG
TTCTCAGCCTTTGCTTACTGCTGCTTCTAATTCCAATAATCTGTCATTATCCAGTTTAGCGTTTGCCTATCATGTTGAAAGTTCTAAGTCATGTAAGGTGTCCAAATCTA
TATGGCACCAATGTCTCGGTCATGCTTCGGTTAAAGTCTTTAATAGTGTCCTTCGATCTTGTAATCAGTCTGCTTATGTTAATGAAGTAGACTCTTTCTATGATGTTTGT
CAATATAGTAAATCGCACCGTTTACCTTTCTCTCGTTCATTATCTTCATCCAAATGCCCGCTTGAACTAATCCATTGTGACCTTTGGGGTCCATCCCCTGTTCCCTCAAC
CCATGGCTATAGATATTACATTAGTTTTGTAGATGAATACTCTCGTTTAACCTATATATATCCACTAAAACTAAAGAGTGATCCCTTTCGAGTCTTTTGCCAATACAAAG
CCCTTGTTGAGAACAAATTTGATAAGAAAGTTAAAACTCTTCAGTCTGATTTGGAAAGGGAATTTAGATCCTTTTCCTCCTTTCTCATAGCTTCTAGGATAGATTCCAGG
CACCCTTGTCCACATACGAGACAACAAAATGGGATTGTAGAAAGGAAACACCATCAGGTTGTCGAAATGGGACTCACATTGATTGCTCAAGCGTCTCTACCATTGAAATT
CTGGTGGGATGCTTTCTCTTCAGCTGTCTTTATCACCAATAGGTTACCTACTCAAGTTCTTGATGGATCCTCACCTTGGGAATGTGCCTTTCGTATAGTCCTTGATATTT
CCTTCCTTAGAGTGTTTGGTTGTGCTTGTTTCCCTTGCCTTCGACCTACCAAGCTCACAAATTCCAATACCACAGTACCAAATGTGTGTTCATTGGTTATAGTGCTGCCC
ACGAAGTCATCCACCACTGCCTCATCTCCTCAGTCTGTTTCTGTTCCAACTCAGTCTCCAAATATAGCTTCACCTACCTTGTCTCCTGTTTCCAGACCTCTCTCACCGAT
AGCTCAATCTCCAGCCCTTTGTCTATCCTCAGGATCTGAAGCATCACATTCTCTGCCTTCCTCTGAGTCCTCTCTTGACAAATCTACTAGGGCTAAAAGTGGGGTCTTCA
AACCAAAAGACTGGGGTTCCTTCTTGTCTACATGTTCTTCCCGTGAGCAACTTTCGGAACCTCGCACCATCAAGGAAGCTTTGACTTCCCCTCTTTGGAGACAGGCCATG
ATGACTGAAGGGGTTGCTCTTTACCACAATAGAACTTGGTCTCTTGTCTCTCCACCTCCGAATACCAATCTCATAGGTAACAAGTGGATTTTCAAATTAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGATACCCAAGAATTCGGCAGTAACTTCAAGCAGGGGATTTATCTTCAGAAAGATGAGAAAGAAATCAGACAAGATTTTTTCAAGAACCCTCTTTACTTAGCTCT
GATACCAATTGTTGAGCGTGAAGGATCAAAGCCTCTCAATCAAAAGCAGAATTTAGATGAAGTAAAGACAAAAGAGCGTATTTTTTTTCTTCCCTGCGCTATGGCGGCTG
ACGATCTTTCGTTCTCTTCAAATGCTGGTATTTCTTCTCCAGCCACCGTACCTATTTCTGCCTCGACCATTGTTAGTTTGTCGTTTGGTCACCCTCTGAGCACTGTGTTA
TCAGTCAAGCTCGATGACAAGAATTTCTTACTGTGGAGAGGAATGGTTCTAGCCATTCTTTGCGGTCAAAAGGTGGATGGCTATGTGCTTGAGACTGTAGCCCAACCCTC
TGAAATGGTTGAGAAGGCCGAGATCGAACGTCCTCGTCACGATCCCCACGAACAACGGGTTGATTTCCACTTCGCGGCCCCATCTGTTTCCGCCTTCCAAACGCTCTGTT
CTGTTCTGTTTACGGCTCCCAAAGTCAATATCAAAACCTCTCGTGAAGTCTGGAAAGCTCTTGAGGAAGTTTATGGTGCAACAAGCAAGGCTCGGATCAACTCTGTTCGC
GGAATCCTACAAAATAAAAAGAAAGGCGCTATGAAAATGGTCGAATACCTGGCAATCATGAAGCAGGCCTCTGAAAATCTTTACGAAAATTTATTTCTAATCAGACCAAC
AACTCTTCTGGCAGCCCTCAATCTCAACCATATGGTAATCCCCACTCGAATTTTAGGGGTGGCAATCGGAGCAGAGGTCGTGGCCGTAATCATCGTGGAAAGAACTTCAG
GCCTTCCTGTCAGCTCTGCTGTAACACCCGATCTTCACTCTTGGAAGCAGTTTAAGACGAGTATGATAATGTGGTCGGGTCGCAGGGGAATGTCGGGGGCCTTAAGTGCC
GAAGCCGGATTCCGAATCCTGGGCCTGGGACGTTACATCTGCGCTCCCAACAACAACCAAGGAGGGAACAATGGAGGCTCATCTACTTATTTGGCCTCTCCTGAAATTCT
CTCTGATCCGAAGTGGCTAGCTGATAGCAGTGCCACCAATCATGTGACCTCTGACGCTGGAAATTTAGCCATTAAGGCTGATTACAATGGTAAACAGAAGCTCACTGTTA
GTAACGATTCTAAACTCACTGTTTATAATGTTGGAAATAGCTTGGTGCCTTCTTCTGCTCTTACTTCTCCTATATTGCTCAAAGATATACTTCATGTTCCGCATATTCAA
CGAAATTTGATAAGCATAGCTCGTCTTATTGCTGACAATAATGCTTTTGTTGAATTTCACCCTCTCTTGTCTTGTGATGGACAAAGATACAAAGAATCGCCCTCTGTTAG
TTCTCAGCCTTTGCTTACTGCTGCTTCTAATTCCAATAATCTGTCATTATCCAGTTTAGCGTTTGCCTATCATGTTGAAAGTTCTAAGTCATGTAAGGTGTCCAAATCTA
TATGGCACCAATGTCTCGGTCATGCTTCGGTTAAAGTCTTTAATAGTGTCCTTCGATCTTGTAATCAGTCTGCTTATGTTAATGAAGTAGACTCTTTCTATGATGTTTGT
CAATATAGTAAATCGCACCGTTTACCTTTCTCTCGTTCATTATCTTCATCCAAATGCCCGCTTGAACTAATCCATTGTGACCTTTGGGGTCCATCCCCTGTTCCCTCAAC
CCATGGCTATAGATATTACATTAGTTTTGTAGATGAATACTCTCGTTTAACCTATATATATCCACTAAAACTAAAGAGTGATCCCTTTCGAGTCTTTTGCCAATACAAAG
CCCTTGTTGAGAACAAATTTGATAAGAAAGTTAAAACTCTTCAGTCTGATTTGGAAAGGGAATTTAGATCCTTTTCCTCCTTTCTCATAGCTTCTAGGATAGATTCCAGG
CACCCTTGTCCACATACGAGACAACAAAATGGGATTGTAGAAAGGAAACACCATCAGGTTGTCGAAATGGGACTCACATTGATTGCTCAAGCGTCTCTACCATTGAAATT
CTGGTGGGATGCTTTCTCTTCAGCTGTCTTTATCACCAATAGGTTACCTACTCAAGTTCTTGATGGATCCTCACCTTGGGAATGTGCCTTTCGTATAGTCCTTGATATTT
CCTTCCTTAGAGTGTTTGGTTGTGCTTGTTTCCCTTGCCTTCGACCTACCAAGCTCACAAATTCCAATACCACAGTACCAAATGTGTGTTCATTGGTTATAGTGCTGCCC
ACGAAGTCATCCACCACTGCCTCATCTCCTCAGTCTGTTTCTGTTCCAACTCAGTCTCCAAATATAGCTTCACCTACCTTGTCTCCTGTTTCCAGACCTCTCTCACCGAT
AGCTCAATCTCCAGCCCTTTGTCTATCCTCAGGATCTGAAGCATCACATTCTCTGCCTTCCTCTGAGTCCTCTCTTGACAAATCTACTAGGGCTAAAAGTGGGGTCTTCA
AACCAAAAGACTGGGGTTCCTTCTTGTCTACATGTTCTTCCCGTGAGCAACTTTCGGAACCTCGCACCATCAAGGAAGCTTTGACTTCCCCTCTTTGGAGACAGGCCATG
ATGACTGAAGGGGTTGCTCTTTACCACAATAGAACTTGGTCTCTTGTCTCTCCACCTCCGAATACCAATCTCATAGGTAACAAGTGGATTTTCAAATTAAAGTAA
Protein sequenceShow/hide protein sequence
MRDTQEFGSNFKQGIYLQKDEKEIRQDFFKNPLYLALIPIVEREGSKPLNQKQNLDEVKTKERIFFLPCAMAADDLSFSSNAGISSPATVPISASTIVSLSFGHPLSTVL
SVKLDDKNFLLWRGMVLAILCGQKVDGYVLETVAQPSEMVEKAEIERPRHDPHEQRVDFHFAAPSVSAFQTLCSVLFTAPKVNIKTSREVWKALEEVYGATSKARINSVR
GILQNKKKGAMKMVEYLAIMKQASENLYENLFLIRPTTLLAALNLNHMVIPTRILGVAIGAEVVAVIIVERTSGLPVSSAVTPDLHSWKQFKTSMIMWSGRRGMSGALSA
EAGFRILGLGRYICAPNNNQGGNNGGSSTYLASPEILSDPKWLADSSATNHVTSDAGNLAIKADYNGKQKLTVSNDSKLTVYNVGNSLVPSSALTSPILLKDILHVPHIQ
RNLISIARLIADNNAFVEFHPLLSCDGQRYKESPSVSSQPLLTAASNSNNLSLSSLAFAYHVESSKSCKVSKSIWHQCLGHASVKVFNSVLRSCNQSAYVNEVDSFYDVC
QYSKSHRLPFSRSLSSSKCPLELIHCDLWGPSPVPSTHGYRYYISFVDEYSRLTYIYPLKLKSDPFRVFCQYKALVENKFDKKVKTLQSDLEREFRSFSSFLIASRIDSR
HPCPHTRQQNGIVERKHHQVVEMGLTLIAQASLPLKFWWDAFSSAVFITNRLPTQVLDGSSPWECAFRIVLDISFLRVFGCACFPCLRPTKLTNSNTTVPNVCSLVIVLP
TKSSTTASSPQSVSVPTQSPNIASPTLSPVSRPLSPIAQSPALCLSSGSEASHSLPSSESSLDKSTRAKSGVFKPKDWGSFLSTCSSREQLSEPRTIKEALTSPLWRQAM
MTEGVALYHNRTWSLVSPPPNTNLIGNKWIFKLK