; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038488 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038488
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationchr2:18537595..18540661
RNA-Seq ExpressionLag0038488
SyntenyLag0038488
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]3.4e-16138.67Show/hide
Query:  LSTAATSFNTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNP
        +++AA S N + L   +    ++KLDR+N+ LWK+L LP++R  KL+G++ G    P ++I                  TSS+          ++   N 
Subjt:  LSTAATSFNTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNP

Query:  LYESWIVIDQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQV
         +  W   DQ LLGW+ NSMT E+ATQ++  E++++LW   Q L G  +R++  YL+  F   RKG +KM DYL  MKN  D L  AG+PV++  L+ Q 
Subjt:  LYESWIVIDQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQV

Query:  LLGLDEEYNPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYG
        L GLD EYNPVV  +  +T +SW ++ A+LL FE R+E  N L N    + NA+ N+AN     + RG+ S      NN +RG+   G   GRGRG+   
Subjt:  LLGLDEEYNPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYG

Query:  QYNNNKPVCQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANP
           + K  CQVCG   H A+ C+ RF+K           TY   N SA         AF+ +QN         +V D +WY DSGASNHVT       + 
Subjt:  QYNNNKPVCQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANP

Query:  TEYEGNQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAES
        TE+ G   ++VG+G  L I  TG+S L     +L+L ++L VP+I KNL+SVSKLA DNNI VEF ++ CFVKDK TGKV+LKG+L +GLY+        
Subjt:  TEYEGNQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAES

Query:  VDVTKSASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHT
                          SG   N  S++V+V   K  WHRRLGHP+ KVLD V++ C +    ++ F FC+AC+YGK H LPF +S S A    +LVHT
Subjt:  VDVTKSASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHT

Query:  DLWGPAPITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERK
        D+WGPAPI +S+GF+YY                                                              GIQ R++CPYTSQQNGRAERK
Subjt:  DLWGPAPITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERK

Query:  HRHVVETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKC
        HRH+ E GLTLLAQA MPL +WW+AF T+  LIN LPSQV   +SP  L+  ++PD   L+TFGCAC+PCLKPY  +K  +HT +CV+LG S  HKG+KC
Subjt:  HRHVVETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKC

Query:  ISSSGRVFISRHVQFNEDDFPFHGGFGAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPNN
        ++S GR+FISRHV FNED FPFH GF    +   TT + P  S    +P+ T+ +       + S+P  + ++P      A   T       S     NN
Subjt:  ISSSGRVFISRHVQFNEDDFPFHGGFGAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPNN

Query:  HPFVPAFPFDNNSESNSLPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKPK---VWLTQSTTD
         P          SE N+  + +  +    +   +S N +NT           +H + TR K+GI KPK   + LT++  D
Subjt:  HPFVPAFPFDNNSESNSLPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKPK---VWLTQSTTD

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]1.3e-15237.78Show/hide
Query:  TSFNTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYESW
        +S   SP    L  I ++KLDR N+ LWK+L L ++R  KL+G++ G    P Q++                  TS++           + +VNP +  W
Subjt:  TSFNTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYESW

Query:  IVIDQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLD
        I  DQ LLGWL NSM  ++ATQ++  E++++LW   Q L G  +++   YL+  F  +RKG +KM +YL  MKN +D L  AGSP+++  L+ Q L GLD
Subjt:  IVIDQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLD

Query:  EEYNPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYGQYNNN
         EYNPVV  +  + N+SW ++ A+LL FE RL+  N   N    + NAS N AN  E          F G + N  RGN R    RG   GRG G+ +N 
Subjt:  EEYNPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYGQYNNN

Query:  KPVCQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEYEG
        K  CQVC   GH A+ C  RF++ Y+G +         G+ SA                    +ASP    D  WY DSGA+NHVT   +      E+ G
Subjt:  KPVCQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEYEG

Query:  NQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDVTK
           ++VG+G  L I  +G++ L    N L+L +VL VP I KNL+SVSKL  DNNI VEF  + C VKDK TG+ LLKG L +GLY+             
Subjt:  NQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDVTK

Query:  SASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGP
                               Y++V   K  WHR+LGHP+ KVLD V+K CN+    ++ F FC+AC++GK H LPF  S S       L+H+D+WGP
Subjt:  SASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGP

Query:  APITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRHVV
        API S +GF+YY                                                              GIQ R++CPYTSQQNGRAERKHRHV 
Subjt:  APITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRHVV

Query:  ETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISSSG
        E GLTLLAQA MPL++WW+AF T+  LIN LPS V   +SP  L+F R+PD  AL+ FGCAC+PCLKPY  +K  FHT +CV++G S  HKG+KCI+S G
Subjt:  ETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISSSG

Query:  RVFISRHVQFNEDDFPFHGGFGAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPN--NHPF
        R+F+SRHV FNE+ FPFHGGF               + T  P    T +SS  + + +     Q    P  N       T S   T S+ +S N  N   
Subjt:  RVFISRHVQFNEDDFPFHGGFGAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPN--NHPF

Query:  VPAFPFDNNSESNSLPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKPKV
        V +  F  N+ +NS  Q      ++ +   ++   + T +         TH M TR K GI KPK+
Subjt:  VPAFPFDNNSESNSLPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKPKV

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]9.3e-15141.01Show/hide
Query:  MTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLDEEYNPVVAMIQGRT
        MT EVATQ++  E++Q++W   Q L G  +R+   +L+  F ++RKG LKM +YL  MK  AD+L  AGS V++  LV+Q L GLD EYNP+V  +  + 
Subjt:  MTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLDEEYNPVVAMIQGRT

Query:  NISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYGQYNNNKPVCQVCGKVGHTA
        +++W EM A+LL +E RLE  N   N  + + N S N+  S  + N+RG+ ++F G      RG Q   G RG   GRG G+   ++ VCQVC K GH A
Subjt:  NISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYGQYNNNKPVCQVCGKVGHTA

Query:  LMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEYEGNQQVIVGDGNNLNI
          CY RFNK Y G          N +   S+    Q      N N    VASP TV D +WY DSGASNHVT D N +    E +G   + VG+G NL I
Subjt:  LMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEYEGNQQVIVGDGNNLNI

Query:  AYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDVTKSASQSRGASGVNKS
           G+S L     +L+L ++L VP I KNL+S+SKL  DN+I+VEFHD  CFVKDK TG++LL+G + +GLY+                   G++  NK 
Subjt:  AYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDVTKSASQSRGASGVNKS

Query:  GVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGPAPITSSNGFRYY--
            +V  S       K  WHR+LGHP+ KVL+ V+K CN+     E F+FC+AC++GK+H LPF NSVS A    DLVH+D+WGPAPI+S +GF+YY  
Subjt:  GVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGPAPITSSNGFRYY--

Query:  ------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRHVVETGLTLLAQASMPL
                                                                    GIQ+R +CPYTS QNGRAERKHRHVVE+GLTLLAQA MPL
Subjt:  ------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRHVVETGLTLLAQASMPL

Query:  QFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISSSGRVFISRHVQFNEDD
         +WW+AF T+  LIN LP+QV+  KSP + LF + PD  A++TFGCAC+PCLKPY  +K  FHT KCV+LG S  HKG+KC++S+GR+FISRHV FNE  
Subjt:  QFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISSSGRVFISRHVQFNEDD

Query:  FPFHGGF----GAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSP-TRSLAASPNNHPFVPAFPFDNNSE
        FPFH GF      A+ +T  TS   PIS   P     ++    + ++N S  N + +H +  +        ++S  T + +   NN   +        S+
Subjt:  FPFHGGF----GAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSP-TRSLAASPNNHPFVPAFPFDNNSE

Query:  SNSLPQASPTVEALPNPLSSSPNPSNTPE
           +    P V A+   L     P  T E
Subjt:  SNSLPQASPTVEALPNPLSSSPNPSNTPE

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]3.4e-16138.17Show/hide
Query:  ATSFNTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYES
        +++ N++  N L + + ++KLDR N+ LW+++ LPI+R  +L+G++ G    P ++I                            + + ++ + NP +E 
Subjt:  ATSFNTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYES

Query:  WIVIDQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGL
        W   DQ LLGWL NSMT  +ATQ++  E++ +LW   Q L G  +R++  YL+  F  +RKG +KM DYL  MKN AD L  AG+P+++  L+ Q L GL
Subjt:  WIVIDQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGL

Query:  DEEYNPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLA-NSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYGQYN
        D EYNPVV  +  +T +SW ++ A+LL FE R+E  N+L N    + NA+ N+A  S   GN+    +++ G  NN +RG+   G   GRGRGR +    
Subjt:  DEEYNPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLA-NSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYGQYN

Query:  NNKPVCQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEY
          K  CQVCG   H A+ C+ RF+K           TY   N SA+        AF+ +QN         ++ D +WY DSGASNHVT   +   N +E+
Subjt:  NNKPVCQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEY

Query:  EGNQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDV
         G   +IVG+G  L I  TG+S L     +L+L ++L VP I KNL+SVSKLA DNNI VEF ++ CFVKDK TGK +L+G+L +GLY+  +  +     
Subjt:  EGNQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDV

Query:  TKSASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLW
                               S+YV++   K  WHR+LGHP+ KVLD V+K CN+    ++ F FC+AC+YGK H LPF  S S A    +LVHTD+W
Subjt:  TKSASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLW

Query:  GPAPITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRH
        GPAPI SS+GF+YY                                                              GIQ R++CPYTSQQNGRAERKHRH
Subjt:  GPAPITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRH

Query:  VVETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISS
        + E GLTLLAQA MPL +WW+AF T+  LIN LPS V   KSP  LL  R+PD  +L+ FGCAC+P LKPY  +K  FHT +CV+LG S  HKG+KC++S
Subjt:  VVETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISS

Query:  SGRVFISRHVQFNEDDFPFHGGFGAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPNNHPF
         GR+FISRHV FNED FPFH GF               ++T  P    T S SS         P ++       +PA          TR  A S  ++  
Subjt:  SGRVFISRHVQFNEDDFPFHGGFGAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPNNHPF

Query:  VPAFPFDNNSESNSLPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKP
              +  +  N    A+P     P    +S N               TH M TR K GI KP
Subjt:  VPAFPFDNNSESNSLPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKP

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]9.9e-16137.93Show/hide
Query:  LNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYESWIVIDQLLLGWL
        L    ++KLDR NF LWK+L LP++R  K +G++ G    P Q++      + +++T                       ++NP Y+ W   DQ LLGWL
Subjt:  LNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYESWIVIDQLLLGWL

Query:  YNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLDEEYNPVVAMIQ
         NSMT ++ATQV+  E++++LW   Q L G  +R+   YL+  F  + K  +KM  YL  MKN AD L  AGSP++S  L+ Q L GLD EYNPVV  + 
Subjt:  YNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLDEEYNPVVAMIQ

Query:  GRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNF-YRGNQRGGGNRGRGRGRGYGQYNN-NKPVCQVCGK
         +TNISW +  A+LL FE RL+  N   N    + NAS N A+  E G             N F  RG  RG  +RG   GRG  + +   +P+CQ+CGK
Subjt:  GRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNF-YRGNQRGGGNRGRGRGRGYGQYNN-NKPVCQVCGK

Query:  VGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEYEGNQQVIVGDG
         GHTA  CY RF+K Y   ++  +   G G+ SA                    VASP    D  WY DSGASNHVT     L +  E  G   ++VG+G
Subjt:  VGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEYEGNQQVIVGDG

Query:  NNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDVTKSASQSRGAS
          L I  +G++ L D    ++L NVL VP I KNL+SVSKL  DNN  VEF +++C+VKDK TGK LLKG L +GLY+                      
Subjt:  NNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDVTKSASQSRGAS

Query:  GVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGPAPITSSNGF
          NK   +     +Y+++   K +WHR+LGHP+ KVL+ V+K  N+    ++ F FC+AC++GK H LPF  S S A    DL+HTD+WGPAPI S + F
Subjt:  GVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGPAPITSSNGF

Query:  RYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRHVVETGLTLLAQ
        +YY                                                              GIQ +++CPYTSQQNGRAERKHRHV E GLTLLAQ
Subjt:  RYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRHVVETGLTLLAQ

Query:  ASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISSSGRVFISRHVQ
        A MPL +WW+AF T+  LIN LPS V   +SP  L+F ++PD  AL+ FGCAC+PCLKPY  +K  FHT +CV+LG S  HKG+KC++S GRVF+SRHV 
Subjt:  ASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISSSGRVFISRHVQ

Query:  FNEDDFPFHGGFGAADNVTTTTSSSPPIS-TWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPNNHPFVPAFPFDNNS
        FNE+ FPF  GF    N     ++  PI    FP  +TT++++        ++ +QQ            P    ++     +   +         F N  
Subjt:  FNEDDFPFHGGFGAADNVTTTTSSSPPIS-TWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPNNHPFVPAFPFDNNS

Query:  ESNSLPQAS-PTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKPKV
          +S   A   ++E +  P++ +  P       P      TH M TR KAG++KPK+
Subjt:  ESNSLPQAS-PTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKPKV

TrEMBL top hitse value%identityAlignment
A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)1.6e-16138.17Show/hide
Query:  ATSFNTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYES
        +++ N++  N L + + ++KLDR N+ LW+++ LPI+R  +L+G++ G    P ++I                            + + ++ + NP +E 
Subjt:  ATSFNTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYES

Query:  WIVIDQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGL
        W   DQ LLGWL NSMT  +ATQ++  E++ +LW   Q L G  +R++  YL+  F  +RKG +KM DYL  MKN AD L  AG+P+++  L+ Q L GL
Subjt:  WIVIDQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGL

Query:  DEEYNPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLA-NSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYGQYN
        D EYNPVV  +  +T +SW ++ A+LL FE R+E  N+L N    + NA+ N+A  S   GN+    +++ G  NN +RG+   G   GRGRGR +    
Subjt:  DEEYNPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLA-NSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYGQYN

Query:  NNKPVCQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEY
          K  CQVCG   H A+ C+ RF+K           TY   N SA+        AF+ +QN         ++ D +WY DSGASNHVT   +   N +E+
Subjt:  NNKPVCQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEY

Query:  EGNQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDV
         G   +IVG+G  L I  TG+S L     +L+L ++L VP I KNL+SVSKLA DNNI VEF ++ CFVKDK TGK +L+G+L +GLY+  +  +     
Subjt:  EGNQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDV

Query:  TKSASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLW
                               S+YV++   K  WHR+LGHP+ KVLD V+K CN+    ++ F FC+AC+YGK H LPF  S S A    +LVHTD+W
Subjt:  TKSASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLW

Query:  GPAPITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRH
        GPAPI SS+GF+YY                                                              GIQ R++CPYTSQQNGRAERKHRH
Subjt:  GPAPITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRH

Query:  VVETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISS
        + E GLTLLAQA MPL +WW+AF T+  LIN LPS V   KSP  LL  R+PD  +L+ FGCAC+P LKPY  +K  FHT +CV+LG S  HKG+KC++S
Subjt:  VVETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISS

Query:  SGRVFISRHVQFNEDDFPFHGGFGAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPNNHPF
         GR+FISRHV FNED FPFH GF               ++T  P    T S SS         P ++       +PA          TR  A S  ++  
Subjt:  SGRVFISRHVQFNEDDFPFHGGFGAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPNNHPF

Query:  VPAFPFDNNSESNSLPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKP
              +  +  N    A+P     P    +S N               TH M TR K GI KP
Subjt:  VPAFPFDNNSESNSLPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKP

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)4.8e-16137.93Show/hide
Query:  LNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYESWIVIDQLLLGWL
        L    ++KLDR NF LWK+L LP++R  K +G++ G    P Q++      + +++T                       ++NP Y+ W   DQ LLGWL
Subjt:  LNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYESWIVIDQLLLGWL

Query:  YNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLDEEYNPVVAMIQ
         NSMT ++ATQV+  E++++LW   Q L G  +R+   YL+  F  + K  +KM  YL  MKN AD L  AGSP++S  L+ Q L GLD EYNPVV  + 
Subjt:  YNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLDEEYNPVVAMIQ

Query:  GRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNF-YRGNQRGGGNRGRGRGRGYGQYNN-NKPVCQVCGK
         +TNISW +  A+LL FE RL+  N   N    + NAS N A+  E G             N F  RG  RG  +RG   GRG  + +   +P+CQ+CGK
Subjt:  GRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNF-YRGNQRGGGNRGRGRGRGYGQYNN-NKPVCQVCGK

Query:  VGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEYEGNQQVIVGDG
         GHTA  CY RF+K Y   ++  +   G G+ SA                    VASP    D  WY DSGASNHVT     L +  E  G   ++VG+G
Subjt:  VGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEYEGNQQVIVGDG

Query:  NNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDVTKSASQSRGAS
          L I  +G++ L D    ++L NVL VP I KNL+SVSKL  DNN  VEF +++C+VKDK TGK LLKG L +GLY+                      
Subjt:  NNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDVTKSASQSRGAS

Query:  GVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGPAPITSSNGF
          NK   +     +Y+++   K +WHR+LGHP+ KVL+ V+K  N+    ++ F FC+AC++GK H LPF  S S A    DL+HTD+WGPAPI S + F
Subjt:  GVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGPAPITSSNGF

Query:  RYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRHVVETGLTLLAQ
        +YY                                                              GIQ +++CPYTSQQNGRAERKHRHV E GLTLLAQ
Subjt:  RYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRHVVETGLTLLAQ

Query:  ASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISSSGRVFISRHVQ
        A MPL +WW+AF T+  LIN LPS V   +SP  L+F ++PD  AL+ FGCAC+PCLKPY  +K  FHT +CV+LG S  HKG+KC++S GRVF+SRHV 
Subjt:  ASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISSSGRVFISRHVQ

Query:  FNEDDFPFHGGFGAADNVTTTTSSSPPIS-TWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPNNHPFVPAFPFDNNS
        FNE+ FPF  GF    N     ++  PI    FP  +TT++++        ++ +QQ            P    ++     +   +         F N  
Subjt:  FNEDDFPFHGGFGAADNVTTTTSSSPPIS-TWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPNNHPFVPAFPFDNNS

Query:  ESNSLPQAS-PTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKPKV
          +S   A   ++E +  P++ +  P       P      TH M TR KAG++KPK+
Subjt:  ESNSLPQAS-PTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKPKV

A0A2Z6MBG6 Integrase catalytic domain-containing protein1.6e-16138.67Show/hide
Query:  LSTAATSFNTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNP
        +++AA S N + L   +    ++KLDR+N+ LWK+L LP++R  KL+G++ G    P ++I                  TSS+          ++   N 
Subjt:  LSTAATSFNTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNP

Query:  LYESWIVIDQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQV
         +  W   DQ LLGW+ NSMT E+ATQ++  E++++LW   Q L G  +R++  YL+  F   RKG +KM DYL  MKN  D L  AG+PV++  L+ Q 
Subjt:  LYESWIVIDQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQV

Query:  LLGLDEEYNPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYG
        L GLD EYNPVV  +  +T +SW ++ A+LL FE R+E  N L N    + NA+ N+AN     + RG+ S      NN +RG+   G   GRGRG+   
Subjt:  LLGLDEEYNPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYG

Query:  QYNNNKPVCQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANP
           + K  CQVCG   H A+ C+ RF+K           TY   N SA         AF+ +QN         +V D +WY DSGASNHVT       + 
Subjt:  QYNNNKPVCQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANP

Query:  TEYEGNQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAES
        TE+ G   ++VG+G  L I  TG+S L     +L+L ++L VP+I KNL+SVSKLA DNNI VEF ++ CFVKDK TGKV+LKG+L +GLY+        
Subjt:  TEYEGNQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAES

Query:  VDVTKSASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHT
                          SG   N  S++V+V   K  WHRRLGHP+ KVLD V++ C +    ++ F FC+AC+YGK H LPF +S S A    +LVHT
Subjt:  VDVTKSASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHT

Query:  DLWGPAPITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERK
        D+WGPAPI +S+GF+YY                                                              GIQ R++CPYTSQQNGRAERK
Subjt:  DLWGPAPITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERK

Query:  HRHVVETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKC
        HRH+ E GLTLLAQA MPL +WW+AF T+  LIN LPSQV   +SP  L+  ++PD   L+TFGCAC+PCLKPY  +K  +HT +CV+LG S  HKG+KC
Subjt:  HRHVVETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKC

Query:  ISSSGRVFISRHVQFNEDDFPFHGGFGAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPNN
        ++S GR+FISRHV FNED FPFH GF    +   TT + P  S    +P+ T+ +       + S+P  + ++P      A   T       S     NN
Subjt:  ISSSGRVFISRHVQFNEDDFPFHGGFGAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPNN

Query:  HPFVPAFPFDNNSESNSLPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKPK---VWLTQSTTD
         P          SE N+  + +  +    +   +S N +NT           +H + TR K+GI KPK   + LT++  D
Subjt:  HPFVPAFPFDNNSESNSLPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKPK---VWLTQSTTD

A0A2Z6P4D5 Integrase catalytic domain-containing protein6.3e-15337.78Show/hide
Query:  TSFNTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYESW
        +S   SP    L  I ++KLDR N+ LWK+L L ++R  KL+G++ G    P Q++                  TS++           + +VNP +  W
Subjt:  TSFNTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYESW

Query:  IVIDQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLD
        I  DQ LLGWL NSM  ++ATQ++  E++++LW   Q L G  +++   YL+  F  +RKG +KM +YL  MKN +D L  AGSP+++  L+ Q L GLD
Subjt:  IVIDQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLD

Query:  EEYNPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYGQYNNN
         EYNPVV  +  + N+SW ++ A+LL FE RL+  N   N    + NAS N AN  E          F G + N  RGN R    RG   GRG G+ +N 
Subjt:  EEYNPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYGQYNNN

Query:  KPVCQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEYEG
        K  CQVC   GH A+ C  RF++ Y+G +         G+ SA                    +ASP    D  WY DSGA+NHVT   +      E+ G
Subjt:  KPVCQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEYEG

Query:  NQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDVTK
           ++VG+G  L I  +G++ L    N L+L +VL VP I KNL+SVSKL  DNNI VEF  + C VKDK TG+ LLKG L +GLY+             
Subjt:  NQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDVTK

Query:  SASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGP
                               Y++V   K  WHR+LGHP+ KVLD V+K CN+    ++ F FC+AC++GK H LPF  S S       L+H+D+WGP
Subjt:  SASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGP

Query:  APITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRHVV
        API S +GF+YY                                                              GIQ R++CPYTSQQNGRAERKHRHV 
Subjt:  APITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRHVV

Query:  ETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISSSG
        E GLTLLAQA MPL++WW+AF T+  LIN LPS V   +SP  L+F R+PD  AL+ FGCAC+PCLKPY  +K  FHT +CV++G S  HKG+KCI+S G
Subjt:  ETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISSSG

Query:  RVFISRHVQFNEDDFPFHGGFGAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPN--NHPF
        R+F+SRHV FNE+ FPFHGGF               + T  P    T +SS  + + +     Q    P  N       T S   T S+ +S N  N   
Subjt:  RVFISRHVQFNEDDFPFHGGFGAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPN--NHPF

Query:  VPAFPFDNNSESNSLPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKPKV
        V +  F  N+ +NS  Q      ++ +   ++   + T +         TH M TR K GI KPK+
Subjt:  VPAFPFDNNSESNSLPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKPKV

A0A803PM38 Uncharacterized protein3.8e-15838.2Show/hide
Query:  LNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYESWIVIDQLLLGWL
        LNQ   +KLDR+NF LW+ +   I+R ++L+G+L G  P+P ++         L+ST  +G  +S               QVNP +E WIV DQLLLGWL
Subjt:  LNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYESWIVIDQLLLGWL

Query:  YNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLDEEYNPVVAMIQ
        Y SMT  +A +VMG +S+  LW A++ELFG  S+A+ D  R   Q +RKG L MADYLR  +  AD L  AG P     LVS VL GLD EY P+V +I+
Subjt:  YNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLDEEYNPVVAMIQ

Query:  GRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRG---NQRGGGNRGRGRGRGYGQYNNNKPVCQVCG
         R + +W ++   LL  + ++E  ++   S   S+   V +  S  + N+     +  G  NN  RG   N RG  NR RGRG   G+ +  +P CQVCG
Subjt:  GRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRG---NQRGGGNRGRGRGRGYGQYNNNKPVCQVCG

Query:  KVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEYEGNQQVIVGD
        K GH+A  CY R                                                           GASNH+T++ N +    EY G ++V V +
Subjt:  KVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEYEGNQQVIVGD

Query:  GNNLNIAYTG-NSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFD-QSRAESVDVTKSASQSR
        GN L I + G  S  T   + L L  +L VPSI KNL+S+SKL  DNN+ VEF    CFVKDK+TG+V+LKG L +GLY+FD  +   S+   +S S   
Subjt:  GNNLNIAYTG-NSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFD-QSRAESVDVTKSASQSR

Query:  GASGVNKSGVSVNVLSSYVNVVVSKVV--WHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGPAPIT
          SG+  S V  NV     N ++  +   WHRRLGHPS++VLD+V+ + N+    N    FCDAC+ GKSH+LPF  +  +A A  +LVHTD+WGP+PI 
Subjt:  GASGVNKSGVSVNVLSSYVNVVVSKVV--WHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGPAPIT

Query:  SSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRHVVETGL
        S+  FRYY                                                              GI  +  CP+TS QNGRAERKHRH+VE GL
Subjt:  SSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRHVVETGL

Query:  TLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISSSGRVFI
        TLLAQA +P ++WWDAF T+  LIN LP+ VL  K+P E+LF ++PD   L+ FG +CFPCL+ YQ +KF FH+ KCV LG S  HKG+KC+SS+GR++I
Subjt:  TLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISSSGRVFI

Query:  SRHVQFNEDDFPFHGGFGAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPNNHPFVPAFPF
        SR V FNED+FPF  GF   +   T  S   P  T   +  + SSS +   SS  +    ++ H  P +    P   +         +  +H        
Subjt:  SRHVQFNEDDFPFHGGFGAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPNNHPFVPAFPF

Query:  DNNSESNSLPQA-SPTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKPKVWLTQS
        D  S+      A + T+E+  +P+ +S +  N        A   THPMITR KAGIFKPK +LTQ+
Subjt:  DNNSESNSLPQA-SPTVEALPNPLSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKPKVWLTQS

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-2922.27Show/hide
Query:  ESWIVIDQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYL-RQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVL
        E W  +D+     +   ++ +V   ++  ++A+ +W  ++ L+  ++   + YL +Q++            +L V       L   G  +        +L
Subjt:  ESWIVIDQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYL-RQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVL

Query:  LGLDEEY-NPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGG--GNRGRGRGRG
          L   Y N    ++ G+T I   ++ + LL+ EK                         R+    +GQ     GR  ++ R +   G  G RG+ + R 
Subjt:  LGLDEEY-NPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGG--GNRGRGRGRG

Query:  YGQYNNNKPVCQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYG--NGNRSASQVPSTQPTA-FVTNQNVGQIVASPETVVDPNWYADSGASNHVT--AD
          +  N    C  C + GH        F ++   P +G+  T G  N + +A+ V +      F+  +     ++ PE+     W  D+ AS+H T   D
Subjt:  YGQYNNNKPVCQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYG--NGNRSASQVPSTQPTA-FVTNQNVGQIVASPETVVDPNWYADSGASNHVT--AD

Query:  YNCLANPTEYEGNQQVIVGDGNNLNIAYTGNSCLTDGIN-ALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDT--GKVLLKGVLSEGL
          C     ++     V +G+ +   IA  G+ C+   +   L L +V  VP +  NL+S   L  D      +   F   K + T    V+ KGV    L
Subjt:  YNCLANPTEYEGNQQVIVGDGNNLNIAYTGNSCLTDGIN-ALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDT--GKVLLKGVLSEGL

Query:  YRFDQSRAESVDVTKSASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQ
        YR   + AE      +A+Q                        +S  +WH+R+GH S K L  + K+  +   +    K CD C +GK H + F  S  +
Subjt:  YRFDQSRAESVDVTKSASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQ

Query:  AHAKFDLVHTDLWGPAPITSSNGFRYY----------------------------------------------------------------GIQIRLTCP
             DLV++D+ GP  I S  G +Y+                                                                GI+   T P
Subjt:  AHAKFDLVHTDLWGPAPITSSNGFRYY----------------------------------------------------------------GIQIRLTCP

Query:  YTSQQNGRAERKHRHVVETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVY
         T Q NG AER +R +VE   ++L  A +P  FW +A  T+  LIN  PS  L  + P  +   ++   + L+ FGC  F  +   Q  K    +  C++
Subjt:  YTSQQNGRAERKHRHVVETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVY

Query:  LGPSPLHKGHKCISS-SGRVFISRHVQFNEDD
        +G      G++       +V  SR V F E +
Subjt:  LGPSPLHKGHKCISS-SGRVFISRHVQFNEDD

P93293 Uncharacterized mitochondrial protein AtMg003009.1e-0835.14Show/hide
Query:  VWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGPAPITSS
        +WH RL H S + ++ +VK+  L + +    KFC+ C YGK+H + F+          D VH+DLWG   +  S
Subjt:  VWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGPAPITSS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.3e-8227.52Show/hide
Query:  NTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYESWIVI
        NTS LN  ++ +T  KL  +N+L+W      +   Y+L G L G    PP                         T+ + A     A +VNP Y  W   
Subjt:  NTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYESWIVI

Query:  DQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLDEEY
        D+L+   +  +++  V   V    +A ++W  +++++   S      LR   +Q  KG   + DY++ +    D L   G P+     V +VL  L EEY
Subjt:  DQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLDEEY

Query:  NPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFY--RGNQRGGGNRGRGRGRGYGQYNNNK
         PV+  I  +      + P  L    +RL    +   +VS +    +  AN+    N     ++ NG +NN Y  R N        +     +   N +K
Subjt:  NPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFY--RGNQRGGGNRGRGRGRGYGQYNNNK

Query:  PV---CQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEY
        P    CQ+CG  GH+A  C Q                        S V S QP +  T       +A        NW  DSGA++H+T+D+N L+    Y
Subjt:  PV---CQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEY

Query:  EGNQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDV
         G   V+V DG+ + I++TG++ L+     L+L N+L VP+I KNL+SV +L + N + VEF  +   VKD +TG  LL+G   + LY +  + ++ V +
Subjt:  EGNQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDV

Query:  TKSASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEY-FKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDL
          S S     S                        WH RLGHP+  +L+SV+   +L      + F  C  C   KS+ +PF+ S   +    + +++D+
Subjt:  TKSASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEY-FKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDL

Query:  WGPAPITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHR
        W  +PI S + +RYY                                                              GI    + P+T + NG +ERKHR
Subjt:  WGPAPITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHR

Query:  HVVETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCIS
        H+VETGLTLL+ AS+P  +W  AF  +  LIN LP+ +L  +SP + LFG  P+   LR FGCAC+P L+PY  +K    + +CV+LG S     + C+ 
Subjt:  HVVETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCIS

Query:  -SSGRVFISRHVQFNEDDFPFHGGFGAADNV-----------------TTTTSSSPPISTWFPYPVTT--SSSSSPVQSSNISVPN--QQLQHPIPNSPA
          + R++ISRHV+F+E+ FPF         V                  T T   P  S   P+   T  SS S+P ++S +S  N         P+SP 
Subjt:  -SSGRVFISRHVQFNEDDFPFHGGFGAADNV-----------------TTTTSSSPPISTWFPYPVTT--SSSSSPVQSSNISVPN--QQLQHPIPNSPA

Query:  ATPPTCSLSPTRSLAASPNNHPFVPAFPFDNNSESNSLPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPV-----------------------THPMI
         T P  +     +                 NN  + S  Q + ++       SSSP+P+ +   S TS  P                        TH M 
Subjt:  ATPPTCSLSPTRSLAASPNNHPFVPAFPFDNNSESNSLPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPV-----------------------THPMI

Query:  TRGKAGIFKP
        TR KAGI KP
Subjt:  TRGKAGIFKP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.0e-8027.33Show/hide
Query:  NTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYESWIVI
        NT+ LN  ++ +T  KL  +N+L+W      +   Y+L G L G  P PP  I    V                              +VNP Y  W   
Subjt:  NTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVNPLYESWIVI

Query:  DQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLDEEY
        D+L+   +  +++  V   V    +A ++W  +++++   S      LR I                      D L   G P+     V +VL  L ++Y
Subjt:  DQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLDEEY

Query:  NPVVAMIQGR-TNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYGQYNNNKP
         PV+  I  + T  S +E+   L+  E +L   N+ +     +   +    N+    N RG   ++N   N          G+R   R          KP
Subjt:  NPVVAMIQGR-TNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYGQYNNNKP

Query:  V---CQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEYE
            CQ+C   GH+A  C          P   Q ++  N  +S S     QP A +        V SP      NW  DSGA++H+T+D+N L+    Y 
Subjt:  V---CQVCGKVGHTALMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEYE

Query:  GNQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDVT
        G   V++ DG+ + I +TG++ L     +L L  VL VP+I KNL+SV +L + N + VEF  +   VKD +TG  LL+G   + LY +  + +++V + 
Subjt:  GNQQVIVGDGNNLNIAYTGNSCLTDGINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDVT

Query:  KSASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEY-FKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLW
         S       S                        WH RLGHPSL +L+SV+   +LP     +    C  C   KSH +PF+NS   +    + +++D+W
Subjt:  KSASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVWHRRLGHPSLKVLDSVVKQCNLPTKRNEY-FKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLW

Query:  GPAPITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRH
          +PI S + +RYY                                                              GI    + P+T + NG +ERKHRH
Subjt:  GPAPITSSNGFRYY--------------------------------------------------------------GIQIRLTCPYTSQQNGRAERKHRH

Query:  VVETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCIS-
        +VE GLTLL+ AS+P  +W  AF  +  LIN LP+ +L  +SP + LFG+ P+   L+ FGCAC+P L+PY  +K    +++C ++G S     + C+  
Subjt:  VVETGLTLLAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCIS-

Query:  SSGRVFISRHVQFNEDDFPFH-GGFGAADNVTTTTSSSP--PISTWFPYP---------------------------VTTSSSSSPVQSSNISVPNQQLQ
         +GR++ SRHVQF+E  FPF    FG + +    + S+P  P  T  P                              TT  SSS + SS+IS P+    
Subjt:  SSGRVFISRHVQFNEDDFPFH-GGFGAADNVTTTTSSSP--PISTWFPYP---------------------------VTTSSSSSPVQSSNISVPNQQLQ

Query:  HPIPNSPAATPPTCSLSPTRSLAASPN----NHPFVPAFPFDNNSESNS-LPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPV---------------
           P +P+   P  +  P ++  ++ N    N+P  P  P  N+   NS LPQ+  +   +P P +S   P N+P  S TS  P+               
Subjt:  HPIPNSPAATPPTCSLSPTRSLAASPN----NHPFVPAFPFDNNSESNS-LPQASPTVEALPNPLSSSPNPSNTPEISPTSAAPV---------------

Query:  ----THPMITRGKAGIFKPKVWLTQSTT
            TH M TR K GI KP    + +T+
Subjt:  ----THPMITRGKAGIFKPKVWLTQSTT

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)2.2e-0427.19Show/hide
Query:  GASRSTTAAQVNPLYESWIVIDQLLLGWLYNSMTP-EVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQ
        G    T A  VN     W   D ++   LY ++TP +     +   +++++W  I+  F     A    L    +    G++++ADY R MK  AD+L  
Subjt:  GASRSTTAAQVNPLYESWIVIDQLLLGWLYNSMTP-EVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQ

Query:  AGSPVTSRSLVSQVLLGLDEEYNPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQR
           PVT R+LV  VL GL+ +++ ++ +I+ R      +  A +L  E+    +    N      ++S  +    E       Q S  G     YRG  R
Subjt:  AGSPVTSRSLVSQVLLGLDEEYNPVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQR

Query:  GGGNRGRGRGRGYGQYN
         G N  RGRG  +  YN
Subjt:  GGGNRGRGRGRGYGQYN

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.3e-0822.58Show/hide
Query:  SALSTAATSFNTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQV
        + LS+   SF    +   +    T+ L++ N+ +W+ L   +  S+ + GH+ G                  +STP                        
Subjt:  SALSTAATSFNTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQV

Query:  NPLYES-WIVIDQLLLGWLYNSMTPEVATQVMGFE-SAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSL
         P+ E  W   D L+  W+Y ++T  +   ++    +A++LW +++ LF     A         + +   +L + +Y + +K+ +D L    SP++ R L
Subjt:  NPLYES-WIVIDQLLLGWLYNSMTPEVATQVMGFE-SAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSL

Query:  VSQVLLGLDEEYNPVVAMIQGRTNI-SWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGR
        V  +L GL E+Y+ ++ +I+ ++   S++E  + LL+ E RL   N  K+S+S + + S++         Q      ++   +N  RG  +    + RG 
Subjt:  VSQVLLGLDEEYNPVVAMIQGRTNI-SWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGR

Query:  GRGYGQYNNN
        G   G+YNNN
Subjt:  GRGYGQYNNN

ATMG00300.1 Gag-Pol-related retrotransposon family protein6.5e-0935.14Show/hide
Query:  VWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGPAPITSS
        +WH RL H S + ++ +VK+  L + +    KFC+ C YGK+H + F+          D VH+DLWG   +  S
Subjt:  VWHRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGPAPITSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAGCGTCATATCTGCCGGATCTTCAGCCCTGTCCACCGCAGCTACCAGTTTCAACACATCGCCACTTAATCAGCTATTAAATCAGATCACGACGATCAAATTGGA
TCGCAGTAACTTTCTTCTTTGGAAGAATTTGGCCCTTCCTATCTTGCGGAGCTACAAATTAGAAGGTCACCTATCAGGTGTTAACCCTCGCCCTCCTCAGTATATTCAAC
CCTCAGTTGTGGAGTCAGATCTGAATTCCACACCAGCAGAGGGAGGAGCAACCAGTTCTGAGACGGTTGCTAGTGGTGCATCTAGGTCTACAACAGCTGCACAGGTTAAT
CCATTGTACGAGTCTTGGATAGTGATTGATCAATTACTACTCGGGTGGTTGTATAACTCTATGACACCTGAAGTAGCTACCCAAGTGATGGGGTTTGAGAGCGCTCAAGA
ATTGTGGGCTGCTATACAAGAGTTATTCGGAGTTCAATCGCGTGCTGAGGAGGATTATCTCCGTCAAATCTTTCAACAATCAAGGAAAGGGAACTTGAAGATGGCAGACT
ATCTGCGGGTCATGAAGAATCACGCAGATAATTTGGGACAGGCAGGGAGTCCGGTTACCTCACGGTCCTTAGTTTCACAAGTTCTTTTGGGATTAGACGAGGAGTACAAT
CCGGTTGTTGCAATGATTCAAGGAAGAACTAATATATCTTGGTCAGAAATGCCGGCAGAACTGCTTGTTTTCGAGAAAAGACTAGAGATGCAAAACACCTTGAAGAACTC
TGTATCCTTCAGTCAAAATGCCTCAGTCAATTTAGCAAACAGCAGGGAGGTTGGAAATCAGCGAGGTCAACAAAGTTCCTTCAATGGGCGCCAGAACAACTTCTATAGGG
GAAATCAACGTGGAGGTGGAAACCGTGGCAGAGGCAGAGGTCGAGGGTATGGACAGTACAACAATAACAAGCCCGTTTGTCAGGTATGTGGGAAGGTAGGTCACACTGCT
CTTATGTGTTACCAACGTTTTAACAAAGAATACTCTGGTCCTTCCCAAGGTCAAAATAGAACATATGGAAATGGAAATCGTTCTGCTAGTCAGGTACCTTCGACCCAACC
TACTGCTTTTGTAACCAATCAGAATGTCGGTCAGATTGTAGCCTCACCTGAAACTGTTGTCGATCCCAATTGGTATGCTGATAGTGGGGCTTCGAATCATGTCACTGCTG
ATTATAATTGCCTTGCTAATCCAACGGAATATGAAGGTAATCAACAAGTTATAGTGGGTGATGGTAATAACTTAAACATTGCTTATACTGGAAATTCATGTTTGACTGAT
GGTATTAATGCTCTTAGCTTGACAAACGTTTTGTGTGTGCCTTCAATTGCAAAGAACTTAGTCAGTGTTTCTAAGTTAGCTCATGATAATAACATATTTGTGGAGTTTCA
TGATTCTTTCTGTTTTGTAAAGGACAAGGATACGGGCAAGGTTCTGCTGAAAGGAGTTCTTAGTGAAGGTCTTTACCGTTTTGATCAATCTCGAGCTGAATCTGTTGATG
TTACAAAGTCTGCCAGTCAGTCCAGAGGTGCTTCTGGTGTTAATAAGTCTGGTGTTTCTGTGAATGTTTTGTCTAGTTATGTTAATGTTGTTGTGTCCAAAGTTGTTTGG
CATAGGCGATTAGGCCATCCTTCCCTTAAAGTTCTTGATTCAGTGGTTAAACAGTGTAATCTCCCCACTAAGAGAAATGAATATTTCAAGTTTTGTGATGCCTGCAAGTA
TGGAAAGTCTCATGCATTGCCTTTTGCTAACTCTGTCTCTCAAGCACATGCTAAGTTTGATCTCGTTCATACAGACCTTTGGGGGCCTGCTCCAATAACCTCTTCTAATG
GCTTTAGATACTATGGTATCCAAATTAGGTTGACGTGTCCCTACACCTCTCAACAAAACGGGAGAGCCGAGAGAAAACATCGACATGTTGTGGAAACCGGGCTCACGTTA
CTTGCTCAGGCCTCTATGCCACTTCAGTTCTGGTGGGATGCATTCTTAACTTCTGCTCTACTGATTAATGTTCTTCCATCGCAGGTCCTTGGTGGCAAGTCACCAGTGGA
ACTCTTATTTGGCAGGAAACCGGATTTGGCTGCACTAAGAACGTTTGGATGTGCCTGCTTTCCGTGTTTGAAACCTTATCAAGCCAACAAGTTCCATTTTCACACCGAGA
AATGCGTTTATTTGGGGCCTAGTCCACTTCACAAAGGCCACAAATGCATCAGTTCAAGTGGCAGAGTGTTCATTTCACGTCATGTTCAGTTCAATGAGGATGATTTTCCG
TTTCATGGTGGTTTTGGTGCAGCTGATAACGTGACTACAACCACCAGTTCCTCTCCCCCGATATCCACCTGGTTTCCTTATCCTGTAACAACTTCATCCAGTTCATCCCC
TGTTCAGTCTTCCAATATAAGTGTACCTAATCAACAATTACAACATCCAATACCTAACAGTCCTGCTGCCACACCACCTACCTGTAGTCTGTCACCAACCAGAAGTCTTG
CTGCCTCCCCAAACAATCACCCCTTCGTTCCAGCTTTCCCATTCGACAATAATTCAGAGTCAAATTCTTTGCCCCAAGCCTCACCTACTGTCGAGGCCTTACCTAATCCT
TTATCCTCATCTCCAAACCCATCAAACACTCCTGAAATAAGTCCTACATCAGCTGCACCAGTCACCCATCCCATGATCACAAGAGGCAAGGCCGGAATTTTTAAACCAAA
AGTTTGGTTAACTCAATCAACAACGGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTAGCGTCATATCTGCCGGATCTTCAGCCCTGTCCACCGCAGCTACCAGTTTCAACACATCGCCACTTAATCAGCTATTAAATCAGATCACGACGATCAAATTGGA
TCGCAGTAACTTTCTTCTTTGGAAGAATTTGGCCCTTCCTATCTTGCGGAGCTACAAATTAGAAGGTCACCTATCAGGTGTTAACCCTCGCCCTCCTCAGTATATTCAAC
CCTCAGTTGTGGAGTCAGATCTGAATTCCACACCAGCAGAGGGAGGAGCAACCAGTTCTGAGACGGTTGCTAGTGGTGCATCTAGGTCTACAACAGCTGCACAGGTTAAT
CCATTGTACGAGTCTTGGATAGTGATTGATCAATTACTACTCGGGTGGTTGTATAACTCTATGACACCTGAAGTAGCTACCCAAGTGATGGGGTTTGAGAGCGCTCAAGA
ATTGTGGGCTGCTATACAAGAGTTATTCGGAGTTCAATCGCGTGCTGAGGAGGATTATCTCCGTCAAATCTTTCAACAATCAAGGAAAGGGAACTTGAAGATGGCAGACT
ATCTGCGGGTCATGAAGAATCACGCAGATAATTTGGGACAGGCAGGGAGTCCGGTTACCTCACGGTCCTTAGTTTCACAAGTTCTTTTGGGATTAGACGAGGAGTACAAT
CCGGTTGTTGCAATGATTCAAGGAAGAACTAATATATCTTGGTCAGAAATGCCGGCAGAACTGCTTGTTTTCGAGAAAAGACTAGAGATGCAAAACACCTTGAAGAACTC
TGTATCCTTCAGTCAAAATGCCTCAGTCAATTTAGCAAACAGCAGGGAGGTTGGAAATCAGCGAGGTCAACAAAGTTCCTTCAATGGGCGCCAGAACAACTTCTATAGGG
GAAATCAACGTGGAGGTGGAAACCGTGGCAGAGGCAGAGGTCGAGGGTATGGACAGTACAACAATAACAAGCCCGTTTGTCAGGTATGTGGGAAGGTAGGTCACACTGCT
CTTATGTGTTACCAACGTTTTAACAAAGAATACTCTGGTCCTTCCCAAGGTCAAAATAGAACATATGGAAATGGAAATCGTTCTGCTAGTCAGGTACCTTCGACCCAACC
TACTGCTTTTGTAACCAATCAGAATGTCGGTCAGATTGTAGCCTCACCTGAAACTGTTGTCGATCCCAATTGGTATGCTGATAGTGGGGCTTCGAATCATGTCACTGCTG
ATTATAATTGCCTTGCTAATCCAACGGAATATGAAGGTAATCAACAAGTTATAGTGGGTGATGGTAATAACTTAAACATTGCTTATACTGGAAATTCATGTTTGACTGAT
GGTATTAATGCTCTTAGCTTGACAAACGTTTTGTGTGTGCCTTCAATTGCAAAGAACTTAGTCAGTGTTTCTAAGTTAGCTCATGATAATAACATATTTGTGGAGTTTCA
TGATTCTTTCTGTTTTGTAAAGGACAAGGATACGGGCAAGGTTCTGCTGAAAGGAGTTCTTAGTGAAGGTCTTTACCGTTTTGATCAATCTCGAGCTGAATCTGTTGATG
TTACAAAGTCTGCCAGTCAGTCCAGAGGTGCTTCTGGTGTTAATAAGTCTGGTGTTTCTGTGAATGTTTTGTCTAGTTATGTTAATGTTGTTGTGTCCAAAGTTGTTTGG
CATAGGCGATTAGGCCATCCTTCCCTTAAAGTTCTTGATTCAGTGGTTAAACAGTGTAATCTCCCCACTAAGAGAAATGAATATTTCAAGTTTTGTGATGCCTGCAAGTA
TGGAAAGTCTCATGCATTGCCTTTTGCTAACTCTGTCTCTCAAGCACATGCTAAGTTTGATCTCGTTCATACAGACCTTTGGGGGCCTGCTCCAATAACCTCTTCTAATG
GCTTTAGATACTATGGTATCCAAATTAGGTTGACGTGTCCCTACACCTCTCAACAAAACGGGAGAGCCGAGAGAAAACATCGACATGTTGTGGAAACCGGGCTCACGTTA
CTTGCTCAGGCCTCTATGCCACTTCAGTTCTGGTGGGATGCATTCTTAACTTCTGCTCTACTGATTAATGTTCTTCCATCGCAGGTCCTTGGTGGCAAGTCACCAGTGGA
ACTCTTATTTGGCAGGAAACCGGATTTGGCTGCACTAAGAACGTTTGGATGTGCCTGCTTTCCGTGTTTGAAACCTTATCAAGCCAACAAGTTCCATTTTCACACCGAGA
AATGCGTTTATTTGGGGCCTAGTCCACTTCACAAAGGCCACAAATGCATCAGTTCAAGTGGCAGAGTGTTCATTTCACGTCATGTTCAGTTCAATGAGGATGATTTTCCG
TTTCATGGTGGTTTTGGTGCAGCTGATAACGTGACTACAACCACCAGTTCCTCTCCCCCGATATCCACCTGGTTTCCTTATCCTGTAACAACTTCATCCAGTTCATCCCC
TGTTCAGTCTTCCAATATAAGTGTACCTAATCAACAATTACAACATCCAATACCTAACAGTCCTGCTGCCACACCACCTACCTGTAGTCTGTCACCAACCAGAAGTCTTG
CTGCCTCCCCAAACAATCACCCCTTCGTTCCAGCTTTCCCATTCGACAATAATTCAGAGTCAAATTCTTTGCCCCAAGCCTCACCTACTGTCGAGGCCTTACCTAATCCT
TTATCCTCATCTCCAAACCCATCAAACACTCCTGAAATAAGTCCTACATCAGCTGCACCAGTCACCCATCCCATGATCACAAGAGGCAAGGCCGGAATTTTTAAACCAAA
AGTTTGGTTAACTCAATCAACAACGGACTAG
Protein sequenceShow/hide protein sequence
MASVISAGSSALSTAATSFNTSPLNQLLNQITTIKLDRSNFLLWKNLALPILRSYKLEGHLSGVNPRPPQYIQPSVVESDLNSTPAEGGATSSETVASGASRSTTAAQVN
PLYESWIVIDQLLLGWLYNSMTPEVATQVMGFESAQELWAAIQELFGVQSRAEEDYLRQIFQQSRKGNLKMADYLRVMKNHADNLGQAGSPVTSRSLVSQVLLGLDEEYN
PVVAMIQGRTNISWSEMPAELLVFEKRLEMQNTLKNSVSFSQNASVNLANSREVGNQRGQQSSFNGRQNNFYRGNQRGGGNRGRGRGRGYGQYNNNKPVCQVCGKVGHTA
LMCYQRFNKEYSGPSQGQNRTYGNGNRSASQVPSTQPTAFVTNQNVGQIVASPETVVDPNWYADSGASNHVTADYNCLANPTEYEGNQQVIVGDGNNLNIAYTGNSCLTD
GINALSLTNVLCVPSIAKNLVSVSKLAHDNNIFVEFHDSFCFVKDKDTGKVLLKGVLSEGLYRFDQSRAESVDVTKSASQSRGASGVNKSGVSVNVLSSYVNVVVSKVVW
HRRLGHPSLKVLDSVVKQCNLPTKRNEYFKFCDACKYGKSHALPFANSVSQAHAKFDLVHTDLWGPAPITSSNGFRYYGIQIRLTCPYTSQQNGRAERKHRHVVETGLTL
LAQASMPLQFWWDAFLTSALLINVLPSQVLGGKSPVELLFGRKPDLAALRTFGCACFPCLKPYQANKFHFHTEKCVYLGPSPLHKGHKCISSSGRVFISRHVQFNEDDFP
FHGGFGAADNVTTTTSSSPPISTWFPYPVTTSSSSSPVQSSNISVPNQQLQHPIPNSPAATPPTCSLSPTRSLAASPNNHPFVPAFPFDNNSESNSLPQASPTVEALPNP
LSSSPNPSNTPEISPTSAAPVTHPMITRGKAGIFKPKVWLTQSTTD