; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0006095 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0006095
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr6:37197227..37200093
RNA-Seq ExpressionLag0006095
SyntenyLag0006095
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]1.3e-18944.47Show/hide
Query:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHL-------------TDEPKGLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWI
        N L  + S+KLDRNN+ LW+++ LP++R  KL+G++             +D  K     N  +  W A DQ L+GW+ NSMTTE+A+Q+  CET+++LW 
Subjt:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHL-------------TDEPKGLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWI

Query:  ALQEFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQ
          Q   G  + SQ  YLK      RK  MKM +YL  MK+  D L LAG+PV   DLI   + GLD E+ P+VV + +Q  ++W  +Q+QLLTFE R EQ
Subjt:  ALQEFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQ

Query:  LQALKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQG
        L  L  +      + N  A   N+++ +    N ++ G+N++    GRGRG+   + K  CQVCG + H A  CF+RF+K +  +R +++AG     KQG
Subjt:  LQALKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQG

Query:  VRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDNAV
          N  A +A+  ++ D  WY DSGASNH+T         +++ G   + VGNG +L I + GS    SK ++L L  +L+VP I+KNL+S+S+L  DN +
Subjt:  VRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDNAV

Query:  VVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTLN
        +VEF ++ C VKDK++GKV+L+G LKDGLYQ+              +K+N +  AFVS                         K +WH R GHP+ + L+
Subjt:  VVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTLN

Query:  QLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVEN
        ++L SCK+K   ++N SFCEACQY K H LPF +S S A +PLEL+HTD+WGPAPI +S+G++YY+ F+DDFSRFTWI+PLK KS+    F QFK     
Subjt:  QLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVEN

Query:  LFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVFL
                       N  EN F  +IK+++CD GGE+KP+   A E GI  +++CP+TS QNGR ERKHRHI E GL LLA A+MPL+YWWEAF TAV+L
Subjt:  LFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVFL

Query:  INRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQSTT
        INR+P+ V Q + PY+++  ++PDY  L+ FG ACYPCL+PY QHK  +HTTRCVFLGYSN+H+GY+CL+  GRI++SRHV F+E+ FPF D F++  + 
Subjt:  INRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQSTT

Query:  LSLLI
        L   I
Subjt:  LSLLI

GAU51268.1 hypothetical protein TSUD_412550 [Trifolium subterraneum]4.8e-17943.23Show/hide
Query:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHL---TDEPKGL-------QMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQ
        N L    S+KLDR+N+ LW+++ L ++R  KL+G++   T+ P+         +  NP++  WIA DQ L+GWL NSM  ++A+Q+  CET+++LW   Q
Subjt:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHL---TDEPKGL-------QMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQ

Query:  EFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQLQA
           G  + S+  YLK     TRK  MKM EYL  MK+ SD L LAGSP+   DL+   + GLD E+ P+VV + +Q  ++W  VQ+QLL FE R +Q   
Subjt:  EFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQLQA

Query:  LKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSR----GRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQ
          G+      + N +A   N+   + ++ N    GN  + N R    GRG+GR   N+K  CQVC   GH A  C YRF++ +     S  A      KQ
Subjt:  LKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSR----GRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQ

Query:  GVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDNA
        G  + +A IA+P +  D  WY DSGA+NH+T   +     +++ G   + VGNG +L I + GS    +K  NL L  +L+VPQI+KNL+S+S+LT DN 
Subjt:  GVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDNA

Query:  VVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTL
        ++VEF  + C VKDK++G+ LL+G LKDGLYQ+                               +  +P +  S          K +WH + GHP+ + L
Subjt:  VVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTL

Query:  NQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVE
        +++L  C +K   ++  SFCEACQ+ K H LPF  S S   +PL LIH+D+WGPAPI S +G++YY+ F+DDFSRFTWIFPLK KSD    F QFK    
Subjt:  NQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVE

Query:  NLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVF
                        N  EN F  KIKI++CD GGE+K +   + E GI  +++CP+TS QNGR ERKHRH+ E GL LLA AKMPL YWWEAF TAV+
Subjt:  NLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVF

Query:  LINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQST
        LINR+P++V   + PY++++  +PDYN+L+ FG ACYPCL+PY QHK  FHTTRCVF+GYSN+H+GY+C++  GRI+VSRHV F+E  FPF   F+D   
Subjt:  LINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQST

Query:  TLSLL
         L  L
Subjt:  TLSLL

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]6.5e-17645.25Show/hide
Query:  MTTEVASQVTSCETAQELWIALQEFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQE
        MT EVA+Q+  CET+Q++W   Q   G  + S+  +LK    +TRK  +KM EYL+ MK  +D+L LAGS V   DL++  +AGLD E+ PIVV + ++E
Subjt:  MTTEVASQVTSCETAQELWIALQEFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQE

Query:  -MTWNQVQSQLLTFEKRQEQLQALKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEK
         +TW ++Q+QLLT+E R EQ+   +  +++N PS+N++    N+   +++       G  N+    GRGRGR     + +CQVC K GH A+ C++RF K
Subjt:  -MTWNQVQSQLLTFEKRQEQLQALKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEK

Query:  NFNNNRGSYTAGSNNQSKQGVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLH
        N+           + + K+   N  A +A+P  + D  WY DSGASNH+T D N +   ++  G   +TVGNG  L I + G S + ++ ++L LK +L+
Subjt:  NFNNNRGSYTAGSNNQSKQGVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLH

Query:  VPQISKNLISISRLTMDNAVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVL
        VP+I+KNL+SIS+LT DN + VEFHD  C VKDK++G++LLEG +KDGLYQ+P    +++ R                                    V 
Subjt:  VPQISKNLISISRLTMDNAVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVL

Query:  YCSKSTWHSRFGHPSLRTLNQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFP
        +  K TWH + GHP+ + LN+++  C ++    EN  FCEACQ+ K+H LPF NS S A +PL+L+H+D+WGPAPI+S +G++YY+ FLDD+SRFTWI+P
Subjt:  YCSKSTWHSRFGHPSLRTLNQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFP

Query:  LKHKSDATTLFKQFKNQVENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALL
        LK KSD    F QF+                    N VEN F  +IK ++CD GGEFK L     + GI ++ +CP+TSAQNGR ERKHRH+VE+GL LL
Subjt:  LKHKSDATTLFKQFKNQVENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALL

Query:  AHAKMPLNYWWEAFHTAVFLINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRH
        A AKMPL+YWWEAF TAVFLINR+PT V++ K PY  L+++ PDY +++ FG ACYPCL+PY QHK  FHTT+CVFLGYS +H+GY+CL+ TGRI++SRH
Subjt:  AHAKMPLNYWWEAFHTAVFLINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRH

Query:  VCFDEEKFPFADKFMD
        V F+E  FPF D F++
Subjt:  VCFDEEKFPFADKFMD

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]2.7e-19045.66Show/hide
Query:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHLTDEPK----------GLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQ
        N L    S+KLDR+N+ LWQ++ LPI+R  +L+G++  + K            +  NPE++ W A DQ L+GWL NSMT  +A+Q+  CET+ +LW   Q
Subjt:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHLTDEPK----------GLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQ

Query:  EFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQLQA
           G  + SQ  YLK     TRK  MKM +YL  MK+ +D L LAG+P+   DLI   + GLD E+ P+VV + +Q  ++W  +Q+QLLTFE R EQL +
Subjt:  EFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQLQA

Query:  LKGVISINQPSANLAATGTNQNNQQASQHN-RSFNGNNNQKNSR----GRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSK
        L   +++N  +AN+A    ++ N+  S +N R  N N    N R    GRGRGR+F   K  CQVCG   H A  CFYRF+K +  +R +++A   N  K
Subjt:  LKGVISINQPSANLAATGTNQNNQQASQHN-RSFNGNNNQKNSR----GRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSK

Query:  QGVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDN
        QG  N  A +A+  ++ D  WY DSGASNH+T   +     S++ G   + VGNG +L I + GS    SK ++L L  +L+VP+I+KNL+S+S+L  DN
Subjt:  QGVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDN

Query:  AVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRT
         ++VEF ++ C VKDK++GK +L G LKDGLYQ+              S+K+ +  A+VS                         K +WH + GHP+ + 
Subjt:  AVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRT

Query:  LNQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQV
        L+ +L SC +K   ++  SFCEACQY K H LPF  S S A + LEL+HTD+WGPAPI SS+G++YY+ F+DDF+RFTWI+PLK KSD    F QFK   
Subjt:  LNQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQV

Query:  ENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAV
                         N VEN F  KIK ++CD GGE+KP+   A E GI  +++CP+TS QNGR ERKHRHI E GL LLA AKMPLNYWWEAF TAV
Subjt:  ENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAV

Query:  FLINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQS
        +LINR+P++V   K PY++L+  +PDYNSL+ FG ACYP L+PY +HK  FHTTRCVFLGYSN+H+GY+C++  GRI++SRHV F+E+ FPF D F++  
Subjt:  FLINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQS

Query:  TTLSLL
          L  L
Subjt:  TTLSLL

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]6.0e-19044.19Show/hide
Query:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHLTDEPK----------GLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQ
        N L    S+KLDR+NF LW+++ LP++R  K +G++    K            +  NP+Y  W A DQ L+GWL NSMT ++A+QV  CET+++LW   Q
Subjt:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHLTDEPK----------GLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQ

Query:  EFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQLQA
           G  + S+  YLK     T K  MKM +YL+ MK+ +D L LAGSP+   DL+   + GLD E+ P+VV + +Q  ++W   Q+QLL FE R +QL  
Subjt:  EFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQLQA

Query:  LKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQGVRN
            I++N  SAN A+   +  N+  S+    + G+N++    GRGR R     +PICQ+CGK GHTAA C+YRF+K       SYT    N   +G  +
Subjt:  LKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQGVRN

Query:  PTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDNAVVVE
         +A +A+P +  D  WY DSGASNH+T     L   ++  G   + VGNG +L I + GS    +K  ++ L+ +L+VP+I+KNL+S+S+LT+DN  +VE
Subjt:  PTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDNAVVVE

Query:  FHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTLNQLL
        F +++C VKDK++GK LL+G LKDGLYQ+                             S N++ P     P +++ L   K  WH + GHP+ + L ++L
Subjt:  FHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTLNQLL

Query:  ASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVENLFE
            +K   ++  +FCEACQ+ K H LPF  S S A +PL+LIHTD+WGPAPI S + ++YY+ FLDDFSRFTWIFPLK KS+    F QFK        
Subjt:  ASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVENLFE

Query:  TKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVFLINR
                    N VEN F  KIK++RCD GGE+KP+   A + GI  Q++CP+TS QNGR ERKHRH+ E GL LLA AKMPL+YWWEAF TAV+LINR
Subjt:  TKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVFLINR

Query:  MPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQSTTLSL
        +P++V   + PYT+++ ++PDY +L+ FG ACYPCL+PY QHK  FHTTRCVFLGYSN+H+GY+C++  GR++VSRHV F+E  FPF + F+D    + +
Subjt:  MPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQSTTLSL

Query:  L
        +
Subjt:  L

TrEMBL top hitse value%identityAlignment
A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)1.3e-19045.66Show/hide
Query:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHLTDEPK----------GLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQ
        N L    S+KLDR+N+ LWQ++ LPI+R  +L+G++  + K            +  NPE++ W A DQ L+GWL NSMT  +A+Q+  CET+ +LW   Q
Subjt:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHLTDEPK----------GLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQ

Query:  EFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQLQA
           G  + SQ  YLK     TRK  MKM +YL  MK+ +D L LAG+P+   DLI   + GLD E+ P+VV + +Q  ++W  +Q+QLLTFE R EQL +
Subjt:  EFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQLQA

Query:  LKGVISINQPSANLAATGTNQNNQQASQHN-RSFNGNNNQKNSR----GRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSK
        L   +++N  +AN+A    ++ N+  S +N R  N N    N R    GRGRGR+F   K  CQVCG   H A  CFYRF+K +  +R +++A   N  K
Subjt:  LKGVISINQPSANLAATGTNQNNQQASQHN-RSFNGNNNQKNSR----GRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSK

Query:  QGVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDN
        QG  N  A +A+  ++ D  WY DSGASNH+T   +     S++ G   + VGNG +L I + GS    SK ++L L  +L+VP+I+KNL+S+S+L  DN
Subjt:  QGVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDN

Query:  AVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRT
         ++VEF ++ C VKDK++GK +L G LKDGLYQ+              S+K+ +  A+VS                         K +WH + GHP+ + 
Subjt:  AVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRT

Query:  LNQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQV
        L+ +L SC +K   ++  SFCEACQY K H LPF  S S A + LEL+HTD+WGPAPI SS+G++YY+ F+DDF+RFTWI+PLK KSD    F QFK   
Subjt:  LNQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQV

Query:  ENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAV
                         N VEN F  KIK ++CD GGE+KP+   A E GI  +++CP+TS QNGR ERKHRHI E GL LLA AKMPLNYWWEAF TAV
Subjt:  ENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAV

Query:  FLINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQS
        +LINR+P++V   K PY++L+  +PDYNSL+ FG ACYP L+PY +HK  FHTTRCVFLGYSN+H+GY+C++  GRI++SRHV F+E+ FPF D F++  
Subjt:  FLINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQS

Query:  TTLSLL
          L  L
Subjt:  TTLSLL

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)2.9e-19044.19Show/hide
Query:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHLTDEPK----------GLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQ
        N L    S+KLDR+NF LW+++ LP++R  K +G++    K            +  NP+Y  W A DQ L+GWL NSMT ++A+QV  CET+++LW   Q
Subjt:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHLTDEPK----------GLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQ

Query:  EFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQLQA
           G  + S+  YLK     T K  MKM +YL+ MK+ +D L LAGSP+   DL+   + GLD E+ P+VV + +Q  ++W   Q+QLL FE R +QL  
Subjt:  EFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQLQA

Query:  LKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQGVRN
            I++N  SAN A+   +  N+  S+    + G+N++    GRGR R     +PICQ+CGK GHTAA C+YRF+K       SYT    N   +G  +
Subjt:  LKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQGVRN

Query:  PTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDNAVVVE
         +A +A+P +  D  WY DSGASNH+T     L   ++  G   + VGNG +L I + GS    +K  ++ L+ +L+VP+I+KNL+S+S+LT+DN  +VE
Subjt:  PTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDNAVVVE

Query:  FHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTLNQLL
        F +++C VKDK++GK LL+G LKDGLYQ+                             S N++ P     P +++ L   K  WH + GHP+ + L ++L
Subjt:  FHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTLNQLL

Query:  ASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVENLFE
            +K   ++  +FCEACQ+ K H LPF  S S A +PL+LIHTD+WGPAPI S + ++YY+ FLDDFSRFTWIFPLK KS+    F QFK        
Subjt:  ASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVENLFE

Query:  TKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVFLINR
                    N VEN F  KIK++RCD GGE+KP+   A + GI  Q++CP+TS QNGR ERKHRH+ E GL LLA AKMPL+YWWEAF TAV+LINR
Subjt:  TKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVFLINR

Query:  MPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQSTTLSL
        +P++V   + PYT+++ ++PDY +L+ FG ACYPCL+PY QHK  FHTTRCVFLGYSN+H+GY+C++  GR++VSRHV F+E  FPF + F+D    + +
Subjt:  MPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQSTTLSL

Query:  L
        +
Subjt:  L

A0A2Z6MBG6 Integrase catalytic domain-containing protein6.5e-19044.47Show/hide
Query:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHL-------------TDEPKGLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWI
        N L  + S+KLDRNN+ LW+++ LP++R  KL+G++             +D  K     N  +  W A DQ L+GW+ NSMTTE+A+Q+  CET+++LW 
Subjt:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHL-------------TDEPKGLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWI

Query:  ALQEFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQ
          Q   G  + SQ  YLK      RK  MKM +YL  MK+  D L LAG+PV   DLI   + GLD E+ P+VV + +Q  ++W  +Q+QLLTFE R EQ
Subjt:  ALQEFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQ

Query:  LQALKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQG
        L  L  +      + N  A   N+++ +    N ++ G+N++    GRGRG+   + K  CQVCG + H A  CF+RF+K +  +R +++AG     KQG
Subjt:  LQALKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQG

Query:  VRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDNAV
          N  A +A+  ++ D  WY DSGASNH+T         +++ G   + VGNG +L I + GS    SK ++L L  +L+VP I+KNL+S+S+L  DN +
Subjt:  VRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDNAV

Query:  VVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTLN
        +VEF ++ C VKDK++GKV+L+G LKDGLYQ+              +K+N +  AFVS                         K +WH R GHP+ + L+
Subjt:  VVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTLN

Query:  QLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVEN
        ++L SCK+K   ++N SFCEACQY K H LPF +S S A +PLEL+HTD+WGPAPI +S+G++YY+ F+DDFSRFTWI+PLK KS+    F QFK     
Subjt:  QLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVEN

Query:  LFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVFL
                       N  EN F  +IK+++CD GGE+KP+   A E GI  +++CP+TS QNGR ERKHRHI E GL LLA A+MPL+YWWEAF TAV+L
Subjt:  LFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVFL

Query:  INRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQSTT
        INR+P+ V Q + PY+++  ++PDY  L+ FG ACYPCL+PY QHK  +HTTRCVFLGYSN+H+GY+CL+  GRI++SRHV F+E+ FPF D F++  + 
Subjt:  INRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQSTT

Query:  LSLLI
        L   I
Subjt:  LSLLI

A0A2Z6P4D5 Integrase catalytic domain-containing protein2.3e-17943.23Show/hide
Query:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHL---TDEPKGL-------QMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQ
        N L    S+KLDR+N+ LW+++ L ++R  KL+G++   T+ P+         +  NP++  WIA DQ L+GWL NSM  ++A+Q+  CET+++LW   Q
Subjt:  NLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHL---TDEPKGL-------QMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQ

Query:  EFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQLQA
           G  + S+  YLK     TRK  MKM EYL  MK+ SD L LAGSP+   DL+   + GLD E+ P+VV + +Q  ++W  VQ+QLL FE R +Q   
Subjt:  EFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLTFEKRQEQLQA

Query:  LKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSR----GRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQ
          G+      + N +A   N+   + ++ N    GN  + N R    GRG+GR   N+K  CQVC   GH A  C YRF++ +     S  A      KQ
Subjt:  LKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSR----GRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQ

Query:  GVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDNA
        G  + +A IA+P +  D  WY DSGA+NH+T   +     +++ G   + VGNG +L I + GS    +K  NL L  +L+VPQI+KNL+S+S+LT DN 
Subjt:  GVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDNA

Query:  VVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTL
        ++VEF  + C VKDK++G+ LL+G LKDGLYQ+                               +  +P +  S          K +WH + GHP+ + L
Subjt:  VVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTL

Query:  NQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVE
        +++L  C +K   ++  SFCEACQ+ K H LPF  S S   +PL LIH+D+WGPAPI S +G++YY+ F+DDFSRFTWIFPLK KSD    F QFK    
Subjt:  NQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVE

Query:  NLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVF
                        N  EN F  KIKI++CD GGE+K +   + E GI  +++CP+TS QNGR ERKHRH+ E GL LLA AKMPL YWWEAF TAV+
Subjt:  NLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVF

Query:  LINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQST
        LINR+P++V   + PY++++  +PDYN+L+ FG ACYPCL+PY QHK  FHTTRCVF+GYSN+H+GY+C++  GRI+VSRHV F+E  FPF   F+D   
Subjt:  LINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQST

Query:  TLSLL
         L  L
Subjt:  TLSLL

A0A803PM38 Uncharacterized protein8.5e-18242.89Show/hide
Query:  PAFTNLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHL-------------TD---EPKGLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCE
        P F + LNQ  ++KLDRNNF LW+ +   I+R ++L+G+L             TD       +   NP ++ WI  DQLL+GWLY SMT  +A +V  C+
Subjt:  PAFTNLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHL-------------TD---EPKGLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCE

Query:  TAQELWIALQEFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLT
        ++  LW AL+E +G  S ++ D  +  +Q  RK ++ M +YL   + ++D L LAG P     L+S V++GLD E+ P+V++I+ +   TW Q+Q  LL+
Subjt:  TAQELWIALQEFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQ-EMTWNQVQSQLLT

Query:  FEKRQEQLQALKGVISIN----QPSANLAATGTN--QNNQQASQHNRSFNGNNNQKNSRGRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRG
         + + E+L +  G   +      PSA+LA  G +   N    + +NR  + NN   N+R RGRG      +P CQVCGK GH+AA C+ R          
Subjt:  FEKRQEQLQALKGVISIN----QPSANLAATGTN--QNNQQASQHNRSFNGNNNQKNSRGRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRG

Query:  SYTAGSNNQSKQGVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRN-LMLKGLLHVPQISK
                                            GASNH+TS++N + LK +Y G +++TV NG +LPIH IG   + +   + L+LK +LHVP I+K
Subjt:  SYTAGSNNQSKQGVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRN-LMLKGLLHVPQISK

Query:  NLISISRLTMDNAVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQI-PPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKS
        NL+SIS+LT DN V VEF    C VKDK +G+V+L+G LKDGLYQ   P    S S  + +S         VS + S        VT P ++ +L   K 
Subjt:  NLISISRLTMDNAVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQI-PPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKS

Query:  TWHSRFGHPSLRTLNQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKS
         WH R GHPS+R L+ +L    +K  IN +LSFC+ACQ  KSH LPF  +  RAT PLEL+HTD+WGP+PI S+  ++YYI F+DDFSR+TWI+PLK KS
Subjt:  TWHSRFGHPSLRTLNQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKS

Query:  DATTLFKQFKNQVENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKM
        +A   F QFK                      VEN F +++K V+ D GGE++    F S+ GI  Q  CPHTS QNGR ERKHRHIVE GL LLA A +
Subjt:  DATTLFKQFKNQVENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKM

Query:  PLNYWWEAFHTAVFLINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDE
        P  YWW+AF TAV+LINR+PT VL+ K P+ VL+ +QPDY  L+VFG +C+PCLR YQ HKF FH+T+CV LGYS+ H+GY+CLS TGR+Y+SR V F+E
Subjt:  PLNYWWEAFHTAVFLINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDE

Query:  EKFPFADKFMDQS---TTLSLLI
        ++FPF   F++ +   T +S+L+
Subjt:  EKFPFADKFMDQS---TTLSLLI

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.4e-4023.92Show/hide
Query:  MPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQEFYGVQSI-SQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKD
        MPN   D W  A++     +   ++    +  TS  TA+++   L   Y  +S+ SQ    KR+L       M +  +  +       L  AG+ ++  D
Subjt:  MPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQEFYGVQSI-SQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKD

Query:  LISYVIAGLDEEFTPIVVVIQ---NQEMTWNQVQSQLLTFEKRQEQLQALKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTF
         IS+++  L   +  I+  I+    + +T   V+++LL  E            I I           T++    A  HN +    NN   +R     + F
Subjt:  LISYVIAGLDEEFTPIVVVIQ---NQEMTWNQVQSQLLTFEKRQEQLQALKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTF

Query:  -GNS--KPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQGVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVG
         GNS  K  C  CG+ GH    CF+ +++  NN            +  G+      +     + +  + +DSGAS+HL +D +  T   +     +I V 
Subjt:  -GNS--KPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQGVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVG

Query:  NGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDNAVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNV
           +    +    V       + L+ +L   + + NL+S+ RL  +  + +EF  S   +              K+GL  +   G  ++  +       +
Subjt:  NGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDNAVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNV

Query:  TFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTL-----NQLLASCKLKTKINENLSFCEACQYAKSHRLPFP--NSKSRATKPLE
         F A+      +N                  +   WH RFGH S   L       + +   L   +  +   CE C   K  RLPF     K+   +PL 
Subjt:  TFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTL-----NQLLASCKLKTKINENLSFCEACQYAKSHRLPFP--NSKSRATKPLE

Query:  LIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFA
        ++H+D+ GP    + +   Y++ F+D F+ +   + +K+KSD  ++F+ F  + E  F  K+  +  D G   + N                   +  F 
Subjt:  LIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFA

Query:  SEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVFLINRMPTAVL--QGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPY
         + GI   L  PHT   NG  ER  R I E    +++ AK+  ++W EA  TA +LINR+P+  L    K PY + +N++P    LRVFG+  Y  ++  
Subjt:  SEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVFLINRMPTAVL--QGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPY

Query:  QQHKFDFHTTRCVFLGYSNNHRGYRCLSPTG-RIYVSRHVCFDE
        +Q KFD  + + +F+GY  N  G++       +  V+R V  DE
Subjt:  QQHKFDFHTTRCVFLGYSNNHRGYRCLSPTG-RIYVSRHVCFDE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.3e-4925.29Show/hide
Query:  NNFLLWQNIALPILRSYKLEGHLTDEPKGLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQEFYGVQSISQKDYLKRMLQQTRK
        N F  WQ     +L    L   L  + K  +    + + W   D+     +   ++ +V + +   +TA+ +W  L+  Y  ++++ K YLK+ L     
Subjt:  NNFLLWQNIALPILRSYKLEGHLTDEPKGLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQEFYGVQSISQKDYLKRMLQQTRK

Query:  WSMKMTE------YLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQEMT--WNQVQSQLLTFEKRQEQLQALKGVISINQPSANLAA
        +++ M+E      +L++       L   G  ++ +D    ++  L   +  +   I + + T     V S LL  EK +++ +        NQ  A L  
Subjt:  WSMKMTE------YLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQEMT--WNQVQSQLLTFEKRQEQLQALKGVISINQPSANLAA

Query:  TGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQGVRNPTAMIATPENL-----
         G  ++ Q++S +N   +G   +  +R + R R        C  C + GH     F R   N    +G  T+G  N       N  AM+   +N+     
Subjt:  TGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSKQGVRNPTAMIATPENL-----

Query:  ----------HDNAWYMDSGASNHLT--SDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFR-NLMLKGLLHVPQISKNLISISRLTMDNAVV
                   ++ W +D+ AS+H T   DL    +  D+  +K   +GN     I  IG   I +     L+LK + HVP +  NLIS   L  D    
Subjt:  ----------HDNAWYMDSGASNHLT--SDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFR-NLMLKGLLHVPQISKNLISISRLTMDNAVV

Query:  VEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTLNQ
           +  + L K  +   V+ +G  +  LY+                    T         +  QD+               S   WH R GH S + L  
Subjt:  VEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTLNQ

Query:  LLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVENL
        L     +       +  C+ C + K HR+ F  S  R    L+L+++D+ GP  I S  G +Y+++F+DD SR  W++ LK K     +F++F   VE  
Subjt:  LLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVENL

Query:  FETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVFLI
           K+K +R D GG      FE                   + S  GI  +   P T   NG  ER +R IVE   ++L  AK+P ++W EA  TA +LI
Subjt:  FETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVFLI

Query:  NRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPT-GRIYVSRHVCFDEEK
        NR P+  L  ++P  V  N++  Y+ L+VFG   +  +   Q+ K D  +  C+F+GY +   GYR   P   ++  SR V F E +
Subjt:  NRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPT-GRIYVSRHVCFDEEK

Q03494 Transposon Ty2-DR2 Gag-Pol polyprotein6.0e-1522.82Show/hide
Query:  KQGVRNPTAMIATPENLHDNAWYMDSGASNHLTSD---LNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGL--LHVPQISKNLISIS
        +Q    PT  I + + L D+   +DSGAS  L      L++ T  S+      I       +PI++IG+  ++  F+N     +  LH P I+ +L+S+S
Subjt:  KQGVRNPTAMIATPENLHDNAWYMDSGASNHLTSD---LNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGL--LHVPQISKNLISIS

Query:  RLTMDNAVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFG
         L   N          C  ++       LE S  DG    P      H     +SKK +   + +S++   N +    V      L+        H   G
Subjt:  RLTMDNAVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFG

Query:  HPSLRTLNQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATK-----------PLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPL
        H + R++ + L    +      ++ +  A  Y     L   ++K R  K           P + +HTD++GP      +   Y+ISF D+ +RF W++PL
Subjt:  HPSLRTLNQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATK-----------PLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPL

Query:  KHKSDATTLFKQFKNQVENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEF--KPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLAL
          + + + L         N+F + +  ++         N F  ++ +++ D G E+  K L  F +  GI         S  +G  ER +R ++     L
Subjt:  KHKSDATTLFKQFKNQVENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEF--KPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLAL

Query:  LAHAKMPLNYWWEAFHTAVFLINRM
        L  + +P + W+ A   +  + N +
Subjt:  LAHAKMPLNYWWEAFHTAVFLINRM

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.4e-11935.13Show/hide
Query:  LNQATSIKLDRNNFLLWQNIALPILRSYKLEGHL---------TDEPKGLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQEFY
        +N +   KL   N+L+W      +   Y+L G L         T         NP+Y  W   D+L+   +  +++  V   V+   TA ++W  L++ Y
Subjt:  LNQATSIKLDRNNFLLWQNIALPILRSYKLEGHL---------TDEPKGLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQEFY

Query:  GVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIV--VVIQNQEMTWNQVQSQLLTFEKRQEQLQALK
           S      L+  L+Q  K +  + +Y+  + +  D L L G P+D  + +  V+  L EE+ P++  +  ++   T  ++  +LL  E +   + +  
Subjt:  GVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIV--VVIQNQEMTWNQVQSQLLTFEKRQEQLQALK

Query:  GVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTF----GNSKPI---CQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSK
         VI I   + +   T T  NN   +++NR  N NNN  +   +     F      SKP    CQ+CG  GH+A  C     ++F ++  S    S     
Subjt:  GVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTF----GNSKPI---CQVCGKAGHTAAVCFYRFEKNFNNNRGSYTAGSNNQSK

Query:  QGVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDN
        Q    P A +A       N W +DSGA++H+TSD NNL+L   YTG   + V +G  +PI   GS+ + +K R L L  +L+VP I KNLIS+ RL   N
Subjt:  QGVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMDN

Query:  AVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRT
         V VEF  +   VKD  +G  LL+G  KD LY+ P   +      Q VS                      L  SP S      + S+WH+R GHP+   
Subjt:  AVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRT

Query:  LNQLLASCKLKTKINENLSF--CEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKN
        LN ++++  L   +N +  F  C  C   KS+++PF  S   +T+PLE I++D+W  +PI S + Y+YY+ F+D F+R+TW++PLK KS     F  FK 
Subjt:  LNQLLASCKLKTKINENLSF--CEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKN

Query:  QVENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHT
                           N +EN F+T+I     D GGEF  L  + S+ GI    + PHT   NG  ERKHRHIVETGL LL+HA +P  YW  AF  
Subjt:  QVENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHT

Query:  AVFLINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLS-PTGRIYVSRHVCFDEEKFPFAD
        AV+LINR+PT +LQ + P+  L+   P+Y+ LRVFG ACYP LRPY QHK D  + +CVFLGYS     Y CL   T R+Y+SRHV FDE  FPF++
Subjt:  AVFLINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLS-PTGRIYVSRHVCFDEEKFPFAD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.9e-11433.91Show/hide
Query:  TNLL--NQATSIKLDRNNFLLWQNIALPILRSYKLEGHL---------TDEPKGLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIA
        TN+L  N +   KL   N+L+W      +   Y+L G L         T     +   NP+Y  W   D+L+   +  +++  V   V+   TA ++W  
Subjt:  TNLL--NQATSIKLDRNNFLLWQNIALPILRSYKLEGHL---------TDEPKGLQMPNPEYDVWIAADQLLVGWLYNSMTTEVASQVTSCETAQELWIA

Query:  LQEFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIV--VVIQNQEMTWNQVQSQLLTFEKRQEQ
        L++ Y   S      L+ + +                    D L L G P+D  + +  V+  L +++ P++  +  ++   +  ++  +L+  E +   
Subjt:  LQEFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIV--VVIQNQEMTWNQVQSQLLTFEKRQEQ

Query:  LQALKGV-ISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNS-RGRGRGRTFGNSKP-----ICQVCGKAGHTAAVC--FYRFEKNFNNNRGSYTA
        L + + V I+ N  +     T  NQNN+     NR++N NNN+ NS +    G    N +P      CQ+C   GH+A  C   ++F+   N  + +   
Subjt:  LQALKGV-ISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNS-RGRGRGRTFGNSKP-----ICQVCGKAGHTAAVC--FYRFEKNFNNNRGSYTA

Query:  GSNNQSKQGVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISI
             S      P A +A     + N W +DSGA++H+TSD NNL+    YTG   + + +G  +PI   GS+ + +  R+L L  +L+VP I KNLIS+
Subjt:  GSNNQSKQGVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISI

Query:  SRLTMDNAVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKST---WH
         RL   N V VEF  +   VKD  +G  LL+G  KD LY+ P   +      Q VS                      +  SP       CSK+T   WH
Subjt:  SRLTMDNAVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKST---WH

Query:  SRFGHPSLRTLNQLLASCKLKT-KINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDA
        SR GHPSL  LN ++++  L     +  L  C  C   KSH++PF NS   ++KPLE I++D+W  +PI S + Y+YY+ F+D F+R+TW++PLK KS  
Subjt:  SRFGHPSLRTLNQLLASCKLKT-KINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDA

Query:  TTLFKQFKNQVENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPL
           F  FK                    + VEN F+T+I  +  D GGEF  L  + S+ GI    + PHT   NG  ERKHRHIVE GL LL+HA +P 
Subjt:  TTLFKQFKNQVENLFETKIKIVRCDEGGNQVENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPL

Query:  NYWWEAFHTAVFLINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLS-PTGRIYVSRHVCFDEE
         YW  AF  AV+LINR+PT +LQ + P+  L+ + P+Y  L+VFG ACYP LRPY +HK +  + +C F+GYS     Y CL  PTGR+Y SRHV FDE 
Subjt:  NYWWEAFHTAVFLINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLS-PTGRIYVSRHVCFDEE

Query:  KFPFADKFMDQSTT
         FPF+      ST+
Subjt:  KFPFADKFMDQSTT

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.5e-0524.36Show/hide
Query:  IKLDRNNFLLWQNIALPILRSYKLEGHLTDEPKGLQMPNPEYDV-WIAADQLLVGWLYNSMTTE--VASQVTSCETAQELWIALQEFYGVQSISQKDYLK
        + ++ +N+  W+ + L    S+ + GH+     G  +P    DV W   D ++   LY ++T +    S VTS  T++++W+ ++  +     ++   L 
Subjt:  IKLDRNNFLLWQNIALPILRSYKLEGHLTDEPKGLQMPNPEYDV-WIAADQLLVGWLYNSMTTE--VASQVTSCETAQELWIALQEFYGVQSISQKDYLK

Query:  RMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQE--MTWNQVQSQLLTFEKRQEQLQALKGVISINQPSANL
          L+      M++ +Y   MK  +D+L     PV  ++L+ YV+ GL+ +F  I+ VI++++   +++   + L      QE+   LK  I  N    + 
Subjt:  RMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVIQNQE--MTWNQVQSQLLTFEKRQEQLQALKGVISINQPSANL

Query:  AATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRG
        +++ T     +A           NQ   RGRGRG
Subjt:  AATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRG

ATMG00300.1 Gag-Pol-related retrotransposon family protein6.6e-0936.99Show/hide
Query:  WHSRFGHPSLRTLNQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSS
        WHSR  H S R +  L+    L +    +L FCE C Y K+HR+ F   +     PL+ +H+DLWG   +  S
Subjt:  WHSRFGHPSLRTLNQLLASCKLKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSS

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.7e-0432.35Show/hide
Query:  HRHIVETGLALLAHAKMPLNYWWEAFHTAVFLINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACY
        +R I+E   ++L    +P  +  +A +TAV +IN+ P+  +   VP  V +   P Y+ LR FG   Y
Subjt:  HRHIVETGLALLAHAKMPLNYWWEAFHTAVFLINRMPTAVLQGKVPYTVLYNEQPDYNSLRVFGSACY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAACACCGAAAAATCCTCTTCCTCTCCTCCTAAGTCTCCCGTCACTCAGTCTACTCTTGCCCAAACACCAACTCTCAAAAAACCTGGTCCGGTTCATATACCCAA
GGTCCCAGGAGCAACGGAGGTGCGATATGGAACCACTCCAGTGGGTACCCCTGCATTTACCAACTTGTTGAACCAAGCTACCTCTATCAAACTTGACCGCAACAATTTCC
TACTTTGGCAGAATATTGCACTACCAATCCTCAGAAGCTATAAGCTAGAAGGGCATCTTACAGATGAACCCAAAGGGCTGCAAATGCCCAATCCTGAATATGATGTATGG
ATAGCTGCGGATCAACTTCTTGTAGGCTGGTTGTACAACTCCATGACAACCGAAGTGGCGTCTCAAGTCACAAGTTGTGAAACTGCTCAAGAGCTTTGGATAGCCCTACA
AGAATTCTATGGTGTTCAATCCATATCCCAGAAGGATTACCTCAAGAGGATGCTTCAGCAAACTAGAAAATGGAGTATGAAAATGACAGAGTATCTTAGTTTAATGAAAA
GTTACTCAGATAATTTGTATTTGGCAGGTTCTCCTGTGGATCCAAAGGATCTTATCTCCTATGTGATAGCTGGATTAGACGAAGAGTTCACACCCATTGTGGTAGTGATT
CAGAACCAAGAAATGACTTGGAATCAAGTTCAATCACAACTTCTGACCTTTGAGAAACGTCAGGAACAACTTCAAGCTCTAAAGGGAGTGATCTCTATCAATCAACCCTC
TGCAAATTTGGCTGCTACTGGAACAAACCAAAACAACCAGCAAGCTTCACAACACAACCGTTCTTTCAATGGAAACAACAACCAGAAAAATAGTCGTGGCAGGGGTAGGG
GACGTACCTTTGGGAACTCAAAGCCAATTTGTCAAGTGTGTGGAAAGGCTGGCCATACTGCTGCTGTGTGTTTCTATAGGTTTGAAAAGAACTTTAACAACAATCGAGGC
TCATATACTGCTGGTTCAAACAACCAATCCAAGCAAGGTGTTCGAAATCCTACTGCCATGATTGCTACTCCTGAAAATCTGCATGATAATGCTTGGTATATGGATAGTGG
GGCCAGCAATCACTTGACTTCGGATCTCAACAATCTCACTCTCAAATCTGACTACACAGGTATGAAACAAATAACTGTTGGTAATGGTTGTCAGTTGCCTATTCACTCTA
TTGGAAGTTCAGTTATTTACTCAAAGTTCAGAAATCTTATGTTGAAAGGTTTACTTCATGTTCCTCAGATTAGTAAGAATCTAATTAGCATATCTCGCTTGACTATGGAT
AATGCGGTTGTGGTTGAATTTCATGATTCCTTTTGTCTTGTTAAGGACAAGGTTTCGGGGAAGGTTCTTCTGGAAGGGAGTCTTAAAGATGGATTGTATCAGATACCGCC
TCTGGGAGCTGCATCTCATAGCCGTCTACAAGAAGTTTCGAAGAAAAATGTCACTTTTGTTGCGTTTGTCTCTAGGATGGCTAGTAGGAACCAGGATGATCCAATGTTAG
TAACATCTCCTGGGAGTCACCTCGTTTTGTATTGTTCAAAAAGTACATGGCATAGTCGGTTTGGCCATCCATCTCTTCGAACCTTAAATCAGTTGTTGGCTTCTTGTAAG
TTGAAAACTAAAATAAATGAGAATCTATCCTTTTGTGAGGCTTGTCAGTATGCAAAATCTCATAGACTTCCATTTCCTAACTCTAAGTCTCGAGCTACTAAACCTCTTGA
ACTCATTCACACAGATTTATGGGGTCCTGCTCCCATAACTTCATCTAATGGTTATCAATACTACATCAGTTTTCTGGATGATTTTTCTCGATTTACTTGGATATTTCCAT
TGAAACATAAAAGTGATGCTACCACTTTGTTCAAGCAGTTCAAGAATCAAGTTGAAAATTTATTTGAAACAAAGATTAAAATTGTGCGTTGTGATGAAGGAGGGAATCAG
GTTGAAAATTTATTTGAAACAAAGATTAAAATTGTGCGTTGTGATGAAGGAGGGGAGTTTAAACCACTGATTCACTTTGCCTCAGAAGTTGGTATTCATATTCAGTTAGC
TTGTCCTCATACTTCAGCCCAAAATGGACGTGTGGAACGAAAACATCGGCATATTGTTGAAACCGGCCTTGCTCTTTTGGCTCACGCTAAAATGCCTTTAAACTATTGGT
GGGAAGCTTTTCATACCGCTGTTTTTTTGATAAATCGTATGCCCACTGCTGTTCTTCAAGGAAAAGTCCCTTATACTGTCTTATATAATGAACAACCTGACTACAACAGC
CTACGGGTCTTTGGTTCAGCATGTTACCCTTGTCTCAGGCCATATCAGCAGCATAAATTTGATTTTCACACCACCCGATGTGTTTTTCTTGGCTATAGCAATAATCATCG
AGGATATAGGTGCCTCAGTCCAACTGGAAGGATATATGTGTCACGCCATGTCTGTTTTGATGAGGAAAAGTTTCCTTTTGCTGATAAATTTATGGACCAATCCACCACAC
TTTCACTGCTCATATTGATCTTCTTTCTTGGTTGCCTATTCAAATTCCCTCCAGTAGCCATTCTAACACTACAACTCCTTGTCCAGCACCTTCAGTCTTGCCATTACCTC
CATCAAATTCACAAATTTCCAGCCCTCCATCAAATTCACCAATTTCCAGCCCCCAATCACAAATTTCCAGCCCCCATTCTGATTCCCCCATTCATCACTCCTCTACTGGT
AGCCTCTCATCCTTTAGTCCCTCCTCCAGTACCTCATCCATGCCCTCCCCCACCGTTTTTTCTCCACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTAACACCGAAAAATCCTCTTCCTCTCCTCCTAAGTCTCCCGTCACTCAGTCTACTCTTGCCCAAACACCAACTCTCAAAAAACCTGGTCCGGTTCATATACCCAA
GGTCCCAGGAGCAACGGAGGTGCGATATGGAACCACTCCAGTGGGTACCCCTGCATTTACCAACTTGTTGAACCAAGCTACCTCTATCAAACTTGACCGCAACAATTTCC
TACTTTGGCAGAATATTGCACTACCAATCCTCAGAAGCTATAAGCTAGAAGGGCATCTTACAGATGAACCCAAAGGGCTGCAAATGCCCAATCCTGAATATGATGTATGG
ATAGCTGCGGATCAACTTCTTGTAGGCTGGTTGTACAACTCCATGACAACCGAAGTGGCGTCTCAAGTCACAAGTTGTGAAACTGCTCAAGAGCTTTGGATAGCCCTACA
AGAATTCTATGGTGTTCAATCCATATCCCAGAAGGATTACCTCAAGAGGATGCTTCAGCAAACTAGAAAATGGAGTATGAAAATGACAGAGTATCTTAGTTTAATGAAAA
GTTACTCAGATAATTTGTATTTGGCAGGTTCTCCTGTGGATCCAAAGGATCTTATCTCCTATGTGATAGCTGGATTAGACGAAGAGTTCACACCCATTGTGGTAGTGATT
CAGAACCAAGAAATGACTTGGAATCAAGTTCAATCACAACTTCTGACCTTTGAGAAACGTCAGGAACAACTTCAAGCTCTAAAGGGAGTGATCTCTATCAATCAACCCTC
TGCAAATTTGGCTGCTACTGGAACAAACCAAAACAACCAGCAAGCTTCACAACACAACCGTTCTTTCAATGGAAACAACAACCAGAAAAATAGTCGTGGCAGGGGTAGGG
GACGTACCTTTGGGAACTCAAAGCCAATTTGTCAAGTGTGTGGAAAGGCTGGCCATACTGCTGCTGTGTGTTTCTATAGGTTTGAAAAGAACTTTAACAACAATCGAGGC
TCATATACTGCTGGTTCAAACAACCAATCCAAGCAAGGTGTTCGAAATCCTACTGCCATGATTGCTACTCCTGAAAATCTGCATGATAATGCTTGGTATATGGATAGTGG
GGCCAGCAATCACTTGACTTCGGATCTCAACAATCTCACTCTCAAATCTGACTACACAGGTATGAAACAAATAACTGTTGGTAATGGTTGTCAGTTGCCTATTCACTCTA
TTGGAAGTTCAGTTATTTACTCAAAGTTCAGAAATCTTATGTTGAAAGGTTTACTTCATGTTCCTCAGATTAGTAAGAATCTAATTAGCATATCTCGCTTGACTATGGAT
AATGCGGTTGTGGTTGAATTTCATGATTCCTTTTGTCTTGTTAAGGACAAGGTTTCGGGGAAGGTTCTTCTGGAAGGGAGTCTTAAAGATGGATTGTATCAGATACCGCC
TCTGGGAGCTGCATCTCATAGCCGTCTACAAGAAGTTTCGAAGAAAAATGTCACTTTTGTTGCGTTTGTCTCTAGGATGGCTAGTAGGAACCAGGATGATCCAATGTTAG
TAACATCTCCTGGGAGTCACCTCGTTTTGTATTGTTCAAAAAGTACATGGCATAGTCGGTTTGGCCATCCATCTCTTCGAACCTTAAATCAGTTGTTGGCTTCTTGTAAG
TTGAAAACTAAAATAAATGAGAATCTATCCTTTTGTGAGGCTTGTCAGTATGCAAAATCTCATAGACTTCCATTTCCTAACTCTAAGTCTCGAGCTACTAAACCTCTTGA
ACTCATTCACACAGATTTATGGGGTCCTGCTCCCATAACTTCATCTAATGGTTATCAATACTACATCAGTTTTCTGGATGATTTTTCTCGATTTACTTGGATATTTCCAT
TGAAACATAAAAGTGATGCTACCACTTTGTTCAAGCAGTTCAAGAATCAAGTTGAAAATTTATTTGAAACAAAGATTAAAATTGTGCGTTGTGATGAAGGAGGGAATCAG
GTTGAAAATTTATTTGAAACAAAGATTAAAATTGTGCGTTGTGATGAAGGAGGGGAGTTTAAACCACTGATTCACTTTGCCTCAGAAGTTGGTATTCATATTCAGTTAGC
TTGTCCTCATACTTCAGCCCAAAATGGACGTGTGGAACGAAAACATCGGCATATTGTTGAAACCGGCCTTGCTCTTTTGGCTCACGCTAAAATGCCTTTAAACTATTGGT
GGGAAGCTTTTCATACCGCTGTTTTTTTGATAAATCGTATGCCCACTGCTGTTCTTCAAGGAAAAGTCCCTTATACTGTCTTATATAATGAACAACCTGACTACAACAGC
CTACGGGTCTTTGGTTCAGCATGTTACCCTTGTCTCAGGCCATATCAGCAGCATAAATTTGATTTTCACACCACCCGATGTGTTTTTCTTGGCTATAGCAATAATCATCG
AGGATATAGGTGCCTCAGTCCAACTGGAAGGATATATGTGTCACGCCATGTCTGTTTTGATGAGGAAAAGTTTCCTTTTGCTGATAAATTTATGGACCAATCCACCACAC
TTTCACTGCTCATATTGATCTTCTTTCTTGGTTGCCTATTCAAATTCCCTCCAGTAGCCATTCTAACACTACAACTCCTTGTCCAGCACCTTCAGTCTTGCCATTACCTC
CATCAAATTCACAAATTTCCAGCCCTCCATCAAATTCACCAATTTCCAGCCCCCAATCACAAATTTCCAGCCCCCATTCTGATTCCCCCATTCATCACTCCTCTACTGGT
AGCCTCTCATCCTTTAGTCCCTCCTCCAGTACCTCATCCATGCCCTCCCCCACCGTTTTTTCTCCACTAG
Protein sequenceShow/hide protein sequence
MANTEKSSSSPPKSPVTQSTLAQTPTLKKPGPVHIPKVPGATEVRYGTTPVGTPAFTNLLNQATSIKLDRNNFLLWQNIALPILRSYKLEGHLTDEPKGLQMPNPEYDVW
IAADQLLVGWLYNSMTTEVASQVTSCETAQELWIALQEFYGVQSISQKDYLKRMLQQTRKWSMKMTEYLSLMKSYSDNLYLAGSPVDPKDLISYVIAGLDEEFTPIVVVI
QNQEMTWNQVQSQLLTFEKRQEQLQALKGVISINQPSANLAATGTNQNNQQASQHNRSFNGNNNQKNSRGRGRGRTFGNSKPICQVCGKAGHTAAVCFYRFEKNFNNNRG
SYTAGSNNQSKQGVRNPTAMIATPENLHDNAWYMDSGASNHLTSDLNNLTLKSDYTGMKQITVGNGCQLPIHSIGSSVIYSKFRNLMLKGLLHVPQISKNLISISRLTMD
NAVVVEFHDSFCLVKDKVSGKVLLEGSLKDGLYQIPPLGAASHSRLQEVSKKNVTFVAFVSRMASRNQDDPMLVTSPGSHLVLYCSKSTWHSRFGHPSLRTLNQLLASCK
LKTKINENLSFCEACQYAKSHRLPFPNSKSRATKPLELIHTDLWGPAPITSSNGYQYYISFLDDFSRFTWIFPLKHKSDATTLFKQFKNQVENLFETKIKIVRCDEGGNQ
VENLFETKIKIVRCDEGGEFKPLIHFASEVGIHIQLACPHTSAQNGRVERKHRHIVETGLALLAHAKMPLNYWWEAFHTAVFLINRMPTAVLQGKVPYTVLYNEQPDYNS
LRVFGSACYPCLRPYQQHKFDFHTTRCVFLGYSNNHRGYRCLSPTGRIYVSRHVCFDEEKFPFADKFMDQSTTLSLLILIFFLGCLFKFPPVAILTLQLLVQHLQSCHYL
HQIHKFPALHQIHQFPAPNHKFPAPILIPPFITPLLVASHPLVPPPVPHPCPPPPFFLH