; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014412 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014412
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr12:522368..526671
RNA-Seq ExpressionLag0014412
SyntenyLag0014412
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
GAU19483.1 hypothetical protein TSUD_77270 [Trifolium subterraneum]2.7e-11940.44Show/hide
Query:  LLNPITSVKLDRGHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAA
        +L  I   KLD G+++GT+ CP + +            T +D +              N A+  W A DQ LLGW+ NSMT E+A Q++  E S+ LW  
Subjt:  LLNPITSVKLDRGHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAA

Query:  IQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLELQ
         Q L G  +R++  YL+  F   RKG MKM DYL  MK+  D L  AG+PV+T  L+ Q L GLD EYNPVV  +  +  ++W ++QA+LL FE R+E  
Subjt:  IQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLELQ

Query:  TSLKTSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPNQAQNRG
         +L T+LT++   + N+ +  D  G+     S NN + +  RG++  GGRGRG+         + K  CQVCG   H  + C+ RF+K +S  N   + G
Subjt:  TSLKTSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPNQAQNRG

Query:  DPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENILCV
          +Q                 S N+F+AS  +V D  WY DSGAS+HVT       +  E+ G   + VG+G  L I + G+S L    K+L+L +IL V
Subjt:  DPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENILCV

Query:  PSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRL
        P+I KNL+SVSKLA DN + VEF ++CCFVKDK TG V+LKG L DGLY+  G +                             N    V  K  WHRRL
Subjt:  PSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRL

Query:  GHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAF
        GHP+ KVL+ ++ SC +    ++ F FCEACQ+GK H LPF +S S      ELVHTD+WGPAP+ ++ GF++YV F+DD+SRF+WIYPLK+K+  + AF
Subjt:  GHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAF

Query:  THFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS
          F  L +NQFN  IK +Q D GGEY  + +L    G+Q R+S
Subjt:  THFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS

KYP50444.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]5.1e-11843.46Show/hide
Query:  MTPEVAVQVMGFENSQDLWAAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRF
        MT EVA Q++  E SQ +W   Q L G  +R+   +L+  F + RKG +KM +YL  MK   D+L  AGS V+T  LV+Q L GLD EYNP+V  +  + 
Subjt:  MTPEVAVQVMGFENSQDLWAAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRF

Query:  GITWSEMQAELLVFEKRLELQTSLKTSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQ--RGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGH
         +TW EMQA+LL +E RLE Q + +++LT++  ++++ +           N+ G +     GRG Q  RG   GRGRGR     +  ++ VCQVC K GH
Subjt:  GITWSEMQAELLVFEKRLELQTSLKTSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQ--RGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGH

Query:  TTLMCYQRFNKEFSGPNQAQNRGDPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNI
            CY RFNK + G N  + + +  +               N + N++VASP TV D  WY DSGAS+HVT D N +    E +G + +TVG+G++L I
Subjt:  TTLMCYQRFNKEFSGPNQAQNRGDPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNI

Query:  KSVGNSCLTDGVKTLSLENILCVPSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSI
         + G+S L    K+L+L++IL VP I KNL+S+SKL  DN ++VEFHD  CFVKDK TG +LL+G + DGLY+  G          S     +  V  SI
Subjt:  KSVGNSCLTDGVKTLSLENILCVPSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSI

Query:  ESACVLSNTVNLVISKNVWHRRLGHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLF
                       K  WHR+LGHP+ KVL  +++ CN+     E F+FCEACQFGK+H LPF NSVS      +LVH+D+WGPAP+ S  GF++YVLF
Subjt:  ESACVLSNTVNLVISKNVWHRRLGHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLF

Query:  LDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS
        LDD+SRF+WIYPLK+K+    AF  F  LV+NQFN  IKTLQ D GGE+  + ++    G+Q+R S
Subjt:  LDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS

PNX76291.1 gag/pol polyprotein - maize retrotransposon Hopscotch, partial [Trifolium pratense]1.2e-11940.09Show/hide
Query:  NQLLNPITSVKLDRGHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLW
        + +L  I   +LD G+++G K CP + + A+ D S                      ++ NP +E W A DQ LLGWL NSMT  +A Q++  E S  LW
Subjt:  NQLLNPITSVKLDRGHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLW

Query:  AAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLE
           Q L G  +R++  YL+  F   RKG MKM DYL  MK+  D L  AG+P++T  L+ Q L GLD EYNPVV  +  +  ++W ++QA+LL FE R+E
Subjt:  AAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLE

Query:  LQTSLKTSLTISQGTSVNMVSNKDSSGQR-NQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPNQAQ
           SL T+LT++   + N+    D  G R N N +     NN+     RG   GRGRGR +       K  CQVCG   H  + C+ RF+K +S  N + 
Subjt:  LQTSLKTSLTISQGTSVNMVSNKDSSGQR-NQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPNQAQ

Query:  NRGDPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENI
        N                       S N+F+AS  ++ D  WY DSGAS+HVT   +   N  E+ G   + VG+G  L I + G+S L    K+L+L +I
Subjt:  NRGDPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENI

Query:  LCVPSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWH
        L VP I KNL+SVSKLA DN + VEF ++CCFVKDK TG  +L+G L DGLY+         +  +SAY  +                       K  WH
Subjt:  LCVPSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWH

Query:  RRLGHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAM
        R+LGHP+ KVL+ +++SCN+    +++F FCEACQ+GK H LPF  S S      ELVHTD+WGPAP+ S+ GF++YV F+DD++RF+WIYPLK+K+   
Subjt:  RRLGHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAM

Query:  IAFTHFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS
         AF  F  +V+NQF+  IKT+Q D GGEY  + +     G+Q R+S
Subjt:  IAFTHFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS

PNX94503.1 putative retrotransposon Ty1-copia subclass protein, partial [Trifolium pratense]1.3e-11339.4Show/hide
Query:  GHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAAIQELFGVQSRAE
        G+++GTK CP                   DQ  TS  +T    E+INP Y+ W A DQ LLGWL NSMT ++A QV+  E S+ LW   Q L G  +R+ 
Subjt:  GHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAAIQELFGVQSRAE

Query:  EDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLELQTSLKTSLTISQG
          YL+  F    K  MKM  YL  MK+  D L  AGSP+++  L+ Q L GLD EYNPVV  +  +  I+W + QA+LL FE RL+            Q 
Subjt:  EDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLELQTSLKTSLTISQG

Query:  TSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPNQ-AQNRGDPRQNGQLSTT
         + N ++   S+   ++N+SG N+  + G G++    RG   GRG    S   +P+CQ+CGK GHT   CY RF+K ++  N  A+  G           
Subjt:  TSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPNQ-AQNRGDPRQNGQLSTT

Query:  VQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENILCVPSIAKNLVSVS
                  S ++FVASP    D  WY DSGAS+HVT     + +  E  G   + VG+G  L I + G++ L D    ++L N+L VP I KNL+SVS
Subjt:  VQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENILCVPSIAKNLVSVS

Query:  KLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRLGHPSLKVLESL
        KL  DN   VEF ++ C+VKDK TG  LLKG L DGLY+    +    +    AY                       +  K +WHR+LGHP+ KVLE +
Subjt:  KLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRLGHPSLKVLESL

Query:  VRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQF
        ++  N+    +++F FCEACQFGK H LPF  S S      +L+HTD+WGPAP+ S   F++YV FLDD+SRF+WI+PLK+K+  + AF  F  LV+NQF
Subjt:  VRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQF

Query:  NTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS
        N  IK ++ D GGEY  + +     G+Q ++S
Subjt:  NTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS

PNY01489.1 copia-like polyprotein, partial [Trifolium pratense]8.4e-11339.72Show/hide
Query:  ITSVKLDRGHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAAIQEL
        I   K D G+++GTK CP + +  S D+S                      +++NP ++ W+A DQ LLGWL NSM  ++A Q++  E S+ LW   Q L
Subjt:  ITSVKLDRGHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAAIQEL

Query:  FGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLELQTSLK
         G  +++   YL+  F   RKG MKM +YL  MK+ +D L  +GSP++   L+ Q L GLD EYNPVV  +  +  ++W ++QA+LL FE RL+ Q +  
Subjt:  FGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLELQTSLK

Query:  TSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPN---QAQNRGD
        + LT++   S N  +  +  G +  ++ GN R++NF RG +  GGRG+GR         N K  CQVC   GHT + C  RF++ ++G N   +A  +G 
Subjt:  TSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPN---QAQNRGD

Query:  PRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENILCVP
                            S ++FVASP    D  WY DSGAS+HVT   +      E+ G   + VG+G  L I + G++ L     TL+L ++L VP
Subjt:  PRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENILCVP

Query:  SIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRLG
         I KNL+SVSKL  DN +FVEF  +CC VKDK TG  LLKG L DGLY+         DVS             S +  CV  +       K  WHR+LG
Subjt:  SIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRLG

Query:  HPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAFT
        HP+ KVLE +++ CN+    +++F FCEACQFGK H LPF +S S V     L+H+D+WGPAP+ S  GF++YV F+DD+SRF+WI+PLK+K+  + AF 
Subjt:  HPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAFT

Query:  HFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS
         F  L +NQFN  IK +Q D GGEY  + ++    G+Q R+S
Subjt:  HFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS

TrEMBL top hitse value%identityAlignment
A0A151S6M8 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-11843.46Show/hide
Query:  MTPEVAVQVMGFENSQDLWAAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRF
        MT EVA Q++  E SQ +W   Q L G  +R+   +L+  F + RKG +KM +YL  MK   D+L  AGS V+T  LV+Q L GLD EYNP+V  +  + 
Subjt:  MTPEVAVQVMGFENSQDLWAAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRF

Query:  GITWSEMQAELLVFEKRLELQTSLKTSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQ--RGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGH
         +TW EMQA+LL +E RLE Q + +++LT++  ++++ +           N+ G +     GRG Q  RG   GRGRGR     +  ++ VCQVC K GH
Subjt:  GITWSEMQAELLVFEKRLELQTSLKTSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQ--RGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGH

Query:  TTLMCYQRFNKEFSGPNQAQNRGDPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNI
            CY RFNK + G N  + + +  +               N + N++VASP TV D  WY DSGAS+HVT D N +    E +G + +TVG+G++L I
Subjt:  TTLMCYQRFNKEFSGPNQAQNRGDPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNI

Query:  KSVGNSCLTDGVKTLSLENILCVPSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSI
         + G+S L    K+L+L++IL VP I KNL+S+SKL  DN ++VEFHD  CFVKDK TG +LL+G + DGLY+  G          S     +  V  SI
Subjt:  KSVGNSCLTDGVKTLSLENILCVPSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSI

Query:  ESACVLSNTVNLVISKNVWHRRLGHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLF
                       K  WHR+LGHP+ KVL  +++ CN+     E F+FCEACQFGK+H LPF NSVS      +LVH+D+WGPAP+ S  GF++YVLF
Subjt:  ESACVLSNTVNLVISKNVWHRRLGHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLF

Query:  LDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS
        LDD+SRF+WIYPLK+K+    AF  F  LV+NQFN  IKTLQ D GGE+  + ++    G+Q+R S
Subjt:  LDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS

A0A2K3LCM1 Gag/pol polyprotein-maize retrotransposon Hopscotch (Fragment)5.8e-12040.09Show/hide
Query:  NQLLNPITSVKLDRGHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLW
        + +L  I   +LD G+++G K CP + + A+ D S                      ++ NP +E W A DQ LLGWL NSMT  +A Q++  E S  LW
Subjt:  NQLLNPITSVKLDRGHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLW

Query:  AAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLE
           Q L G  +R++  YL+  F   RKG MKM DYL  MK+  D L  AG+P++T  L+ Q L GLD EYNPVV  +  +  ++W ++QA+LL FE R+E
Subjt:  AAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLE

Query:  LQTSLKTSLTISQGTSVNMVSNKDSSGQR-NQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPNQAQ
           SL T+LT++   + N+    D  G R N N +     NN+     RG   GRGRGR +       K  CQVCG   H  + C+ RF+K +S  N + 
Subjt:  LQTSLKTSLTISQGTSVNMVSNKDSSGQR-NQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPNQAQ

Query:  NRGDPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENI
        N                       S N+F+AS  ++ D  WY DSGAS+HVT   +   N  E+ G   + VG+G  L I + G+S L    K+L+L +I
Subjt:  NRGDPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENI

Query:  LCVPSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWH
        L VP I KNL+SVSKLA DN + VEF ++CCFVKDK TG  +L+G L DGLY+         +  +SAY  +                       K  WH
Subjt:  LCVPSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWH

Query:  RRLGHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAM
        R+LGHP+ KVL+ +++SCN+    +++F FCEACQ+GK H LPF  S S      ELVHTD+WGPAP+ S+ GF++YV F+DD++RF+WIYPLK+K+   
Subjt:  RRLGHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAM

Query:  IAFTHFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS
         AF  F  +V+NQF+  IKT+Q D GGEY  + +     G+Q R+S
Subjt:  IAFTHFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS

A0A2K3MUJ9 Putative retrotransposon Ty1-copia subclass protein (Fragment)6.3e-11439.4Show/hide
Query:  GHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAAIQELFGVQSRAE
        G+++GTK CP                   DQ  TS  +T    E+INP Y+ W A DQ LLGWL NSMT ++A QV+  E S+ LW   Q L G  +R+ 
Subjt:  GHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAAIQELFGVQSRAE

Query:  EDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLELQTSLKTSLTISQG
          YL+  F    K  MKM  YL  MK+  D L  AGSP+++  L+ Q L GLD EYNPVV  +  +  I+W + QA+LL FE RL+            Q 
Subjt:  EDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLELQTSLKTSLTISQG

Query:  TSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPNQ-AQNRGDPRQNGQLSTT
         + N ++   S+   ++N+SG N+  + G G++    RG   GRG    S   +P+CQ+CGK GHT   CY RF+K ++  N  A+  G           
Subjt:  TSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPNQ-AQNRGDPRQNGQLSTT

Query:  VQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENILCVPSIAKNLVSVS
                  S ++FVASP    D  WY DSGAS+HVT     + +  E  G   + VG+G  L I + G++ L D    ++L N+L VP I KNL+SVS
Subjt:  VQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENILCVPSIAKNLVSVS

Query:  KLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRLGHPSLKVLESL
        KL  DN   VEF ++ C+VKDK TG  LLKG L DGLY+    +    +    AY                       +  K +WHR+LGHP+ KVLE +
Subjt:  KLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRLGHPSLKVLESL

Query:  VRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQF
        ++  N+    +++F FCEACQFGK H LPF  S S      +L+HTD+WGPAP+ S   F++YV FLDD+SRF+WI+PLK+K+  + AF  F  LV+NQF
Subjt:  VRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQF

Query:  NTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS
        N  IK ++ D GGEY  + +     G+Q ++S
Subjt:  NTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS

A0A2Z6MBG6 Integrase catalytic domain-containing protein1.3e-11940.44Show/hide
Query:  LLNPITSVKLDRGHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAA
        +L  I   KLD G+++GT+ CP + +            T +D +              N A+  W A DQ LLGW+ NSMT E+A Q++  E S+ LW  
Subjt:  LLNPITSVKLDRGHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAA

Query:  IQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLELQ
         Q L G  +R++  YL+  F   RKG MKM DYL  MK+  D L  AG+PV+T  L+ Q L GLD EYNPVV  +  +  ++W ++QA+LL FE R+E  
Subjt:  IQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLELQ

Query:  TSLKTSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPNQAQNRG
         +L T+LT++   + N+ +  D  G+     S NN + +  RG++  GGRGRG+         + K  CQVCG   H  + C+ RF+K +S  N   + G
Subjt:  TSLKTSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPNQAQNRG

Query:  DPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENILCV
          +Q                 S N+F+AS  +V D  WY DSGAS+HVT       +  E+ G   + VG+G  L I + G+S L    K+L+L +IL V
Subjt:  DPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENILCV

Query:  PSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRL
        P+I KNL+SVSKLA DN + VEF ++CCFVKDK TG V+LKG L DGLY+  G +                             N    V  K  WHRRL
Subjt:  PSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRL

Query:  GHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAF
        GHP+ KVL+ ++ SC +    ++ F FCEACQ+GK H LPF +S S      ELVHTD+WGPAP+ ++ GF++YV F+DD+SRF+WIYPLK+K+  + AF
Subjt:  GHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAF

Query:  THFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS
          F  L +NQFN  IK +Q D GGEY  + +L    G+Q R+S
Subjt:  THFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGVQVRLS

A0A803PEH4 Uncharacterized protein7.9e-11739.35Show/hide
Query:  ATMTSVNSSGISALSSG----------NSFSSPPLNQLLNPITSVKLDR--------------------GHLIGTKPCPPKILQASVDRSGPSSSTGTDQ
        +T +S  +S ++  SS           N+F+ P LNQ      S+KLDR                    G+L GT  CPP+ +                 
Subjt:  ATMTSVNSSGISALSSG----------NSFSSPPLNQLLNPITSVKLDR--------------------GHLIGTKPCPPKILQASVDRSGPSSSTGTDQ

Query:  AATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDN
             G T +     NP YE W+  DQLL+GWLY+SMT  +A +VMG  ++ +L   ++ L+G  S+++ D  R + Q  RKGS  M++YLR  K+ ++ 
Subjt:  AATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDN

Query:  LGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLE-LQT-SLKTSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFG
        L  AG P     LV+ VL GLD EY  +V  I+ R   TW E+Q  LL F+ ++E LQ  +L ++   S     NM +  +++G+    QS N   N+ G
Subjt:  LGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLE-LQT-SLKTSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFG

Query:  RGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSG--PNQAQNRGDPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWY
              G   R RGRG G GS  ++P CQV GK GHT  +CY RF++ + G  PN   N              Q  A   N + ++FVA+PE +   +W+
Subjt:  RGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSG--PNQAQNRGDPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWY

Query:  ADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLT-DGVKTLSLENILCVPSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMV
        ADSGAS+H+T+D   +    +Y G   V VG+GS L I  +GN  L  +    L L+++L VP IAKNLVSVSKLA DN V +EF+ + C VKDK T  V
Subjt:  ADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLT-DGVKTLSLENILCVPSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMV

Query:  LLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQ--SKVNNSIESACVLSNTVNLVISK-NVWHRRLGHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGK
        LL G L D LY+ D         S+  YQ+    S    S++S    S T +L+IS+ +V HRRLGHPS+KVL  ++ S N+    N     C+ACQ+GK
Subjt:  LLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQ--SKVNNSIESACVLSNTVNLVISK-NVWHRRLGHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGK

Query:  SHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCET
        +HALPF +S +R  S  +L+HTDLWGPAP+ SN    +Y+ F+DDYSR++W+YPLK K+ A+ AF  F ALV+NQF   IK+L+SD+GGEY     L +T
Subjt:  SHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCET

Query:  MGVQ
         G++
Subjt:  MGVQ

SwissProt top hitse value%identityAlignment
P0C2I3 Transposon Ty1-DR6 Gag-Pol polyprotein4.6e-1329.45Show/hide
Query:  HRRLGHPSLKVLESLVRSCNLPTKTNE---------EFKFCEACQFGKS--------HALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLD
        HR L H + + +   +++ N  T  NE         +++ C  C  GKS          L + NS       F+ +HTD++GP     N    +++ F D
Subjt:  HRRLGHPSLKVLESLVRSCNLPTKTNE---------EFKFCEACQFGKS--------HALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLD

Query:  DYSRFSWIYPL--KRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGEYTH--IHRLCETMGV
        + ++F W+YPL  +R++  +  FT   A +KNQF  S+  +Q D G EYT+  +H+  E  G+
Subjt:  DYSRFSWIYPL--KRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGEYTH--IHRLCETMGV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-3123.6Show/hide
Query:  EAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAAIQELFGVQSRAEEDYL-RQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVL
        E W  +D+     +   ++ +V   ++  + ++ +W  ++ L+  ++   + YL +Q++            +L V       L   G  +        +L
Subjt:  EAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAAIQELFGVQSRAEEDYL-RQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVL

Query:  LGLDEEY-NPVVAMIQGRFGITWSEMQAELLVFEKRLELQTSLKTSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGY
          L   Y N    ++ G+  I   ++ + LL+ EK  +   +   +L I++G                + +S     NN+GR   RG  + R + R    
Subjt:  LGLDEEY-NPVVAMIQGRFGITWSEMQAELLVFEKRLELQTSLKTSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGY

Query:  GSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPNQAQNRGDPRQNGQLSTTVQPSAYVANQSLNSFVASPETVV-----DPSWYADSGASSHVTADYNAI
                C  C + GH    C          PN  + +G+   +GQ +     +    N ++  F+   E  +     +  W  D+ AS H T   +  
Subjt:  GSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPNQAQNRGDPRQNGQLSTTVQPSAYVANQSLNSFVASPETVV-----DPSWYADSGASSHVTADYNAI

Query:  ANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVK-TLSLENILCVPSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDT--GMVLLKGTLSDGLYRFD
           V  +    V +G+ S   I  +G+ C+   V  TL L+++  VP +  NL+S   L +D      +  +    K + T   +V+ KG     LYR +
Subjt:  ANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVK-TLSLENILCVPSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDT--GMVLLKGTLSDGLYRFD

Query:  GVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRLGHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKF
                      +  Q ++N + +            IS ++WH+R+GH S K L+ L +   +        K C+ C FGK H + F  S  R  +  
Subjt:  GVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRLGHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKF

Query:  ELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGEYT--HIHRLCETMGVQ
        +LV++D+ GP  +ES  G +++V F+DD SR  W+Y LK K+     F  F ALV+ +    +K L+SDNGGEYT       C + G++
Subjt:  ELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGEYT--HIHRLCETMGVQ

Q12141 Transposon Ty1-GR1 Gag-Pol polyprotein2.3e-1228.83Show/hide
Query:  HRRLGHPSLKVLESLVRSCNLPTKTNE---------EFKFCEACQFGKS--------HALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLD
        HR L H + + +   +++ N  T  NE         +++ C  C  GKS          L + NS       F+ +HTD++GP          +++ F D
Subjt:  HRRLGHPSLKVLESLVRSCNLPTKTNE---------EFKFCEACQFGKS--------HALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLD

Query:  DYSRFSWIYPL--KRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGEYTH--IHRLCETMGV
        + ++F W+YPL  +R++  +  FT   A +KNQF  S+  +Q D G EYT+  +H+  E  G+
Subjt:  DYSRFSWIYPL--KRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGEYTH--IHRLCETMGV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.2e-5929.36Show/hide
Query:  GTDQAATSGGSTTI--------VVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMN
        G + A    GSTT+            +NP Y  W   D+L+   +  +++  V   V     +  +W  +++++   S      LR   +Q  KG+  ++
Subjt:  GTDQAATSGGSTTI--------VVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMGFENSQDLWAAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMN

Query:  DYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGR-FGITWSEMQAELLVFEKRLELQTSLKTSLTISQGTSVNMVSNKDSSGQRNQN
        DY++ + +  D L   G P+     V +VL  L EEY PV+  I  +    T +E+   LL  E ++ L  S  T + I    + N VS+++++   N N
Subjt:  DYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGR-FGITWSEMQAELLVFEKRLELQTSLKTSLTISQGTSVNMVSNKDSSGQRNQN

Query:  QSG-NNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPV---CQVCGKIGHTTLMCYQRFNKEFSGPNQAQNRGDPRQNGQLSTTVQPSAYVANQSLNSF
            NNR +N  R         +     +   +  +KP    CQ+CG  GH+   C Q   + F     +Q    P       T  QP A +A       
Subjt:  QSG-NNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPV---CQVCGKIGHTTLMCYQRFNKEFSGPNQAQNRGDPRQNGQLSTTVQPSAYVANQSLNSF

Query:  VASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENILCVPSIAKNLVSVSKLAQDNRVFVEFHDH
        + SP +    +W  DSGA+ H+T+D+N ++    Y G   V V DGS++ I   G++ L+   + L+L NIL VP+I KNL+SV +L   N V VEF   
Subjt:  VASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENILCVPSIAKNLVSVSKLAQDNRVFVEFHDH

Query:  CCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRLGHPSLKVLESLVRSCNLPT-KTNEEF
           VKD +TG+ LL+G   D LY +       V    S +    SK  +S                   WH RLGHP+  +L S++ + +L     + +F
Subjt:  CCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRLGHPSLKVLESLVRSCNLPT-KTNEEF

Query:  KFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGE
          C  C   KS+ +PF+ S        E +++D+W  +P+ S+  +R+YV+F+D ++R++W+YPLK+K+     F  F  L++N+F T I T  SDNGGE
Subjt:  KFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGE

Query:  YTHIHRLCETMGV
        +  +       G+
Subjt:  YTHIHRLCETMGV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.0e-5727.58Show/hide
Query:  IRATMTSVNSSGISALSSGNSFS-SPPLNQLLNPITSVKLDRGHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQ
        +   + +VN S ++ L+S N    S  ++ L +         G L G+ P PP             ++ GTD            V  +NP Y  W   D+
Subjt:  IRATMTSVNSSGISALSSGNSFS-SPPLNQLLNPITSVKLDRGHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQ

Query:  LLLGWLYNSMTPEVAVQVMGFENSQDLWAAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNP
        L+   +  +++  V   V     +  +W  +++++   S      LR +                   +  D L   G P+     V +VL  L ++Y P
Subjt:  LLLGWLYNSMTPEVAVQVMGFENSQDLWAAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNP

Query:  VVAMIQGRFGITWSEMQAELLVFEKRLELQTSLKTSLTISQGTSV--NMVSNKDSSGQRNQNQSGNNR----QNNFGRGFQRGGGRGRGRGRGYGYGSFN
        V+  I  +      +    L    +RL  + S   +L  ++   +  N+V++++++  RNQN  G+NR     NN    +Q      R   R        
Subjt:  VVAMIQGRFGITWSEMQAELLVFEKRLELQTSLKTSLTISQGTSV--NMVSNKDSSGQRNQNQSGNNR----QNNFGRGFQRGGGRGRGRGRGYGYGSFN

Query:  NKPV---CQVCGKIGHTTLMCYQRFNKEFSGPNQAQNRGDPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEY
         KP    CQ+C   GH+   C Q         +Q Q+  + +Q+    T  QP A +A       V SP      +W  DSGA+ H+T+D+N ++    Y
Subjt:  NKPV---CQVCGKIGHTTLMCYQRFNKEFSGPNQAQNRGDPRQNGQLSTTVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEY

Query:  EGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENILCVPSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDV
         G   V + DGS++ I   G++ L    ++L L  +L VP+I KNL+SV +L   NRV VEF      VKD +TG+ LL+G   D LY +      +V +
Subjt:  EGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENILCVPSIAKNLVSVSKLAQDNRVFVEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDV

Query:  SNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRLGHPSLKVLESLVRSCNLPT-KTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLW
          S   K                       + + WH RLGHPSL +L S++ + +LP    + +   C  C   KSH +PF+NS        E +++D+W
Subjt:  SNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRLGHPSLKVLESLVRSCNLPT-KTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLW

Query:  GPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGV
          +P+ S   +R+YV+F+D ++R++W+YPLK+K+     F  F +LV+N+F T I TL SDNGGE+  +       G+
Subjt:  GPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGV

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.1e-0427.1Show/hide
Query:  WVAVDQLLLGWLYNSMTP-EVAVQVMGFENSQDLWAAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLG
        W   D ++   LY ++TP +     +    S+D+W  I+  F     A    L    +    G M++ DY R MK   D+L     PV  R+LV  VL G
Subjt:  WVAVDQLLLGWLYNSMTP-EVAVQVMGFENSQDLWAAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLG

Query:  LDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLELQTSLKTSLT-ISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGY--
        L+ +++ ++ +I+ R     S   A  ++ E+   L+ ++K + T +   +S  +++  ++    N  +SG N+    GRG  RG    RGRG  + Y  
Subjt:  LDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLELQTSLKTSLT-ISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGY--

Query:  ----GSFNNKPVCQ
             S+N  P  Q
Subjt:  ----GSFNNKPVCQ

AT5G48050.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)9.5e-0626.11Show/hide
Query:  WVAVDQLLLGWLYNSMTPEVAVQVMGFE-NSQDLWAAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLG
        W   D L+  W+Y ++T  +   ++     ++DLW +++ LF     A         +      + +++Y + +KS +D L    SP++ R LV  +L G
Subjt:  WVAVDQLLLGWLYNSMTPEVAVQVMGFE-NSQDLWAAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLG

Query:  LDEEYNPVVAMIQGRFGI-TWSEMQAELLVFEKRLELQTSLKTSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGS
        L E+Y+ ++ +I+ +    +++E ++ LL+ E RL  ++  K+SL+ +   S++ V       Q    Q  +N  +N GRG  +   RG G   G  Y +
Subjt:  LDEEYNPVVAMIQGRFGI-TWSEMQAELLVFEKRLELQTSLKTSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGS

Query:  FNN
         NN
Subjt:  FNN

ATMG00300.1 Gag-Pol-related retrotransposon family protein7.0e-0936.62Show/hide
Query:  VWHRRLGHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPV
        +WH RL H S + +E LV+   L +      KFCE C +GK+H + F+       +  + VH+DLWG   V
Subjt:  VWHRRLGHPSLKVLESLVRSCNLPTKTNEEFKFCEACQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAAAATACACATTAGTGGCATGGGGATGCAGGGTCTAAGCCTTTTCTACTCGGACAGAAAAAAAAAACTTCAGAACTTGAATTGTCTGTTACAAATTGTTCAACA
TAATGAAGCTATTCCAGAGGGACACATGGTCTCAACTACTGGAGTTTGCCAGCTTTACAAAGTCTTTAGAGATAAAACCATGAACGAGCAGGATTCAAGTACCAGAAACA
AGGGCCAGGAAGATGAGCAGCAGGACTCCATTGATGAGTATGAAATTGAGTATTATCTCTGCCAATCATCATTGAGTCTTGGTATCAGAGCTACTATGACTAGCGTCAAT
TCGTCCGGTATTTCAGCCTTGTCCTCCGGCAACAGTTTCAGCAGTCCACCATTAAATCAATTGTTGAATCCGATTACCTCAGTGAAGCTCGATAGAGGGCATTTGATTGG
GACTAAACCTTGCCCTCCTAAAATTCTTCAAGCCTCTGTTGATAGATCTGGTCCTTCGTCCTCGACCGGAACAGATCAGGCAGCAACCAGTGGAGGTTCAACTACAATTG
TTGTTGAAGAAATTAACCCAGCGTATGAAGCTTGGGTAGCAGTCGACCAACTATTGTTAGGCTGGTTATATAATTCGATGACCCCAGAGGTAGCAGTTCAGGTAATGGGC
TTTGAGAATTCTCAAGATCTATGGGCAGCTATACAGGAACTCTTCGGAGTTCAGTCTCGAGCCGAGGAAGATTACCTTCGGCAAGTTTTTCAGCAGTGCAGGAAAGGAAG
TATGAAGATGAATGACTACTTACGAGTTATGAAGAGCCACACTGACAACCTAGGTCAAGCTGGAAGCCCTGTCGCCACCCGCTCTCTAGTTTCTCAAGTTCTTTTGGGGT
TGGATGAGGAGTACAACCCAGTGGTAGCCATGATTCAAGGGAGATTTGGAATCACCTGGTCTGAGATGCAGGCTGAGCTATTGGTATTTGAAAAACGATTGGAGCTTCAA
ACTAGTCTGAAAACATCCTTGACAATCAGTCAAGGAACCTCTGTCAACATGGTTAGCAATAAGGACTCATCAGGACAGAGAAATCAGAATCAATCTGGGAATAACAGGCA
GAACAATTTTGGAAGAGGCTTTCAGAGAGGGGGTGGCAGAGGACGAGGAAGAGGCAGAGGCTATGGCTATGGTTCGTTTAACAACAAACCTGTGTGTCAGGTATGTGGTA
AGATAGGGCATACAACATTAATGTGTTATCAGCGATTCAATAAAGAGTTTTCAGGACCAAATCAAGCTCAAAACAGGGGAGATCCTAGACAAAATGGGCAACTGTCAACC
ACTGTGCAACCATCAGCTTATGTTGCAAATCAGAGTTTGAATTCCTTTGTTGCATCTCCAGAAACTGTAGTTGACCCAAGCTGGTATGCAGACAGTGGTGCATCCAGCCA
TGTGACCGCTGATTACAATGCCATTGCGAATCCTGTGGAGTATGAAGGTAATGCGTGTGTTACGGTGGGAGATGGTAGTAGCCTGAATATTAAATCTGTTGGAAATTCTT
GCTTGACCGATGGTGTTAAAACTCTAAGTCTTGAGAATATTTTGTGTGTTCCAAGCATAGCTAAAAATCTTGTGAGTGTTTCTAAGCTTGCTCAAGATAATCGTGTATTC
GTTGAATTTCATGATCACTGTTGTTTTGTTAAGGACAAGGATACGGGCATGGTGTTGCTGAAAGGAACGCTCAGTGATGGTCTTTATCGCTTTGATGGAGTACGTGTTGA
TTCAGTGGATGTGTCAAATTCAGCTTATCAGAAAGTTCAGTCTAAAGTTAATAATAGTATTGAGTCTGCTTGTGTCTTGTCGAATACTGTAAATCTTGTGATTTCCAAGA
ATGTGTGGCATAGACGTCTTGGTCACCCCTCGTTAAAAGTTCTTGAATCTTTAGTAAGGTCATGTAATTTACCTACTAAAACTAATGAAGAGTTTAAGTTCTGTGAAGCC
TGTCAGTTTGGGAAGTCTCATGCTCTGCCCTTCAATAATTCTGTTTCTAGAGTTTGTTCTAAGTTTGAATTGGTGCATACTGATCTCTGGGGGCCTGCTCCTGTTGAGTC
TAATCAGGGGTTCAGATTTTATGTGTTGTTTCTTGATGATTATAGCAGGTTCTCTTGGATTTATCCCTTGAAACGCAAGAATGTTGCTATGATTGCATTTACTCACTTTA
CTGCCTTGGTGAAGAATCAATTTAATACATCTATCAAAACTTTGCAATCAGATAATGGAGGGGAATATACACATATCCATCGACTATGTGAGACAATGGGAGTTCAAGTT
CGGTTATCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAAAATACACATTAGTGGCATGGGGATGCAGGGTCTAAGCCTTTTCTACTCGGACAGAAAAAAAAAACTTCAGAACTTGAATTGTCTGTTACAAATTGTTCAACA
TAATGAAGCTATTCCAGAGGGACACATGGTCTCAACTACTGGAGTTTGCCAGCTTTACAAAGTCTTTAGAGATAAAACCATGAACGAGCAGGATTCAAGTACCAGAAACA
AGGGCCAGGAAGATGAGCAGCAGGACTCCATTGATGAGTATGAAATTGAGTATTATCTCTGCCAATCATCATTGAGTCTTGGTATCAGAGCTACTATGACTAGCGTCAAT
TCGTCCGGTATTTCAGCCTTGTCCTCCGGCAACAGTTTCAGCAGTCCACCATTAAATCAATTGTTGAATCCGATTACCTCAGTGAAGCTCGATAGAGGGCATTTGATTGG
GACTAAACCTTGCCCTCCTAAAATTCTTCAAGCCTCTGTTGATAGATCTGGTCCTTCGTCCTCGACCGGAACAGATCAGGCAGCAACCAGTGGAGGTTCAACTACAATTG
TTGTTGAAGAAATTAACCCAGCGTATGAAGCTTGGGTAGCAGTCGACCAACTATTGTTAGGCTGGTTATATAATTCGATGACCCCAGAGGTAGCAGTTCAGGTAATGGGC
TTTGAGAATTCTCAAGATCTATGGGCAGCTATACAGGAACTCTTCGGAGTTCAGTCTCGAGCCGAGGAAGATTACCTTCGGCAAGTTTTTCAGCAGTGCAGGAAAGGAAG
TATGAAGATGAATGACTACTTACGAGTTATGAAGAGCCACACTGACAACCTAGGTCAAGCTGGAAGCCCTGTCGCCACCCGCTCTCTAGTTTCTCAAGTTCTTTTGGGGT
TGGATGAGGAGTACAACCCAGTGGTAGCCATGATTCAAGGGAGATTTGGAATCACCTGGTCTGAGATGCAGGCTGAGCTATTGGTATTTGAAAAACGATTGGAGCTTCAA
ACTAGTCTGAAAACATCCTTGACAATCAGTCAAGGAACCTCTGTCAACATGGTTAGCAATAAGGACTCATCAGGACAGAGAAATCAGAATCAATCTGGGAATAACAGGCA
GAACAATTTTGGAAGAGGCTTTCAGAGAGGGGGTGGCAGAGGACGAGGAAGAGGCAGAGGCTATGGCTATGGTTCGTTTAACAACAAACCTGTGTGTCAGGTATGTGGTA
AGATAGGGCATACAACATTAATGTGTTATCAGCGATTCAATAAAGAGTTTTCAGGACCAAATCAAGCTCAAAACAGGGGAGATCCTAGACAAAATGGGCAACTGTCAACC
ACTGTGCAACCATCAGCTTATGTTGCAAATCAGAGTTTGAATTCCTTTGTTGCATCTCCAGAAACTGTAGTTGACCCAAGCTGGTATGCAGACAGTGGTGCATCCAGCCA
TGTGACCGCTGATTACAATGCCATTGCGAATCCTGTGGAGTATGAAGGTAATGCGTGTGTTACGGTGGGAGATGGTAGTAGCCTGAATATTAAATCTGTTGGAAATTCTT
GCTTGACCGATGGTGTTAAAACTCTAAGTCTTGAGAATATTTTGTGTGTTCCAAGCATAGCTAAAAATCTTGTGAGTGTTTCTAAGCTTGCTCAAGATAATCGTGTATTC
GTTGAATTTCATGATCACTGTTGTTTTGTTAAGGACAAGGATACGGGCATGGTGTTGCTGAAAGGAACGCTCAGTGATGGTCTTTATCGCTTTGATGGAGTACGTGTTGA
TTCAGTGGATGTGTCAAATTCAGCTTATCAGAAAGTTCAGTCTAAAGTTAATAATAGTATTGAGTCTGCTTGTGTCTTGTCGAATACTGTAAATCTTGTGATTTCCAAGA
ATGTGTGGCATAGACGTCTTGGTCACCCCTCGTTAAAAGTTCTTGAATCTTTAGTAAGGTCATGTAATTTACCTACTAAAACTAATGAAGAGTTTAAGTTCTGTGAAGCC
TGTCAGTTTGGGAAGTCTCATGCTCTGCCCTTCAATAATTCTGTTTCTAGAGTTTGTTCTAAGTTTGAATTGGTGCATACTGATCTCTGGGGGCCTGCTCCTGTTGAGTC
TAATCAGGGGTTCAGATTTTATGTGTTGTTTCTTGATGATTATAGCAGGTTCTCTTGGATTTATCCCTTGAAACGCAAGAATGTTGCTATGATTGCATTTACTCACTTTA
CTGCCTTGGTGAAGAATCAATTTAATACATCTATCAAAACTTTGCAATCAGATAATGGAGGGGAATATACACATATCCATCGACTATGTGAGACAATGGGAGTTCAAGTT
CGGTTATCCTAA
Protein sequenceShow/hide protein sequence
MGKIHISGMGMQGLSLFYSDRKKKLQNLNCLLQIVQHNEAIPEGHMVSTTGVCQLYKVFRDKTMNEQDSSTRNKGQEDEQQDSIDEYEIEYYLCQSSLSLGIRATMTSVN
SSGISALSSGNSFSSPPLNQLLNPITSVKLDRGHLIGTKPCPPKILQASVDRSGPSSSTGTDQAATSGGSTTIVVEEINPAYEAWVAVDQLLLGWLYNSMTPEVAVQVMG
FENSQDLWAAIQELFGVQSRAEEDYLRQVFQQCRKGSMKMNDYLRVMKSHTDNLGQAGSPVATRSLVSQVLLGLDEEYNPVVAMIQGRFGITWSEMQAELLVFEKRLELQ
TSLKTSLTISQGTSVNMVSNKDSSGQRNQNQSGNNRQNNFGRGFQRGGGRGRGRGRGYGYGSFNNKPVCQVCGKIGHTTLMCYQRFNKEFSGPNQAQNRGDPRQNGQLST
TVQPSAYVANQSLNSFVASPETVVDPSWYADSGASSHVTADYNAIANPVEYEGNACVTVGDGSSLNIKSVGNSCLTDGVKTLSLENILCVPSIAKNLVSVSKLAQDNRVF
VEFHDHCCFVKDKDTGMVLLKGTLSDGLYRFDGVRVDSVDVSNSAYQKVQSKVNNSIESACVLSNTVNLVISKNVWHRRLGHPSLKVLESLVRSCNLPTKTNEEFKFCEA
CQFGKSHALPFNNSVSRVCSKFELVHTDLWGPAPVESNQGFRFYVLFLDDYSRFSWIYPLKRKNVAMIAFTHFTALVKNQFNTSIKTLQSDNGGEYTHIHRLCETMGVQV
RLS