; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G17410 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G17410
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationChr6:15649187..15653059
RNA-Seq ExpressionCSPI06G17410
SyntenyCSPI06G17410
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN66323.1 hypothetical protein VITISV_007384 [Vitis vinifera]1.5e-19548.65Show/hide
Query:  IKDYSEDFDFGPWFFTPHKPDNKTDKEWELCHRKVCGFMRLW----------------------------------------MMKLKYQDGAPMLDHLNT
        +KDY     + P  F   +P+NK D EW L HR+VCG++R W                                        MM LKYQDG PM DHLNT
Subjt:  IKDYSEDFDFGPWFFTPHKPDNKTDKEWELCHRKVCGFMRLW----------------------------------------MMKLKYQDGAPMLDHLNT

Query:  FQGILNQLSRMNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGML------------WLLKRGGGVKVRVQEVITEAKA------------------
        FQGI+NQL  MNIKFE+E+ GLW+LGTLP+ W+ FRTSLSNSA +G++             + ++  G   +   ++TE K                   
Subjt:  FQGILNQLSRMNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGML------------WLLKRGGGVKVRVQEVITEAKA------------------

Query:  ---------------------------------------GVTSLQMLSVTIAMKKAYKE--VLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVR
                                               G    Q+ + T      Y    V    ++  WVIDSGAS+ AT +++FF SYT GDFGSVR
Subjt:  ---------------------------------------GVTSLQMLSVTIAMKKAYKE--VLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVR

Query:  MGNDGSTNTVGIEDVSLMMKVSAIPSTMTYGST-KGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKST
        MGNDGS   +G+ D SLMMK SA PS +  GS+ +GSMVI +G K SSLY M A++++S IN V+D++  ELWH RL H+SEKGL IL K N L  +K  
Subjt:  MGNDGSTNTVGIEDVSLMMKVSAIPSTMTYGST-KGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKST

Query:  PLKRCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDS
         LKRC HCLAGK TRV FK+ +H RKP +  LV+SDVCGPMKTK+LGG+LYFVTF DDHSRKIWVYTLKTKDQVL  FKQFHA VER++GEKLKC+RTD+
Subjt:  PLKRCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDS

Query:  GCEYCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFG
        G EY GPFDEYCR HDIRHQK PPKTPQLNG+AER+N+TLVERVRCLLS+SQLP+SFW EALNTVV+V NLTPCVPL  +V + IWS  +ISY HLRVFG
Subjt:  GCEYCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFG

Query:  CKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDV---------------------------------------IEDEVQNEQF-
        CKAFVH+PKDERSKLD KT+ CVF+GYGQDE GYR YDP +KKL+RSRDV                                       +EDE  ++Q  
Subjt:  CKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDV---------------------------------------IEDEVQNEQF-

Query:  -----------SDTYESFEHVG---------TEDSVQEQLAETVVPTDVSLRRSIRDRRPSTRYSPNESLLLTDG
                    +T++    +G          +D V EQ      P+D+ LRR  RDR PSTRYS ++ +LLTDG
Subjt:  -----------SDTYESFEHVG---------TEDSVQEQLAETVVPTDVSLRRSIRDRRPSTRYSPNESLLLTDG

KAA0040427.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.3e-19966.21Show/hide
Query:  KVCGFMRLWMMKLKYQDGAPMLDHL---NTFQGILNQLS-----RMNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGMLWLLKRGGGVKVRVQEVI
        KVCGFMRLW+     +D    L+H+      Q + N+L      +  IKFEDEI GLWVLGTLPDSW+IFRTSLSNSA NG+L +      VK  V    
Subjt:  KVCGFMRLWMMKLKYQDGAPMLDHL---NTFQGILNQLS-----RMNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGMLWLLKRGGGVKVRVQEVI

Query:  TEAKAGVTSLQMLSVTIAMKKAYK----EVLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTVGIEDVSL---------MMKVSA
           K+  +S+Q   +    +   K     V   I++  WVIDSGASV ATSKREFFASYTPGDFGSVRMGNDG TN VGI DV L         +  V  
Subjt:  TEAKAGVTSLQMLSVTIAMKKAYK----EVLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTVGIEDVSL---------MMKVSA

Query:  IP------------------STMTYG---STKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKSTPLK
        IP                  +T   G    TKGSMVI  GQKFSSLYYMDAKI++ DINTVNDEANVELWHKRLSH+SEKGLKILTKKNHL DLKSTPLK
Subjt:  IP------------------STMTYG---STKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKSTPLK

Query:  RCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCE
        RC HCLAGK TRVTFKSSQH RK N+L+LVHS+VCG MKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQV   FKQFHA VER+TGEKLKC+RTD+G E
Subjt:  RCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCE

Query:  YCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKA
        YCGPFDEYCRNH IRHQK PPK+ QLNGIA+RLN+TLVERVRCLL++SQLPQSFWGEALNTV++V NLTPCVPLGSEVPN IWSGKDISYSHLRVFGCKA
Subjt:  YCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKA

Query:  FVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDVIEDEVQN----EQFSDTYESFEHVGTEDSVQEQ
        FVHVPKDERSKLDAKTK CVFLGYGQDEFGYR+Y   KKKLIRSRDV+  E Q     E+  +     +    E++++++
Subjt:  FVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDVIEDEVQN----EQFSDTYESFEHVGTEDSVQEQ

KAA0047570.1 putative retrotransposon [Cucumis melo var. makuwa]1.6e-20060.53Show/hide
Query:  MNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGML------------WLLKRGGGVKVRVQEVITEAKAGVTSLQMLSVTIAMKKA-----------
        MNIKFE+EIHGLWVLG L DSW+IFRTSLSNSA NG+L             + ++     V+   ++TE +    S      + +  K+           
Subjt:  MNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGML------------WLLKRGGGVKVRVQEVITEAKAGVTSLQMLSVTIAMKKA-----------

Query:  --------YKEVLSKIEKRH----------------------------------------WVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTV
                Y   L +  K H                                        WVIDSGASV ATSK +FFASYTP DFGSVRMGNDGS N V
Subjt:  --------YKEVLSKIEKRH----------------------------------------WVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTV

Query:  GIEDVSL-----------MMKVSAIPSTM------------------TYGSTKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISE
        GI DV L           +  +S I   +                   +  TKGS+VI +G KFSSLYYMDAKI++SDINTVNDE N+ELWHKRLSH+SE
Subjt:  GIEDVSL-----------MMKVSAIPSTM------------------TYGSTKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISE

Query:  KGLKILTKK-----------NHLPDLKSTPLKRCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTK
        KGLKILTKK           NHLPDLKSTPLKRC HCLAGK TRVTFKSSQH RKPN+L+LVHS+VCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTK
Subjt:  KGLKILTKK-----------NHLPDLKSTPLKRCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTK

Query:  DQVLQAFKQFHAFVERKTGEKLKCVRTDSGCEYCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNL
        DQVLQ FKQFHA VER+TGEKLKC+RTD+G EYCGPFDEYCRNH IRHQK PPKTPQLNGIAERLN+TLVERVRCLLS+SQLPQSFWGEALNTVV+V NL
Subjt:  DQVLQAFKQFHAFVERKTGEKLKCVRTDSGCEYCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNL

Query:  TPCVPLGSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDV---------------------
        TPCVPLGSEVPN IWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTK CVFLGYGQDEFGYRLYDP KKKLIRSRDV                     
Subjt:  TPCVPLGSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDV---------------------

Query:  ------------------IEDEVQNEQFSDTYESFEHVGTEDSVQE
                          IEDE+QNEQF DT ES E VG EDSVQE
Subjt:  ------------------IEDEVQNEQFSDTYESFEHVGTEDSVQE

RVW85908.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]7.1e-19347.97Show/hide
Query:  FTPHKPDNKTDKEWELCHRKVCGFMRLW----------------------------------------MMKLKYQDGAPMLDHLNTFQGILNQLSRMNIK
        F   +P+NKTD EW L HR+VCGF+R W                                        MM LKYQDG P+ DHLNTFQGI+NQL+ MNIK
Subjt:  FTPHKPDNKTDKEWELCHRKVCGFMRLW----------------------------------------MMKLKYQDGAPMLDHLNTFQGILNQLSRMNIK

Query:  FEDEIHGLWVLGTLPDSWKIFRTSLSNSASNG----------------------------MLWLLKRG-------------------------------G
        FE+E+ GLW+LGTLPDSW+ FRTSLSNSA +G                            +L + KRG                               G
Subjt:  FEDEIHGLWVLGTLPDSWKIFRTSLSNSASNG----------------------------MLWLLKRG-------------------------------G

Query:  GVKVRVQEVITEAKAGVTSL----------QMLSVTIAMKKAYKE--VLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTVGIED
         +K   +++  + K G              Q+ + T      Y    V    ++  WVIDSGAS+ AT +++FF SYT GDFGSVRMGNDGS   + + D
Subjt:  GVKVRVQEVITEAKAGVTSL----------QMLSVTIAMKKAYKE--VLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTVGIED

Query:  VSL---------MMKVSAIPS---------------------TMTYGSTKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGL
        V L         +  V  IP                         +  T+GSMVI +G K SSLY M A++++S IN V+D++  ELWH RL H+SEKGL
Subjt:  VSL---------MMKVSAIPS---------------------TMTYGSTKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGL

Query:  KILTKKNHLPDLKSTPLKRCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFV
         IL KKN L  +K   LKRC HCLAGK TRV FK+ +H RKP +L LV+SDVCGPMKTK+LGG+LYFVTF DDHSRKIWVYTLKTKDQVL  FKQFHA V
Subjt:  KILTKKNHLPDLKSTPLKRCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFV

Query:  ERKTGEKLKCVRTDSGCEYCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNII
        ER++GEKLKC+RTD+G EY GPFDEYCR H IRHQK PPKTPQLNG+AER+N+TLVERVRCLLS+SQLP+SFWGEALNTVV+V NLTPCVPL  +VP+ I
Subjt:  ERKTGEKLKCVRTDSGCEYCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNII

Query:  WSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDV-----------------------------------
        WS  +ISY HLRVFGCKAFVH+PKDERSKLDAKT+ CVF+GYGQDE GYR YDP +KKL+RSRDV                                   
Subjt:  WSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDV-----------------------------------

Query:  ----IEDE----------------VQNEQFSDTYESFE-----HVGTEDSVQEQLAETVVPTDVSLRRSIRDRRPSTRYSPNESLLLTDG
            +EDE                V++E   D ++  +      V  +D V EQ      P+D+ LRRS RDR PSTRYS ++ +LLTDG
Subjt:  ----IEDE----------------VQNEQFSDTYESFE-----HVGTEDSVQEQLAETVVPTDVSLRRSIRDRRPSTRYSPNESLLLTDG

TYJ98688.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]1.6e-20066.38Show/hide
Query:  KVCGFMRLWMMKLKYQDGAPMLDHL---NTFQGILNQLS-----RMNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGMLWLLKRGGGVKVRVQEVI
        KVCGFMRLW+     +D    L+H+      Q + N+L      +  IKFEDEI GLWVLGTLPDSW+IFRTSLSNSA NG+L +      VK  V    
Subjt:  KVCGFMRLWMMKLKYQDGAPMLDHL---NTFQGILNQLS-----RMNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGMLWLLKRGGGVKVRVQEVI

Query:  TEAKAGVTSLQMLSVTIAMKKAYK----EVLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTVGIEDVSL---------MMKVSA
           K+  +S+Q   +    +   K     V   I++  WVIDSGASV ATSKREFFASYTPGDFGSVRMGNDG TN VGI DV L         +  V  
Subjt:  TEAKAGVTSLQMLSVTIAMKKAYK----EVLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTVGIEDVSL---------MMKVSA

Query:  IP------------------STMTYG---STKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKSTPLK
        IP                  +T   G    TKGSMVI  GQKFSSLYYMDAKI++ DINTVNDEANVELWHKRLSH+SEKGLKILTKKNHL DLKSTPLK
Subjt:  IP------------------STMTYG---STKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKSTPLK

Query:  RCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCE
        RC HCLAGK TRVTFKSSQH RK N+L+LVHS+VCG MKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQV   FKQFHA VER+TGEKLKC+RTD+G E
Subjt:  RCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCE

Query:  YCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKA
        YCGPFDEYCRNH IRHQK PPK+ QLNGIA+RLN+TLVERVRCLL++SQLPQSFWGEALNTV++V NLTPCVPLGSEVPN IWSGKDISYSHLRVFGCKA
Subjt:  YCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKA

Query:  FVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDVIEDEVQN----EQFSDTYESFEHVGTEDSVQEQ
        FVHVPKDERSKLDAKTK CVFLGYGQDEFGYR+YD  KKKLIRSRDV+  E Q     E+  +     +    E++++++
Subjt:  FVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDVIEDEVQN----EQFSDTYESFEHVGTEDSVQEQ

TrEMBL top hitse value%identityAlignment
A0A438HN89 Retrovirus-related Pol polyprotein from transposon TNT 1-943.4e-19347.97Show/hide
Query:  FTPHKPDNKTDKEWELCHRKVCGFMRLW----------------------------------------MMKLKYQDGAPMLDHLNTFQGILNQLSRMNIK
        F   +P+NKTD EW L HR+VCGF+R W                                        MM LKYQDG P+ DHLNTFQGI+NQL+ MNIK
Subjt:  FTPHKPDNKTDKEWELCHRKVCGFMRLW----------------------------------------MMKLKYQDGAPMLDHLNTFQGILNQLSRMNIK

Query:  FEDEIHGLWVLGTLPDSWKIFRTSLSNSASNG----------------------------MLWLLKRG-------------------------------G
        FE+E+ GLW+LGTLPDSW+ FRTSLSNSA +G                            +L + KRG                               G
Subjt:  FEDEIHGLWVLGTLPDSWKIFRTSLSNSASNG----------------------------MLWLLKRG-------------------------------G

Query:  GVKVRVQEVITEAKAGVTSL----------QMLSVTIAMKKAYKE--VLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTVGIED
         +K   +++  + K G              Q+ + T      Y    V    ++  WVIDSGAS+ AT +++FF SYT GDFGSVRMGNDGS   + + D
Subjt:  GVKVRVQEVITEAKAGVTSL----------QMLSVTIAMKKAYKE--VLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTVGIED

Query:  VSL---------MMKVSAIPS---------------------TMTYGSTKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGL
        V L         +  V  IP                         +  T+GSMVI +G K SSLY M A++++S IN V+D++  ELWH RL H+SEKGL
Subjt:  VSL---------MMKVSAIPS---------------------TMTYGSTKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGL

Query:  KILTKKNHLPDLKSTPLKRCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFV
         IL KKN L  +K   LKRC HCLAGK TRV FK+ +H RKP +L LV+SDVCGPMKTK+LGG+LYFVTF DDHSRKIWVYTLKTKDQVL  FKQFHA V
Subjt:  KILTKKNHLPDLKSTPLKRCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFV

Query:  ERKTGEKLKCVRTDSGCEYCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNII
        ER++GEKLKC+RTD+G EY GPFDEYCR H IRHQK PPKTPQLNG+AER+N+TLVERVRCLLS+SQLP+SFWGEALNTVV+V NLTPCVPL  +VP+ I
Subjt:  ERKTGEKLKCVRTDSGCEYCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNII

Query:  WSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDV-----------------------------------
        WS  +ISY HLRVFGCKAFVH+PKDERSKLDAKT+ CVF+GYGQDE GYR YDP +KKL+RSRDV                                   
Subjt:  WSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDV-----------------------------------

Query:  ----IEDE----------------VQNEQFSDTYESFE-----HVGTEDSVQEQLAETVVPTDVSLRRSIRDRRPSTRYSPNESLLLTDG
            +EDE                V++E   D ++  +      V  +D V EQ      P+D+ LRRS RDR PSTRYS ++ +LLTDG
Subjt:  ----IEDE----------------VQNEQFSDTYESFE-----HVGTEDSVQEQLAETVVPTDVSLRRSIRDRRPSTRYSPNESLLLTDG

A0A5A7TFU1 Retrovirus-related Pol polyprotein from transposon TNT 1-946.4e-20066.21Show/hide
Query:  KVCGFMRLWMMKLKYQDGAPMLDHL---NTFQGILNQLS-----RMNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGMLWLLKRGGGVKVRVQEVI
        KVCGFMRLW+     +D    L+H+      Q + N+L      +  IKFEDEI GLWVLGTLPDSW+IFRTSLSNSA NG+L +      VK  V    
Subjt:  KVCGFMRLWMMKLKYQDGAPMLDHL---NTFQGILNQLS-----RMNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGMLWLLKRGGGVKVRVQEVI

Query:  TEAKAGVTSLQMLSVTIAMKKAYK----EVLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTVGIEDVSL---------MMKVSA
           K+  +S+Q   +    +   K     V   I++  WVIDSGASV ATSKREFFASYTPGDFGSVRMGNDG TN VGI DV L         +  V  
Subjt:  TEAKAGVTSLQMLSVTIAMKKAYK----EVLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTVGIEDVSL---------MMKVSA

Query:  IP------------------STMTYG---STKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKSTPLK
        IP                  +T   G    TKGSMVI  GQKFSSLYYMDAKI++ DINTVNDEANVELWHKRLSH+SEKGLKILTKKNHL DLKSTPLK
Subjt:  IP------------------STMTYG---STKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKSTPLK

Query:  RCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCE
        RC HCLAGK TRVTFKSSQH RK N+L+LVHS+VCG MKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQV   FKQFHA VER+TGEKLKC+RTD+G E
Subjt:  RCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCE

Query:  YCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKA
        YCGPFDEYCRNH IRHQK PPK+ QLNGIA+RLN+TLVERVRCLL++SQLPQSFWGEALNTV++V NLTPCVPLGSEVPN IWSGKDISYSHLRVFGCKA
Subjt:  YCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKA

Query:  FVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDVIEDEVQN----EQFSDTYESFEHVGTEDSVQEQ
        FVHVPKDERSKLDAKTK CVFLGYGQDEFGYR+Y   KKKLIRSRDV+  E Q     E+  +     +    E++++++
Subjt:  FVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDVIEDEVQN----EQFSDTYESFEHVGTEDSVQEQ

A0A5D3BKF7 Retrovirus-related Pol polyprotein from transposon TNT 1-947.6e-20166.38Show/hide
Query:  KVCGFMRLWMMKLKYQDGAPMLDHL---NTFQGILNQLS-----RMNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGMLWLLKRGGGVKVRVQEVI
        KVCGFMRLW+     +D    L+H+      Q + N+L      +  IKFEDEI GLWVLGTLPDSW+IFRTSLSNSA NG+L +      VK  V    
Subjt:  KVCGFMRLWMMKLKYQDGAPMLDHL---NTFQGILNQLS-----RMNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGMLWLLKRGGGVKVRVQEVI

Query:  TEAKAGVTSLQMLSVTIAMKKAYK----EVLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTVGIEDVSL---------MMKVSA
           K+  +S+Q   +    +   K     V   I++  WVIDSGASV ATSKREFFASYTPGDFGSVRMGNDG TN VGI DV L         +  V  
Subjt:  TEAKAGVTSLQMLSVTIAMKKAYK----EVLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTVGIEDVSL---------MMKVSA

Query:  IP------------------STMTYG---STKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKSTPLK
        IP                  +T   G    TKGSMVI  GQKFSSLYYMDAKI++ DINTVNDEANVELWHKRLSH+SEKGLKILTKKNHL DLKSTPLK
Subjt:  IP------------------STMTYG---STKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKSTPLK

Query:  RCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCE
        RC HCLAGK TRVTFKSSQH RK N+L+LVHS+VCG MKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQV   FKQFHA VER+TGEKLKC+RTD+G E
Subjt:  RCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCE

Query:  YCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKA
        YCGPFDEYCRNH IRHQK PPK+ QLNGIA+RLN+TLVERVRCLL++SQLPQSFWGEALNTV++V NLTPCVPLGSEVPN IWSGKDISYSHLRVFGCKA
Subjt:  YCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKA

Query:  FVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDVIEDEVQN----EQFSDTYESFEHVGTEDSVQEQ
        FVHVPKDERSKLDAKTK CVFLGYGQDEFGYR+YD  KKKLIRSRDV+  E Q     E+  +     +    E++++++
Subjt:  FVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDVIEDEVQN----EQFSDTYESFEHVGTEDSVQEQ

A0A5D3CVK2 Putative retrotransposon7.6e-20160.53Show/hide
Query:  MNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGML------------WLLKRGGGVKVRVQEVITEAKAGVTSLQMLSVTIAMKKA-----------
        MNIKFE+EIHGLWVLG L DSW+IFRTSLSNSA NG+L             + ++     V+   ++TE +    S      + +  K+           
Subjt:  MNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGML------------WLLKRGGGVKVRVQEVITEAKAGVTSLQMLSVTIAMKKA-----------

Query:  --------YKEVLSKIEKRH----------------------------------------WVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTV
                Y   L +  K H                                        WVIDSGASV ATSK +FFASYTP DFGSVRMGNDGS N V
Subjt:  --------YKEVLSKIEKRH----------------------------------------WVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTV

Query:  GIEDVSL-----------MMKVSAIPSTM------------------TYGSTKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISE
        GI DV L           +  +S I   +                   +  TKGS+VI +G KFSSLYYMDAKI++SDINTVNDE N+ELWHKRLSH+SE
Subjt:  GIEDVSL-----------MMKVSAIPSTM------------------TYGSTKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISE

Query:  KGLKILTKK-----------NHLPDLKSTPLKRCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTK
        KGLKILTKK           NHLPDLKSTPLKRC HCLAGK TRVTFKSSQH RKPN+L+LVHS+VCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTK
Subjt:  KGLKILTKK-----------NHLPDLKSTPLKRCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTK

Query:  DQVLQAFKQFHAFVERKTGEKLKCVRTDSGCEYCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNL
        DQVLQ FKQFHA VER+TGEKLKC+RTD+G EYCGPFDEYCRNH IRHQK PPKTPQLNGIAERLN+TLVERVRCLLS+SQLPQSFWGEALNTVV+V NL
Subjt:  DQVLQAFKQFHAFVERKTGEKLKCVRTDSGCEYCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNL

Query:  TPCVPLGSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDV---------------------
        TPCVPLGSEVPN IWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTK CVFLGYGQDEFGYRLYDP KKKLIRSRDV                     
Subjt:  TPCVPLGSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDV---------------------

Query:  ------------------IEDEVQNEQFSDTYESFEHVGTEDSVQE
                          IEDE+QNEQF DT ES E VG EDSVQE
Subjt:  ------------------IEDEVQNEQFSDTYESFEHVGTEDSVQE

A5C3L0 Integrase catalytic domain-containing protein7.4e-19648.65Show/hide
Query:  IKDYSEDFDFGPWFFTPHKPDNKTDKEWELCHRKVCGFMRLW----------------------------------------MMKLKYQDGAPMLDHLNT
        +KDY     + P  F   +P+NK D EW L HR+VCG++R W                                        MM LKYQDG PM DHLNT
Subjt:  IKDYSEDFDFGPWFFTPHKPDNKTDKEWELCHRKVCGFMRLW----------------------------------------MMKLKYQDGAPMLDHLNT

Query:  FQGILNQLSRMNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGML------------WLLKRGGGVKVRVQEVITEAKA------------------
        FQGI+NQL  MNIKFE+E+ GLW+LGTLP+ W+ FRTSLSNSA +G++             + ++  G   +   ++TE K                   
Subjt:  FQGILNQLSRMNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGML------------WLLKRGGGVKVRVQEVITEAKA------------------

Query:  ---------------------------------------GVTSLQMLSVTIAMKKAYKE--VLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVR
                                               G    Q+ + T      Y    V    ++  WVIDSGAS+ AT +++FF SYT GDFGSVR
Subjt:  ---------------------------------------GVTSLQMLSVTIAMKKAYKE--VLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVR

Query:  MGNDGSTNTVGIEDVSLMMKVSAIPSTMTYGST-KGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKST
        MGNDGS   +G+ D SLMMK SA PS +  GS+ +GSMVI +G K SSLY M A++++S IN V+D++  ELWH RL H+SEKGL IL K N L  +K  
Subjt:  MGNDGSTNTVGIEDVSLMMKVSAIPSTMTYGST-KGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKST

Query:  PLKRCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDS
         LKRC HCLAGK TRV FK+ +H RKP +  LV+SDVCGPMKTK+LGG+LYFVTF DDHSRKIWVYTLKTKDQVL  FKQFHA VER++GEKLKC+RTD+
Subjt:  PLKRCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDS

Query:  GCEYCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFG
        G EY GPFDEYCR HDIRHQK PPKTPQLNG+AER+N+TLVERVRCLLS+SQLP+SFW EALNTVV+V NLTPCVPL  +V + IWS  +ISY HLRVFG
Subjt:  GCEYCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFG

Query:  CKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDV---------------------------------------IEDEVQNEQF-
        CKAFVH+PKDERSKLD KT+ CVF+GYGQDE GYR YDP +KKL+RSRDV                                       +EDE  ++Q  
Subjt:  CKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDV---------------------------------------IEDEVQNEQF-

Query:  -----------SDTYESFEHVG---------TEDSVQEQLAETVVPTDVSLRRSIRDRRPSTRYSPNESLLLTDG
                    +T++    +G          +D V EQ      P+D+ LRR  RDR PSTRYS ++ +LLTDG
Subjt:  -----------SDTYESFEHVG---------TEDSVQEQLAETVVPTDVSLRRSIRDRRPSTRYSPNESLLLTDG

SwissProt top hitse value%identityAlignment
P04146 Copia protein8.8e-4535.09Show/hide
Query:  NVELWHKRLSHISEKGLKILTKKNHLPD---LKSTPL--KRCLHCLAGKHTRVTF---KSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHS
        N  LWH+R  HIS+  L  + +KN   D   L +  L  + C  CL GK  R+ F   K   H+++P  L +VHSDVCGP+   +L    YFV F D  +
Subjt:  NVELWHKRLSHISEKGLKILTKKNHLPD---LKSTPL--KRCLHCLAGKHTRVTF---KSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHS

Query:  RKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCEY-CGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWG
             Y +K K  V   F+ F A  E     K+  +  D+G EY      ++C    I +    P TPQLNG++ER+ +T+ E+ R ++S ++L +SFWG
Subjt:  RKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCEY-CGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWG

Query:  EALNTVVYVFNLTPCVPL--GSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDVIEDEVQN
        EA+ T  Y+ N  P   L   S+ P  +W  K     HLRVFG   +VH+ K+++ K D K+   +F+GY  +  G++L+D   +K I +RDV+ DE   
Subjt:  EALNTVVYVFNLTPCVPL--GSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDVIEDEVQN

Query:  EQFSDTYESFEHVGTEDSVQEQ
           +     FE V  +DS + +
Subjt:  EQFSDTYESFEHVGTEDSVQEQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-12037.84Show/hide
Query:  FMRLWMMKLKYQDGAPMLDHLNTFQGILNQLSRMNIKFEDEIHGLWVLGTLPDSWKIFRTSLSN--------SASNGMLWLLKRGGGVKVRVQEVITEAK
        +++  +  L   +G   L HLN F G++ QL+ + +K E+E   + +L +LP S+    T++ +          ++ +L   K     + + Q +ITE +
Subjt:  FMRLWMMKLKYQDGAPMLDHLNTFQGILNQLSRMNIKFEDEIHGLWVLGTLPDSWKIFRTSLSN--------SASNGMLWLLKRGGGVKVRVQEVITEAK

Query:  A--------------------------------------------------GVTSLQML-SVTIAMKKAYKEVLSKIEKR-----------HWVIDSGAS
                                                           G TS Q     T AM +    V+  I +             WV+D+ AS
Subjt:  A--------------------------------------------------GVTSLQML-SVTIAMKKAYKEVLSKIEKR-----------HWVIDSGAS

Query:  VRATSKREFFASYTPGDFGSVRMGNDGSTNTVGIEDV--------SLMMK-VSAIP---------------------STMTYGSTKGSMVIPQGQKFSSL
          AT  R+ F  Y  GDFG+V+MGN   +   GI D+        +L++K V  +P                     +   +  TKGS+VI +G    +L
Subjt:  VRATSKREFFASYTPGDFGSVRMGNDGSTNTVGIEDV--------SLMMK-VSAIP---------------------STMTYGSTKGSMVIPQGQKFSSL

Query:  YYMDAKIMESDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKSTPLKRCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGA
        Y  +A+I + ++N   DE +V+LWHKR+ H+SEKGL+IL KK+ +   K T +K C +CL GK  RV+F++S   RK N+L LV+SDVCGPM+ +S+GG 
Subjt:  YYMDAKIMESDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKSTPLKRCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGA

Query:  LYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCEYCG-PFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLL
         YFVTF DD SRK+WVY LKTKDQV Q F++FHA VER+TG KLK +R+D+G EY    F+EYC +H IRH+K  P TPQ NG+AER+N+T+VE+VR +L
Subjt:  LYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCEYCG-PFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLL

Query:  SKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSR
          ++LP+SFWGEA+ T  Y+ N +P VPL  E+P  +W+ K++SYSHL+VFGC+AF HVPK++R+KLD K+  C+F+GYG +EFGYRL+DP KKK+IRSR
Subjt:  SKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSR

Query:  DVIEDEVQNEQFSDTYESFEHVGTEDSVQEQLAETVVPTDVSLRRSIRDRRPSTRYSPNESLLLTD
        DV+  E               V T   + E++   ++P  V++        PST  +P  +   TD
Subjt:  DVIEDEVQNEQFSDTYESFEHVGTEDSVQEQLAETVVPTDVSLRRSIRDRRPSTRYSPNESLLLTD

Q12337 Transposon Ty2-GR1 Gag-Pol polyprotein1.2e-2025.6Show/hide
Query:  SSLYYMDAKIMESDINTVNDEANVE-----LWHKRLSHISEKGLKILTKKNHLPDLKSTPLK-------RCLHCLAGKHTRVTFKSSQHLRKPNL---LK
        S  Y + + I +  IN VN   +V      L H+ L H + + ++   KKN +  LK + ++       +C  CL GK T+        L+        +
Subjt:  SSLYYMDAKIMESDINTVNDEANVE-----LWHKRLSHISEKGLKILTKKNHLPDLKSTPLK-------RCLHCLAGKHTRVTFKSSQHLRKPNL---LK

Query:  LVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTL--KTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCEYCG-PFDEYCRNHDIRHQKAPPKTPQ
         +H+D+ GP+         YF++FTD+ +R  WVY L  + ++ +L  F    AF++ +   ++  ++ D G EY      ++  N  I          +
Subjt:  LVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTL--KTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCEYCG-PFDEYCRNHDIRHQKAPPKTPQ

Query:  LNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFN-LTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGY
         +G+AERLN+TL+   R LL  S LP   W  A+     + N L       S   +   +G DI  + +  FG    V+    + SK+  +      L  
Subjt:  LNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFN-LTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGY

Query:  GQDEFGYRLYDPTKKKLIRSRDVIEDEVQNEQ
         ++ +GY +Y P+ KK + + + +   +QN+Q
Subjt:  GQDEFGYRLYDPTKKKLIRSRDVIEDEVQNEQ

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.5e-3327.25Show/hide
Query:  SKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTVG-IEDVSLMMK--------------------------------VSAIPSTMTY
        S     +W++DSGA+   TS     + + P   G   M  DGST  +      SL  K                                V   P++   
Subjt:  SKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTVG-IEDVSLMMK--------------------------------VSAIPSTMTY

Query:  GSTKGSMVIPQGQKFSSLYYMDAKIME--SDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLK-STPLKRCLHCLAGKHTRVTFKSSQHLRKPN
              + + QG+    LY       +  S   + + +A    WH RL H +   L  +     L  L  S     C  CL  K  +V F  S  +    
Subjt:  GSTKGSMVIPQGQKFSSLYYMDAKIME--SDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLK-STPLKRCLHCLAGKHTRVTFKSSQHLRKPN

Query:  LLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCEYCGPFDEYCRNHDIRHQKAPPKTPQ
         L+ ++SDV       S     Y+V F D  +R  W+Y LK K QV + F  F   +E +   ++    +D+G E+   + EY   H I H  +PP TP+
Subjt:  LLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCEYCGPFDEYCRNHDIRHQKAPPKTPQ

Query:  LNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGYG
         NG++ER ++ +VE    LLS + +P+++W  A    VY+ N  P   L  E P     G   +Y  LRVFGC  +  +    + KLD K++ CVFLGY 
Subjt:  LNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKACVFLGYG

Query:  QDEFGYRLYDPTKKKLIRSRDVIEDEVQNEQFSDTYESFEHVGTEDSVQEQLAET
          +  Y        +L  SR V  DE       + +    ++ T   VQEQ  E+
Subjt:  QDEFGYRLYDPTKKKLIRSRDVIEDEVQNEQFSDTYESFEHVGTEDSVQEQLAET

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE27.0e-3427.25Show/hide
Query:  KAYKEVLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGST-------------------------------NTVGIEDVSLMMKVSA--
        +A   V S     +W++DSGA+   TS     + + P   G   M  DGST                               N + +  +    +VS   
Subjt:  KAYKEVLSKIEKRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGST-------------------------------NTVGIEDVSLMMKVSA--

Query:  IPSTMTYGSTKGSMVIPQGQKFSSLYYMDAKIME--SDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLK-STPLKRCLHCLAGKHTRVTFKSS
         P++         + + QG+    LY       +  S   +   +A    WH RL H S   L  +   + LP L  S  L  C  C   K  +V F +S
Subjt:  IPSTMTYGSTKGSMVIPQGQKFSSLYYMDAKIME--SDINTVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLK-STPLKRCLHCLAGKHTRVTFKSS

Query:  QHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCEYCGPFDEYCRNHDIRHQK
          +     L+ ++SDV       S+    Y+V F D  +R  W+Y LK K QV   F  F + VE +   ++  + +D+G E+     +Y   H I H  
Subjt:  QHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERKTGEKLKCVRTDSGCEYCGPFDEYCRNHDIRHQK

Query:  APPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKA
        +PP TP+ NG++ER ++ +VE    LLS + +P+++W  A +  VY+ N  P   L  + P     G+  +Y  L+VFGC  +  +    R KL+ K+K 
Subjt:  APPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTKA

Query:  CVFLGYGQDEFGYRLYDPTKKKLIRSRDVIEDE
        C F+GY   +  Y        +L  SR V  DE
Subjt:  CVFLGYGQDEFGYRLYDPTKKKLIRSRDVIEDE

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.1e-1339.81Show/hide
Query:  KGSMVIPQGQKFSSLYYMDAKIMESDIN---TVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKSTPLKRCLHCLAGKHTRVTFKSSQHLRKPNLLK
        KG   I +G +  SLY +   +   + N   T  DE    LWH RL+H+S++G+++L KK  L   K + LK C  C+ GK  RV F + QH  K N L 
Subjt:  KGSMVIPQGQKFSSLYYMDAKIMESDIN---TVNDEANVELWHKRLSHISEKGLKILTKKNHLPDLKSTPLKRCLHCLAGKHTRVTFKSSQHLRKPNLLK

Query:  LVHSDVCG
         VHSD+ G
Subjt:  LVHSDVCG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.7e-1135.29Show/hide
Query:  LNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTK
        +N+T++E+VR +L +  LP++F  +A NT V++ N  P   +   VP+ +W     +YS+LR FGC A++H    +  KL  + K
Subjt:  LNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRVFGCKAFVHVPKDERSKLDAKTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGGTGGCCGTACAGTCTGGCGGTTTTGAAAGGCGTGAAAAGGCTGTTTCTTTCTTACAAAAGCGGGGGACTCTTTTGGGGATCGATTGGTGGTCCGATCAAGCTGA
AATTTTGACTGGTGTATTCTTGGCACGAGTGGAGATTTTATACTTGCCTCTAGTTGTGACTAGGGAGAACTTTGTTGTACCCATTAAAGATTATAGTGAAGATTTTGACT
TCGGTCCGTGGTTTTTTACTCCTCACAAGCCTGACAACAAAACTGATAAAGAATGGGAATTATGTCATAGGAAAGTGTGTGGGTTTATGAGGCTATGGATGATGAAGTTA
AAGTATCAAGATGGAGCACCTATGTTAGATCACTTGAATACATTTCAAGGTATTTTGAATCAGCTATCTAGAATGAATATCAAGTTTGAGGATGAGATACATGGGTTATG
GGTGCTTGGTACATTGCCGGACTCGTGGAAAATATTTAGAACTTCCTTATCGAACTCAGCCTCAAATGGTATGTTATGGTTACTGAAAAGAGGGGGAGGAGTAAAAGTAA
GAGTCCAAGAGGTAATAACAGAAGCAAAAGCAGGAGTGACCAGTTTGCAAATGTTGAGTGTCACTATTGCCATGAAGAAGGCATATAAAGAAGTATTGTCGAAAATTGAA
AAGAGACATTGGGTGATTGATAGTGGTGCATCAGTTCGTGCTACTTCGAAAAGGGAATTTTTTGCATCCTATACTCCTGGTGATTTTGGCAGTGTTAGGATGGGTAATGA
CGGATCAACAAATACAGTTGGCATCGAAGATGTAAGCTTGATGATGAAGGTTTCTGCAATACCTTCGACAATGACATATGGAAGCACTAAAGGTTCAATGGTTATACCAC
AGGGACAAAAATTTTCTTCACTGTACTACATGGATGCAAAAATCATGGAGTCTGATATAAATACAGTGAATGATGAAGCAAATGTTGAGCTTTGGCATAAGAGACTTAGC
CATATAAGTGAGAAGGGTTTAAAGATTTTAACCAAGAAAAATCATCTTCCTGATTTAAAGAGTACACCTCTAAAACGATGTCTTCATTGTTTGGCAGGAAAGCATACGAG
AGTTACCTTTAAATCATCTCAACATTTAAGGAAGCCAAATTTACTAAAGTTAGTACATTCTGATGTGTGTGGTCCCATGAAAACAAAGTCTCTTGGGGGTGCTTTGTATT
TTGTGACATTTACTGATGATCATTCAAGGAAAATATGGGTTTACACCTTGAAGACTAAAGATCAAGTGTTGCAAGCGTTTAAACAATTTCATGCTTTTGTTGAGAGAAAA
ACTGGTGAAAAGCTCAAGTGTGTTAGAACTGATAGTGGATGTGAGTATTGTGGACCTTTTGATGAATATTGTAGAAATCATGACATTCGACATCAAAAGGCACCTCCTAA
GACCCCACAGTTAAATGGAATTGCTGAAAGATTGAACAAAACATTGGTTGAGAGAGTGAGATGCTTATTATCTAAATCACAGTTGCCACAATCATTTTGGGGTGAAGCTT
TAAATACAGTTGTATATGTTTTCAATCTCACACCATGTGTTCCTTTGGGATCAGAAGTTCCAAACATAATATGGTCAGGTAAGGATATATCTTACAGTCACCTACGTGTC
TTTGGTTGTAAAGCTTTTGTTCATGTACCTAAAGATGAGAGATCAAAGCTTGATGCAAAAACCAAAGCATGTGTGTTTCTTGGTTATGGCCAAGATGAGTTTGGTTATAG
ATTATATGATCCAACTAAGAAAAAGCTTATAAGAAGTCGAGATGTTATAGAAGATGAGGTTCAAAATGAACAGTTTTCTGATACATATGAGAGTTTTGAGCACGTTGGGA
CAGAGGATAGTGTTCAGGAACAGTTAGCTGAAACAGTTGTTCCTACAGATGTTTCGCTCAGGAGATCTATCAGAGATCGACGTCCGTCAACAAGATATTCACCTAATGAA
TCTTTGCTATTGACTGACGGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACGGTGGCCGTACAGTCTGGCGGTTTTGAAAGGCGTGAAAAGGCTGTTTCTTTCTTACAAAAGCGGGGGACTCTTTTGGGGATCGATTGGTGGTCCGATCAAGCTGA
AATTTTGACTGGTGTATTCTTGGCACGAGTGGAGATTTTATACTTGCCTCTAGTTGTGACTAGGGAGAACTTTGTTGTACCCATTAAAGATTATAGTGAAGATTTTGACT
TCGGTCCGTGGTTTTTTACTCCTCACAAGCCTGACAACAAAACTGATAAAGAATGGGAATTATGTCATAGGAAAGTGTGTGGGTTTATGAGGCTATGGATGATGAAGTTA
AAGTATCAAGATGGAGCACCTATGTTAGATCACTTGAATACATTTCAAGGTATTTTGAATCAGCTATCTAGAATGAATATCAAGTTTGAGGATGAGATACATGGGTTATG
GGTGCTTGGTACATTGCCGGACTCGTGGAAAATATTTAGAACTTCCTTATCGAACTCAGCCTCAAATGGTATGTTATGGTTACTGAAAAGAGGGGGAGGAGTAAAAGTAA
GAGTCCAAGAGGTAATAACAGAAGCAAAAGCAGGAGTGACCAGTTTGCAAATGTTGAGTGTCACTATTGCCATGAAGAAGGCATATAAAGAAGTATTGTCGAAAATTGAA
AAGAGACATTGGGTGATTGATAGTGGTGCATCAGTTCGTGCTACTTCGAAAAGGGAATTTTTTGCATCCTATACTCCTGGTGATTTTGGCAGTGTTAGGATGGGTAATGA
CGGATCAACAAATACAGTTGGCATCGAAGATGTAAGCTTGATGATGAAGGTTTCTGCAATACCTTCGACAATGACATATGGAAGCACTAAAGGTTCAATGGTTATACCAC
AGGGACAAAAATTTTCTTCACTGTACTACATGGATGCAAAAATCATGGAGTCTGATATAAATACAGTGAATGATGAAGCAAATGTTGAGCTTTGGCATAAGAGACTTAGC
CATATAAGTGAGAAGGGTTTAAAGATTTTAACCAAGAAAAATCATCTTCCTGATTTAAAGAGTACACCTCTAAAACGATGTCTTCATTGTTTGGCAGGAAAGCATACGAG
AGTTACCTTTAAATCATCTCAACATTTAAGGAAGCCAAATTTACTAAAGTTAGTACATTCTGATGTGTGTGGTCCCATGAAAACAAAGTCTCTTGGGGGTGCTTTGTATT
TTGTGACATTTACTGATGATCATTCAAGGAAAATATGGGTTTACACCTTGAAGACTAAAGATCAAGTGTTGCAAGCGTTTAAACAATTTCATGCTTTTGTTGAGAGAAAA
ACTGGTGAAAAGCTCAAGTGTGTTAGAACTGATAGTGGATGTGAGTATTGTGGACCTTTTGATGAATATTGTAGAAATCATGACATTCGACATCAAAAGGCACCTCCTAA
GACCCCACAGTTAAATGGAATTGCTGAAAGATTGAACAAAACATTGGTTGAGAGAGTGAGATGCTTATTATCTAAATCACAGTTGCCACAATCATTTTGGGGTGAAGCTT
TAAATACAGTTGTATATGTTTTCAATCTCACACCATGTGTTCCTTTGGGATCAGAAGTTCCAAACATAATATGGTCAGGTAAGGATATATCTTACAGTCACCTACGTGTC
TTTGGTTGTAAAGCTTTTGTTCATGTACCTAAAGATGAGAGATCAAAGCTTGATGCAAAAACCAAAGCATGTGTGTTTCTTGGTTATGGCCAAGATGAGTTTGGTTATAG
ATTATATGATCCAACTAAGAAAAAGCTTATAAGAAGTCGAGATGTTATAGAAGATGAGGTTCAAAATGAACAGTTTTCTGATACATATGAGAGTTTTGAGCACGTTGGGA
CAGAGGATAGTGTTCAGGAACAGTTAGCTGAAACAGTTGTTCCTACAGATGTTTCGCTCAGGAGATCTATCAGAGATCGACGTCCGTCAACAAGATATTCACCTAATGAA
TCTTTGCTATTGACTGACGGGTGA
Protein sequenceShow/hide protein sequence
MTVAVQSGGFERREKAVSFLQKRGTLLGIDWWSDQAEILTGVFLARVEILYLPLVVTRENFVVPIKDYSEDFDFGPWFFTPHKPDNKTDKEWELCHRKVCGFMRLWMMKL
KYQDGAPMLDHLNTFQGILNQLSRMNIKFEDEIHGLWVLGTLPDSWKIFRTSLSNSASNGMLWLLKRGGGVKVRVQEVITEAKAGVTSLQMLSVTIAMKKAYKEVLSKIE
KRHWVIDSGASVRATSKREFFASYTPGDFGSVRMGNDGSTNTVGIEDVSLMMKVSAIPSTMTYGSTKGSMVIPQGQKFSSLYYMDAKIMESDINTVNDEANVELWHKRLS
HISEKGLKILTKKNHLPDLKSTPLKRCLHCLAGKHTRVTFKSSQHLRKPNLLKLVHSDVCGPMKTKSLGGALYFVTFTDDHSRKIWVYTLKTKDQVLQAFKQFHAFVERK
TGEKLKCVRTDSGCEYCGPFDEYCRNHDIRHQKAPPKTPQLNGIAERLNKTLVERVRCLLSKSQLPQSFWGEALNTVVYVFNLTPCVPLGSEVPNIIWSGKDISYSHLRV
FGCKAFVHVPKDERSKLDAKTKACVFLGYGQDEFGYRLYDPTKKKLIRSRDVIEDEVQNEQFSDTYESFEHVGTEDSVQEQLAETVVPTDVSLRRSIRDRRPSTRYSPNE
SLLLTDG