; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cla97C02G036263 (gene) of Watermelon (97103) v2.5 genome

Gene IDCla97C02G036263
OrganismCitrullus lanatus subsp. vulgaris cv. 97103 (Watermelon (97103) v2.5)
DescriptionTransposable element protein
Genome locationCla97Chr02:16558609..16560322
RNA-Seq ExpressionCla97C02G036263
SyntenyCla97C02G036263
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833156.1 hypothetical protein, partial [Synechococcus sp. PCC 7002]3.5e-25177.89Show/hide
Query:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFA+DKFRSYLLGSKIVVHTDHAALKYLFVKKD KPRLMRWILLLQ FDLEI
Subjt:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ
        KDRKGCENVVADHLSRIENE+AKSWP IV+MFPDEQ YQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLH VKSYHWEDP LYKVCADNMIRKCVPQ
Subjt:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ

Query:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILVAV
        EEVV+ILNSCHASPYGG FGPTRTAAKVLQS FYWPSLFKDCYTFVKSCDRCQR GNISRQHELPMKPILEVELFDVWGIDFMG FPIS           
Subjt:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILVAV

Query:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDWAL
                                                                 YNV HKIATAYHPQTNGLA+LSNREIKQVLEK VKTNRKDWAL
Subjt:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDWAL

Query:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ
        KLDDALWAYRTAFKTPIGTS Y+LVFGKACHLPVELEHRAYWAIKKLNMDFEK GEKRLLELNEMEEFRA+AYEN KLYK+RTARWHDKKIT  T LP Q
Subjt:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ

Query:  R---------------------------VSPHGAVELQGNNGTTFKVNGQRLKHYIGDEERGLENLTFIA
        R                           VSPHGAVELQGNNGTTFKVNG RLKHYIGDEER LENL F A
Subjt:  R---------------------------VSPHGAVELQGNNGTTFKVNGQRLKHYIGDEERGLENLTFIA

XP_012833379.1 PREDICTED: uncharacterized protein LOC105954252 [Erythranthe guttata]1.2e-21764.86Show/hide
Query:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASDYA+GAVLGQRRD +F+AIYY+SRTLD  Q+ Y+TTEKE+LAVV+A+DKFR Y+LGS+++++TDHAA++YLF KKD KPRL+RW+LLLQ FDLEI
Subjt:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ
        +D+KG ENVVADHLSR+  EE  +  +I + FPDEQ   +    PW+AD+ N+LA G +P D++Y QKK+FLH  + Y W++P L++   D +IR+CVP+
Subjt:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ

Query:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILVAV
         EV  IL  CH+SP GG  G +RTAAKVLQS F+WP+LF+D Y FVK CDRCQR GN+S + ++P+  + EVELFDVWGIDFMG FP SSNG LYIL+AV
Subjt:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILVAV

Query:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDWAL
        DYVSKWVEAIAT  NDARTVLKF HKNIF+RFGTPRAIISDEGSHFCNKL  ++  K  ++HKIA AYHPQTNGLA+LSNREIKQ+LEKTV TNRKDWAL
Subjt:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDWAL

Query:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ
        KLDDALWAYRTAFKTPIG SPYKLVFGKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  AYEN K+YKE+T +WHDK+IT +    G 
Subjt:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ

Query:  RV---------------------------SPHGAVELQGNNGTTFKVNGQRLKHY
        +V                           +P G +E++G +G +FKVNGQR+KHY
Subjt:  RV---------------------------SPHGAVELQGNNGTTFKVNGQRLKHY

XP_012833687.1 PREDICTED: uncharacterized protein LOC105954563 [Erythranthe guttata]6.7e-21864.86Show/hide
Query:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASDYA+GAVLGQRRD +F+AIYY+SRTLD  Q+ Y+TTEKE+LAVV+A+DKFR Y+LGS+++++TDHAA++YLF KKD KPRL+RW+LLLQ FDLEI
Subjt:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ
        +D+KG ENVVADHLSR+  EE  +  +I + FPDEQ   +    PW+AD+ N+LA G +P D++Y QKK+FLH  + Y W++P L++   D +IR+CVP+
Subjt:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ

Query:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILVAV
         EV  IL  CH+SP GG  G +RTAAKVLQS F+WP+LF+D Y FVK CDRCQR GN+S + ++P+  + EVELFDVWGIDFMG FP SSNG LYIL+AV
Subjt:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILVAV

Query:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDWAL
        DYVSKWVEAIAT TNDARTVLKF HKNIF+RFGTPRAIISDEGSHFCNKL  ++  K  ++HKIA AYHPQTNGLA+LSNREIKQ+LEKTV TNRKDWAL
Subjt:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDWAL

Query:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ
        KLDDALWAYRTAFKTPIG SPYKLV+GKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  AYEN K+YKE+T +WHDK+IT +    G 
Subjt:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ

Query:  RV---------------------------SPHGAVELQGNNGTTFKVNGQRLKHY
        +V                           +P G +E++G +G +FKVNGQR+KHY
Subjt:  RV---------------------------SPHGAVELQGNNGTTFKVNGQRLKHY

XP_012847037.1 PREDICTED: uncharacterized protein LOC105967019 [Erythranthe guttata]6.7e-21864.86Show/hide
Query:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASDYA+GAVLGQRRD +F+AIYY+SRTLD  Q+ Y+TTEKE+LAVV+A+DKFR Y+LGS+++++TDHAA++YLF KKD KPRL+RW+LLLQ FDLEI
Subjt:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ
        +D+KG ENVVADHLSR+  EE  +  +I + FPDEQ   +    PW+AD+ N+LA G +P D++Y QKK+FLH  + Y W++P L++   D +IR+CVP+
Subjt:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ

Query:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILVAV
         EV  IL  CH+SP GG  G +RTAAKVLQS F+WP+LF+D Y FVK CDRCQR GN+S + ++P+  + EVELFDVWGIDFMG FP SSNG LYIL+AV
Subjt:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILVAV

Query:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDWAL
        DYVSKWVEAIAT TNDARTVLKF HKNIF+RFGTPRAIISDEGSHFCNKL  ++  K  ++HKIA AYHPQTNGLA+LSNREIKQ+LEKTV TNRKDWAL
Subjt:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDWAL

Query:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ
        KLDDALWAYRTAFKTPIG SPYKLV+GKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  AYEN K+YKE+T +WHDK+IT +    G 
Subjt:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ

Query:  RV---------------------------SPHGAVELQGNNGTTFKVNGQRLKHY
        +V                           +P G +E++G +G +FKVNGQR+KHY
Subjt:  RV---------------------------SPHGAVELQGNNGTTFKVNGQRLKHY

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]5.7e-22567.8Show/hide
Query:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASD+ALGAVLGQRRD +FRAIYYASRTL+  Q  YTTTEKE+LAVVFA DKFRSYL+ +K++V TDHAAL+YLF KKD KPRL+RWILLLQ FDLE+
Subjt:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ
        +D+KG EN VADHLSR+E EE +    I + FPDEQ +  +  LPW+ADIVN+LA   LPPD+ Y Q+K+FLH VK Y W++P L+K C D +IR+CVP+
Subjt:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ

Query:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILVAV
        EE+  IL+ CH+S YGG FG TRTAAKVLQS F+WPS+F+D YT VK+CDRCQR+GNISR+ ELP+K ILEVELFDVWGIDFMG FP  S G +YIL+AV
Subjt:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILVAV

Query:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDWAL
        DYVSKWVEAIAT TNDA+ VLKFLHKNIFTRFGTPRAIISDEG+HFCNKLF++++ KY VKHKIA AYHPQTNG A++SNREIK +LEKTV TNRKDWA 
Subjt:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDWAL

Query:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ
        KLDDALWAYRTAFKTPIG SPY+LVFGKACHLPVELEH+AYWA+KK N+D +  GEKRLL+LNEM+EFR  AYEN K+YKERT +WHDK+I  +   PGQ
Subjt:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ

Query:  ---------------------------RVSPHGAVELQGNNGTTFKVNGQRLKHYIGDE
                                   +VS  GA++L+   G  F+VNGQRLKHY G++
Subjt:  ---------------------------RVSPHGAVELQGNNGTTFKVNGQRLKHYIGDE

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase1.5e-20762.39Show/hide
Query:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASD+A+GAVLGQR+D +FR+IYYAS+TL++ Q  YTTTEKELLAVVFA DKFRSYL+G+K++V+TDHAA++YL  KKD KPRL+RW+LLLQ FDLEI
Subjt:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRIENEEAKSWPSIVK-MFPDEQRYQ-VKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCV
        +DRKG EN +ADHLSR+E+      P+++   FPDEQ    V   +PW+ADIVNYL  G +P D++ QQKK+FL   + Y W+DPFL+K   DN++R+CV
Subjt:  KDRKGCENVVADHLSRIENEEAKSWPSIVK-MFPDEQRYQ-VKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCV

Query:  PQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILV
        P+ E+ +IL  CHASPYGG F   RTAAK+LQS F+WP+LFKD ++FV +CDRCQR GNISR+HE+P+  ILEVELFDVWGIDFMG F I S G++YILV
Subjt:  PQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILV

Query:  AVDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDW
        AVDYVSKWVEA A   ND++ V+ F+ KNIFTRFGTPRAIISD G+HFCN+ FE+++ KY VKHKI+T YHPQT+G  ++SNREIK++LEKTV + RKDW
Subjt:  AVDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDW

Query:  ALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLP
        + +LD+ALWAYRTA+KTPIG SPY+LVFGKACHLPVELEH AYWAI+KLN D +  GEKRLL+LNE++EFR  AYEN K+YKE+  RWH+KKI  +   P
Subjt:  ALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLP

Query:  GQ---------------------------RVSPHGAVELQGNNG-TTFKVNGQRLKHYIGD
        GQ                            V PHGAVEL+  N    FKVN QR+KHY G+
Subjt:  GQ---------------------------RVSPHGAVELQGNNG-TTFKVNGQRLKHYIGD

A0A4Y1RZC3 Transposable element protein (Fragment)3.3e-20262.43Show/hide
Query:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASD+A+GAVLGQ+++ +   I+YASRTL++ Q  Y+TTEKELLAVVFA++KFR YL+GSK++V++DHAAL+YL  KKD KPRL+RWILLLQ FDLEI
Subjt:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRI-ENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVP
        +D+KGCENVVADHLSRI   E+ ++   + + FPDEQ Y  +   PW+AD VNYLA G L  D+ YQ KK+F   VK Y W++PFL+K C D +IR+CVP
Subjt:  KDRKGCENVVADHLSRI-ENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVP

Query:  QEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILVA
        +EE  +IL   H    GG FG  +TA K+LQS F+WP+LFKD + F   CDRCQR+GNISR++ELP+K IL VELFDVWGIDFMG FP SS G+ YILVA
Subjt:  QEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILVA

Query:  VDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDWA
        VDYVSKWVEAIAT+TND + VLKFL  NIFTRFGTPRA+ISD GSHFCNKLFE++M+KYN+ H+++T YHPQT+G  ++SNREIKQ+LEK V + RKDWA
Subjt:  VDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDWA

Query:  LKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPG
         KL+DALWAYRTA+KTPIG SPY+LVFGKACHLP+ELEH A+WAIKKLN D +K G  R  +LNE+EE R  +YEN KLYKERT  +HD+ I  +    G
Subjt:  LKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPG

Query:  Q------------RVSPHGAVELQG-NNGTTFKVNGQRLKHYIGDEERGLE
        +             VSP+GAVE+Q   +G+TFKVNGQRLK +      G++
Subjt:  Q------------RVSPHGAVELQG-NNGTTFKVNGQRLKHYIGDEERGLE

A0A540MQU7 Integrase catalytic domain-containing protein6.6e-20360.71Show/hide
Query:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASDYA+GAVLGQR++ +   IYYASRTL++ Q  YTTTEKE+LAV+FA++KFRSYL+GSK++V+TDH ALKYL  KKD KPRL+RW+LLLQ FDL+I
Subjt:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSR---IENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKC
        +D+KG ENVVADHLSR   +++EE    P + + FPDEQ + + D +PW+ADI NYL  G LPPD++ Q +K+FL  VK Y W+DP+LYK C+D +IR+C
Subjt:  KDRKGCENVVADHLSR---IENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKC

Query:  VPQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYIL
        VP  E  +IL  CH+   GG FGP++TAAKVLQS F+WP+LFKD Y F  +CDRCQR+GNIS+++E+P + +L VELFDVWGIDFMG FP S N   YIL
Subjt:  VPQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYIL

Query:  VAVDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKD
        VAVDYVSKWVEAIAT TND + VL+FL   IF RFGTPR IISD G HF NK F ++M KYN+ H++AT YHPQT+G  ++SNREIK++LE TV  +RKD
Subjt:  VAVDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKD

Query:  WALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLL
        W+LKL DALWAYRTA+KTPIG SP++LV+GKACH PVELEHRAYWAIK+LN +++  GEKR L+LNE+EE+R  AYEN K+YKERT ++HDK I  +  +
Subjt:  WALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLL

Query:  PGQR---------------------------VSPHGAVELQG-NNGTTFKVNGQRLKHYI
        PGQ+                           V PHGA+E++    G  FKVNGQRLKHY+
Subjt:  PGQR---------------------------VSPHGAVELQG-NNGTTFKVNGQRLKHYI

A0A540NGH5 Integrase catalytic domain-containing protein1.1e-20260.71Show/hide
Query:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASDYA+GAVLGQR++ +   IYYASRTL++ Q  YTTT+KE+LAV+FA++KFRSYL+GSK++V+TDH ALKYL  KKD KPRL+RW+LLLQ FDL+I
Subjt:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSR---IENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKC
        +D+KG ENVVADHLSR   +++EE    P + + FPDEQ + + D +PW+ADI NYL  G LPPD++ Q +K+FL  VK Y W+DP+LYK C+D +IR+C
Subjt:  KDRKGCENVVADHLSR---IENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKC

Query:  VPQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYIL
        VP  E  +IL  CH+   GG FGP++TAAKVLQS F+WP+LFKD Y F  +CDRCQR+GNIS+++E+P + +L VELFDVWGIDFMG FP S N   YIL
Subjt:  VPQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYIL

Query:  VAVDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKD
        VAVDYVSKWVEAIAT TND + VL+FL   IF RFGTPR IISD G HF NK F ++M KYN+ H++AT YHPQT+G  ++SNREIK++LE TV  +RKD
Subjt:  VAVDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKD

Query:  WALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLL
        W+LKL DALWAYRTA+KTPIG SP++LV+GKACHLPVELEHRAYWAIK+LN +++  GEKR L+LNE+EE+R  AYEN K+YKERT ++HDK I  +  +
Subjt:  WALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLL

Query:  PGQR---------------------------VSPHGAVELQG-NNGTTFKVNGQRLKHYI
        PGQ+                           V PHGA+E++    G  FKVNGQRLKHY+
Subjt:  PGQR---------------------------VSPHGAVELQG-NNGTTFKVNGQRLKHYI

A0A6P8CBX2 Reverse transcriptase1.5e-20261.79Show/hide
Query:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASDYA+GAVLGQRR  +F AIYYASRTL+  Q+ Y TTEKELLAV+FA DKFR YL+GSKI+V+TDHAALKYLF K D KPRL+RWILLLQ FDLEI
Subjt:  MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQRYQVK-DSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVP
        +D KG ENVVADHLSR+E++   S   I + FPDEQ +  +   LPW+ADIVNY+     P  ++ QQKK+FLH VK Y W++P+L+K CAD +IR+CVP
Subjt:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQRYQVK-DSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVP

Query:  QEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILVA
        + E ++I+  CH+   GG FG  RTA K+L   FYWP +F DC  ++ SC  CQR GNISR+HE+P   IL +ELFDVWGIDFMG FP SS  + YILVA
Subjt:  QEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILVA

Query:  VDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDWA
        VDYVSKWVEA+A ++NDAR V++FL KNIF+RFG PRAIISD GSHFCN+ FE ++ KY V HKIAT YHPQT G  ++SNREIK++LEKTV  +RKDW+
Subjt:  VDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDWA

Query:  LKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPG
        LKLDDALWAYRTAFKTPIG SPYK+V+GK+CHLPVELEH+AYWAIK LN D +  GEKRLL+LN+M E R  AYEN ++YKER  RWHD+ I  +  LPG
Subjt:  LKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPG

Query:  QR---------------------------VSPHGAVELQGNNGTTFKVNGQRLKHYIGDE
        Q+                           V P+GAVEL+  +  TFKVNG  LKHY   E
Subjt:  QR---------------------------VSPHGAVELQGNNGTTFKVNGQRLKHYIGDE

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.9e-2925.75Show/hide
Query:  DASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAAL--KYLFVKKDYKPRLMRWILLLQLFD
        DASD A+GAVL Q+  D+ +  + Y S  +   Q  Y+ ++KE+LA++ ++  +R YL  +     + TDH  L  +     +    RL RW L LQ F+
Subjt:  DASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAAL--KYLFVKKDYKPRLMRWILLLQLFD

Query:  LEIKDRKGCENVVADHLSRIENE--------EAKSWPSIVKM-FPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKV
         EI  R G  N +AD LSRI +E        E  S   + ++   D+ + QV         ++N L       + N Q K   L   K     D  L  +
Subjt:  LEIKDRKGCENVVADHLSRIENE--------EAKSWPSIVKM-FPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKV

Query:  CADNMIRKCVPQEEVVNILNSCHASPYGGQFGP-TRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHEL--PMKPILEVEL-FDVWGIDFMG
          D  + +         I+   H    G    P       ++  RF W  + K    +V++C  CQ   N SR H+   P++PI   E  ++   +DF+ 
Subjt:  CADNMIRKCVPQEEVVNILNSCHASPYGGQFGP-TRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHEL--PMKPILEVEL-FDVWGIDFMG

Query:  LFPISSNGHLYILVAVDYVSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREI
          P  S+G+  + V VD  SK    +  T++  A    +   + +   FG P+ II+D    F ++ ++    KYN   K +  Y PQT+G  + +N+ +
Subjt:  LFPISSNGHLYILVAVDYVSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREI

Query:  KQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHL-PVELEH-------------RAYWAIKK-LNMDFEKVGEKRLLELNEMEEF
        +++L     T+   W   +     +Y  A  +    +P+++V   +  L P+EL               + +  +K+ LN +  K+ +   +++ E+EEF
Subjt:  KQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHL-PVELEH-------------RAYWAIKK-LNMDFEKVGEKRLLELNEMEEF

Query:  R
        +
Subjt:  R

P0CT35 Transposon Tf2-2 polyprotein1.9e-2925.75Show/hide
Query:  DASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAAL--KYLFVKKDYKPRLMRWILLLQLFD
        DASD A+GAVL Q+  D+ +  + Y S  +   Q  Y+ ++KE+LA++ ++  +R YL  +     + TDH  L  +     +    RL RW L LQ F+
Subjt:  DASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAAL--KYLFVKKDYKPRLMRWILLLQLFD

Query:  LEIKDRKGCENVVADHLSRIENE--------EAKSWPSIVKM-FPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKV
         EI  R G  N +AD LSRI +E        E  S   + ++   D+ + QV         ++N L       + N Q K   L   K     D  L  +
Subjt:  LEIKDRKGCENVVADHLSRIENE--------EAKSWPSIVKM-FPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKV

Query:  CADNMIRKCVPQEEVVNILNSCHASPYGGQFGP-TRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHEL--PMKPILEVEL-FDVWGIDFMG
          D  + +         I+   H    G    P       ++  RF W  + K    +V++C  CQ   N SR H+   P++PI   E  ++   +DF+ 
Subjt:  CADNMIRKCVPQEEVVNILNSCHASPYGGQFGP-TRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHEL--PMKPILEVEL-FDVWGIDFMG

Query:  LFPISSNGHLYILVAVDYVSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREI
          P  S+G+  + V VD  SK    +  T++  A    +   + +   FG P+ II+D    F ++ ++    KYN   K +  Y PQT+G  + +N+ +
Subjt:  LFPISSNGHLYILVAVDYVSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREI

Query:  KQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHL-PVELEH-------------RAYWAIKK-LNMDFEKVGEKRLLELNEMEEF
        +++L     T+   W   +     +Y  A  +    +P+++V   +  L P+EL               + +  +K+ LN +  K+ +   +++ E+EEF
Subjt:  KQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHL-PVELEH-------------RAYWAIKK-LNMDFEKVGEKRLLELNEMEEF

Query:  R
        +
Subjt:  R

P0CT41 Transposon Tf2-12 polyprotein1.9e-2925.75Show/hide
Query:  DASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAAL--KYLFVKKDYKPRLMRWILLLQLFD
        DASD A+GAVL Q+  D+ +  + Y S  +   Q  Y+ ++KE+LA++ ++  +R YL  +     + TDH  L  +     +    RL RW L LQ F+
Subjt:  DASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAAL--KYLFVKKDYKPRLMRWILLLQLFD

Query:  LEIKDRKGCENVVADHLSRIENE--------EAKSWPSIVKM-FPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKV
         EI  R G  N +AD LSRI +E        E  S   + ++   D+ + QV         ++N L       + N Q K   L   K     D  L  +
Subjt:  LEIKDRKGCENVVADHLSRIENE--------EAKSWPSIVKM-FPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKV

Query:  CADNMIRKCVPQEEVVNILNSCHASPYGGQFGP-TRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHEL--PMKPILEVEL-FDVWGIDFMG
          D  + +         I+   H    G    P       ++  RF W  + K    +V++C  CQ   N SR H+   P++PI   E  ++   +DF+ 
Subjt:  CADNMIRKCVPQEEVVNILNSCHASPYGGQFGP-TRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHEL--PMKPILEVEL-FDVWGIDFMG

Query:  LFPISSNGHLYILVAVDYVSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREI
          P  S+G+  + V VD  SK    +  T++  A    +   + +   FG P+ II+D    F ++ ++    KYN   K +  Y PQT+G  + +N+ +
Subjt:  LFPISSNGHLYILVAVDYVSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREI

Query:  KQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHL-PVELEH-------------RAYWAIKK-LNMDFEKVGEKRLLELNEMEEF
        +++L     T+   W   +     +Y  A  +    +P+++V   +  L P+EL               + +  +K+ LN +  K+ +   +++ E+EEF
Subjt:  KQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHL-PVELEH-------------RAYWAIKK-LNMDFEKVGEKRLLELNEMEEF

Query:  R
        +
Subjt:  R

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein3.1e-4027.82Show/hide
Query:  DASDYALGAVLGQ--RRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        DAS   +GAVL +   ++ +   + Y S++L++ Q+ Y   E ELL ++ A+  FR  L G    + TDH +L  L  K +   R+ RW+  L  +D  +
Subjt:  DASDYALGAVLGQ--RRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSR----IENE-----EAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDM----NYQQKKRFLHKV-KSYHWEDPFLY
        +   G +NVVAD +SR    I  E     + +SW S  K  P          L    ++  +      P DM    +YQ+K        K+Y  ED  +Y
Subjt:  KDRKGCENVVADHLSR----IENE-----EAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDM----NYQQKKRFLHKV-KSYHWEDPFLY

Query:  KVCADNMIRKCVPQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRI-GNISRQHEL--PMKPILEVELFDVWGIDFM
            D ++     Q  V+ + +    + +GG FG T T AK+    +YWP L      ++++C +CQ I  +  R H L  P+ PI E    D+  +DF+
Subjt:  KVCADNMIRKCVPQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRI-GNISRQHEL--PMKPILEVELFDVWGIDFM

Query:  GLFPISSNGHLYILVAVDYVSKWVEAIATR-TNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNRE
           P +SN    ILV VD  SK    IATR T DA  ++  L + IF+  G PR I SD         ++ + ++  +K  +++A HPQT+G ++ + + 
Subjt:  GLFPISSNGHLYILVAVDYVSKWVEAIATR-TNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNRE

Query:  IKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKER
        + ++L   V TN ++W + L    + Y +     +G SP+++  G   + P                            +   +E  AR++  V+L K  
Subjt:  IKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKER

Query:  TARWHDKKITSQTLLPGQRVSPHGAVELQGNN
            H K +T QT    +    H  +E++ NN
Subjt:  TARWHDKKITSQTLLPGQRVSPHGAVELQGNN

Q9UR07 Transposon Tf2-11 polyprotein1.9e-2925.75Show/hide
Query:  DASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAAL--KYLFVKKDYKPRLMRWILLLQLFD
        DASD A+GAVL Q+  D+ +  + Y S  +   Q  Y+ ++KE+LA++ ++  +R YL  +     + TDH  L  +     +    RL RW L LQ F+
Subjt:  DASDYALGAVLGQRR-DNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAAL--KYLFVKKDYKPRLMRWILLLQLFD

Query:  LEIKDRKGCENVVADHLSRIENE--------EAKSWPSIVKM-FPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKV
         EI  R G  N +AD LSRI +E        E  S   + ++   D+ + QV         ++N L       + N Q K   L   K     D  L  +
Subjt:  LEIKDRKGCENVVADHLSRIENE--------EAKSWPSIVKM-FPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKV

Query:  CADNMIRKCVPQEEVVNILNSCHASPYGGQFGP-TRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHEL--PMKPILEVEL-FDVWGIDFMG
          D  + +         I+   H    G    P       ++  RF W  + K    +V++C  CQ   N SR H+   P++PI   E  ++   +DF+ 
Subjt:  CADNMIRKCVPQEEVVNILNSCHASPYGGQFGP-TRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHEL--PMKPILEVEL-FDVWGIDFMG

Query:  LFPISSNGHLYILVAVDYVSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREI
          P  S+G+  + V VD  SK    +  T++  A    +   + +   FG P+ II+D    F ++ ++    KYN   K +  Y PQT+G  + +N+ +
Subjt:  LFPISSNGHLYILVAVDYVSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREI

Query:  KQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHL-PVELEH-------------RAYWAIKK-LNMDFEKVGEKRLLELNEMEEF
        +++L     T+   W   +     +Y  A  +    +P+++V   +  L P+EL               + +  +K+ LN +  K+ +   +++ E+EEF
Subjt:  KQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHL-PVELEH-------------RAYWAIKK-LNMDFEKVGEKRLLELNEMEEF

Query:  R
        +
Subjt:  R

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein7.1e-1662.5Show/hide
Query:  VLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFM
        VLQ+ FYWP+ FKD + FV SCD CQR GN ++++E+P   ILEVE+FDVWGI FM
Subjt:  VLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGATGCTAGTGATTATGCTTTAGGAGCTGTTTTAGGCCAACGTAGAGATAACATGTTTAGGGCCATTTACTATGCTAGTAGGACTCTTGATAATACTCAACAGAA
ATACACTACTACTGAAAAAGAACTACTAGCTGTTGTGTTTGCCATTGATAAATTTAGATCATACTTGCTTGGCTCTAAAATAGTAGTGCATACTGACCATGCTGCTTTAA
AGTATTTGTTTGTTAAGAAAGATTATAAACCTAGGCTAATGAGGTGGATATTATTGTTACAGTTATTTGACCTAGAAATCAAAGACAGGAAAGGATGTGAAAATGTGGTC
GCAGACCACTTATCTAGAATTGAGAATGAGGAAGCTAAATCATGGCCCTCAATTGTTAAGATGTTCCCTGATGAACAACGGTATCAGGTAAAAGATAGTTTGCCCTGGTT
TGCTGACATAGTTAATTATCTTGCAGGAGGACATTTGCCACCTGACATGAACTATCAACAAAAGAAAAGATTCCTGCACAAAGTAAAGTCTTACCATTGGGAGGACCCAT
TTCTCTACAAGGTTTGTGCTGACAATATGATAAGAAAGTGTGTGCCTCAAGAGGAAGTGGTAAATATTCTAAATTCATGTCATGCCTCACCCTATGGAGGTCAATTTGGA
CCCACTAGAACTGCAGCTAAGGTACTTCAGTCAAGATTTTATTGGCCATCCCTTTTTAAAGACTGTTATACCTTTGTTAAGTCATGTGATAGGTGCCAACGTATTGGCAA
TATTTCTAGACAACATGAGCTTCCAATGAAACCTATCTTGGAAGTGGAGTTATTTGATGTTTGGGGTATTGACTTTATGGGGCTCTTTCCTATTTCTTCTAATGGTCACC
TATACATTCTAGTTGCAGTAGATTATGTATCTAAATGGGTAGAAGCCATAGCTACTAGGACCAATGATGCTCGCACTGTTTTAAAATTCTTGCATAAAAACATTTTCACA
CGTTTTGGTACACCTAGAGCTATTATTAGTGATGAAGGCTCTCACTTTTGCAATAAACTGTTTGAATCCATGATGCAAAAATATAATGTAAAACATAAAATTGCTACAGC
TTATCATCCTCAAACTAATGGCCTTGCTAAGTTATCTAACAGGGAAATCAAACAAGTTTTGGAAAAGACAGTCAAGACCAATAGGAAGGATTGGGCCCTTAAGCTCGATG
ATGCACTGTGGGCCTACCGCACAGCTTTCAAAACCCCAATTGGTACTTCCCCGTATAAGTTGGTGTTTGGAAAGGCTTGTCACTTACCGGTAGAGCTCGAGCATAGAGCT
TATTGGGCCATCAAGAAGCTGAACATGGATTTCGAGAAAGTTGGTGAGAAACGCCTCTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCGAGCTTATGAGAATGTCAA
ACTTTACAAAGAGCGCACTGCGAGATGGCATGACAAGAAGATAACATCACAAACCTTGCTTCCAGGACAAAGAGTATCCCCCCACGGAGCCGTGGAACTACAAGGTAACA
ATGGAACAACCTTCAAAGTGAATGGTCAACGATTAAAGCACTACATCGGTGATGAAGAACGCGGACTTGAGAACCTGACTTTCATTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTGATGCTAGTGATTATGCTTTAGGAGCTGTTTTAGGCCAACGTAGAGATAACATGTTTAGGGCCATTTACTATGCTAGTAGGACTCTTGATAATACTCAACAGAA
ATACACTACTACTGAAAAAGAACTACTAGCTGTTGTGTTTGCCATTGATAAATTTAGATCATACTTGCTTGGCTCTAAAATAGTAGTGCATACTGACCATGCTGCTTTAA
AGTATTTGTTTGTTAAGAAAGATTATAAACCTAGGCTAATGAGGTGGATATTATTGTTACAGTTATTTGACCTAGAAATCAAAGACAGGAAAGGATGTGAAAATGTGGTC
GCAGACCACTTATCTAGAATTGAGAATGAGGAAGCTAAATCATGGCCCTCAATTGTTAAGATGTTCCCTGATGAACAACGGTATCAGGTAAAAGATAGTTTGCCCTGGTT
TGCTGACATAGTTAATTATCTTGCAGGAGGACATTTGCCACCTGACATGAACTATCAACAAAAGAAAAGATTCCTGCACAAAGTAAAGTCTTACCATTGGGAGGACCCAT
TTCTCTACAAGGTTTGTGCTGACAATATGATAAGAAAGTGTGTGCCTCAAGAGGAAGTGGTAAATATTCTAAATTCATGTCATGCCTCACCCTATGGAGGTCAATTTGGA
CCCACTAGAACTGCAGCTAAGGTACTTCAGTCAAGATTTTATTGGCCATCCCTTTTTAAAGACTGTTATACCTTTGTTAAGTCATGTGATAGGTGCCAACGTATTGGCAA
TATTTCTAGACAACATGAGCTTCCAATGAAACCTATCTTGGAAGTGGAGTTATTTGATGTTTGGGGTATTGACTTTATGGGGCTCTTTCCTATTTCTTCTAATGGTCACC
TATACATTCTAGTTGCAGTAGATTATGTATCTAAATGGGTAGAAGCCATAGCTACTAGGACCAATGATGCTCGCACTGTTTTAAAATTCTTGCATAAAAACATTTTCACA
CGTTTTGGTACACCTAGAGCTATTATTAGTGATGAAGGCTCTCACTTTTGCAATAAACTGTTTGAATCCATGATGCAAAAATATAATGTAAAACATAAAATTGCTACAGC
TTATCATCCTCAAACTAATGGCCTTGCTAAGTTATCTAACAGGGAAATCAAACAAGTTTTGGAAAAGACAGTCAAGACCAATAGGAAGGATTGGGCCCTTAAGCTCGATG
ATGCACTGTGGGCCTACCGCACAGCTTTCAAAACCCCAATTGGTACTTCCCCGTATAAGTTGGTGTTTGGAAAGGCTTGTCACTTACCGGTAGAGCTCGAGCATAGAGCT
TATTGGGCCATCAAGAAGCTGAACATGGATTTCGAGAAAGTTGGTGAGAAACGCCTCTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCGAGCTTATGAGAATGTCAA
ACTTTACAAAGAGCGCACTGCGAGATGGCATGACAAGAAGATAACATCACAAACCTTGCTTCCAGGACAAAGAGTATCCCCCCACGGAGCCGTGGAACTACAAGGTAACA
ATGGAACAACCTTCAAAGTGAATGGTCAACGATTAAAGCACTACATCGGTGATGAAGAACGCGGACTTGAGAACCTGACTTTCATTGCATAA
Protein sequenceShow/hide protein sequence
MCDASDYALGAVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEIKDRKGCENVV
ADHLSRIENEEAKSWPSIVKMFPDEQRYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQEEVVNILNSCHASPYGGQFG
PTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDVWGIDFMGLFPISSNGHLYILVAVDYVSKWVEAIATRTNDARTVLKFLHKNIFT
RFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAKLSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRA
YWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQRVSPHGAVELQGNNGTTFKVNGQRLKHYIGDEERGLENLTFIA