; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G10490 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G10490
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTransposable element protein
Genome locationClcChr02:16488583..16490296
RNA-Seq ExpressionClc02G10490
SyntenyClc02G10490
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
WP_217833156.1 hypothetical protein, partial [Synechococcus sp. PCC 7002]2.7e-25177.72Show/hide
Query:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASDYALG VLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFA+DKFRSYLLGSKIVVHTDHAALKYLFVKKD KPRLMRWILLLQ FDLEI
Subjt:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ
        KDRKGCENVVADHLSRIENE+AKSWP IV+MFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLH VKSYHWEDP LYKVCADNMIRKCVPQ
Subjt:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ

Query:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILVAV
        EEVV+ILNSCHASPYGG FGPTRTAAKVLQS FYWPSLFKDCYTFVKSCDRCQR GNISRQHELPMKPILEVELFD+WGIDFMG  PIS           
Subjt:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILVAV

Query:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWAL
                                                                 YNV HKIATAYHPQTNGLAELSNREIKQVLEK VKTNRKDWAL
Subjt:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWAL

Query:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ
        KLDDALWAYRTAFKTPIGTS Y+LVFGKACHLPVELEHRAYWAIKKLNMDFEK GEKRLLELNEMEEFRA+AYEN KLYK+RTARWHDKKIT  T LP Q
Subjt:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ

Query:  R---------------------------VSPHGAVELQGNNGTTFKVNGQRLKHYIGDEERGLENLTFIA
        R                           VSPHGAVELQGNNGTTFKVNG RLKHYIGDEER LENL F A
Subjt:  R---------------------------VSPHGAVELQGNNGTTFKVNGQRLKHYIGDEERGLENLTFIA

XP_012833379.1 PREDICTED: uncharacterized protein LOC105954252 [Erythranthe guttata]8.8e-21864.68Show/hide
Query:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASDYA+G VLGQRRD +F+AIYY+SRTLD  Q+ Y+TTEKE+LAVV+A+DKFR Y+LGS+++++TDHAA++YLF KKD KPRL+RW+LLLQ FDLEI
Subjt:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ
        +D+KG ENVVADHLSR+  EE  +  +I + FPDEQL  +    PW+AD+ N+LA G +P D++Y QKK+FLH  + Y W++P L++   D +IR+CVP+
Subjt:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ

Query:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILVAV
         EV  IL  CH+SP GG  G +RTAAKVLQS F+WP+LF+D Y FVK CDRCQR GN+S + ++P+  + EVELFD+WGIDFMG  P SSNG LYIL+AV
Subjt:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILVAV

Query:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWAL
        DYVSKWVEAIAT  NDARTVLKF HKNIF+RFGTPRAIISDEGSHFCNKL  ++  K  ++HKIA AYHPQTNGLAELSNREIKQ+LEKTV TNRKDWAL
Subjt:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWAL

Query:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ
        KLDDALWAYRTAFKTPIG SPYKLVFGKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  AYEN K+YKE+T +WHDK+IT +    G 
Subjt:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ

Query:  RV---------------------------SPHGAVELQGNNGTTFKVNGQRLKHY
        +V                           +P G +E++G +G +FKVNGQR+KHY
Subjt:  RV---------------------------SPHGAVELQGNNGTTFKVNGQRLKHY

XP_012833687.1 PREDICTED: uncharacterized protein LOC105954563 [Erythranthe guttata]5.2e-21864.68Show/hide
Query:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASDYA+G VLGQRRD +F+AIYY+SRTLD  Q+ Y+TTEKE+LAVV+A+DKFR Y+LGS+++++TDHAA++YLF KKD KPRL+RW+LLLQ FDLEI
Subjt:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ
        +D+KG ENVVADHLSR+  EE  +  +I + FPDEQL  +    PW+AD+ N+LA G +P D++Y QKK+FLH  + Y W++P L++   D +IR+CVP+
Subjt:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ

Query:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILVAV
         EV  IL  CH+SP GG  G +RTAAKVLQS F+WP+LF+D Y FVK CDRCQR GN+S + ++P+  + EVELFD+WGIDFMG  P SSNG LYIL+AV
Subjt:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILVAV

Query:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWAL
        DYVSKWVEAIAT TNDARTVLKF HKNIF+RFGTPRAIISDEGSHFCNKL  ++  K  ++HKIA AYHPQTNGLAELSNREIKQ+LEKTV TNRKDWAL
Subjt:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWAL

Query:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ
        KLDDALWAYRTAFKTPIG SPYKLV+GKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  AYEN K+YKE+T +WHDK+IT +    G 
Subjt:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ

Query:  RV---------------------------SPHGAVELQGNNGTTFKVNGQRLKHY
        +V                           +P G +E++G +G +FKVNGQR+KHY
Subjt:  RV---------------------------SPHGAVELQGNNGTTFKVNGQRLKHY

XP_012847037.1 PREDICTED: uncharacterized protein LOC105967019 [Erythranthe guttata]5.2e-21864.68Show/hide
Query:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASDYA+G VLGQRRD +F+AIYY+SRTLD  Q+ Y+TTEKE+LAVV+A+DKFR Y+LGS+++++TDHAA++YLF KKD KPRL+RW+LLLQ FDLEI
Subjt:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ
        +D+KG ENVVADHLSR+  EE  +  +I + FPDEQL  +    PW+AD+ N+LA G +P D++Y QKK+FLH  + Y W++P L++   D +IR+CVP+
Subjt:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ

Query:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILVAV
         EV  IL  CH+SP GG  G +RTAAKVLQS F+WP+LF+D Y FVK CDRCQR GN+S + ++P+  + EVELFD+WGIDFMG  P SSNG LYIL+AV
Subjt:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILVAV

Query:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWAL
        DYVSKWVEAIAT TNDARTVLKF HKNIF+RFGTPRAIISDEGSHFCNKL  ++  K  ++HKIA AYHPQTNGLAELSNREIKQ+LEKTV TNRKDWAL
Subjt:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWAL

Query:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ
        KLDDALWAYRTAFKTPIG SPYKLV+GKACHLPVELEHRAYWA+KKLN D    G++RLL+LNEMEEFR  AYEN K+YKE+T +WHDK+IT +    G 
Subjt:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ

Query:  RV---------------------------SPHGAVELQGNNGTTFKVNGQRLKHY
        +V                           +P G +E++G +G +FKVNGQR+KHY
Subjt:  RV---------------------------SPHGAVELQGNNGTTFKVNGQRLKHY

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]5.7e-22567.62Show/hide
Query:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASD+ALG VLGQRRD +FRAIYYASRTL+  Q  YTTTEKE+LAVVFA DKFRSYL+ +K++V TDHAAL+YLF KKD KPRL+RWILLLQ FDLE+
Subjt:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ
        +D+KG EN VADHLSR+E EE +    I + FPDEQL+  +  LPW+ADIVN+LA   LPPD+ Y Q+K+FLH VK Y W++P L+K C D +IR+CVP+
Subjt:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQ

Query:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILVAV
        EE+  IL+ CH+S YGG FG TRTAAKVLQS F+WPS+F+D YT VK+CDRCQR+GNISR+ ELP+K ILEVELFD+WGIDFMG  P  S G +YIL+AV
Subjt:  EEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILVAV

Query:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWAL
        DYVSKWVEAIAT TNDA+ VLKFLHKNIFTRFGTPRAIISDEG+HFCNKLF++++ KY VKHKIA AYHPQTNG AE+SNREIK +LEKTV TNRKDWA 
Subjt:  DYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWAL

Query:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ
        KLDDALWAYRTAFKTPIG SPY+LVFGKACHLPVELEH+AYWA+KK N+D +  GEKRLL+LNEM+EFR  AYEN K+YKERT +WHDK+I  +   PGQ
Subjt:  KLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQ

Query:  ---------------------------RVSPHGAVELQGNNGTTFKVNGQRLKHYIGDE
                                   +VS  GA++L+   G  F+VNGQRLKHY G++
Subjt:  ---------------------------RVSPHGAVELQGNNGTTFKVNGQRLKHYIGDE

TrEMBL top hitse value%identityAlignment
A0A2G9FWY3 Reverse transcriptase1.5e-20762.21Show/hide
Query:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASD+A+G VLGQR+D +FR+IYYAS+TL++ Q  YTTTEKELLAVVFA DKFRSYL+G+K++V+TDHAA++YL  KKD KPRL+RW+LLLQ FDLEI
Subjt:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRIENEEAKSWPSIVK-MFPDEQLYQ-VKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCV
        +DRKG EN +ADHLSR+E+      P+++   FPDEQL   V   +PW+ADIVNYL  G +P D++ QQKK+FL   + Y W+DPFL+K   DN++R+CV
Subjt:  KDRKGCENVVADHLSRIENEEAKSWPSIVK-MFPDEQLYQ-VKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCV

Query:  PQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILV
        P+ E+ +IL  CHASPYGG F   RTAAK+LQS F+WP+LFKD ++FV +CDRCQR GNISR+HE+P+  ILEVELFD+WGIDFMG   I S G++YILV
Subjt:  PQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILV

Query:  AVDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDW
        AVDYVSKWVEA A   ND++ V+ F+ KNIFTRFGTPRAIISD G+HFCN+ FE+++ KY VKHKI+T YHPQT+G  E+SNREIK++LEKTV + RKDW
Subjt:  AVDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDW

Query:  ALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLP
        + +LD+ALWAYRTA+KTPIG SPY+LVFGKACHLPVELEH AYWAI+KLN D +  GEKRLL+LNE++EFR  AYEN K+YKE+  RWH+KKI  +   P
Subjt:  ALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLP

Query:  GQ---------------------------RVSPHGAVELQGNNG-TTFKVNGQRLKHYIGD
        GQ                            V PHGAVEL+  N    FKVN QR+KHY G+
Subjt:  GQ---------------------------RVSPHGAVELQGNNG-TTFKVNGQRLKHYIGD

A0A4Y1RZC3 Transposable element protein (Fragment)3.3e-20262.25Show/hide
Query:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASD+A+G VLGQ+++ +   I+YASRTL++ Q  Y+TTEKELLAVVFA++KFR YL+GSK++V++DHAAL+YL  KKD KPRL+RWILLLQ FDLEI
Subjt:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRI-ENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVP
        +D+KGCENVVADHLSRI   E+ ++   + + FPDEQLY  +   PW+AD VNYLA G L  D+ YQ KK+F   VK Y W++PFL+K C D +IR+CVP
Subjt:  KDRKGCENVVADHLSRI-ENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVP

Query:  QEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILVA
        +EE  +IL   H    GG FG  +TA K+LQS F+WP+LFKD + F   CDRCQR+GNISR++ELP+K IL VELFD+WGIDFMG  P SS G+ YILVA
Subjt:  QEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILVA

Query:  VDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWA
        VDYVSKWVEAIAT+TND + VLKFL  NIFTRFGTPRA+ISD GSHFCNKLFE++M+KYN+ H+++T YHPQT+G  E+SNREIKQ+LEK V + RKDWA
Subjt:  VDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWA

Query:  LKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPG
         KL+DALWAYRTA+KTPIG SPY+LVFGKACHLP+ELEH A+WAIKKLN D +K G  R  +LNE+EE R  +YEN KLYKERT  +HD+ I  +    G
Subjt:  LKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPG

Query:  Q------------RVSPHGAVELQG-NNGTTFKVNGQRLKHYIGDEERGLE
        +             VSP+GAVE+Q   +G+TFKVNGQRLK +      G++
Subjt:  Q------------RVSPHGAVELQG-NNGTTFKVNGQRLKHYIGDEERGLE

A0A540MQU7 Integrase catalytic domain-containing protein5.1e-20360.54Show/hide
Query:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASDYA+G VLGQR++ +   IYYASRTL++ Q  YTTTEKE+LAV+FA++KFRSYL+GSK++V+TDH ALKYL  KKD KPRL+RW+LLLQ FDL+I
Subjt:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSR---IENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKC
        +D+KG ENVVADHLSR   +++EE    P + + FPDEQL+ + D +PW+ADI NYL  G LPPD++ Q +K+FL  VK Y W+DP+LYK C+D +IR+C
Subjt:  KDRKGCENVVADHLSR---IENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKC

Query:  VPQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYIL
        VP  E  +IL  CH+   GG FGP++TAAKVLQS F+WP+LFKD Y F  +CDRCQR+GNIS+++E+P + +L VELFD+WGIDFMG  P S N   YIL
Subjt:  VPQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYIL

Query:  VAVDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKD
        VAVDYVSKWVEAIAT TND + VL+FL   IF RFGTPR IISD G HF NK F ++M KYN+ H++AT YHPQT+G  E+SNREIK++LE TV  +RKD
Subjt:  VAVDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKD

Query:  WALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLL
        W+LKL DALWAYRTA+KTPIG SP++LV+GKACH PVELEHRAYWAIK+LN +++  GEKR L+LNE+EE+R  AYEN K+YKERT ++HDK I  +  +
Subjt:  WALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLL

Query:  PGQR---------------------------VSPHGAVELQG-NNGTTFKVNGQRLKHYI
        PGQ+                           V PHGA+E++    G  FKVNGQRLKHY+
Subjt:  PGQR---------------------------VSPHGAVELQG-NNGTTFKVNGQRLKHYI

A0A540NGH5 Integrase catalytic domain-containing protein8.6e-20360.54Show/hide
Query:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASDYA+G VLGQR++ +   IYYASRTL++ Q  YTTT+KE+LAV+FA++KFRSYL+GSK++V+TDH ALKYL  KKD KPRL+RW+LLLQ FDL+I
Subjt:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSR---IENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKC
        +D+KG ENVVADHLSR   +++EE    P + + FPDEQL+ + D +PW+ADI NYL  G LPPD++ Q +K+FL  VK Y W+DP+LYK C+D +IR+C
Subjt:  KDRKGCENVVADHLSR---IENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKC

Query:  VPQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYIL
        VP  E  +IL  CH+   GG FGP++TAAKVLQS F+WP+LFKD Y F  +CDRCQR+GNIS+++E+P + +L VELFD+WGIDFMG  P S N   YIL
Subjt:  VPQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYIL

Query:  VAVDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKD
        VAVDYVSKWVEAIAT TND + VL+FL   IF RFGTPR IISD G HF NK F ++M KYN+ H++AT YHPQT+G  E+SNREIK++LE TV  +RKD
Subjt:  VAVDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKD

Query:  WALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLL
        W+LKL DALWAYRTA+KTPIG SP++LV+GKACHLPVELEHRAYWAIK+LN +++  GEKR L+LNE+EE+R  AYEN K+YKERT ++HDK I  +  +
Subjt:  WALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLL

Query:  PGQR---------------------------VSPHGAVELQG-NNGTTFKVNGQRLKHYI
        PGQ+                           V PHGA+E++    G  FKVNGQRLKHY+
Subjt:  PGQR---------------------------VSPHGAVELQG-NNGTTFKVNGQRLKHYI

A0A6P8CBX2 Reverse transcriptase1.1e-20261.61Show/hide
Query:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        MCDASDYA+G VLGQRR  +F AIYYASRTL+  Q+ Y TTEKELLAV+FA DKFR YL+GSKI+V+TDHAALKYLF K D KPRL+RWILLLQ FDLEI
Subjt:  MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQLYQVK-DSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVP
        +D KG ENVVADHLSR+E++   S   I + FPDEQL+  +   LPW+ADIVNY+     P  ++ QQKK+FLH VK Y W++P+L+K CAD +IR+CVP
Subjt:  KDRKGCENVVADHLSRIENEEAKSWPSIVKMFPDEQLYQVK-DSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVP

Query:  QEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILVA
        + E ++I+  CH+   GG FG  RTA K+L   FYWP +F DC  ++ SC  CQR GNISR+HE+P   IL +ELFD+WGIDFMG  P SS  + YILVA
Subjt:  QEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILVA

Query:  VDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWA
        VDYVSKWVEA+A ++NDAR V++FL KNIF+RFG PRAIISD GSHFCN+ FE ++ KY V HKIAT YHPQT G  E+SNREIK++LEKTV  +RKDW+
Subjt:  VDYVSKWVEAIATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWA

Query:  LKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPG
        LKLDDALWAYRTAFKTPIG SPYK+V+GK+CHLPVELEH+AYWAIK LN D +  GEKRLL+LN+M E R  AYEN ++YKER  RWHD+ I  +  LPG
Subjt:  LKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPG

Query:  QR---------------------------VSPHGAVELQGNNGTTFKVNGQRLKHYIGDE
        Q+                           V P+GAVEL+  +  TFKVNG  LKHY   E
Subjt:  QR---------------------------VSPHGAVELQGNNGTTFKVNGQRLKHYIGDE

SwissProt top hitse value%identityAlignment
P0CT41 Transposon Tf2-12 polyprotein1.4e-2925.95Show/hide
Query:  DASDYALGTVLGQRR-DNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAAL--KYLFVKKDYKPRLMRWILLLQLFD
        DASD A+G VL Q+  D+ +  + Y S  +   Q  Y+ ++KE+LA++ ++  +R YL  +     + TDH  L  +     +    RL RW L LQ F+
Subjt:  DASDYALGTVLGQRR-DNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAAL--KYLFVKKDYKPRLMRWILLLQLFD

Query:  LEIKDRKGCENVVADHLSRIENE--------EAKSWPSIVKM-FPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKV
         EI  R G  N +AD LSRI +E        E  S   + ++   D+   QV         ++N L       + N Q K   L   K     D  L  +
Subjt:  LEIKDRKGCENVVADHLSRIENE--------EAKSWPSIVKM-FPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKV

Query:  CADNMIRKCVPQEEVVNILNSCHASPYGGQFGP-TRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHEL--PMKPILEVEL-FDIWGIDFMG
          D  + +         I+   H    G    P       ++  RF W  + K    +V++C  CQ   N SR H+   P++PI   E  ++   +DF+ 
Subjt:  CADNMIRKCVPQEEVVNILNSCHASPYGGQFGP-TRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHEL--PMKPILEVEL-FDIWGIDFMG

Query:  LLPISSNGHLYILVAVDYVSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREI
         LP  S+G+  + V VD  SK    +  T++  A    +   + +   FG P+ II+D    F ++ ++    KYN   K +  Y PQT+G  E +N+ +
Subjt:  LLPISSNGHLYILVAVDYVSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREI

Query:  KQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHL-PVELEH-------------RAYWAIKK-LNMDFEKVGEKRLLELNEMEEF
        +++L     T+   W   +     +Y  A  +    +P+++V   +  L P+EL               + +  +K+ LN +  K+ +   +++ E+EEF
Subjt:  KQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHL-PVELEH-------------RAYWAIKK-LNMDFEKVGEKRLLELNEMEEF

Query:  R
        +
Subjt:  R

P10394 Retrovirus-related Pol polyprotein from transposon 4122.7e-4425.79Show/hide
Query:  DASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEIKD
        DAS  A G VL Q  +     + YASR     +   +TTE+EL A+ +AI  FR Y+ G    V TDH  L YLF   +   +L R  L L+ ++  ++ 
Subjt:  DASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEIKD

Query:  RKGCENVVADHLSRI------------------------------------ENEEAKSWPSIVKMFPDEQL-----YQVKDSLPWFA---------DIVN
         KG +N VAD LSRI                                    + +E  S P++ ++  ++++      Q+ DS+  F          D+ +
Subjt:  RKGCENVVADHLSRI------------------------------------ENEEAKSWPSIVKMFPDEQL-----YQVKDSLPWFA---------DIVN

Query:  YLAGGHLPPDMNYQQKK-----RFLHKVKSYHWEDPFLY-----------------KVCADNMIRKCVPQEEVVNILNSCHASP-YGGQFGPTRTAAKVL
            G L  D   Q+ +       + ++K   W+  F +                 KV   N + +   ++E   IL++ H  P  GG  G T+T AKV 
Subjt:  YLAGGHLPPDMNYQQKK-----RFLHKVKSYHWEDPFLY-----------------KVCADNMIRKCVPQEEVVNILNSCHASP-YGGQFGPTRTAAKVL

Query:  QSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEV--ELFDIWGIDFMGLLPISSNGHLYILVAVDYVSKWVEAIATRTNDARTVLKFLHKN
        +  +YW ++ K    +V+ C +CQ+    ++  + PM  I E     FD   +D +G LP S NG+ Y +  +  ++K++ AI      A+TV K + ++
Subjt:  QSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEV--ELFDIWGIDFMGLLPISSNGHLYILVAVDYVSKWVEAIATRTNDARTVLKFLHKN

Query:  IFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFG
           ++G  +  I+D G+ + N +   + +   +K+  +TA+H QT G+ E S+R + + +   + T++ DW + L   ++ + T         PY+LVFG
Subjt:  IFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFG

Query:  KACHLPVELEH-RAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQRVSPHGAVELQGNNGTTFKVNGQRLK
        +  +LP       +   I  ++ D+ K  + RL      E   ARA + ++ +KE+    +D K+    L  G +V     V     +   FK  G    
Subjt:  KACHLPVELEH-RAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQRVSPHGAVELQGNNGTTFKVNGQRLK

Query:  HYIGD
          IGD
Subjt:  HYIGD

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein4.8e-4128.28Show/hide
Query:  DASDYALGTVLGQ--RRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        DAS   +G VL +   ++ +   + Y S++L++ Q+ Y   E ELL ++ A+  FR  L G    + TDH +L  L  K +   R+ RW+  L  +D  +
Subjt:  DASDYALGTVLGQ--RRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSR----IENE-----EAKSWPSIVKMFP--DEQLYQVKDSLPWFADIVNYLAGGHLPPDM----NYQQKKRFLHKV-KSYHWEDPF
        +   G +NVVAD +SR    I  E     + +SW S  K  P     L  +K+                 P DM    +YQ+K        K+Y  ED  
Subjt:  KDRKGCENVVADHLSR----IENE-----EAKSWPSIVKMFP--DEQLYQVKDSLPWFADIVNYLAGGHLPPDM----NYQQKKRFLHKV-KSYHWEDPF

Query:  LYKVCADNMIRKCVPQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRI-GNISRQHEL--PMKPILEVELFDIWGID
        +Y    D ++     Q  V+ + +    + +GG FG T T AK+    +YWP L      ++++C +CQ I  +  R H L  P+ PI E    DI  +D
Subjt:  LYKVCADNMIRKCVPQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRI-GNISRQHEL--PMKPILEVELFDIWGID

Query:  FMGLLPISSNGHLYILVAVDYVSKWVEAIATR-TNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSN
        F+  LP +SN    ILV VD  SK    IATR T DA  ++  L + IF+  G PR I SD         ++ + ++  +K  +++A HPQT+G +E + 
Subjt:  FMGLLPISSNGHLYILVAVDYVSKWVEAIATR-TNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSN

Query:  REIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYK
        + + ++L   V TN ++W + L    + Y +     +G SP+++  G   + P                            +   +E  AR++  V+L K
Subjt:  REIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYK

Query:  ERTARWHDKKITSQTLLPGQRVSPHGAVELQGNN
              H K +T QT    +    H  +E++ NN
Subjt:  ERTARWHDKKITSQTLLPGQRVSPHGAVELQGNN

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.4e-4028.09Show/hide
Query:  DASDYALGTVLGQ--RRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI
        DAS   +G VL +   ++ +   + Y S++L++ Q+ Y   E ELL ++ A+  FR  L G    + TDH +L  L  K +   R+ RW+  L  +D  +
Subjt:  DASDYALGTVLGQ--RRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEI

Query:  KDRKGCENVVADHLSR----IENE-----EAKSWPSIVKMFP--DEQLYQVKDSLPWFADIVNYLAGGHLPPDM----NYQQKKRFLHKV-KSYHWEDPF
        +   G +NVVAD +SR    I  E     + +SW S  K  P     L  +K+                 P DM    +YQ+K        K+Y  ED  
Subjt:  KDRKGCENVVADHLSR----IENE-----EAKSWPSIVKMFP--DEQLYQVKDSLPWFADIVNYLAGGHLPPDM----NYQQKKRFLHKV-KSYHWEDPF

Query:  LYKVCADNMIRKCVPQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRI-GNISRQHEL--PMKPILEVELFDIWGID
        +Y    D ++     Q  V+ + +    + +GG FG T T AK+    +YWP L      ++++C +CQ I  +  R H L  P+ PI E    DI  +D
Subjt:  LYKVCADNMIRKCVPQEEVVNILNSCHASPYGGQFGPTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRI-GNISRQHEL--PMKPILEVELFDIWGID

Query:  FMGLLPISSNGHLYILVAVDYVSKWVEAIATR-TNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSN
        F+  LP +SN    ILV VD  SK    IATR T DA  ++  L + IF+  G PR I SD         ++ + ++  +K  +++A HPQT+G +E + 
Subjt:  FMGLLPISSNGHLYILVAVDYVSKWVEAIATR-TNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSN

Query:  REIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYK
        + + ++L     TN ++W + L    + Y +     +G SP+++  G   + P                            +   +E  AR++  V+L K
Subjt:  REIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRAYWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYK

Query:  ERTARWHDKKITSQTLLPGQRVSPHGAVELQGNN
              H K +T QT    +    H  +E++ NN
Subjt:  ERTARWHDKKITSQTLLPGQRVSPHGAVELQGNN

Q9UR07 Transposon Tf2-11 polyprotein1.4e-2925.95Show/hide
Query:  DASDYALGTVLGQRR-DNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAAL--KYLFVKKDYKPRLMRWILLLQLFD
        DASD A+G VL Q+  D+ +  + Y S  +   Q  Y+ ++KE+LA++ ++  +R YL  +     + TDH  L  +     +    RL RW L LQ F+
Subjt:  DASDYALGTVLGQRR-DNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGS--KIVVHTDHAAL--KYLFVKKDYKPRLMRWILLLQLFD

Query:  LEIKDRKGCENVVADHLSRIENE--------EAKSWPSIVKM-FPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKV
         EI  R G  N +AD LSRI +E        E  S   + ++   D+   QV         ++N L       + N Q K   L   K     D  L  +
Subjt:  LEIKDRKGCENVVADHLSRIENE--------EAKSWPSIVKM-FPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKV

Query:  CADNMIRKCVPQEEVVNILNSCHASPYGGQFGP-TRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHEL--PMKPILEVEL-FDIWGIDFMG
          D  + +         I+   H    G    P       ++  RF W  + K    +V++C  CQ   N SR H+   P++PI   E  ++   +DF+ 
Subjt:  CADNMIRKCVPQEEVVNILNSCHASPYGGQFGP-TRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHEL--PMKPILEVEL-FDIWGIDFMG

Query:  LLPISSNGHLYILVAVDYVSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREI
         LP  S+G+  + V VD  SK    +  T++  A    +   + +   FG P+ II+D    F ++ ++    KYN   K +  Y PQT+G  E +N+ +
Subjt:  LLPISSNGHLYILVAVDYVSKWVEAI-ATRTNDARTVLKFLHKNIFTRFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREI

Query:  KQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHL-PVELEH-------------RAYWAIKK-LNMDFEKVGEKRLLELNEMEEF
        +++L     T+   W   +     +Y  A  +    +P+++V   +  L P+EL               + +  +K+ LN +  K+ +   +++ E+EEF
Subjt:  KQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHL-PVELEH-------------RAYWAIKK-LNMDFEKVGEKRLLELNEMEEF

Query:  R
        +
Subjt:  R

Arabidopsis top hitse value%identityAlignment
ATMG00750.1 GAG/POL/ENV polyprotein5.5e-1654.29Show/hide
Query:  VLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFM-----GLLPISSNG
        VLQ+ FYWP+ FKD + FV SCD CQR GN ++++E+P   ILEVE+FD+WGI FM        PI  NG
Subjt:  VLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFM-----GLLPISSNG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGTGATGCTAGTGATTATGCTTTAGGAACTGTTTTAGGCCAACGTAGAGATAACATGTTTAGGGCCATTTACTATGCTAGTAGGACTCTTGATAATACTCAACAGAA
ATACACTACTACTGAAAAAGAACTACTAGCTGTTGTGTTTGCCATTGATAAATTTAGATCATACTTGCTTGGCTCTAAAATAGTAGTGCATACTGACCATGCTGCTTTAA
AGTATTTGTTTGTTAAGAAAGATTATAAACCTAGGCTAATGAGGTGGATATTATTGTTACAGTTATTTGACCTAGAAATCAAAGACAGGAAAGGATGTGAAAATGTGGTC
GCAGACCACTTATCTAGAATTGAGAATGAGGAAGCTAAATCATGGCCCTCAATTGTTAAGATGTTCCCTGATGAACAACTGTATCAGGTAAAAGATAGTTTGCCCTGGTT
TGCTGACATAGTTAATTATCTTGCAGGAGGACATTTGCCACCTGACATGAACTATCAACAAAAGAAAAGATTCCTGCACAAAGTAAAGTCTTACCATTGGGAGGACCCAT
TTCTCTACAAGGTTTGTGCTGACAATATGATAAGAAAGTGTGTGCCTCAAGAGGAAGTGGTAAATATTCTAAATTCATGTCATGCCTCACCCTATGGAGGTCAATTTGGA
CCCACTAGAACTGCAGCTAAGGTACTTCAGTCAAGATTTTATTGGCCATCCCTTTTTAAAGACTGTTATACCTTTGTTAAGTCATGTGATAGGTGCCAACGTATTGGCAA
TATTTCTAGACAACATGAGCTTCCAATGAAACCTATCTTGGAAGTGGAGTTATTTGATATTTGGGGTATTGACTTTATGGGGCTCTTGCCTATTTCTTCTAATGGTCACC
TATACATTCTAGTTGCAGTAGATTATGTATCTAAATGGGTAGAAGCCATAGCTACTAGGACCAATGATGCTCGCACTGTTTTAAAATTCTTGCATAAAAACATTTTCACA
CGTTTTGGTACACCTAGAGCTATTATTAGTGATGAAGGCTCTCACTTTTGCAATAAACTGTTTGAATCCATGATGCAAAAATATAATGTAAAACATAAAATTGCTACAGC
TTATCATCCTCAAACTAATGGCCTTGCTGAGTTATCTAACAGGGAAATCAAACAAGTTTTGGAAAAGACAGTCAAGACCAATAGGAAGGATTGGGCCCTTAAGCTCGATG
ATGCACTGTGGGCCTACCGCACAGCTTTCAAAACCCCAATTGGTACTTCCCCGTATAAGTTGGTGTTTGGAAAGGCTTGTCACTTACCGGTAGAGCTCGAGCATAGAGCT
TATTGGGCCATCAAGAAGCTGAACATGGATTTCGAGAAAGTTGGTGAGAAACGCCTCTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCGAGCTTATGAGAATGTCAA
ACTTTACAAAGAGCGCACTGCGAGATGGCATGACAAGAAGATAACATCACAAACCTTGCTTCCAGGACAAAGAGTATCCCCCCACGGAGCCGTGGAACTACAAGGTAACA
ATGGAACAACCTTCAAAGTGAATGGTCAACGATTAAAGCACTACATCGGTGATGAAGAACGCGGACTTGAGAACCTGACTTTCATTGCATAA
mRNA sequenceShow/hide mRNA sequence
ATGTGTGATGCTAGTGATTATGCTTTAGGAACTGTTTTAGGCCAACGTAGAGATAACATGTTTAGGGCCATTTACTATGCTAGTAGGACTCTTGATAATACTCAACAGAA
ATACACTACTACTGAAAAAGAACTACTAGCTGTTGTGTTTGCCATTGATAAATTTAGATCATACTTGCTTGGCTCTAAAATAGTAGTGCATACTGACCATGCTGCTTTAA
AGTATTTGTTTGTTAAGAAAGATTATAAACCTAGGCTAATGAGGTGGATATTATTGTTACAGTTATTTGACCTAGAAATCAAAGACAGGAAAGGATGTGAAAATGTGGTC
GCAGACCACTTATCTAGAATTGAGAATGAGGAAGCTAAATCATGGCCCTCAATTGTTAAGATGTTCCCTGATGAACAACTGTATCAGGTAAAAGATAGTTTGCCCTGGTT
TGCTGACATAGTTAATTATCTTGCAGGAGGACATTTGCCACCTGACATGAACTATCAACAAAAGAAAAGATTCCTGCACAAAGTAAAGTCTTACCATTGGGAGGACCCAT
TTCTCTACAAGGTTTGTGCTGACAATATGATAAGAAAGTGTGTGCCTCAAGAGGAAGTGGTAAATATTCTAAATTCATGTCATGCCTCACCCTATGGAGGTCAATTTGGA
CCCACTAGAACTGCAGCTAAGGTACTTCAGTCAAGATTTTATTGGCCATCCCTTTTTAAAGACTGTTATACCTTTGTTAAGTCATGTGATAGGTGCCAACGTATTGGCAA
TATTTCTAGACAACATGAGCTTCCAATGAAACCTATCTTGGAAGTGGAGTTATTTGATATTTGGGGTATTGACTTTATGGGGCTCTTGCCTATTTCTTCTAATGGTCACC
TATACATTCTAGTTGCAGTAGATTATGTATCTAAATGGGTAGAAGCCATAGCTACTAGGACCAATGATGCTCGCACTGTTTTAAAATTCTTGCATAAAAACATTTTCACA
CGTTTTGGTACACCTAGAGCTATTATTAGTGATGAAGGCTCTCACTTTTGCAATAAACTGTTTGAATCCATGATGCAAAAATATAATGTAAAACATAAAATTGCTACAGC
TTATCATCCTCAAACTAATGGCCTTGCTGAGTTATCTAACAGGGAAATCAAACAAGTTTTGGAAAAGACAGTCAAGACCAATAGGAAGGATTGGGCCCTTAAGCTCGATG
ATGCACTGTGGGCCTACCGCACAGCTTTCAAAACCCCAATTGGTACTTCCCCGTATAAGTTGGTGTTTGGAAAGGCTTGTCACTTACCGGTAGAGCTCGAGCATAGAGCT
TATTGGGCCATCAAGAAGCTGAACATGGATTTCGAGAAAGTTGGTGAGAAACGCCTCTTAGAACTCAATGAGATGGAGGAGTTCCGTGCTCGAGCTTATGAGAATGTCAA
ACTTTACAAAGAGCGCACTGCGAGATGGCATGACAAGAAGATAACATCACAAACCTTGCTTCCAGGACAAAGAGTATCCCCCCACGGAGCCGTGGAACTACAAGGTAACA
ATGGAACAACCTTCAAAGTGAATGGTCAACGATTAAAGCACTACATCGGTGATGAAGAACGCGGACTTGAGAACCTGACTTTCATTGCATAA
Protein sequenceShow/hide protein sequence
MCDASDYALGTVLGQRRDNMFRAIYYASRTLDNTQQKYTTTEKELLAVVFAIDKFRSYLLGSKIVVHTDHAALKYLFVKKDYKPRLMRWILLLQLFDLEIKDRKGCENVV
ADHLSRIENEEAKSWPSIVKMFPDEQLYQVKDSLPWFADIVNYLAGGHLPPDMNYQQKKRFLHKVKSYHWEDPFLYKVCADNMIRKCVPQEEVVNILNSCHASPYGGQFG
PTRTAAKVLQSRFYWPSLFKDCYTFVKSCDRCQRIGNISRQHELPMKPILEVELFDIWGIDFMGLLPISSNGHLYILVAVDYVSKWVEAIATRTNDARTVLKFLHKNIFT
RFGTPRAIISDEGSHFCNKLFESMMQKYNVKHKIATAYHPQTNGLAELSNREIKQVLEKTVKTNRKDWALKLDDALWAYRTAFKTPIGTSPYKLVFGKACHLPVELEHRA
YWAIKKLNMDFEKVGEKRLLELNEMEEFRARAYENVKLYKERTARWHDKKITSQTLLPGQRVSPHGAVELQGNNGTTFKVNGQRLKHYIGDEERGLENLTFIA