; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G32640 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G32640
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr1:27500554..27503101
RNA-Seq ExpressionCSPI01G32640
SyntenyCSPI01G32640
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0010158 - abaxial cell fate specification (biological process)
GO:0015074 - DNA integration (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8637561.1 hypothetical protein CSA_017659 [Cucumis sativus]1.8e-25280.41Show/hide
Query:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM
        MLQA VIRPSH+P SSP+LLVKKKDGGWRFCVDYRKLNQ T +DKFPIPVIEELLDELH ATVFSKLD+KS YHQIRM+EED+EKTAF THEGHYEFLVM
Subjt:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM

Query:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP
        PFGLTNAPATFQSL+NQVFKPFLRRCVLVFFDDILVYS DI EHEKHLGMVFA+LRD+  FANK+KCVIAHS+IQYLGH+IS +GV+ADEEKI+D+V WP
Subjt:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP

Query:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ
        QP++VTGLRGFLGL+GYYRRFVKGYGEIAAPLT+LLQKNSF+W+E+AT+AF KLK A TTIPVLALP W+LPF++ETDA G GLGAVLSQNGHPIAFFSQ
Subjt:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ

Query:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT
        KLS +AQAKS+YERELM V LSVQKWRHYLLG+KFTII DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQ G +NKAADALSR+E  LE+N +TT 
Subjt:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT

Query:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE
        GIV+ME+ ++EV QD+EL++ I  LKQ     SK  WEN KL YK R+VLSK SS+IP LLHTFH+S+LG HSGFLRTYKRM G+L+W+GMK D+K+YVE
Subjt:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE

Query:  QCEICQRNKYEATKPAGVLQPIPISEKILEE
        QCEICQRNKYEATKPAGVL PIP  + ILEE
Subjt:  QCEICQRNKYEATKPAGVLQPIPISEKILEE

KAE8637598.1 hypothetical protein CSA_022681 [Cucumis sativus]8.6e-25079.1Show/hide
Query:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM
        MLQA VIRPS +P SSP+LLVKKKDGGWRFCVDYRKLNQ T ADKFPIPVIEELLDELH AT FSKLDLKSGYHQIRM+EED+EKTAF THEGHYEFLVM
Subjt:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM

Query:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP
        PFGLTNAPATFQSL+N+VFKPFLRRCVLVFF DILVYS+DI EH KHLGMVFAILRDH+ FAN+ KCVIAHSQ+QYLGHLIS RGVEADE+KI+ +VNWP
Subjt:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP

Query:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ
        +P+++TGLRGFLGLTGYYRRFVK YGEIAAPLTKLLQKN+F WNEEATIAF++LK+A TT+PVLALP+W+ PF +ETDA G+GLGAVLSQ+GHPIAFFSQ
Subjt:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ

Query:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT
        KLSP+AQ KS+YERELMAV LSVQKWRHYLLG+KFTI+ DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQ G +NK ADALSR +  +E+N MTTT
Subjt:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT

Query:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE
        GIV++EI  +EV+ D EL++II  LK   D+  K++W N +L YK R+VL + SSLIP+LLHTFH+S+LG HSGFLRTYKRM G+L WKGMK D+KRYVE
Subjt:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE

Query:  QCEICQRNKYEATKPAGVLQPIPISEKILEE
        +C+ CQRNK+EATKPAGVLQPIPI +KILE+
Subjt:  QCEICQRNKYEATKPAGVLQPIPISEKILEE

KGN62557.2 hypothetical protein Csa_018739 [Cucumis sativus]6.1e-25680.98Show/hide
Query:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM
        MLQ  +IRPSH+P SSP+LLV+KKDGGWRFCVDYRKLNQ T +DKFPIPVIEELLDELH ATVFSKLDLKSGYHQIRMKEED+EKTAF THEGHYEFLVM
Subjt:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM

Query:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP
        PFGLTNAPATFQSL+N VFKPFLRRCVLVFFDDIL+YS ++ EHEKHL MVFA++RD+Q  ANK+KCVIAHSQIQYLGHLIS RGVEAD +KI+D+VNWP
Subjt:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP

Query:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ
        QP++VTGLRGFLGLTGYYRRFVKGYGE+A PLTKLLQKNSFLW EEAT AF+KLK+A TT+PVLALP+WNLPFI+ETDA GI LGAVLSQNGHPIAFFSQ
Subjt:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ

Query:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT
        KLS +A+ KS+YERELMAV LSVQKWRHYLLG+KFTII DQ+ALKFLLEQREVQPQFQKWLTKLLGYDFEILYQ G +NKAADALSR+EQP+E+  M+TT
Subjt:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT

Query:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE
        GIVNME+  +EV+ D+ELK II+ LKQ  DE SK +W N  LWYK RIVLSK+S+LIP LLHTFH+S+LG HSGFLRTYKRM G+L+WKGMK DVK+YV+
Subjt:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE

Query:  QCEICQRNKYEATKPAGVLQPIPISEKILEE
        +CE+CQRNK EATKPAGVLQPIPI E+ILE+
Subjt:  QCEICQRNKYEATKPAGVLQPIPISEKILEE

TYJ96663.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.8e-24276.65Show/hide
Query:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM
        MLQ  +IRPSH+P SSP+LLVKKKDGGWRFCVDYRKLN+ T ADKFPIPVIEELLDELH ATVFSKLDLKSGYHQIRM+EEDIEKTAF THEGHYEF+VM
Subjt:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM

Query:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP
        PFGLTNAPATFQSL+NQVFKPFLRRCVLVFFDDILVYS DI EHEKHLGMVFA LRD+Q +AN++KCV AHSQI YLGH+IS+ GVEAD++K++ ++ WP
Subjt:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP

Query:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ
        +P++VTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKN+F W+E AT+AF  LK A +TIPVLALP+W+LPF++ETDA G GLGAVLSQN HPIAFFSQ
Subjt:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ

Query:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT
        KLS +AQAKS+YERELMAV LSVQKWRHYLLG++FTI+ DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQ G +NKAADALSR++  +E+  ++TT
Subjt:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT

Query:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE
        GIV+ME+  +EV++D+EL+ +I  L+       K+   N  L YK R+VLSK SS+IP+LLHTFH+S+LG HSGFLRTYKRM G+L WKGMK D+K+YVE
Subjt:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE

Query:  QCEICQRNKYEATKPAGVLQPIPISEKILEE
        QCEICQRNK EATKPAGVLQP+PI ++ILE+
Subjt:  QCEICQRNKYEATKPAGVLQPIPISEKILEE

TYK28944.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.8e-24276.65Show/hide
Query:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM
        MLQ  +IRPSH+P SSP+LLVKKKDGGWRFCVDYRKLN+ T ADKFPIPVIEELLDELH ATVFSKLDLKSGYHQIRM+EEDIEKTAF THEGHYEF+VM
Subjt:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM

Query:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP
        PFGLTNAPATFQSL+NQVFKPFLRRCVLVFFDDILVYS DI EHEKHLGMVFA LRD+Q +AN++KCV AHSQI YLGH+IS+ GVEAD++K++ ++ WP
Subjt:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP

Query:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ
        +P++VTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKN+F W+E AT+AF  LK A +TIPVLALP+W+LPF++ETDA G GLGAVLSQN HPIAFFSQ
Subjt:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ

Query:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT
        KLS +AQAKS+YERELMAV LSVQKWRHYLLG++FTI+ DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQ G +NKAADALSR++  +E+  ++TT
Subjt:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT

Query:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE
        GIV+ME+  +EV++D+EL+ +I  L+       K+   N  L YK R+VLSK SS+IP+LLHTFH+S+LG HSGFLRTYKRM G+L WKGMK D+K+YVE
Subjt:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE

Query:  QCEICQRNKYEATKPAGVLQPIPISEKILEE
        QCEICQRNK EATKPAGVLQP+PI ++ILE+
Subjt:  QCEICQRNKYEATKPAGVLQPIPISEKILEE

TrEMBL top hitse value%identityAlignment
A0A5D3BBH7 Ty3/gypsy retrotransposon protein1.9e-24276.65Show/hide
Query:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM
        MLQ  +IRPSH+P SSP+LLVKKKDGGWRFCVDYRKLN+ T ADKFPIPVIEELLDELH ATVFSKLDLKSGYHQIRM+EEDIEKTAF THEGHYEF+VM
Subjt:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM

Query:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP
        PFGLTNAPATFQSL+NQVFKPFLRRCVLVFFDDILVYS DI EHEKHLGMVFA LRD+Q +AN++KCV AHSQI YLGH+IS+ GVEAD++K++ ++ WP
Subjt:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP

Query:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ
        +P++VTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKN+F W+E AT+AF  LK A +TIPVLALP+W+LPF++ETDA G GLGAVLSQN HPIAFFSQ
Subjt:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ

Query:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT
        KLS +AQAKS+YERELMAV LSVQKWRHYLLG++FTI+ DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQ G +NKAADALSR++  +E+  ++TT
Subjt:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT

Query:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE
        GIV+ME+  +EV++D+EL+ +I  L+       K+   N  L YK R+VLSK SS+IP+LLHTFH+S+LG HSGFLRTYKRM G+L WKGMK D+K+YVE
Subjt:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE

Query:  QCEICQRNKYEATKPAGVLQPIPISEKILEE
        QCEICQRNK EATKPAGVLQP+PI ++ILE+
Subjt:  QCEICQRNKYEATKPAGVLQPIPISEKILEE

A0A5D3DU86 Ty3/gypsy retrotransposon protein1.9e-24276.65Show/hide
Query:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM
        MLQ  +IRPSH+P SSP+LLVKKKDGGWRFCVDYRKLN+ T ADKFPIPVIEELLDELH ATVFSKLDLKSGYHQIRM+EEDIEKTAF THEGHYEF+VM
Subjt:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM

Query:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP
        PFGLTNAPATFQSL+NQVFKPFLRRCVLVFFDDILVYS DI EHEKHLGMVFA LRD+Q +AN++KCV AHSQI YLGH+IS+ GVEAD++K++ ++ WP
Subjt:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP

Query:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ
        +P++VTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKN+F W+E AT+AF  LK A +TIPVLALP+W+LPF++ETDA G GLGAVLSQN HPIAFFSQ
Subjt:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ

Query:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT
        KLS +AQAKS+YERELMAV LSVQKWRHYLLG++FTI+ DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQ G +NKAADALSR++  +E+  ++TT
Subjt:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT

Query:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE
        GIV+ME+  +EV++D+EL+ +I  L+       K+   N  L YK R+VLSK SS+IP+LLHTFH+S+LG HSGFLRTYKRM G+L WKGMK D+K+YVE
Subjt:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE

Query:  QCEICQRNKYEATKPAGVLQPIPISEKILEE
        QCEICQRNK EATKPAGVLQP+PI ++ILE+
Subjt:  QCEICQRNKYEATKPAGVLQPIPISEKILEE

A0A5D3DWA9 Ty3/gypsy retrotransposon protein1.9e-24276.65Show/hide
Query:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM
        MLQ  +IRPSH+P SSP+LLVKKKDGGWRFCVDYRKLN+ T ADKFPIPVIEELLDELH ATVFSKLDLKSGYHQIRM+EEDIEKTAF THEGHYEF+VM
Subjt:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM

Query:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP
        PFGLTNAPATFQSL+NQVFKPFLRRCVLVFFDDILVYS DI EHEKHLGMVFA LRD+Q +AN++KCV AHSQI YLGH+IS+ GVEAD++K++ ++ WP
Subjt:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP

Query:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ
        +P++VTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKN+F W+E AT+AF  LK A +TIPVLALP+W+LPF++ETDA G GLGAVLSQN HPIAFFSQ
Subjt:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ

Query:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT
        KLS +AQAKS+YERELMAV LSVQKWRHYLLG++FTI+ DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQ G +NKAADALSR++  +E+  ++TT
Subjt:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT

Query:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE
        GIV+ME+  +EV++D+EL+ +I  L+       K+   N  L YK R+VLSK SS+IP+LLHTFH+S+LG HSGFLRTYKRM G+L WKGMK D+K+YVE
Subjt:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE

Query:  QCEICQRNKYEATKPAGVLQPIPISEKILEE
        QCEICQRNK EATKPAGVLQP+PI ++ILE+
Subjt:  QCEICQRNKYEATKPAGVLQPIPISEKILEE

A0A5D3DZK6 Ty3/gypsy retrotransposon protein1.9e-24276.65Show/hide
Query:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM
        MLQ  +IRPSH+P SSP+LLVKKKDGGWRFCVDYRKLN+ T ADKFPIPVIEELLDELH ATVFSKLDLKSGYHQIRM+EEDIEKTAF THEGHYEF+VM
Subjt:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM

Query:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP
        PFGLTNAPATFQSL+NQVFKPFLRRCVLVFFDDILVYS DI EHEKHLGMVFA LRD+Q +AN++KCV AHSQI YLGH+IS+ GVEAD++K++ ++ WP
Subjt:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP

Query:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ
        +P++VTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKN+F W+E AT+AF  LK A +TIPVLALP+W+LPF++ETDA G GLGAVLSQN HPIAFFSQ
Subjt:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ

Query:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT
        KLS +AQAKS+YERELMAV LSVQKWRHYLLG++FTI+ DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQ G +NKAADALSR++  +E+  ++TT
Subjt:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT

Query:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE
        GIV+ME+  +EV++D+EL+ +I  L+       K+   N  L YK R+VLSK SS+IP+LLHTFH+S+LG HSGFLRTYKRM G+L WKGMK D+K+YVE
Subjt:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE

Query:  QCEICQRNKYEATKPAGVLQPIPISEKILEE
        QCEICQRNK EATKPAGVLQP+PI ++ILE+
Subjt:  QCEICQRNKYEATKPAGVLQPIPISEKILEE

A0A5D3E325 Ty3/gypsy retrotransposon protein1.9e-24276.65Show/hide
Query:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM
        MLQ  +IRPSH+P SSP+LLVKKKDGGWRFCVDYRKLN+ T ADKFPIPVIEELLDELH ATVFSKLDLKSGYHQIRM+EEDIEKTAF THEGHYEF+VM
Subjt:  MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVM

Query:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP
        PFGLTNAPATFQSL+NQVFKPFLRRCVLVFFDDILVYS DI EHEKHLGMVFA LRD+Q +AN++KCV AHSQI YLGH+IS+ GVEAD++K++ ++ WP
Subjt:  PFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWP

Query:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ
        +P++VTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKN+F W+E AT+AF  LK A +TIPVLALP+W+LPF++ETDA G GLGAVLSQN HPIAFFSQ
Subjt:  QPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQ

Query:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT
        KLS +AQAKS+YERELMAV LSVQKWRHYLLG++FTI+ DQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQ G +NKAADALSR++  +E+  ++TT
Subjt:  KLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTT

Query:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE
        GIV+ME+  +EV++D+EL+ +I  L+       K+   N  L YK R+VLSK SS+IP+LLHTFH+S+LG HSGFLRTYKRM G+L WKGMK D+K+YVE
Subjt:  GIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVE

Query:  QCEICQRNKYEATKPAGVLQPIPISEKILEE
        QCEICQRNK EATKPAGVLQP+PI ++ILE+
Subjt:  QCEICQRNKYEATKPAGVLQPIPISEKILEE

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.68.2e-8640.91Show/hide
Query:  MLQAEVIRPSHNPDSSPILLVKKKDGG-----WRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHY
        ML   +IR S++P +SPI +V KK        +R  +DYRKLN+ T  D+ PIP ++E+L +L     F+ +DL  G+HQI M  E + KTAF T  GHY
Subjt:  MLQAEVIRPSHNPDSSPILLVKKKDGG-----WRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHY

Query:  EFLVMPFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQD
        E+L MPFGL NAPATFQ  +N + +P L +  LV+ DDI+V+S  + EH + LG+VF  L          KC     +  +LGH+++  G++ + EKI+ 
Subjt:  EFLVMPFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQD

Query:  LVNWPQPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFL--WNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGH
        +  +P P     ++ FLGLTGYYR+F+  + +IA P+TK L+KN  +   N E   AF KLK   +  P+L +P++   F L TDA  + LGAVLSQ+GH
Subjt:  LVNWPQPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFL--WNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGH

Query:  PIAFFSQKLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVE
        P+++ S+ L+      S  E+EL+A+  + + +RHYLLG+ F I  D + L +L   ++   +  +W  KL  +DF+I Y  G +N  ADALSR++
Subjt:  PIAFFSQKLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVE

P0CT34 Transposon Tf2-1 polyprotein5.7e-7932.48Show/hide
Query:  LQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVMP
        L++ +IR S   ++ P++ V KK+G  R  VDY+ LN+    + +P+P+IE+LL ++  +T+F+KLDLKS YH IR+++ D  K AF    G +E+LVMP
Subjt:  LQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVMP

Query:  FGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWPQ
        +G++ APA FQ  +N +        V+ + DDIL++S    EH KH+  V   L++     N+ KC    SQ++++G+ IS +G    +E I  ++ W Q
Subjt:  FGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWPQ

Query:  PRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKN-SFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNG-----HPI
        P+N   LR FLG   Y R+F+    ++  PL  LL+K+  + W    T A   +K    + PVL   +++   +LETDA  + +GAVLSQ       +P+
Subjt:  PRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKN-SFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNG-----HPI

Query:  AFFSQKLSPKAQAKSVYERELMAVALSVQKWRHYLLG--KKFTIIFDQKAL--KFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQP
         ++S K+S      SV ++E++A+  S++ WRHYL    + F I+ D + L  +   E      +  +W   L  ++FEI Y+ GS N  ADALSR+   
Subjt:  AFFSQKLSPKAQAKSVYERELMAVALSVQKWRHYLLG--KKFTIIFDQKAL--KFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQP

Query:  LE----------INIMTTTGIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWY--KNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTY
         E          IN +    I + +  N+ V +     ++++ L   +    ++    D L    K++I+L   + L   ++  +H      H G     
Subjt:  LE----------INIMTTTGIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWY--KNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTY

Query:  KRMRGKLHWKGMKTDVKRYVEQCEICQRNKYEATKPAGVLQPIPISEK
          +  +  WKG++  ++ YV+ C  CQ NK    KP G LQPIP SE+
Subjt:  KRMRGKLHWKGMKTDVKRYVEQCEICQRNKYEATKPAGVLQPIPISEK

P0CT35 Transposon Tf2-2 polyprotein5.7e-7932.48Show/hide
Query:  LQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVMP
        L++ +IR S   ++ P++ V KK+G  R  VDY+ LN+    + +P+P+IE+LL ++  +T+F+KLDLKS YH IR+++ D  K AF    G +E+LVMP
Subjt:  LQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVMP

Query:  FGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWPQ
        +G++ APA FQ  +N +        V+ + DDIL++S    EH KH+  V   L++     N+ KC    SQ++++G+ IS +G    +E I  ++ W Q
Subjt:  FGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWPQ

Query:  PRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKN-SFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNG-----HPI
        P+N   LR FLG   Y R+F+    ++  PL  LL+K+  + W    T A   +K    + PVL   +++   +LETDA  + +GAVLSQ       +P+
Subjt:  PRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKN-SFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNG-----HPI

Query:  AFFSQKLSPKAQAKSVYERELMAVALSVQKWRHYLLG--KKFTIIFDQKAL--KFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQP
         ++S K+S      SV ++E++A+  S++ WRHYL    + F I+ D + L  +   E      +  +W   L  ++FEI Y+ GS N  ADALSR+   
Subjt:  AFFSQKLSPKAQAKSVYERELMAVALSVQKWRHYLLG--KKFTIIFDQKAL--KFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQP

Query:  LE----------INIMTTTGIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWY--KNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTY
         E          IN +    I + +  N+ V +     ++++ L   +    ++    D L    K++I+L   + L   ++  +H      H G     
Subjt:  LE----------INIMTTTGIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWY--KNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTY

Query:  KRMRGKLHWKGMKTDVKRYVEQCEICQRNKYEATKPAGVLQPIPISEK
          +  +  WKG++  ++ YV+ C  CQ NK    KP G LQPIP SE+
Subjt:  KRMRGKLHWKGMKTDVKRYVEQCEICQRNKYEATKPAGVLQPIPISEK

P0CT41 Transposon Tf2-12 polyprotein5.7e-7932.48Show/hide
Query:  LQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVMP
        L++ +IR S   ++ P++ V KK+G  R  VDY+ LN+    + +P+P+IE+LL ++  +T+F+KLDLKS YH IR+++ D  K AF    G +E+LVMP
Subjt:  LQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVMP

Query:  FGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWPQ
        +G++ APA FQ  +N +        V+ + DDIL++S    EH KH+  V   L++     N+ KC    SQ++++G+ IS +G    +E I  ++ W Q
Subjt:  FGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWPQ

Query:  PRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKN-SFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNG-----HPI
        P+N   LR FLG   Y R+F+    ++  PL  LL+K+  + W    T A   +K    + PVL   +++   +LETDA  + +GAVLSQ       +P+
Subjt:  PRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKN-SFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNG-----HPI

Query:  AFFSQKLSPKAQAKSVYERELMAVALSVQKWRHYLLG--KKFTIIFDQKAL--KFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQP
         ++S K+S      SV ++E++A+  S++ WRHYL    + F I+ D + L  +   E      +  +W   L  ++FEI Y+ GS N  ADALSR+   
Subjt:  AFFSQKLSPKAQAKSVYERELMAVALSVQKWRHYLLG--KKFTIIFDQKAL--KFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQP

Query:  LE----------INIMTTTGIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWY--KNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTY
         E          IN +    I + +  N+ V +     ++++ L   +    ++    D L    K++I+L   + L   ++  +H      H G     
Subjt:  LE----------INIMTTTGIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWENDKLWY--KNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTY

Query:  KRMRGKLHWKGMKTDVKRYVEQCEICQRNKYEATKPAGVLQPIPISEK
          +  +  WKG++  ++ YV+ C  CQ NK    KP G LQPIP SE+
Subjt:  KRMRGKLHWKGMKTDVKRYVEQCEICQRNKYEATKPAGVLQPIPISEK

P20825 Retrovirus-related Pol polyprotein from transposon 2971.6e-8641.16Show/hide
Query:  MLQAEVIRPSHNPDSSPILLVKKKDGG-----WRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHY
        ML   +IR S++P +SP  +V KK        +R  +DYRKLN+ T  D++PIP ++E+L +L +   F+ +DL  G+HQI M EE I KTAF T  GHY
Subjt:  MLQAEVIRPSHNPDSSPILLVKKKDGG-----WRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHY

Query:  EFLVMPFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQD
        E+L MPFGL NAPATFQ  +N + +P L +  LV+ DDI+++S  + EH   + +VF  L D        KC     +  +LGH+++  G++ +  K++ 
Subjt:  EFLVMPFGLTNAPATFQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQD

Query:  LVNWPQPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNE--EATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGH
        +V++P P     +R FLGLTGYYR+F+  Y +IA P+T  L+K + +  +  E   AF KLK      P+L LP++   F+L TDA  + LGAVLSQNGH
Subjt:  LVNWPQPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWNE--EATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGH

Query:  PIAFFSQKLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVE
        PI+F S+ L+      S  E+EL+A+  + + +RHYLLG++F I  D + L++L   +E   + ++W  +L  Y F+I Y  G +N  ADALSR++
Subjt:  PIAFFSQKLSPKAQAKSVYERELMAVALSVQKWRHYLLGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVE

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein7.2e-0567.86Show/hide
Query:  MLQAEVIRPSHNPDSSPILLVKKKDGGW
        ML+A +I+PS +P SSP+LLV+KKDGGW
Subjt:  MLQAEVIRPSHNPDSSPILLVKKKDGGW

ATMG00860.1 DNA/RNA polymerases superfamily protein9.1e-4059.23Show/hide
Query:  HLGMVFAILRDHQPFANKRKCVIAHSQIQYLG--HLISRRGVEADEEKIQDLVNWPQPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWN
        HLGMV  I   HQ +AN++KC     QI YLG  H+IS  GV AD  K++ +V WP+P+N T LRGFLGLTGYYRRFVK YG+I  PLT+LL+KNS  W 
Subjt:  HLGMVFAILRDHQPFANKRKCVIAHSQIQYLG--HLISRRGVEADEEKIQDLVNWPQPRNVTGLRGFLGLTGYYRRFVKGYGEIAAPLTKLLQKNSFLWN

Query:  EEATIAFNKLKIARTTIPVLALPEWNLPFI
        E A +AF  LK A TT+PVLALP+  LPF+
Subjt:  EEATIAFNKLKIARTTIPVLALPEWNLPFI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCAAGCTGAGGTAATAAGGCCGAGTCACAACCCCGATTCTAGCCCCATTTTATTGGTGAAGAAGAAGGATGGAGGGTGGAGATTTTGTGTAGACTACCGAAAGCT
CAACCAAGCCACTACAGCTGACAAATTCCCCATTCCTGTAATAGAAGAATTACTAGATGAACTGCACGAGGCTACAGTGTTCTCAAAGTTAGATTTGAAGTCCGGTTATC
ACCAAATAAGGATGAAGGAAGAAGACATAGAGAAGACAGCCTTCTGGACTCATGAAGGCCATTATGAATTCTTGGTCATGCCGTTTGGCCTCACCAACGCTCCTGCAACC
TTCCAATCATTAGTGAACCAGGTATTTAAACCGTTCTTAAGACGCTGTGTTTTGGTTTTTTTTGATGACATTCTGGTGTATAGCTTGGATATCATCGAACATGAGAAACA
CTTAGGCATGGTGTTCGCTATATTGAGGGATCATCAACCGTTTGCCAATAAAAGGAAGTGCGTTATAGCTCATTCCCAAATCCAGTACTTGGGCCATTTAATTTCCAGAA
GAGGGGTAGAAGCTGATGAAGAAAAAATACAAGATTTGGTTAATTGGCCACAACCAAGGAATGTCACTGGATTGAGGGGATTCTTGGGGTTAACCGGCTATTATAGAAGG
TTTGTCAAAGGCTATGGCGAAATCGCAGCTCCTCTCACTAAGTTGCTGCAGAAGAATTCTTTTTTATGGAATGAAGAAGCCACAATTGCTTTTAATAAGCTGAAGATAGC
AAGGACAACGATACCCGTTTTAGCACTTCCTGAATGGAACCTGCCATTCATTTTGGAAACAGATGCATTAGGAATAGGATTGGGAGCTGTGTTGTCTCAGAATGGTCACC
CTATCGCGTTCTTTAGCCAAAAACTATCTCCCAAGGCACAAGCTAAGTCAGTATATGAGCGAGAGTTAATGGCCGTGGCGCTCTCTGTGCAGAAATGGAGGCATTATTTA
TTGGGAAAGAAGTTCACCATCATATTCGACCAGAAGGCTTTAAAGTTCCTTTTAGAACAGAGAGAGGTTCAACCTCAATTCCAAAAATGGCTAACTAAGCTATTAGGATA
TGATTTCGAGATTCTTTATCAACTCGGATCTAAGAACAAGGCTGCTGACGCACTGTCTCGAGTGGAACAACCACTGGAAATTAACATTATGACTACTACGGGTATTGTGA
ACATGGAGATAGCCAATGAAGAAGTGCAGCAAGATGATGAACTTAAGAGGATTATAGATGGGCTAAAACAAAGGGAAGATGAGACAAGCAAACATCGTTGGGAGAATGAC
AAACTGTGGTATAAAAACCGAATAGTGTTATCGAAACAGTCTTCATTGATACCGAATCTGTTGCACACATTTCATAACTCTGTTCTAGGAAGCCATTCCGGATTTCTAAG
AACATATAAGAGGATGAGAGGGAAATTACATTGGAAAGGAATGAAAACCGATGTCAAGAGATATGTGGAGCAATGTGAGATTTGCCAGAGAAATAAGTACGAGGCAACTA
AACCAGCAGGGGTTCTACAACCCATTCCAATTTCGGAGAAAATCCTTGAAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAAATTTAGAGCCAGACTCGATAAATGCCTATGTTTTAAATGTAATGAGAGACACTCACCTGGGCATAAATGCAAAATGAAAAAAAAGAGGGAATTGATGTTATTC
ATCATGAATGAAGAGGAGAGTGTCGAAGAGGAGAACCAAAGGGAGGAGAATTCAGAAGAGGTGGTGGAGTTGAAACAGCTGGACCTCACAGAAGAAACAAAGATAGAATT
GAGAGTAGTCACCAGGTTGACAAATAGAGGAATGATGAAGCTCAATGGAGAGATAAGAGGGAAGGAAGTGGTAGTACTTATCGACAGTGGAGCCACCCACAATTTTTTAC
ACATTAAAATAGTAGAGGAGCAACAAATAACCGTCGAGGACGGAACACCCTTTGGAGTAACGATTGGTAATGGCACGAAGTGCAGAGGAATAGGAGTATGTAGAAAAGTG
GAGATGAAATTGAAGGGGCTCACTGTTGTGACCAACTTCTTAGCCATTGATTTGGGCAGTGTGGATGTCGTTTTAGGGATGCAGTGGCTAGGTACCATTAAAACTATGAA
AATCAATTGGCCTTCCTTGATTATGAAGTTTTGGGTTGGAACAAGGCAACTTACTCTTAAGGGGGATCCTTCCCTGGTGAGATCAGAATGTTCCTTGAGAACGATTGAGA
AGACATGGGAAAAGGAAGATCAATGTTTCCTTTTAGAACTACAAAACTATGATACAGAAATGAATGAAGACATGGGGGAAAATCAAGAAATTAAGGGAGACGAAGAAGAT
ACTCCGATGTTAAGGTTTCTGTTACAACAATATGTGGATATTTTCGAGGATCCAAAAGGGTTACCTCCTAAAAGAGAAGTTGATCATTGAATCTTGATGATGCCGAGCAG
AAACCAATTAACGTGAGACCTTGCAAGAATGGACACGTTCAAAAGGAAGAGATTGAGAAGTTAGTAGAGGAGATGCTCCAAGCTGAGGTAATAAGGCCGAGTCACAACCC
CGATTCTAGCCCCATTTTATTGGTGAAGAAGAAGGATGGAGGGTGGAGATTTTGTGTAGACTACCGAAAGCTCAACCAAGCCACTACAGCTGACAAATTCCCCATTCCTG
TAATAGAAGAATTACTAGATGAACTGCACGAGGCTACAGTGTTCTCAAAGTTAGATTTGAAGTCCGGTTATCACCAAATAAGGATGAAGGAAGAAGACATAGAGAAGACA
GCCTTCTGGACTCATGAAGGCCATTATGAATTCTTGGTCATGCCGTTTGGCCTCACCAACGCTCCTGCAACCTTCCAATCATTAGTGAACCAGGTATTTAAACCGTTCTT
AAGACGCTGTGTTTTGGTTTTTTTTGATGACATTCTGGTGTATAGCTTGGATATCATCGAACATGAGAAACACTTAGGCATGGTGTTCGCTATATTGAGGGATCATCAAC
CGTTTGCCAATAAAAGGAAGTGCGTTATAGCTCATTCCCAAATCCAGTACTTGGGCCATTTAATTTCCAGAAGAGGGGTAGAAGCTGATGAAGAAAAAATACAAGATTTG
GTTAATTGGCCACAACCAAGGAATGTCACTGGATTGAGGGGATTCTTGGGGTTAACCGGCTATTATAGAAGGTTTGTCAAAGGCTATGGCGAAATCGCAGCTCCTCTCAC
TAAGTTGCTGCAGAAGAATTCTTTTTTATGGAATGAAGAAGCCACAATTGCTTTTAATAAGCTGAAGATAGCAAGGACAACGATACCCGTTTTAGCACTTCCTGAATGGA
ACCTGCCATTCATTTTGGAAACAGATGCATTAGGAATAGGATTGGGAGCTGTGTTGTCTCAGAATGGTCACCCTATCGCGTTCTTTAGCCAAAAACTATCTCCCAAGGCA
CAAGCTAAGTCAGTATATGAGCGAGAGTTAATGGCCGTGGCGCTCTCTGTGCAGAAATGGAGGCATTATTTATTGGGAAAGAAGTTCACCATCATATTCGACCAGAAGGC
TTTAAAGTTCCTTTTAGAACAGAGAGAGGTTCAACCTCAATTCCAAAAATGGCTAACTAAGCTATTAGGATATGATTTCGAGATTCTTTATCAACTCGGATCTAAGAACA
AGGCTGCTGACGCACTGTCTCGAGTGGAACAACCACTGGAAATTAACATTATGACTACTACGGGTATTGTGAACATGGAGATAGCCAATGAAGAAGTGCAGCAAGATGAT
GAACTTAAGAGGATTATAGATGGGCTAAAACAAAGGGAAGATGAGACAAGCAAACATCGTTGGGAGAATGACAAACTGTGGTATAAAAACCGAATAGTGTTATCGAAACA
GTCTTCATTGATACCGAATCTGTTGCACACATTTCATAACTCTGTTCTAGGAAGCCATTCCGGATTTCTAAGAACATATAAGAGGATGAGAGGGAAATTACATTGGAAAG
GAATGAAAACCGATGTCAAGAGATATGTGGAGCAATGTGAGATTTGCCAGAGAAATAAGTACGAGGCAACTAAACCAGCAGGGGTTCTACAACCCATTCCAATTTCGGAG
AAAATCCTTGAAGAATGA
Protein sequenceShow/hide protein sequence
MLQAEVIRPSHNPDSSPILLVKKKDGGWRFCVDYRKLNQATTADKFPIPVIEELLDELHEATVFSKLDLKSGYHQIRMKEEDIEKTAFWTHEGHYEFLVMPFGLTNAPAT
FQSLVNQVFKPFLRRCVLVFFDDILVYSLDIIEHEKHLGMVFAILRDHQPFANKRKCVIAHSQIQYLGHLISRRGVEADEEKIQDLVNWPQPRNVTGLRGFLGLTGYYRR
FVKGYGEIAAPLTKLLQKNSFLWNEEATIAFNKLKIARTTIPVLALPEWNLPFILETDALGIGLGAVLSQNGHPIAFFSQKLSPKAQAKSVYERELMAVALSVQKWRHYL
LGKKFTIIFDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQLGSKNKAADALSRVEQPLEINIMTTTGIVNMEIANEEVQQDDELKRIIDGLKQREDETSKHRWEND
KLWYKNRIVLSKQSSLIPNLLHTFHNSVLGSHSGFLRTYKRMRGKLHWKGMKTDVKRYVEQCEICQRNKYEATKPAGVLQPIPISEKILEE