; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G09800 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G09800
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr7:7766154..7767789
RNA-Seq ExpressionCSPI07G09800
SyntenyCSPI07G09800
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0010158 - abaxial cell fate specification (biological process)
GO:0015074 - DNA integration (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8637561.1 hypothetical protein CSA_017659 [Cucumis sativus]5.0e-21776.37Show/hide
Query:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA
        VFSKLD+KS YHQIRM+E+DVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLR CVLVFFDDILVYS D++EHEKHLGMVFAV+RDN L A
Subjt:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA

Query:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP
        NKKKCVIAHS+IQYLGH+ISS+GV+AD EKIKDMV WPQPKDVTGLRGFLGL+GYYRRFVKGYGEIA PLT+LLQKNSF+W E+AT AF+KLK AMTT+P
Subjt:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP

Query:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT
        VLALP+W+LPF+IETDASG  LGAVLSQNGHPIAFFSQ LS RA+ KSIYERELM VVLS                            REVQPQFQKWLT
Subjt:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT

Query:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH
        KLLGYDFEILYQPGLQNKA DALS++E  +E+ S++T GIV+M V++KEV  DEEL+  I+ELK+NP   +KF W NG LLYKKR+VLSK S++IPTLLH
Subjt:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH

Query:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL
        TFHDSILGGHSGFLRTYKRM GELYW+GMKAD+KKYV++CE+CQRNK EATKPAGVL PIP P+ ILE WSMDFIEGLPKAGGMN   +++
Subjt:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL

KGN62557.2 hypothetical protein Csa_018739 [Cucumis sativus]2.5e-24888.8Show/hide
Query:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA
        VFSKLDLKSGYHQIRMKE+DVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMN VFKPFLR CVLVFFDDIL+YST+LTEHEKHL MVFAVMRDNQLVA
Subjt:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA

Query:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP
        NKKKCVIAHSQIQYLGHLISSRGVEADG+KIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGE+A PLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP
Subjt:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP

Query:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT
        VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQ LSNRAKTKSIYERELMAVVLS                            REVQPQFQKWLT
Subjt:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT

Query:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH
        KLLGYDFEILYQPGLQNKA DALS+IEQPVEM++MSTTGIVNM VVEKEVELDEELKAIIEELK+NP E +KFQWVNGNL YKKRIVLSKESTLIPTLLH
Subjt:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH

Query:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL
        TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILE WSMDFIEGLPKAGGMN   +++
Subjt:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL

TYJ96663.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-21274.13Show/hide
Query:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA
        VFSKLDLKSGYHQIRM+E+D+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMNQVFKPFLR CVLVFFDDILVYS+D+TEHEKHLGMVFA +RDNQL A
Subjt:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA

Query:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP
        N+KKCV AHSQI YLGH+IS  GVEAD +K+K M+ WP+PKDVTGLRGFLGLTGYYRRFVKGYGEIA PLTKLLQKN+F W E AT AF+ LK AM+T+P
Subjt:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP

Query:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT
        VLALPDW+LPF+IETDASG  LGAVLSQN HPIAFFSQ LS RA+ KSIYERELMAVVLS                            REVQPQFQKWLT
Subjt:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT

Query:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH
        KLLGYDFEILYQPGLQNKA DALS+++  +E++++STTGIV+M VV KEVE DEEL+ +I++L+ NP    K+   NG L+YK R+VLSK S++IP+LLH
Subjt:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH

Query:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL
        TFHDSILGGHSGFLRTYKRM GEL+WKGMK D+KKYV++CE+CQRNK EATKPAGVLQP+PIP+RILE W+MDFIEGLPKAGGMN   +++
Subjt:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL

TYK21035.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-21274.13Show/hide
Query:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA
        VFSKLDLKSGYHQIRM+E+D+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMNQVFKPFLR CVLVFFDDILVYS+D+TEHEKHLGMVFA +RDNQL A
Subjt:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA

Query:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP
        N+KKCV AHSQI YLGH+IS  GVEAD +K+K M+ WP+PKDVTGLRGFLGLTGYYRRFVKGYGEIA PLTKLLQKN+F W E AT AF+ LK AM+T+P
Subjt:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP

Query:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT
        VLALPDW+LPF+IETDASG  LGAVLSQN HPIAFFSQ LS RA+ KSIYERELMAVVLS                            REVQPQFQKWLT
Subjt:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT

Query:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH
        KLLGYDFEILYQPGLQNKA DALS+++  +E++++STTGIV+M VV KEVE DEEL+ +I++L+ NP    K+   NG L+YK R+VLSK S++IP+LLH
Subjt:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH

Query:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL
        TFHDSILGGHSGFLRTYKRM GEL+WKGMK D+KKYV++CE+CQRNK EATKPAGVLQP+PIP+RILE W+MDFIEGLPKAGGMN   +++
Subjt:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL

TYK28944.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-21274.13Show/hide
Query:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA
        VFSKLDLKSGYHQIRM+E+D+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMNQVFKPFLR CVLVFFDDILVYS+D+TEHEKHLGMVFA +RDNQL A
Subjt:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA

Query:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP
        N+KKCV AHSQI YLGH+IS  GVEAD +K+K M+ WP+PKDVTGLRGFLGLTGYYRRFVKGYGEIA PLTKLLQKN+F W E AT AF+ LK AM+T+P
Subjt:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP

Query:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT
        VLALPDW+LPF+IETDASG  LGAVLSQN HPIAFFSQ LS RA+ KSIYERELMAVVLS                            REVQPQFQKWLT
Subjt:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT

Query:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH
        KLLGYDFEILYQPGLQNKA DALS+++  +E++++STTGIV+M VV KEVE DEEL+ +I++L+ NP    K+   NG L+YK R+VLSK S++IP+LLH
Subjt:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH

Query:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL
        TFHDSILGGHSGFLRTYKRM GEL+WKGMK D+KKYV++CE+CQRNK EATKPAGVLQP+PIP+RILE W+MDFIEGLPKAGGMN   +++
Subjt:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL

TrEMBL top hitse value%identityAlignment
A0A5D3BBH7 Ty3/gypsy retrotransposon protein6.2e-21374.13Show/hide
Query:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA
        VFSKLDLKSGYHQIRM+E+D+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMNQVFKPFLR CVLVFFDDILVYS+D+TEHEKHLGMVFA +RDNQL A
Subjt:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA

Query:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP
        N+KKCV AHSQI YLGH+IS  GVEAD +K+K M+ WP+PKDVTGLRGFLGLTGYYRRFVKGYGEIA PLTKLLQKN+F W E AT AF+ LK AM+T+P
Subjt:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP

Query:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT
        VLALPDW+LPF+IETDASG  LGAVLSQN HPIAFFSQ LS RA+ KSIYERELMAVVLS                            REVQPQFQKWLT
Subjt:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT

Query:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH
        KLLGYDFEILYQPGLQNKA DALS+++  +E++++STTGIV+M VV KEVE DEEL+ +I++L+ NP    K+   NG L+YK R+VLSK S++IP+LLH
Subjt:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH

Query:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL
        TFHDSILGGHSGFLRTYKRM GEL+WKGMK D+KKYV++CE+CQRNK EATKPAGVLQP+PIP+RILE W+MDFIEGLPKAGGMN   +++
Subjt:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL

A0A5D3DU86 Ty3/gypsy retrotransposon protein6.2e-21374.13Show/hide
Query:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA
        VFSKLDLKSGYHQIRM+E+D+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMNQVFKPFLR CVLVFFDDILVYS+D+TEHEKHLGMVFA +RDNQL A
Subjt:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA

Query:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP
        N+KKCV AHSQI YLGH+IS  GVEAD +K+K M+ WP+PKDVTGLRGFLGLTGYYRRFVKGYGEIA PLTKLLQKN+F W E AT AF+ LK AM+T+P
Subjt:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP

Query:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT
        VLALPDW+LPF+IETDASG  LGAVLSQN HPIAFFSQ LS RA+ KSIYERELMAVVLS                            REVQPQFQKWLT
Subjt:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT

Query:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH
        KLLGYDFEILYQPGLQNKA DALS+++  +E++++STTGIV+M VV KEVE DEEL+ +I++L+ NP    K+   NG L+YK R+VLSK S++IP+LLH
Subjt:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH

Query:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL
        TFHDSILGGHSGFLRTYKRM GEL+WKGMK D+KKYV++CE+CQRNK EATKPAGVLQP+PIP+RILE W+MDFIEGLPKAGGMN   +++
Subjt:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL

A0A5D3DWA9 Ty3/gypsy retrotransposon protein6.2e-21374.13Show/hide
Query:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA
        VFSKLDLKSGYHQIRM+E+D+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMNQVFKPFLR CVLVFFDDILVYS+D+TEHEKHLGMVFA +RDNQL A
Subjt:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA

Query:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP
        N+KKCV AHSQI YLGH+IS  GVEAD +K+K M+ WP+PKDVTGLRGFLGLTGYYRRFVKGYGEIA PLTKLLQKN+F W E AT AF+ LK AM+T+P
Subjt:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP

Query:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT
        VLALPDW+LPF+IETDASG  LGAVLSQN HPIAFFSQ LS RA+ KSIYERELMAVVLS                            REVQPQFQKWLT
Subjt:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT

Query:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH
        KLLGYDFEILYQPGLQNKA DALS+++  +E++++STTGIV+M VV KEVE DEEL+ +I++L+ NP    K+   NG L+YK R+VLSK S++IP+LLH
Subjt:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH

Query:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL
        TFHDSILGGHSGFLRTYKRM GEL+WKGMK D+KKYV++CE+CQRNK EATKPAGVLQP+PIP+RILE W+MDFIEGLPKAGGMN   +++
Subjt:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL

A0A5D3DZK6 Ty3/gypsy retrotransposon protein6.2e-21374.13Show/hide
Query:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA
        VFSKLDLKSGYHQIRM+E+D+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMNQVFKPFLR CVLVFFDDILVYS+D+TEHEKHLGMVFA +RDNQL A
Subjt:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA

Query:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP
        N+KKCV AHSQI YLGH+IS  GVEAD +K+K M+ WP+PKDVTGLRGFLGLTGYYRRFVKGYGEIA PLTKLLQKN+F W E AT AF+ LK AM+T+P
Subjt:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP

Query:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT
        VLALPDW+LPF+IETDASG  LGAVLSQN HPIAFFSQ LS RA+ KSIYERELMAVVLS                            REVQPQFQKWLT
Subjt:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT

Query:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH
        KLLGYDFEILYQPGLQNKA DALS+++  +E++++STTGIV+M VV KEVE DEEL+ +I++L+ NP    K+   NG L+YK R+VLSK S++IP+LLH
Subjt:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH

Query:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL
        TFHDSILGGHSGFLRTYKRM GEL+WKGMK D+KKYV++CE+CQRNK EATKPAGVLQP+PIP+RILE W+MDFIEGLPKAGGMN   +++
Subjt:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL

A0A5D3E325 Ty3/gypsy retrotransposon protein6.2e-21374.13Show/hide
Query:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA
        VFSKLDLKSGYHQIRM+E+D+EKTAFRTHEGHYEF+VMPFGLTNAPATFQSLMNQVFKPFLR CVLVFFDDILVYS+D+TEHEKHLGMVFA +RDNQL A
Subjt:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA

Query:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP
        N+KKCV AHSQI YLGH+IS  GVEAD +K+K M+ WP+PKDVTGLRGFLGLTGYYRRFVKGYGEIA PLTKLLQKN+F W E AT AF+ LK AM+T+P
Subjt:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLP

Query:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT
        VLALPDW+LPF+IETDASG  LGAVLSQN HPIAFFSQ LS RA+ KSIYERELMAVVLS                            REVQPQFQKWLT
Subjt:  VLALPDWNLPFIIETDASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLS----------------------------REVQPQFQKWLT

Query:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH
        KLLGYDFEILYQPGLQNKA DALS+++  +E++++STTGIV+M VV KEVE DEEL+ +I++L+ NP    K+   NG L+YK R+VLSK S++IP+LLH
Subjt:  KLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDEELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLH

Query:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL
        TFHDSILGGHSGFLRTYKRM GEL+WKGMK D+KKYV++CE+CQRNK EATKPAGVLQP+PIP+RILE W+MDFIEGLPKAGGMN   +++
Subjt:  TFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEGLPKAGGMNAEQILL

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein8.1e-6932.42Show/hide
Query:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA
        +F+KLDLKS YH IR+++ D  K AFR   G +E+LVMP+G++ APA FQ  +N +      S V+ + DDIL++S   +EH KH+  V   +++  L+ 
Subjt:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA

Query:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKN-SFLWGEEATEAFDKLKLAMTTL
        N+ KC    SQ++++G+ IS +G     E I  ++ W QPK+   LR FLG   Y R+F+    ++  PL  LL+K+  + W    T+A + +K  + + 
Subjt:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKN-SFLWGEEATEAFDKLKLAMTTL

Query:  PVLALPDWNLPFIIETDASGIALGAVLSQNG-----HPIAFFSQILSNRAKTKSIYERELMAVV-----------------------------LSREVQP
        PVL   D++   ++ETDAS +A+GAVLSQ       +P+ ++S  +S      S+ ++E++A++                             ++ E +P
Subjt:  PVLALPDWNLPFIIETDASGIALGAVLSQNG-----HPIAFFSQILSNRAKTKSIYERELMAVV-----------------------------LSREVQP

Query:  Q---FQKWLTKLLGYDFEILYQPGLQNKAVDALSQI---EQPVEMRSM-STTGIVNMV--------VVEKEVELDEELKAIIEELKKNPHESNKFQWVNG
        +     +W   L  ++FEI Y+PG  N   DALS+I    +P+   S  ++   VN +         V  E   D +L  ++    K   E+   Q  +G
Subjt:  Q---FQKWLTKLLGYDFEILYQPGLQNKAVDALSQI---EQPVEMRSM-STTGIVNMV--------VVEKEVELDEELKAIIEELKKNPHESNKFQWVNG

Query:  NLLYKK-RIVLSKESTLIPTLLHTFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEG
         L+  K +I+L  ++ L  T++  +H+     H G       +     WKG++  +++YVQ C  CQ NK    KP G LQPIP  ER  E  SMDFI  
Subjt:  NLLYKK-RIVLSKESTLIPTLLHTFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEG

Query:  LPKAGGMNA
        LP++ G NA
Subjt:  LPKAGGMNA

P0CT35 Transposon Tf2-2 polyprotein8.1e-6932.42Show/hide
Query:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA
        +F+KLDLKS YH IR+++ D  K AFR   G +E+LVMP+G++ APA FQ  +N +      S V+ + DDIL++S   +EH KH+  V   +++  L+ 
Subjt:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA

Query:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKN-SFLWGEEATEAFDKLKLAMTTL
        N+ KC    SQ++++G+ IS +G     E I  ++ W QPK+   LR FLG   Y R+F+    ++  PL  LL+K+  + W    T+A + +K  + + 
Subjt:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKN-SFLWGEEATEAFDKLKLAMTTL

Query:  PVLALPDWNLPFIIETDASGIALGAVLSQNG-----HPIAFFSQILSNRAKTKSIYERELMAVV-----------------------------LSREVQP
        PVL   D++   ++ETDAS +A+GAVLSQ       +P+ ++S  +S      S+ ++E++A++                             ++ E +P
Subjt:  PVLALPDWNLPFIIETDASGIALGAVLSQNG-----HPIAFFSQILSNRAKTKSIYERELMAVV-----------------------------LSREVQP

Query:  Q---FQKWLTKLLGYDFEILYQPGLQNKAVDALSQI---EQPVEMRSM-STTGIVNMV--------VVEKEVELDEELKAIIEELKKNPHESNKFQWVNG
        +     +W   L  ++FEI Y+PG  N   DALS+I    +P+   S  ++   VN +         V  E   D +L  ++    K   E+   Q  +G
Subjt:  Q---FQKWLTKLLGYDFEILYQPGLQNKAVDALSQI---EQPVEMRSM-STTGIVNMV--------VVEKEVELDEELKAIIEELKKNPHESNKFQWVNG

Query:  NLLYKK-RIVLSKESTLIPTLLHTFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEG
         L+  K +I+L  ++ L  T++  +H+     H G       +     WKG++  +++YVQ C  CQ NK    KP G LQPIP  ER  E  SMDFI  
Subjt:  NLLYKK-RIVLSKESTLIPTLLHTFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEG

Query:  LPKAGGMNA
        LP++ G NA
Subjt:  LPKAGGMNA

P0CT36 Transposon Tf2-3 polyprotein8.1e-6932.42Show/hide
Query:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA
        +F+KLDLKS YH IR+++ D  K AFR   G +E+LVMP+G++ APA FQ  +N +      S V+ + DDIL++S   +EH KH+  V   +++  L+ 
Subjt:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA

Query:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKN-SFLWGEEATEAFDKLKLAMTTL
        N+ KC    SQ++++G+ IS +G     E I  ++ W QPK+   LR FLG   Y R+F+    ++  PL  LL+K+  + W    T+A + +K  + + 
Subjt:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKN-SFLWGEEATEAFDKLKLAMTTL

Query:  PVLALPDWNLPFIIETDASGIALGAVLSQNG-----HPIAFFSQILSNRAKTKSIYERELMAVV-----------------------------LSREVQP
        PVL   D++   ++ETDAS +A+GAVLSQ       +P+ ++S  +S      S+ ++E++A++                             ++ E +P
Subjt:  PVLALPDWNLPFIIETDASGIALGAVLSQNG-----HPIAFFSQILSNRAKTKSIYERELMAVV-----------------------------LSREVQP

Query:  Q---FQKWLTKLLGYDFEILYQPGLQNKAVDALSQI---EQPVEMRSM-STTGIVNMV--------VVEKEVELDEELKAIIEELKKNPHESNKFQWVNG
        +     +W   L  ++FEI Y+PG  N   DALS+I    +P+   S  ++   VN +         V  E   D +L  ++    K   E+   Q  +G
Subjt:  Q---FQKWLTKLLGYDFEILYQPGLQNKAVDALSQI---EQPVEMRSM-STTGIVNMV--------VVEKEVELDEELKAIIEELKKNPHESNKFQWVNG

Query:  NLLYKK-RIVLSKESTLIPTLLHTFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEG
         L+  K +I+L  ++ L  T++  +H+     H G       +     WKG++  +++YVQ C  CQ NK    KP G LQPIP  ER  E  SMDFI  
Subjt:  NLLYKK-RIVLSKESTLIPTLLHTFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEG

Query:  LPKAGGMNA
        LP++ G NA
Subjt:  LPKAGGMNA

P0CT37 Transposon Tf2-4 polyprotein8.1e-6932.42Show/hide
Query:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA
        +F+KLDLKS YH IR+++ D  K AFR   G +E+LVMP+G++ APA FQ  +N +      S V+ + DDIL++S   +EH KH+  V   +++  L+ 
Subjt:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA

Query:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKN-SFLWGEEATEAFDKLKLAMTTL
        N+ KC    SQ++++G+ IS +G     E I  ++ W QPK+   LR FLG   Y R+F+    ++  PL  LL+K+  + W    T+A + +K  + + 
Subjt:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKN-SFLWGEEATEAFDKLKLAMTTL

Query:  PVLALPDWNLPFIIETDASGIALGAVLSQNG-----HPIAFFSQILSNRAKTKSIYERELMAVV-----------------------------LSREVQP
        PVL   D++   ++ETDAS +A+GAVLSQ       +P+ ++S  +S      S+ ++E++A++                             ++ E +P
Subjt:  PVLALPDWNLPFIIETDASGIALGAVLSQNG-----HPIAFFSQILSNRAKTKSIYERELMAVV-----------------------------LSREVQP

Query:  Q---FQKWLTKLLGYDFEILYQPGLQNKAVDALSQI---EQPVEMRSM-STTGIVNMV--------VVEKEVELDEELKAIIEELKKNPHESNKFQWVNG
        +     +W   L  ++FEI Y+PG  N   DALS+I    +P+   S  ++   VN +         V  E   D +L  ++    K   E+   Q  +G
Subjt:  Q---FQKWLTKLLGYDFEILYQPGLQNKAVDALSQI---EQPVEMRSM-STTGIVNMV--------VVEKEVELDEELKAIIEELKKNPHESNKFQWVNG

Query:  NLLYKK-RIVLSKESTLIPTLLHTFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEG
         L+  K +I+L  ++ L  T++  +H+     H G       +     WKG++  +++YVQ C  CQ NK    KP G LQPIP  ER  E  SMDFI  
Subjt:  NLLYKK-RIVLSKESTLIPTLLHTFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEG

Query:  LPKAGGMNA
        LP++ G NA
Subjt:  LPKAGGMNA

P0CT41 Transposon Tf2-12 polyprotein8.1e-6932.42Show/hide
Query:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA
        +F+KLDLKS YH IR+++ D  K AFR   G +E+LVMP+G++ APA FQ  +N +      S V+ + DDIL++S   +EH KH+  V   +++  L+ 
Subjt:  VFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVA

Query:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKN-SFLWGEEATEAFDKLKLAMTTL
        N+ KC    SQ++++G+ IS +G     E I  ++ W QPK+   LR FLG   Y R+F+    ++  PL  LL+K+  + W    T+A + +K  + + 
Subjt:  NKKKCVIAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKN-SFLWGEEATEAFDKLKLAMTTL

Query:  PVLALPDWNLPFIIETDASGIALGAVLSQNG-----HPIAFFSQILSNRAKTKSIYERELMAVV-----------------------------LSREVQP
        PVL   D++   ++ETDAS +A+GAVLSQ       +P+ ++S  +S      S+ ++E++A++                             ++ E +P
Subjt:  PVLALPDWNLPFIIETDASGIALGAVLSQNG-----HPIAFFSQILSNRAKTKSIYERELMAVV-----------------------------LSREVQP

Query:  Q---FQKWLTKLLGYDFEILYQPGLQNKAVDALSQI---EQPVEMRSM-STTGIVNMV--------VVEKEVELDEELKAIIEELKKNPHESNKFQWVNG
        +     +W   L  ++FEI Y+PG  N   DALS+I    +P+   S  ++   VN +         V  E   D +L  ++    K   E+   Q  +G
Subjt:  Q---FQKWLTKLLGYDFEILYQPGLQNKAVDALSQI---EQPVEMRSM-STTGIVNMV--------VVEKEVELDEELKAIIEELKKNPHESNKFQWVNG

Query:  NLLYKK-RIVLSKESTLIPTLLHTFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEG
         L+  K +I+L  ++ L  T++  +H+     H G       +     WKG++  +++YVQ C  CQ NK    KP G LQPIP  ER  E  SMDFI  
Subjt:  NLLYKK-RIVLSKESTLIPTLLHTFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPERILEGWSMDFIEG

Query:  LPKAGGMNA
        LP++ G NA
Subjt:  LPKAGGMNA

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.8e-3960.77Show/hide
Query:  HLGMVFAVMRDNQLVANKKKCVIAHSQIQYLG--HLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWG
        HLGMV  +   +Q  AN+KKC     QI YLG  H+IS  GV AD  K++ MV WP+PK+ T LRGFLGLTGYYRRFVK YG+I  PLT+LL+KNS  W 
Subjt:  HLGMVFAVMRDNQLVANKKKCVIAHSQIQYLG--HLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWG

Query:  EEATEAFDKLKLAMTTLPVLALPDWNLPFI
        E A  AF  LK A+TTLPVLALPD  LPF+
Subjt:  EEATEAFDKLKLAMTTLPVLALPDWNLPFI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGTGGAGTTTTCTCAAAATTGGATCTCAAATCTGGATATCATCAGATACGGATGAAGGAGGATGATGTGGAAAAGACAGCGTTCAGAACCCATGAAGGACACTA
CGAGTTTCTGGTCATGCCGTTCGGCCTCACTAACGCACCAGCTACTTTCCAATCATTAATGAATCAGGTCTTTAAACCCTTCCTCAGGAGTTGTGTACTGGTATTTTTTG
ATGATATATTAGTGTACAGCACAGATCTCACAGAGCACGAAAAGCATTTGGGCATGGTTTTTGCAGTAATGCGGGATAACCAATTAGTTGCCAACAAAAAAAAATGCGTA
ATAGCTCATTCACAGATTCAATATTTGGGGCATTTAATATCCAGTAGAGGGGTTGAAGCTGATGGAGAAAAGATCAAGGATATGGTAAATTGGCCCCAACCCAAAGATGT
AACCGGATTGAGGGGGTTCTTAGGTCTGACTGGTTACTATAGAAGATTCGTCAAAGGGTATGGAGAGATAGCAGGACCGTTGACCAAATTATTGCAGAAGAACTCATTCC
TATGGGGGGAAGAAGCAACAGAGGCGTTTGATAAGCTGAAATTAGCCATGACAACCCTACCTGTACTAGCTCTACCAGATTGGAACCTACCTTTCATCATTGAAACGGAT
GCGTCCGGGATTGCTTTAGGGGCAGTTCTATCTCAAAATGGCCATCCCATAGCTTTTTTCAGTCAAATACTATCGAACCGAGCTAAAACCAAATCCATATATGAGAGGGA
ATTGATGGCTGTGGTTCTGTCGAGGGAAGTGCAACCCCAATTCCAGAAGTGGCTGACTAAACTCCTCGGGTATGACTTCGAAATACTTTACCAACCCGGATTACAAAACA
AAGCAGTCGATGCCCTCTCCCAAATCGAACAGCCAGTGGAAATGAGGAGTATGTCCACCACGGGTATTGTCAACATGGTGGTGGTTGAGAAGGAAGTTGAGTTAGATGAA
GAGCTCAAGGCAATCATTGAAGAATTAAAAAAAAATCCTCACGAGTCCAATAAATTCCAATGGGTGAATGGAAACCTACTGTATAAGAAGCGAATTGTTTTGTCAAAAGA
ATCCACTCTGATCCCCACCTTACTACATACGTTTCATGACTCCATTTTAGGAGGCCATTCCGGATTCTTAAGGACGTATAAAAGGATGTGTGGGGAATTGTATTGGAAAG
GTATGAAGGCTGATGTTAAAAAATATGTGCAGGAATGCGAGGTTTGCCAGAGAAATAAGTTGGAAGCAACTAAACCAGCTGGAGTTCTGCAGCCAATTCCAATTCCAGAA
AGAATCTTGGAAGGCTGGTCCATGGACTTCATTGAAGGGCTACCTAAAGCAGGAGGTATGAATGCTGAGCAAATACTCCTACTTTATCACCATGAGGCATCCTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGTGGAGTTTTCTCAAAATTGGATCTCAAATCTGGATATCATCAGATACGGATGAAGGAGGATGATGTGGAAAAGACAGCGTTCAGAACCCATGAAGGACACTA
CGAGTTTCTGGTCATGCCGTTCGGCCTCACTAACGCACCAGCTACTTTCCAATCATTAATGAATCAGGTCTTTAAACCCTTCCTCAGGAGTTGTGTACTGGTATTTTTTG
ATGATATATTAGTGTACAGCACAGATCTCACAGAGCACGAAAAGCATTTGGGCATGGTTTTTGCAGTAATGCGGGATAACCAATTAGTTGCCAACAAAAAAAAATGCGTA
ATAGCTCATTCACAGATTCAATATTTGGGGCATTTAATATCCAGTAGAGGGGTTGAAGCTGATGGAGAAAAGATCAAGGATATGGTAAATTGGCCCCAACCCAAAGATGT
AACCGGATTGAGGGGGTTCTTAGGTCTGACTGGTTACTATAGAAGATTCGTCAAAGGGTATGGAGAGATAGCAGGACCGTTGACCAAATTATTGCAGAAGAACTCATTCC
TATGGGGGGAAGAAGCAACAGAGGCGTTTGATAAGCTGAAATTAGCCATGACAACCCTACCTGTACTAGCTCTACCAGATTGGAACCTACCTTTCATCATTGAAACGGAT
GCGTCCGGGATTGCTTTAGGGGCAGTTCTATCTCAAAATGGCCATCCCATAGCTTTTTTCAGTCAAATACTATCGAACCGAGCTAAAACCAAATCCATATATGAGAGGGA
ATTGATGGCTGTGGTTCTGTCGAGGGAAGTGCAACCCCAATTCCAGAAGTGGCTGACTAAACTCCTCGGGTATGACTTCGAAATACTTTACCAACCCGGATTACAAAACA
AAGCAGTCGATGCCCTCTCCCAAATCGAACAGCCAGTGGAAATGAGGAGTATGTCCACCACGGGTATTGTCAACATGGTGGTGGTTGAGAAGGAAGTTGAGTTAGATGAA
GAGCTCAAGGCAATCATTGAAGAATTAAAAAAAAATCCTCACGAGTCCAATAAATTCCAATGGGTGAATGGAAACCTACTGTATAAGAAGCGAATTGTTTTGTCAAAAGA
ATCCACTCTGATCCCCACCTTACTACATACGTTTCATGACTCCATTTTAGGAGGCCATTCCGGATTCTTAAGGACGTATAAAAGGATGTGTGGGGAATTGTATTGGAAAG
GTATGAAGGCTGATGTTAAAAAATATGTGCAGGAATGCGAGGTTTGCCAGAGAAATAAGTTGGAAGCAACTAAACCAGCTGGAGTTCTGCAGCCAATTCCAATTCCAGAA
AGAATCTTGGAAGGCTGGTCCATGGACTTCATTGAAGGGCTACCTAAAGCAGGAGGTATGAATGCTGAGCAAATACTCCTACTTTATCACCATGAGGCATCCTTTTAA
Protein sequenceShow/hide protein sequence
MEGGVFSKLDLKSGYHQIRMKEDDVEKTAFRTHEGHYEFLVMPFGLTNAPATFQSLMNQVFKPFLRSCVLVFFDDILVYSTDLTEHEKHLGMVFAVMRDNQLVANKKKCV
IAHSQIQYLGHLISSRGVEADGEKIKDMVNWPQPKDVTGLRGFLGLTGYYRRFVKGYGEIAGPLTKLLQKNSFLWGEEATEAFDKLKLAMTTLPVLALPDWNLPFIIETD
ASGIALGAVLSQNGHPIAFFSQILSNRAKTKSIYERELMAVVLSREVQPQFQKWLTKLLGYDFEILYQPGLQNKAVDALSQIEQPVEMRSMSTTGIVNMVVVEKEVELDE
ELKAIIEELKKNPHESNKFQWVNGNLLYKKRIVLSKESTLIPTLLHTFHDSILGGHSGFLRTYKRMCGELYWKGMKADVKKYVQECEVCQRNKLEATKPAGVLQPIPIPE
RILEGWSMDFIEGLPKAGGMNAEQILLLYHHEASF