; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G12070 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G12070
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr4:10388171..10389485
RNA-Seq ExpressionCSPI04G12070
SyntenyCSPI04G12070
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0015074 - DNA integration (biological process)
GO:0030154 - cell differentiation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8637561.1 hypothetical protein CSA_017659 [Cucumis sativus]1.1e-12069.23Show/hide
Query:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW
        MPFGLTNAPATF S+ ++           VFFDDILVYS DI EH KHLGMVF VL+DN LFANKKKCVIAHS+IQY+GH+IS KGV+ADEEKIK+M+ W
Subjt:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW

Query:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------
        P PKDV+ LR FLGL+ YYRRFVK YGEI A LT+LLQKN F W+ +ATVAFE LK AMTT+ VLALP+W+LPF IET+ASG  LG              
Subjt:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------

Query:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT
         K   RAQ KSIYERELM VVLSVQKWRHYLLG+KFTIISDQKALKFLLEQR+VQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRME  +E+NS+TT
Subjt:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT

Query:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKYQW
          IVD+E+I +E   DEELQK  + L++N     K+ W
Subjt:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKYQW

KAE8637598.1 hypothetical protein CSA_022681 [Cucumis sativus]6.7e-12670.41Show/hide
Query:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW
        MPFGLTNAPATF S+ +E           VFF DILVYS DIDEH KHLGMVF +L+D++LFAN+ KCVIAHSQ+QY+GHLIS +GVEADE+KI++M+NW
Subjt:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW

Query:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------
        P PKD++ LR FLGLT YYRRFVKSYGEI A LTKLLQKN F+WN EAT+AF+ LKLAMTTL VLALPDW+ PFTIET+ASG+ LG              
Subjt:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------

Query:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT
         K  PRAQ KSIYERELMAVVLSVQKWRHYLLG+KFTI+SDQKALKFLLEQR+VQPQFQKWLTKLLGYDFEILYQPGLQNK ADALSR +  VELN+MTT
Subjt:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT

Query:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKYQW
        T IVD+E+I +E + D+ELQKI   L+   ++  KYQW
Subjt:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKYQW

KGN62557.2 hypothetical protein Csa_018739 [Cucumis sativus]5.9e-12267.73Show/hide
Query:  MPFGLTNAPATFSITDE-----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW
        MPFGLTNAPATF                 VFFDDIL+YS ++ EH KHL MVF V++DNQL ANKKKCVIAHSQIQY+GHLIS +GVEAD +KIK+M+NW
Subjt:  MPFGLTNAPATFSITDE-----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW

Query:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------
        P PKDV+ LR FLGLT YYRRFVK YGE+   LTKLLQKN F W  EAT AF+ LKLAMTTL VLALPDWNLPF IET+ASGI LG              
Subjt:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------

Query:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT
         K   RA+TKSIYERELMAVVLSVQKWRHYLLG+KFTIISDQ+ALKFLLEQR+VQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSR+E+PVE+ +M+T
Subjt:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT

Query:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKYQWVTPDFY
        T IV++E++ +E + DEEL+ I   L++N +E  K+QWV  + +
Subjt:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKYQWVTPDFY

TYK28944.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.0e-11867.56Show/hide
Query:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW
        MPFGLTNAPATF S+ ++           VFFDDILVYS+DI EH KHLGMVF  L+DNQL+AN+KKCV AHSQI Y+GH+ISK GVEAD++K+K+M+ W
Subjt:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW

Query:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------
        P PKDV+ LR FLGLT YYRRFVK YGEI A LTKLLQKN F W+  AT+AFESLK AM+T+ VLALPDW+LPF IET+ASG  LG              
Subjt:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------

Query:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT
         K   RAQ KSIYERELMAVVLSVQKWRHYLLG++FTI+SDQKALKFLLEQR+VQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRM+  +EL +++T
Subjt:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT

Query:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKY
        T IVD+E++ +E + DEELQ + ++L+ N   E KY
Subjt:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKY

XP_031745419.1 uncharacterized protein LOC116405630 [Cucumis sativus]6.7e-12670.41Show/hide
Query:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW
        MPFGLTNAPATF S+ +E           VFF DILVYS DIDEH KHLGMVF +L+D++LFAN+ KCVIAHSQ+QY+GHLIS +GVEADE+KI++M+NW
Subjt:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW

Query:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------
        P PKD++ LR FLGLT YYRRFVKSYGEI A LTKLLQKN F+WN EAT+AF+ LKLAMTTL VLALPDW+ PFTIET+ASG+ LG              
Subjt:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------

Query:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT
         K  PRAQ KSIYERELMAVVLSVQKWRHYLLG+KFTI+SDQKALKFLLEQR+VQPQFQKWLTKLLGYDFEILYQPGLQNK ADALSR +  VELN+MTT
Subjt:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT

Query:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKYQW
        T IVD+E+I +E + D+ELQKI   L+   ++  KYQW
Subjt:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKYQW

TrEMBL top hitse value%identityAlignment
A0A5D3BBH7 Ty3/gypsy retrotransposon protein5.0e-11967.56Show/hide
Query:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW
        MPFGLTNAPATF S+ ++           VFFDDILVYS+DI EH KHLGMVF  L+DNQL+AN+KKCV AHSQI Y+GH+ISK GVEAD++K+K+M+ W
Subjt:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW

Query:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------
        P PKDV+ LR FLGLT YYRRFVK YGEI A LTKLLQKN F W+  AT+AFESLK AM+T+ VLALPDW+LPF IET+ASG  LG              
Subjt:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------

Query:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT
         K   RAQ KSIYERELMAVVLSVQKWRHYLLG++FTI+SDQKALKFLLEQR+VQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRM+  +EL +++T
Subjt:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT

Query:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKY
        T IVD+E++ +E + DEELQ + ++L+ N   E KY
Subjt:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKY

A0A5D3DU86 Ty3/gypsy retrotransposon protein5.0e-11967.56Show/hide
Query:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW
        MPFGLTNAPATF S+ ++           VFFDDILVYS+DI EH KHLGMVF  L+DNQL+AN+KKCV AHSQI Y+GH+ISK GVEAD++K+K+M+ W
Subjt:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW

Query:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------
        P PKDV+ LR FLGLT YYRRFVK YGEI A LTKLLQKN F W+  AT+AFESLK AM+T+ VLALPDW+LPF IET+ASG  LG              
Subjt:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------

Query:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT
         K   RAQ KSIYERELMAVVLSVQKWRHYLLG++FTI+SDQKALKFLLEQR+VQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRM+  +EL +++T
Subjt:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT

Query:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKY
        T IVD+E++ +E + DEELQ + ++L+ N   E KY
Subjt:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKY

A0A5D3DWA9 Ty3/gypsy retrotransposon protein5.0e-11967.56Show/hide
Query:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW
        MPFGLTNAPATF S+ ++           VFFDDILVYS+DI EH KHLGMVF  L+DNQL+AN+KKCV AHSQI Y+GH+ISK GVEAD++K+K+M+ W
Subjt:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW

Query:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------
        P PKDV+ LR FLGLT YYRRFVK YGEI A LTKLLQKN F W+  AT+AFESLK AM+T+ VLALPDW+LPF IET+ASG  LG              
Subjt:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------

Query:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT
         K   RAQ KSIYERELMAVVLSVQKWRHYLLG++FTI+SDQKALKFLLEQR+VQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRM+  +EL +++T
Subjt:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT

Query:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKY
        T IVD+E++ +E + DEELQ + ++L+ N   E KY
Subjt:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKY

A0A5D3DZK6 Ty3/gypsy retrotransposon protein5.0e-11967.56Show/hide
Query:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW
        MPFGLTNAPATF S+ ++           VFFDDILVYS+DI EH KHLGMVF  L+DNQL+AN+KKCV AHSQI Y+GH+ISK GVEAD++K+K+M+ W
Subjt:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW

Query:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------
        P PKDV+ LR FLGLT YYRRFVK YGEI A LTKLLQKN F W+  AT+AFESLK AM+T+ VLALPDW+LPF IET+ASG  LG              
Subjt:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------

Query:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT
         K   RAQ KSIYERELMAVVLSVQKWRHYLLG++FTI+SDQKALKFLLEQR+VQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRM+  +EL +++T
Subjt:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT

Query:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKY
        T IVD+E++ +E + DEELQ + ++L+ N   E KY
Subjt:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKY

A0A5D3E325 Ty3/gypsy retrotransposon protein5.0e-11967.56Show/hide
Query:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW
        MPFGLTNAPATF S+ ++           VFFDDILVYS+DI EH KHLGMVF  L+DNQL+AN+KKCV AHSQI Y+GH+ISK GVEAD++K+K+M+ W
Subjt:  MPFGLTNAPATF-SITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW

Query:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------
        P PKDV+ LR FLGLT YYRRFVK YGEI A LTKLLQKN F W+  AT+AFESLK AM+T+ VLALPDW+LPF IET+ASG  LG              
Subjt:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV-------------

Query:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT
         K   RAQ KSIYERELMAVVLSVQKWRHYLLG++FTI+SDQKALKFLLEQR+VQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRM+  +EL +++T
Subjt:  PKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTT

Query:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKY
        T IVD+E++ +E + DEELQ + ++L+ N   E KY
Subjt:  TEIVDVELICEEAKSDEELQKITRRLERNTNEEQKY

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.64.3e-4334.93Show/hide
Query:  MPFGLTNAPATFS---------ITDER--VFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW
        MPFGL NAPATF          + ++   V+ DDI+V+S  +DEH + LG+VF+ L    L     KC     +  ++GH+++  G++ + EKI+ +  +
Subjt:  MPFGLTNAPATFS---------ITDER--VFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW

Query:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNG--FNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLG------------
        PIP     ++ FLGLT YYR+F+ ++ +I   +TK L+KN      N E   AF+ LK  ++   +L +PD+   FT+ T+AS + LG            
Subjt:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNG--FNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLG------------

Query:  VPKTIPRAQTK-SIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRME
        + +T+   +   S  E+EL+A+V + + +RHYLLG+ F I SD + L +L   +    +  +W  KL  +DF+I Y  G +N  ADALSR++
Subjt:  VPKTIPRAQTK-SIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRME

P20825 Retrovirus-related Pol polyprotein from transposon 2976.9e-4133.22Show/hide
Query:  MPFGLTNAPATFSITDER-----------VFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW
        MPFGL NAPATF                 V+ DDI+++S  + EH   + +VF  L D  L     KC     +  ++GH+++  G++ +  K+K ++++
Subjt:  MPFGLTNAPATFSITDER-----------VFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW

Query:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNG--FNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLG------------
        PIP     +R FLGLT YYR+F+ +Y +I   +T  L+K        +E   AFE LK  +    +L LPD+   F + T+AS + LG            
Subjt:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNG--FNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLG------------

Query:  VPKTIPRAQTK-SIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRME
        + +T+   +   S  E+EL+A+V + + +RHYLLG++F I SD + L++L   ++   + ++W  +L  Y F+I Y  G +N  ADALSR++
Subjt:  VPKTIPRAQTK-SIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRME

P92523 Uncharacterized mitochondrial protein AtMg008605.3e-3354.26Show/hide
Query:  HLGMVFDVLKDNQLFANKKKCVIAHSQIQYMG--HLISKKGVEADEEKIKNMINWPIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWN
        HLGMV  + + +Q +AN+KKC     QI Y+G  H+IS +GV AD  K++ M+ WP PK+ + LR FLGLT YYRRFVK+YG+I   LT+LL+KN   W 
Subjt:  HLGMVFDVLKDNQLFANKKKCVIAHSQIQYMG--HLISKKGVEADEEKIKNMINWPIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWN

Query:  VEATVAFESLKLAMTTLSVLALPDWNLPF
          A +AF++LK A+TTL VLALPD  LPF
Subjt:  VEATVAFESLKLAMTTLSVLALPDWNLPF

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.1e-3031.16Show/hide
Query:  MPFGLTNAPATFS---------ITDERVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINWPI
        MPFGL NAP+TF+         +    V+ DDIL++S   +EH KHL  V + LK+  L   KKKC  A  + +++G+ I  + +   + K   + ++P 
Subjt:  MPFGLTNAPATFS---------ITDERVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINWPI

Query:  PKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV---------------
        PK V   +RFLG+  YYRRF+ +  +I A   +L   +   W  +   A E LK A+    VL   +    + + T+AS   +G                
Subjt:  PKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGV---------------

Query:  ---PKTIPRAQTK-SIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSR
            K++  AQ      E EL+ ++ ++  +R+ L GK FT+ +D  +L  L  + +   + Q+WL  L  YDF + Y  G +N  ADA+SR
Subjt:  ---PKTIPRAQTK-SIYERELMAVVLSVQKWRHYLLGKKFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.5e-3530.6Show/hide
Query:  MPFGLTNAPATFS-ITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW
        +PFGL NAPA F  + D+           V+ DDI+V+S D D H K+L +V   L    L  N +K     +Q++++G++++  G++AD +K++ +   
Subjt:  MPFGLTNAPATFS-ITDE----------RVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINW

Query:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQ------KNGFNWNVEATV------AFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGVP
        P P  V  L+RFLG+T YYR+F++ Y ++   LT L +      K+  +  V  T+      +F  LK  + +  +LA P +  PF + T+AS   +G  
Subjt:  PIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQ------KNGFNWNVEATV------AFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGVP

Query:  KT------------IPRAQTK-----SIYERELMAVVLSVQKWRHYLLGK-KFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAA
         +            I R+  K     +  E+E++A++ S+   R YL G     + +D + L F L  R    + ++W  ++  Y+ E++Y+PG  N  A
Subjt:  KT------------IPRAQTK-----SIYERELMAVVLSVQKWRHYLLGK-KFTIISDQKALKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAA

Query:  DALSRMERPVELNSMTT
        DALSR+  P +LN ++T
Subjt:  DALSRMERPVELNSMTT

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein3.8e-3454.26Show/hide
Query:  HLGMVFDVLKDNQLFANKKKCVIAHSQIQYMG--HLISKKGVEADEEKIKNMINWPIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWN
        HLGMV  + + +Q +AN+KKC     QI Y+G  H+IS +GV AD  K++ M+ WP PK+ + LR FLGLT YYRRFVK+YG+I   LT+LL+KN   W 
Subjt:  HLGMVFDVLKDNQLFANKKKCVIAHSQIQYMG--HLISKKGVEADEEKIKNMINWPIPKDVSSLRRFLGLTRYYRRFVKSYGEICALLTKLLQKNGFNWN

Query:  VEATVAFESLKLAMTTLSVLALPDWNLPF
          A +AF++LK A+TTL VLALPD  LPF
Subjt:  VEATVAFESLKLAMTTLSVLALPDWNLPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTTTTGGCCTCACAAACGCCCCAGCTACCTTTTCAATCACTGATGAACGAGTATTTTTTGACGATATACTAGTCTACAGTGCTGATATTGATGAGCATGCTAAGCA
TTTGGGAATGGTATTTGACGTACTAAAGGATAATCAACTCTTTGCTAATAAGAAGAAATGTGTTATAGCCCATTCGCAGATACAATACATGGGACATTTGATATCAAAAA
AAGGGGTAGAGGCAGACGAAGAGAAGATCAAGAATATGATAAACTGGCCAATTCCGAAGGACGTTTCTAGCTTGAGGAGATTTTTAGGTCTCACGAGATACTATAGACGA
TTTGTAAAGAGTTATGGAGAGATATGTGCCCTGCTAACTAAGTTATTACAAAAAAATGGCTTCAATTGGAATGTGGAAGCAACTGTGGCCTTCGAAAGTTTAAAGTTAGC
GATGACAACTCTTTCAGTATTAGCTTTACCCGATTGGAATTTACCCTTCACCATTGAAACGAATGCATCTGGAATAAGGCTAGGAGTTCCAAAAACTATCCCAAGAGCTC
AAACCAAATCTATTTATGAAAGGGAATTGATGGCGGTAGTTCTTTCAGTCCAAAAATGGAGACACTATTTATTGGGGAAAAAGTTTACCATTATTTCTGACCAAAAGGCT
CTTAAATTCTTATTAGAGCAAAGGAAAGTACAACCCCAATTTCAGAAGTGGTTAACGAAACTTCTTGGATATGATTTTGAGATATTATATCAGCCTGGACTTCAAAACAA
AGCTGCAGATGCCCTTTCAAGAATGGAGCGGCCTGTGGAACTGAACAGCATGACAACCACTGAAATTGTAGATGTGGAACTCATTTGTGAGGAGGCTAAAAGTGATGAGG
AACTTCAAAAGATTACAAGAAGACTGGAAAGAAACACGAATGAGGAACAGAAATACCAATGGGTCACTCCGGATTTCTACAAACTTATAAGAGAATCAGTGGCGACTTAT
ACTGGAAGGCCAATTCCTGTTCCGGATCGAATACTATGGATTTCATTGAGGGGTTACCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCTTTTGGCCTCACAAACGCCCCAGCTACCTTTTCAATCACTGATGAACGAGTATTTTTTGACGATATACTAGTCTACAGTGCTGATATTGATGAGCATGCTAAGCA
TTTGGGAATGGTATTTGACGTACTAAAGGATAATCAACTCTTTGCTAATAAGAAGAAATGTGTTATAGCCCATTCGCAGATACAATACATGGGACATTTGATATCAAAAA
AAGGGGTAGAGGCAGACGAAGAGAAGATCAAGAATATGATAAACTGGCCAATTCCGAAGGACGTTTCTAGCTTGAGGAGATTTTTAGGTCTCACGAGATACTATAGACGA
TTTGTAAAGAGTTATGGAGAGATATGTGCCCTGCTAACTAAGTTATTACAAAAAAATGGCTTCAATTGGAATGTGGAAGCAACTGTGGCCTTCGAAAGTTTAAAGTTAGC
GATGACAACTCTTTCAGTATTAGCTTTACCCGATTGGAATTTACCCTTCACCATTGAAACGAATGCATCTGGAATAAGGCTAGGAGTTCCAAAAACTATCCCAAGAGCTC
AAACCAAATCTATTTATGAAAGGGAATTGATGGCGGTAGTTCTTTCAGTCCAAAAATGGAGACACTATTTATTGGGGAAAAAGTTTACCATTATTTCTGACCAAAAGGCT
CTTAAATTCTTATTAGAGCAAAGGAAAGTACAACCCCAATTTCAGAAGTGGTTAACGAAACTTCTTGGATATGATTTTGAGATATTATATCAGCCTGGACTTCAAAACAA
AGCTGCAGATGCCCTTTCAAGAATGGAGCGGCCTGTGGAACTGAACAGCATGACAACCACTGAAATTGTAGATGTGGAACTCATTTGTGAGGAGGCTAAAAGTGATGAGG
AACTTCAAAAGATTACAAGAAGACTGGAAAGAAACACGAATGAGGAACAGAAATACCAATGGGTCACTCCGGATTTCTACAAACTTATAAGAGAATCAGTGGCGACTTAT
ACTGGAAGGCCAATTCCTGTTCCGGATCGAATACTATGGATTTCATTGAGGGGTTACCCCTAG
Protein sequenceShow/hide protein sequence
MPFGLTNAPATFSITDERVFFDDILVYSADIDEHAKHLGMVFDVLKDNQLFANKKKCVIAHSQIQYMGHLISKKGVEADEEKIKNMINWPIPKDVSSLRRFLGLTRYYRR
FVKSYGEICALLTKLLQKNGFNWNVEATVAFESLKLAMTTLSVLALPDWNLPFTIETNASGIRLGVPKTIPRAQTKSIYERELMAVVLSVQKWRHYLLGKKFTIISDQKA
LKFLLEQRKVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRMERPVELNSMTTTEIVDVELICEEAKSDEELQKITRRLERNTNEEQKYQWVTPDFYKLIRESVATY
TGRPIPVPDRILWISLRGYP