; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0015628 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0015628
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr11:17281182..17283153
RNA-Seq ExpressionIVF0015628
SyntenyIVF0015628
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0015074 - DNA integration (biological process)
GO:0030154 - cell differentiation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062071.1 Transposon Tf2-6 polyprotein [Cucumis melo var. makuwa]0.089.33Show/hide
Query:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT
        MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAP QKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT
Subjt:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT

Query:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPISHE-------------------------
        VSDKFPLPVIEDL DELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPISHE                         
Subjt:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPISHE-------------------------

Query:  -----------SGVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLLNTFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFI
                    GVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLLNTFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFI
Subjt:  -----------SGVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLLNTFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFI

Query:  IETDALGKGLGQSLSQNGHPIAFFSQKLSICAQVQKWRHYVLGRKFTIISDQKALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLK
        IETDALGKGLGQSLSQNGHPIAFFSQKLSICAQ         G+    I +++    L+  GEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLK
Subjt:  IETDALGKGLGQSLSQNGHPIAFFSQKLSICAQVQKWRHYVLGRKFTIISDQKALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLK

Query:  TTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQRSNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGE
        TTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQRSNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGE
Subjt:  TTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQRSNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGE

Query:  IYWDGMKKDIKNYEEQCDRNKTESTLPAGLLQTIPVLELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGV
        IYWDGMKKDIKNYEEQCDRNKTESTLPAGLLQTIPV ELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGV
Subjt:  IYWDGMKKDIKNYEEQCDRNKTESTLPAGLLQTIPVLELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGV

Query:  PKAIITDQD
        PKAIITDQD
Subjt:  PKAIITDQD

KAE8637561.1 hypothetical protein CSA_017659 [Cucumis sativus]1.49e-27761.67Show/hide
Query:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT
        MIK+LL+QY ++F  P +LPPKR IDH IL  P Q+PINVRPYKYGY+QK EIE LV EMLQAGVI+PSH PYS+PVLLVKKKDGGW F +DYRKLNQ T
Subjt:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT

Query:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP-----------------------------
        +SDKFP+PVIE+LLDELHGA++FSKLD++S YHQI+M+E+D+EKT FR HEGHY+FLVM FGLT  P+ F                              
Subjt:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP-----------------------------

Query:  ---------------------------ISHES-----------GVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL--NTF
                                   I+H             GV+AD +K++ M+ WPQPKDVTGLRGFLGL GYYRRFV+GYG+IAAPLTRLL  N+F
Subjt:  ---------------------------ISHES-----------GVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL--NTF

Query:  KWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQ---------------VQKWRHYVLGRKFTIISDQ
         WD  A  AFEKLKTAM + PVLALP+W LPF+IETDA G GLG  LSQNGHPIAFFSQKLSI AQ               VQKWRHY+LGRKFTIISDQ
Subjt:  KWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQ---------------VQKWRHYVLGRKFTIISDQ

Query:  KALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQR
        KALKFLLEQ EVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSR+    E   E+  ++  GI+D+EV+ KEV  D ELQ+ I+ ++++P+  +K+  
Subjt:  KALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQR

Query:  SNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCD---RNKTESTLPAGLLQTIPVLELMLEDWTMDFV
         N KLLYK R+VLSK S++IPTLLHTF DSILGG+S FLRTYKRM GE+YW+GMK DIK Y EQC+   RNK E+T PAG+L  IP  + +LE+W+MDF+
Subjt:  SNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCD---RNKTESTLPAGLLQTIPVLELMLEDWTMDFV

Query:  EGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD
        EGLP AG MN IM+VVDRLSKYAYF+T++HPF+AKQVA  FI+K++ KHGVPK+I++D+D
Subjt:  EGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD

KGN62557.2 hypothetical protein Csa_018739 [Cucumis sativus]2.06e-28061.67Show/hide
Query:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT
        MI+ LL+QY DLF  P  LPPKR  DH IL    QKPINVRPYKYG+ QK EIE L+ EMLQ G+I+PSH PYS+PVLLV+KKDGGW F +DYRKLNQ T
Subjt:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT

Query:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP-----------------------------
        +SDKFP+PVIE+LLDELHGA++FSKLDL+SGYHQI+MKE+D+EKT FR HEGHY+FLVM FGLT  P+ F                              
Subjt:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP-----------------------------

Query:  ---------------------------ISHES-----------GVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL--NTF
                                   I+H             GVEADGDK++ M+NWPQPKDVTGLRGFLGL GYYRRFV+GYG++A PLT+LL  N+F
Subjt:  ---------------------------ISHES-----------GVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL--NTF

Query:  KWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLS---------------ICAQVQKWRHYVLGRKFTIISDQ
         W   A EAF+KLK AM + PVLALPDW+LPFIIETDA G  LG  LSQNGHPIAFFSQKLS               +   VQKWRHY+LGRKFTIISDQ
Subjt:  KWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLS---------------ICAQVQKWRHYVLGRKFTIISDQ

Query:  KALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQR
        +ALKFLLEQ EVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSR+    EQP E+  MS  GI+++EVV KEVE D EL+ +IE ++++P+  +K+Q 
Subjt:  KALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQR

Query:  SNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCD---RNKTESTLPAGLLQTIPVLELMLEDWTMDFV
         N  L YK R+VLSK STLIPTLLHTF DSILGG+S FLRTYKRM GE+YW GMK D+K Y ++C+   RNK E+T PAG+LQ IP+ E +LEDW+MDF+
Subjt:  SNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCD---RNKTESTLPAGLLQTIPVLELMLEDWTMDFV

Query:  EGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD
        EGLP AG MN IM++VDRLSKY+YF+T+RHPF+A+QVAEVFID+V+ +HG+PK+II+D+D
Subjt:  EGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD

TYJ96663.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.29e-27562.27Show/hide
Query:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT
        MI+ LL QY D+F +P+ LPPKR IDH ILT P QKPINVRPYKYG+ QK EIE LV EMLQ G+I+PSH P+S+PVLLVKKKDGGW F +DYRKLN+ T
Subjt:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT

Query:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP-----------------------------
        ++DKFP+PVIE+LLDELHGA++FSKLDL+SGYHQI+M+E+DIEKT FR HEGHY+F+VM FGLT  P+ F                              
Subjt:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP-----------------------------

Query:  ISHES--------------------------------------GVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL--NTF
          HE                                       GVEAD DKV+ M+ WP+PKDVTGLRGFLGL GYYRRFV+GYG+IAAPLT+LL  N F
Subjt:  ISHES--------------------------------------GVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL--NTF

Query:  KWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQ---------------VQKWRHYVLGRKFTIISDQ
        KWD  A  AFE LK+AM + PVLALPDWSLPF+IETDA G GLG  LSQN HPIAFFSQKLS  AQ               VQKWRHY+LGR+FTI+SDQ
Subjt:  KWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQ---------------VQKWRHYVLGRKFTIISDQ

Query:  KALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQR
        KALKFLLEQ EVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSR+    +   EL  +S  GI+D+EVV KEVE D ELQ LI+ +Q +P    KY  
Subjt:  KALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQR

Query:  SNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCD---RNKTESTLPAGLLQTIPVLELMLEDWTMDFV
        +N  L+YKGR+VLSK S++IP+LLHTF DSILGG+S FLRTYKRM GE++W GMK+DIK Y EQC+   RNK+E+T PAG+LQ +P+ + +LEDWTMDF+
Subjt:  SNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCD---RNKTESTLPAGLLQTIPVLELMLEDWTMDFV

Query:  EGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD
        EGLP AG MN IM+VVDRLSKYAYFVT++HPFSAKQVA  FIDK++R+HG+PK+II+D+D
Subjt:  EGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD

TYK04949.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]0.095.64Show/hide
Query:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT
        MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT
Subjt:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT

Query:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPISHESGVEADGDKVQVMMNWPQPKDVTGL
        VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPISHESGVEADGDKVQVMMNWPQPKDVTGL
Subjt:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPISHESGVEADGDKVQVMMNWPQPKDVTGL

Query:  RGFLGLKGYYRRFVEGYGKIAAPLTRLLNTFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQVQK
        RGFLGLKGYYRRFVEGYGKIAAPLTRLLNTFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQ   
Subjt:  RGFLGLKGYYRRFVEGYGKIAAPLTRLLNTFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQVQK

Query:  WRHYVLGRKFTIISDQKALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRL
              G+    I +++    L+  GEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRL
Subjt:  WRHYVLGRKFTIISDQKALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRL

Query:  IEAVQKDPERYAKYQRSNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCDRNKTESTLPAGLLQTIPV
        IEAVQKDPERYAKYQRSNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCDRNKTESTLPAGLLQTIPV
Subjt:  IEAVQKDPERYAKYQRSNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCDRNKTESTLPAGLLQTIPV

Query:  LELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD
        LELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD
Subjt:  LELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD

TrEMBL top hitse value%identityAlignment
A0A5A7V8Q6 Transposon Tf2-6 polyprotein2.6e-30989.16Show/hide
Query:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT
        MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAP QKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT
Subjt:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT

Query:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPISHE-------------------------
        VSDKFPLPVIEDL DELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPISHE                         
Subjt:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPISHE-------------------------

Query:  -----------SGVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLLNTFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFI
                    GVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLLNTFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFI
Subjt:  -----------SGVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLLNTFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFI

Query:  IETDALGKGLGQSLSQNGHPIAFFSQKLSICAQVQKWRHYVLGRKFTIISDQKALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLK
        IETDALGKGLGQSLSQNGHPIAFFSQKLSICAQ +             I +++    L+  GEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLK
Subjt:  IETDALGKGLGQSLSQNGHPIAFFSQKLSICAQVQKWRHYVLGRKFTIISDQKALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLK

Query:  TTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQRSNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGE
        TTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQRSNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGE
Subjt:  TTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQRSNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGE

Query:  IYWDGMKKDIKNYEEQCDRNKTESTLPAGLLQTIPVLELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGV
        IYWDGMKKDIKNYEEQCDRNKTESTLPAGLLQTIPV ELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGV
Subjt:  IYWDGMKKDIKNYEEQCDRNKTESTLPAGLLQTIPVLELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGV

Query:  PKAIITDQD
        PKAIITDQD
Subjt:  PKAIITDQD

A0A5D3BBH7 Ty3/gypsy retrotransposon protein1.0e-23062.27Show/hide
Query:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT
        MI+ LL QY D+F +P+ LPPKR IDH ILT P QKPINVRPYKYG+ QK EIE LV EMLQ G+I+PSH P+S+PVLLVKKKDGGW F +DYRKLN+ T
Subjt:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT

Query:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP-----------------------------
        ++DKFP+PVIE+LLDELHGA++FSKLDL+SGYHQI+M+E+DIEKT FR HEGHY+F+VM FGLT  P+ F                              
Subjt:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP-----------------------------

Query:  ISHES--------------------------------------GVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL--NTF
          HE                                       GVEAD DKV+ M+ WP+PKDVTGLRGFLGL GYYRRFV+GYG+IAAPLT+LL  N F
Subjt:  ISHES--------------------------------------GVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL--NTF

Query:  KWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQ---------------VQKWRHYVLGRKFTIISDQ
        KWD  A  AFE LK+AM + PVLALPDWSLPF+IETDA G GLG  LSQN HPIAFFSQKLS  AQ               VQKWRHY+LGR+FTI+SDQ
Subjt:  KWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQ---------------VQKWRHYVLGRKFTIISDQ

Query:  KALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQR
        KALKFLLEQ EVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSR+    +   EL  +S  GI+D+EVV KEVE D ELQ LI+ +Q +P    KY  
Subjt:  KALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQR

Query:  SNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCD---RNKTESTLPAGLLQTIPVLELMLEDWTMDFV
        +N  L+YKGR+VLSK S++IP+LLHTF DSILGG+S FLRTYKRM GE++W GMK+DIK Y EQC+   RNK+E+T PAG+LQ +P+ + +LEDWTMDF+
Subjt:  SNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCD---RNKTESTLPAGLLQTIPVLELMLEDWTMDFV

Query:  EGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD
        EGLP AG MN IM+VVDRLSKYAYFVT++HPFSAKQVA  FIDK++R+HG+PK+II+D+D
Subjt:  EGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD

A0A5D3C3J4 Ty3-gypsy retrotransposon protein0.0e+0095.46Show/hide
Query:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT
        MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT
Subjt:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT

Query:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPISHESGVEADGDKVQVMMNWPQPKDVTGL
        VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPISHESGVEADGDKVQVMMNWPQPKDVTGL
Subjt:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPISHESGVEADGDKVQVMMNWPQPKDVTGL

Query:  RGFLGLKGYYRRFVEGYGKIAAPLTRLLNTFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQVQK
        RGFLGLKGYYRRFVEGYGKIAAPLTRLLNTFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQ + 
Subjt:  RGFLGLKGYYRRFVEGYGKIAAPLTRLLNTFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQVQK

Query:  WRHYVLGRKFTIISDQKALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRL
                    I +++    L+  GEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRL
Subjt:  WRHYVLGRKFTIISDQKALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRL

Query:  IEAVQKDPERYAKYQRSNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCDRNKTESTLPAGLLQTIPV
        IEAVQKDPERYAKYQRSNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCDRNKTESTLPAGLLQTIPV
Subjt:  IEAVQKDPERYAKYQRSNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCDRNKTESTLPAGLLQTIPV

Query:  LELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD
        LELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD
Subjt:  LELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD

A0A5D3DWA9 Ty3/gypsy retrotransposon protein1.0e-23062.27Show/hide
Query:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT
        MI+ LL QY D+F +P+ LPPKR IDH ILT P QKPINVRPYKYG+ QK EIE LV EMLQ G+I+PSH P+S+PVLLVKKKDGGW F +DYRKLN+ T
Subjt:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT

Query:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP-----------------------------
        ++DKFP+PVIE+LLDELHGA++FSKLDL+SGYHQI+M+E+DIEKT FR HEGHY+F+VM FGLT  P+ F                              
Subjt:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP-----------------------------

Query:  ISHES--------------------------------------GVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL--NTF
          HE                                       GVEAD DKV+ M+ WP+PKDVTGLRGFLGL GYYRRFV+GYG+IAAPLT+LL  N F
Subjt:  ISHES--------------------------------------GVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL--NTF

Query:  KWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQ---------------VQKWRHYVLGRKFTIISDQ
        KWD  A  AFE LK+AM + PVLALPDWSLPF+IETDA G GLG  LSQN HPIAFFSQKLS  AQ               VQKWRHY+LGR+FTI+SDQ
Subjt:  KWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQ---------------VQKWRHYVLGRKFTIISDQ

Query:  KALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQR
        KALKFLLEQ EVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSR+    +   EL  +S  GI+D+EVV KEVE D ELQ LI+ +Q +P    KY  
Subjt:  KALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQR

Query:  SNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCD---RNKTESTLPAGLLQTIPVLELMLEDWTMDFV
        +N  L+YKGR+VLSK S++IP+LLHTF DSILGG+S FLRTYKRM GE++W GMK+DIK Y EQC+   RNK+E+T PAG+LQ +P+ + +LEDWTMDF+
Subjt:  SNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCD---RNKTESTLPAGLLQTIPVLELMLEDWTMDFV

Query:  EGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD
        EGLP AG MN IM+VVDRLSKYAYFVT++HPFSAKQVA  FIDK++R+HG+PK+II+D+D
Subjt:  EGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD

A0A5D3DZK6 Ty3/gypsy retrotransposon protein1.0e-23062.27Show/hide
Query:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT
        MI+ LL QY D+F +P+ LPPKR IDH ILT P QKPINVRPYKYG+ QK EIE LV EMLQ G+I+PSH P+S+PVLLVKKKDGGW F +DYRKLN+ T
Subjt:  MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQAT

Query:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP-----------------------------
        ++DKFP+PVIE+LLDELHGA++FSKLDL+SGYHQI+M+E+DIEKT FR HEGHY+F+VM FGLT  P+ F                              
Subjt:  VSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP-----------------------------

Query:  ISHES--------------------------------------GVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL--NTF
          HE                                       GVEAD DKV+ M+ WP+PKDVTGLRGFLGL GYYRRFV+GYG+IAAPLT+LL  N F
Subjt:  ISHES--------------------------------------GVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL--NTF

Query:  KWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQ---------------VQKWRHYVLGRKFTIISDQ
        KWD  A  AFE LK+AM + PVLALPDWSLPF+IETDA G GLG  LSQN HPIAFFSQKLS  AQ               VQKWRHY+LGR+FTI+SDQ
Subjt:  KWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQ---------------VQKWRHYVLGRKFTIISDQ

Query:  KALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQR
        KALKFLLEQ EVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSR+    +   EL  +S  GI+D+EVV KEVE D ELQ LI+ +Q +P    KY  
Subjt:  KALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQR

Query:  SNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCD---RNKTESTLPAGLLQTIPVLELMLEDWTMDFV
        +N  L+YKGR+VLSK S++IP+LLHTF DSILGG+S FLRTYKRM GE++W GMK+DIK Y EQC+   RNK+E+T PAG+LQ +P+ + +LEDWTMDF+
Subjt:  SNDKLLYKGRLVLSKYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCD---RNKTESTLPAGLLQTIPVLELMLEDWTMDFV

Query:  EGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD
        EGLP AG MN IM+VVDRLSKYAYFVT++HPFSAKQVA  FIDK++R+HG+PK+II+D+D
Subjt:  EGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein7.2e-6427.76Show/hide
Query:  IKSLLKQYEDLFS-TPSELPPK--RGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEM---LQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRK
        +  + K+++D+ + T +E  PK  +G++  +        + +R Y    L  G+++ + +E+   L++G+I+ S    + PV+ V KK+G     +DY+ 
Subjt:  IKSLLKQYEDLFS-TPSELPPK--RGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEM---LQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRK

Query:  LNQATVSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPI-----------------------
        LN+    + +PLP+IE LL ++ G++IF+KLDL+S YH I++++ D  K  FR   G +++LVM +G++  P+HF                         
Subjt:  LNQATVSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPI-----------------------

Query:  --------------------------------SHESGVEADG------------DKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL
                                         H+S V+  G            + +  ++ W QPK+   LR FLG   Y R+F+    ++  PL  LL
Subjt:  --------------------------------SHESGVEADG------------DKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL

Query:  N---TFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNG-----HPIAFFSQKLS---------------ICAQVQKWRHYV
             +KW  T  +A E +K  ++S PVL   D+S   ++ETDA    +G  LSQ       +P+ ++S K+S               I   ++ WRHY+
Subjt:  N---TFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNG-----HPIAFFSQKLS---------------ICAQVQKWRHYV

Query:  LG--RKFTIISDQKAL--KFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRL-KTTEEQPKE-----LAVMSALGIID--LEVVLKEVED
              F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N  ADALSR+   TE  PK+     +  ++ + I D     V+ E  +
Subjt:  LG--RKFTIISDQKAL--KFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRL-KTTEEQPKE-----LAVMSALGIID--LEVVLKEVED

Query:  DPELQRLIEAVQKDPERYAKYQRSNDKLLY--KGRLVLSKYSTLIPTLLHTFQDS---ILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCDR---NK
        D +L  L+     + +R  +  +  D LL   K +++L   + L  T++  + +    I  G         R      W G++K I+ Y + C     NK
Subjt:  DPELQRLIEAVQKDPERYAKYQRSNDKLLY--KGRLVLSKYSTLIPTLLHTFQDS---ILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCDR---NK

Query:  TESTLPAGLLQTIPVLELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD
        + +  P G LQ IP  E   E  +MDF+  LP +   NA+ +VVDR SK A  V      +A+Q A +F  +VI   G PK II D D
Subjt:  TESTLPAGLLQTIPVLELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD

P0CT35 Transposon Tf2-2 polyprotein7.2e-6427.76Show/hide
Query:  IKSLLKQYEDLFS-TPSELPPK--RGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEM---LQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRK
        +  + K+++D+ + T +E  PK  +G++  +        + +R Y    L  G+++ + +E+   L++G+I+ S    + PV+ V KK+G     +DY+ 
Subjt:  IKSLLKQYEDLFS-TPSELPPK--RGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEM---LQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRK

Query:  LNQATVSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPI-----------------------
        LN+    + +PLP+IE LL ++ G++IF+KLDL+S YH I++++ D  K  FR   G +++LVM +G++  P+HF                         
Subjt:  LNQATVSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPI-----------------------

Query:  --------------------------------SHESGVEADG------------DKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL
                                         H+S V+  G            + +  ++ W QPK+   LR FLG   Y R+F+    ++  PL  LL
Subjt:  --------------------------------SHESGVEADG------------DKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL

Query:  N---TFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNG-----HPIAFFSQKLS---------------ICAQVQKWRHYV
             +KW  T  +A E +K  ++S PVL   D+S   ++ETDA    +G  LSQ       +P+ ++S K+S               I   ++ WRHY+
Subjt:  N---TFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNG-----HPIAFFSQKLS---------------ICAQVQKWRHYV

Query:  LG--RKFTIISDQKAL--KFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRL-KTTEEQPKE-----LAVMSALGIID--LEVVLKEVED
              F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N  ADALSR+   TE  PK+     +  ++ + I D     V+ E  +
Subjt:  LG--RKFTIISDQKAL--KFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRL-KTTEEQPKE-----LAVMSALGIID--LEVVLKEVED

Query:  DPELQRLIEAVQKDPERYAKYQRSNDKLLY--KGRLVLSKYSTLIPTLLHTFQDS---ILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCDR---NK
        D +L  L+     + +R  +  +  D LL   K +++L   + L  T++  + +    I  G         R      W G++K I+ Y + C     NK
Subjt:  DPELQRLIEAVQKDPERYAKYQRSNDKLLY--KGRLVLSKYSTLIPTLLHTFQDS---ILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCDR---NK

Query:  TESTLPAGLLQTIPVLELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD
        + +  P G LQ IP  E   E  +MDF+  LP +   NA+ +VVDR SK A  V      +A+Q A +F  +VI   G PK II D D
Subjt:  TESTLPAGLLQTIPVLELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD

P0CT41 Transposon Tf2-12 polyprotein7.2e-6427.76Show/hide
Query:  IKSLLKQYEDLFS-TPSELPPK--RGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEM---LQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRK
        +  + K+++D+ + T +E  PK  +G++  +        + +R Y    L  G+++ + +E+   L++G+I+ S    + PV+ V KK+G     +DY+ 
Subjt:  IKSLLKQYEDLFS-TPSELPPK--RGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEM---LQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRK

Query:  LNQATVSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPI-----------------------
        LN+    + +PLP+IE LL ++ G++IF+KLDL+S YH I++++ D  K  FR   G +++LVM +G++  P+HF                         
Subjt:  LNQATVSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPI-----------------------

Query:  --------------------------------SHESGVEADG------------DKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL
                                         H+S V+  G            + +  ++ W QPK+   LR FLG   Y R+F+    ++  PL  LL
Subjt:  --------------------------------SHESGVEADG------------DKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL

Query:  N---TFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNG-----HPIAFFSQKLS---------------ICAQVQKWRHYV
             +KW  T  +A E +K  ++S PVL   D+S   ++ETDA    +G  LSQ       +P+ ++S K+S               I   ++ WRHY+
Subjt:  N---TFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNG-----HPIAFFSQKLS---------------ICAQVQKWRHYV

Query:  LG--RKFTIISDQKAL--KFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRL-KTTEEQPKE-----LAVMSALGIID--LEVVLKEVED
              F I++D + L  +   E      +  +W   L  ++FEI Y+PG  N  ADALSR+   TE  PK+     +  ++ + I D     V+ E  +
Subjt:  LG--RKFTIISDQKAL--KFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRL-KTTEEQPKE-----LAVMSALGIID--LEVVLKEVED

Query:  DPELQRLIEAVQKDPERYAKYQRSNDKLLY--KGRLVLSKYSTLIPTLLHTFQDS---ILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCDR---NK
        D +L  L+     + +R  +  +  D LL   K +++L   + L  T++  + +    I  G         R      W G++K I+ Y + C     NK
Subjt:  DPELQRLIEAVQKDPERYAKYQRSNDKLLY--KGRLVLSKYSTLIPTLLHTFQDS---ILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCDR---NK

Query:  TESTLPAGLLQTIPVLELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD
        + +  P G LQ IP  E   E  +MDF+  LP +   NA+ +VVDR SK A  V      +A+Q A +F  +VI   G PK II D D
Subjt:  TESTLPAGLLQTIPVLELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.0e-6528.61Show/hide
Query:  LLKQYEDLFSTPSELPPKRG------IDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQ
        L ++Y ++    ++LPP+        + H I   P  +   ++PY      + EI  +V+++L    I PS  P S+PV+LV KKDG +   +DYR LN+
Subjt:  LLKQYEDLFSTPSELPPKRG------IDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQ

Query:  ATVSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP---------------------------
        AT+SD FPLP I++LL  +  A IF+ LDL SGYHQI M+  D  KT F    G Y++ VM FGL   PS F                            
Subjt:  ATVSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP---------------------------

Query:  -----------------------------------ISHESGVEADG---DKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL-NTFK
                                           + +  G++       K   + ++P PK V   + FLG+  YYRRF+    KIA P+   + +  +
Subjt:  -----------------------------------ISHESGVEADG---DKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL-NTFK

Query:  WDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHP------IAFFSQK---------------LSICAQVQKWRHYVLGRKFT
        W     +A EKLK A+ ++PVL   +    + + TDA   G+G  L +  +       + +FS+                L I   +  +R+ + G+ FT
Subjt:  WDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHP------IAFFSQK---------------LSICAQVQKWRHYVLGRKFT

Query:  IISDQKALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSR--------------LKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPEL
        + +D  +L  L  + E   + Q+WL  L  YDF + Y  G +N  ADA+SR               ++ +   K   + SA+ I   E+    V   PE 
Subjt:  IISDQKALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSR--------------LKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPEL

Query:  QRLIEAVQKD---PERYAK-YQRSNDKLLYKGRLVLS-KYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEE---QCDRNKTEST
             + QK     E + K Y   ++ + Y+ RLV+  K    +  L H    ++ GG+     T  ++    YW  ++  I  Y     QC   K+   
Subjt:  QRLIEAVQKD---PERYAK-YQRSNDKLLYKGRLVLS-KYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEE---QCDRNKTEST

Query:  LPAGLLQTIPVLELMLEDWTMDFVEGL-PTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD
           GLLQ +P+ E    D +MDFV GL PT+ N+N I++VVDR SK A+F+  R    A Q+ ++    +   HG P+ I +D+D
Subjt:  LPAGLLQTIPVLELMLEDWTMDFVEGL-PTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.2e-6528.61Show/hide
Query:  LLKQYEDLFSTPSELPPKRG------IDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQ
        L ++Y ++    ++LPP+        + H I   P  +   ++PY      + EI  +V+++L    I PS  P S+PV+LV KKDG +   +DYR LN+
Subjt:  LLKQYEDLFSTPSELPPKRG------IDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQ

Query:  ATVSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP---------------------------
        AT+SD FPLP I++LL  +  A IF+ LDL SGYHQI M+  D  KT F    G Y++ VM FGL   PS F                            
Subjt:  ATVSDKFPLPVIEDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFP---------------------------

Query:  -----------------------------------ISHESGVEADG---DKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL-NTFK
                                           + +  G++       K   + ++P PK V   + FLG+  YYRRF+    KIA P+   + +  +
Subjt:  -----------------------------------ISHESGVEADG---DKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL-NTFK

Query:  WDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHP------IAFFSQK---------------LSICAQVQKWRHYVLGRKFT
        W     +A +KLK A+ ++PVL   +    + + TDA   G+G  L +  +       + +FS+                L I   +  +R+ + G+ FT
Subjt:  WDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHP------IAFFSQK---------------LSICAQVQKWRHYVLGRKFT

Query:  IISDQKALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTT--------------EEQPKELAVMSALGIIDLEVVLKEVEDDPEL
        + +D  +L  L  + E   + Q+WL  L  YDF + Y  G +N  ADA+SR   T              +   K   + SA+ I   E+    V   PE 
Subjt:  IISDQKALKFLLEQGEVQPQFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTT--------------EEQPKELAVMSALGIIDLEVVLKEVEDDPEL

Query:  QRLIEAVQKD---PERYAK-YQRSNDKLLYKGRLVLS-KYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEE---QCDRNKTEST
             + QK     E + K Y   ++ + Y+ RLV+  K    +  L H    ++ GG+     T  ++    YW  ++  I  Y     QC   K+   
Subjt:  QRLIEAVQKD---PERYAK-YQRSNDKLLYKGRLVLS-KYSTLIPTLLHTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEE---QCDRNKTEST

Query:  LPAGLLQTIPVLELMLEDWTMDFVEGL-PTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD
           GLLQ +P+ E    D +MDFV GL PT+ N+N I++VVDR SK A+F+  R    A Q+ ++    +   HG P+ I +D+D
Subjt:  LPAGLLQTIPVLELMLEDWTMDFVEGL-PTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQVAEVFIDKVIRKHGVPKAIITDQD

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein2.2e-0757.5Show/hide
Query:  LQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGW
        L++  ++N + EML+A +I+PS  PYS+PVLLV+KKDGGW
Subjt:  LQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGW

ATMG00860.1 DNA/RNA polymerases superfamily protein2.6e-2456.12Show/hide
Query:  HFPISHESGVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL--NTFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFI
        H  I    GV AD  K++ M+ WP+PK+ T LRGFLGL GYYRRFV+ YGKI  PLT LL  N+ KW   A  AF+ LK A+ + PVLALPD  LPF+
Subjt:  HFPISHESGVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKIAAPLTRLL--NTFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAAGTCTCTACTTAAACAATATGAAGATCTTTTTTCTACCCCATCCGAACTGCCACCTAAGAGAGGAATTGATCATTGCATCCTAACAGCACCCAGACAAAAACC
GATCAATGTAAGACCCTATAAATATGGTTACCTACAGAAAGGGGAAATTGAAAATCTGGTGGAAGAGATGTTACAGGCTGGAGTGATCAAACCTAGCCATGGTCCATACT
CCACTCCGGTTCTGTTGGTGAAGAAAAAAGATGGGGGCTGGTGTTTCTCCATTGATTATAGGAAGTTAAATCAAGCTACGGTCTCGGATAAGTTTCCATTACCGGTTATA
GAGGATCTTCTGGATGAATTACATGGAGCCTCCATATTTTCCAAGCTGGATTTACGATCGGGGTACCATCAAATCAAGATGAAGGAAGACGACATTGAGAAAACTGTTTT
TAGAATGCATGAAGGCCACTACAAATTTTTGGTAATGTCATTTGGCCTTACAAAATGCCCCAGCCACTTTCCAATCTCTCATGAATCAGGAGTAGAAGCAGATGGGGATA
AGGTACAGGTTATGATGAATTGGCCTCAACCAAAGGATGTAACTGGGTTGAGAGGATTTCTGGGACTGAAAGGTTATTATAGAAGATTTGTGGAAGGTTATGGTAAAATT
GCAGCTCCCTTGACCAGATTGTTGAACACATTTAAATGGGATGGGACAGCCATTGAAGCTTTTGAAAAATTGAAGACAGCGATGATATCAGCCCCTGTTTTAGCCCTTCC
GGATTGGTCCCTCCCATTTATCATTGAGACTGATGCTTTAGGAAAAGGGTTAGGGCAGTCTTTATCTCAAAATGGGCACCCTATTGCTTTCTTCAGCCAGAAGCTTTCTA
TTTGTGCTCAAGTGCAGAAATGGAGACATTATGTGCTTGGGAGGAAGTTTACTATCATCTCGGACCAAAAGGCTCTTAAGTTCTTGCTGGAGCAGGGGGAGGTGCAACCC
CAGTTTCAAAAATGGCTTACCAAACTACTCGGATATGACTTTGAAATTCTCTACCAACCTGGGTTACAGAACAAGGCTGCTGATGCCTTATCAAGACTGAAAACAACAGA
AGAACAACCTAAAGAATTAGCTGTAATGAGTGCTCTGGGTATCATTGACTTAGAGGTGGTTTTGAAAGAAGTTGAAGATGATCCTGAACTTCAAAGATTGATTGAAGCTG
TTCAAAAAGACCCAGAAAGGTATGCCAAATATCAGCGGTCTAATGACAAATTGCTTTATAAAGGAAGATTGGTCCTTTCCAAATACTCTACCTTGATACCAACCCTATTA
CACACATTCCAGGACTCTATTTTAGGAGGGTATTCTAGATTTCTAAGAACCTATAAACGCATGAGAGGAGAAATTTATTGGGATGGGATGAAAAAAGATATCAAGAATTA
TGAGGAGCAATGCGACAGAAACAAAACGGAGTCAACTTTACCAGCAGGGTTACTTCAGACTATCCCTGTACTCGAACTCATGCTTGAAGATTGGACCATGGATTTTGTGG
AAGGTCTACCAACGGCAGGAAATATGAATGCAATAATGATTGTGGTGGATAGGCTAAGCAAATATGCCTACTTTGTGACATTGAGACACCCCTTCTCGGCTAAACAAGTT
GCTGAAGTCTTTATTGATAAAGTGATAAGGAAACATGGAGTACCAAAAGCCATCATTACTGACCAAGACTAA
mRNA sequenceShow/hide mRNA sequence
ATGATCAAGTCTCTACTTAAACAATATGAAGATCTTTTTTCTACCCCATCCGAACTGCCACCTAAGAGAGGAATTGATCATTGCATCCTAACAGCACCCAGACAAAAACC
GATCAATGTAAGACCCTATAAATATGGTTACCTACAGAAAGGGGAAATTGAAAATCTGGTGGAAGAGATGTTACAGGCTGGAGTGATCAAACCTAGCCATGGTCCATACT
CCACTCCGGTTCTGTTGGTGAAGAAAAAAGATGGGGGCTGGTGTTTCTCCATTGATTATAGGAAGTTAAATCAAGCTACGGTCTCGGATAAGTTTCCATTACCGGTTATA
GAGGATCTTCTGGATGAATTACATGGAGCCTCCATATTTTCCAAGCTGGATTTACGATCGGGGTACCATCAAATCAAGATGAAGGAAGACGACATTGAGAAAACTGTTTT
TAGAATGCATGAAGGCCACTACAAATTTTTGGTAATGTCATTTGGCCTTACAAAATGCCCCAGCCACTTTCCAATCTCTCATGAATCAGGAGTAGAAGCAGATGGGGATA
AGGTACAGGTTATGATGAATTGGCCTCAACCAAAGGATGTAACTGGGTTGAGAGGATTTCTGGGACTGAAAGGTTATTATAGAAGATTTGTGGAAGGTTATGGTAAAATT
GCAGCTCCCTTGACCAGATTGTTGAACACATTTAAATGGGATGGGACAGCCATTGAAGCTTTTGAAAAATTGAAGACAGCGATGATATCAGCCCCTGTTTTAGCCCTTCC
GGATTGGTCCCTCCCATTTATCATTGAGACTGATGCTTTAGGAAAAGGGTTAGGGCAGTCTTTATCTCAAAATGGGCACCCTATTGCTTTCTTCAGCCAGAAGCTTTCTA
TTTGTGCTCAAGTGCAGAAATGGAGACATTATGTGCTTGGGAGGAAGTTTACTATCATCTCGGACCAAAAGGCTCTTAAGTTCTTGCTGGAGCAGGGGGAGGTGCAACCC
CAGTTTCAAAAATGGCTTACCAAACTACTCGGATATGACTTTGAAATTCTCTACCAACCTGGGTTACAGAACAAGGCTGCTGATGCCTTATCAAGACTGAAAACAACAGA
AGAACAACCTAAAGAATTAGCTGTAATGAGTGCTCTGGGTATCATTGACTTAGAGGTGGTTTTGAAAGAAGTTGAAGATGATCCTGAACTTCAAAGATTGATTGAAGCTG
TTCAAAAAGACCCAGAAAGGTATGCCAAATATCAGCGGTCTAATGACAAATTGCTTTATAAAGGAAGATTGGTCCTTTCCAAATACTCTACCTTGATACCAACCCTATTA
CACACATTCCAGGACTCTATTTTAGGAGGGTATTCTAGATTTCTAAGAACCTATAAACGCATGAGAGGAGAAATTTATTGGGATGGGATGAAAAAAGATATCAAGAATTA
TGAGGAGCAATGCGACAGAAACAAAACGGAGTCAACTTTACCAGCAGGGTTACTTCAGACTATCCCTGTACTCGAACTCATGCTTGAAGATTGGACCATGGATTTTGTGG
AAGGTCTACCAACGGCAGGAAATATGAATGCAATAATGATTGTGGTGGATAGGCTAAGCAAATATGCCTACTTTGTGACATTGAGACACCCCTTCTCGGCTAAACAAGTT
GCTGAAGTCTTTATTGATAAAGTGATAAGGAAACATGGAGTACCAAAAGCCATCATTACTGACCAAGACTAA
Protein sequenceShow/hide protein sequence
MIKSLLKQYEDLFSTPSELPPKRGIDHCILTAPRQKPINVRPYKYGYLQKGEIENLVEEMLQAGVIKPSHGPYSTPVLLVKKKDGGWCFSIDYRKLNQATVSDKFPLPVI
EDLLDELHGASIFSKLDLRSGYHQIKMKEDDIEKTVFRMHEGHYKFLVMSFGLTKCPSHFPISHESGVEADGDKVQVMMNWPQPKDVTGLRGFLGLKGYYRRFVEGYGKI
AAPLTRLLNTFKWDGTAIEAFEKLKTAMISAPVLALPDWSLPFIIETDALGKGLGQSLSQNGHPIAFFSQKLSICAQVQKWRHYVLGRKFTIISDQKALKFLLEQGEVQP
QFQKWLTKLLGYDFEILYQPGLQNKAADALSRLKTTEEQPKELAVMSALGIIDLEVVLKEVEDDPELQRLIEAVQKDPERYAKYQRSNDKLLYKGRLVLSKYSTLIPTLL
HTFQDSILGGYSRFLRTYKRMRGEIYWDGMKKDIKNYEEQCDRNKTESTLPAGLLQTIPVLELMLEDWTMDFVEGLPTAGNMNAIMIVVDRLSKYAYFVTLRHPFSAKQV
AEVFIDKVIRKHGVPKAIITDQD