; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0020480 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0020480
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr02:19570929..19572968
RNA-Seq ExpressionPI0020480
SyntenyPI0020480
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0015074 - DNA integration (biological process)
GO:0030154 - cell differentiation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR041588 - Integrase zinc-binding domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8637561.1 hypothetical protein CSA_017659 [Cucumis sativus]4.1e-24563.39Show/hide
Query:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----
        LL+QYA IF  P  LPPKR+IDH+IL  P Q+PINVRPYKY ++QKEEIEKLV EMLQAG+IRPSH+PYS+PVLLV+K+ GGWRFCVDYRKLNQVT    
Subjt:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----

Query:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMN-----------------------------
                  DELH AT+FSKLD++S YHQIRM+EEDV K AFRTHEGHYEFLVMPFGLTNAPATFQ+LMN                             
Subjt:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMN-----------------------------

Query:  -----------------------------QYLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD
                                     QYLGH+IS++GV+AD EKI+ M +WPQP+DVT LRGFLGL+GYYRRFVK YG IAAPLT+LLQKN F WD+
Subjt:  -----------------------------QYLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD

Query:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK
        QATVAFE+LK AM ++PVLALP+W LPF++ETDASGTGLGAVLSQNGHPIAFF                       +VQ+WRHY+LGR+F +ISDQKALK
Subjt:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK

Query:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER
        FLLEQREVQPQFQKWLTKLLGYDFEILYQ GLQNKAADALSR+E       E+++++T G++  E I KEV +D +LQK I  LK NP+   K+ W+  +
Subjt:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER

Query:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP
        L YK R+V+ K+S +IPTLLHTFHDSI+GGHSGFLRTYKR++GELYWEGMK D+KKYVEQCEICQRNK ++T P+G+L PIP PD ILEEWSMDF+EGLP
Subjt:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP

Query:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYFLSWG
         +G MNVIMVVVDRLSKYAYFIT+KHPFTAKQVA  F+E+I+SKHG+PKSI++DRD++    F    F + G
Subjt:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYFLSWG

KGN62557.2 hypothetical protein Csa_018739 [Cucumis sativus]9.1e-24562.63Show/hide
Query:  MIQCCLLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQV
        MIQ  LL+QY ++F  P GLPPKR+ DH+IL   GQKPINVRPYKY H QKEEIEKL+ EMLQ GIIRPSH+PYS+PVLLVRK+ GGWRFCVDYRKLNQV
Subjt:  MIQCCLLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQV

Query:  T-----------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMN------------------------
        T              DELH AT+FSKLDL+SGYHQIRMKEEDV K AFRTHEGHYEFLVMPFGLTNAPATFQ+LMN                        
Subjt:  T-----------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMN------------------------

Query:  ----------------------------------QYLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNG
                                          QYLGHLIS+RGVEADG+KI+ M  WPQP+DVT LRGFLGLTGYYRRFVK YG +A PLTKLLQKN 
Subjt:  ----------------------------------QYLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNG

Query:  FKWDDQATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISD
        F W ++AT AF++LK AM ++PVLALPDW+LPF++ETDASG  LGAVLSQNGHPIAFF                       +VQ+WRHY+LGR+F +ISD
Subjt:  FKWDDQATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISD

Query:  QKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYK
        Q+ALKFLLEQREVQPQFQKWLTKLLGYDFEILYQ GLQNKAADALSR+E       E+  MST G+++ E ++KEV+ D +L+ +I+ LK NP+   K++
Subjt:  QKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYK

Query:  WDRERLWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDF
        W    LWYK R+V+ K S +IPTLLHTFHDSI+GGHSGFLRTYKR+ GELYW+GMK DVKKYV++CE+CQRNK ++T P+G+LQPIPIP+ ILE+WSMDF
Subjt:  WDRERLWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDF

Query:  VEGLPMSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYFLSWG
        +EGLP +G MNVIMV+VDRLSKY+YFIT++HPF A+QVA  F++R++S+HGIPKSII+DRDKI    F +  F S G
Subjt:  VEGLPMSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYFLSWG

TYK03842.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]2.8e-24162.72Show/hide
Query:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----
        LL QY+++F  PT LPPKR IDH+ILT PGQKPINVRPYKY H QKEEIEKLV EMLQ GIIRPSH+P+S+PVLLV+K+ GGWRFCVDYRKLN++T    
Subjt:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----

Query:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------
                  DELH AT+FSKLDL+SGYHQIRM+EED+ K AFRTHEGHYEF+VMPFGLTNAPATFQ+LMNQ                            
Subjt:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------

Query:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD
                                      YLGH+IS  GVEAD +K++ M +WP+P+DVT LRGFLGLTGYYRRFVK YG IAAPLTKLLQKN FKWD+
Subjt:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD

Query:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK
         AT+AFE LK AM ++PVLALPDWSLPF++ETDASG+GLGAVLSQN HPIAFF                       +VQ+WRHY+LGRRF ++SDQKALK
Subjt:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK

Query:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER
        FLLEQREVQPQFQKWLTKLLGYDFEILYQ GLQNKAADALSR++       EL  +ST G++  E + KEV++D +LQ LI  L+ NP    KY      
Subjt:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER

Query:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP
        L YK R+V+ K S IIP+LLHTFHDSI+GGHSGFLRTYKR++GEL+W+GMK D+KKYVEQCEICQRNK+++T P+G+LQP+PIPD ILE+W+MDF+EGLP
Subjt:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP

Query:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF
         +G MNVIMVVVDRLSKYAYF+T+KHPF+AKQVA  F+++I+ +HGIPKSII+DRDKI    F +  F
Subjt:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF

TYK21035.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.6e-24162.87Show/hide
Query:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----
        LL QY+++FN PT LPPKR IDH+ILT PGQKPINVRPYKY H QKEEIEKLV EMLQ GIIRPSH+P+S+PVLLV+K+ GGWRFCVDYRKLN++T    
Subjt:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----

Query:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------
                  DELH AT+FSKLDL+SGYHQIRM+EED+ K AFRTHEGHYEF+VMPFGLTNAPATFQ+LMNQ                            
Subjt:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------

Query:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD
                                      YLGH+IS  GVEAD +K++ M +WP+P+DVT LRGFLGLTGYYRRFVK YG IAAPLTKLLQKN FKWD+
Subjt:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD

Query:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK
         AT+AFE LK AM ++PVLALPDWSLPF++ETDASG+GLGAVLSQN HPIAFF                       +VQ+WRHY+LGRRF ++SDQKALK
Subjt:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK

Query:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER
        FLLEQREVQPQFQKWLTKLLGYDFEILYQ GLQNKAADALSR++       EL  +ST G++  E + KEV++D +LQ LI  L+ NP    KY      
Subjt:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER

Query:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP
        L YK R+V+ K S IIP+LLHTFHDSI+GGHSGFLRTYKR++GEL+W+GMK D+KKYVEQCEICQRNK+++T P+G+LQP+PIPD ILE+W+MDF+EGLP
Subjt:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP

Query:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF
         +G MNVIMVVVDRLSKYAYF+T+KHPF+AKQVA  F+++I+ +HGIPKSII+DRDKI    F +  F
Subjt:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF

TYK28944.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]8.0e-24162.72Show/hide
Query:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----
        LL QY+++F  PT LPPKR IDH+ILT PGQKPINVRPYKY H QKEEIEKLV EMLQ GIIRPSH+P+S+PVLLV+K+ GGWRFCVDYRKLN++T    
Subjt:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----

Query:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------
                  DELH AT+FSKLDL+SGYHQIRM+EED+ K AFRTHEGHYEF+VMPFGLTNAPATFQ+LMNQ                            
Subjt:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------

Query:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD
                                      YLGH+IS  GVEAD +K++ M +WP+P+DVT LRGFLGLTGYYRRFVK YG IAAPLTKLLQKN FKWD+
Subjt:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD

Query:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK
         AT+AFE LK AM ++PVLALPDWSLPF++ETDASG+GLGAVLSQN HPIAFF                       +VQ+WRHY+LGRRF ++SDQKALK
Subjt:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK

Query:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER
        FLLEQREVQPQFQKWLTKLLGYDFEILYQ GLQNKAADALSR++       EL  +ST G++  E + KEV++D +LQ LI  L+ NP    KY      
Subjt:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER

Query:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP
        L YK R+V+ K S IIP+LLHTFHDSI+GGHSGFLRTYKR++GEL+W+GMK D+KKYVEQCEICQRNK+++T P+G+LQP+PIPD ILE+W+MDF+EGLP
Subjt:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP

Query:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF
         +G MNVIMVVVDRLSKYAYF+T+KHPF+AKQVA  F+++I+ +HGIPKSII+DRDKI    F +  F
Subjt:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF

TrEMBL top hitse value%identityAlignment
A0A5D3BBH7 Ty3/gypsy retrotransposon protein3.9e-24162.72Show/hide
Query:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----
        LL QY+++F  PT LPPKR IDH+ILT PGQKPINVRPYKY H QKEEIEKLV EMLQ GIIRPSH+P+S+PVLLV+K+ GGWRFCVDYRKLN++T    
Subjt:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----

Query:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------
                  DELH AT+FSKLDL+SGYHQIRM+EED+ K AFRTHEGHYEF+VMPFGLTNAPATFQ+LMNQ                            
Subjt:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------

Query:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD
                                      YLGH+IS  GVEAD +K++ M +WP+P+DVT LRGFLGLTGYYRRFVK YG IAAPLTKLLQKN FKWD+
Subjt:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD

Query:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK
         AT+AFE LK AM ++PVLALPDWSLPF++ETDASG+GLGAVLSQN HPIAFF                       +VQ+WRHY+LGRRF ++SDQKALK
Subjt:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK

Query:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER
        FLLEQREVQPQFQKWLTKLLGYDFEILYQ GLQNKAADALSR++       EL  +ST G++  E + KEV++D +LQ LI  L+ NP    KY      
Subjt:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER

Query:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP
        L YK R+V+ K S IIP+LLHTFHDSI+GGHSGFLRTYKR++GEL+W+GMK D+KKYVEQCEICQRNK+++T P+G+LQP+PIPD ILE+W+MDF+EGLP
Subjt:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP

Query:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF
         +G MNVIMVVVDRLSKYAYF+T+KHPF+AKQVA  F+++I+ +HGIPKSII+DRDKI    F +  F
Subjt:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF

A0A5D3BYA1 Ty3/gypsy retrotransposon protein1.3e-24162.72Show/hide
Query:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----
        LL QY+++F  PT LPPKR IDH+ILT PGQKPINVRPYKY H QKEEIEKLV EMLQ GIIRPSH+P+S+PVLLV+K+ GGWRFCVDYRKLN++T    
Subjt:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----

Query:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------
                  DELH AT+FSKLDL+SGYHQIRM+EED+ K AFRTHEGHYEF+VMPFGLTNAPATFQ+LMNQ                            
Subjt:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------

Query:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD
                                      YLGH+IS  GVEAD +K++ M +WP+P+DVT LRGFLGLTGYYRRFVK YG IAAPLTKLLQKN FKWD+
Subjt:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD

Query:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK
         AT+AFE LK AM ++PVLALPDWSLPF++ETDASG+GLGAVLSQN HPIAFF                       +VQ+WRHY+LGRRF ++SDQKALK
Subjt:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK

Query:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER
        FLLEQREVQPQFQKWLTKLLGYDFEILYQ GLQNKAADALSR++       EL  +ST G++  E + KEV++D +LQ LI  L+ NP    KY      
Subjt:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER

Query:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP
        L YK R+V+ K S IIP+LLHTFHDSI+GGHSGFLRTYKR++GEL+W+GMK D+KKYVEQCEICQRNK+++T P+G+LQP+PIPD ILE+W+MDF+EGLP
Subjt:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP

Query:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF
         +G MNVIMVVVDRLSKYAYF+T+KHPF+AKQVA  F+++I+ +HGIPKSII+DRDKI    F +  F
Subjt:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF

A0A5D3DWA9 Ty3/gypsy retrotransposon protein3.9e-24162.72Show/hide
Query:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----
        LL QY+++F  PT LPPKR IDH+ILT PGQKPINVRPYKY H QKEEIEKLV EMLQ GIIRPSH+P+S+PVLLV+K+ GGWRFCVDYRKLN++T    
Subjt:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----

Query:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------
                  DELH AT+FSKLDL+SGYHQIRM+EED+ K AFRTHEGHYEF+VMPFGLTNAPATFQ+LMNQ                            
Subjt:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------

Query:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD
                                      YLGH+IS  GVEAD +K++ M +WP+P+DVT LRGFLGLTGYYRRFVK YG IAAPLTKLLQKN FKWD+
Subjt:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD

Query:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK
         AT+AFE LK AM ++PVLALPDWSLPF++ETDASG+GLGAVLSQN HPIAFF                       +VQ+WRHY+LGRRF ++SDQKALK
Subjt:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK

Query:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER
        FLLEQREVQPQFQKWLTKLLGYDFEILYQ GLQNKAADALSR++       EL  +ST G++  E + KEV++D +LQ LI  L+ NP    KY      
Subjt:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER

Query:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP
        L YK R+V+ K S IIP+LLHTFHDSI+GGHSGFLRTYKR++GEL+W+GMK D+KKYVEQCEICQRNK+++T P+G+LQP+PIPD ILE+W+MDF+EGLP
Subjt:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP

Query:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF
         +G MNVIMVVVDRLSKYAYF+T+KHPF+AKQVA  F+++I+ +HGIPKSII+DRDKI    F +  F
Subjt:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF

A0A5D3DZK6 Ty3/gypsy retrotransposon protein3.9e-24162.72Show/hide
Query:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----
        LL QY+++F  PT LPPKR IDH+ILT PGQKPINVRPYKY H QKEEIEKLV EMLQ GIIRPSH+P+S+PVLLV+K+ GGWRFCVDYRKLN++T    
Subjt:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----

Query:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------
                  DELH AT+FSKLDL+SGYHQIRM+EED+ K AFRTHEGHYEF+VMPFGLTNAPATFQ+LMNQ                            
Subjt:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------

Query:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD
                                      YLGH+IS  GVEAD +K++ M +WP+P+DVT LRGFLGLTGYYRRFVK YG IAAPLTKLLQKN FKWD+
Subjt:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD

Query:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK
         AT+AFE LK AM ++PVLALPDWSLPF++ETDASG+GLGAVLSQN HPIAFF                       +VQ+WRHY+LGRRF ++SDQKALK
Subjt:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK

Query:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER
        FLLEQREVQPQFQKWLTKLLGYDFEILYQ GLQNKAADALSR++       EL  +ST G++  E + KEV++D +LQ LI  L+ NP    KY      
Subjt:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER

Query:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP
        L YK R+V+ K S IIP+LLHTFHDSI+GGHSGFLRTYKR++GEL+W+GMK D+KKYVEQCEICQRNK+++T P+G+LQP+PIPD ILE+W+MDF+EGLP
Subjt:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP

Query:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF
         +G MNVIMVVVDRLSKYAYF+T+KHPF+AKQVA  F+++I+ +HGIPKSII+DRDKI    F +  F
Subjt:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF

A0A5D3E325 Ty3/gypsy retrotransposon protein1.7e-24162.87Show/hide
Query:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----
        LL QY+++FN PT LPPKR IDH+ILT PGQKPINVRPYKY H QKEEIEKLV EMLQ GIIRPSH+P+S+PVLLV+K+ GGWRFCVDYRKLN++T    
Subjt:  LLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVT----

Query:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------
                  DELH AT+FSKLDL+SGYHQIRM+EED+ K AFRTHEGHYEF+VMPFGLTNAPATFQ+LMNQ                            
Subjt:  -------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQ----------------------------

Query:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD
                                      YLGH+IS  GVEAD +K++ M +WP+P+DVT LRGFLGLTGYYRRFVK YG IAAPLTKLLQKN FKWD+
Subjt:  ------------------------------YLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDD

Query:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK
         AT+AFE LK AM ++PVLALPDWSLPF++ETDASG+GLGAVLSQN HPIAFF                       +VQ+WRHY+LGRRF ++SDQKALK
Subjt:  QATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFF-----------------------TVQRWRHYVLGRRFIVISDQKALK

Query:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER
        FLLEQREVQPQFQKWLTKLLGYDFEILYQ GLQNKAADALSR++       EL  +ST G++  E + KEV++D +LQ LI  L+ NP    KY      
Subjt:  FLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRER

Query:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP
        L YK R+V+ K S IIP+LLHTFHDSI+GGHSGFLRTYKR++GEL+W+GMK D+KKYVEQCEICQRNK+++T P+G+LQP+PIPD ILE+W+MDF+EGLP
Subjt:  LWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLP

Query:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF
         +G MNVIMVVVDRLSKYAYF+T+KHPF+AKQVA  F+++I+ +HGIPKSII+DRDKI    F +  F
Subjt:  MSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKISSVVFGRSYF

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein1.9e-6727.27Show/hide
Query:  PKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQ-----------VTNCWDELHEA
        P + ++ ++        + +R Y     + + +   + + L++GIIR S    + PV+ V K+ G  R  VDY+ LN+           +     ++  +
Subjt:  PKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQ-----------VTNCWDELHEA

Query:  TIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMN---------------------------------------------
        TIF+KLDL+S YH IR+++ D  K AFR   G +E+LVMP+G++ APA FQ  +N                                             
Subjt:  TIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMN---------------------------------------------

Query:  -------------QYLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKN-GFKWDDQATVAFEQLKKAMIS
                     +++G+ IS +G     E I  + +W QP++   LR FLG   Y R+F+     +  PL  LL+K+  +KW    T A E +K+ ++S
Subjt:  -------------QYLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKN-GFKWDDQATVAFEQLKKAMIS

Query:  VPVLALPDWSLPFVVETDASGTGLGAVLSQNG-----HPIAFF-----------------------TVQRWRHYVLG--RRFIVISDQKAL--KFLLEQR
         PVL   D+S   ++ETDAS   +GAVLSQ       +P+ ++                       +++ WRHY+      F +++D + L  +   E  
Subjt:  VPVLALPDWSLPFVVETDASGTGLGAVLSQNG-----HPIAFF-----------------------TVQRWRHYVLG--RRFIVISDQKAL--KFLLEQR

Query:  EVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVE------PRKATEAELSTMSTLGLLS--TEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDR
            +  +W   L  ++FEI Y+ G  N  ADALSR+       P+ + +  ++ ++ + +       +  E   D  L  L++N     E + + K D 
Subjt:  EVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVE------PRKATEAELSTMSTLGLLS--TEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDR

Query:  ERLWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEG
          +  K+++++   + +  T++  +H+     H G       +     W+G++  +++YV+ C  CQ NK+ +  P G LQPIP  +   E  SMDF+  
Subjt:  ERLWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEG

Query:  LPMSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKI
        LP S   N + VVVDR SK A  +      TA+Q A  F +R+I+  G PK II D D I
Subjt:  LPMSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKI

P0CT35 Transposon Tf2-2 polyprotein1.9e-6727.27Show/hide
Query:  PKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQ-----------VTNCWDELHEA
        P + ++ ++        + +R Y     + + +   + + L++GIIR S    + PV+ V K+ G  R  VDY+ LN+           +     ++  +
Subjt:  PKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQ-----------VTNCWDELHEA

Query:  TIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMN---------------------------------------------
        TIF+KLDL+S YH IR+++ D  K AFR   G +E+LVMP+G++ APA FQ  +N                                             
Subjt:  TIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMN---------------------------------------------

Query:  -------------QYLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKN-GFKWDDQATVAFEQLKKAMIS
                     +++G+ IS +G     E I  + +W QP++   LR FLG   Y R+F+     +  PL  LL+K+  +KW    T A E +K+ ++S
Subjt:  -------------QYLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKN-GFKWDDQATVAFEQLKKAMIS

Query:  VPVLALPDWSLPFVVETDASGTGLGAVLSQNG-----HPIAFF-----------------------TVQRWRHYVLG--RRFIVISDQKAL--KFLLEQR
         PVL   D+S   ++ETDAS   +GAVLSQ       +P+ ++                       +++ WRHY+      F +++D + L  +   E  
Subjt:  VPVLALPDWSLPFVVETDASGTGLGAVLSQNG-----HPIAFF-----------------------TVQRWRHYVLG--RRFIVISDQKAL--KFLLEQR

Query:  EVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVE------PRKATEAELSTMSTLGLLS--TEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDR
            +  +W   L  ++FEI Y+ G  N  ADALSR+       P+ + +  ++ ++ + +       +  E   D  L  L++N     E + + K D 
Subjt:  EVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVE------PRKATEAELSTMSTLGLLS--TEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDR

Query:  ERLWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEG
          +  K+++++   + +  T++  +H+     H G       +     W+G++  +++YV+ C  CQ NK+ +  P G LQPIP  +   E  SMDF+  
Subjt:  ERLWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEG

Query:  LPMSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKI
        LP S   N + VVVDR SK A  +      TA+Q A  F +R+I+  G PK II D D I
Subjt:  LPMSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKI

P0CT41 Transposon Tf2-12 polyprotein1.9e-6727.27Show/hide
Query:  PKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQ-----------VTNCWDELHEA
        P + ++ ++        + +R Y     + + +   + + L++GIIR S    + PV+ V K+ G  R  VDY+ LN+           +     ++  +
Subjt:  PKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQ-----------VTNCWDELHEA

Query:  TIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMN---------------------------------------------
        TIF+KLDL+S YH IR+++ D  K AFR   G +E+LVMP+G++ APA FQ  +N                                             
Subjt:  TIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMN---------------------------------------------

Query:  -------------QYLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKN-GFKWDDQATVAFEQLKKAMIS
                     +++G+ IS +G     E I  + +W QP++   LR FLG   Y R+F+     +  PL  LL+K+  +KW    T A E +K+ ++S
Subjt:  -------------QYLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKN-GFKWDDQATVAFEQLKKAMIS

Query:  VPVLALPDWSLPFVVETDASGTGLGAVLSQNG-----HPIAFF-----------------------TVQRWRHYVLG--RRFIVISDQKAL--KFLLEQR
         PVL   D+S   ++ETDAS   +GAVLSQ       +P+ ++                       +++ WRHY+      F +++D + L  +   E  
Subjt:  VPVLALPDWSLPFVVETDASGTGLGAVLSQNG-----HPIAFF-----------------------TVQRWRHYVLG--RRFIVISDQKAL--KFLLEQR

Query:  EVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVE------PRKATEAELSTMSTLGLLS--TEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDR
            +  +W   L  ++FEI Y+ G  N  ADALSR+       P+ + +  ++ ++ + +       +  E   D  L  L++N     E + + K D 
Subjt:  EVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVE------PRKATEAELSTMSTLGLLS--TEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDR

Query:  ERLWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEG
          +  K+++++   + +  T++  +H+     H G       +     W+G++  +++YV+ C  CQ NK+ +  P G LQPIP  +   E  SMDF+  
Subjt:  ERLWYKNRLVILKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEG

Query:  LPMSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKI
        LP S   N + VVVDR SK A  +      TA+Q A  F +R+I+  G PK II D D I
Subjt:  LPMSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRDKI

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.0e-7329.38Show/hide
Query:  LLKQYAEIFNPPTGLPPKR------DIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQ
        L ++Y EI      LPP+        + H I   PG +   ++PY      ++EI K+V+++L    I PS +P S+PV+LV K+ G +R CVDYR LN+
Subjt:  LLKQYAEIFNPPTGLPPKR------DIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQ

Query:  VT-----------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALM------------------------
         T           N    +  A IF+ LDL SGYHQI M+ +D  K AF T  G YE+ VMPFGL NAP+TF   M                        
Subjt:  VT-----------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALM------------------------

Query:  --------------------------------NQYLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGF
                                         ++LG+ I  + +     K   ++++P P+ V   + FLG+  YYRRF+ N   IA P+ +L   +  
Subjt:  --------------------------------NQYLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGF

Query:  KWDDQATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVL----------------------SQNGHPIA-------FFTVQRWRHYVLGRRF
        +W ++   A E+LK A+ + PVL   +    + + TDAS  G+GAVL                      +Q  +P            +  +R+ + G+ F
Subjt:  KWDDQATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVL----------------------SQNGHPIA-------FFTVQRWRHYVLGRRF

Query:  IVISDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLK-----
         + +D  +L  L  + E   + Q+WL  L  YDF + Y +G +N  ADA+SR            T  T   + TE+ +   K DP    ++ ++K     
Subjt:  IVISDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLK-----

Query:  ------------------LNPEAHPKYKWDRERLWYKNRLVI-LKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQR
                          L+      Y  + E ++Y++RLV+ +K    +  L H    ++ GGH G   T  +++   YW  ++  + +Y+  C  CQ 
Subjt:  ------------------LNPEAHPKYKWDRERLWYKNRLVI-LKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQR

Query:  NKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGL-PMSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRD
         K+      GLLQP+PI +    + SMDFV GL P S ++N+I+VVVDR SK A+FI  +    A Q+       I S HG P++I +DRD
Subjt:  NKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGL-PMSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRD

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.8e-7329.23Show/hide
Query:  LLKQYAEIFNPPTGLPPKR------DIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQ
        L ++Y EI      LPP+        + H I   PG +   ++PY      ++EI K+V+++L    I PS +P S+PV+LV K+ G +R CVDYR LN+
Subjt:  LLKQYAEIFNPPTGLPPKR------DIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQ

Query:  VT-----------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALM------------------------
         T           N    +  A IF+ LDL SGYHQI M+ +D  K AF T  G YE+ VMPFGL NAP+TF   M                        
Subjt:  VT-----------NCWDELHEATIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALM------------------------

Query:  --------------------------------NQYLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGF
                                         ++LG+ I  + +     K   ++++P P+ V   + FLG+  YYRRF+ N   IA P+ +L   +  
Subjt:  --------------------------------NQYLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGF

Query:  KWDDQATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVL----------------------SQNGHPIA-------FFTVQRWRHYVLGRRF
        +W ++   A ++LK A+ + PVL   +    + + TDAS  G+GAVL                      +Q  +P            +  +R+ + G+ F
Subjt:  KWDDQATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVL----------------------SQNGHPIA-------FFTVQRWRHYVLGRRF

Query:  IVISDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLK-----
         + +D  +L  L  + E   + Q+WL  L  YDF + Y +G +N  ADA+SR            T  T   + TE+ +   K DP    ++ ++K     
Subjt:  IVISDQKALKFLLEQREVQPQFQKWLTKLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLK-----

Query:  ------------------LNPEAHPKYKWDRERLWYKNRLVI-LKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQR
                          L+      Y  + E ++Y++RLV+ +K    +  L H    ++ GGH G   T  +++   YW  ++  + +Y+  C  CQ 
Subjt:  ------------------LNPEAHPKYKWDRERLWYKNRLVI-LKHSPIIPTLLHTFHDSIMGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQR

Query:  NKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGL-PMSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRD
         K+      GLLQP+PI +    + SMDFV GL P S ++N+I+VVVDR SK A+FI  +    A Q+       I S HG P++I +DRD
Subjt:  NKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGL-PMSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAFVERIISKHGIPKSIITDRD

Arabidopsis top hitse value%identityAlignment
ATMG00850.1 DNA/RNA polymerases superfamily protein2.1e-0552.5Show/hide
Query:  LQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGW
        L++  ++  + EML+A II+PS +PYS+PVLLV+K+ GGW
Subjt:  LQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGW

ATMG00860.1 DNA/RNA polymerases superfamily protein5.0e-3161.39Show/hide
Query:  YLG--HLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDDQATVAFEQLKKAMISVPVLALPDWSLPF
        YLG  H+IS  GV AD  K++ M  WP+P++ T LRGFLGLTGYYRRFVKNYG I  PLT+LL+KN  KW + A +AF+ LK A+ ++PVLALPD  LPF
Subjt:  YLG--HLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIAAPLTKLLQKNGFKWDDQATVAFEQLKKAMISVPVLALPDWSLPF

Query:  V
        V
Subjt:  V


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCCAATGCTGCTTATTGAAACAATATGCTGAGATTTTTAATCCTCCAACAGGTTTACCACCCAAAAGGGATATAGACCATCAGATTTTGACTGCACCAGGGCAAAA
ACCCATTAATGTGAGACCATATAAGTACGAACATTTGCAAAAGGAAGAGATTGAGAAGTTGGTTGAAGAAATGCTTCAAGCTGGAATTATCCGCCCCAGCCACAACCCAT
ACTCCAACCCAGTGCTCTTAGTAAGAAAAGAGATGGGGGGATGGCGCTTCTGTGTTGATTACCGAAAACTGAACCAAGTGACTAATTGTTGGGATGAACTTCACGAGGCT
ACTATCTTCTCGAAACTGGACCTACGATCGGGCTACCACCAGATAAGAATGAAAGAGGAAGATGTGGGGAAGGCAGCCTTTCGTACTCATGAAGGACACTATGAGTTCTT
GGTGATGCCCTTTGGCCTGACAAACGCCCCTGCCACTTTCCAAGCACTGATGAACCAGTATCTTGGCCACTTGATATCAAACAGGGGAGTTGAGGCTGATGGAGAGAAAA
TCCAAGTAATGAAAGAGTGGCCCCAACCCAGAGATGTAACCAGTTTGAGGGGATTTTTGGGCCTCACGGGGTACTACAGAAGGTTTGTTAAAAATTATGGTGTGATTGCA
GCTCCTTTGACTAAGTTGCTTCAGAAGAATGGATTCAAATGGGATGACCAAGCAACTGTAGCTTTTGAACAACTAAAGAAAGCCATGATATCAGTTCCTGTATTAGCTTT
GCCAGACTGGTCATTGCCCTTTGTAGTAGAAACTGATGCTTCTGGAACAGGTTTAGGGGCTGTCCTATCCCAAAATGGCCATCCTATAGCTTTCTTCACCGTTCAGAGAT
GGAGACATTATGTTTTGGGACGTCGTTTCATAGTGATATCAGATCAGAAGGCATTAAAATTCTTGCTGGAACAGAGGGAAGTCCAACCTCAATTTCAGAAGTGGCTCACC
AAACTTCTAGGCTACGATTTTGAAATCTTATACCAATCGGGCTTGCAGAATAAAGCAGCAGATGCCCTCTCAAGAGTGGAACCAAGGAAAGCTACTGAGGCTGAACTGTC
TACCATGTCCACCTTGGGGTTACTAAGTACTGAAGCCATTCAGAAGGAAGTGAAAGAAGACCCTGATTTGCAGAAACTGATTGATAACCTGAAACTGAACCCAGAGGCTC
ATCCTAAGTACAAATGGGATAGGGAACGGTTGTGGTACAAAAATAGGTTGGTAATTCTCAAGCACTCACCTATAATACCAACTCTGTTGCACACTTTTCATGATTCCATC
ATGGGAGGCCATTCGGGGTTTCTAAGAACCTATAAAAGGTTGACAGGGGAACTGTACTGGGAGGGGATGAAATTGGATGTCAAGAAGTATGTTGAGCAGTGTGAGATATG
TCAGCGTAATAAAACAGATTCCACATTTCCATCTGGCTTACTACAACCTATCCCTATTCCAGATTTAATCTTGGAAGAATGGTCAATGGATTTTGTGGAAGGACTACCTA
TGTCAGGGCACATGAATGTGATCATGGTTGTGGTGGATCGGCTGAGCAAATATGCCTACTTTATTACCCTCAAACATCCGTTCACAGCCAAACAGGTTGCAGCAGCTTTC
GTTGAGAGAATAATAAGTAAGCACGGGATACCTAAATCTATCATTACAGACAGGGACAAAATATCCTCAGTAGTTTTTGGAAGGAGCTATTTTCTGTCATGGGGACCTCC
TTAA
mRNA sequenceShow/hide mRNA sequence
ATGATCCAATGCTGCTTATTGAAACAATATGCTGAGATTTTTAATCCTCCAACAGGTTTACCACCCAAAAGGGATATAGACCATCAGATTTTGACTGCACCAGGGCAAAA
ACCCATTAATGTGAGACCATATAAGTACGAACATTTGCAAAAGGAAGAGATTGAGAAGTTGGTTGAAGAAATGCTTCAAGCTGGAATTATCCGCCCCAGCCACAACCCAT
ACTCCAACCCAGTGCTCTTAGTAAGAAAAGAGATGGGGGGATGGCGCTTCTGTGTTGATTACCGAAAACTGAACCAAGTGACTAATTGTTGGGATGAACTTCACGAGGCT
ACTATCTTCTCGAAACTGGACCTACGATCGGGCTACCACCAGATAAGAATGAAAGAGGAAGATGTGGGGAAGGCAGCCTTTCGTACTCATGAAGGACACTATGAGTTCTT
GGTGATGCCCTTTGGCCTGACAAACGCCCCTGCCACTTTCCAAGCACTGATGAACCAGTATCTTGGCCACTTGATATCAAACAGGGGAGTTGAGGCTGATGGAGAGAAAA
TCCAAGTAATGAAAGAGTGGCCCCAACCCAGAGATGTAACCAGTTTGAGGGGATTTTTGGGCCTCACGGGGTACTACAGAAGGTTTGTTAAAAATTATGGTGTGATTGCA
GCTCCTTTGACTAAGTTGCTTCAGAAGAATGGATTCAAATGGGATGACCAAGCAACTGTAGCTTTTGAACAACTAAAGAAAGCCATGATATCAGTTCCTGTATTAGCTTT
GCCAGACTGGTCATTGCCCTTTGTAGTAGAAACTGATGCTTCTGGAACAGGTTTAGGGGCTGTCCTATCCCAAAATGGCCATCCTATAGCTTTCTTCACCGTTCAGAGAT
GGAGACATTATGTTTTGGGACGTCGTTTCATAGTGATATCAGATCAGAAGGCATTAAAATTCTTGCTGGAACAGAGGGAAGTCCAACCTCAATTTCAGAAGTGGCTCACC
AAACTTCTAGGCTACGATTTTGAAATCTTATACCAATCGGGCTTGCAGAATAAAGCAGCAGATGCCCTCTCAAGAGTGGAACCAAGGAAAGCTACTGAGGCTGAACTGTC
TACCATGTCCACCTTGGGGTTACTAAGTACTGAAGCCATTCAGAAGGAAGTGAAAGAAGACCCTGATTTGCAGAAACTGATTGATAACCTGAAACTGAACCCAGAGGCTC
ATCCTAAGTACAAATGGGATAGGGAACGGTTGTGGTACAAAAATAGGTTGGTAATTCTCAAGCACTCACCTATAATACCAACTCTGTTGCACACTTTTCATGATTCCATC
ATGGGAGGCCATTCGGGGTTTCTAAGAACCTATAAAAGGTTGACAGGGGAACTGTACTGGGAGGGGATGAAATTGGATGTCAAGAAGTATGTTGAGCAGTGTGAGATATG
TCAGCGTAATAAAACAGATTCCACATTTCCATCTGGCTTACTACAACCTATCCCTATTCCAGATTTAATCTTGGAAGAATGGTCAATGGATTTTGTGGAAGGACTACCTA
TGTCAGGGCACATGAATGTGATCATGGTTGTGGTGGATCGGCTGAGCAAATATGCCTACTTTATTACCCTCAAACATCCGTTCACAGCCAAACAGGTTGCAGCAGCTTTC
GTTGAGAGAATAATAAGTAAGCACGGGATACCTAAATCTATCATTACAGACAGGGACAAAATATCCTCAGTAGTTTTTGGAAGGAGCTATTTTCTGTCATGGGGACCTCC
TTAA
Protein sequenceShow/hide protein sequence
MIQCCLLKQYAEIFNPPTGLPPKRDIDHQILTAPGQKPINVRPYKYEHLQKEEIEKLVEEMLQAGIIRPSHNPYSNPVLLVRKEMGGWRFCVDYRKLNQVTNCWDELHEA
TIFSKLDLRSGYHQIRMKEEDVGKAAFRTHEGHYEFLVMPFGLTNAPATFQALMNQYLGHLISNRGVEADGEKIQVMKEWPQPRDVTSLRGFLGLTGYYRRFVKNYGVIA
APLTKLLQKNGFKWDDQATVAFEQLKKAMISVPVLALPDWSLPFVVETDASGTGLGAVLSQNGHPIAFFTVQRWRHYVLGRRFIVISDQKALKFLLEQREVQPQFQKWLT
KLLGYDFEILYQSGLQNKAADALSRVEPRKATEAELSTMSTLGLLSTEAIQKEVKEDPDLQKLIDNLKLNPEAHPKYKWDRERLWYKNRLVILKHSPIIPTLLHTFHDSI
MGGHSGFLRTYKRLTGELYWEGMKLDVKKYVEQCEICQRNKTDSTFPSGLLQPIPIPDLILEEWSMDFVEGLPMSGHMNVIMVVVDRLSKYAYFITLKHPFTAKQVAAAF
VERIISKHGIPKSIITDRDKISSVVFGRSYFLSWGPP