; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0052481 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0052481
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCMiso1.1chr02:19913572..19916029
RNA-Seq ExpressionCmc02g0052481
SyntenyCmc02g0052481
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025314 - Domain of unknown function DUF4219
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0062322.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0090.55Show/hide
Query:  MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILE
        MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILE
Subjt:  MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILE

Query:  NTYKGVDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEE
        NTYKGVDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEE
Subjt:  NTYKGVDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEE

Query:  KLLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ------
        KLLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ      
Subjt:  KLLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ------

Query:  -----------KKIEENANYAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKH
                    ++EENANYAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKH
Subjt:  -----------KKIEENANYAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKH

Query:  EFISNVYYVPNMKNNILSLGQLLEKGYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNM
        EFISNVYYVPNMKNNILSLGQLLEKGYNILMKD SLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNM
Subjt:  EFISNVYYVPNMKNNILSLGQLLEKGYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNM

Query:  VKGLPYVKLPDQLCEGCLHGKQSRKSFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGY
        VKGLPYVKLPDQLCEGCLHGKQSRKSFPQESS RARRPLELVHTDLCGPIKPSSF                                         ESGY
Subjt:  VKGLPYVKLPDQLCEGCLHGKQSRKSFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGY

Query:  YIKALRSDRGGEFTSNEFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRK
        YIKALRSDRGGEFTSNEFKTFC ENGIRR MTVPFTPQ  GVVERKNRTILNMARSMLKCKKMPKEFWAQ VECAVYLSNRSPTRSLWNKT QQAWTGRK
Subjt:  YIKALRSDRGGEFTSNEFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRK

Query:  PSIGHLRVFGCMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP
        PSIGHLRVFGCMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNP+TKKTI+SRDVVFDEEASWNWNDE EDYKF  FP
Subjt:  PSIGHLRVFGCMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP

TYJ96309.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0091.16Show/hide
Query:  MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH
        MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH
Subjt:  MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH

Query:  MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG
        MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG
Subjt:  MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG

Query:  SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ-----------------KKIEENANYAEKDE
        SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ                  ++EENANYAEKDE
Subjt:  SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ-----------------KKIEENANYAEKDE

Query:  ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK
        ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK
Subjt:  ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK

Query:  GYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK
        GYNILMKD SLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK
Subjt:  GYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK

Query:  SFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGYYIKALRSDRGGEFTSNEFKTFCAEN
        SFPQESS RARRPLELVHTDLCGPIKPSSF                                         ESGYYIKALRSDRGGEFTSNEFKTFC EN
Subjt:  SFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGYYIKALRSDRGGEFTSNEFKTFCAEN

Query:  GIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL
        GIRR MTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQ VECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL
Subjt:  GIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL

Query:  DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP
        DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKF FFP
Subjt:  DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP

TYK06895.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0091.16Show/hide
Query:  MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH
        MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH
Subjt:  MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH

Query:  MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG
        MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG
Subjt:  MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG

Query:  SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ-----------------KKIEENANYAEKDE
        SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ                  ++EENANYAEKDE
Subjt:  SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ-----------------KKIEENANYAEKDE

Query:  ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK
        ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK
Subjt:  ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK

Query:  GYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK
        GYNILMKD SLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK
Subjt:  GYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK

Query:  SFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGYYIKALRSDRGGEFTSNEFKTFCAEN
        SFPQESS RARRPLELVHTDLCGPIKPSSF                                         ESGYYIKALRSDRGGEFTSNEFKTFC EN
Subjt:  SFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGYYIKALRSDRGGEFTSNEFKTFCAEN

Query:  GIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL
        GIRR MTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQ VECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL
Subjt:  GIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL

Query:  DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP
        DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKF FFP
Subjt:  DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP

TYK13816.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0091.44Show/hide
Query:  MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILE
        MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILE
Subjt:  MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILE

Query:  NTYKGVDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEE
        NTYKGVDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEE
Subjt:  NTYKGVDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEE

Query:  KLLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ------
        KLLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ      
Subjt:  KLLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ------

Query:  -----------KKIEENANYAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKH
                    ++EENANYAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKH
Subjt:  -----------KKIEENANYAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKH

Query:  EFISNVYYVPNMKNNILSLGQLLEKGYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNM
        EFISNVYYVPNMKNNILSLGQLLEKGYNILMKD SLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNM
Subjt:  EFISNVYYVPNMKNNILSLGQLLEKGYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNM

Query:  VKGLPYVKLPDQLCEGCLHGKQSRKSFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGY
        VKGLPYVKLPDQLCEGCLHGKQSRKSFPQESS RARRPLELVHTDLCGPIKPSSF                                         ESGY
Subjt:  VKGLPYVKLPDQLCEGCLHGKQSRKSFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGY

Query:  YIKALRSDRGGEFTSNEFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRK
        YIKALRSDRGGEFTSNEFKTFC ENGIRR MTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQ VECAVYLSNRSPTRSLWNKTPQQAWTGRK
Subjt:  YIKALRSDRGGEFTSNEFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRK

Query:  PSIGHLRVFGCMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP
        PSIGHLRVFGCMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKF FFP
Subjt:  PSIGHLRVFGCMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP

TYK18362.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cucumis melo var. makuwa]0.0e+0091.16Show/hide
Query:  MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH
        MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH
Subjt:  MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH

Query:  MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG
        MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG
Subjt:  MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG

Query:  SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ-----------------KKIEENANYAEKDE
        SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ                  ++EENANYAEKDE
Subjt:  SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ-----------------KKIEENANYAEKDE

Query:  ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK
        ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK
Subjt:  ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK

Query:  GYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK
        GYNILMKD SLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK
Subjt:  GYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK

Query:  SFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGYYIKALRSDRGGEFTSNEFKTFCAEN
        SFPQESS RARRPLELVHTDLCGPIKPSSF                                         ESGYYIKALRSDRGGEFTSNEFKTFC EN
Subjt:  SFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGYYIKALRSDRGGEFTSNEFKTFCAEN

Query:  GIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL
        GIRR MTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQ VECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL
Subjt:  GIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL

Query:  DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP
        DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKF FFP
Subjt:  DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP

TrEMBL top hitse value%identityAlignment
A0A5A7V277 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0090.55Show/hide
Query:  MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILE
        MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILE
Subjt:  MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILE

Query:  NTYKGVDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEE
        NTYKGVDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEE
Subjt:  NTYKGVDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEE

Query:  KLLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ------
        KLLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ      
Subjt:  KLLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ------

Query:  -----------KKIEENANYAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKH
                    ++EENANYAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKH
Subjt:  -----------KKIEENANYAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKH

Query:  EFISNVYYVPNMKNNILSLGQLLEKGYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNM
        EFISNVYYVPNMKNNILSLGQLLEKGYNILMKD SLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNM
Subjt:  EFISNVYYVPNMKNNILSLGQLLEKGYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNM

Query:  VKGLPYVKLPDQLCEGCLHGKQSRKSFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGY
        VKGLPYVKLPDQLCEGCLHGKQSRKSFPQESS RARRPLELVHTDLCGPIKPSSF                                         ESGY
Subjt:  VKGLPYVKLPDQLCEGCLHGKQSRKSFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGY

Query:  YIKALRSDRGGEFTSNEFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRK
        YIKALRSDRGGEFTSNEFKTFC ENGIRR MTVPFTPQ  GVVERKNRTILNMARSMLKCKKMPKEFWAQ VECAVYLSNRSPTRSLWNKT QQAWTGRK
Subjt:  YIKALRSDRGGEFTSNEFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRK

Query:  PSIGHLRVFGCMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP
        PSIGHLRVFGCMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNP+TKKTI+SRDVVFDEEASWNWNDE EDYKF  FP
Subjt:  PSIGHLRVFGCMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP

A0A5D3B8P4 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0091.16Show/hide
Query:  MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH
        MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH
Subjt:  MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH

Query:  MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG
        MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG
Subjt:  MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG

Query:  SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ-----------------KKIEENANYAEKDE
        SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ                  ++EENANYAEKDE
Subjt:  SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ-----------------KKIEENANYAEKDE

Query:  ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK
        ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK
Subjt:  ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK

Query:  GYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK
        GYNILMKD SLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK
Subjt:  GYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK

Query:  SFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGYYIKALRSDRGGEFTSNEFKTFCAEN
        SFPQESS RARRPLELVHTDLCGPIKPSSF                                         ESGYYIKALRSDRGGEFTSNEFKTFC EN
Subjt:  SFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGYYIKALRSDRGGEFTSNEFKTFCAEN

Query:  GIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL
        GIRR MTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQ VECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL
Subjt:  GIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL

Query:  DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP
        DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKF FFP
Subjt:  DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP

A0A5D3C639 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0091.16Show/hide
Query:  MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH
        MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH
Subjt:  MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH

Query:  MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG
        MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG
Subjt:  MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG

Query:  SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ-----------------KKIEENANYAEKDE
        SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ                  ++EENANYAEKDE
Subjt:  SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ-----------------KKIEENANYAEKDE

Query:  ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK
        ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK
Subjt:  ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK

Query:  GYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK
        GYNILMKD SLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK
Subjt:  GYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK

Query:  SFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGYYIKALRSDRGGEFTSNEFKTFCAEN
        SFPQESS RARRPLELVHTDLCGPIKPSSF                                         ESGYYIKALRSDRGGEFTSNEFKTFC EN
Subjt:  SFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGYYIKALRSDRGGEFTSNEFKTFCAEN

Query:  GIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL
        GIRR MTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQ VECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL
Subjt:  GIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL

Query:  DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP
        DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKF FFP
Subjt:  DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP

A0A5D3CTU3 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0091.44Show/hide
Query:  MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILE
        MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILE
Subjt:  MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILE

Query:  NTYKGVDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEE
        NTYKGVDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEE
Subjt:  NTYKGVDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEE

Query:  KLLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ------
        KLLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ      
Subjt:  KLLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ------

Query:  -----------KKIEENANYAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKH
                    ++EENANYAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKH
Subjt:  -----------KKIEENANYAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKH

Query:  EFISNVYYVPNMKNNILSLGQLLEKGYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNM
        EFISNVYYVPNMKNNILSLGQLLEKGYNILMKD SLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNM
Subjt:  EFISNVYYVPNMKNNILSLGQLLEKGYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNM

Query:  VKGLPYVKLPDQLCEGCLHGKQSRKSFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGY
        VKGLPYVKLPDQLCEGCLHGKQSRKSFPQESS RARRPLELVHTDLCGPIKPSSF                                         ESGY
Subjt:  VKGLPYVKLPDQLCEGCLHGKQSRKSFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGY

Query:  YIKALRSDRGGEFTSNEFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRK
        YIKALRSDRGGEFTSNEFKTFC ENGIRR MTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQ VECAVYLSNRSPTRSLWNKTPQQAWTGRK
Subjt:  YIKALRSDRGGEFTSNEFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRK

Query:  PSIGHLRVFGCMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP
        PSIGHLRVFGCMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKF FFP
Subjt:  PSIGHLRVFGCMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP

A0A5D3D497 Retrovirus-related Pol polyprotein from transposon TNT 1-940.0e+0091.16Show/hide
Query:  MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH
        MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH
Subjt:  MKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVKKVRLQKLRGDYESLH

Query:  MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG
        MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG
Subjt:  MKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKLKDKEG

Query:  SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ-----------------KKIEENANYAEKDE
        SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ                  ++EENANYAEKDE
Subjt:  SLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQ-----------------KKIEENANYAEKDE

Query:  ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK
        ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK
Subjt:  ESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEK

Query:  GYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK
        GYNILMKD SLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK
Subjt:  GYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRK

Query:  SFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGYYIKALRSDRGGEFTSNEFKTFCAEN
        SFPQESS RARRPLELVHTDLCGPIKPSSF                                         ESGYYIKALRSDRGGEFTSNEFKTFC EN
Subjt:  SFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGYYIKALRSDRGGEFTSNEFKTFCAEN

Query:  GIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL
        GIRR MTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQ VECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL
Subjt:  GIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKL

Query:  DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP
        DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKF FFP
Subjt:  DDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFP

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.7e-5728.33Show/hide
Query:  ANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILEN
        A  N+ PF       E Y+ W  R++ALL  QDV  +V +G    E D +  +A+R A            TII    D      ++ AT+   A QILEN
Subjt:  ANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILEN

Query:  TYKGVDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEK
             +R        LR    SL +    S+  +      +++E+   G  I +   +  +L +L   ++ I+ AIE   +   +++  +   L   E K
Subjt:  TYKGVDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEK

Query:  LLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQKKIEENA
        +   +   ++++  +   +     +  K N  + R   + +  FK  G   Y   K    +        +   HY R      NN  + +++Q +   + 
Subjt:  LLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQKKIEENA

Query:  NYAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKI-------LINLKNGKHEFISNVYYVPN
          A   +E  ++S+          +N  + LDSGAS+H+   +S++ +  E V          KI V  +G+        ++ L+N     + +V +   
Subjt:  NYAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKI-------LINLKNGKHEFISNVYYVPN

Query:  MKNNILSLGQLLEKGYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKG---LPYVK
           N++S+ +L E G +I      + I  N   ++    M  N + ++N Q   A  + +  K+   +WH RFGH++   L  + RKNM      L  ++
Subjt:  MKNNILSLGQLLEKGYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKG---LPYVK

Query:  LPDQLCEGCLHGKQSRKSFPQ-ESSSRARRPLELVHTDLCGPIKPSSFESGYY-----------------------------------------IKALRS
        L  ++CE CL+GKQ+R  F Q +  +  +RPL +VH+D+CGPI P + +   Y                                         +  L  
Subjt:  LPDQLCEGCLHGKQSRKSFPQ-ESSSRARRPLELVHTDLCGPIKPSSFESGYY-----------------------------------------IKALRS

Query:  DRGGEFTSNEFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSL--WNKTPQQAWTGRKPSIGH
        D G E+ SNE + FC + GI   +TVP TPQ NGV ER  RTI   AR+M+   K+ K FW + V  A YL NR P+R+L   +KTP + W  +KP + H
Subjt:  DRGGEFTSNEFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSL--WNKTPQQAWTGRKPSIGH

Query:  LRVFGCMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDE
        LRVFG   Y HI + K+ K DDKS K +FVGY+ +  G+KL++ V +K IV+RDVV DE
Subjt:  LRVFGCMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDE

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.1e-7028.7Show/hide
Query:  VPFQVPRLTKEN-YSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKG
        V ++V +   +N +S+W  RM+ LL  Q +  ++    ++P++  A + A           D++A + I   + D+    I    TA   W  LE+ Y  
Subjt:  VPFQVPRLTKEN-YSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKG

Query:  VDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKK
             K+ L+K      +LHM E  +   + +    ++ ++   G  I +E     +L SL   ++ +   I   K  +T+ +  +  +L  + EK+ KK
Subjt:  VDRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKK

Query:  NKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQKKIEENAN---
         +   + L       + +  S ++ +   GR G RG    K + +     R     N   +  R        +       ND   D     ++ N N   
Subjt:  NKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQKKIEENAN---

Query:  YAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSL
        +  ++EE       +   G E    S W +D+ AS+H    + +F        G +  G+ +   + G G I I    G    + +V +VP+++ N++S 
Subjt:  YAEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSL

Query:  GQLLEKGYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWI--WHLRFGHLNFDGLRLLARKNMVKGLPYVK-LPDQLCEG
          L   GY     +    +      +IAK  + +  ++  N   ++ Q   +  +D   +  WH R GH++  GL++LA+K+++    Y K    + C+ 
Subjt:  GQLLEKGYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWI--WHLRFGHLNFDGLRLLARKNMVKGLPYVK-LPDQLCEG

Query:  CLHGKQSRKSFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGYYIKALRSDRGGEFTSN
        CL GKQ R SF Q SS R    L+LV++D+CGP++  S                                          E+G  +K LRSD GGE+TS 
Subjt:  CLHGKQSRKSFPQESSSRARRPLELVHTDLCGPIKPSSF-----------------------------------------ESGYYIKALRSDRGGEFTSN

Query:  EFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAH
        EF+ +C+ +GIR   TVP TPQ NGV ER NRTI+   RSML+  K+PK FW + V+ A YL NRSP+  L  + P++ WT ++ S  HL+VFGC A+AH
Subjt:  EFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAH

Query:  IPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFPTNV
        +P ++R+KLDDKS   +F+GY     GY+L++PV KK I SRDVVF E       D  E  K    P  V
Subjt:  IPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFPTNV

Q12501 Transposon Ty2-OR2 Gag-Pol polyprotein1.6e-1320.69Show/hide
Query:  NNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAW--QILEN
        NN++P   P  ++EN+S+W       L + ++ DI+ N   E                      ++ +T    A   N F+  +        W  QILE 
Subjt:  NNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAW--QILEN

Query:  TYKGVDRVKKVRLQKLRGDYE---------SLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLM
         Y  +  V    + K++ + +         +L    S S   +   +  ++  +K     +SD    + IL+ L   F ++     + +  + M + QL 
Subjt:  TYKGVDRVKKVRLQKLRGDYE---------SLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLM

Query:  GSLQA--HEEKLLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRY
          +Q    E K++  NK       Q K   + K  S    N    +                   R +  +NS S     +     + S + R NND   
Subjt:  GSLQA--HEEKLLKKNKQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRY

Query:  DKRQKKIEENANYAEKDEESGDSSLFLACKGAETCENS-----AWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEF
        +        ++ Y   D E          K   T +++        +DSGAS  +  S         +   +IV      IP+   G +  N +NG    
Subjt:  DKRQKKIEENANYAEKDEESGDSSLFLACKGAETCENS-----AWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEF

Query:  ISNVYYVPNMKNNILSLGQLLEKGYNILMKDCSL----------LIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGL
        I    + PN+  ++LSL +L  +         +L          +++      ++K  +  + +  L I        KS  K P  + H   GH NF  +
Subjt:  ISNVYYVPNMKNNILSLGQLLEKGYNILMKDCSL----------LIRDNHDKIIAKVQMTKNRMFLLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGL

Query:  RLLARKNMV-----KGLPYVKLPDQLCEGCLHGKQSRKSFPQESS---SRARRPLELVHTDLCGPI------KPSSFES---------------------
        +   +KN V       + +       C  CL GK ++    + S      +  P + +HTD+ GP+       PS F S                     
Subjt:  RLLARKNMV-----KGLPYVKLPDQLCEGCLHGKQSRKSFPQESS---SRARRPLELVHTDLCGPI------KPSSFES---------------------

Query:  ----------------GYYIKALRSDRGGEFTSNEFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNR-
                           +  ++ DRG E+T+     F    GI    T     + +GV ER NRT+LN  R++L C  +P   W   VE +  + N  
Subjt:  ----------------GYYIKALRSDRGGEFTSNEFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNR-

Query:  -SPTRSLWNKTPQQAWTGRKPSIGHLRVFG--CMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVV
         SP     +K+ +Q        I  +  FG   +   H PD   SK+  +      +    +S GY +Y P  KKT+ + + V
Subjt:  -SPTRSLWNKTPQQAWTGRKPSIGHLRVFG--CMAYAHIPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.1e-3823.47Show/hide
Query:  VPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKAL--TIIHQAIDDNNFEKISGATTAYQAWQILENTY--KGV
        V +LT  NY  W  ++ AL    ++   +      P +    + A R     TR K Q  L  + +  AI  +    +S ATTA Q W+ L   Y     
Subjt:  VPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKAL--TIIHQAIDDNNFEKISGATTAYQAWQILENTY--KGV

Query:  DRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKN
          V ++R Q  +        K ++++ DY   L+   +++   G+ +  ++ VE++L +L E++  ++  I  +KD +  ++ ++   L  HE K+L  +
Subjt:  DRVKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKN

Query:  KQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSS--WERSNNDRRYDKRQKKIEE-NANY
              +  + +  ++   +    N   G   NR      D    +   + + +S++N + +  + + +  +      + ++ +R  + Q  +   N+  
Subjt:  KQMTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSS--WERSNNDRRYDKRQKKIEE-NANY

Query:  AEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGS-KSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSL
                     LA     +  N  W LDSGA++H+     ++ +    + G D++  D + IP+   G   ++ K+ +   + N+ YVPN+  N++S+
Subjt:  AEKDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGS-KSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSL

Query:  GQLLE-KGYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTD--VAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGL-PYVKLPDQLCE
         +L    G ++     S  ++D +  +      TK+ ++   I +   V+       K  +  WH R GH     L  +     +  L P  K     C 
Subjt:  GQLLE-KGYNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTD--VAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGL-PYVKLPDQLCE

Query:  GCLHGKQSRKSFPQESSSRARRPLELVHTDLCG--------------------------PIKPSS------------FESGYY--IKALRSDRGGEFTSN
         CL  K ++  F Q S+  + RPLE +++D+                            P+K  S             E+ +   I    SD GGEF + 
Subjt:  GCLHGKQSRKSFPQESSSRARRPLELVHTDLCG--------------------------PIKPSS------------FESGYY--IKALRSDRGGEFTSN

Query:  EFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAH
            + +++GI  L + P TP+ NG+ ERK+R I+    ++L    +PK +W      AVYL NR PT  L  ++P Q   G  P+   LRVFGC  Y  
Subjt:  EFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAH

Query:  IPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFPTNVMSLVTLLLHQHRQS
        +    + KLDDKS + VF+GY  +   Y   +  T +  +SR V FDE               CF  +N ++ ++ +  Q R+S
Subjt:  IPDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFPTNVMSLVTLLLHQHRQS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-3322.89Show/hide
Query:  VPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNT--RKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDR
        V +LT  NY  W  ++ AL    ++   +      P +    +   R     T  R++D+   + I  AI  +    +S ATTA Q W+ L   Y     
Subjt:  VPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNT--RKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDR

Query:  VKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQ
            +L                       R +   +++   G+ +  ++ VE++L +L + +  ++  I  +KD +  S+ ++   L   E KLL  N  
Subjt:  VKKVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQ

Query:  MTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSR---SSWERSNNDRRYDKRQKKIEENANYAE
            +  + +  ++   +  + NRG  R  N                  +  S+S S S   + + +  R    S +  +  R     Q +   N   + 
Subjt:  MTEQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSR---SSWERSNNDRRYDKRQKKIEENANYAE

Query:  KDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGS-KSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQ
                   LA        N  W LDSGA++H+     ++      + G D++  D + IP+   G   +   +   + ++ V YVPN+  N++S+ +
Subjt:  KDEESGDSSLFLACKGAETCENSAWYLDSGASNHMCGS-KSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQ

Query:  LLEKG-YNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTD--VAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQL--CEG
        L      ++     S  ++D +  +      TK+ ++   I +   V+     C K  +  WH R GH     L +L        LP +    +L  C  
Subjt:  LLEKG-YNILMKDCSLLIRDNHDKIIAKVQMTKNRMFLLNIQTD--VAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQL--CEG

Query:  CLHGKQSRKSFPQESSSRARRPLELVHTDLCG--------------------------PIKPSSFESGYY--------------IKALRSDRGGEFTSNE
        C   K  +  F   S+  + +PLE +++D+                            P+K  S     +              I  L SD GGEF    
Subjt:  CLHGKQSRKSFPQESSSRARRPLELVHTDLCG--------------------------PIKPSSFESGYY--------------IKALRSDRGGEFTSNE

Query:  FKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHI
         + + +++GI    + P TP+ NG+ ERK+R I+ M  ++L    +PK +W      AVYL NR PT  L  ++P Q   G+ P+   L+VFGC  Y  +
Subjt:  FKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHI

Query:  PDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDE
            R KL+DKS++  F+GY  +   Y   +  T +   SR V FDE
Subjt:  PDQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDE

Arabidopsis top hitse value%identityAlignment
AT1G48720.1 unknown protein2.0e-2454.35Show/hide
Query:  MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTA
        MA+NN VPFQVP LTK NY +W +RMKA+LG+ DVW+IV  G+ EPE++ +L+Q Q++ L+++RK+D+KAL +I+Q +D++ FEK+  AT+A
Subjt:  MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTA

AT3G21000.1 Gag-Pol-related retrotransposon family protein5.9e-1623.13Show/hide
Query:  NYSSWCIRMKALLGSQDVWDIVSNGY-----EEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILE--NTYKGVDRVK
        +Y  W    K+ L  Q +WD+V NG      + PE  A +   +    ++   KD KAL I+  ++ D+ F K   A++A   W +L   N    + R++
Subjt:  NYSSWCIRMKALLGSQDVWDIVSNGY-----EEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILE--NTYKGVDRVK

Query:  KVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMT
        +V +++L    E L M + ES S Y  + L ++  + R     SD ++ + +  +L   F+ +   +EE  D+  M+   L+      E    + ++  T
Subjt:  KVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMT

Query:  EQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQKKIEENANYAEKDEES
        E+     LK                                        +    S S +  G  + +  + E        DK +K+ E   +Y  +   +
Subjt:  EQLFQSKLKLKDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQKKIEENANYAEKDEES

Query:  GDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEKGY
                  GA+T ++  W +   A  +M      F  LD +    +   D T + V+GKG + I +K GK + I NV +VP +  N+LS G+++ K Y
Subjt:  GDSSLFLACKGAETCENSAWYLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEKGY

Query:  NI
        +I
Subjt:  NI

ATMG00300.1 Gag-Pol-related retrotransposon family protein5.9e-0827.91Show/hide
Query:  LMKDCSLLIRDN-HDKIIAKVQMTKNRMFLLNIQTDVAQC-LKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRKSF
        ++K C  +++ N HD            +++L    +  +  L    KD   +WH R  H++  G+ LL +K  +       L  + CE C++GK  R +F
Subjt:  LMKDCSLLIRDN-HDKIIAKVQMTKNRMFLLNIQTDVAQC-LKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRKSF

Query:  PQESSSRARRPLELVHTDLCG-PIKPSSF
                + PL+ VH+DL G P  P SF
Subjt:  PQESSSRARRPLELVHTDLCG-PIKPSSF

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.5e-0835.29Show/hide
Query:  NRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKLDDKSEK
        NRTI+   RSML    +PK F A     AV++ N+ P+ ++    P + W    P+  +LR FGC+AY H  + K      K E+
Subjt:  NRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIPDQKRSKLDDKSEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAACAACAATTTAGTTCCCTTCCAAGTACCTCGACTTACGAAAGAAAATTATAGTAGTTGGTGTATTCGAATGAAAGCTCTACTTGGTTCACAAGATGTGTGGGA
CATTGTTAGTAATGGTTATGAAGAACCAGAAAGTGATGCAGCTTTGAATCAAGCTCAACGAGAAGCTTTACAAAATACAAGAAAAAAAGATCAAAAGGCTCTCACCATCA
TTCATCAAGCCATTGATGATAACAATTTTGAGAAAATTTCTGGAGCAACTACTGCATATCAAGCATGGCAAATTTTGGAGAATACGTATAAAGGAGTAGATCGAGTCAAG
AAGGTTCGCCTTCAAAAATTGAGAGGTGATTATGAATCACTACATATGAAGGAGTCTGAATCAGTTTCAGATTATACTTCAAGATTGCTAGCAGTAGTAAATGAAATGAA
AAGATTTGGTGAGACAATAAGCGATGAGCAAGTAGTAGAAAAGATACTTCGCTCATTAGATGAAAAATTTAATTTCATCGTTGTAGCTATTGAAGAATCAAAGGATTTGA
GTACAATGTCCATTGATCAACTTATGGGTTCTTTACAAGCCCATGAAGAGAAGCTTCTTAAGAAGAACAAGCAGATGACTGAGCAACTTTTTCAGTCAAAGTTGAAATTA
AAAGACAAGGAAGGCAGCCTAGAAAAAGGAAATCGAGGTCGAGGACGTGGTGGTAATCGTGGACGTGGTGATTTCAAAGATCGAGGTCAAGGAAGCTACGGTCAAAGAAA
ATTTGATGAGAGTAATTCAAACTCAAATTCATCAAGAGGTAGAGGAAGACAACATTATTCGAGGTCAAGTTGGGAAAGATCAAATAATGACAGGAGATATGACAAAAGAC
AGAAAAAAATTGAAGAAAATGCAAATTATGCTGAGAAAGATGAAGAAAGCGGTGATTCCTCATTGTTTCTAGCATGCAAGGGTGCTGAAACATGTGAAAACAGTGCATGG
TATCTCGATAGTGGTGCAAGCAATCACATGTGTGGAAGTAAATCAATGTTCATTGAACTTGATGAATCTGTTGGTGGCGATATCGTATTTGGTGATGCCACAAAAATTCC
AGTTAAAGGAAAAGGTAAGATTTTGATCAATTTGAAGAATGGGAAGCATGAGTTTATCTCTAATGTTTATTATGTGCCTAATATGAAGAACAACATTTTGAGTTTGGGAC
AACTCTTAGAGAAAGGCTATAATATTTTGATGAAGGATTGTAGTCTTTTGATAAGAGATAATCATGACAAAATTATTGCTAAAGTGCAAATGACGAAAAATAGAATGTTT
TTATTAAACATTCAAACTGATGTTGCTCAATGTTTAAAGTCATGTTTGAAAGATCCCAATTGGATTTGGCACTTGAGATTTGGGCATTTGAACTTTGATGGCTTAAGATT
ATTAGCCAGGAAGAACATGGTGAAAGGGTTGCCATATGTCAAACTTCCAGATCAACTTTGTGAAGGTTGTCTTCATGGCAAACAATCAAGGAAGAGTTTTCCACAAGAAT
CATCTTCGAGAGCAAGGAGACCACTGGAGTTAGTTCACACTGATCTTTGTGGACCGATCAAACCAAGTTCTTTTGAAAGTGGTTATTACATTAAAGCTTTGAGATCAGAC
AGGGGAGGTGAATTCACTTCAAATGAATTCAAAACTTTTTGCGCAGAAAATGGAATCCGTCGCCTTATGACAGTTCCATTTACTCCTCAACAAAATGGTGTTGTTGAAAG
GAAGAACCGAACAATACTTAACATGGCTCGAAGCATGTTGAAGTGTAAGAAGATGCCAAAAGAATTTTGGGCACAAGTTGTTGAGTGTGCAGTGTACTTGTCAAATCGTT
CCCCTACTAGAAGCTTATGGAACAAAACTCCTCAACAAGCATGGACAGGAAGAAAACCATCCATTGGTCATTTGAGAGTATTCGGATGCATGGCTTATGCGCATATACCT
GATCAAAAGCGTAGTAAGCTTGATGATAAAAGTGAGAAATATGTGTTTGTTGGCTATGATGCAAGCTCAAAAGGCTACAAGCTTTATAATCCTGTTACAAAGAAGACGAT
CGTAAGCAGAGATGTTGTGTTTGATGAAGAAGCATCATGGAATTGGAATGACGAACCAGAAGATTACAAATTTTGTTTTTTCCCGACGAACGTGATGAGCCTAGTGACAT
TGCTTCTCCACCAACATCGCCAATCACTCCACAACAAAGCACATCTTCATCATCTGCAAGTTCAAGTGAAGGGCCTCGTGGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAACAACAATTTAGTTCCCTTCCAAGTACCTCGACTTACGAAAGAAAATTATAGTAGTTGGTGTATTCGAATGAAAGCTCTACTTGGTTCACAAGATGTGTGGGA
CATTGTTAGTAATGGTTATGAAGAACCAGAAAGTGATGCAGCTTTGAATCAAGCTCAACGAGAAGCTTTACAAAATACAAGAAAAAAAGATCAAAAGGCTCTCACCATCA
TTCATCAAGCCATTGATGATAACAATTTTGAGAAAATTTCTGGAGCAACTACTGCATATCAAGCATGGCAAATTTTGGAGAATACGTATAAAGGAGTAGATCGAGTCAAG
AAGGTTCGCCTTCAAAAATTGAGAGGTGATTATGAATCACTACATATGAAGGAGTCTGAATCAGTTTCAGATTATACTTCAAGATTGCTAGCAGTAGTAAATGAAATGAA
AAGATTTGGTGAGACAATAAGCGATGAGCAAGTAGTAGAAAAGATACTTCGCTCATTAGATGAAAAATTTAATTTCATCGTTGTAGCTATTGAAGAATCAAAGGATTTGA
GTACAATGTCCATTGATCAACTTATGGGTTCTTTACAAGCCCATGAAGAGAAGCTTCTTAAGAAGAACAAGCAGATGACTGAGCAACTTTTTCAGTCAAAGTTGAAATTA
AAAGACAAGGAAGGCAGCCTAGAAAAAGGAAATCGAGGTCGAGGACGTGGTGGTAATCGTGGACGTGGTGATTTCAAAGATCGAGGTCAAGGAAGCTACGGTCAAAGAAA
ATTTGATGAGAGTAATTCAAACTCAAATTCATCAAGAGGTAGAGGAAGACAACATTATTCGAGGTCAAGTTGGGAAAGATCAAATAATGACAGGAGATATGACAAAAGAC
AGAAAAAAATTGAAGAAAATGCAAATTATGCTGAGAAAGATGAAGAAAGCGGTGATTCCTCATTGTTTCTAGCATGCAAGGGTGCTGAAACATGTGAAAACAGTGCATGG
TATCTCGATAGTGGTGCAAGCAATCACATGTGTGGAAGTAAATCAATGTTCATTGAACTTGATGAATCTGTTGGTGGCGATATCGTATTTGGTGATGCCACAAAAATTCC
AGTTAAAGGAAAAGGTAAGATTTTGATCAATTTGAAGAATGGGAAGCATGAGTTTATCTCTAATGTTTATTATGTGCCTAATATGAAGAACAACATTTTGAGTTTGGGAC
AACTCTTAGAGAAAGGCTATAATATTTTGATGAAGGATTGTAGTCTTTTGATAAGAGATAATCATGACAAAATTATTGCTAAAGTGCAAATGACGAAAAATAGAATGTTT
TTATTAAACATTCAAACTGATGTTGCTCAATGTTTAAAGTCATGTTTGAAAGATCCCAATTGGATTTGGCACTTGAGATTTGGGCATTTGAACTTTGATGGCTTAAGATT
ATTAGCCAGGAAGAACATGGTGAAAGGGTTGCCATATGTCAAACTTCCAGATCAACTTTGTGAAGGTTGTCTTCATGGCAAACAATCAAGGAAGAGTTTTCCACAAGAAT
CATCTTCGAGAGCAAGGAGACCACTGGAGTTAGTTCACACTGATCTTTGTGGACCGATCAAACCAAGTTCTTTTGAAAGTGGTTATTACATTAAAGCTTTGAGATCAGAC
AGGGGAGGTGAATTCACTTCAAATGAATTCAAAACTTTTTGCGCAGAAAATGGAATCCGTCGCCTTATGACAGTTCCATTTACTCCTCAACAAAATGGTGTTGTTGAAAG
GAAGAACCGAACAATACTTAACATGGCTCGAAGCATGTTGAAGTGTAAGAAGATGCCAAAAGAATTTTGGGCACAAGTTGTTGAGTGTGCAGTGTACTTGTCAAATCGTT
CCCCTACTAGAAGCTTATGGAACAAAACTCCTCAACAAGCATGGACAGGAAGAAAACCATCCATTGGTCATTTGAGAGTATTCGGATGCATGGCTTATGCGCATATACCT
GATCAAAAGCGTAGTAAGCTTGATGATAAAAGTGAGAAATATGTGTTTGTTGGCTATGATGCAAGCTCAAAAGGCTACAAGCTTTATAATCCTGTTACAAAGAAGACGAT
CGTAAGCAGAGATGTTGTGTTTGATGAAGAAGCATCATGGAATTGGAATGACGAACCAGAAGATTACAAATTTTGTTTTTTCCCGACGAACGTGATGAGCCTAGTGACAT
TGCTTCTCCACCAACATCGCCAATCACTCCACAACAAAGCACATCTTCATCATCTGCAAGTTCAAGTGAAGGGCCTCGTGGCATGA
Protein sequenceShow/hide protein sequence
MANNNLVPFQVPRLTKENYSSWCIRMKALLGSQDVWDIVSNGYEEPESDAALNQAQREALQNTRKKDQKALTIIHQAIDDNNFEKISGATTAYQAWQILENTYKGVDRVK
KVRLQKLRGDYESLHMKESESVSDYTSRLLAVVNEMKRFGETISDEQVVEKILRSLDEKFNFIVVAIEESKDLSTMSIDQLMGSLQAHEEKLLKKNKQMTEQLFQSKLKL
KDKEGSLEKGNRGRGRGGNRGRGDFKDRGQGSYGQRKFDESNSNSNSSRGRGRQHYSRSSWERSNNDRRYDKRQKKIEENANYAEKDEESGDSSLFLACKGAETCENSAW
YLDSGASNHMCGSKSMFIELDESVGGDIVFGDATKIPVKGKGKILINLKNGKHEFISNVYYVPNMKNNILSLGQLLEKGYNILMKDCSLLIRDNHDKIIAKVQMTKNRMF
LLNIQTDVAQCLKSCLKDPNWIWHLRFGHLNFDGLRLLARKNMVKGLPYVKLPDQLCEGCLHGKQSRKSFPQESSSRARRPLELVHTDLCGPIKPSSFESGYYIKALRSD
RGGEFTSNEFKTFCAENGIRRLMTVPFTPQQNGVVERKNRTILNMARSMLKCKKMPKEFWAQVVECAVYLSNRSPTRSLWNKTPQQAWTGRKPSIGHLRVFGCMAYAHIP
DQKRSKLDDKSEKYVFVGYDASSKGYKLYNPVTKKTIVSRDVVFDEEASWNWNDEPEDYKFCFFPTNVMSLVTLLLHQHRQSLHNKAHLHHLQVQVKGLVA