; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0249491 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0249491
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr09:15389327..15390880
RNA-Seq ExpressionCmc09g0249491
SyntenyCmc09g0249491
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035157.1 gag-pol polyprotein [Cucumis melo var. makuwa]7.4e-19875.2Show/hide
Query:  HNRVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDE
        H RVTIR+ TTVTLYELWK+RK NVKYFHVFGSTCYILADREY +KWDA+SEQGIFLGYSQN+ AYRV+NNRS  VMETINVVINDL+  IKQ+NDEEDE
Subjt:  HNRVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDE

Query:  TSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVD-SALR
        T NMSE RTTS+VE  KA   SD   KSL+KS +E I KK +LI SAHVKKNHPASSIIGDPS GMQTRRK+KIDY+KMVADLCYIST+E STVD SAL+
Subjt:  TSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVD-SALR

Query:  DEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMD
        DEYWLNAMQEELLQFR+NNVW LV KP GVNVIGTKW+FKNKTDE GCVTKNKA+LVAQGYTQVEG+DFDETFA VARLEAIRLLLGISCIQKFKLYQMD
Subjt:  DEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMD

Query:  VKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVN
        VKS FL+GYLNEEVYVAQPKGFVD EHPKH+YKLNKALYGLKQA  AWYD+LTVYLRG+GYSRGEIDKTLFI  KSDQLLV QIYVDDIIF GFP  L  
Subjt:  VKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVN

Query:  NFIEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLLYITAS
                                                       ARNKRTP   HVKLTKDTE +EVDHKLYRSIVGSLLY+TAS
Subjt:  NFIEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLLYITAS

KAA0053137.1 gag-pol polyprotein [Cucumis melo var. makuwa]6.3e-19771.66Show/hide
Query:  ESVNIACHIHNRVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAI
        E+VN ACHIHNRV IR+ TT+TLYELWK+RK NVKYFHVFGSTCY+LADREYH+KWDA+SEQGIFLGYSQN+RAY+V+NN+SG VMETINVVINDL+  I
Subjt:  ESVNIACHIHNRVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAI

Query:  KQINDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEP
        KQ+NDEED+T NM E R                                                                                   
Subjt:  KQINDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEP

Query:  STVDSALRDEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQ
         T++SAL+DEYWLN MQEELLQFR+NNVWTL+SKPEGVNVIGTKW+FKNKTDE GCVTKNKA+LVAQGYTQVEG+DFDETFA VARLEAIRLLLGISCIQ
Subjt:  STVDSALRDEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQ

Query:  KFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFR
        KFKLYQ+DVKS FL+GYLNEEVYVAQPKGFVDSEHPKH+YKLNKALYGLKQA RAWYD+LTVYLRG+GYSRGEIDK LFI RKSDQLLVAQIYVDDIIF 
Subjt:  KFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFR

Query:  GFPHDLVNNFI-----EFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLLYITA
        GFP DL+NNFI     EFEMSMVGELSCFLGLQIKQKND IFISQEKYARNMVKKFGL+QARNKRTP ATHVKLTKDTE +EVDHKLYRSIVGSLLY+TA
Subjt:  GFPHDLVNNFI-----EFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLLYITA

Query:  S
        S
Subjt:  S

KAA0066164.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.5e-24699.77Show/hide
Query:  REYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKK
        REYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKK
Subjt:  REYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKK

Query:  LELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVDSALRDEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKN
        LELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVDSALRDEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKN
Subjt:  LELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVDSALRDEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKN

Query:  KTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGL
        KTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGL
Subjt:  KTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGL

Query:  KQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVNNFIEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVK
        KQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVNNFIEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVK
Subjt:  KQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVNNFIEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVK

Query:  KFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSI
        KFGLEQARNKRTP ATHVKLTKDTESSEVDHKLYRSI
Subjt:  KFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSI

TYJ98791.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.8e-21887.95Show/hide
Query:  RVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDETS
        RVTIRS  TVTL+ELWKDRK NVKYFHVFGSTCYILADREY +KWDAKSEQGIFLGYSQNSRAYRVFNNR   VMETINVVIND++ AIKQINDEEDE  
Subjt:  RVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDETS

Query:  NMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVDSALRDEY
        NMSEARTTSSV+  KA  PSDDS K LEKS +E+ITKK ELIS AHVKKNHPASSIIGDPS GMQTRRKEKIDYMKMVADLCYIST EPSTVD +LRDEY
Subjt:  NMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVDSALRDEY

Query:  WLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMDVKS
         LNAMQEELLQF++NNVWTLV KPEGVNVIGTKWVFKNKTDEAGCVTKNKA+LVAQGYTQVEGIDFDETF+ VARLEAIRLLLGISCIQKFKLYQMDVKS
Subjt:  WLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMDVKS

Query:  AFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVNNFI
        AFL+GYLNEEVYVAQPK FVDSEH KH+YKLNKALYGLKQA RAWYD+LTVYLRGKGYSRGEIDKTLFI RKSDQLLVAQIYVDDIIF GFP  LVNNFI
Subjt:  AFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVNNFI

Query:  -----EFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLE
             EFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFG +
Subjt:  -----EFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLE

TYK29237.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.1e-19979.05Show/hide
Query:  HNRVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDE
        H RVTIR+ TTVTLYELWK+RK NVKYFHVFGSTCYILADREY +KWDA+SEQGIFLGYSQN+ AYRV+NNRS  VMETINVVINDL   IKQ+NDEEDE
Subjt:  HNRVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDE

Query:  TSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVD-SALR
        T NMSE RTTS+VE  KA   SD   KSL+KSS+E I KK +LISSAHVKKNHPASSIIGDPS GMQTRRK+KIDY+KMVADLCYIS++E STVD SA +
Subjt:  TSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVD-SALR

Query:  DEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMD
        DEYWLNAMQEELL+ R NNVW LV KP GVNVIGTKW+FKNKTDE GCVTKNKA+LVAQGYTQVEG+DFDETFA VARLEAIRLLLGISCIQKFKLYQMD
Subjt:  DEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMD

Query:  VKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVN
        VKS FL+GYLNEEVYVAQPKGFVD EHPKH+YKLNKALYGLKQA  AWYD+LTVYLRG+GYSRGEIDKTLFI  KSDQLLVAQIYVDDIIF GFP  L  
Subjt:  VKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVN

Query:  NFIEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTK
                              QKNDDIFISQEKYA+NMVKKFGLE ARNKRTP   HVKL K
Subjt:  NFIEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTK

TrEMBL top hitse value%identityAlignment
A0A5A7T0Q0 Gag-pol polyprotein3.6e-19875.2Show/hide
Query:  HNRVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDE
        H RVTIR+ TTVTLYELWK+RK NVKYFHVFGSTCYILADREY +KWDA+SEQGIFLGYSQN+ AYRV+NNRS  VMETINVVINDL+  IKQ+NDEEDE
Subjt:  HNRVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDE

Query:  TSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVD-SALR
        T NMSE RTTS+VE  KA   SD   KSL+KS +E I KK +LI SAHVKKNHPASSIIGDPS GMQTRRK+KIDY+KMVADLCYIST+E STVD SAL+
Subjt:  TSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVD-SALR

Query:  DEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMD
        DEYWLNAMQEELLQFR+NNVW LV KP GVNVIGTKW+FKNKTDE GCVTKNKA+LVAQGYTQVEG+DFDETFA VARLEAIRLLLGISCIQKFKLYQMD
Subjt:  DEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMD

Query:  VKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVN
        VKS FL+GYLNEEVYVAQPKGFVD EHPKH+YKLNKALYGLKQA  AWYD+LTVYLRG+GYSRGEIDKTLFI  KSDQLLV QIYVDDIIF GFP  L  
Subjt:  VKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVN

Query:  NFIEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLLYITAS
                                                       ARNKRTP   HVKLTKDTE +EVDHKLYRSIVGSLLY+TAS
Subjt:  NFIEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLLYITAS

A0A5D3BJA9 Gag-pol polyprotein1.8e-21887.95Show/hide
Query:  RVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDETS
        RVTIRS  TVTL+ELWKDRK NVKYFHVFGSTCYILADREY +KWDAKSEQGIFLGYSQNSRAYRVFNNR   VMETINVVIND++ AIKQINDEEDE  
Subjt:  RVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDETS

Query:  NMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVDSALRDEY
        NMSEARTTSSV+  KA  PSDDS K LEKS +E+ITKK ELIS AHVKKNHPASSIIGDPS GMQTRRKEKIDYMKMVADLCYIST EPSTVD +LRDEY
Subjt:  NMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVDSALRDEY

Query:  WLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMDVKS
         LNAMQEELLQF++NNVWTLV KPEGVNVIGTKWVFKNKTDEAGCVTKNKA+LVAQGYTQVEGIDFDETF+ VARLEAIRLLLGISCIQKFKLYQMDVKS
Subjt:  WLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMDVKS

Query:  AFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVNNFI
        AFL+GYLNEEVYVAQPK FVDSEH KH+YKLNKALYGLKQA RAWYD+LTVYLRGKGYSRGEIDKTLFI RKSDQLLVAQIYVDDIIF GFP  LVNNFI
Subjt:  AFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVNNFI

Query:  -----EFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLE
             EFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFG +
Subjt:  -----EFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLE

A0A5D3BPB3 Gag-pol polyprotein3.0e-19771.66Show/hide
Query:  ESVNIACHIHNRVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAI
        E+VN ACHIHNRV IR+ TT+TLYELWK+RK NVKYFHVFGSTCY+LADREYH+KWDA+SEQGIFLGYSQN+RAY+V+NN+SG VMETINVVINDL+  I
Subjt:  ESVNIACHIHNRVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAI

Query:  KQINDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEP
        KQ+NDEED+T NM E R                                                                                   
Subjt:  KQINDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEP

Query:  STVDSALRDEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQ
         T++SAL+DEYWLN MQEELLQFR+NNVWTL+SKPEGVNVIGTKW+FKNKTDE GCVTKNKA+LVAQGYTQVEG+DFDETFA VARLEAIRLLLGISCIQ
Subjt:  STVDSALRDEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQ

Query:  KFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFR
        KFKLYQ+DVKS FL+GYLNEEVYVAQPKGFVDSEHPKH+YKLNKALYGLKQA RAWYD+LTVYLRG+GYSRGEIDK LFI RKSDQLLVAQIYVDDIIF 
Subjt:  KFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFR

Query:  GFPHDLVNNFI-----EFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLLYITA
        GFP DL+NNFI     EFEMSMVGELSCFLGLQIKQKND IFISQEKYARNMVKKFGL+QARNKRTP ATHVKLTKDTE +EVDHKLYRSIVGSLLY+TA
Subjt:  GFPHDLVNNFI-----EFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLLYITA

Query:  S
        S
Subjt:  S

A0A5D3CXU0 Gag-pol polyprotein1.2e-24699.77Show/hide
Query:  REYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKK
        REYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKK
Subjt:  REYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKK

Query:  LELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVDSALRDEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKN
        LELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVDSALRDEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKN
Subjt:  LELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVDSALRDEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKN

Query:  KTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGL
        KTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGL
Subjt:  KTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGL

Query:  KQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVNNFIEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVK
        KQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVNNFIEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVK
Subjt:  KQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVNNFIEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVK

Query:  KFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSI
        KFGLEQARNKRTP ATHVKLTKDTESSEVDHKLYRSI
Subjt:  KFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSI

A0A5D3DZD2 Gag-pol polyprotein2.5e-19979.05Show/hide
Query:  HNRVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDE
        H RVTIR+ TTVTLYELWK+RK NVKYFHVFGSTCYILADREY +KWDA+SEQGIFLGYSQN+ AYRV+NNRS  VMETINVVINDL   IKQ+NDEEDE
Subjt:  HNRVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETINVVINDLEPAIKQINDEEDE

Query:  TSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVD-SALR
        T NMSE RTTS+VE  KA   SD   KSL+KSS+E I KK +LISSAHVKKNHPASSIIGDPS GMQTRRK+KIDY+KMVADLCYIS++E STVD SA +
Subjt:  TSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVD-SALR

Query:  DEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMD
        DEYWLNAMQEELL+ R NNVW LV KP GVNVIGTKW+FKNKTDE GCVTKNKA+LVAQGYTQVEG+DFDETFA VARLEAIRLLLGISCIQKFKLYQMD
Subjt:  DEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMD

Query:  VKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVN
        VKS FL+GYLNEEVYVAQPKGFVD EHPKH+YKLNKALYGLKQA  AWYD+LTVYLRG+GYSRGEIDKTLFI  KSDQLLVAQIYVDDIIF GFP  L  
Subjt:  VKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVN

Query:  NFIEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTK
                              QKNDDIFISQEKYA+NMVKKFGLE ARNKRTP   HVKL K
Subjt:  NFIEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTK

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.8e-5628.43Show/hide
Query:  LQEMARVMIHAKNLPLCFWAESVNIACHIHNRVTIRS--ETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNS-RAYRV
        + E AR M+    L   FW E+V  A ++ NR+  R+  +++ T YE+W ++K  +K+  VFG+T Y+   +    K+D KS + IF+GY  N  + +  
Subjt:  LQEMARVMIHAKNLPLCFWAESVNIACHIHNRVTIRS--ETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNS-RAYRV

Query:  FNNR----SGCVMETINVVIN---DLEPAIKQINDEEDETSNMSEARTTSSVEFPKAGKPSD------------------DSRKSLE-------------
         N +       V++  N+V +     E    + + E +  +  +++R     EFP   K  D                  DSRK ++             
Subjt:  FNNR----SGCVMETINVVIN---DLEPAIKQINDEEDETSNMSEARTTSSVEFPKAGKPSD------------------DSRKSLE-------------

Query:  ---KSSKES-------------------------ITKKLELISSAHVKK---NHPASS----IIGDPSTGMQTR-----RKEKIDYMKMVADLCYISTVE
           K SKES                           +  E  ++ H+K+   ++P  +    II   S  ++T+      +E     K+V +   I    
Subjt:  ---KSSKES-------------------------ITKKLELISSAHVKK---NHPASS----IIGDPSTGMQTR-----RKEKIDYMKMVADLCYISTVE

Query:  PSTVDS-ALRDE--YWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGI
        P++ D    RD+   W  A+  EL   + NN WT+  +PE  N++ ++WVF  K +E G   + KA+LVA+G+TQ   ID++ETFA VAR+ + R +L +
Subjt:  PSTVDS-ALRDE--YWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGI

Query:  SCIQKFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKS--DQLLVAQIYV
              K++QMDVK+AFL+G L EE+Y+  P+G   S +  ++ KLNKA+YGLKQA+R W++     L+   +    +D+ ++I  K   ++ +   +YV
Subjt:  SCIQKFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKS--DQLLVAQIYV

Query:  DDIIFRGFPHDLVNNF-----IEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVD-HKLYRSIVG
        DD++        +NNF      +F M+ + E+  F+G++I+ + D I++SQ  Y + ++ KF +E      TP+ +  K+  +  +S+ D +   RS++G
Subjt:  DDIIFRGFPHDLVNNF-----IEFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVD-HKLYRSIVG

Query:  SLLYI
         L+YI
Subjt:  SLLYI

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-5428.36Show/hide
Query:  EMARVMIHAKNLPLCFWAESVNIACHIHNRVTIRSETTVTLYEL----WKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVF
        E  R M+    LP  FW E+V  AC++ N    RS +    +E+    W +++ +  +  VFG   +    +E   K D KS   IF+GY      YR++
Subjt:  EMARVMIHAKNLPLCFWAESVNIACHIHNRVTIRSETTVTLYEL----WKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVF

Query:  NNRSGCVMETINVVINDLEPAIKQINDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKE-----SITKKLELISSAHVKKNHPASS------I
        +     V+ + +VV  + E     +    D +  +      + V  P        +  + ++ S++      + ++ E +     +  HP         +
Subjt:  NNRSGCVMETINVVINDLEPAIKQINDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKE-----SITKKLELISSAHVKKNHPASS------I

Query:  IGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVDSAL---RDEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKL
               +++RR    +Y+ +  D       EP ++   L        + AMQEE+   ++N  + LV  P+G   +  KWVFK K D    + + KA+L
Subjt:  IGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVDSAL---RDEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKL

Query:  VAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMY-KLNKALYGLKQASRAWYDQLTVY
        V +G+ Q +GIDFDE F+ V ++ +IR +L ++     ++ Q+DVK+AFL G L EE+Y+ QP+GF +    KHM  KLNK+LYGLKQA R WY +   +
Subjt:  VAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMY-KLNKALYGLKQASRAWYDQLTVY

Query:  LRGKGYSRGEIDKTLFIQRKSD-QLLVAQIYVDDIIFRGFPHDLVNNF-----IEFEMSMVGELSCFLGLQI--KQKNDDIFISQEKYARNMVKKFGLEQ
        ++ + Y +   D  ++ +R S+   ++  +YVDD++  G    L+          F+M  +G     LG++I  ++ +  +++SQEKY   ++++F ++ 
Subjt:  LRGKGYSRGEIDKTLFIQRKSD-QLLVAQIYVDDIIFRGFPHDLVNNF-----IEFEMSMVGELSCFLGLQI--KQKNDDIFISQEKYARNMVKKFGLEQ

Query:  ARNKRTPVATHVKLTKDTESSEVDHK------LYRSIVGSLLY
        A+   TP+A H+KL+K    + V+ K       Y S VGSL+Y
Subjt:  ARNKRTPVATHVKLTKDTESSEVDHK------LYRSIVGSLLY

P25600 Putative transposon Ty5-1 protein YCL074W3.8e-1931.09Show/hide
Query:  MDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDL
        MDV +AFL+  ++E +YV QP GFV+  +P ++++L   +YGLKQA   W + +   L+  G+ R E +  L+ +  SD  +   +YVDD++    P   
Subjt:  MDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDL

Query:  VNNFIE------FEMSMVGELSCFLGLQIKQ-KNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLLY
        + + ++      + M  +G++  FLGL I Q  N DI +S + Y      +  +   +  +TP+     L + T     D   Y+SIVG LL+
Subjt:  VNNFIE------FEMSMVGELSCFLGLQIKQ-KNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLLY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.8e-5233.08Show/hide
Query:  NDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISS----AHVKKNHPASSIIGDPSTGMQTRRKEKI--DYMKMVADLCYIST
        N  ++  +N S ++   S+  P     S  S  +   SS  S T    LI      A +  N+  + +    +  M TR K  I     K    +   + 
Subjt:  NDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISS----AHVKKNHPASSIIGDPSTGMQTRRKEKI--DYMKMVADLCYIST

Query:  VEPSTVDSALRDEYWLNAMQEELLQFRQNNVWTLVSKPEG-VNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGI
         EP T   AL+DE W NAM  E+     N+ W LV  P   V ++G +W+F  K +  G + + KA+LVA+GY Q  G+D+ ETF+ V +  +IR++LG+
Subjt:  VEPSTVDSALRDEYWLNAMQEELLQFRQNNVWTLVSKPEG-VNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGI

Query:  SCIQKFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDD
        +  + + + Q+DV +AFL G L ++VY++QP GF+D + P ++ KL KALYGLKQA RAWY +L  YL   G+     D +LF+ ++   ++   +YVDD
Subjt:  SCIQKFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDD

Query:  IIFRGFPHDLVNNFIE-----FEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLL
        I+  G    L++N ++     F +    EL  FLG++ K+    + +SQ +Y  +++ +  +  A+   TP+A   KL+  + +   D   YR IVGSL 
Subjt:  IIFRGFPHDLVNNFIE-----FEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLL

Query:  YI
        Y+
Subjt:  YI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.4e-5032.16Show/hide
Query:  NDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKI--DYMKMVADLCYISTVEPS
        N     + N +     S +  P    PS    +    SS  + T  L  +  A       A + +   S  M TR K+ I     K        +  EP 
Subjt:  NDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKI--DYMKMVADLCYISTVEPS

Query:  TVDSALRDEYWLNAMQEELLQFRQNNVWTLV-SKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQ
        T   A++D+ W  AM  E+     N+ W LV   P  V ++G +W+F  K +  G + + KA+LVA+GY Q  G+D+ ETF+ V +  +IR++LG++  +
Subjt:  TVDSALRDEYWLNAMQEELLQFRQNNVWTLV-SKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQ

Query:  KFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFR
         + + Q+DV +AFL G L +EVY++QP GFVD + P ++ +L KA+YGLKQA RAWY +L  YL   G+     D +LF+ ++   ++   +YVDDI+  
Subjt:  KFKLYQMDVKSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFR

Query:  GFPHDLVNNFIE-----FEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLLYI
        G    L+ + ++     F +    +L  FLG++ K+    + +SQ +Y  +++ +  +  A+   TP+AT  KLT  + +   D   YR IVGSL Y+
Subjt:  GFPHDLVNNFIE-----FEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLLYI

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.6e-4933.33Show/hide
Query:  SNMSEARTTSSVE-FPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVDSALRD
        S+   + ++SS++  P A   +D    S+  S +   T+K   +   +   +  AS  I D S   Q    EK+  +     +C     EPST + A   
Subjt:  SNMSEARTTSSVE-FPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVEPSTVDSALRD

Query:  EYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMDV
          W  AM +E+      + W + + P     IG KWV+K K +  G + + KA+LVA+GYTQ EGIDF ETF+ V +L +++L+L IS I  F L+Q+D+
Subjt:  EYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMDV

Query:  KSAFLDGYLNEEVYVAQPKGFV----DSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHD
         +AFL+G L+EE+Y+  P G+     DS  P  +  L K++YGLKQASR W+ + +V L G G+ +   D T F++  +   L   +YVDDII       
Subjt:  KSAFLDGYLNEEVYVAQPKGFV----DSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHD

Query:  LVNNFIE-----FEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLLYI
         V+         F++  +G L  FLGL+I +    I I Q KYA +++ + GL   +    P+   V  +  +    VD K YR ++G L+Y+
Subjt:  LVNNFIE-----FEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLLYI

ATMG00810.1 DNA/RNA polymerases superfamily protein8.0e-0934.86Show/hide
Query:  IYVDDIIFRGFPHDLVNNFI-----EFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEV-DHKLYRS
        +YVDDI+  G  + L+N  I      F M  +G +  FLG+QIK     +F+SQ KYA  ++   G+   +   TP+   +KL     +++  D   +RS
Subjt:  IYVDDIIFRGFPHDLVNNFI-----EFEMSMVGELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEV-DHKLYRS

Query:  IVGSLLYIT
        IVG+L Y+T
Subjt:  IVGSLLYIT

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.9e-1941.6Show/hide
Query:  MQTRRKEKIDYMKMVADLCYISTV--EPSTVDSALRDEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQV
        M TR K  I+ +     L   +T+  EP +V  AL+D  W  AMQEEL    +N  W LV  P   N++G KWVFK K    G + + KA+LVA+G+ Q 
Subjt:  MQTRRKEKIDYMKMVADLCYISTV--EPSTVDSALRDEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQV

Query:  EGIDFDETFASVARLEAIRLLLGIS
        EGI F ET++ V R   IR +L ++
Subjt:  EGIDFDETFASVARLEAIRLLLGIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTACAAGAAATGGCACGTGTTATGATACATGCCAAAAATTTACCTCTATGTTTTTGGGCAGAATCTGTAAATATTGCCTGTCACATTCATAACCGGGTAACTATAAG
ATCGGAAACAACTGTCACACTTTATGAACTTTGGAAAGATAGAAAGTCAAATGTTAAATACTTTCATGTGTTTGGAAGTACGTGTTATATCTTAGCTGACAGGGAATACC
ACAAGAAATGGGATGCTAAGTCAGAACAAGGAATTTTTCTCGGGTACTCTCAGAACAGCCGTGCCTATAGAGTCTTCAATAACAGATCTGGGTGTGTTATGGAAACAATC
AATGTAGTTATAAATGACCTCGAACCAGCTATCAAACAGATAAATGATGAGGAAGATGAGACTTCAAATATGTCTGAAGCTAGAACTACTAGCAGTGTAGAATTTCCTAA
AGCTGGTAAACCATCTGATGATTCCAGGAAAAGTTTGGAAAAATCATCAAAGGAAAGTATTACTAAGAAATTAGAACTAATTTCGTCTGCTCATGTGAAGAAAAATCATC
CAGCAAGCTCTATTATAGGTGATCCGTCAACTGGGATGCAGACCAGAAGGAAAGAAAAGATTGATTACATGAAGATGGTTGCTGATTTATGCTATATTTCCACCGTTGAA
CCTTCTACTGTTGACTCTGCGCTCAGGGATGAGTACTGGTTAAATGCTATGCAAGAGGAGTTACTTCAATTCAGACAAAACAATGTCTGGACATTAGTGTCAAAACCAGA
AGGTGTAAACGTTATCGGTACTAAATGGGTGTTCAAAAATAAAACTGATGAAGCCGGATGTGTGACGAAAAATAAAGCCAAATTAGTGGCTCAAGGGTATACTCAAGTTG
AAGGTATTGACTTTGATGAAACGTTTGCTTCTGTTGCTCGACTTGAAGCCATTCGACTATTACTTGGTATATCATGCATACAGAAATTTAAATTGTATCAGATGGATGTC
AAGAGTGCCTTCTTAGATGGGTACTTGAATGAGGAGGTTTATGTTGCTCAACCAAAAGGTTTTGTTGATTCCGAGCACCCGAAGCATATGTATAAGCTCAACAAAGCCTT
ATATGGACTAAAGCAAGCGTCGAGAGCTTGGTATGACCAGCTAACTGTATACTTGAGAGGTAAAGGATATTCCAGAGGAGAAATTGACAAGACTTTGTTCATACAAAGGA
AATCTGACCAACTATTGGTGGCTCAAATTTATGTTGATGACATCATTTTTCGAGGATTTCCTCATGATCTAGTAAATAATTTCATTGAATTCGAAATGAGCATGGTTGGA
GAGCTTTCATGCTTTCTGGGACTTCAAATTAAGCAAAAGAATGATGACATTTTCATATCTCAAGAAAAGTATGCCAGGAATATGGTCAAAAAGTTTGGTTTAGAACAGGC
TCGAAATAAGCGGACTCCAGTTGCGACACATGTTAAACTTACAAAAGACACTGAAAGTTCTGAAGTTGATCACAAACTTTACAGGAGTATAGTAGGCAGTCTATTATACA
TAACAGCAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTACAAGAAATGGCACGTGTTATGATACATGCCAAAAATTTACCTCTATGTTTTTGGGCAGAATCTGTAAATATTGCCTGTCACATTCATAACCGGGTAACTATAAG
ATCGGAAACAACTGTCACACTTTATGAACTTTGGAAAGATAGAAAGTCAAATGTTAAATACTTTCATGTGTTTGGAAGTACGTGTTATATCTTAGCTGACAGGGAATACC
ACAAGAAATGGGATGCTAAGTCAGAACAAGGAATTTTTCTCGGGTACTCTCAGAACAGCCGTGCCTATAGAGTCTTCAATAACAGATCTGGGTGTGTTATGGAAACAATC
AATGTAGTTATAAATGACCTCGAACCAGCTATCAAACAGATAAATGATGAGGAAGATGAGACTTCAAATATGTCTGAAGCTAGAACTACTAGCAGTGTAGAATTTCCTAA
AGCTGGTAAACCATCTGATGATTCCAGGAAAAGTTTGGAAAAATCATCAAAGGAAAGTATTACTAAGAAATTAGAACTAATTTCGTCTGCTCATGTGAAGAAAAATCATC
CAGCAAGCTCTATTATAGGTGATCCGTCAACTGGGATGCAGACCAGAAGGAAAGAAAAGATTGATTACATGAAGATGGTTGCTGATTTATGCTATATTTCCACCGTTGAA
CCTTCTACTGTTGACTCTGCGCTCAGGGATGAGTACTGGTTAAATGCTATGCAAGAGGAGTTACTTCAATTCAGACAAAACAATGTCTGGACATTAGTGTCAAAACCAGA
AGGTGTAAACGTTATCGGTACTAAATGGGTGTTCAAAAATAAAACTGATGAAGCCGGATGTGTGACGAAAAATAAAGCCAAATTAGTGGCTCAAGGGTATACTCAAGTTG
AAGGTATTGACTTTGATGAAACGTTTGCTTCTGTTGCTCGACTTGAAGCCATTCGACTATTACTTGGTATATCATGCATACAGAAATTTAAATTGTATCAGATGGATGTC
AAGAGTGCCTTCTTAGATGGGTACTTGAATGAGGAGGTTTATGTTGCTCAACCAAAAGGTTTTGTTGATTCCGAGCACCCGAAGCATATGTATAAGCTCAACAAAGCCTT
ATATGGACTAAAGCAAGCGTCGAGAGCTTGGTATGACCAGCTAACTGTATACTTGAGAGGTAAAGGATATTCCAGAGGAGAAATTGACAAGACTTTGTTCATACAAAGGA
AATCTGACCAACTATTGGTGGCTCAAATTTATGTTGATGACATCATTTTTCGAGGATTTCCTCATGATCTAGTAAATAATTTCATTGAATTCGAAATGAGCATGGTTGGA
GAGCTTTCATGCTTTCTGGGACTTCAAATTAAGCAAAAGAATGATGACATTTTCATATCTCAAGAAAAGTATGCCAGGAATATGGTCAAAAAGTTTGGTTTAGAACAGGC
TCGAAATAAGCGGACTCCAGTTGCGACACATGTTAAACTTACAAAAGACACTGAAAGTTCTGAAGTTGATCACAAACTTTACAGGAGTATAGTAGGCAGTCTATTATACA
TAACAGCAAGTTGA
Protein sequenceShow/hide protein sequence
MLQEMARVMIHAKNLPLCFWAESVNIACHIHNRVTIRSETTVTLYELWKDRKSNVKYFHVFGSTCYILADREYHKKWDAKSEQGIFLGYSQNSRAYRVFNNRSGCVMETI
NVVINDLEPAIKQINDEEDETSNMSEARTTSSVEFPKAGKPSDDSRKSLEKSSKESITKKLELISSAHVKKNHPASSIIGDPSTGMQTRRKEKIDYMKMVADLCYISTVE
PSTVDSALRDEYWLNAMQEELLQFRQNNVWTLVSKPEGVNVIGTKWVFKNKTDEAGCVTKNKAKLVAQGYTQVEGIDFDETFASVARLEAIRLLLGISCIQKFKLYQMDV
KSAFLDGYLNEEVYVAQPKGFVDSEHPKHMYKLNKALYGLKQASRAWYDQLTVYLRGKGYSRGEIDKTLFIQRKSDQLLVAQIYVDDIIFRGFPHDLVNNFIEFEMSMVG
ELSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPVATHVKLTKDTESSEVDHKLYRSIVGSLLYITAS