; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0103101 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0103101
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr04:20109115..20110557
RNA-Seq ExpressionCmc04g0103101
SyntenyCmc04g0103101
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026117.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.0e-20679.16Show/hide
Query:  VVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDYMKMVA
        +VINDLDS IKQMNDEEDETPNM E RTTS VE SKADN SD  GKSL+KSSEEII KKSELIPSAH KKNHPASSIIG+PSAGMQTRRK+ IDY+KM  
Subjt:  VVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDYMKMVA

Query:  DLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAI
                                    ELLQFRRNNVWTLVSK EGVN+I TKW+FKNK DE GCVTKNKARLVAQGYTQV+GVDFDETFAPVARL+AI
Subjt:  DLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAI

Query:  RLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQLLVA
        RLLLGISCI KFKLYQ+DV SAFLN YLNEEVYVAQ KGFVDSEH KHVYKLNK LYGLKQAPRAWY+RLTVYLRG+GY R EIDKT FIH+KS+QLLVA
Subjt:  RLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQLLVA

Query:  QIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRS
        QIYVD+IIFGGFP DLV NFINIMQSEFEMS  GE SCF  LQIKQKND IFISQEKY KNMVKKF LEQARNK T A THVKLT+DT+G EVDHKLYRS
Subjt:  QIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRS

Query:  IVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
        IVG+LLYLT SRPDIAYVV IC  YQADPR++HL+A+K+ILKYVHGTSDF MMYSYDTTPTLVGY DADWA SA+
Subjt:  IVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

KAA0033543.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.9e-21180.42Show/hide
Query:  METINVVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDY
        METINVVINDLD A+KQ+NDE++ETPNM E RTTS +EVSKADNP DD GKSL+K SEE I+KKS+LI SAHVKKNHPASSIIGDPSAGMQTRRKE IDY
Subjt:  METINVVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDY

Query:  MKMVADLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVA
         KMVADLCYTSTIEPSTVD ALK+EYWLNAMQE+LLQFRRNNVWTLVSK EGVN+I TKWVFKNKTDEA CVTKNKARLVAQGY QVKG+DF+ETFAPVA
Subjt:  MKMVADLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVA

Query:  RLEAIRLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSN
        RLEAIRLLLGISCI KFKLYQ+DV SAFLNGYLNEEVYVAQ   FVD EH KHVYKLNK LYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIH+KS+
Subjt:  RLEAIRLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSN

Query:  QLLVAQIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDH
        QLL+                           EFEMSM GE S F  LQ KQKND IFISQEKY KNMV+KFGLEQARNK TPA THVKLTRDT G EVDH
Subjt:  QLLVAQIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDH

Query:  KLYRSIVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
        KLYRSI+GSLLYLTASRPDIAY V IC RYQADPR+SHL+ +K+ILKYVHGTSDF MMYSYDTT TLVGY DADWASS+D
Subjt:  KLYRSIVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

KAA0066164.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.6e-21079.38Show/hide
Query:  METINVVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDY
        METINVVINDL+ AIKQ+NDEEDET NMSEARTTS+VE  KA  PSDD  KSLEKSS+E ITKK ELI SAHVKKNHPASSIIGDPS GMQTRRKE IDY
Subjt:  METINVVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDY

Query:  MKMVADLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVA
        MKMVADLCY ST+EPSTVD AL+DEYWLNAMQEELLQFR+NNVWTLVSK EGVN+I TKWVFKNKTDEAGCVTKNKA+LVAQGYTQV+G+DFDETFA VA
Subjt:  MKMVADLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVA

Query:  RLEAIRLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSN
        RLEAIRLLLGISCI KFKLYQ+DV SAFL+GYLNEEVYVAQ KGFVDSEH KH+YKLNK LYGLKQA RAWY++LTVYLRGKGYSRGEIDKTLFI +KS+
Subjt:  RLEAIRLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSN

Query:  QLLVAQIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDH
        QLLVAQIYVD+IIF GFP DLV NFI     EFEMSM GE SCF  LQIKQKND IFISQEKY +NMVKKFGLEQARNK TPA THVKLT+DT+ +EVDH
Subjt:  QLLVAQIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDH

Query:  KLYRSIVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
        KLYRSI                         ADPR++HL+A+K+ILKYVHGTSDFGMMYSYDTTPTLVGY DA+WA S D
Subjt:  KLYRSIVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

TYJ98791.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.1e-19875.62Show/hide
Query:  METINVVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDY
        METINVVIND+DSAIKQ+NDEEDE PNMSEARTTS+V+V+KADNPSDD GK LEKS EE ITKKSELI  AHVKKNHPASSIIGDPSAGMQTRRKE IDY
Subjt:  METINVVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDY

Query:  MKMVADLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVA
        MKMVADLCY ST EPSTVDF+L+DEY LNAMQEELLQF+RNNVWTLV K EGVN+I TKWVFKNKTDEAGCVTKNKARLVAQGYTQV+G+DFDETF+PVA
Subjt:  MKMVADLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVA

Query:  RLEAIRLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSN
        RLEAIRLLLGISCI KFKLYQ+DV SAFLNGYLNEEVYVAQ K FVDSEH KHVYKLNK LYGLKQAPRAWY+RLTVYLRGKGYSRGEIDKTLFIH+KS+
Subjt:  RLEAIRLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSN

Query:  QLLVAQIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDH
        QLLVAQIYVD+IIFGGFPQ LV NFI +MQSEFEMSM GE SCF  LQIKQKND IFISQEKY +NMVKKFG                            
Subjt:  QLLVAQIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDH

Query:  KLYRSIVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
                                     YQADPR+++L+A+K+ILKY+HGT+DFGMMY YDTTPTLVGY DADWA S D
Subjt:  KLYRSIVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

TYK11575.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.2e-20578.95Show/hide
Query:  VVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDYMKMVA
        +VINDLDS IKQMNDEEDETPNM E RTTS VE SKADN SD  GKSL+KSSEEII KKSELIPSAH KKNHPASSIIG+PSAGMQTRRK+ IDY+KM  
Subjt:  VVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDYMKMVA

Query:  DLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAI
                                    ELLQFRRNNVWTLVSK EGVN+I TKW+FKNK DE GCVTKNKARLVAQGYTQV+GVDFDETFAPVARL+AI
Subjt:  DLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAI

Query:  RLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQLLVA
        RLLLGISCI KFKLYQ+DV SAFLN YLNEEVYVAQ KGFVDSEH KHVYKLNK LYGLKQAPRAWY+RLT YLRG+GY R EIDKT FIH+KS+QLLVA
Subjt:  RLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQLLVA

Query:  QIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRS
        QIYVD+IIFGGFP DLV NFINIMQSEFEMS  GE SCF  LQIKQKND IFISQEKY KNMVKKF LEQARNK T A THVKLT+DT+G EVDHKLYRS
Subjt:  QIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRS

Query:  IVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
        IVG+LLYLT SRPDIAYVV IC  YQADPR++HL+A+K+ILKYVHGTSDF MMYSYDTTPTLVGY DADWA SA+
Subjt:  IVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

TrEMBL top hitse value%identityAlignment
A0A5A7SN07 Gag-pol polyprotein1.9e-20679.16Show/hide
Query:  VVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDYMKMVA
        +VINDLDS IKQMNDEEDETPNM E RTTS VE SKADN SD  GKSL+KSSEEII KKSELIPSAH KKNHPASSIIG+PSAGMQTRRK+ IDY+KM  
Subjt:  VVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDYMKMVA

Query:  DLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAI
                                    ELLQFRRNNVWTLVSK EGVN+I TKW+FKNK DE GCVTKNKARLVAQGYTQV+GVDFDETFAPVARL+AI
Subjt:  DLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAI

Query:  RLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQLLVA
        RLLLGISCI KFKLYQ+DV SAFLN YLNEEVYVAQ KGFVDSEH KHVYKLNK LYGLKQAPRAWY+RLTVYLRG+GY R EIDKT FIH+KS+QLLVA
Subjt:  RLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQLLVA

Query:  QIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRS
        QIYVD+IIFGGFP DLV NFINIMQSEFEMS  GE SCF  LQIKQKND IFISQEKY KNMVKKF LEQARNK T A THVKLT+DT+G EVDHKLYRS
Subjt:  QIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRS

Query:  IVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
        IVG+LLYLT SRPDIAYVV IC  YQADPR++HL+A+K+ILKYVHGTSDF MMYSYDTTPTLVGY DADWA SA+
Subjt:  IVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

A0A5D3B9N4 Gag-pol polyprotein9.0e-21280.42Show/hide
Query:  METINVVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDY
        METINVVINDLD A+KQ+NDE++ETPNM E RTTS +EVSKADNP DD GKSL+K SEE I+KKS+LI SAHVKKNHPASSIIGDPSAGMQTRRKE IDY
Subjt:  METINVVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDY

Query:  MKMVADLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVA
         KMVADLCYTSTIEPSTVD ALK+EYWLNAMQE+LLQFRRNNVWTLVSK EGVN+I TKWVFKNKTDEA CVTKNKARLVAQGY QVKG+DF+ETFAPVA
Subjt:  MKMVADLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVA

Query:  RLEAIRLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSN
        RLEAIRLLLGISCI KFKLYQ+DV SAFLNGYLNEEVYVAQ   FVD EH KHVYKLNK LYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIH+KS+
Subjt:  RLEAIRLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSN

Query:  QLLVAQIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDH
        QLL+                           EFEMSM GE S F  LQ KQKND IFISQEKY KNMV+KFGLEQARNK TPA THVKLTRDT G EVDH
Subjt:  QLLVAQIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDH

Query:  KLYRSIVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
        KLYRSI+GSLLYLTASRPDIAY V IC RYQADPR+SHL+ +K+ILKYVHGTSDF MMYSYDTT TLVGY DADWASS+D
Subjt:  KLYRSIVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

A0A5D3BJA9 Gag-pol polyprotein1.5e-19875.62Show/hide
Query:  METINVVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDY
        METINVVIND+DSAIKQ+NDEEDE PNMSEARTTS+V+V+KADNPSDD GK LEKS EE ITKKSELI  AHVKKNHPASSIIGDPSAGMQTRRKE IDY
Subjt:  METINVVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDY

Query:  MKMVADLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVA
        MKMVADLCY ST EPSTVDF+L+DEY LNAMQEELLQF+RNNVWTLV K EGVN+I TKWVFKNKTDEAGCVTKNKARLVAQGYTQV+G+DFDETF+PVA
Subjt:  MKMVADLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVA

Query:  RLEAIRLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSN
        RLEAIRLLLGISCI KFKLYQ+DV SAFLNGYLNEEVYVAQ K FVDSEH KHVYKLNK LYGLKQAPRAWY+RLTVYLRGKGYSRGEIDKTLFIH+KS+
Subjt:  RLEAIRLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSN

Query:  QLLVAQIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDH
        QLLVAQIYVD+IIFGGFPQ LV NFI +MQSEFEMSM GE SCF  LQIKQKND IFISQEKY +NMVKKFG                            
Subjt:  QLLVAQIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDH

Query:  KLYRSIVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
                                     YQADPR+++L+A+K+ILKY+HGT+DFGMMY YDTTPTLVGY DADWA S D
Subjt:  KLYRSIVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

A0A5D3CJ17 Gag-pol polyprotein5.6e-20678.95Show/hide
Query:  VVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDYMKMVA
        +VINDLDS IKQMNDEEDETPNM E RTTS VE SKADN SD  GKSL+KSSEEII KKSELIPSAH KKNHPASSIIG+PSAGMQTRRK+ IDY+KM  
Subjt:  VVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDYMKMVA

Query:  DLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAI
                                    ELLQFRRNNVWTLVSK EGVN+I TKW+FKNK DE GCVTKNKARLVAQGYTQV+GVDFDETFAPVARL+AI
Subjt:  DLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAI

Query:  RLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQLLVA
        RLLLGISCI KFKLYQ+DV SAFLN YLNEEVYVAQ KGFVDSEH KHVYKLNK LYGLKQAPRAWY+RLT YLRG+GY R EIDKT FIH+KS+QLLVA
Subjt:  RLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQLLVA

Query:  QIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRS
        QIYVD+IIFGGFP DLV NFINIMQSEFEMS  GE SCF  LQIKQKND IFISQEKY KNMVKKF LEQARNK T A THVKLT+DT+G EVDHKLYRS
Subjt:  QIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRS

Query:  IVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
        IVG+LLYLT SRPDIAYVV IC  YQADPR++HL+A+K+ILKYVHGTSDF MMYSYDTTPTLVGY DADWA SA+
Subjt:  IVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

A0A5D3CXU0 Gag-pol polyprotein2.2e-21079.38Show/hide
Query:  METINVVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDY
        METINVVINDL+ AIKQ+NDEEDET NMSEARTTS+VE  KA  PSDD  KSLEKSS+E ITKK ELI SAHVKKNHPASSIIGDPS GMQTRRKE IDY
Subjt:  METINVVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDY

Query:  MKMVADLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVA
        MKMVADLCY ST+EPSTVD AL+DEYWLNAMQEELLQFR+NNVWTLVSK EGVN+I TKWVFKNKTDEAGCVTKNKA+LVAQGYTQV+G+DFDETFA VA
Subjt:  MKMVADLCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVA

Query:  RLEAIRLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSN
        RLEAIRLLLGISCI KFKLYQ+DV SAFL+GYLNEEVYVAQ KGFVDSEH KH+YKLNK LYGLKQA RAWY++LTVYLRGKGYSRGEIDKTLFI +KS+
Subjt:  RLEAIRLLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSN

Query:  QLLVAQIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDH
        QLLVAQIYVD+IIF GFP DLV NFI     EFEMSM GE SCF  LQIKQKND IFISQEKY +NMVKKFGLEQARNK TPA THVKLT+DT+ +EVDH
Subjt:  QLLVAQIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDH

Query:  KLYRSIVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
        KLYRSI                         ADPR++HL+A+K+ILKYVHGTSDFGMMYSYDTTPTLVGY DA+WA S D
Subjt:  KLYRSIVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.0e-5533.33Show/hide
Query:  WLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAIRLLLGISCIHKFKLYQIDVNS
        W  A+  EL   + NN WT+  + E  NI+D++WVF  K +E G   + KARLVA+G+TQ   +D++ETFAPVAR+ + R +L +   +  K++Q+DV +
Subjt:  WLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAIRLLLGISCIHKFKLYQIDVNS

Query:  AFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKS--NQLLVAQIYVDNIIFGGFPQDLVCN
        AFLNG L EE+Y+   +G   S +  +V KLNK +YGLKQA R W+E     L+   +    +D+ ++I  K   N+ +   +YVD+++        + N
Subjt:  AFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKS--NQLLVAQIYVDNIIFGGFPQDLVCN

Query:  FINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRSIVGSLLY-LTASRPDIAYV
        F   +  +F M+   E   F  ++I+ + D I++SQ  Y K ++ KF +E      TP  + +      +  E  +   RS++G L+Y +  +RPD+   
Subjt:  FINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRSIVGSLLY-LTASRPDIAYV

Query:  VEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTT--PTLVGYLDADWASS
        V I  RY +       + +K++L+Y+ GT D  +++  +      ++GY+D+DWA S
Subjt:  VEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTT--PTLVGYLDADWASS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-5630.34Show/hide
Query:  PNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDYMKMVADLCYTSTIEPSTVDFAL---
        P  +E+ T    E  +      ++G+ L++  EE+        P+   +++ P   +       +++RR  + +Y+ +  D       EP ++   L   
Subjt:  PNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDYMKMVADLCYTSTIEPSTVDFAL---

Query:  KDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAIRLLLGISCIHKFKLYQI
        +    + AMQEE+   ++N  + LV   +G   +  KWVFK K D    + + KARLV +G+ Q KG+DFDE F+PV ++ +IR +L ++     ++ Q+
Subjt:  KDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAIRLLLGISCIHKFKLYQI

Query:  DVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKS-NQLLVAQIYVDNIIFGGFPQDL
        DV +AFL+G L EE+Y+ Q +GF  +     V KLNK LYGLKQAPR WY +   +++ + Y +   D  ++  + S N  ++  +YVD+++  G  + L
Subjt:  DVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKS-NQLLVAQIYVDNIIFGGFPQDL

Query:  VCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDG--IFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHK------LYRSIVGSLLY-
        +      +   F+M   G       ++I ++     +++SQEKY + ++++F ++ A+   TP   H+KL++    T V+ K       Y S VGSL+Y 
Subjt:  VCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDG--IFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHK------LYRSIVGSLLY-

Query:  LTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
        +  +RPDIA+ V +  R+  +P   H +A+K IL+Y+ GT+   + +   + P L GY DAD A   D
Subjt:  LTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

P25600 Putative transposon Ty5-1 protein YCL074W4.2e-2528.74Show/hide
Query:  IDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQLLVAQIYVDNIIFGGFPQDL
        +DV++AFLN  ++E +YV Q  GFV+  +  +V++L   +YGLKQAP  W E +   L+  G+ R E +  L+    S+  +   +YVD+++       +
Subjt:  IDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQLLVAQIYVDNIIFGGFPQDL

Query:  VCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDG-IFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRSIVGSLLY-LTASRPD
               +   + M   G+   F  L I Q ++G I +S + Y      +  +   +   TP      L   T     D   Y+SIVG LL+     RPD
Subjt:  VCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDG-IFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRSIVGSLLY-LTASRPD

Query:  IAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
        I+Y V +  R+  +PR  HL++ +++L+Y++ T    + Y   +   L  Y DA   +  D
Subjt:  IAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.4e-6534.54Show/hide
Query:  NDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELI----PSAHVKKNHPASSIIGDPSAGMQTRRKENI--DYMKMVADLCYTST
        N+  +E+P+   A++ ST   S + +PS     S   SS    T  S LI    P A +  N+  + +    +  M TR K  I     K    +   + 
Subjt:  NDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELI----PSAHVKKNHPASSIIGDPSAGMQTRRKENI--DYMKMVADLCYTST

Query:  IEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEG-VNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAIRLLLGI
         EP T   ALKDE W NAM  E+     N+ W LV      V I+  +W+F  K +  G + + KARLVA+GY Q  G+D+ ETF+PV +  +IR++LG+
Subjt:  IEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEG-VNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAIRLLLGI

Query:  SCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQLLVAQIYVDN
        +    + + Q+DVN+AFL G L ++VY++Q  GF+D +   +V KL K LYGLKQAPRAWY  L  YL   G+     D +LF+ ++   ++   +YVD+
Subjt:  SCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQLLVAQIYVDN

Query:  IIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRSIVGSLL
        I+  G    L+ N ++ +   F +    E   F  ++ K+   G+ +SQ +Y  +++ +  +  A+   TP     KL+  +     D   YR IVGSL 
Subjt:  IIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRSIVGSLL

Query:  YLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
        YL  +RPDI+Y V    ++   P   HL+A+K+IL+Y+ GT + G+      T +L  Y DADWA   D
Subjt:  YLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.6e-6432.19Show/hide
Query:  NDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKK-SELIPSAHVKKNHPASSIIGDPSAGMQTRRKENI--DYMKMVADLCYTSTIEP
        N     +PN +     S +       PS    +    SS    T     ++P+  + + +  + +    +  M TR K+ I     K        +  EP
Subjt:  NDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKK-SELIPSAHVKKNHPASSIIGDPSAGMQTRRKENI--DYMKMVADLCYTSTIEP

Query:  STVDFALKDEYWLNAMQEELLQFRRNNVWTLV-SKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAIRLLLGISCI
         T   A+KD+ W  AM  E+     N+ W LV      V I+  +W+F  K +  G + + KARLVA+GY Q  G+D+ ETF+PV +  +IR++LG++  
Subjt:  STVDFALKDEYWLNAMQEELLQFRRNNVWTLV-SKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAIRLLLGISCI

Query:  HKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQLLVAQIYVDNIIF
          + + Q+DVN+AFL G L +EVY++Q  GFVD +   +V +L K +YGLKQAPRAWY  L  YL   G+     D +LF+ ++   ++   +YVD+I+ 
Subjt:  HKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQLLVAQIYVDNIIF

Query:  GGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRSIVGSLLYLT
         G    L+ + ++ +   F +    +   F  ++ K+   G+ +SQ +YT +++ +  +  A+   TP  T  KLT  +     D   YR IVGSL YL 
Subjt:  GGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRSIVGSLLYLT

Query:  ASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
         +RPD++Y V    +Y   P   H  A+K++L+Y+ GT D G+      T +L  Y DADWA   D
Subjt:  ASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.7e-5934.39Show/hide
Query:  LCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAIR
        +C     EPST + A +   W  AM +E+      + W + +       I  KWV+K K +  G + + KARLVA+GYTQ +G+DF ETF+PV +L +++
Subjt:  LCYTSTIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAIR

Query:  LLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFV----DSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQL
        L+L IS I+ F L+Q+D+++AFLNG L+EE+Y+    G+     DS     V  L K +YGLKQA R W+ + +V L G G+ +   D T F+   +   
Subjt:  LLLGISCIHKFKLYQIDVNSAFLNGYLNEEVYVAQSKGFV----DSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQL

Query:  LVAQIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKL
        L   +YVD+II        V    + ++S F++   G    F  L+I +   GI I Q KY  +++ + GL   +    P    V  +  + G  VD K 
Subjt:  LVAQIYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKL

Query:  YRSIVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
        YR ++G L+YL  +R DI++ V    ++   PR++H +A+ +IL Y+ GT   G+ YS      L  + DA + S  D
Subjt:  YRSIVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

ATMG00240.1 Gag-Pol-related retrotransposon family protein4.1e-0734.29Show/hide
Query:  LYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD
        +YLT +RPD+ + V    ++ +  R + ++A+ ++L YV GT   G+ YS  +   L  + D+DWAS  D
Subjt:  LYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD

ATMG00810.1 DNA/RNA polymerases superfamily protein4.7e-1931.76Show/hide
Query:  IYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRSI
        +YVD+I+  G    L+   I  + S F M   G    F  +QIK    G+F+SQ KY + ++   G+   +   TP    +  +  T     D   +RSI
Subjt:  IYVDNIIFGGFPQDLVCNFINIMQSEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRSI

Query:  VGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWA
        VG+L YLT +RPDI+Y V I  +   +P ++    +K++L+YV GT   G+    ++   +  + D+DWA
Subjt:  VGSLLYLTASRPDIAYVVEICVRYQADPRMSHLKAIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWA

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)3.8e-2144.8Show/hide
Query:  MQTRRKENIDYMKMVADLCYTSTI--EPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQV
        M TR K  I+ +     L  T+TI  EP +V FALKD  W  AMQEEL    RN  W LV      NI+  KWVFK K    G + + KARLVA+G+ Q 
Subjt:  MQTRRKENIDYMKMVADLCYTSTI--EPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQV

Query:  KGVDFDETFAPVARLEAIRLLLGIS
        +G+ F ET++PV R   IR +L ++
Subjt:  KGVDFDETFAPVARLEAIRLLLGIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACAATCAATGTAGTTATAAATGATCTCGATTCAGCTATCAAACAGATGAATGATGAGGAAGATGAGACTCCAAACATGTCTGAAGCTAGAACTACGAGTACTGT
AGAAGTTTCTAAAGCCGATAACCCATCTGATGATCGAGGTAAAAGTTTGGAAAAATCATCAGAAGAAATCATCACTAAAAAATCAGAACTAATTCCATCTGCTCATGTGA
AGAAAAATCATCCAGCAAGCTCTATAATAGGTGATCCGTCAGCTGGGATGCAGACCAGAAGGAAAGAAAATATTGATTACATGAAGATGGTTGCTGATCTATGTTATACT
TCTACTATTGAACCTTCTACTGTTGACTTTGCTCTGAAGGATGAGTATTGGCTCAATGCTATGCAAGAGGAGCTACTCCAATTCAGACGAAACAATGTCTGGACGTTAGT
TTCCAAGCTAGAAGGTGTAAACATTATTGACACCAAATGGGTATTTAAAAATAAGACTGATGAAGCTGGATGTGTGACGAAAAATAAAGCCAGATTAGTAGCTCAAGGGT
ATACTCAAGTTAAAGGTGTTGACTTTGATGAAACGTTTGCTCCAGTTGCTCGACTTGAAGCCATTCGATTATTACTTGGTATATCATGCATACATAAATTTAAATTATAT
CAGATTGATGTAAATAGTGCTTTCTTAAATGGATATTTGAATGAAGAGGTTTATGTTGCTCAATCAAAAGGTTTTGTTGATTCCGAGCACCTGAAGCATGTGTATAAGCT
CAACAAAGTTTTATATGGGCTAAAGCAAGCTCCGAGAGCTTGGTATGAACGGCTAACTGTTTACTTAAGAGGTAAAGGATATTCTAGAGGAGAAATTGACAAGACCTTGT
TTATACACAAAAAATCTAATCAACTTTTGGTTGCTCAAATTTATGTTGATAACATCATTTTTGGAGGTTTTCCTCAAGATCTTGTATGTAATTTCATTAACATCATGCAG
TCAGAATTCGAAATGAGCATGGCTGGAGAATTTTCATGCTTTTTTAGTCTTCAAATTAAGCAAAAGAATGACGGCATATTCATATCTCAAGAAAAGTATACCAAAAATAT
GGTTAAAAAGTTTGGTTTGGAACAGGCTCGAAATAAGTGGACTCCAGCTGGGACACATGTTAAACTTACAAGAGATACTGATGGTACAGAAGTTGATCACAAACTCTATA
GGAGTATAGTAGGCAGCTTATTGTATTTAACAGCAAGTCGACCTGACATAGCTTATGTTGTGGAAATATGTGTTCGTTATCAGGCTGATCCCCGCATGTCTCACCTAAAA
GCTATTAAACAAATTCTTAAGTATGTTCATGGGACCAGTGACTTTGGAATGATGTATTCCTATGATACCACCCCTACTCTTGTTGGATATCTTGATGCTGACTGGGCAAG
TTCGGCTGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAACAATCAATGTAGTTATAAATGATCTCGATTCAGCTATCAAACAGATGAATGATGAGGAAGATGAGACTCCAAACATGTCTGAAGCTAGAACTACGAGTACTGT
AGAAGTTTCTAAAGCCGATAACCCATCTGATGATCGAGGTAAAAGTTTGGAAAAATCATCAGAAGAAATCATCACTAAAAAATCAGAACTAATTCCATCTGCTCATGTGA
AGAAAAATCATCCAGCAAGCTCTATAATAGGTGATCCGTCAGCTGGGATGCAGACCAGAAGGAAAGAAAATATTGATTACATGAAGATGGTTGCTGATCTATGTTATACT
TCTACTATTGAACCTTCTACTGTTGACTTTGCTCTGAAGGATGAGTATTGGCTCAATGCTATGCAAGAGGAGCTACTCCAATTCAGACGAAACAATGTCTGGACGTTAGT
TTCCAAGCTAGAAGGTGTAAACATTATTGACACCAAATGGGTATTTAAAAATAAGACTGATGAAGCTGGATGTGTGACGAAAAATAAAGCCAGATTAGTAGCTCAAGGGT
ATACTCAAGTTAAAGGTGTTGACTTTGATGAAACGTTTGCTCCAGTTGCTCGACTTGAAGCCATTCGATTATTACTTGGTATATCATGCATACATAAATTTAAATTATAT
CAGATTGATGTAAATAGTGCTTTCTTAAATGGATATTTGAATGAAGAGGTTTATGTTGCTCAATCAAAAGGTTTTGTTGATTCCGAGCACCTGAAGCATGTGTATAAGCT
CAACAAAGTTTTATATGGGCTAAAGCAAGCTCCGAGAGCTTGGTATGAACGGCTAACTGTTTACTTAAGAGGTAAAGGATATTCTAGAGGAGAAATTGACAAGACCTTGT
TTATACACAAAAAATCTAATCAACTTTTGGTTGCTCAAATTTATGTTGATAACATCATTTTTGGAGGTTTTCCTCAAGATCTTGTATGTAATTTCATTAACATCATGCAG
TCAGAATTCGAAATGAGCATGGCTGGAGAATTTTCATGCTTTTTTAGTCTTCAAATTAAGCAAAAGAATGACGGCATATTCATATCTCAAGAAAAGTATACCAAAAATAT
GGTTAAAAAGTTTGGTTTGGAACAGGCTCGAAATAAGTGGACTCCAGCTGGGACACATGTTAAACTTACAAGAGATACTGATGGTACAGAAGTTGATCACAAACTCTATA
GGAGTATAGTAGGCAGCTTATTGTATTTAACAGCAAGTCGACCTGACATAGCTTATGTTGTGGAAATATGTGTTCGTTATCAGGCTGATCCCCGCATGTCTCACCTAAAA
GCTATTAAACAAATTCTTAAGTATGTTCATGGGACCAGTGACTTTGGAATGATGTATTCCTATGATACCACCCCTACTCTTGTTGGATATCTTGATGCTGACTGGGCAAG
TTCGGCTGACTGA
Protein sequenceShow/hide protein sequence
METINVVINDLDSAIKQMNDEEDETPNMSEARTTSTVEVSKADNPSDDRGKSLEKSSEEIITKKSELIPSAHVKKNHPASSIIGDPSAGMQTRRKENIDYMKMVADLCYT
STIEPSTVDFALKDEYWLNAMQEELLQFRRNNVWTLVSKLEGVNIIDTKWVFKNKTDEAGCVTKNKARLVAQGYTQVKGVDFDETFAPVARLEAIRLLLGISCIHKFKLY
QIDVNSAFLNGYLNEEVYVAQSKGFVDSEHLKHVYKLNKVLYGLKQAPRAWYERLTVYLRGKGYSRGEIDKTLFIHKKSNQLLVAQIYVDNIIFGGFPQDLVCNFINIMQ
SEFEMSMAGEFSCFFSLQIKQKNDGIFISQEKYTKNMVKKFGLEQARNKWTPAGTHVKLTRDTDGTEVDHKLYRSIVGSLLYLTASRPDIAYVVEICVRYQADPRMSHLK
AIKQILKYVHGTSDFGMMYSYDTTPTLVGYLDADWASSAD