; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0160761 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0160761
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr06:9115588..9117012
RNA-Seq ExpressionCmc06g0160761
SyntenyCmc06g0160761
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026117.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.1e-21684.22Show/hide
Query:  VVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDYLKMVV
        +VINDLDSDIKQMNDEEDE  NM EVRT S VEESKADNS DGPGKSLKKSSEEIINKKS++I SAH KKNHP S IIG+PS GMQTRRKDKIDYLKM  
Subjt:  VVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDYLKMVV

Query:  DSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAI
                                    ELLQF+RNNVWTLVSKPEGVNVI TKWIFKNK DETGCVTKNKARLVAQGYTQVE VDFDETFA VARL+AI
Subjt:  DSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAI

Query:  RLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA
        RLLLGISCIQKFKLYQMDVKS FLN YLNEEVYVAQPK FVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGY R EIDKT FIHRKSDQLLVA
Subjt:  RLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA

Query:  QIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKS
        QIYVDDIIFGGFP DLVNNFINIMQSEFEMS VG LSCFLGLQIKQKNDDIFISQEKYA+NMVKKF LEQARNKRT AAT+VKL KDTEGAEVDHKLY+S
Subjt:  QIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKS

Query:  IVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD
        IVG+LLYLTTSRPDIAY VGICACYQADPRITHLE +KRILKYVHGTSDF MMYSYDTT TL+ YCDAD
Subjt:  IVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD

KAA0033021.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.9e-19679.64Show/hide
Query:  MSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDYLKMVVDSCYISTIGPLTVDSALKDEYW
        MSE RT ++VE  +ADN  D  GKSLKKSSEE I KKS++I  AHVKKNHP S IIGDPS GMQTRRK+KIDY+KMV D CYIST  P TVDSA +DEYW
Subjt:  MSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDYLKMVVDSCYISTIGPLTVDSALKDEYW

Query:  LNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSV
        LNAMQ+ELL+F+RNNVWTLVSKPEGVNVI                                 +DFDETFA VARLEAIRLLLGISCIQKFKLYQMDVKS 
Subjt:  LNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSV

Query:  FLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFIN
        FLNGYLNEEVYVAQPK FVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRG+GYS+GEIDKTLFIHRKSDQLLVAQIYVDDIIFG FPQDL+NNFIN
Subjt:  FLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFIN

Query:  IMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKSIVGSLLYLTTSRPDIAYTVGIC
        IMQSE EMSMVG LSCFLGLQIKQKNDDI ISQEKYARNMVKKFGLEQARNKRTPAAT+VKL K+TEGAEVDHKLYKSIVG+LLYLT SRPDIAY VGI 
Subjt:  IMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKSIVGSLLYLTTSRPDIAYTVGIC

Query:  ACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD
        A YQADPRITHLEV+K+I+KYVHGTSDFGMM SYDT  TL+RYCDAD
Subjt:  ACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD

KAA0033543.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.2e-20779.54Show/hide
Query:  METINVVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDY
        METINVVINDLD  +KQ+NDE++E  NM E RT S +E SKADN  D PGKSLKK SEE I+KKSK+ILSAHVKKNHP S IIGDPS GMQTRRK+KIDY
Subjt:  METINVVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDY

Query:  LKMVVDSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVA
         KMV D CY STI P TVDSALK+EYWLNAMQE+LLQF+RNNVWTLVSKPEGVNVI TKW+FKNKTDE  CVTKNKARLVAQGY QV+ +DF+ETFA VA
Subjt:  LKMVVDSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVA

Query:  RLEAIRLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSD
        RLEAIRLLLGISCIQKFKLYQMDVKS FLNGYLNEEVYVAQP CFVD EHPKHVYKLNKALYGLKQAPRAWY+RLTVYLRG+GYSRGEIDKTLFIHRKSD
Subjt:  RLEAIRLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSD

Query:  QLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDH
        QLL+                           EFEMSMVG LS FL LQ KQKNDDIFISQEKYA+NMV+KFGLEQARNKRTPAAT+VKL +DT GAEVDH
Subjt:  QLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDH

Query:  KLYKSIVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD
        KLY+SI+GSLLYLT SRPDIAY VGICA YQADPRI+HLEV+KRILKYVHGTSDF MMYSYDTTSTL+ YCDAD
Subjt:  KLYKSIVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD

KAA0066164.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.6e-20779.54Show/hide
Query:  METINVVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDY
        METINVVINDL+  IKQ+NDEEDE SNMSE RT S+VE  KA    D   KSL+KSS+E I KK ++I SAHVKKNHP S IIGDPS GMQTRRK+KIDY
Subjt:  METINVVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDY

Query:  LKMVVDSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVA
        +KMV D CYIST+ P TVDSAL+DEYWLNAMQEELLQF++NNVWTLVSKPEGVNVI TKW+FKNKTDE GCVTKNKA+LVAQGYTQVE +DFDETFA VA
Subjt:  LKMVVDSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVA

Query:  RLEAIRLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSD
        RLEAIRLLLGISCIQKFKLYQMDVKS FL+GYLNEEVYVAQPK FVDSEHPKH+YKLNKALYGLKQA RAWYD+LTVYLRG+GYSRGEIDKTLFI RKSD
Subjt:  RLEAIRLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSD

Query:  QLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDH
        QLLVAQIYVDDIIF GFP DLVNNFI     EFEMSMVG LSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAAT+VKL KDTE +EVDH
Subjt:  QLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDH

Query:  KLYKSIVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD
        KLY+SI                         ADPRITHLE +KRILKYVHGTSDFGMMYSYDTT TL+ YCDA+
Subjt:  KLYKSIVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD

TYK11575.1 gag-pol polyprotein [Cucumis melo var. makuwa]3.2e-21684.01Show/hide
Query:  VVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDYLKMVV
        +VINDLDSDIKQMNDEEDE  NM EVRT S VEESKADNS DGPGKSLKKSSEEIINKKS++I SAH KKNHP S IIG+PS GMQTRRKDKIDYLKM  
Subjt:  VVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDYLKMVV

Query:  DSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAI
                                    ELLQF+RNNVWTLVSKPEGVNVI TKWIFKNK DETGCVTKNKARLVAQGYTQVE VDFDETFA VARL+AI
Subjt:  DSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAI

Query:  RLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA
        RLLLGISCIQKFKLYQMDVKS FLN YLNEEVYVAQPK FVDSEHPKHVYKLNKALYGLKQAPRAWYDRLT YLRGRGY R EIDKT FIHRKSDQLLVA
Subjt:  RLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA

Query:  QIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKS
        QIYVDDIIFGGFP DLVNNFINIMQSEFEMS VG LSCFLGLQIKQKNDDIFISQEKYA+NMVKKF LEQARNKRT AAT+VKL KDTEGAEVDHKLY+S
Subjt:  QIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKS

Query:  IVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD
        IVG+LLYLTTSRPDIAY VGICACYQADPRITHLE +KRILKYVHGTSDF MMYSYDTT TL+ YCDAD
Subjt:  IVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD

TrEMBL top hitse value%identityAlignment
A0A5A7SN07 Gag-pol polyprotein5.4e-21784.22Show/hide
Query:  VVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDYLKMVV
        +VINDLDSDIKQMNDEEDE  NM EVRT S VEESKADNS DGPGKSLKKSSEEIINKKS++I SAH KKNHP S IIG+PS GMQTRRKDKIDYLKM  
Subjt:  VVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDYLKMVV

Query:  DSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAI
                                    ELLQF+RNNVWTLVSKPEGVNVI TKWIFKNK DETGCVTKNKARLVAQGYTQVE VDFDETFA VARL+AI
Subjt:  DSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAI

Query:  RLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA
        RLLLGISCIQKFKLYQMDVKS FLN YLNEEVYVAQPK FVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGY R EIDKT FIHRKSDQLLVA
Subjt:  RLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA

Query:  QIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKS
        QIYVDDIIFGGFP DLVNNFINIMQSEFEMS VG LSCFLGLQIKQKNDDIFISQEKYA+NMVKKF LEQARNKRT AAT+VKL KDTEGAEVDHKLY+S
Subjt:  QIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKS

Query:  IVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD
        IVG+LLYLTTSRPDIAY VGICACYQADPRITHLE +KRILKYVHGTSDF MMYSYDTT TL+ YCDAD
Subjt:  IVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD

A0A5D3B9N4 Gag-pol polyprotein6.0e-20879.54Show/hide
Query:  METINVVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDY
        METINVVINDLD  +KQ+NDE++E  NM E RT S +E SKADN  D PGKSLKK SEE I+KKSK+ILSAHVKKNHP S IIGDPS GMQTRRK+KIDY
Subjt:  METINVVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDY

Query:  LKMVVDSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVA
         KMV D CY STI P TVDSALK+EYWLNAMQE+LLQF+RNNVWTLVSKPEGVNVI TKW+FKNKTDE  CVTKNKARLVAQGY QV+ +DF+ETFA VA
Subjt:  LKMVVDSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVA

Query:  RLEAIRLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSD
        RLEAIRLLLGISCIQKFKLYQMDVKS FLNGYLNEEVYVAQP CFVD EHPKHVYKLNKALYGLKQAPRAWY+RLTVYLRG+GYSRGEIDKTLFIHRKSD
Subjt:  RLEAIRLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSD

Query:  QLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDH
        QLL+                           EFEMSMVG LS FL LQ KQKNDDIFISQEKYA+NMV+KFGLEQARNKRTPAAT+VKL +DT GAEVDH
Subjt:  QLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDH

Query:  KLYKSIVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD
        KLY+SI+GSLLYLT SRPDIAY VGICA YQADPRI+HLEV+KRILKYVHGTSDF MMYSYDTTSTL+ YCDAD
Subjt:  KLYKSIVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD

A0A5D3BWU5 Gag-pol polyprotein2.3e-19679.64Show/hide
Query:  MSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDYLKMVVDSCYISTIGPLTVDSALKDEYW
        MSE RT ++VE  +ADN  D  GKSLKKSSEE I KKS++I  AHVKKNHP S IIGDPS GMQTRRK+KIDY+KMV D CYIST  P TVDSA +DEYW
Subjt:  MSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDYLKMVVDSCYISTIGPLTVDSALKDEYW

Query:  LNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSV
        LNAMQ+ELL+F+RNNVWTLVSKPEGVNVI                                 +DFDETFA VARLEAIRLLLGISCIQKFKLYQMDVKS 
Subjt:  LNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSV

Query:  FLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFIN
        FLNGYLNEEVYVAQPK FVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRG+GYS+GEIDKTLFIHRKSDQLLVAQIYVDDIIFG FPQDL+NNFIN
Subjt:  FLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFIN

Query:  IMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKSIVGSLLYLTTSRPDIAYTVGIC
        IMQSE EMSMVG LSCFLGLQIKQKNDDI ISQEKYARNMVKKFGLEQARNKRTPAAT+VKL K+TEGAEVDHKLYKSIVG+LLYLT SRPDIAY VGI 
Subjt:  IMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKSIVGSLLYLTTSRPDIAYTVGIC

Query:  ACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD
        A YQADPRITHLEV+K+I+KYVHGTSDFGMM SYDT  TL+RYCDAD
Subjt:  ACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD

A0A5D3CJ17 Gag-pol polyprotein1.6e-21684.01Show/hide
Query:  VVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDYLKMVV
        +VINDLDSDIKQMNDEEDE  NM EVRT S VEESKADNS DGPGKSLKKSSEEIINKKS++I SAH KKNHP S IIG+PS GMQTRRKDKIDYLKM  
Subjt:  VVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDYLKMVV

Query:  DSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAI
                                    ELLQF+RNNVWTLVSKPEGVNVI TKWIFKNK DETGCVTKNKARLVAQGYTQVE VDFDETFA VARL+AI
Subjt:  DSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAI

Query:  RLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA
        RLLLGISCIQKFKLYQMDVKS FLN YLNEEVYVAQPK FVDSEHPKHVYKLNKALYGLKQAPRAWYDRLT YLRGRGY R EIDKT FIHRKSDQLLVA
Subjt:  RLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVA

Query:  QIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKS
        QIYVDDIIFGGFP DLVNNFINIMQSEFEMS VG LSCFLGLQIKQKNDDIFISQEKYA+NMVKKF LEQARNKRT AAT+VKL KDTEGAEVDHKLY+S
Subjt:  QIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKS

Query:  IVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD
        IVG+LLYLTTSRPDIAY VGICACYQADPRITHLE +KRILKYVHGTSDF MMYSYDTT TL+ YCDAD
Subjt:  IVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD

A0A5D3CXU0 Gag-pol polyprotein1.7e-20779.54Show/hide
Query:  METINVVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDY
        METINVVINDL+  IKQ+NDEEDE SNMSE RT S+VE  KA    D   KSL+KSS+E I KK ++I SAHVKKNHP S IIGDPS GMQTRRK+KIDY
Subjt:  METINVVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDY

Query:  LKMVVDSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVA
        +KMV D CYIST+ P TVDSAL+DEYWLNAMQEELLQF++NNVWTLVSKPEGVNVI TKW+FKNKTDE GCVTKNKA+LVAQGYTQVE +DFDETFA VA
Subjt:  LKMVVDSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVA

Query:  RLEAIRLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSD
        RLEAIRLLLGISCIQKFKLYQMDVKS FL+GYLNEEVYVAQPK FVDSEHPKH+YKLNKALYGLKQA RAWYD+LTVYLRG+GYSRGEIDKTLFI RKSD
Subjt:  RLEAIRLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSD

Query:  QLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDH
        QLLVAQIYVDDIIF GFP DLVNNFI     EFEMSMVG LSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAAT+VKL KDTE +EVDH
Subjt:  QLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDH

Query:  KLYKSIVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD
        KLY+SI                         ADPRITHLE +KRILKYVHGTSDFGMMYSYDTT TL+ YCDA+
Subjt:  KLYKSIVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.8e-5230.17Show/hide
Query:  DEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDYLKMVVDSCYISTIGPLTVD
        +E   + N +E R   T E  K +   D P    K    EIIN++S+      +K     S+   D S+             K+V+++  I    P + D
Subjt:  DEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDYLKMVVDSCYISTIGPLTVD

Query:  S-ALKDE--YWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAIRLLLGISCIQK
            +D+   W  A+  EL   K NN WT+  +PE  N++ ++W+F  K +E G   + KARLVA+G+TQ   +D++ETFA VAR+ + R +L +     
Subjt:  S-ALKDE--YWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAIRLLLGISCIQK

Query:  FKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKS--DQLLVAQIYVDDIIF
         K++QMDVK+ FLNG L EE+Y+  P+    S +  +V KLNKA+YGLKQA R W++     L+   +    +D+ ++I  K   ++ +   +YVDD++ 
Subjt:  FKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKS--DQLLVAQIYVDDIIF

Query:  GGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVD-HKLYKSIVGSLLY-
               +NNF   +  +F M+ +  +  F+G++I+ + D I++SQ  Y + ++ KF +E      TP  +  K+  +   ++ D +   +S++G L+Y 
Subjt:  GGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVD-HKLYKSIVGSLLY-

Query:  LTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTT--STLIRYCDAD
        +  +RPD+   V I + Y +       + +KR+L+Y+ GT D  +++  +    + +I Y D+D
Subjt:  LTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTT--STLIRYCDAD

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.3e-5433.89Show/hide
Query:  LNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSV
        + AMQEE+   ++N  + LV  P+G   +  KW+FK K D    + + KARLV +G+ Q + +DFDE F+ V ++ +IR +L ++     ++ Q+DVK+ 
Subjt:  LNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSV

Query:  FLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSD-QLLVAQIYVDDIIFGGFPQDLVNNFI
        FL+G L EE+Y+ QP+ F  +     V KLNK+LYGLKQAPR WY +   +++ + Y +   D  ++  R S+   ++  +YVDD++  G  + L+    
Subjt:  FLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSD-QLLVAQIYVDDIIFGGFPQDLVNNFI

Query:  NIMQSEFEMSMVGVLSCFLGLQI--KQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHK------LYKSIVGSLLY-LTTSR
          +   F+M  +G     LG++I  ++ +  +++SQEKY   ++++F ++ A+   TP A ++KL+K      V+ K       Y S VGSL+Y +  +R
Subjt:  NIMQSEFEMSMVGVLSCFLGLQI--KQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHK------LYKSIVGSLLY-LTTSR

Query:  PDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD
        PDIA+ VG+ + +  +P   H E +K IL+Y+ GT+   + +   +   L  Y DAD
Subjt:  PDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD

P25600 Putative transposon Ty5-1 protein YCL074W6.0e-3232.28Show/hide
Query:  MDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDL
        MDV + FLN  ++E +YV QP  FV+  +P +V++L   +YGLKQAP  W + +   L+  G+ R E +  L+    SD  +   +YVDD++       +
Subjt:  MDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDL

Query:  VNNFINIMQSEFEMSMVGVLSCFLGLQIKQ-KNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKSIVGSLLY-LTTSRPD
         +     +   + M  +G +  FLGL I Q  N DI +S + Y      +  +   +  +TP      L + T     D   Y+SIVG LL+   T RPD
Subjt:  VNNFINIMQSEFEMSMVGVLSCFLGLQIKQ-KNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKSIVGSLLY-LTTSRPD

Query:  IAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDA
        I+Y V + + +  +PR  HLE  +R+L+Y++ T    + Y   +   L  YCDA
Subjt:  IAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-6337.12Show/hide
Query:  PLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEG-VNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAIRLLLGISC
        P T   ALKDE W NAM  E+     N+ W LV  P   V ++  +WIF  K +  G + + KARLVA+GY Q   +D+ ETF+ V +  +IR++LG++ 
Subjt:  PLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEG-VNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAIRLLLGISC

Query:  IQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDII
         + + + Q+DV + FL G L ++VY++QP  F+D + P +V KL KALYGLKQAPRAWY  L  YL   G+     D +LF+ ++   ++   +YVDDI+
Subjt:  IQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDII

Query:  FGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKSIVGSLLYL
          G    L++N ++ +   F +     L  FLG++ K+    + +SQ +Y  +++ +  +  A+   TP A   KL+  +     D   Y+ IVGSL YL
Subjt:  FGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKSIVGSLLYL

Query:  TTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD
          +RPDI+Y V   + +   P   HL+ +KRIL+Y+ GT + G+      T +L  Y DAD
Subjt:  TTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE28.6e-6335.31Show/hide
Query:  MQTRRKDKI--DYLKMVVDSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLV-SKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQ
        M TR KD I     K    +   +   P T   A+KD+ W  AM  E+     N+ W LV   P  V ++  +WIF  K +  G + + KARLVA+GY Q
Subjt:  MQTRRKDKI--DYLKMVVDSCYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLV-SKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQ

Query:  VESVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSR
           +D+ ETF+ V +  +IR++LG++  + + + Q+DV + FL G L +EVY++QP  FVD + P +V +L KA+YGLKQAPRAWY  L  YL   G+  
Subjt:  VESVDFDETFALVARLEAIRLLLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSR

Query:  GEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATY
           D +LF+ ++   ++   +YVDDI+  G    L+ + ++ +   F +     L  FLG++ K+    + +SQ +Y  +++ +  +  A+   TP AT 
Subjt:  GEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATY

Query:  VKLAKDTEGAEVDHKLYKSIVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD
         KL   +     D   Y+ IVGSL YL  +RPD++Y V   + Y   P   H   +KR+L+Y+ GT D G+      T +L  Y DAD
Subjt:  VKLAKDTEGAEVDHKLYKSIVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 87.8e-5934.32Show/hide
Query:  CYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAIRL
        C      P T + A +   W  AM +E+   +  + W + + P     I  KW++K K +  G + + KARLVA+GYTQ E +DF ETF+ V +L +++L
Subjt:  CYISTIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAIRL

Query:  LLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFV----DSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLL
        +L IS I  F L+Q+D+ + FLNG L+EE+Y+  P  +     DS  P  V  L K++YGLKQA R W+ + +V L G G+ +   D T F+   +   L
Subjt:  LLGISCIQKFKLYQMDVKSVFLNGYLNEEVYVAQPKCFV----DSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLL

Query:  VAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLY
           +YVDDII        V+   + ++S F++  +G L  FLGL+I +    I I Q KYA +++ + GL   +    P    V  +  + G  VD K Y
Subjt:  VAQIYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLY

Query:  KSIVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDA
        + ++G L+YL  +R DI++ V   + +   PR+ H + + +IL Y+ GT   G+ YS      L  + DA
Subjt:  KSIVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDA

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.5e-0429.69Show/hide
Query:  LYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD
        +YLT +RPD+ + V   + + +  R   ++ + ++L YV GT   G+ YS  +   L  + D+D
Subjt:  LYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD

ATMG00810.1 DNA/RNA polymerases superfamily protein7.6e-2234.91Show/hide
Query:  IYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEV-DHKLYKS
        +YVDDI+  G    L+N  I  + S F M  +G +  FLG+QIK     +F+SQ KYA  ++   G+   +   TP    +KL      A+  D   ++S
Subjt:  IYVDDIIFGGFPQDLVNNFINIMQSEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEV-DHKLYKS

Query:  IVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD
        IVG+L YLT +RPDI+Y V I      +P +   +++KR+L+YV GT   G+    ++   +  +CD+D
Subjt:  IVGSLLYLTTSRPDIAYTVGICACYQADPRITHLEVIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.5e-1740.8Show/hide
Query:  MQTRRKDKIDYLKMVVDSCYISTI--GPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQV
        M TR K  I+ L         +TI   P +V  ALKD  W  AMQEEL    RN  W LV  P   N++  KW+FK K    G + + KARLVA+G+ Q 
Subjt:  MQTRRKDKIDYLKMVVDSCYISTI--GPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQV

Query:  ESVDFDETFALVARLEAIRLLLGIS
        E + F ET++ V R   IR +L ++
Subjt:  ESVDFDETFALVARLEAIRLLLGIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAACAATCAATGTAGTTATAAATGATCTCGATTCTGATATCAAACAGATGAATGATGAGGAAGATGAGAATTCAAACATGTCTGAAGTCAGAACTATGAGTACTGT
GGAAGAGTCTAAAGCTGATAATTCATTCGACGGTCCAGGCAAAAGTTTGAAAAAATCATCAGAAGAAATTATCAATAAAAAATCTAAGATAATTCTGTCAGCTCATGTAA
AGAAAAATCATCCAACAAGCTTTATCATAGGTGATCCATCAGTTGGGATGCAGACCAGAAGGAAAGATAAGATTGACTATTTGAAGATGGTTGTTGACTCATGCTATATT
TCCACCATTGGACCTTTGACTGTTGACTCTGCTCTCAAGGATGAGTATTGGTTAAATGCTATGCAAGAGGAGCTACTGCAATTTAAACGAAACAATGTCTGGACGTTAGT
CTCAAAGCCAGAAGGTGTAAACGTTATTAGCACCAAATGGATATTTAAAAATAAGACTGATGAAACTGGATGTGTGACAAAAAATAAAGCCAGATTAGTAGCTCAAGGGT
ATACTCAAGTTGAAAGTGTTGACTTTGATGAAACATTTGCTCTTGTAGCTCGACTTGAAGCCATTCGACTTTTACTGGGCATATCATGCATACAGAAATTTAAGTTGTAT
CAGATGGATGTAAAGAGTGTCTTCTTAAATGGGTACTTGAATGAGGAGGTTTATGTTGCTCAACCAAAATGTTTTGTAGATTCCGAGCACCCGAAGCATGTGTATAAGCT
CAACAAAGCCTTATATGGACTAAAGCAAGCTCCGAGAGCTTGGTATGACCGGCTAACTGTGTACTTGAGAGGTAGAGGATATTCCAGAGGAGAAATTGACAAGACCTTGT
TCATACACAGGAAATCTGACCAACTGTTGGTGGCTCAAATTTATGTTGATGACATCATTTTTGGAGGATTTCCTCAAGATCTAGTAAATAATTTCATTAATATTATGCAG
TCAGAATTCGAAATGAGCATGGTTGGAGTACTTTCATGTTTTCTGGGACTTCAAATTAAGCAAAAAAATGATGACATTTTCATATCACAAGAAAAGTATGCCAGGAATAT
GGTCAAAAAGTTTGGTTTGGAACAGGCTCGAAATAAGCGGACTCCAGCTGCAACATATGTTAAACTTGCAAAAGACACTGAAGGTGCTGAAGTTGATCACAAACTTTACA
AGAGTATAGTAGGCAGCCTATTATACTTAACAACAAGTCGACCTGACATAGCTTATACTGTGGGAATATGTGCTTGTTATCAGGCGGATCCCCGCATCACTCACCTAGAA
GTTATTAAACGAATTCTTAAATACGTTCATGGGACCAGTGACTTTGGAATGATGTATTCCTATGATACCACTTCCACTCTAATTAGATATTGTGATGCTGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAACAATCAATGTAGTTATAAATGATCTCGATTCTGATATCAAACAGATGAATGATGAGGAAGATGAGAATTCAAACATGTCTGAAGTCAGAACTATGAGTACTGT
GGAAGAGTCTAAAGCTGATAATTCATTCGACGGTCCAGGCAAAAGTTTGAAAAAATCATCAGAAGAAATTATCAATAAAAAATCTAAGATAATTCTGTCAGCTCATGTAA
AGAAAAATCATCCAACAAGCTTTATCATAGGTGATCCATCAGTTGGGATGCAGACCAGAAGGAAAGATAAGATTGACTATTTGAAGATGGTTGTTGACTCATGCTATATT
TCCACCATTGGACCTTTGACTGTTGACTCTGCTCTCAAGGATGAGTATTGGTTAAATGCTATGCAAGAGGAGCTACTGCAATTTAAACGAAACAATGTCTGGACGTTAGT
CTCAAAGCCAGAAGGTGTAAACGTTATTAGCACCAAATGGATATTTAAAAATAAGACTGATGAAACTGGATGTGTGACAAAAAATAAAGCCAGATTAGTAGCTCAAGGGT
ATACTCAAGTTGAAAGTGTTGACTTTGATGAAACATTTGCTCTTGTAGCTCGACTTGAAGCCATTCGACTTTTACTGGGCATATCATGCATACAGAAATTTAAGTTGTAT
CAGATGGATGTAAAGAGTGTCTTCTTAAATGGGTACTTGAATGAGGAGGTTTATGTTGCTCAACCAAAATGTTTTGTAGATTCCGAGCACCCGAAGCATGTGTATAAGCT
CAACAAAGCCTTATATGGACTAAAGCAAGCTCCGAGAGCTTGGTATGACCGGCTAACTGTGTACTTGAGAGGTAGAGGATATTCCAGAGGAGAAATTGACAAGACCTTGT
TCATACACAGGAAATCTGACCAACTGTTGGTGGCTCAAATTTATGTTGATGACATCATTTTTGGAGGATTTCCTCAAGATCTAGTAAATAATTTCATTAATATTATGCAG
TCAGAATTCGAAATGAGCATGGTTGGAGTACTTTCATGTTTTCTGGGACTTCAAATTAAGCAAAAAAATGATGACATTTTCATATCACAAGAAAAGTATGCCAGGAATAT
GGTCAAAAAGTTTGGTTTGGAACAGGCTCGAAATAAGCGGACTCCAGCTGCAACATATGTTAAACTTGCAAAAGACACTGAAGGTGCTGAAGTTGATCACAAACTTTACA
AGAGTATAGTAGGCAGCCTATTATACTTAACAACAAGTCGACCTGACATAGCTTATACTGTGGGAATATGTGCTTGTTATCAGGCGGATCCCCGCATCACTCACCTAGAA
GTTATTAAACGAATTCTTAAATACGTTCATGGGACCAGTGACTTTGGAATGATGTATTCCTATGATACCACTTCCACTCTAATTAGATATTGTGATGCTGACTAG
Protein sequenceShow/hide protein sequence
METINVVINDLDSDIKQMNDEEDENSNMSEVRTMSTVEESKADNSFDGPGKSLKKSSEEIINKKSKIILSAHVKKNHPTSFIIGDPSVGMQTRRKDKIDYLKMVVDSCYI
STIGPLTVDSALKDEYWLNAMQEELLQFKRNNVWTLVSKPEGVNVISTKWIFKNKTDETGCVTKNKARLVAQGYTQVESVDFDETFALVARLEAIRLLLGISCIQKFKLY
QMDVKSVFLNGYLNEEVYVAQPKCFVDSEHPKHVYKLNKALYGLKQAPRAWYDRLTVYLRGRGYSRGEIDKTLFIHRKSDQLLVAQIYVDDIIFGGFPQDLVNNFINIMQ
SEFEMSMVGVLSCFLGLQIKQKNDDIFISQEKYARNMVKKFGLEQARNKRTPAATYVKLAKDTEGAEVDHKLYKSIVGSLLYLTTSRPDIAYTVGICACYQADPRITHLE
VIKRILKYVHGTSDFGMMYSYDTTSTLIRYCDAD