; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0046841 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0046841
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr02:12383348..12384643
RNA-Seq ExpressionCmc02g0046841
SyntenyCmc02g0046841
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035157.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.4e-21896.51Show/hide
Query:  HNRVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDE
        H RVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDL+SDIKQMNDEEDE
Subjt:  HNRVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDE

Query:  TPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFK
        TPNMSEVRTTSTVEESKADNSSDS  KSLKKS EEIINKKSKLI SAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYIS+IEHSTVDHSA K
Subjt:  TPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFK

Query:  DEYWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMD
        DEYWLNAMQEELL+  RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMD
Subjt:  DEYWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMD

Query:  VKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVN
        VKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLV QIYVDDIIFGGFPQGL  
Subjt:  VKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVN

Query:  N
        N
Subjt:  N

KAA0053137.1 gag-pol polyprotein [Cucumis melo var. makuwa]9.3e-15470.12Show/hide
Query:  EAINTVCHIHNRVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDI
        EA+N  CHIHNRV IRTGTT+TLYELWKERKPNVKYFHVFGSTCY+LADREY QKWDARSEQGIFLGYSQNN AY+VYNN+S SVMETINVVINDL+S+I
Subjt:  EAINTVCHIHNRVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDI

Query:  KQMNDEEDETPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEH
        KQMNDEED+T NM EVRT                                                                                  
Subjt:  KQMNDEEDETPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEH

Query:  STVDHSAFKDEYWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCI
            +SA KDEYWLN MQEELL+  RNNVW L+ KP GVNVIGTKWIFKNKTDE GCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCI
Subjt:  STVDHSAFKDEYWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCI

Query:  QKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIF
        QKFKLYQ+DVKS FLNGYLNEEVYVAQPKGFVD EHPKHVYKLNKALYGLKQA  AWYDRLTVYLRGRGYSRGEIDK LFI  KSDQLLVAQIYVDDIIF
Subjt:  QKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIF

Query:  GGFPQGLVNNFINIM
        GGFP  L+NNFINIM
Subjt:  GGFPQGLVNNFINIM

KAA0066164.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.4e-16282.92Show/hide
Query:  REYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDETPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKK
        REY +KWDA+SEQGIFLGYSQN+ AYRV+NNRS  VMETINVVINDL   IKQ+NDEEDET NMSE RTTS+VE  KA   SD   KSL+KSS+E I KK
Subjt:  REYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDETPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKK

Query:  SKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFKDEYWLNAMQEELLKLR-NNVWMLVLKPVGVNVIGTKWIFK
         +LISSAHVKKNHPASSIIGDPS GMQTRRK+KIDY+KMVADLCYIS++E STVD SA +DEYWLNAMQEELL+ R NNVW LV KP GVNVIGTKW+FK
Subjt:  SKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFKDEYWLNAMQEELLKLR-NNVWMLVLKPVGVNVIGTKWIFK

Query:  NKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYG
        NKTDE GCVTKNKA+LVAQGYTQVEG+DFDETFA VARLEAIRLLLGISCIQKFKLYQMDVKS FL+GYLNEEVYVAQPKGFVD EHPKH+YKLNKALYG
Subjt:  NKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYG

Query:  LKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVNNFI
        LKQA  AWYD+LTVYLRG+GYSRGEIDKTLFI  KSDQLLVAQIYVDDIIF GFP  LVNNFI
Subjt:  LKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVNNFI

TYJ98791.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.2e-18884.9Show/hide
Query:  RVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDETP
        RVTIR+G TVTL+ELWK+RKPNVKYFHVFGSTCYILADREYRQKWDA+SEQGIFLGYSQN+ AYRV+NNR  SVMETINVVIND++S IKQ+NDEEDE P
Subjt:  RVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDETP

Query:  NMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFKDE
        NMSE RTTS+V+ +KADN SD   K L+KS EE I KKS+LIS AHVKKNHPASSIIGDPSAGMQTRRK+KIDY+KMVADLCYIS+ E STVD S  +DE
Subjt:  NMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFKDE

Query:  YWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVK
        Y LNAMQEELL+  RNNVW LVLKP GVNVIGTKW+FKNKTDE GCVTKNKARLVAQGYTQVEG+DFDETF+PVARLEAIRLLLGISCIQKFKLYQMDVK
Subjt:  YWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVK

Query:  SDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVNNF
        S FLNGYLNEEVYVAQPK FVD EH KHVYKLNKALYGLKQAP AWYDRLTVYLRG+GYSRGEIDKTLFI  KSDQLLVAQIYVDDIIFGGFPQ LVNNF
Subjt:  SDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVNNF

Query:  INIM
        I +M
Subjt:  INIM

TYK29237.1 gag-pol polyprotein [Cucumis melo var. makuwa]9.1e-22699.25Show/hide
Query:  HNRVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDE
        H RVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDE
Subjt:  HNRVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDE

Query:  TPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFK
        TPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFK
Subjt:  TPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFK

Query:  DEYWLNAMQEELLKLRNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDV
        DEYWLNAMQEELLKLRNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDV
Subjt:  DEYWLNAMQEELLKLRNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDV

Query:  KSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVNN
        KSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGL  N
Subjt:  KSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVNN

TrEMBL top hitse value%identityAlignment
A0A5A7T0Q0 Gag-pol polyprotein1.2e-21896.51Show/hide
Query:  HNRVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDE
        H RVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDL+SDIKQMNDEEDE
Subjt:  HNRVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDE

Query:  TPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFK
        TPNMSEVRTTSTVEESKADNSSDS  KSLKKS EEIINKKSKLI SAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYIS+IEHSTVDHSA K
Subjt:  TPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFK

Query:  DEYWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMD
        DEYWLNAMQEELL+  RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMD
Subjt:  DEYWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMD

Query:  VKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVN
        VKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLV QIYVDDIIFGGFPQGL  
Subjt:  VKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVN

Query:  N
        N
Subjt:  N

A0A5D3BJA9 Gag-pol polyprotein5.6e-18984.9Show/hide
Query:  RVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDETP
        RVTIR+G TVTL+ELWK+RKPNVKYFHVFGSTCYILADREYRQKWDA+SEQGIFLGYSQN+ AYRV+NNR  SVMETINVVIND++S IKQ+NDEEDE P
Subjt:  RVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDETP

Query:  NMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFKDE
        NMSE RTTS+V+ +KADN SD   K L+KS EE I KKS+LIS AHVKKNHPASSIIGDPSAGMQTRRK+KIDY+KMVADLCYIS+ E STVD S  +DE
Subjt:  NMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFKDE

Query:  YWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVK
        Y LNAMQEELL+  RNNVW LVLKP GVNVIGTKW+FKNKTDE GCVTKNKARLVAQGYTQVEG+DFDETF+PVARLEAIRLLLGISCIQKFKLYQMDVK
Subjt:  YWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVK

Query:  SDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVNNF
        S FLNGYLNEEVYVAQPK FVD EH KHVYKLNKALYGLKQAP AWYDRLTVYLRG+GYSRGEIDKTLFI  KSDQLLVAQIYVDDIIFGGFPQ LVNNF
Subjt:  SDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVNNF

Query:  INIM
        I +M
Subjt:  INIM

A0A5D3BPB3 Gag-pol polyprotein4.5e-15470.12Show/hide
Query:  EAINTVCHIHNRVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDI
        EA+N  CHIHNRV IRTGTT+TLYELWKERKPNVKYFHVFGSTCY+LADREY QKWDARSEQGIFLGYSQNN AY+VYNN+S SVMETINVVINDL+S+I
Subjt:  EAINTVCHIHNRVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDI

Query:  KQMNDEEDETPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEH
        KQMNDEED+T NM EVRT                                                                                  
Subjt:  KQMNDEEDETPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEH

Query:  STVDHSAFKDEYWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCI
            +SA KDEYWLN MQEELL+  RNNVW L+ KP GVNVIGTKWIFKNKTDE GCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCI
Subjt:  STVDHSAFKDEYWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCI

Query:  QKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIF
        QKFKLYQ+DVKS FLNGYLNEEVYVAQPKGFVD EHPKHVYKLNKALYGLKQA  AWYDRLTVYLRGRGYSRGEIDK LFI  KSDQLLVAQIYVDDIIF
Subjt:  QKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIF

Query:  GGFPQGLVNNFINIM
        GGFP  L+NNFINIM
Subjt:  GGFPQGLVNNFINIM

A0A5D3CXU0 Gag-pol polyprotein6.9e-16382.92Show/hide
Query:  REYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDETPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKK
        REY +KWDA+SEQGIFLGYSQN+ AYRV+NNRS  VMETINVVINDL   IKQ+NDEEDET NMSE RTTS+VE  KA   SD   KSL+KSS+E I KK
Subjt:  REYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDETPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKK

Query:  SKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFKDEYWLNAMQEELLKLR-NNVWMLVLKPVGVNVIGTKWIFK
         +LISSAHVKKNHPASSIIGDPS GMQTRRK+KIDY+KMVADLCYIS++E STVD SA +DEYWLNAMQEELL+ R NNVW LV KP GVNVIGTKW+FK
Subjt:  SKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFKDEYWLNAMQEELLKLR-NNVWMLVLKPVGVNVIGTKWIFK

Query:  NKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYG
        NKTDE GCVTKNKA+LVAQGYTQVEG+DFDETFA VARLEAIRLLLGISCIQKFKLYQMDVKS FL+GYLNEEVYVAQPKGFVD EHPKH+YKLNKALYG
Subjt:  NKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYG

Query:  LKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVNNFI
        LKQA  AWYD+LTVYLRG+GYSRGEIDKTLFI  KSDQLLVAQIYVDDIIF GFP  LVNNFI
Subjt:  LKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVNNFI

A0A5D3DZD2 Gag-pol polyprotein4.4e-22699.25Show/hide
Query:  HNRVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDE
        H RVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDE
Subjt:  HNRVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMETINVVINDLNSDIKQMNDEEDE

Query:  TPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFK
        TPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFK
Subjt:  TPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFK

Query:  DEYWLNAMQEELLKLRNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDV
        DEYWLNAMQEELLKLRNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDV
Subjt:  DEYWLNAMQEELLKLRNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDV

Query:  KSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVNN
        KSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGL  N
Subjt:  KSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQGLVNN

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.8e-4126.36Show/hide
Query:  ARVMIHTKNVPLCFWAEAINTVCHIHNRVTIR--TGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRS
        AR M+    +   FW EA+ T  ++ NR+  R    ++ T YE+W  +KP +K+  VFG+T Y+   +  + K+D +S + IF+GY  N   +++++  +
Subjt:  ARVMIHTKNVPLCFWAEAINTVCHIHNRVTIR--TGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRS

Query:  DSVMETINVVINDLN--------------SDIKQMNDEEDETPNMSEVRTTSTVEESKADN------SSDSPEKSLKKSSEEII----------------
        +  +   +VV+++ N               D K+  ++     +   ++T    E  + DN      S +S  K+    S +II                
Subjt:  DSVMETINVVINDLN--------------SDIKQMNDEEDETPNMSEVRTTSTVEESKADN------SSDSPEKSLKKSSEEII----------------

Query:  ---------------------------------NKKSKLISSAHVKK---NHPASS----IIGDPSAGMQTRRKDKIDY-------LKMVADLCYISSIE
                                         N+  +  ++ H+K+   ++P  +    II   S  ++T  K +I Y        K+V +   I +  
Subjt:  ---------------------------------NKKSKLISSAHVKK---NHPASS----IIGDPSAGMQTRRKDKIDY-------LKMVADLCYISSIE

Query:  HSTVDHSAFKDE--YWLNAMQEELLKLR-NNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGI
         ++ D   ++D+   W  A+  EL   + NN W +  +P   N++ ++W+F  K +E+G   + KARLVA+G+TQ   +D++ETFAPVAR+ + R +L +
Subjt:  HSTVDHSAFKDE--YWLNAMQEELLKLR-NNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGI

Query:  SCIQKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKS--DQLLVAQIYV
              K++QMDVK+ FLNG L EE+Y+  P+G     +  +V KLNKA+YGLKQA   W++     L+   +    +D+ ++I  K   ++ +   +YV
Subjt:  SCIQKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKS--DQLLVAQIYV

Query:  DDIIFGGFPQGLVNNF
        DD++        +NNF
Subjt:  DDIIFGGFPQGLVNNF

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-4329Show/hide
Query:  RVMIHTKNVPLCFWAEAINTVCHIHNRVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSV
        R M+    +P  FW EA+ T C++ NR             +W  ++ +  +  VFG   +    +E R K D +S   IF+GY    + YR+++     V
Subjt:  RVMIHTKNVPLCFWAEAINTVCHIHNRVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSV

Query:  METINVVINDLNSDIKQMNDEEDET-----PNMSEVRTTS---TVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASS------IIGDP
        + + +VV  +  S+++   D  ++      PN   + +TS   T  ES  D  S+  E+       E+I +  +L      +  HP         +    
Subjt:  METINVVINDLNSDIKQMNDEEDET-----PNMSEVRTTS---TVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASS------IIGDP

Query:  SAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFKDEYWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYT
           +++RR    +Y+ +++D     S++   + H   +    + AMQEE+  L +N  + LV  P G   +  KW+FK K D    + + KARLV +G+ 
Subjt:  SAGMQTRRKDKIDYLKMVADLCYISSIEHSTVDHSAFKDEYWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYT

Query:  QVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYS
        Q +G+DFDE F+PV ++ +IR +L ++     ++ Q+DVK+ FL+G L EE+Y+ QP+GF        V KLNK+LYGLKQAP  WY +   +++ + Y 
Subjt:  QVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYS

Query:  RGEIDKTLFIDSKSD-QLLVAQIYVDDIIFGGFPQGLV
        +   D  ++    S+   ++  +YVDD++  G  +GL+
Subjt:  RGEIDKTLFIDSKSD-QLLVAQIYVDDIIFGGFPQGLV

P92520 Uncharacterized mitochondrial protein AtMg008203.8e-1742.4Show/hide
Query:  MQTRRKDKIDYLKMVADLCYISSIEHSTVD-HSAFKDEYWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQV
        M TR K  I+ L     L   ++I+        A KD  W  AMQEEL  L RN  W+LV  PV  N++G KW+FK K    G + + KARLVA+G+ Q 
Subjt:  MQTRRKDKIDYLKMVADLCYISSIEHSTVD-HSAFKDEYWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQV

Query:  EGVDFDETFAPVARLEAIRLLLGIS
        EG+ F ET++PV R   IR +L ++
Subjt:  EGVDFDETFAPVARLEAIRLLLGIS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.9e-4241.83Show/hide
Query:  AFKDEYWLNAMQEEL-LKLRNNVWMLVLKPVG-VNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKL
        A KDE W NAM  E+  ++ N+ W LV  P   V ++G +WIF  K +  G + + KARLVA+GY Q  G+D+ ETF+PV +  +IR++LG++  + + +
Subjt:  AFKDEYWLNAMQEEL-LKLRNNVWMLVLKPVG-VNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKL

Query:  YQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQ
         Q+DV + FL G L ++VY++QP GF+D + P +V KL KALYGLKQAP AWY  L  YL   G+     D +LF+  +   ++   +YVDDI+  G   
Subjt:  YQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFPQ

Query:  GLVNNFIN
         L++N ++
Subjt:  GLVNNFIN

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-4039.66Show/hide
Query:  MQTRRKDKIDYLKMVADLCYISSIEHSTVDHS---AFKDEYWLNAMQEEL-LKLRNNVWMLV-LKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGY
        M TR KD I   K      Y +S+  ++   +   A KD+ W  AM  E+  ++ N+ W LV   P  V ++G +WIF  K +  G + + KARLVA+GY
Subjt:  MQTRRKDKIDYLKMVADLCYISSIEHSTVDHS---AFKDEYWLNAMQEEL-LKLRNNVWMLV-LKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGY

Query:  TQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGY
         Q  G+D+ ETF+PV +  +IR++LG++  + + + Q+DV + FL G L +EVY++QP GFVD + P +V +L KA+YGLKQAP AWY  L  YL   G+
Subjt:  TQVEGVDFDETFAPVARLEAIRLLLGISCIQKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGY

Query:  SRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGG
             D +LF+  +   ++   +YVDDI+  G
Subjt:  SRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGG

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 85.8e-3739.15Show/hide
Query:  LCYISSIEHSTVDHSAFKDEYWLNAMQEELLKLR-NNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI
        +C   + E ST  + A +   W  AM +E+  +   + W +   P     IG KW++K K +  G + + KARLVA+GYTQ EG+DF ETF+PV +L ++
Subjt:  LCYISSIEHSTVDHSAFKDEYWLNAMQEELLKLR-NNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAI

Query:  RLLLGISCIQKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFV----DFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQ
        +L+L IS I  F L+Q+D+ + FLNG L+EE+Y+  P G+     D   P  V  L K++YGLKQA   W+ + +V L G G+ +   D T F+   +  
Subjt:  RLLLGISCIQKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFV----DFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQ

Query:  LLVAQIYVDDII
         L   +YVDDII
Subjt:  LLVAQIYVDDII

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.7e-1842.4Show/hide
Query:  MQTRRKDKIDYLKMVADLCYISSIEHSTVD-HSAFKDEYWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQV
        M TR K  I+ L     L   ++I+        A KD  W  AMQEEL  L RN  W+LV  PV  N++G KW+FK K    G + + KARLVA+G+ Q 
Subjt:  MQTRRKDKIDYLKMVADLCYISSIEHSTVD-HSAFKDEYWLNAMQEELLKL-RNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQV

Query:  EGVDFDETFAPVARLEAIRLLLGIS
        EG+ F ET++PV R   IR +L ++
Subjt:  EGVDFDETFAPVARLEAIRLLLGIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGTGTTATGATACATACCAAAAATGTACCTTTATGTTTTTGGGCAGAAGCTATAAATACTGTCTGTCACATTCATAACAGAGTAACTATTAGAACTGGA
ACGACTGTTACTCTTTATGAACTTTGGAAAGAGAGAAAACCAAATGTCAAATACTTCCATGTGTTTGGAAGTACATGTTATATCTTAGCTGACCGAGAATACCGT
CAGAAATGGGATGCAAGGTCAGAACAAGGAATCTTTCTCGGGTACTCTCAAAACAATTGGGCCTATAGAGTCTACAATAACAGATCCGACAGTGTTATGGAAACA
ATCAATGTAGTTATAAATGATCTCAATTCTGATATCAAACAGATGAATGATGAGGAAGATGAGACTCCAAACATGTCTGAAGTAAGAACTACGAGTACTGTGGAA
GAGTCTAAAGCTGATAATTCATCTGACAGTCCAGAAAAAAGTTTGAAAAAATCATCAGAAGAAATTATCAATAAAAAATCTAAACTAATTTCGTCAGCTCATGTA
AAGAAAAATCATCCAGCAAGTTCTATCATAGGTGATCCATCAGCTGGGATGCAAACCAGAAGGAAAGATAAGATTGACTATTTGAAGATGGTTGCTGACTTGTGC
TATATTTCCAGTATTGAACATTCGACTGTTGATCACTCTGCTTTCAAGGATGAGTATTGGTTAAATGCAATGCAAGAGGAGCTACTGAAATTACGAAACAATGTC
TGGATGTTAGTCTTAAAGCCAGTAGGTGTAAACGTTATTGGCACCAAATGGATATTTAAAAATAAGACTGATGAAATTGGATGTGTGACGAAAAATAAAGCCAGA
TTAGTAGCTCAAGGGTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTGCTCCTGTTGCTCGACTTGAAGCCATTCGACTTTTACTGGGCATATCATGT
ATACAGAAATTTAAATTGTATCAGATGGATGTAAAGAGTGATTTCTTAAATGGGTACTTGAATGAGGAGGTTTATGTTGCTCAACCAAAAGGTTTTGTAGATTTT
GAGCACCCGAAGCATGTGTATAAGCTCAACAAAGCCTTATATGGACTAAAGCAAGCTCCGGGAGCTTGGTATGACCGACTAACTGTGTACTTGAGAGGTAGAGGA
TATTCCAGAGGAGAAATTGACAAGACCTTGTTCATAGACAGTAAATCTGACCAACTGTTGGTGGCTCAAATTTATGTTGATGACATCATTTTTGGAGGATTTCCT
CAGGGTCTAGTAAATAATTTCATTAATATTATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTCGTGTTATGATACATACCAAAAATGTACCTTTATGTTTTTGGGCAGAAGCTATAAATACTGTCTGTCACATTCATAACAGAGTAACTATTAGAACTGGA
ACGACTGTTACTCTTTATGAACTTTGGAAAGAGAGAAAACCAAATGTCAAATACTTCCATGTGTTTGGAAGTACATGTTATATCTTAGCTGACCGAGAATACCGT
CAGAAATGGGATGCAAGGTCAGAACAAGGAATCTTTCTCGGGTACTCTCAAAACAATTGGGCCTATAGAGTCTACAATAACAGATCCGACAGTGTTATGGAAACA
ATCAATGTAGTTATAAATGATCTCAATTCTGATATCAAACAGATGAATGATGAGGAAGATGAGACTCCAAACATGTCTGAAGTAAGAACTACGAGTACTGTGGAA
GAGTCTAAAGCTGATAATTCATCTGACAGTCCAGAAAAAAGTTTGAAAAAATCATCAGAAGAAATTATCAATAAAAAATCTAAACTAATTTCGTCAGCTCATGTA
AAGAAAAATCATCCAGCAAGTTCTATCATAGGTGATCCATCAGCTGGGATGCAAACCAGAAGGAAAGATAAGATTGACTATTTGAAGATGGTTGCTGACTTGTGC
TATATTTCCAGTATTGAACATTCGACTGTTGATCACTCTGCTTTCAAGGATGAGTATTGGTTAAATGCAATGCAAGAGGAGCTACTGAAATTACGAAACAATGTC
TGGATGTTAGTCTTAAAGCCAGTAGGTGTAAACGTTATTGGCACCAAATGGATATTTAAAAATAAGACTGATGAAATTGGATGTGTGACGAAAAATAAAGCCAGA
TTAGTAGCTCAAGGGTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTGCTCCTGTTGCTCGACTTGAAGCCATTCGACTTTTACTGGGCATATCATGT
ATACAGAAATTTAAATTGTATCAGATGGATGTAAAGAGTGATTTCTTAAATGGGTACTTGAATGAGGAGGTTTATGTTGCTCAACCAAAAGGTTTTGTAGATTTT
GAGCACCCGAAGCATGTGTATAAGCTCAACAAAGCCTTATATGGACTAAAGCAAGCTCCGGGAGCTTGGTATGACCGACTAACTGTGTACTTGAGAGGTAGAGGA
TATTCCAGAGGAGAAATTGACAAGACCTTGTTCATAGACAGTAAATCTGACCAACTGTTGGTGGCTCAAATTTATGTTGATGACATCATTTTTGGAGGATTTCCT
CAGGGTCTAGTAAATAATTTCATTAATATTATGTAG
Protein sequenceShow/hide protein sequence
MARVMIHTKNVPLCFWAEAINTVCHIHNRVTIRTGTTVTLYELWKERKPNVKYFHVFGSTCYILADREYRQKWDARSEQGIFLGYSQNNWAYRVYNNRSDSVMET
INVVINDLNSDIKQMNDEEDETPNMSEVRTTSTVEESKADNSSDSPEKSLKKSSEEIINKKSKLISSAHVKKNHPASSIIGDPSAGMQTRRKDKIDYLKMVADLC
YISSIEHSTVDHSAFKDEYWLNAMQEELLKLRNNVWMLVLKPVGVNVIGTKWIFKNKTDEIGCVTKNKARLVAQGYTQVEGVDFDETFAPVARLEAIRLLLGISC
IQKFKLYQMDVKSDFLNGYLNEEVYVAQPKGFVDFEHPKHVYKLNKALYGLKQAPGAWYDRLTVYLRGRGYSRGEIDKTLFIDSKSDQLLVAQIYVDDIIFGGFP
QGLVNNFINIM