; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0225351 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0225351
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr08:16312731..16313912
RNA-Seq ExpressionCmc08g0225351
SyntenyCmc08g0225351
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0036855.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.5e-14565.39Show/hide
Query:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC
        +R L+K+I+NEA+V I  LD+NGKFFCG+CQ+GKQT+++H+ LKECYT  VL+LLH++LM PMQTESLGGK+YVLVVVDDYS++TWV FLK K DTV++C
Subjt:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC

Query:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR
         +LCL LQREK +KI RIRSD+GKEFDNE  N+ C  +GIHHEF+APITPQQNGVVERKNR LQEMA VMIHAKN PL FWAE VNTACHIH RVT R  
Subjt:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR

Query:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEART
        TT+TLYE WK RKPNVKYFH+FGS CYILADREY +KWD +S QGIFL YSQNSRAYRVFN KS +VME INVV+ND +S + Q   E+DET    +   
Subjt:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEART

Query:  TSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY
        T   E+ K D   +    N    ++E+I  ++ +   AHVKKNHP+SSIIGD SAG+ T+RKEK+ Y+ M+ DLCY S I+P +V++ALKDEY
Subjt:  TSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY

KAA0042995.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.8e-16778.37Show/hide
Query:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC
        MRGLEK+IKN+A+V I +LDVNG FFC DCQIGKQTRSTHKSLKECYTNRVL+LLHMDLM PMQT+SLG                      GKTDTVEIC
Subjt:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC

Query:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR
        KNLCLKLQRE+ KKITRIRSD+GKEFDNE FNSFCLL+G HHEFSAPITPQQNGVVERKN+TLQEMA VMIHAKNLPLCF+AEAVNTACHIHNRVTIRT 
Subjt:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR

Query:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEART
        TT+TLYE WKERK NVKYFHVFGS CYILADREY +KWDARSEQGIFL YSQ SRAYRV+NN+S SVMETIN  +NDLDS IK M DEEDET NMSE RT
Subjt:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEART

Query:  TSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY
        TSTVE SKADN SD  GK+L+K SEEII KK ELI  AHV+KNHP  SIIGDPSAGMQT+RK+KI YL MV +LCY STI+PSTVDSALK+EY
Subjt:  TSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY

KAA0054354.1 gag-pol polyprotein [Cucumis melo var. makuwa]9.7e-14285.28Show/hide
Query:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC
        MRGLEKIIKN+A+V I DL+VNG FFCG         STHKS K CYTNRVL+LLHMDLMRPM+TES GGKRYVLVVV DYSRYTWV FL+GKTDTVEIC
Subjt:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC

Query:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR
        KNLCLKLQREK KKITRIRSD+GKEFDNE FNSFCLL+GI HEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLP+CFWAEAVN ACHIHNRVTIRT 
Subjt:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR

Query:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEAR
         TVTLYE WKERKPNVKYFHVFGS CYILADREYRQKWDARSEQGIFLGYSQNS AYRVFNN+S +V+ETINVV+NDLDS +KQM DEED+TSNMSEAR
Subjt:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEAR

KAA0060126.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.4e-12459.29Show/hide
Query:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC
        +R L+K+++N+A+V I  LD+N KFFC    +G                                   GG++YVLVVVDDYSR+TWV FLKGK DT ++C
Subjt:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC

Query:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR
         NLCL LQREK +KI RIR ++G EF+NE  N+FC  +GIHHEF+APITPQQNGVVERKNRTLQEMA VMIHAKNLPL FWAEAVNTACHIH+RVT R+ 
Subjt:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR

Query:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEART
        TTVTLYE WK RKPN+KYFH+FGSICYILADR+Y +KWD +S+Q IFLGYSQNSRAYRVFN KS +VMETINVV+ND +S I Q   E DET    E  +
Subjt:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEART

Query:  TSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY
        T   E+SK ++  D   K     ++E+I  ++ L+  AHVKKNHP SS+IGDPSAG+ T+RKEK+ Y  M+ DLCY S I+P++V++ALKDEY
Subjt:  TSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY

TYK07876.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.7e-11768.02Show/hide
Query:  RVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPIT
        RVL+LLHMDLM PMQTESLGGKRYVLVV DDYSRYTWV FLKG+TD VEICKNLCLKLQRE                           KGIHHEFSAP+T
Subjt:  RVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPIT

Query:  PQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLG
        PQQNGVVERK RTLQEMA VMIHAKNLPLCFWA+ V+TACHIHNRVTIRT TTVTLY+ WKERKPNVKYFHVFGS CYILADREYRQKWDARSEQGIFLG
Subjt:  PQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLG

Query:  YSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEARTTSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSI
        YSQNSRAYRVFNN+S SVM+TINVV+  LDSTIKQM DEED+T NMSEARTTS                                               
Subjt:  YSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEARTTSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSI

Query:  IGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY
          DPS  MQT+RK+KI YL MV DLCYTSTI+PSTVDSA+KDEY
Subjt:  IGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY

TrEMBL top hitse value%identityAlignment
A0A5A7TNK7 Gag-pol polyprotein8.5e-16878.37Show/hide
Query:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC
        MRGLEK+IKN+A+V I +LDVNG FFC DCQIGKQTRSTHKSLKECYTNRVL+LLHMDLM PMQT+SLG                      GKTDTVEIC
Subjt:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC

Query:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR
        KNLCLKLQRE+ KKITRIRSD+GKEFDNE FNSFCLL+G HHEFSAPITPQQNGVVERKN+TLQEMA VMIHAKNLPLCF+AEAVNTACHIHNRVTIRT 
Subjt:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR

Query:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEART
        TT+TLYE WKERK NVKYFHVFGS CYILADREY +KWDARSEQGIFL YSQ SRAYRV+NN+S SVMETIN  +NDLDS IK M DEEDET NMSE RT
Subjt:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEART

Query:  TSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY
        TSTVE SKADN SD  GK+L+K SEEII KK ELI  AHV+KNHP  SIIGDPSAGMQT+RK+KI YL MV +LCY STI+PSTVDSALK+EY
Subjt:  TSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY

A0A5A7V0X1 Gag-pol polyprotein2.6e-12459.29Show/hide
Query:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC
        +R L+K+++N+A+V I  LD+N KFFC    +G                                   GG++YVLVVVDDYSR+TWV FLKGK DT ++C
Subjt:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC

Query:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR
         NLCL LQREK +KI RIR ++G EF+NE  N+FC  +GIHHEF+APITPQQNGVVERKNRTLQEMA VMIHAKNLPL FWAEAVNTACHIH+RVT R+ 
Subjt:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR

Query:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEART
        TTVTLYE WK RKPN+KYFH+FGSICYILADR+Y +KWD +S+Q IFLGYSQNSRAYRVFN KS +VMETINVV+ND +S I Q   E DET    E  +
Subjt:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEART

Query:  TSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY
        T   E+SK ++  D   K     ++E+I  ++ L+  AHVKKNHP SS+IGDPSAG+ T+RKEK+ Y  M+ DLCY S I+P++V++ALKDEY
Subjt:  TSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY

A0A5D3BA69 Gag-pol polyprotein2.7e-14565.39Show/hide
Query:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC
        +R L+K+I+NEA+V I  LD+NGKFFCG+CQ+GKQT+++H+ LKECYT  VL+LLH++LM PMQTESLGGK+YVLVVVDDYS++TWV FLK K DTV++C
Subjt:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC

Query:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR
         +LCL LQREK +KI RIRSD+GKEFDNE  N+ C  +GIHHEF+APITPQQNGVVERKNR LQEMA VMIHAKN PL FWAE VNTACHIH RVT R  
Subjt:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR

Query:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEART
        TT+TLYE WK RKPNVKYFH+FGS CYILADREY +KWD +S QGIFL YSQNSRAYRVFN KS +VME INVV+ND +S + Q   E+DET    +   
Subjt:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEART

Query:  TSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY
        T   E+ K D   +    N    ++E+I  ++ +   AHVKKNHP+SSIIGD SAG+ T+RKEK+ Y+ M+ DLCY S I+P +V++ALKDEY
Subjt:  TSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY

A0A5D3CBZ0 Gag-pol polyprotein8.1e-11868.02Show/hide
Query:  RVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPIT
        RVL+LLHMDLM PMQTESLGGKRYVLVV DDYSRYTWV FLKG+TD VEICKNLCLKLQRE                           KGIHHEFSAP+T
Subjt:  RVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPIT

Query:  PQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLG
        PQQNGVVERK RTLQEMA VMIHAKNLPLCFWA+ V+TACHIHNRVTIRT TTVTLY+ WKERKPNVKYFHVFGS CYILADREYRQKWDARSEQGIFLG
Subjt:  PQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLG

Query:  YSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEARTTSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSI
        YSQNSRAYRVFNN+S SVM+TINVV+  LDSTIKQM DEED+T NMSEARTTS                                               
Subjt:  YSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEARTTSTVEVSKADNPSDDLGKNLEKPSEEIITKKSELITFAHVKKNHPTSSI

Query:  IGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY
          DPS  MQT+RK+KI YL MV DLCYTSTI+PSTVDSA+KDEY
Subjt:  IGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY

A0A5D3CM30 Gag-pol polyprotein4.7e-14285.28Show/hide
Query:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC
        MRGLEKIIKN+A+V I DL+VNG FFCG         STHKS K CYTNRVL+LLHMDLMRPM+TES GGKRYVLVVV DYSRYTWV FL+GKTDTVEIC
Subjt:  MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEIC

Query:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR
        KNLCLKLQREK KKITRIRSD+GKEFDNE FNSFCLL+GI HEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLP+CFWAEAVN ACHIHNRVTIRT 
Subjt:  KNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTR

Query:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEAR
         TVTLYE WKERKPNVKYFHVFGS CYILADREYRQKWDARSEQGIFLGYSQNS AYRVFNN+S +V+ETINVV+NDLDS +KQM DEED+TSNMSEAR
Subjt:  TTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEAR

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-3128.7Show/hide
Query:  CGDCQIGKQTRSTHKSLKE-CYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKE
        C  C  GKQ R   K LK+  +  R L ++H D+  P+   +L  K Y ++ VD ++ Y   + +K K+D   + ++   K +     K+  +  DNG+E
Subjt:  CGDCQIGKQTRSTHKSLKE-CYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKE

Query:  FDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIR--TRTTVTLYEFWKERKPNVKYFHVFG
        + +     FC+ KGI +  + P TPQ NGV ER  RT+ E A  M+    L   FW EAV TA ++ NR+  R    ++ T YE W  +KP +K+  VFG
Subjt:  FDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIR--TRTTVTLYEFWKERKPNVKYFHVFG

Query:  SICYILADREYRQKWDARSEQGIFLGYSQNS-RAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEARTTSTVEVSKADNPSDDLGKNLEK
        +  Y+   +  + K+D +S + IF+GY  N  + +   N K +   + +              +DE    +NM  +R      V   D+   +  KN   
Subjt:  SICYILADREYRQKWDARSEQGIFLGYSQNS-RAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEARTTSTVEVSKADNPSDDLGKNLEK

Query:  PSEEII-------TKKSELITFAHVKKNHPTSSIIGDPSAGMQTK
         S +II       +K+ + I F    K     +   D    +QT+
Subjt:  PSEEII-------TKKSELITFAHVKKNHPTSSIIGDPSAGMQTK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.2e-3930.86Show/hide
Query:  CGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEF
        C  C  GKQ R + ++  E   N +L L++ D+  PM+ ES+GG +Y +  +DD SR  WV+ LK K    ++ +     ++RE  +K+ R+RSDNG E+
Subjt:  CGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEF

Query:  DNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYEFWKERKPNVKYFHVFGSIC
         +  F  +C   GI HE + P TPQ NGV ER NRT+ E    M+    LP  FW EAV TAC++ NR              W  ++ +  +  VFG   
Subjt:  DNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYEFWKERKPNVKYFHVFGSIC

Query:  YILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEARTTSTVEVS----KADNPSDDLGKNLEK
        +    +E R K D +S   IF+GY      YR+++     V+ + +VV    +S ++   D  ++  N       +    S     A++ +D++ +  E+
Subjt:  YILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEARTTSTVEVS----KADNPSDDLGKNLEK

Query:  PSEEIITKKSELITFAHVKKNHPT
        P E  + ++ E +     +  HPT
Subjt:  PSEEIITKKSELITFAHVKKNHPT

Q87040 Pro-Pol polyprotein7.3e-0735.05Show/hide
Query:  GKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEM
        G  YVLV+VD  + +TW++  K  + +  + K+L +       K    I SD G  F +  F  +   +GIH EFS P  PQ +G VERKN  ++ +
Subjt:  GKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEM

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE16.4e-1927.27Show/hide
Query:  LEKIIKNEAIVRISDLDVNGKFF-CGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKN
        L  +I N +   +S L+ + KF  C DC I K  +    S     + R L+ ++ D+       S    RY ++ VD ++RYTW++ LK K+   E    
Subjt:  LEKIIKNEAIVRISDLDVNGKFF-CGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKN

Query:  LCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTT
            L+   + +I    SDNG EF       +    GI H  S P TP+ NG+ ERK+R + E    ++   ++P  +W  A   A ++ NR+       
Subjt:  LCLKLQREKEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTT

Query:  VTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAY--------RVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSN
         + ++      PN     VFG  CY       + K D +S Q +FLGYS    AY        R++ ++ V   E      N L +T+  + ++  E+S 
Subjt:  VTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAY--------RVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSN

Query:  MSEARTTSTVEVSKADNPS
        +    TT          PS
Subjt:  MSEARTTSTVEVSKADNPS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.8e-2130.38Show/hide
Query:  CGDCQIGKQTRSTHK---SLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKT---DTVEICKNLCLKLQREKEKKITRIRS
        C DC I K    +HK   S     +++ L+ ++ D+       S+   RY ++ VD ++RYTW++ LK K+   DT  I K+L   ++   + +I  + S
Subjt:  CGDCQIGKQTRSTHK---SLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKT---DTVEICKNLCLKLQREKEKKITRIRS

Query:  DNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYEFWKERKPNVKYFH
        DNG EF       +    GI H  S P TP+ NG+ ERK+R + EM   ++   ++P  +W  A + A ++ NR+        + ++    + PN +   
Subjt:  DNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYEFWKERKPNVKYFH

Query:  VFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAY
        VFG  CY       R K + +S+Q  F+GYS    AY
Subjt:  VFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAY

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.7e-0429.82Show/hide
Query:  NRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRV
        NRT+ E    M+    LP  F A+A NTA HI N+            E W +  P   Y   FG + YI  D     K   R+++G      +   +Y +
Subjt:  NRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYEFWKERKPNVKYFHVFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRV

Query:  FNNKSVSVMETINV
          N+ VS++ TI +
Subjt:  FNNKSVSVMETINV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGGCTTGGAAAAAATTATTAAAAATGAAGCAATTGTGAGAATTTCTGATTTAGATGTAAATGGAAAATTCTTCTGTGGAGACTGTCAAATTGGCAAACAAACAAG
GTCTACTCACAAAAGTCTGAAAGAATGTTATACCAATAGAGTCTTGAAACTGTTACATATGGATCTCATGAGACCAATGCAAACAGAAAGTCTGGGAGGAAAGAGGTACG
TGTTGGTTGTTGTTGATGATTACTCAAGATATACTTGGGTTTTCTTTCTCAAAGGAAAAACAGATACTGTTGAAATATGTAAAAATCTGTGTTTGAAGCTACAACGTGAA
AAAGAGAAAAAGATAACGAGAATCCGAAGTGATAATGGTAAGGAGTTTGATAATGAGTGCTTTAACAGTTTTTGTCTGTTAAAAGGAATACACCATGAATTTTCTGCACC
TATAACTCCTCAACAAAATGGTGTAGTAGAAAGAAAAAACAGGACGTTACAAGAGATGGCATGTGTTATGATACATGCAAAAAATTTACCTCTATGTTTTTGGGCAGAAG
CTGTAAATACTGCCTGTCACATTCATAACAGGGTAACTATTAGGACTAGAACGACTGTTACTCTTTATGAATTTTGGAAAGAGAGAAAGCCAAATGTTAAATACTTCCAT
GTGTTTGGAAGTATATGTTATATCTTAGCTGACAGGGAATACCGTCAGAAATGGGATGCTAGATCAGAACAAGGAATCTTTCTCGGGTACTCTCAGAACAGTCGGGCCTA
TAGAGTCTTCAATAACAAATCTGTGAGTGTTATGGAAACGATCAATGTAGTTTTAAATGATCTCGATTCAACCATCAAACAGATGATAGATGAGGAAGATGAGACTTCAA
ACATGTCTGAAGCTAGAACTACGAGTACTGTAGAAGTTTCTAAAGCTGATAACCCATCAGATGATCTAGGTAAAAATTTGGAAAAACCATCAGAAGAAATTATCACTAAA
AAATCAGAACTAATTACATTTGCTCATGTGAAGAAAAATCATCCAACAAGCTCTATAATAGGTGATCCGTCAGCTGGGATGCAGACCAAAAGGAAAGAAAAGATTGTTTA
CTTAAATATGGTTGATGATTTATGTTATACTTCTACCATTAAACCTTCTACTGTTGACTCTGCTCTCAAGGATGAGTATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGGGCTTGGAAAAAATTATTAAAAATGAAGCAATTGTGAGAATTTCTGATTTAGATGTAAATGGAAAATTCTTCTGTGGAGACTGTCAAATTGGCAAACAAACAAG
GTCTACTCACAAAAGTCTGAAAGAATGTTATACCAATAGAGTCTTGAAACTGTTACATATGGATCTCATGAGACCAATGCAAACAGAAAGTCTGGGAGGAAAGAGGTACG
TGTTGGTTGTTGTTGATGATTACTCAAGATATACTTGGGTTTTCTTTCTCAAAGGAAAAACAGATACTGTTGAAATATGTAAAAATCTGTGTTTGAAGCTACAACGTGAA
AAAGAGAAAAAGATAACGAGAATCCGAAGTGATAATGGTAAGGAGTTTGATAATGAGTGCTTTAACAGTTTTTGTCTGTTAAAAGGAATACACCATGAATTTTCTGCACC
TATAACTCCTCAACAAAATGGTGTAGTAGAAAGAAAAAACAGGACGTTACAAGAGATGGCATGTGTTATGATACATGCAAAAAATTTACCTCTATGTTTTTGGGCAGAAG
CTGTAAATACTGCCTGTCACATTCATAACAGGGTAACTATTAGGACTAGAACGACTGTTACTCTTTATGAATTTTGGAAAGAGAGAAAGCCAAATGTTAAATACTTCCAT
GTGTTTGGAAGTATATGTTATATCTTAGCTGACAGGGAATACCGTCAGAAATGGGATGCTAGATCAGAACAAGGAATCTTTCTCGGGTACTCTCAGAACAGTCGGGCCTA
TAGAGTCTTCAATAACAAATCTGTGAGTGTTATGGAAACGATCAATGTAGTTTTAAATGATCTCGATTCAACCATCAAACAGATGATAGATGAGGAAGATGAGACTTCAA
ACATGTCTGAAGCTAGAACTACGAGTACTGTAGAAGTTTCTAAAGCTGATAACCCATCAGATGATCTAGGTAAAAATTTGGAAAAACCATCAGAAGAAATTATCACTAAA
AAATCAGAACTAATTACATTTGCTCATGTGAAGAAAAATCATCCAACAAGCTCTATAATAGGTGATCCGTCAGCTGGGATGCAGACCAAAAGGAAAGAAAAGATTGTTTA
CTTAAATATGGTTGATGATTTATGTTATACTTCTACCATTAAACCTTCTACTGTTGACTCTGCTCTCAAGGATGAGTATTGA
Protein sequenceShow/hide protein sequence
MRGLEKIIKNEAIVRISDLDVNGKFFCGDCQIGKQTRSTHKSLKECYTNRVLKLLHMDLMRPMQTESLGGKRYVLVVVDDYSRYTWVFFLKGKTDTVEICKNLCLKLQRE
KEKKITRIRSDNGKEFDNECFNSFCLLKGIHHEFSAPITPQQNGVVERKNRTLQEMACVMIHAKNLPLCFWAEAVNTACHIHNRVTIRTRTTVTLYEFWKERKPNVKYFH
VFGSICYILADREYRQKWDARSEQGIFLGYSQNSRAYRVFNNKSVSVMETINVVLNDLDSTIKQMIDEEDETSNMSEARTTSTVEVSKADNPSDDLGKNLEKPSEEIITK
KSELITFAHVKKNHPTSSIIGDPSAGMQTKRKEKIVYLNMVDDLCYTSTIKPSTVDSALKDEY