; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0012836 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0012836
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRibonuclease H
Genome locationchr01:17756953..17759773
RNA-Seq ExpressionPay0012836
SyntenyPay0012836
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]5.4e-16765.02Show/hide
Query:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE------
        ++ KFD+VMLEHVPR ENKRA+ALANLA AL MPD+                     E N+ TS+LIDEEDWRQPIIEYLEHGKLPKDS HKIE      
Subjt:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE------

Query:  ---------------------------------------------------LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFE
                                                           LRRMGYYWPKM+QDSIDY KKCE CQYH NFI+QPPEPLHPT+ASW FE
Subjt:  ---------------------------------------------------LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFE

Query:  AWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGL
        AWGL+LVGPITPKS A HSYIL ATDYF +W E I LREAKKENVA+FIRTHII+RYGIPHRIVTDNG+Q SNSM+DKLCEKF FKQYKSSMYNA ANGL
Subjt:  AWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGL

Query:  AEAFNKTLYNLLKKIVSKSKRDWQERM--------------------------------------------EGLTTEDNVKLRLQELEALDEKRLEAQQA
        AEAFNKTL NLLKKIVSKSKRDWQE++                                            EGLTTEDNVKLRLQELEALDEKRLEAQQA
Subjt:  AEAFNKTLYNLLKKIVSKSKRDWQERM--------------------------------------------EGLTTEDNVKLRLQELEALDEKRLEAQQA

Query:  LKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIVKEVYTNGAYKIVDQDELKIGPINGKFLKKFYA
        L+CYQARMSK FDK+VKPRSFQVGDLVLAVRR II TRHTGNKFT KWDGPYIVKEVYTNGAYKIVDQD L+IGPINGKFLKKFYA
Subjt:  LKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIVKEVYTNGAYKIVDQDELKIGPINGKFLKKFYA

XP_031737372.1 uncharacterized protein LOC116402244 [Cucumis sativus]5.4e-16765.02Show/hide
Query:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE------
        ++ KFD+VMLEHVPR ENKRA+ALANLA AL MPD+                     E N+ TS+LIDEEDWRQPIIEYLEHGKLPKDS HKIE      
Subjt:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE------

Query:  ---------------------------------------------------LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFE
                                                           LRRMGYYWPKM+QDSIDY KKCE CQYH NFI+QPPEPLHPT+ASW FE
Subjt:  ---------------------------------------------------LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFE

Query:  AWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGL
        AWGL+LVGPITPKS A HSYIL ATDYF +W E I LREAKKENVA+FIRTHII+RYGIPHRIVTDNG+Q SNSM+DKLCEKF FKQYKSSMYNA ANGL
Subjt:  AWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGL

Query:  AEAFNKTLYNLLKKIVSKSKRDWQERM--------------------------------------------EGLTTEDNVKLRLQELEALDEKRLEAQQA
        AEAFNKTL NLLKKIVSKSKRDWQE++                                            EGLTTEDNVKLRLQELEALDEKRLEAQQA
Subjt:  AEAFNKTLYNLLKKIVSKSKRDWQERM--------------------------------------------EGLTTEDNVKLRLQELEALDEKRLEAQQA

Query:  LKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIVKEVYTNGAYKIVDQDELKIGPINGKFLKKFYA
        L+CYQARMSK FDK+VKPRSFQVGDLVLAVRR II TRHTGNKFT KWDGPYIVKEVYTNGAYKIVDQD L+IGPINGKFLKKFYA
Subjt:  LKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIVKEVYTNGAYKIVDQDELKIGPINGKFLKKFYA

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]2.4e-16765.23Show/hide
Query:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE------
        ++ KFD+VMLEHVPR ENKRA+ALANLA AL MPD+                     E N+ TS+LIDEEDWRQPIIEYLEHGKLPKDS HKIE      
Subjt:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE------

Query:  ---------------------------------------------------LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFE
                                                           LRRMGYYWPKM+QDSIDY KKCE CQYH NFI+QPPEPLHPT+ASW FE
Subjt:  ---------------------------------------------------LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFE

Query:  AWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGL
        AWGL+LVGPITPKS A HSYIL ATDYF KW E I LREAKKENVA+FIRTHII+RYGIPHRIVTDNG+Q SNSM+DKLCEKF FKQYKSSMYNA ANGL
Subjt:  AWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGL

Query:  AEAFNKTLYNLLKKIVSKSKRDWQERM--------------------------------------------EGLTTEDNVKLRLQELEALDEKRLEAQQA
        AEAFNKTL NLLKKIVSKSKRDWQE++                                            EGLTTEDNVKLRLQELEALDEKRLEAQQA
Subjt:  AEAFNKTLYNLLKKIVSKSKRDWQERM--------------------------------------------EGLTTEDNVKLRLQELEALDEKRLEAQQA

Query:  LKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIVKEVYTNGAYKIVDQDELKIGPINGKFLKKFYA
        L+CYQARMSK FDK+VKPRSFQVGDLVLAVRR II TRHTGNKFT KWDGPYIVKEVYTNGAYKIVDQD L+IGPINGKFLKKFYA
Subjt:  LKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIVKEVYTNGAYKIVDQDELKIGPINGKFLKKFYA

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]5.4e-16765.02Show/hide
Query:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE------
        ++ KFD+VMLEHVPR ENKRA+ALANLA AL MPD+                     E N+ TS+LIDEEDWRQPIIEYLEHGKLPKDS HKIE      
Subjt:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE------

Query:  ---------------------------------------------------LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFE
                                                           LRRMGYYWPKM+QDSIDY KKCE CQYH NFI+QPPEPLHPT+ASW FE
Subjt:  ---------------------------------------------------LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFE

Query:  AWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGL
        AWGL+LVGPITPKS A HSYIL ATDYF +W E I LREAKKENVA+FIRTHII+RYGIPHRIVTDNG+Q SNSM+DKLCEKF FKQYKSSMYNA ANGL
Subjt:  AWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGL

Query:  AEAFNKTLYNLLKKIVSKSKRDWQERM--------------------------------------------EGLTTEDNVKLRLQELEALDEKRLEAQQA
        AEAFNKTL NLLKKIVSKSKRDWQE++                                            EGLTTEDNVKLRLQELEALDEKRLEAQQA
Subjt:  AEAFNKTLYNLLKKIVSKSKRDWQERM--------------------------------------------EGLTTEDNVKLRLQELEALDEKRLEAQQA

Query:  LKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIVKEVYTNGAYKIVDQDELKIGPINGKFLKKFYA
        L+CYQARMSK FDK+VKPRSFQVGDLVLAVRR II TRHTGNKFT KWDGPYIVKEVYTNGAYKIVDQD L+IGPINGKFLKKFYA
Subjt:  LKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIVKEVYTNGAYKIVDQDELKIGPINGKFLKKFYA

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]5.4e-16765.02Show/hide
Query:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE------
        ++ KFD+VMLEHVPR ENKRA+ALANLA AL MPD+                     E N+ TS+LIDEEDWRQPIIEYLEHGKLPKDS HKIE      
Subjt:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE------

Query:  ---------------------------------------------------LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFE
                                                           LRRMGYYWPKM+QDSIDY KKCE CQYH NFI+QPPEPLHPT+ASW FE
Subjt:  ---------------------------------------------------LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFE

Query:  AWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGL
        AWGL+LVGPITPKS A HSYIL ATDYF +W E I LREAKKENVA+FIRTHII+RYGIPHRIVTDNG+Q SNSM+DKLCEKF FKQYKSSMYNA ANGL
Subjt:  AWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGL

Query:  AEAFNKTLYNLLKKIVSKSKRDWQERM--------------------------------------------EGLTTEDNVKLRLQELEALDEKRLEAQQA
        AEAFNKTL NLLKKIVSKSKRDWQE++                                            EGLTTEDNVKLRLQELEALDEKRLEAQQA
Subjt:  AEAFNKTLYNLLKKIVSKSKRDWQERM--------------------------------------------EGLTTEDNVKLRLQELEALDEKRLEAQQA

Query:  LKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIVKEVYTNGAYKIVDQDELKIGPINGKFLKKFYA
        L+CYQARMSK FDK+VKPRSFQVGDLVLAVRR II TRHTGNKFT KWDGPYIVKEVYTNGAYKIVDQD L+IGPINGKFLKKFYA
Subjt:  LKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIVKEVYTNGAYKIVDQDELKIGPINGKFLKKFYA

TrEMBL top hitse value%identityAlignment
A0A5A7SPV8 Ribonuclease H9.9e-16765.02Show/hide
Query:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE------
        ++ +FDSVML+HVPRTENKRA+ALANLA ALMMPDN                     EAN+T SHLI+EEDW QPIIEYLEHGKLPKDS HK E      
Subjt:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE------

Query:  ---------------------------------------------------LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFE
                                                           LRRMGYYWPKMVQDS+DY KKCEACQYH NFI+QP EPLHP++ASWLFE
Subjt:  ---------------------------------------------------LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFE

Query:  AWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGL
        AWGL+LVGPITPKS A HSYIL AT+YF KW E IPLREAKKENV NFIRTHII+RYGIPHRIVTDNGRQ SNSMIDKLCEKF F+QYKSSMYNA ANGL
Subjt:  AWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGL

Query:  AEAFNKTLYNLLKKIVSKSKRDWQERM--------------------------------------------EGLTTEDNVKLRLQELEALDEKRLEAQQA
        AEAFNKTL NLLKKIVSKSKRDWQER+                                            EGLTTEDNVKLRLQELE LDEKRLEAQQ 
Subjt:  AEAFNKTLYNLLKKIVSKSKRDWQERM--------------------------------------------EGLTTEDNVKLRLQELEALDEKRLEAQQA

Query:  LKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIVKEVYTNGAYKIVDQDELKIGPINGKFLKKFYA
        L+CYQARMSK FDK+VKPRSFQVGDLVLAVRR II TRH GNKFT KWDGPYIVKEVYTNGAYKIVD+D LKIG INGKFLKKFYA
Subjt:  LKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIVKEVYTNGAYKIVDQDELKIGPINGKFLKKFYA

A0A5A7T485 Reverse transcriptase3.1e-16071.63Show/hide
Query:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDNEANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE---------------------------
        ++  FDSVMLEHVPR ENKRA+ALANLA ALMMPDNE N+TTSHLIDEED RQ IIEYLEHGKLPKDS HK E                           
Subjt:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDNEANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE---------------------------

Query:  LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRT
        LRRM YYWPKMVQDS+DY KKCEACQYH NFI+QP EPLHPTMASW FEAWGL+LVGPITPKS A H YIL ATDYF KW E IPLREAKKENVANFIRT
Subjt:  LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRT

Query:  HIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKIVSKSKRDWQERM------------------------
        HII+RYGIPHRIVTDNGRQ SNSMIDKLCEKF FKQYKSSMYNA ANGLAE F+KTL NLLKKIV KSKRDWQER+                        
Subjt:  HIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKIVSKSKRDWQERM------------------------

Query:  --------------------EGLTTEDNVKLRLQELEALDEKRLEAQQALKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGP
                            E LT EDNVKLRLQELEALDEKRLEAQQAL+CYQARMSK FDK+VKPRSFQVGDLVLAVRR I+ TRHTGNKFT KWDGP
Subjt:  --------------------EGLTTEDNVKLRLQELEALDEKRLEAQQALKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGP

Query:  YIVKEVYTNGAYKIVD
        YIVKEVY NGAYKIVD
Subjt:  YIVKEVYTNGAYKIVD

A0A5A7T485 Reverse transcriptase7.2e-1662.2Show/hide
Query:  MVDATTGHEALSFMDGS-------------------TLKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKFDSVMLEHVP
        MVDATTGHEALSFMDGS                   T K IYCYKVMPFGLKN GATYQRAMQ VFDDMLHK+    L   P
Subjt:  MVDATTGHEALSFMDGS-------------------TLKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKFDSVMLEHVP

A0A5A7T485 Reverse transcriptase3.1e-16071.63Show/hide
Query:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDNEANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE---------------------------
        ++  FDSVMLEHVPR ENKRA+ALANLA ALMMPDNE N+TTSHLIDEED RQ IIEYLEHGKLPKDS HK E                           
Subjt:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDNEANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE---------------------------

Query:  LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRT
        LRRM YYWPKMVQDS+DY KKCEACQYH NFI+QP EPLHPTMASW FEAWGL+LVGPITPKS A H YIL ATDYF KW E IPLREAKKENVANFIRT
Subjt:  LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRT

Query:  HIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKIVSKSKRDWQERM------------------------
        HII+RYGIPHRIVTDNGRQ SNSMIDKLCEKF FKQYKSSMYNA ANGLAE F+KTL NLLKKIV KSKRDWQER+                        
Subjt:  HIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKIVSKSKRDWQERM------------------------

Query:  --------------------EGLTTEDNVKLRLQELEALDEKRLEAQQALKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGP
                            E LT EDNVKLRLQELEALDEKRLEAQQAL+CYQARMSK FDK+VKPRSFQVGDLVLAVRR I+ TRHTGNKFT KWDGP
Subjt:  --------------------EGLTTEDNVKLRLQELEALDEKRLEAQQALKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGP

Query:  YIVKEVYTNGAYKIVD
        YIVKEVY NGAYKIVD
Subjt:  YIVKEVYTNGAYKIVD

A0A5A7U8J5 Ribonuclease H2.9e-16671.76Show/hide
Query:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIELRRMG-
        ++ +FDSVMLEHVPRTENKRA+ALANLA ALMMPDN                     EAN+TTSHLI+EEDWRQPIIEYL+HGKL KDS HK E+RR   
Subjt:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIELRRMG-

Query:  ---YYWPKMV-------------------------QDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSLAEHSYILTATDYF
           YY   +                          +DS+DY KKCEACQYH NFI+QPPEPLHP +ASW FEAW L+LVGPITPKS A HSYIL ATDYF
Subjt:  ---YYWPKMV-------------------------QDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSLAEHSYILTATDYF

Query:  LKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKIVSKSKRDWQERM-
         KW E IPLRE KK+NVANFIRTHII+RYGIPHRIVTDNGRQ SNSMIDKLCEKF FKQYKSSMYNA ANGLAEAFNKTL NLLKKIVSKSKRDWQER+ 
Subjt:  LKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKIVSKSKRDWQERM-

Query:  -----------------EGLTTEDNVKLRLQELEALDEKRLEAQQALKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIV
                         EGLT E NVKLRLQELEALDEKRLEAQQAL+CYQARMSK FDK+VKPRSFQVGDLVLAVRR II TRHTGNKFT KWDGPYIV
Subjt:  -----------------EGLTTEDNVKLRLQELEALDEKRLEAQQALKCYQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIV

Query:  KEVYTNGAYKIVDQDELKIGPINGKFLKKFYA
        KEVYTNGAYKIVD+D LKIGPINGKFLKKFYA
Subjt:  KEVYTNGAYKIVDQDELKIGPINGKFLKKFYA

A0A5D3BV77 Reverse transcriptase7.2e-1662.2Show/hide
Query:  MVDATTGHEALSFMDGS-------------------TLKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKFDSVMLEHVP
        MVDATTGHEALSFMDGS                   T K IYCYKVMPFGLKN GATYQRAMQ VFDDMLHK+    L   P
Subjt:  MVDATTGHEALSFMDGS-------------------TLKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKFDSVMLEHVP

A0A5D3C8N8 Ribonuclease H6.2e-16167.54Show/hide
Query:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE------
        ++ +FDSVMLEHVPR ENKRA+ L NLA ALMM DN                     E NVTTSHLID+EDWRQPIIEYLEH KL KDS HK E      
Subjt:  MLHKFDSVMLEHVPRTENKRANALANLAPALMMPDN---------------------EANVTTSHLIDEEDWRQPIIEYLEHGKLPKDSGHKIE------

Query:  ---------------------LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSLAEHSYILTATDYFLK
                             LRRM YYWPKMVQDS+DY KK +A QYH NFI+QPPEP HPT+ASW FEAWGL+L GPITPKS   HSYIL AT YF K
Subjt:  ---------------------LRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSLAEHSYILTATDYFLK

Query:  WDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKIVSKSKRDWQERM---
        W E IPLREAKKENVANFIRTHII+RYGIPHRI+TDNGRQ SNSMIDKLCEKF FKQYKSSMYNA ANGLAEAFNKTL NLLKKIVSK  RDWQER+   
Subjt:  WDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKIVSKSKRDWQERM---

Query:  -----------------------------------------EGLTTEDNVKLRLQELEALDEKRLEAQQALKCYQARMSKVFDKYVKPRSFQVGDLVLAV
                                                 EGLTTEDNVKLRLQELEALDEKRLEAQQALKCYQARMSK FDK+VKPRSFQV DLVLAV
Subjt:  -----------------------------------------EGLTTEDNVKLRLQELEALDEKRLEAQQALKCYQARMSKVFDKYVKPRSFQVGDLVLAV

Query:  RRSIIITRHTGNKFTSKWDGPYIVKEVYTNGAYKIVDQDELKIGPINGKFLKKFYA
        RR II TRHTGNKFT KWDGPYIVK VY NGAYKIVD+D LKIGPINGKFLKKFYA
Subjt:  RRSIIITRHTGNKFTSKWDGPYIVKEVYTNGAYKIVDQDELKIGPINGKFLKKFYA

SwissProt top hitse value%identityAlignment
A4FUB7 Gypsy retrotransposon integrase-like protein 18.7e-1122.58Show/hide
Query:  IELRRMGYYWPKMVQDSIDYEKKCEACQYHVN-FIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANF
        + L    YYW  +  D   +   C+ CQ   N  I  P + L      W      ++L+GP    S   H Y +  TD F KW  ++PL +     ++  
Subjt:  IELRRMGYYWPKMVQDSIDYEKKCEACQYHVN-FIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANF

Query:  IRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKIVSKSKRDWQERMEGLT-----------------
        I  +I   YG P +I+ D   +  + +  +LCE F  KQ   S  +   N  AE+   T+   L K       DW + +  ++                 
Subjt:  IRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKIVSKSKRDWQERMEGLT-----------------

Query:  ------------------TEDNVKLRLQELEALDE--KRLE-------AQQALKCYQARMSKVF----DKYVKPRSFQVGDLVLAVRRSIIITRHTGNKF
                            DN  +  + L+A+ E  K +E         +   C++   SK+      K   P   +VG  VL  R++         +F
Subjt:  ------------------TEDNVKLRLQELEALDE--KRLE-------AQQALKCYQARMSKVF----DKYVKPRSFQVGDLVLAVRRSIIITRHTGNKF

Query:  TSKWDGPYIVKEVYTNGAYKIVDQDELKI-GPINGKFLKKF
         S+W GP ++  +  NG   + D    ++  PI    LK +
Subjt:  TSKWDGPYIVKEVYTNGAYKIVDQDELKI-GPINGKFLKKF

P03360 Gag-Pol polyprotein (Fragment)1.8e-0823.67Show/hide
Query:  EAWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANG
        E W ++    IT K    + Y+L   D F  W E  P +    + V   +   II R+G+P +I +DNG      +  +LCE           Y   ++G
Subjt:  EAWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANG

Query:  LAEAFNKTLYNLLKKIVSKSKRDWQERM-------------EGLTTEDNV---------------------KLRLQELEALDEKRLEAQQALKCYQARMS
          E  N+TL   + K+  ++  DW   +             EGL+  + +                     +  L+ L+AL   R  A+  L+    +  
Subjt:  LAEAFNKTLYNLLKKIVSKSKRDWQERM-------------EGLTTEDNV---------------------KLRLQELEALDEKRLEAQQALKCYQARMS

Query:  KVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIV
           D   +   FQ GDLV        + +H   +   +WDGPY V
Subjt:  KVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIV

Q66H30 Gypsy retrotransposon integrase-like protein 11.6e-0922.79Show/hide
Query:  LIDEEDWRQPIIEYLEHGK-LPKDSGHKIELRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSLAEHSYI
        ++ EE+ ++ + E  E+G  +       + L    YYW  +  D   +   C+ CQ   + +   P+  H ++    +    ++L+GP    S   H Y 
Subjt:  LIDEEDWRQPIIEYLEHGK-LPKDSGHKIELRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSLAEHSYI

Query:  LTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKIVSKSKR
        +  TD F KW  ++PL +     ++  I  +I   YG P +I+ D   +  + +  +L   F  K+   S  +   N  +E+   T+   L K  ++   
Subjt:  LTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKIVSKSKR

Query:  DWQERMEGLTTEDNV
         W E +  L+   NV
Subjt:  DWQERMEGLTTEDNV

Q8K259 Gypsy retrotransposon integrase-like protein 12.5e-1024.65Show/hide
Query:  LIDEEDWRQPIIEYLEHGK-LPKDSGHKIELRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSLAEHSYI
        ++ EE+ ++ + E  E+G  +       + L   GYYW  +  D   +   C+ CQ   N +   P+  H  M    +    ++L+GP    S   H Y 
Subjt:  LIDEEDWRQPIIEYLEHGK-LPKDSGHKIELRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSLAEHSYI

Query:  LTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKIVSKSKR
        +  TD F KW  ++PL +     ++  I  +I   YG P +I+ D   +    +  +L   F  K+   S  +   N  AE    T+   L K  +    
Subjt:  LTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKIVSKSKR

Query:  DWQERMEGLTTEDNV
         W E +  L+   NV
Subjt:  DWQERMEGLTTEDNV

Q9NXP7 Gypsy retrotransposon integrase-like protein 12.4e-0824.43Show/hide
Query:  LIDEEDWRQPIIEYLEHGKLPKDSG------HKIELRRMGYYWPKMVQDSIDYEKKCEACQYHVN-FIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSL
        ++ EE+ ++ + E  E+     DSG        + L    YYW  +  D   +   C+ CQ   N  I  P + L      W      ++L+GP    S 
Subjt:  LIDEEDWRQPIIEYLEHGKLPKDSG------HKIELRRMGYYWPKMVQDSIDYEKKCEACQYHVN-FIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSL

Query:  AEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKI
          H Y +  TD F KW  ++PL +     V+  I  +I   YG P +I+ D   +    +  +L   F  KQ   S  +   N   E+   T+   L K 
Subjt:  AEHSYILTATDYFLKWDEVIPLREAKKENVANFIRTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKI

Query:  VSKSKRDWQERMEGLTTEDNV
         +    +W + +  ++   NV
Subjt:  VSKSKRDWQERMEGLTTEDNV

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTGATGCAACTACTGGACACGAGGCGCTGTCCTTTATGGATGGGTCGACTCTAAAGGGAATATATTGTTACAAGGTGATGCCCTTTGGATTGAAAAATGTTGGTGC
CACTTATCAACGTGCTATGCAAAAAGTGTTTGATGATATGCTACATAAGTTTGACAGTGTAATGTTGGAGCATGTCCCTAGAACAGAAAACAAGAGAGCAAACGCATTGG
CAAATTTGGCCCCTGCCTTGATGATGCCGGATAATGAAGCGAACGTAACGACATCCCATTTGATTGATGAAGAAGATTGGCGTCAACCCATCATAGAGTATCTTGAGCAT
GGAAAGCTTCCAAAGGATTCTGGTCATAAAATTGAGTTGAGAAGAATGGGCTATTATTGGCCTAAGATGGTTCAAGATTCAATAGACTATGAAAAGAAGTGTGAAGCTTG
TCAATACCATGTAAACTTCATATATCAACCTCCAGAGCCTCTACATCCAACCATGGCTTCTTGGTTGTTTGAGGCTTGGGGACTTAATCTCGTTGGCCCTATTACACCGA
AATCATTAGCAGAACATTCTTATATCCTTACAGCAACAGATTATTTCTTAAAGTGGGATGAGGTCATTCCCTTGAGAGAGGCCAAGAAAGAGAACGTGGCAAACTTCATT
CGTACCCATATTATCCATCGATACGGTATTCCTCACCGAATTGTGACAGATAATGGAAGACAACTCTCCAATAGCATGATAGATAAATTATGTGAAAAATTCATGTTCAA
GCAATACAAGTCATCTATGTATAACGCAGTTGCAAATGGCCTAGCAGAGGCATTCAATAAAACGTTATACAATCTTCTGAAGAAAATTGTCTCCAAGTCGAAGAGGGATT
GGCAAGAAAGAATGGAGGGGTTGACTACCGAAGACAATGTCAAGTTACGTCTTCAAGAGTTAGAAGCACTTGATGAAAAACGATTGGAAGCTCAACAAGCATTGAAATGT
TACCAAGCGAGAATGTCTAAAGTTTTTGATAAATACGTCAAACCTCGCTCCTTTCAGGTTGGTGATCTAGTACTGGCCGTAAGAAGATCAATCATCATAACAAGACATAC
AGGAAATAAGTTCACATCTAAATGGGATGGACCCTACATTGTCAAAGAAGTGTACACAAACGGCGCATACAAGATTGTTGATCAAGATGAATTAAAAATTGGCCCAATCA
ACGGTAAATTTCTTAAGAAATTTTATGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTGATGCAACTACTGGACACGAGGCGCTGTCCTTTATGGATGGGTCGACTCTAAAGGGAATATATTGTTACAAGGTGATGCCCTTTGGATTGAAAAATGTTGGTGC
CACTTATCAACGTGCTATGCAAAAAGTGTTTGATGATATGCTACATAAGTTTGACAGTGTAATGTTGGAGCATGTCCCTAGAACAGAAAACAAGAGAGCAAACGCATTGG
CAAATTTGGCCCCTGCCTTGATGATGCCGGATAATGAAGCGAACGTAACGACATCCCATTTGATTGATGAAGAAGATTGGCGTCAACCCATCATAGAGTATCTTGAGCAT
GGAAAGCTTCCAAAGGATTCTGGTCATAAAATTGAGTTGAGAAGAATGGGCTATTATTGGCCTAAGATGGTTCAAGATTCAATAGACTATGAAAAGAAGTGTGAAGCTTG
TCAATACCATGTAAACTTCATATATCAACCTCCAGAGCCTCTACATCCAACCATGGCTTCTTGGTTGTTTGAGGCTTGGGGACTTAATCTCGTTGGCCCTATTACACCGA
AATCATTAGCAGAACATTCTTATATCCTTACAGCAACAGATTATTTCTTAAAGTGGGATGAGGTCATTCCCTTGAGAGAGGCCAAGAAAGAGAACGTGGCAAACTTCATT
CGTACCCATATTATCCATCGATACGGTATTCCTCACCGAATTGTGACAGATAATGGAAGACAACTCTCCAATAGCATGATAGATAAATTATGTGAAAAATTCATGTTCAA
GCAATACAAGTCATCTATGTATAACGCAGTTGCAAATGGCCTAGCAGAGGCATTCAATAAAACGTTATACAATCTTCTGAAGAAAATTGTCTCCAAGTCGAAGAGGGATT
GGCAAGAAAGAATGGAGGGGTTGACTACCGAAGACAATGTCAAGTTACGTCTTCAAGAGTTAGAAGCACTTGATGAAAAACGATTGGAAGCTCAACAAGCATTGAAATGT
TACCAAGCGAGAATGTCTAAAGTTTTTGATAAATACGTCAAACCTCGCTCCTTTCAGGTTGGTGATCTAGTACTGGCCGTAAGAAGATCAATCATCATAACAAGACATAC
AGGAAATAAGTTCACATCTAAATGGGATGGACCCTACATTGTCAAAGAAGTGTACACAAACGGCGCATACAAGATTGTTGATCAAGATGAATTAAAAATTGGCCCAATCA
ACGGTAAATTTCTTAAGAAATTTTATGCTTAA
Protein sequenceShow/hide protein sequence
MVDATTGHEALSFMDGSTLKGIYCYKVMPFGLKNVGATYQRAMQKVFDDMLHKFDSVMLEHVPRTENKRANALANLAPALMMPDNEANVTTSHLIDEEDWRQPIIEYLEH
GKLPKDSGHKIELRRMGYYWPKMVQDSIDYEKKCEACQYHVNFIYQPPEPLHPTMASWLFEAWGLNLVGPITPKSLAEHSYILTATDYFLKWDEVIPLREAKKENVANFI
RTHIIHRYGIPHRIVTDNGRQLSNSMIDKLCEKFMFKQYKSSMYNAVANGLAEAFNKTLYNLLKKIVSKSKRDWQERMEGLTTEDNVKLRLQELEALDEKRLEAQQALKC
YQARMSKVFDKYVKPRSFQVGDLVLAVRRSIIITRHTGNKFTSKWDGPYIVKEVYTNGAYKIVDQDELKIGPINGKFLKKFYA