; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0007544 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0007544
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionRibonuclease H
Genome locationchr10:17855755..17858467
RNA-Seq ExpressionPay0007544
SyntenyPay0007544
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0030430 - host cell cytoplasm (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR001584 - Integrase, catalytic core
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_031737372.1 uncharacterized protein LOC116402244 [Cucumis sativus]0.0e+0078.59Show/hide
Query:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL
        MALSDEE+TAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLH+YVECY DDLVVK+KRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL
Subjt:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL

Query:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRR-----------------------------EKFLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY
        GFIVRHRGIEIDQSKIDAIQKM RPKSLHDLR                              E F+WDEACQNAFDSIKKYLL PPVLGAPVP +PLILY
Subjt:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRR-----------------------------EKFLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY

Query:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY
        IAAQE                       RTL+GAEVNYSPIEKMCLALFFAI KLRHYMQAFTVHLVAKADPIKYVLSRPII+GRLAKW ++LQQYDIVY
Subjt:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY

Query:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE
        I QKAIKGQALAD LADHPIPSDWKLC+DLPDDEVFFTE++EPWTMYFDGA RRS AGA IVLISPEKHM PYSFAL+ELCSNNVAEYQALIIGLQ+ALE
Subjt:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE

Query:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD----------------------------------------------RSE------
        IGVSFIE+YGDSKLIINQLSLQYDVKHEDLK YF YARQLME+FD                                              R E      
Subjt:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD----------------------------------------------RSE------

Query:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK
                            +GKLPKDSRHK ++RRRAAHFIYYKGTLYRRSLEGLFLRCLGKE+S+KAL+E HAGVCGAHQSGPKLQFQLRRMGYYWPK
Subjt:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK

Query:  MVQDSMNYAKKCE-----------ASEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH
        M+QDS++Y KKCE             EPLHPT+ASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFS+WAEAI LREAKKENVA+FI THIIYRYGIPH
Subjt:  MVQDSMNYAKKCE-----------ASEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH

Query:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
        RIVTDNG+QFSNSM+DKLCEKFKFKQYKSSMYNAA NGL EAFNKTLCNLLKKIVSKSKRDWQE+IGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
Subjt:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER

Query:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG
        EIPSLRMAVQEGLTTEDNVKLRLQ+LEALDEKRLEAQQALECYQARMSKAF KHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDG YIVKEVYTNG
Subjt:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG

Query:  AYKIVDRDGLKIGPINDKFLKKFYA
        AYKIVD+DGL+IGPIN KFLKKFYA
Subjt:  AYKIVDRDGLKIGPINDKFLKKFYA

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]0.0e+0078.7Show/hide
Query:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL
        MALSDEE+TAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLH+YVECY DDLVVK+KRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL
Subjt:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL

Query:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRR-----------------------------EKFLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY
        GFIVRHRGIEIDQSKIDAIQKM RPKSLHDLR                              E F+WDEACQNAFDSIKKYLL PPVLGAPVP +PLILY
Subjt:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRR-----------------------------EKFLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY

Query:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY
        IAAQE                       RTL+GAEVNYSPIEKMCLALFFAI KLRHYMQAFTVHLVAKADPIKYVLSRPII+GRLAKW ++LQQYDIVY
Subjt:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY

Query:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE
        I QKAIKGQALAD LADHPIPSDWKLC+DLPDDEVFFTE++EPWTMYFDGA RRS AGA IVLISPEKHM PYSFAL+ELCSNNVAEYQALIIGLQ+ALE
Subjt:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE

Query:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD----------------------------------------------RSE------
        IGVSFIE+YGDSKLIINQLSLQYDVKHEDLK YF YARQLME+FD                                              R E      
Subjt:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD----------------------------------------------RSE------

Query:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK
                            +GKLPKDSRHK ++RRRAAHFIYYKGTLYRRSLEGLFLRCLGKE+S+KAL+E HAGVCGAHQSGPKLQFQLRRMGYYWPK
Subjt:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK

Query:  MVQDSMNYAKKCE-----------ASEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH
        M+QDS++Y KKCE             EPLHPT+ASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAI LREAKKENVA+FI THIIYRYGIPH
Subjt:  MVQDSMNYAKKCE-----------ASEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH

Query:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
        RIVTDNG+QFSNSM+DKLCEKFKFKQYKSSMYNAA NGL EAFNKTLCNLLKKIVSKSKRDWQE+IGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
Subjt:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER

Query:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG
        EIPSLRMAVQEGLTTEDNVKLRLQ+LEALDEKRLEAQQALECYQARMSKAF KHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDG YIVKEVYTNG
Subjt:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG

Query:  AYKIVDRDGLKIGPINDKFLKKFYA
        AYKIVD+DGL+IGPIN KFLKKFYA
Subjt:  AYKIVDRDGLKIGPINDKFLKKFYA

XP_031742032.1 uncharacterized protein LOC116404025 [Cucumis sativus]0.0e+0078.59Show/hide
Query:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL
        MALSDEE+TAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLH+YVECY DDLVVK+KRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL
Subjt:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL

Query:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRR-----------------------------EKFLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY
        GFIVRHRGIEIDQSKIDAIQKM RPKSLHDLR                              E F+WDEACQNAFDSIKKYLL PPVLGAPVP +PLILY
Subjt:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRR-----------------------------EKFLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY

Query:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY
        IAAQE                       RTL+GAEVNYSPIEKMCLALFFAI KLRHYMQAFTVHLVAKADPIKYVLSRPII+GRLAKW ++LQQYDIVY
Subjt:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY

Query:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE
        I QKAIKGQALAD LADHPIPSDWKLC+DLPDDEVFFTE++EPWTMYFDGA RRS AGA IVLISPEKHM PYSFAL+ELCSNNVAEYQALIIGLQ+ALE
Subjt:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE

Query:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD----------------------------------------------RSE------
        IGVSFIE+YGDSKLIINQLSLQYDVKHEDLK YF YARQLME+FD                                              R E      
Subjt:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD----------------------------------------------RSE------

Query:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK
                            +GKLPKDSRHK ++RRRAAHFIYYKGTLYRRSLEGLFLRCLGKE+S+KAL+E HAGVCGAHQSGPKLQFQLRRMGYYWPK
Subjt:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK

Query:  MVQDSMNYAKKCE-----------ASEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH
        M+QDS++Y KKCE             EPLHPT+ASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFS+WAEAI LREAKKENVA+FI THIIYRYGIPH
Subjt:  MVQDSMNYAKKCE-----------ASEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH

Query:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
        RIVTDNG+QFSNSM+DKLCEKFKFKQYKSSMYNAA NGL EAFNKTLCNLLKKIVSKSKRDWQE+IGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
Subjt:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER

Query:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG
        EIPSLRMAVQEGLTTEDNVKLRLQ+LEALDEKRLEAQQALECYQARMSKAF KHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDG YIVKEVYTNG
Subjt:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG

Query:  AYKIVDRDGLKIGPINDKFLKKFYA
        AYKIVD+DGL+IGPIN KFLKKFYA
Subjt:  AYKIVDRDGLKIGPINDKFLKKFYA

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]0.0e+0078.59Show/hide
Query:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL
        MALSDEE+TAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLH+YVECY DDLVVK+KRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL
Subjt:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL

Query:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRR-----------------------------EKFLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY
        GFIVRHRGIEIDQSKIDAIQKM RPKSLHDLR                              E F+WDEACQNAFDSIKKYLL PPVLGAPVP +PLILY
Subjt:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRR-----------------------------EKFLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY

Query:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY
        IAAQE                       RTL+GAEVNYSPIEKMCLALFFAI KLRHYMQAFTVHLVAKADPIKYVLSRPII+GRLAKW ++LQQYDIVY
Subjt:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY

Query:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE
        I QKAIKGQALAD LADHPIPSDWKLC+DLPDDEVFFTE++EPWTMYFDGA RRS AGA IVLISPEKHM PYSFAL+ELCSNNVAEYQALIIGLQ+ALE
Subjt:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE

Query:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD----------------------------------------------RSE------
        IGVSFIE+YGDSKLIINQLSLQYDVKHEDLK YF YARQLME+FD                                              R E      
Subjt:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD----------------------------------------------RSE------

Query:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK
                            +GKLPKDSRHK ++RRRAAHFIYYKGTLYRRSLEGLFLRCLGKE+S+KAL+E HAGVCGAHQSGPKLQFQLRRMGYYWPK
Subjt:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK

Query:  MVQDSMNYAKKCE-----------ASEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH
        M+QDS++Y KKCE             EPLHPT+ASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFS+WAEAI LREAKKENVA+FI THIIYRYGIPH
Subjt:  MVQDSMNYAKKCE-----------ASEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH

Query:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
        RIVTDNG+QFSNSM+DKLCEKFKFKQYKSSMYNAA NGL EAFNKTLCNLLKKIVSKSKRDWQE+IGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
Subjt:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER

Query:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG
        EIPSLRMAVQEGLTTEDNVKLRLQ+LEALDEKRLEAQQALECYQARMSKAF KHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDG YIVKEVYTNG
Subjt:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG

Query:  AYKIVDRDGLKIGPINDKFLKKFYA
        AYKIVD+DGL+IGPIN KFLKKFYA
Subjt:  AYKIVDRDGLKIGPINDKFLKKFYA

XP_031742888.1 uncharacterized protein LOC116404510 [Cucumis sativus]0.0e+0078.7Show/hide
Query:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL
        MALSDEE+TAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLH+YVECY DDLVVK+KRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL
Subjt:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL

Query:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRR-----------------------------EKFLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY
        GFIVRHRGIEIDQSKIDAIQKM RPKSLHDLR                              E F+WDEACQNAFDSIKKYLL PPVLGAPVP +PLILY
Subjt:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRR-----------------------------EKFLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY

Query:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY
        IAAQE                       RTL+GAEVNYSPIEKMCLALFFAI KLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKW ++LQQYDIVY
Subjt:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY

Query:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE
        I QKAIKGQALAD LADHPIPSDWKLC+DLPDDEVFFTE++EPWTMYFDGA RRS AGA IVLISPEKHM PYSFAL+ELCSNNVAEYQALIIGLQ+ALE
Subjt:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE

Query:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD----------------------------------------------RSE------
        IGVSFIE+YGDSKLIINQLSLQYDVKHEDLK YF YARQLME+FD                                              R E      
Subjt:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD----------------------------------------------RSE------

Query:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK
                            +GKLPKDSRHK ++RRRAAHFIYYKGTLYRRSLEGLFLRCLGKE+S+KAL+E HAGVCGAHQSGPKLQFQLRRMGYYWPK
Subjt:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK

Query:  MVQDSMNYAKKCE-----------ASEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH
        M+QDS++Y KKCE             EPLHPT+ASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFS+WAEAI LREAKKENVA+FI THIIYRYGIPH
Subjt:  MVQDSMNYAKKCE-----------ASEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH

Query:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
        RIVTDNG+QFSNSM+DKLCEKFKFKQYKSSMYNAA NGL EAFNKTLCNLLKKIVSKSKRDWQE+IGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
Subjt:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER

Query:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG
        EIPSLRMAVQEGLTTEDNVKLRLQ+LEALDEKRLEAQQALECYQARMSKAF KHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDG YIVKEVYTNG
Subjt:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG

Query:  AYKIVDRDGLKIGPINDKFLKKFYA
        AYKIVD+DGL+IGPIN KFLKKFYA
Subjt:  AYKIVDRDGLKIGPINDKFLKKFYA

TrEMBL top hitse value%identityAlignment
A0A5A7SPV8 Ribonuclease H0.0e+0078.59Show/hide
Query:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL
        MALSDEE+TAFRTPKGIYCYKV+PFGLKNAGATYQRAMQKVFDDMLHKYVECY DDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAF VTSGKFL
Subjt:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL

Query:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRR-----------------------------EKFLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY
        GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLR                              E F+WDEACQNAFDSIKKYLLNPPVLGAP+P EPLILY
Subjt:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRR-----------------------------EKFLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY

Query:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY
        IAAQE                       RTLVGAEVNYSPIEKMCLALFFAI KLRHYMQAFTVHLVAK DPIKYVLSRPIISG LAKW +ILQQYDIVY
Subjt:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY

Query:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE
        ISQK IKGQALA+ LADHPIPSDWKLCE LPDDEVFFT+MVEPWTMYFDGA RRS AGA IVLIS E+HM PYSF LAELC NNVAEYQALIIGLQMALE
Subjt:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE

Query:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD--------RSE--------------------------------------------
        I VSFI+IYGDSKLIINQLSLQYDVKHEDLK YFTYARQLMERFD        R+E                                            
Subjt:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD--------RSE--------------------------------------------

Query:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK
                            +GKLPKDSRHKT+VRRRAAHFIYYKGTLYRR LEG+FLRCLG++ESIKALEEAHAGVCGAHQSG KLQFQLRRMGYYWPK
Subjt:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK

Query:  MVQDSMNYAKKCEA-----------SEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH
        MVQDSM+YAKKCEA            EPLHP++ASW FEAWGLDLVGPITPKSSAGHSYILAAT+YFSKWAEAIPLREAKKENV NFI THIIYRYGIPH
Subjt:  MVQDSMNYAKKCEA-----------SEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH

Query:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
        RIVTDNGRQFSNSMIDKLCEKFKF+QYKSSMYNAA NGL EAFNKTLCNLLKKIVSKSKRDWQERI EALWAYRTTHRT T VTPYSLVYGVE  LPLER
Subjt:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER

Query:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG
        EIPSLRM VQEGLTTEDNVKLRLQ+LE LDEKRLEAQQ LECYQARMSKAF KHVKPRSFQVGDLVLAVRRPIITTRH GNKFTPKWDG YIVKEVYTNG
Subjt:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG

Query:  AYKIVDRDGLKIGPINDKFLKKFYA
        AYKIVDRDGLKIG IN KFLKKFYA
Subjt:  AYKIVDRDGLKIGPINDKFLKKFYA

A0A5A7TZU9 Ribonuclease H0.0e+0071.68Show/hide
Query:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL
        MAL DEE TAFRTPKGIYCYKVMPFGLKNAGATYQRAMQ++FDDMLHK+VECY DDLVVKSK++ DHLKDLK+V DRLRKYQLRMNPLKCAFGVTSGKFL
Subjt:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL

Query:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRREK-----------------------------FLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY
        GFIVRHRGIE+D SKIDAIQKMP PK+LH+LRR +                             F WD++CQNAFDSIKKYLLNPPVL AP   +PLILY
Subjt:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRREK-----------------------------FLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY

Query:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY
        IAAQE                       RTL GAE+NYSPIEKMCLALFFAI KLRHYMQAFT+HLVAKADP+KY+LSRP+ISGRLAKW IILQQYDIVY
Subjt:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY

Query:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE
        I QKA+KGQALAD LADHP+PS+WKLC+DLPD+EV F E +EPW M+FDGA RRS AG  IV ISPEKHM PYSF L ELCSNNVAEYQA IIGLQMA E
Subjt:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE

Query:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD--------RSE--------------------------------------------
         G+  IEI+GDSKLIINQLS QY+VKH+DLK YF+YAR+LM+RFD        RSE                                            
Subjt:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD--------RSE--------------------------------------------

Query:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK
                            +GKLP D RH+ ++RRRAA FIYYK TLYRRS EGL LRCLGKEES KALEEAH+G+CGAHQSGPKLQ+QL+RMGYYWP 
Subjt:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK

Query:  MVQDSMNYAKKCEA-----------SEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH
        M+ DSM++AK CEA            EPLHPT+ASWPFEAWGLDLVGPITPKS+AGHSYILA TDYFSKWAEA+PLREAKKEN+ NF+ THIIYRYGIPH
Subjt:  MVQDSMNYAKKCEA-----------SEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH

Query:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
        RIVTDNGRQF+N+++DKLCEKF FKQ+KSSMYNAA NGL EAFNKTLC+LLKK+VSK+KRDWQE+IGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
Subjt:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER

Query:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG
        EIPSLRMA+QEGLTTEDN +LRL++LEALDEKRLEAQQALECYQARMSKAF K V+PRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDG YIVKEV+TNG
Subjt:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG

Query:  AYKIVDRDGLKIGPINDKFLKKFYA
        AYKI+D+DGL+IGPIN +FLKKFYA
Subjt:  AYKIVDRDGLKIGPINDKFLKKFYA

A0A5A7UID6 Ribonuclease H0.0e+0070.92Show/hide
Query:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL
        MAL DEE TAFRTPKGIYCYKVMPFGLKNAGATYQRAMQ++FDDMLHK++ECY DDLVVKSK++ DHLKDLK+V DRLRKYQLRMNPLKCAFGVTSGKFL
Subjt:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL

Query:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRREK-----------------------------FLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY
        GFIVRHRGIE+D SKIDAIQKMP  K+LH+LRR +                             F WD++CQNAFDSIKKYLLNPPVL AP   +PLILY
Subjt:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRREK-----------------------------FLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY

Query:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY
        IAAQE                       RTL GAE+NYSPIEKMCLALFFAI KLRHYMQ FT+HLVAKADP+KY+LSRP+ISGRLAKW IILQQYDIVY
Subjt:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY

Query:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE
        I QKA+KGQALAD LADHP+PS+WKLC+DLPD+EV F E +E W M+FDG  RRS AG  IV ISPEKHM PYSF L ELCSNNVAEYQA IIGLQMA E
Subjt:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE

Query:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD--------RSE--------------------------------------------
         G+  IEI+GDSKLIINQLS QY VKH+DLK YF+YAR+LM+RFD        RSE                                            
Subjt:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD--------RSE--------------------------------------------

Query:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK
                            +GKLP D RH+ ++RRRAA FIYYK TLYRRS EGL LRCLGKEES KALEEAH+G+CGAHQSGPKLQ+QL+RMGYYWP 
Subjt:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK

Query:  MVQDSMNYAKKCEA-----------SEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH
        M+ DSM++AK CEA            EPLHPT+ASWPFEAWGLDLV PITPKS+AGHSYILA TDYFSKWAEA+PLREAKKEN+ NF+ THIIYRYGIPH
Subjt:  MVQDSMNYAKKCEA-----------SEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH

Query:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
        RIVTDNGRQF+N+++DKLCEKF FKQYKSSMYNAA NGL EAFNKTLC+LLKK+VSK+KRDWQE+IGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
Subjt:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER

Query:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG
        EIPSLRMA+Q+GLTTEDN +LRLQ+LEALDEKRLEAQQALECYQARMSKAF K V+PRSFQVGDLVLAVRRPIITTRH GNKFTPKWDG YIVKEV+TNG
Subjt:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG

Query:  AYKIVDRDGLKIGPINDKFLKKFYA
        AYKI+D+DGL+IGPIN +FLKKF A
Subjt:  AYKIVDRDGLKIGPINDKFLKKFYA

A0A5D3BTY1 Ribonuclease H0.0e+0071.24Show/hide
Query:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL
        MAL DEE TAFRTPKGIYCYKVMPFGLKNAGATYQRAMQ++FDDMLHK++ECY DDLVVKSK++ DHLKDLK+V DRLRKYQLRMNPLKCAFGVTSGKFL
Subjt:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL

Query:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRREK-----------------------------FLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY
        GFIVRHRGIE+D SKIDAIQKMP PK+LH+LRR +                             F WD++CQNAFDSIKKYLLNPPVL AP   +PLILY
Subjt:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRREK-----------------------------FLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY

Query:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY
        IAAQE                       RTL GAE+NYSPIEKMCLALFFAI KLRHYMQAFT+HLVAKADP+KY+LSRP+ISGRLAKW IILQQYDIVY
Subjt:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY

Query:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE
        I QKA+KGQALAD LADHP+PS+WKLC+DLPD+EV F E +EPW M+FDGA RRS AG  IV ISPEKHM PYSF L ELCSNNVAEYQA IIGLQMA E
Subjt:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE

Query:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD--------RSE--------------------------------------------
         G+  IEI+GDSKLIINQLS QY+VKH+DLK YF+YAR+LM+RFD        RSE                                            
Subjt:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD--------RSE--------------------------------------------

Query:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK
                            +GKLP D RH+ ++RRRAA FIYYK TLYRRS EGL LRCLGKEES KALEEAH+G+CGAHQSGPKLQ+QL+RMGYYWP 
Subjt:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK

Query:  MVQDSMNYAKKCEA-----------SEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH
        M+ DSM++AK CEA            EPLHPT+ASWPFE WGLDLVGPITPKSSAGHSYILA TDYFS+WAEA+PLREAKKEN+ NF+ T+IIYRYGIPH
Subjt:  MVQDSMNYAKKCEA-----------SEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH

Query:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
        RIVTDNGRQF+N+++DKLCEKF FKQYKSSMYNAA NGL EAFNKTLC+LLKK+VSK+KRDWQE+IGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLE+
Subjt:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER

Query:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG
        EIPSLRM++QEGLTT+DN +L LQ+LEALDEKRLEAQQALECYQARMSKAF K V+PRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDG YIVKEV+TNG
Subjt:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG

Query:  AYKIVDRDGLKIGPINDKFLKKFYA
        AYKI+D+DGL+IGPIN KFLKKFYA
Subjt:  AYKIVDRDGLKIGPINDKFLKKFYA

A0A5D3D1E5 Ribonuclease H0.0e+0071.89Show/hide
Query:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL
        MAL DEE TAFRTPKGIYCYKVMPFGLKNAGATYQRAMQ++FDDMLHK+VECY DDLVVKSK++ DHLKDLK+V DRLRKYQLRMNPLKCAFGVTSGKFL
Subjt:  MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFL

Query:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRREK-----------------------------FLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY
        GFIVRHRGIE+D SKIDAIQKMP PK+LH+LRR +                             F WD++CQNAFDSIKKYLLNPPVL AP   +PLILY
Subjt:  GFIVRHRGIEIDQSKIDAIQKMPRPKSLHDLRREK-----------------------------FLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILY

Query:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY
        IAAQE                       RTL GAE+NYSPIEKMCLALFFAI KLRHYMQAFT+HLVAKADP+KY+LSRP+ISGRLAKW IILQQYDIVY
Subjt:  IAAQE-----------------------RTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVY

Query:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE
        I QKA+KGQALAD LADHP+PS+WKLC+DLPD+EV F E +EPW M+FDGA RRS AG  IV ISPEKHM PYSF L ELCSNNVAEYQA IIGLQMA E
Subjt:  ISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALE

Query:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD--------RSE--------------------------------------------
         G+  IEI+GDSKLIINQLS QY+VKH+DLK YF+YAR+LM+RFD        RSE                                            
Subjt:  IGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFD--------RSE--------------------------------------------

Query:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK
                            +GKLP D RH+ ++RRRAA FIYYK TLYRRS EGL LRCLGKEES KALEEAH+G+CGAHQSGPKLQ+QL+RMGYYWP 
Subjt:  -------------------RNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPK

Query:  MVQDSMNYAKKCEA-----------SEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH
        M+ DSM++AK CEA            EPLHPT+ASWPFEAWGLDLVGPITPKS+AGHSYILA TDYFSKWAEA+PLREAKKEN+ NF+ THIIYRYGIPH
Subjt:  MVQDSMNYAKKCEA-----------SEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSYILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPH

Query:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
        RIVTDNGRQF+N+++DKLCEKF FKQ+KSSMYNAA NGL EAFNKTLC+LLKK+VSK+KRDWQE+IGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER
Subjt:  RIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLER

Query:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG
        EIPSLRMA+QEGLTTEDN +LRLQ+LEALDEKRLEAQQALECYQARMSKAF K V+PRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDG YIVKEV+TNG
Subjt:  EIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNG

Query:  AYKIVDRDGLKIGPINDKFLKKFYA
        AYKI+D+DGL+IGPIN KFLKKFYA
Subjt:  AYKIVDRDGLKIGPINDKFLKKFYA

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein4.0e-3621.73Show/hide
Query:  DEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIV
        DE   AFR P+G++ Y VMP+G+  A A +Q  +  +  +    +V CY DD+++ SK   +H+K +K V  +L+   L +N  KC F  +  KF+G+ +
Subjt:  DEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIV

Query:  RHRGIEIDQSKIDAIQKMPRPKSLHDLRR--------EKFL---------------------WDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILYIAAQ
          +G    Q  ID + +  +PK+  +LR+         KF+                     W      A ++IK+ L++PPVL     S+ ++L   A 
Subjt:  RHRGIEIDQSKIDAIQKMPRPKSLHDLRR--------EKFL---------------------WDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILYIAAQ

Query:  ERTL---------------VG--------AEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISG-----------RLAKWTIIL
        +  +               VG        A++NYS  +K  LA+  ++   RHY++       +  +P K +     + G           RLA+W + L
Subjt:  ERTL---------------VG--------AEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISG-----------RLAKWTIIL

Query:  QQYDIVYISQKAIKGQALADVLA-----DHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEY
        Q ++   I+ +      +AD L+       PIP      +D  D+ + F                                                   
Subjt:  QQYDIVYISQKAIKGQALADVLA-----DHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEY

Query:  QALIIGLQMALEIGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFDRSERNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLR
                                   +NQ+S+  D K++ + +Y T   +L+   +         +D R + +++ +    I  K  +   +   L   
Subjt:  QALIIGLQMALEIGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFDRSERNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLR

Query:  CLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPKMVQDSMNYAKKCEASE-----------PLHPTMAS-WPFEAWGLDLVGPITPKSSAGHS
         + K      L   H G+        +L   +    + W  + +    Y + C   +           PL P   S  P+E+  +D +  + P+SS G++
Subjt:  CLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPKMVQDSMNYAKKCEASE-----------PLHPTMAS-WPFEAWGLDLVGPITPKSSAGHS

Query:  YILAATDYFSKWAEAIP-LREAKKENVANFILTHIIYRYGIPHRIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSK
         +    D FSK A  +P  +    E  A      +I  +G P  I+ DN   F++        K+ F    S  Y   T+G TE  N+T+  LL+ + S 
Subjt:  YILAATDYFSKWAEAIP-LREAKKENVANFILTHIIYRYGIPHRIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSK

Query:  SKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVE-AVLPLEREIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVK
            W + I     +Y     + T +TP+ +V+    A+ PL  E+PS          T++N +  +Q  + + E        L     +M K F   ++
Subjt:  SKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVE-AVLPLEREIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVK

Query:  P-RSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNGAYKIVDRDGLK
            FQ GDLV+ V+R      H  NK  P + G + V +      Y++   D +K
Subjt:  P-RSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNGAYKIVDRDGLK

P0CT35 Transposon Tf2-2 polyprotein4.0e-3621.73Show/hide
Query:  DEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIV
        DE   AFR P+G++ Y VMP+G+  A A +Q  +  +  +    +V CY DD+++ SK   +H+K +K V  +L+   L +N  KC F  +  KF+G+ +
Subjt:  DEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIV

Query:  RHRGIEIDQSKIDAIQKMPRPKSLHDLRR--------EKFL---------------------WDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILYIAAQ
          +G    Q  ID + +  +PK+  +LR+         KF+                     W      A ++IK+ L++PPVL     S+ ++L   A 
Subjt:  RHRGIEIDQSKIDAIQKMPRPKSLHDLRR--------EKFL---------------------WDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILYIAAQ

Query:  ERTL---------------VG--------AEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISG-----------RLAKWTIIL
        +  +               VG        A++NYS  +K  LA+  ++   RHY++       +  +P K +     + G           RLA+W + L
Subjt:  ERTL---------------VG--------AEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISG-----------RLAKWTIIL

Query:  QQYDIVYISQKAIKGQALADVLA-----DHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEY
        Q ++   I+ +      +AD L+       PIP      +D  D+ + F                                                   
Subjt:  QQYDIVYISQKAIKGQALADVLA-----DHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEY

Query:  QALIIGLQMALEIGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFDRSERNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLR
                                   +NQ+S+  D K++ + +Y T   +L+   +         +D R + +++ +    I  K  +   +   L   
Subjt:  QALIIGLQMALEIGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFDRSERNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLR

Query:  CLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPKMVQDSMNYAKKCEASE-----------PLHPTMAS-WPFEAWGLDLVGPITPKSSAGHS
         + K      L   H G+        +L   +    + W  + +    Y + C   +           PL P   S  P+E+  +D +  + P+SS G++
Subjt:  CLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPKMVQDSMNYAKKCEASE-----------PLHPTMAS-WPFEAWGLDLVGPITPKSSAGHS

Query:  YILAATDYFSKWAEAIP-LREAKKENVANFILTHIIYRYGIPHRIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSK
         +    D FSK A  +P  +    E  A      +I  +G P  I+ DN   F++        K+ F    S  Y   T+G TE  N+T+  LL+ + S 
Subjt:  YILAATDYFSKWAEAIP-LREAKKENVANFILTHIIYRYGIPHRIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSK

Query:  SKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVE-AVLPLEREIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVK
            W + I     +Y     + T +TP+ +V+    A+ PL  E+PS          T++N +  +Q  + + E        L     +M K F   ++
Subjt:  SKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVE-AVLPLEREIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVK

Query:  P-RSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNGAYKIVDRDGLK
            FQ GDLV+ V+R      H  NK  P + G + V +      Y++   D +K
Subjt:  P-RSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNGAYKIVDRDGLK

P0CT36 Transposon Tf2-3 polyprotein4.0e-3621.73Show/hide
Query:  DEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIV
        DE   AFR P+G++ Y VMP+G+  A A +Q  +  +  +    +V CY DD+++ SK   +H+K +K V  +L+   L +N  KC F  +  KF+G+ +
Subjt:  DEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIV

Query:  RHRGIEIDQSKIDAIQKMPRPKSLHDLRR--------EKFL---------------------WDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILYIAAQ
          +G    Q  ID + +  +PK+  +LR+         KF+                     W      A ++IK+ L++PPVL     S+ ++L   A 
Subjt:  RHRGIEIDQSKIDAIQKMPRPKSLHDLRR--------EKFL---------------------WDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILYIAAQ

Query:  ERTL---------------VG--------AEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISG-----------RLAKWTIIL
        +  +               VG        A++NYS  +K  LA+  ++   RHY++       +  +P K +     + G           RLA+W + L
Subjt:  ERTL---------------VG--------AEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISG-----------RLAKWTIIL

Query:  QQYDIVYISQKAIKGQALADVLA-----DHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEY
        Q ++   I+ +      +AD L+       PIP      +D  D+ + F                                                   
Subjt:  QQYDIVYISQKAIKGQALADVLA-----DHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEY

Query:  QALIIGLQMALEIGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFDRSERNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLR
                                   +NQ+S+  D K++ + +Y T   +L+   +         +D R + +++ +    I  K  +   +   L   
Subjt:  QALIIGLQMALEIGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFDRSERNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLR

Query:  CLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPKMVQDSMNYAKKCEASE-----------PLHPTMAS-WPFEAWGLDLVGPITPKSSAGHS
         + K      L   H G+        +L   +    + W  + +    Y + C   +           PL P   S  P+E+  +D +  + P+SS G++
Subjt:  CLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPKMVQDSMNYAKKCEASE-----------PLHPTMAS-WPFEAWGLDLVGPITPKSSAGHS

Query:  YILAATDYFSKWAEAIP-LREAKKENVANFILTHIIYRYGIPHRIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSK
         +    D FSK A  +P  +    E  A      +I  +G P  I+ DN   F++        K+ F    S  Y   T+G TE  N+T+  LL+ + S 
Subjt:  YILAATDYFSKWAEAIP-LREAKKENVANFILTHIIYRYGIPHRIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSK

Query:  SKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVE-AVLPLEREIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVK
            W + I     +Y     + T +TP+ +V+    A+ PL  E+PS          T++N +  +Q  + + E        L     +M K F   ++
Subjt:  SKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVE-AVLPLEREIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVK

Query:  P-RSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNGAYKIVDRDGLK
            FQ GDLV+ V+R      H  NK  P + G + V +      Y++   D +K
Subjt:  P-RSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNGAYKIVDRDGLK

P0CT41 Transposon Tf2-12 polyprotein4.0e-3621.73Show/hide
Query:  DEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIV
        DE   AFR P+G++ Y VMP+G+  A A +Q  +  +  +    +V CY DD+++ SK   +H+K +K V  +L+   L +N  KC F  +  KF+G+ +
Subjt:  DEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIV

Query:  RHRGIEIDQSKIDAIQKMPRPKSLHDLRR--------EKFL---------------------WDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILYIAAQ
          +G    Q  ID + +  +PK+  +LR+         KF+                     W      A ++IK+ L++PPVL     S+ ++L   A 
Subjt:  RHRGIEIDQSKIDAIQKMPRPKSLHDLRR--------EKFL---------------------WDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILYIAAQ

Query:  ERTL---------------VG--------AEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISG-----------RLAKWTIIL
        +  +               VG        A++NYS  +K  LA+  ++   RHY++       +  +P K +     + G           RLA+W + L
Subjt:  ERTL---------------VG--------AEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKADPIKYVLSRPIISG-----------RLAKWTIIL

Query:  QQYDIVYISQKAIKGQALADVLA-----DHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEY
        Q ++   I+ +      +AD L+       PIP      +D  D+ + F                                                   
Subjt:  QQYDIVYISQKAIKGQALADVLA-----DHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEY

Query:  QALIIGLQMALEIGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFDRSERNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLR
                                   +NQ+S+  D K++ + +Y T   +L+   +         +D R + +++ +    I  K  +   +   L   
Subjt:  QALIIGLQMALEIGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFDRSERNGKLPKDSRHKTDVRRRAAHFIYYKGTLYRRSLEGLFLR

Query:  CLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPKMVQDSMNYAKKCEASE-----------PLHPTMAS-WPFEAWGLDLVGPITPKSSAGHS
         + K      L   H G+        +L   +    + W  + +    Y + C   +           PL P   S  P+E+  +D +  + P+SS G++
Subjt:  CLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPKMVQDSMNYAKKCEASE-----------PLHPTMAS-WPFEAWGLDLVGPITPKSSAGHS

Query:  YILAATDYFSKWAEAIP-LREAKKENVANFILTHIIYRYGIPHRIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSK
         +    D FSK A  +P  +    E  A      +I  +G P  I+ DN   F++        K+ F    S  Y   T+G TE  N+T+  LL+ + S 
Subjt:  YILAATDYFSKWAEAIP-LREAKKENVANFILTHIIYRYGIPHRIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSK

Query:  SKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVE-AVLPLEREIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVK
            W + I     +Y     + T +TP+ +V+    A+ PL  E+PS          T++N +  +Q  + + E        L     +M K F   ++
Subjt:  SKRDWQERIGEALWAYRTTHRTPTGVTPYSLVYGVE-AVLPLEREIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVK

Query:  P-RSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNGAYKIVDRDGLK
            FQ GDLV+ V+R      H  NK  P + G + V +      Y++   D +K
Subjt:  P-RSFQVGDLVLAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNGAYKIVDRDGLK

P10394 Retrovirus-related Pol polyprotein from transposon 4123.2e-4624.1Show/hide
Query:  EITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRH
        +IT+F T  G Y +  +PFGLK A  ++QR M   F  +       Y DDL+V     +  LK+L  VF + R+Y L+++P KC+F +    FLG     
Subjt:  EITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIVRH

Query:  RGIEIDQSKIDAIQKMPRPKSLHDLRR-----------------------------EKFLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLI--------
        +GI  D  K D IQ  P P      RR                               F W + CQ AF  +K  L+NP +L  P  S+           
Subjt:  RGIEIDQSKIDAIQKMPRPKSLHDLRR-----------------------------EKFLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLI--------

Query:  --------------LYIAAQERTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKAD--PIKYVLSRPIISGRLAKWTIILQQYDIVYISQKA
                      L +A   R     E N S  E+   A+ +AI   R Y+  +  H   K D  P+ Y+ S    S +L +  + L++Y+      K 
Subjt:  --------------LYIAAQERTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAFTVHLVAKAD--PIKYVLSRPIISGRLAKWTIILQQYDIVYISQKA

Query:  IKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALEIGVSF
         K   +AD L+   I    K  +D+  + +  T   +        + ++SCAG + + +  +         + E+ +N+       ++ LQ+        
Subjt:  IKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALEIGVSF

Query:  IEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFDRSERNGKLPKDSRHKTDVRRRAAHFIYYK--GTLYRRSLEGLFLRCLGKEESIKALEEAH
        I ++   K II     +YDV   DL  Y      L +   R E    +   S+ K    ++    +       +  + L+ L +  L     I   +E  
Subjt:  IEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFDRSERNGKLPKDSRHKTDVRRRAAHFIYYK--GTLYRRSLEGLFLRCLGKEESIKALEEAH

Query:  A--------GVCGAHQSGPKLQFQLRRMGYYWPKMVQDSMNYAKKCEASEPLHPT-----------MASWPFEAWGLDLVGPITPKSSAGHSYILAATDY
        A         + G H    K   +++R  YYW  M +    Y +KC+  +    T                F+   +D +GP+ PKS  G+ Y +     
Subjt:  A--------GVCGAHQSGPKLQFQLRRMGYYWPKMVQDSMNYAKKCEASEPLHPT-----------MASWPFEAWGLDLVGPITPKSSAGHSYILAATDY

Query:  FSKWAEAIPLREAKKENVANFILTHIIYRYGIPHRIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERI
         +K+  AIP+     + VA  I    I +YG     +TD G ++ NS+I  LC+  K K   S+ ++  T G+ E  ++TL   ++  +S  K DW   +
Subjt:  FSKWAEAIPLREAKKENVANFILTHIIYRYGIPHRIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQERI

Query:  GEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLV
           ++ + TT        PY LV+G  + LP  +    L  +++     +D  K    +LE    +   A++ LE ++ +  + +   VK    +VGD V
Subjt:  GEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLV

Query:  LAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNGAYKIVDRDGLKIGPINDKFLKKFYA
        L            G+K   K+ G Y ++ +  N    ++     K     D+ LKKF++
Subjt:  LAVRRPIITTRHTGNKFTPKWDGLYIVKEVYTNGAYKIVDRDGLKIGPINDKFLKKFYA

Arabidopsis top hitse value%identityAlignment
AT1G24090.1 RNase H family protein4.0e-0735.71Show/hide
Query:  FDGAVRRS--CAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALEIGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMER
        FDGA + +   +GA  VL + +  +         + +NN AEY ALI+GL+ A+E G   I++ GDSKL+  Q+  Q+ V HE L +    A+ L  +
Subjt:  FDGAVRRS--CAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALEIGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMER

AT3G01410.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.2e-0728.68Show/hide
Query:  TMYFDGAVRRS--CAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALEIGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLME
        T+ FDGA + +   AGA  VL + +  +  Y        +NNVAEY+AL++GL+ AL+ G   + + GDS L+  Q+   +   H  + +    A++LM 
Subjt:  TMYFDGAVRRS--CAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALEIGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLME

Query:  RFDRSERNGKLPKDSRHKTDVRRRAAHFI
         F   +    + ++   + D +  +A F+
Subjt:  RFDRSERNGKLPKDSRHKTDVRRRAAHFI

AT5G51080.1 RNase H family protein3.6e-0831.62Show/hide
Query:  ALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRS--CAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALEIGVSFIE
        AL   L    +PS     E L + E       E   + FDGA + +   +GA  VL + +  +         + +NN AEY  LI+GL+ A+E G + I+
Subjt:  ALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRS--CAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALEIGVSFIE

Query:  IYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMER
        +  DSKL+  Q+  Q+ V HE L +    A+QL ++
Subjt:  IYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMER

AT5G51080.2 RNase H family protein3.6e-0831.62Show/hide
Query:  ALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRS--CAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALEIGVSFIE
        AL   L    +PS     E L + E       E   + FDGA + +   +GA  VL + +  +         + +NN AEY  LI+GL+ A+E G + I+
Subjt:  ALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRS--CAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALEIGVSFIE

Query:  IYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMER
        +  DSKL+  Q+  Q+ V HE L +    A+QL ++
Subjt:  IYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMER

AT5G51080.3 RNase H family protein3.6e-0831.62Show/hide
Query:  ALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRS--CAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALEIGVSFIE
        AL   L    +PS     E L + E       E   + FDGA + +   +GA  VL + +  +         + +NN AEY  LI+GL+ A+E G + I+
Subjt:  ALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRS--CAGADIVLISPEKHMFPYSFALAELCSNNVAEYQALIIGLQMALEIGVSFIE

Query:  IYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMER
        +  DSKL+  Q+  Q+ V HE L +    A+QL ++
Subjt:  IYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMER


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCTTTCAGATGAAGAAATAACAGCTTTCAGGACCCCAAAGGGAATATATTGTTACAAGGTGATGCCCTTTGGATTAAAAAATGCTGGTGCCACTTATCAA
CGTGCTATGCAAAAAGTATTTGACGATATGCTACATAAATATGTCGAGTGCTACGCTGATGACCTTGTGGTCAAATCCAAGAGACGACAAGACCATTTGAAGGAT
CTAAAAGTTGTGTTCGACCGCTTGCGAAAATATCAGTTGAGGATGAACCCTCTCAAATGCGCGTTCGGTGTAACTTCAGGAAAGTTTCTTGGCTTTATTGTAAGG
CATCGAGGGATCGAGATAGACCAGTCCAAGATTGATGCCATTCAGAAGATGCCAAGGCCAAAGAGTTTGCATGACCTAAGAAGAGAAAAATTTTTGTGGGATGAG
GCTTGTCAGAATGCTTTTGATAGCATAAAGAAATACTTGCTCAATCCCCCAGTATTAGGAGCTCCAGTACCTAGCGAGCCATTAATATTGTACATTGCTGCACAA
GAAAGAACTTTAGTTGGGGCCGAAGTTAACTATTCTCCCATTGAGAAAATGTGCCTTGCACTTTTCTTTGCCATCGGTAAGTTGAGGCATTATATGCAGGCCTTC
ACCGTTCATCTAGTAGCAAAGGCGGACCCTATAAAGTATGTTCTGTCTAGGCCGATTATCTCTGGACGCTTAGCCAAATGGACAATTATACTCCAGCAATATGAC
ATTGTCTATATTTCCCAAAAGGCGATAAAAGGACAAGCGTTGGCAGATGTTTTGGCGGACCACCCAATTCCTTCAGATTGGAAGTTATGTGAGGATTTGCCAGAC
GATGAAGTTTTCTTCACGGAAATGGTGGAACCTTGGACTATGTATTTTGATGGTGCAGTGCGAAGGAGTTGTGCGGGGGCAGACATCGTCCTCATTTCTCCTGAA
AAACATATGTTTCCTTATAGCTTTGCGCTTGCTGAATTGTGCTCAAACAATGTGGCTGAATATCAAGCCTTGATAATTGGCCTTCAAATGGCATTAGAAATCGGA
GTATCGTTCATAGAAATTTACGGCGATTCAAAGTTGATAATCAATCAACTCTCGCTTCAGTATGACGTGAAACATGAAGACTTGAAGCAATACTTCACTTATGCT
AGACAATTGATGGAAAGGTTTGACAGAAGCGAACGTAACGGAAAACTTCCGAAGGATTCTCGTCATAAAACTGATGTACGAAGAAGAGCTGCACACTTTATTTAT
TACAAAGGAACTTTATATCGCCGTTCTCTTGAAGGACTCTTTCTTAGATGTCTTGGAAAGGAAGAGTCGATAAAAGCTCTAGAGGAAGCACATGCAGGTGTCTGT
GGAGCACATCAATCGGGACCAAAGCTTCAATTTCAGTTGAGAAGAATGGGCTATTATTGGCCTAAGATGGTTCAAGATTCAATGAACTATGCAAAGAAGTGTGAA
GCTTCAGAGCCTCTACATCCAACTATGGCTTCTTGGCCGTTTGAGGCTTGGGGACTCGATCTGGTTGGTCCTATTACACCAAAATCATCAGCAGGACATTCTTAT
ATCCTTGCAGCAACAGATTATTTCTCAAAATGGGCTGAGGCCATTCCCTTGAGAGAGGCCAAGAAGGAGAACGTGGCAAACTTTATTCTTACCCACATCATCTAT
CGATACGGTATTCCTCACCGGATTGTGACAGATAATGGAAGACAATTCTCCAATAGCATGATAGATAAATTATGTGAAAAATTCAAGTTCAAACAATACAAGTCA
TCTATGTACAACGCAGCCACGAATGGCCTAACAGAAGCATTCAATAAGACGTTATGCAATCTTCTAAAGAAAATTGTCTCCAAGTCGAAGAGGGATTGGCAAGAA
AGAATCGGTGAAGCATTGTGGGCCTATCGAACCACTCATCGCACTCCTACGGGGGTTACACCATATTCGCTTGTTTACGGAGTAGAGGCTGTCCTTCCCCTTGAA
AGAGAAATCCCATCATTAAGAATGGCAGTGCAAGAAGGGTTGACTACCGAAGACAATGTCAAGTTACGTCTTCAAAAGTTAGAAGCACTTGATGAAAAACGATTA
GAAGCTCAGCAAGCATTGGAATGTTACCAAGCCCGAATGTCTAAAGCTTTTTACAAACATGTAAAGCCCCGATCATTTCAAGTTGGTGATTTAGTACTTGCAGTA
AGAAGACCTATCATCACGACAAGGCATACGGGAAATAAGTTTACACCTAAATGGGACGGACTCTATATTGTCAAAGAAGTCTATACAAATGGTGCATACAAGATT
GTTGATCGAGATGGATTAAAAATTGGCCCAATCAACGACAAGTTTCTCAAGAAATTTTATGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCTTTCAGATGAAGAAATAACAGCTTTCAGGACCCCAAAGGGAATATATTGTTACAAGGTGATGCCCTTTGGATTAAAAAATGCTGGTGCCACTTATCAA
CGTGCTATGCAAAAAGTATTTGACGATATGCTACATAAATATGTCGAGTGCTACGCTGATGACCTTGTGGTCAAATCCAAGAGACGACAAGACCATTTGAAGGAT
CTAAAAGTTGTGTTCGACCGCTTGCGAAAATATCAGTTGAGGATGAACCCTCTCAAATGCGCGTTCGGTGTAACTTCAGGAAAGTTTCTTGGCTTTATTGTAAGG
CATCGAGGGATCGAGATAGACCAGTCCAAGATTGATGCCATTCAGAAGATGCCAAGGCCAAAGAGTTTGCATGACCTAAGAAGAGAAAAATTTTTGTGGGATGAG
GCTTGTCAGAATGCTTTTGATAGCATAAAGAAATACTTGCTCAATCCCCCAGTATTAGGAGCTCCAGTACCTAGCGAGCCATTAATATTGTACATTGCTGCACAA
GAAAGAACTTTAGTTGGGGCCGAAGTTAACTATTCTCCCATTGAGAAAATGTGCCTTGCACTTTTCTTTGCCATCGGTAAGTTGAGGCATTATATGCAGGCCTTC
ACCGTTCATCTAGTAGCAAAGGCGGACCCTATAAAGTATGTTCTGTCTAGGCCGATTATCTCTGGACGCTTAGCCAAATGGACAATTATACTCCAGCAATATGAC
ATTGTCTATATTTCCCAAAAGGCGATAAAAGGACAAGCGTTGGCAGATGTTTTGGCGGACCACCCAATTCCTTCAGATTGGAAGTTATGTGAGGATTTGCCAGAC
GATGAAGTTTTCTTCACGGAAATGGTGGAACCTTGGACTATGTATTTTGATGGTGCAGTGCGAAGGAGTTGTGCGGGGGCAGACATCGTCCTCATTTCTCCTGAA
AAACATATGTTTCCTTATAGCTTTGCGCTTGCTGAATTGTGCTCAAACAATGTGGCTGAATATCAAGCCTTGATAATTGGCCTTCAAATGGCATTAGAAATCGGA
GTATCGTTCATAGAAATTTACGGCGATTCAAAGTTGATAATCAATCAACTCTCGCTTCAGTATGACGTGAAACATGAAGACTTGAAGCAATACTTCACTTATGCT
AGACAATTGATGGAAAGGTTTGACAGAAGCGAACGTAACGGAAAACTTCCGAAGGATTCTCGTCATAAAACTGATGTACGAAGAAGAGCTGCACACTTTATTTAT
TACAAAGGAACTTTATATCGCCGTTCTCTTGAAGGACTCTTTCTTAGATGTCTTGGAAAGGAAGAGTCGATAAAAGCTCTAGAGGAAGCACATGCAGGTGTCTGT
GGAGCACATCAATCGGGACCAAAGCTTCAATTTCAGTTGAGAAGAATGGGCTATTATTGGCCTAAGATGGTTCAAGATTCAATGAACTATGCAAAGAAGTGTGAA
GCTTCAGAGCCTCTACATCCAACTATGGCTTCTTGGCCGTTTGAGGCTTGGGGACTCGATCTGGTTGGTCCTATTACACCAAAATCATCAGCAGGACATTCTTAT
ATCCTTGCAGCAACAGATTATTTCTCAAAATGGGCTGAGGCCATTCCCTTGAGAGAGGCCAAGAAGGAGAACGTGGCAAACTTTATTCTTACCCACATCATCTAT
CGATACGGTATTCCTCACCGGATTGTGACAGATAATGGAAGACAATTCTCCAATAGCATGATAGATAAATTATGTGAAAAATTCAAGTTCAAACAATACAAGTCA
TCTATGTACAACGCAGCCACGAATGGCCTAACAGAAGCATTCAATAAGACGTTATGCAATCTTCTAAAGAAAATTGTCTCCAAGTCGAAGAGGGATTGGCAAGAA
AGAATCGGTGAAGCATTGTGGGCCTATCGAACCACTCATCGCACTCCTACGGGGGTTACACCATATTCGCTTGTTTACGGAGTAGAGGCTGTCCTTCCCCTTGAA
AGAGAAATCCCATCATTAAGAATGGCAGTGCAAGAAGGGTTGACTACCGAAGACAATGTCAAGTTACGTCTTCAAAAGTTAGAAGCACTTGATGAAAAACGATTA
GAAGCTCAGCAAGCATTGGAATGTTACCAAGCCCGAATGTCTAAAGCTTTTTACAAACATGTAAAGCCCCGATCATTTCAAGTTGGTGATTTAGTACTTGCAGTA
AGAAGACCTATCATCACGACAAGGCATACGGGAAATAAGTTTACACCTAAATGGGACGGACTCTATATTGTCAAAGAAGTCTATACAAATGGTGCATACAAGATT
GTTGATCGAGATGGATTAAAAATTGGCCCAATCAACGACAAGTTTCTCAAGAAATTTTATGCTTAA
Protein sequenceShow/hide protein sequence
MALSDEEITAFRTPKGIYCYKVMPFGLKNAGATYQRAMQKVFDDMLHKYVECYADDLVVKSKRRQDHLKDLKVVFDRLRKYQLRMNPLKCAFGVTSGKFLGFIVR
HRGIEIDQSKIDAIQKMPRPKSLHDLRREKFLWDEACQNAFDSIKKYLLNPPVLGAPVPSEPLILYIAAQERTLVGAEVNYSPIEKMCLALFFAIGKLRHYMQAF
TVHLVAKADPIKYVLSRPIISGRLAKWTIILQQYDIVYISQKAIKGQALADVLADHPIPSDWKLCEDLPDDEVFFTEMVEPWTMYFDGAVRRSCAGADIVLISPE
KHMFPYSFALAELCSNNVAEYQALIIGLQMALEIGVSFIEIYGDSKLIINQLSLQYDVKHEDLKQYFTYARQLMERFDRSERNGKLPKDSRHKTDVRRRAAHFIY
YKGTLYRRSLEGLFLRCLGKEESIKALEEAHAGVCGAHQSGPKLQFQLRRMGYYWPKMVQDSMNYAKKCEASEPLHPTMASWPFEAWGLDLVGPITPKSSAGHSY
ILAATDYFSKWAEAIPLREAKKENVANFILTHIIYRYGIPHRIVTDNGRQFSNSMIDKLCEKFKFKQYKSSMYNAATNGLTEAFNKTLCNLLKKIVSKSKRDWQE
RIGEALWAYRTTHRTPTGVTPYSLVYGVEAVLPLEREIPSLRMAVQEGLTTEDNVKLRLQKLEALDEKRLEAQQALECYQARMSKAFYKHVKPRSFQVGDLVLAV
RRPIITTRHTGNKFTPKWDGLYIVKEVYTNGAYKIVDRDGLKIGPINDKFLKKFYA