; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc01g0023601 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc01g0023601
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr01:23503112..23504773
RNA-Seq ExpressionCmc01g0023601
SyntenyCmc01g0023601
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR021109 - Aspartic peptidase domain superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025469.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.5e-22177.08Show/hide
Query:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG
        M DFDVILGMDWLA+NHASIDCS +EV FNP    +FKFKG    +LP+VIS+++ +KLL+QGTW IL SVVDTR+ DV LSS+ VVRD PDVF EE PG
Subjt:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG

Query:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ
        LPPHR+++FAIELEPGTVPIS+A YRM  AELK+LK+QLQE LDKGFIRPSVSPWG PVLFVKKKD SMRLCIDY++LNKVTVKNKY LPR DDLFDQLQ
Subjt:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR
        GATVFSKIDLRSGYHQLRIKD D+PKT+F  RYGH+EFIVMSFGLTNA TVF +                          TEAEHEEHLR+VL+ LR N+
Subjt:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR

Query:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFVWSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLI
        LYAKFSKCEFWL QVSFLGH+VSK GVSVDPA+IE VT TPL QL RK APFVWSK CEDSFQNLKQ LVTA VLTVPDGSGSFV YSDAS+KGLGCVL+
Subjt:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFVWSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLI

Query:  QQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKV
        QQGKVVAYAS QLKSHE+N PTHDLELA +VFALKIWRHYLYGEKIQIFTDHKSL YFF QKELNMRQRRWLELVKDYDCEILYH G  NVV DALSRKV
Subjt:  QQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKV

Query:  SHSATL
        S+SA L
Subjt:  SHSATL

KAA0048687.1 pol protein [Cucumis melo var. makuwa]5.1e-22073.74Show/hide
Query:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG
        M DFDVILGMDWLA++HASIDCS +EV FNP    +FKFKG    +LP+VIS+++A+KLL+QGTW IL SVVDTREADV LSS+ VVRD PDVF EE PG
Subjt:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG

Query:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ
        LPPHR+++FAIELEPGTVPIS+A YRM  AELK+LKVQLQELLDKGFIRPSVSPWG PVLFVKKKD SMRLCIDY++LNKVTVKN+Y LPR DDLFDQLQ
Subjt:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR
        GATVFSKIDLRSGYHQLRIKD D+PKT+F  RYGH+EFIVMSFGLTNA  VF +                          TEAEHEEHLRMVL+ LR N+
Subjt:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR

Query:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVT-------------------------------TTPLIQLNRKEAPFVWSKFCEDSFQNLKQSL
        LYAKFSKCEFWL QVSFLGH+VSK GVSVDPA+IE VT                                TPL QL RK APFVWSK CEDSFQNLKQ L
Subjt:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVT-------------------------------TTPLIQLNRKEAPFVWSKFCEDSFQNLKQSL

Query:  VTATVLTVPDGSGSFVTYSDASQKGLGCVLIQQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQR
        VTA VLTVPDGSGSFV YSDAS+KGLGCVL+QQGKVVAYASRQLKSHE+NYPTHDLELA +VFALKIWRHYLYGEKIQIFTDHKSL YFF QKELNMRQR
Subjt:  VTATVLTVPDGSGSFVTYSDASQKGLGCVLIQQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQR

Query:  RWLELVKDYDCEILYHLGTMNVVVDALSRKVSHSATL
        RWLELVKDYDCEILYH G  NVV DALSRKVSHSA L
Subjt:  RWLELVKDYDCEILYHLGTMNVVVDALSRKVSHSATL

KAA0051744.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.5e-22177.08Show/hide
Query:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG
        M DFDVILGMDWLA+NHAS+DCS +EV FNP    +FKFKG    +LP+VIS+++A+KLL+QGTW IL SVVDTRE DV LSS+ VVRD PDVF EE PG
Subjt:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG

Query:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ
        LPPHR+++FAIELEPGTVPISKA YRM  AELK+LKVQLQELLDKGFIRPSVSPWG PVLFVKKKD  MRLCIDY++LNKVTVKN+Y LPR DDLFDQLQ
Subjt:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR
        GATVFSKIDLR G+HQLRIK+ D+PKT+F  RYGH+EFI+MSFGLTNA TVF +                          TEAEHEEHLRMVL+ LR N+
Subjt:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR

Query:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFVWSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLI
        LYAKFSKCEFWL QVSFL H+VSK GVSVDPA+IE VT TPL QL RK APFVWSK CEDSFQNLKQ LVTA VLTVPDGSGSFV YSDAS+KGLGCVL+
Subjt:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFVWSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLI

Query:  QQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKV
        QQGKVVAYASRQLKSHE+NYPTHDLELA +VFALKIWRHYLYGEKIQIFTD+KSL YFF QKELNMRQRRWLELVKDYDCEILYH G  NVV DALSRKV
Subjt:  QQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKV

Query:  SHSATL
        SHSA L
Subjt:  SHSATL

KAA0067622.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.3e-22277.27Show/hide
Query:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG
        M DFDVILGMDWLA+NHASIDCS +EV FNP    +FKFKG    +LP+VIS+++A+KLL+QGTW IL SVVDTRE DV LSS+ VVRD  DVF EE PG
Subjt:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG

Query:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ
        LPPHR+++FAIELEP TVPIS+A YRM  AELK+LKVQLQELLDKGFIRPSVSPWG PVLFVKKKD SMRLCIDY++LNKVTVKN+Y LPR DDLFDQLQ
Subjt:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR
        GATVFSKIDLRSGYHQLRIKD D+PKT+F  RYGH+EFIVMSFGLTNA  VF +                          T+AEHEEHLRMVL+ LR N+
Subjt:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR

Query:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFVWSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLI
        LYAKFSKCEFWL QV+FLGH+VSK GVSVDPA+IE VT TPL QL RK  PFVWSK CEDSFQNLKQ LVTA VLTVPDGSGSFV YSDAS+KGLGCVL+
Subjt:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFVWSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLI

Query:  QQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKV
        QQGKVVAYASRQLKSHE+NYPTHDLELA +VFALKIWRHYLYGEKIQIFTDHKSL YFF QKELNMRQRRWLELVKDYDCEILYH G  NVV DALSRKV
Subjt:  QQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKV

Query:  SHSATL
        SHSA L
Subjt:  SHSATL

TYK01613.1 pol protein [Cucumis melo var. makuwa]8.7e-22073.74Show/hide
Query:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG
        M DFDVILGMDWLA+NHASIDCS +EV FNP    +FKFKG    +LP+VIS+++A+KLL+QGTW IL SVVDTREADV LSS+ VVRD PDVF EE PG
Subjt:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG

Query:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ
        LPPHR+++FAIELEPGTVPIS+A YRM  AELK+LKVQLQELLDKGFIRPSVSPWG PVLFVKKKD SMRLCIDY++LNKVTVKN+Y LPR DDLFDQLQ
Subjt:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR
        GATVFSKIDLRSGYHQLRIKD D+PKT+F  RYGH+EFIVMSFGLTNA  VF +                          TEAEHEEHLRMVL+ LR N+
Subjt:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR

Query:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVT-------------------------------TTPLIQLNRKEAPFVWSKFCEDSFQNLKQSL
        LYAKFSKCEFWL QVSFLGH+VSK GVSVDPA+IE VT                                TPL QL RK APFVWSK CEDSFQ LKQ L
Subjt:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVT-------------------------------TTPLIQLNRKEAPFVWSKFCEDSFQNLKQSL

Query:  VTATVLTVPDGSGSFVTYSDASQKGLGCVLIQQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQR
        VTA VLTVPDGSGSFV YSDAS+KGLGCVL+QQGKVVAYASRQLKSHE+NYPTHDLELA +VFALKIWRHYLYGEKIQIFTDHKSL YFF QKELNMRQR
Subjt:  VTATVLTVPDGSGSFVTYSDASQKGLGCVLIQQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQR

Query:  RWLELVKDYDCEILYHLGTMNVVVDALSRKVSHSATL
        RWLELVKDYDCEILYH G  NVV DALSRKVSHSA L
Subjt:  RWLELVKDYDCEILYHLGTMNVVVDALSRKVSHSATL

TrEMBL top hitse value%identityAlignment
A0A5A7SJH3 Reverse transcriptase1.7e-22177.08Show/hide
Query:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG
        M DFDVILGMDWLA+NHASIDCS +EV FNP    +FKFKG    +LP+VIS+++ +KLL+QGTW IL SVVDTR+ DV LSS+ VVRD PDVF EE PG
Subjt:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG

Query:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ
        LPPHR+++FAIELEPGTVPIS+A YRM  AELK+LK+QLQE LDKGFIRPSVSPWG PVLFVKKKD SMRLCIDY++LNKVTVKNKY LPR DDLFDQLQ
Subjt:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR
        GATVFSKIDLRSGYHQLRIKD D+PKT+F  RYGH+EFIVMSFGLTNA TVF +                          TEAEHEEHLR+VL+ LR N+
Subjt:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR

Query:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFVWSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLI
        LYAKFSKCEFWL QVSFLGH+VSK GVSVDPA+IE VT TPL QL RK APFVWSK CEDSFQNLKQ LVTA VLTVPDGSGSFV YSDAS+KGLGCVL+
Subjt:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFVWSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLI

Query:  QQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKV
        QQGKVVAYAS QLKSHE+N PTHDLELA +VFALKIWRHYLYGEKIQIFTDHKSL YFF QKELNMRQRRWLELVKDYDCEILYH G  NVV DALSRKV
Subjt:  QQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKV

Query:  SHSATL
        S+SA L
Subjt:  SHSATL

A0A5A7U330 Reverse transcriptase2.5e-22073.74Show/hide
Query:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG
        M DFDVILGMDWLA++HASIDCS +EV FNP    +FKFKG    +LP+VIS+++A+KLL+QGTW IL SVVDTREADV LSS+ VVRD PDVF EE PG
Subjt:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG

Query:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ
        LPPHR+++FAIELEPGTVPIS+A YRM  AELK+LKVQLQELLDKGFIRPSVSPWG PVLFVKKKD SMRLCIDY++LNKVTVKN+Y LPR DDLFDQLQ
Subjt:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR
        GATVFSKIDLRSGYHQLRIKD D+PKT+F  RYGH+EFIVMSFGLTNA  VF +                          TEAEHEEHLRMVL+ LR N+
Subjt:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR

Query:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVT-------------------------------TTPLIQLNRKEAPFVWSKFCEDSFQNLKQSL
        LYAKFSKCEFWL QVSFLGH+VSK GVSVDPA+IE VT                                TPL QL RK APFVWSK CEDSFQNLKQ L
Subjt:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVT-------------------------------TTPLIQLNRKEAPFVWSKFCEDSFQNLKQSL

Query:  VTATVLTVPDGSGSFVTYSDASQKGLGCVLIQQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQR
        VTA VLTVPDGSGSFV YSDAS+KGLGCVL+QQGKVVAYASRQLKSHE+NYPTHDLELA +VFALKIWRHYLYGEKIQIFTDHKSL YFF QKELNMRQR
Subjt:  VTATVLTVPDGSGSFVTYSDASQKGLGCVLIQQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQR

Query:  RWLELVKDYDCEILYHLGTMNVVVDALSRKVSHSATL
        RWLELVKDYDCEILYH G  NVV DALSRKVSHSA L
Subjt:  RWLELVKDYDCEILYHLGTMNVVVDALSRKVSHSATL

A0A5A7U8T5 Reverse transcriptase1.7e-22177.08Show/hide
Query:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG
        M DFDVILGMDWLA+NHAS+DCS +EV FNP    +FKFKG    +LP+VIS+++A+KLL+QGTW IL SVVDTRE DV LSS+ VVRD PDVF EE PG
Subjt:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG

Query:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ
        LPPHR+++FAIELEPGTVPISKA YRM  AELK+LKVQLQELLDKGFIRPSVSPWG PVLFVKKKD  MRLCIDY++LNKVTVKN+Y LPR DDLFDQLQ
Subjt:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR
        GATVFSKIDLR G+HQLRIK+ D+PKT+F  RYGH+EFI+MSFGLTNA TVF +                          TEAEHEEHLRMVL+ LR N+
Subjt:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR

Query:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFVWSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLI
        LYAKFSKCEFWL QVSFL H+VSK GVSVDPA+IE VT TPL QL RK APFVWSK CEDSFQNLKQ LVTA VLTVPDGSGSFV YSDAS+KGLGCVL+
Subjt:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFVWSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLI

Query:  QQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKV
        QQGKVVAYASRQLKSHE+NYPTHDLELA +VFALKIWRHYLYGEKIQIFTD+KSL YFF QKELNMRQRRWLELVKDYDCEILYH G  NVV DALSRKV
Subjt:  QQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKV

Query:  SHSATL
        SHSA L
Subjt:  SHSATL

A0A5A7VAL8 Pol protein4.2e-22073.74Show/hide
Query:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG
        M DFDVIL  DWLA+NHASIDCS +EV FNP    +FKFKG    +LP+VIS+++A+KLL+QGTW IL SVVDTR+ADV LSS+ VVRD PDVF EE PG
Subjt:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG

Query:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ
        LPPHR+++FAIELEPGTVPIS+A YRM  AELK+LKVQLQELLDKGFIRPSVSPWG PVLFVKKKD SMRLCIDY++LNKVTVKN+Y LPR DDLFDQLQ
Subjt:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR
        GATVFSKIDLRSGYHQLRIKD D+PKT+F  RYGH+EFIVMSFGLTNAL VF +                          TEAEHEEHLRMVL+ LR N+
Subjt:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR

Query:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTT-------------------------------TPLIQLNRKEAPFVWSKFCEDSFQNLKQSL
        LYAKFSKCEFWL QVSFLGH+VSK GVSVDPA+IE VT+                               TPL QL RK APFVWSK CEDSFQNLKQ L
Subjt:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTT-------------------------------TPLIQLNRKEAPFVWSKFCEDSFQNLKQSL

Query:  VTATVLTVPDGSGSFVTYSDASQKGLGCVLIQQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQR
        VTA VLTVPDGSGSFV YSDAS+KGLGCVLIQQGKVVAYASRQLKSHE+NYPTHDLELA +VFALKIWRHYLYGEKIQIFTDHKSL YFF QKELNMRQR
Subjt:  VTATVLTVPDGSGSFVTYSDASQKGLGCVLIQQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQR

Query:  RWLELVKDYDCEILYHLGTMNVVVDALSRKVSHSATL
        RWLELVKDYDCEILYH G  NVV DALSRKVSHSA L
Subjt:  RWLELVKDYDCEILYHLGTMNVVVDALSRKVSHSATL

A0A5A7VGW8 Reverse transcriptase4.5e-22277.27Show/hide
Query:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG
        M DFDVILGMDWLA+NHASIDCS +EV FNP    +FKFKG    +LP+VIS+++A+KLL+QGTW IL SVVDTRE DV LSS+ VVRD  DVF EE PG
Subjt:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPG

Query:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ
        LPPHR+++FAIELEP TVPIS+A YRM  AELK+LKVQLQELLDKGFIRPSVSPWG PVLFVKKKD SMRLCIDY++LNKVTVKN+Y LPR DDLFDQLQ
Subjt:  LPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQ

Query:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR
        GATVFSKIDLRSGYHQLRIKD D+PKT+F  RYGH+EFIVMSFGLTNA  VF +                          T+AEHEEHLRMVL+ LR N+
Subjt:  GATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNR

Query:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFVWSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLI
        LYAKFSKCEFWL QV+FLGH+VSK GVSVDPA+IE VT TPL QL RK  PFVWSK CEDSFQNLKQ LVTA VLTVPDGSGSFV YSDAS+KGLGCVL+
Subjt:  LYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFVWSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLI

Query:  QQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKV
        QQGKVVAYASRQLKSHE+NYPTHDLELA +VFALKIWRHYLYGEKIQIFTDHKSL YFF QKELNMRQRRWLELVKDYDCEILYH G  NVV DALSRKV
Subjt:  QQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKV

Query:  SHSATL
        SHSA L
Subjt:  SHSATL

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.69.2e-5530.99Show/hide
Query:  ASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFV-KKKDVS----MRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQGATVFSKIDLRSGYHQL
        + Y    A  ++++ Q+Q++L++G IR S SP+  P+  V KK+D S     R+ IDY++LN++TV +++ +P  D++  +L     F+ IDL  G+HQ+
Subjt:  ASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFV-KKKDVS----MRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQGATVFSKIDLRSGYHQL

Query:  RIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNRLYAKFSKCEFWLDQVSF
         +    + KT+F  ++GH+E++ M FGL NA   F                           ++  EH + L +V E L    L  +  KCEF   + +F
Subjt:  RIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNRLYAKFSKCEFWLDQVSF

Query:  LGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFV-----WSKFC---------------------------EDSFQNLKQSLVTATVLTVPDGSGSFV
        LGH+++ DG+  +P +IE +   P+    ++   F+     + KF                            + +F+ LK  +    +L VPD +  F 
Subjt:  LGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFV-----WSKFC---------------------------EDSFQNLKQSLVTATVLTVPDGSGSFV

Query:  TYSDASQKGLGCVLIQQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYH
          +DAS   LG VL Q G  ++Y SR L  HE NY T + EL  IV+A K +RHYL G   +I +DH+ L + +  K+ N +  RW   + ++D +I Y 
Subjt:  TYSDASQKGLGCVLIQQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYH

Query:  LGTMNVVVDALSR
         G  N V DALSR
Subjt:  LGTMNVVVDALSR

P20825 Retrovirus-related Pol polyprotein from transposon 2972.9e-5330.46Show/hide
Query:  PISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFV-KKKDVS----MRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQGATVFSKIDLRSG
        PI    Y +      +++ Q+QE+L++G IR S SP+  P   V KK D S     R+ IDY++LN++T+ ++Y +P  D++  +L     F+ IDL  G
Subjt:  PISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFV-KKKDVS----MRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQGATVFSKIDLRSG

Query:  YHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNRLYAKFSKCEFWLD
        +HQ+ + +  I KT+F  + GH+E++ M FGL NA   F                           ++  EH   +++V   L    L  +  KCEF   
Subjt:  YHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDE------------------------QSTEAEHEEHLRMVLEILRTNRLYAKFSKCEFWLD

Query:  QVSFLGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFV-----WSKFCE---------------------------DSFQNLKQSLVTATVLTVPDGS
        + +FLGH+V+ DG+  +P +++ + + P+   +++   F+     + KF                             ++F+ LK  ++   +L +PD  
Subjt:  QVSFLGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFV-----WSKFCE---------------------------DSFQNLKQSLVTATVLTVPDGS

Query:  GSFVTYSDASQKGLGCVLIQQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCE
          FV  +DAS   LG VL Q G  +++ SR L  HE NY   + EL  IV+A K +RHYL G +  I +DH+ L +  N KE   +  RW   + +Y  +
Subjt:  GSFVTYSDASQKGLGCVLIQQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCE

Query:  ILYHLGTMNVVVDALSR
        I Y  G  N V DALSR
Subjt:  ILYHLGTMNVVVDALSR

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein1.8e-5529.43Show/hide
Query:  KANKLLNQGTWSILVSVVDTREAD--------------VFLSSKL--VVRDDPDVFHEEFPGLPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQ
        +A+ L   G +S +VS + + E +              V+L  K   ++R+D      +   +P    +   IE++PG        Y +T    +++   
Subjt:  KANKLLNQGTWSILVSVVDTREAD--------------VFLSSKL--VVRDDPDVFHEEFPGLPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQ

Query:  LQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEF
        +Q+LLD  FI PS SP   PV+ V KKD + RLC+DY+ LNK T+ + + LPR D+L  ++  A +F+ +DL SGYHQ+ ++  D  KT+F    G +E+
Subjt:  LQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEF

Query:  IVMSFGLTNALTVFDEQSTEA----------------------EHEEHLRMVLEILRTNRLYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTT
         VM FGL NA + F     +                       EH +HL  VLE L+   L  K  KC+F  ++  FLG+ +    ++    +   +   
Subjt:  IVMSFGLTNALTVFDEQSTEA----------------------EHEEHLRMVLEILRTNRLYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTT

Query:  P-------------LIQLNRKEAP-----------FV-----WSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLIQQGK------V
        P             +I   R+  P           F+     W++  + + + LK +L  + VL   +   ++   +DAS+ G+G VL +         V
Subjt:  P-------------LIQLNRKEAP-----------FV-----WSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLIQQGK------V

Query:  VAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKVSHSAT
        V Y S+ L+S +KNYP  +LEL  I+ AL  +R+ L+G+   + TDH SL+   N+ E   R +RWL+ +  YD  + Y  G  NVV DA+SR + ++ T
Subjt:  VAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKVSHSAT

Query:  LSLDRPVCIKTWR
            RP+  ++W+
Subjt:  LSLDRPVCIKTWR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus1.2e-5428.35Show/hide
Query:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLL----NQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHE
        +H FD I+G D L    A +D  +  ++  P         G++   L +  +S+  N LL      GT  IL S++                + P +F  
Subjt:  MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLL----NQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHE

Query:  EFPGLPPHRKIDFAIELEPGT---VPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKK-----DVSMRLCIDYKQLNKVTVKNKYA
           G+     ++ A++ E  T    PI   SY        +++ Q+ ELL  G IRPS SP+  P+  V KK     +   R+ +D+K+LN VT+ + Y 
Subjt:  EFPGLPPHRKIDFAIELEPGT---VPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKK-----DVSMRLCIDYKQLNKVTVKNKYA

Query:  LPRNDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDEQSTEA------------------------EHEEH
        +P  +     L  A  F+ +DL SG+HQ+ +K+SDIPKT+F    G +EF+ + FGL NA  +F     +                          H ++
Subjt:  LPRNDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDEQSTEA------------------------EHEEH

Query:  LRMVLEILRTNRLYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVT-------------------------------TTPLIQLNR----------
        LR+VL  L    L     K  F   QV FLG++V+ DG+  DP ++  ++                                 PL  L R          
Subjt:  LRMVLEILRTNRLYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVT-------------------------------TTPLIQLNR----------

Query:  -KEAPFVWSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLIQ----QGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLY
          + P    +    SF +LK  L ++ +L  P  +  F   +DAS   +G VL Q    + + +AY SR L   E+NY T + E+  I+++L   R YLY
Subjt:  -KEAPFVWSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLIQ----QGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLY

Query:  GE-KIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKVSHSATLSLD
        G   I+++TDH+ L +    +  N + +RW   +++Y+CE++Y  G  NVV DALSR       LS D
Subjt:  GE-KIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKVSHSATLSLD

Q99315 Transposon Ty3-G Gag-Pol polyprotein1.8e-5529.63Show/hide
Query:  KANKLLNQGTWSILVSVVDTREAD--------------VFLSSKL--VVRDDPDVFHEEFPGLPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQ
        +A+ L   G +S +VS + + E +              V+L  K   ++R+D      +   +P    +   IE++PG        Y +T    +++   
Subjt:  KANKLLNQGTWSILVSVVDTREAD--------------VFLSSKL--VVRDDPDVFHEEFPGLPPHRKIDFAIELEPGTVPISKASYRMTAAELKKLKVQ

Query:  LQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEF
        +Q+LLD  FI PS SP   PV+ V KKD + RLC+DY+ LNK T+ + + LPR D+L  ++  A +F+ +DL SGYHQ+ ++  D  KT+F    G +E+
Subjt:  LQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQGATVFSKIDLRSGYHQLRIKDSDIPKTSFCLRYGHFEF

Query:  IVMSFGLTNALTVFDEQSTEA----------------------EHEEHLRMVLEILRTNRLYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTT
         VM FGL NA + F     +                       EH +HL  VLE L+   L  K  KC+F  ++  FLG+ +    ++    +   +   
Subjt:  IVMSFGLTNALTVFDEQSTEA----------------------EHEEHLRMVLEILRTNRLYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTT

Query:  P-------------LIQLNRKEAP-----------FV-----WSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLIQQGK------V
        P             +I   R+  P           F+     W++  + +   LK +L  + VL   +   ++   +DAS+ G+G VL +         V
Subjt:  P-------------LIQLNRKEAP-----------FV-----WSKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLIQQGK------V

Query:  VAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKVSHSAT
        V Y S+ L+S +KNYP  +LEL  I+ AL  +R+ L+G+   + TDH SL+   N+ E   R +RWL+ +  YD  + Y  G  NVV DA+SR V ++ T
Subjt:  VAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKELNMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKVSHSAT

Query:  LSLDRPVCIKTWR
            RP+  ++W+
Subjt:  LSLDRPVCIKTWR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein5.1e-0829.55Show/hide
Query:  HLRMVLEILRTNRLYAKFSKCEFWLDQVSFLG--HMVSKDGVSVDPAEIETVT-------------------------------TTPLIQLNRKEAPFVW
        HL MVL+I   ++ YA   KC F   Q+++LG  H++S +GVS DPA++E +                                  PL +L +K +   W
Subjt:  HLRMVLEILRTNRLYAKFSKCEFWLDQVSFLG--HMVSKDGVSVDPAEIETVT-------------------------------TTPLIQLNRKEAPFVW

Query:  SKFCEDSFQNLKQSLVTATVLTVPDGSGSFVT
        ++    +F+ LK ++ T  VL +PD    FVT
Subjt:  SKFCEDSFQNLKQSLVTATVLTVPDGSGSFVT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGATTTTGATGTGATTCTAGGTATGGATTGGCTAGCTTCTAATCATGCTAGTATAGATTGTTCCCATAGAGAGGTGGTGTTTAATCCTTCTAAAGGGACCAATTT
TAAGTTTAAAGGGGTAAGAAAAATAGCATTACCCAAAGTGATCTCAAGTATGAAAGCCAATAAACTACTTAACCAGGGTACTTGGAGTATCTTGGTCAGTGTGGTGGATA
CTAGAGAGGCTGATGTTTTCTTGTCATCGAAACTTGTGGTGAGGGATGATCCAGATGTTTTTCATGAAGAATTTCCAGGATTACCCCCTCACAGGAAGATTGATTTTGCT
ATCGAGCTGGAACCCGGTACTGTTCCTATATCTAAAGCTTCTTACAGAATGACCGCAGCAGAGTTGAAAAAGCTGAAAGTGCAGTTGCAAGAGTTGCTTGACAAAGGCTT
CATTCGACCGAGTGTGTCACCTTGGGGTGACCCAGTCTTGTTTGTTAAAAAGAAGGATGTGTCGATGCGCCTGTGTATTGATTACAAGCAGTTGAATAAGGTAACTGTTA
AGAACAAGTATGCTTTGCCCAGAAATGATGATCTATTTGACCAATTGCAGGGAGCTACAGTGTTCTCTAAGATCGACCTTCGATCAGGATATCATCAGCTGAGGATTAAG
GATAGTGATATACCGAAGACATCCTTTTGTTTGAGATATGGGCATTTTGAGTTCATTGTGATGTCCTTTGGATTGACGAATGCTCTAACAGTATTTGATGAACAGAGTAC
AGAGGCAGAGCATGAAGAGCATTTACGCATGGTTCTAGAGATCCTTCGAACCAATAGACTATATGCAAAGTTTTCAAAATGTGAGTTTTGGTTGGATCAGGTATCGTTTC
TAGGCCATATGGTTTCTAAAGATGGTGTTTCTGTGGATCCAGCTGAGATAGAAACTGTTACCACTACTCCCCTCATTCAGTTGAACAGGAAAGAAGCTCCATTTGTTTGG
AGTAAATTCTGTGAGGATAGTTTTCAGAACCTTAAACAGAGCCTCGTTACTGCAACGGTTCTTACCGTACCTGATGGTTCAGGGAGTTTTGTGACTTACAGTGATGCTTC
TCAGAAAGGTTTGGGTTGTGTTCTGATACAGCAAGGTAAGGTAGTTGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGAAGAATTACCCTACACATGATTTAGAGTTGG
CAACAATAGTTTTTGCACTGAAGATATGGAGACATTACTTGTATGGTGAGAAGATACAGATCTTTACGGACCATAAAAGCTTGATATACTTCTTCAATCAGAAGGAGTTG
AATATGAGACAGCGAAGATGGCTTGAATTAGTAAAAGATTATGATTGTGAGATATTGTATCATCTAGGTACGATGAATGTGGTAGTTGATGCTCTTAGTAGGAAGGTATC
ACATTCAGCAACACTCTCACTAGACAGGCCCGTTTGCATCAAGACTTGGAGAGAGCTGAGATTGCGGTGTCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCATGATTTTGATGTGATTCTAGGTATGGATTGGCTAGCTTCTAATCATGCTAGTATAGATTGTTCCCATAGAGAGGTGGTGTTTAATCCTTCTAAAGGGACCAATTT
TAAGTTTAAAGGGGTAAGAAAAATAGCATTACCCAAAGTGATCTCAAGTATGAAAGCCAATAAACTACTTAACCAGGGTACTTGGAGTATCTTGGTCAGTGTGGTGGATA
CTAGAGAGGCTGATGTTTTCTTGTCATCGAAACTTGTGGTGAGGGATGATCCAGATGTTTTTCATGAAGAATTTCCAGGATTACCCCCTCACAGGAAGATTGATTTTGCT
ATCGAGCTGGAACCCGGTACTGTTCCTATATCTAAAGCTTCTTACAGAATGACCGCAGCAGAGTTGAAAAAGCTGAAAGTGCAGTTGCAAGAGTTGCTTGACAAAGGCTT
CATTCGACCGAGTGTGTCACCTTGGGGTGACCCAGTCTTGTTTGTTAAAAAGAAGGATGTGTCGATGCGCCTGTGTATTGATTACAAGCAGTTGAATAAGGTAACTGTTA
AGAACAAGTATGCTTTGCCCAGAAATGATGATCTATTTGACCAATTGCAGGGAGCTACAGTGTTCTCTAAGATCGACCTTCGATCAGGATATCATCAGCTGAGGATTAAG
GATAGTGATATACCGAAGACATCCTTTTGTTTGAGATATGGGCATTTTGAGTTCATTGTGATGTCCTTTGGATTGACGAATGCTCTAACAGTATTTGATGAACAGAGTAC
AGAGGCAGAGCATGAAGAGCATTTACGCATGGTTCTAGAGATCCTTCGAACCAATAGACTATATGCAAAGTTTTCAAAATGTGAGTTTTGGTTGGATCAGGTATCGTTTC
TAGGCCATATGGTTTCTAAAGATGGTGTTTCTGTGGATCCAGCTGAGATAGAAACTGTTACCACTACTCCCCTCATTCAGTTGAACAGGAAAGAAGCTCCATTTGTTTGG
AGTAAATTCTGTGAGGATAGTTTTCAGAACCTTAAACAGAGCCTCGTTACTGCAACGGTTCTTACCGTACCTGATGGTTCAGGGAGTTTTGTGACTTACAGTGATGCTTC
TCAGAAAGGTTTGGGTTGTGTTCTGATACAGCAAGGTAAGGTAGTTGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGAAGAATTACCCTACACATGATTTAGAGTTGG
CAACAATAGTTTTTGCACTGAAGATATGGAGACATTACTTGTATGGTGAGAAGATACAGATCTTTACGGACCATAAAAGCTTGATATACTTCTTCAATCAGAAGGAGTTG
AATATGAGACAGCGAAGATGGCTTGAATTAGTAAAAGATTATGATTGTGAGATATTGTATCATCTAGGTACGATGAATGTGGTAGTTGATGCTCTTAGTAGGAAGGTATC
ACATTCAGCAACACTCTCACTAGACAGGCCCGTTTGCATCAAGACTTGGAGAGAGCTGAGATTGCGGTGTCAGTAG
Protein sequenceShow/hide protein sequence
MHDFDVILGMDWLASNHASIDCSHREVVFNPSKGTNFKFKGVRKIALPKVISSMKANKLLNQGTWSILVSVVDTREADVFLSSKLVVRDDPDVFHEEFPGLPPHRKIDFA
IELEPGTVPISKASYRMTAAELKKLKVQLQELLDKGFIRPSVSPWGDPVLFVKKKDVSMRLCIDYKQLNKVTVKNKYALPRNDDLFDQLQGATVFSKIDLRSGYHQLRIK
DSDIPKTSFCLRYGHFEFIVMSFGLTNALTVFDEQSTEAEHEEHLRMVLEILRTNRLYAKFSKCEFWLDQVSFLGHMVSKDGVSVDPAEIETVTTTPLIQLNRKEAPFVW
SKFCEDSFQNLKQSLVTATVLTVPDGSGSFVTYSDASQKGLGCVLIQQGKVVAYASRQLKSHEKNYPTHDLELATIVFALKIWRHYLYGEKIQIFTDHKSLIYFFNQKEL
NMRQRRWLELVKDYDCEILYHLGTMNVVVDALSRKVSHSATLSLDRPVCIKTWRELRLRCQ