; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc06g0172031 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc06g0172031
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr06:27834212..27834997
RNA-Seq ExpressionCmc06g0172031
SyntenyCmc06g0172031
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026271.1 pol protein [Cucumis melo var. makuwa]7.9e-14096.55Show/hide
Query:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQ LR NKLYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAKIEAVT W
Subjt:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLT+PDGSGSFVIYSDASKKGLGCVL+QQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR

KAA0042119.1 pol protein [Cucumis melo var. makuwa]2.7e-14096.55Show/hide
Query:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQ LR NKLYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAKIEAVTSW
Subjt:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAP+LT+PDGSGSFVIYSDAS+KGLGCVL+QQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR

KAA0048687.1 pol protein [Cucumis melo var. makuwa]2.7e-14096.93Show/hide
Query:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQ LR NKLYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAKIEAVT W
Subjt:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLT+PDGSGSFVIYSDASKKGLGCVL+QQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR

KAA0053412.1 pol protein [Cucumis melo var. makuwa]7.9e-14096.55Show/hide
Query:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW
        MSFGLTNA AVFMDLMNRVFREFLDTF+IVFIDDILIYSKTEAEHEEHLRMVLQ LR NKLYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAKIEAVTSW
Subjt:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLT+PDGSGSFVIYSDASKKGLGCVL+QQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQ+R
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR

KAA0062719.1 pol protein [Cucumis melo var. makuwa]5.5e-14197.7Show/hide
Query:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQ LR NKLYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAKIEAVTSW
Subjt:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLT+PDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR

TrEMBL top hitse value%identityAlignment
A0A5A7SPZ2 Pol protein3.8e-14096.55Show/hide
Query:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLR+VLQ LR NKLYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAKIEAVT W
Subjt:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLT+PDGSGSFVIYSDASKKGLGCVL+QQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR

A0A5A7TLA3 Pol protein1.3e-14096.55Show/hide
Query:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQ LR NKLYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAKIEAVTSW
Subjt:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAP+LT+PDGSGSFVIYSDAS+KGLGCVL+QQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR

A0A5A7U330 Reverse transcriptase1.3e-14096.93Show/hide
Query:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQ LR NKLYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAKIEAVT W
Subjt:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLT+PDGSGSFVIYSDASKKGLGCVL+QQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR

A0A5A7UCD5 Reverse transcriptase3.8e-14096.55Show/hide
Query:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW
        MSFGLTNA AVFMDLMNRVFREFLDTF+IVFIDDILIYSKTEAEHEEHLRMVLQ LR NKLYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAKIEAVTSW
Subjt:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLT+PDGSGSFVIYSDASKKGLGCVL+QQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQ+R
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR

A0A5A7VAL8 Pol protein2.6e-14197.7Show/hide
Query:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW
        MSFGLTNA AVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQ LR NKLYAKFSKCEFWLKQ+SFLGHVVSKAGVS+DPAKIEAVTSW
Subjt:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA
        TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLT+PDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYA

Query:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
        SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
Subjt:  SRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.66.7e-4938.93Show/hide
Query:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW
        M FGL NA A F   MN + R  L+   +V++DDI+++S +  EH + L +V + L    L  +  KCEF  ++ +FLGHV++  G+  +P KIEA+  +
Subjt:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDS-FQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAY
          P+   E+++FLGL GYYR+F+ NF+ IA P+T+  +K      +    DS F+ LK  +   P+L +PD +  F + +DAS   LG VL Q G  ++Y
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDS-FQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAY

Query:  ASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
         SR L  HE NY T + EL A+V+A K +RHYL G   +I +DH+ L + +  K+ N +  R
Subjt:  ASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR

P0CT41 Transposon Tf2-12 polyprotein1.3e-3935.18Show/hide
Query:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW
        M +G++ A A F   +N +  E  ++ V+ ++DDILI+SK+E+EH +H++ VLQ L+   L    +KCEF   Q+ F+G+ +S+ G +     I+ V  W
Subjt:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGK-----
         +P    E+R FLG   Y R+F+   S++  PL  L +K   + W+     + +N+KQ LV+ PVL   D S   ++ +DAS   +G VL Q+       
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGK-----

Query:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSL
         V Y S ++   + NY   D E+ A++ +LK WRHYL    E  +I TDH++L
Subjt:  VVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYG--EKIQIFTDHKSL

P10401 Retrovirus-related Pol polyprotein from transposon gypsy1.0e-4436.13Show/hide
Query:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW
        + FGL NAS++F   ++ V RE +     V++DD++I+S+ E++H  H+  VL+ L    +     K  F+ + + +LG +VSK G   DP K++A+  +
Subjt:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR-----------KGAPFVWSKACEDSFQNLKQKLVTAPV-LTLPDGSGSFVIYSDASKKGLGC
          P  V +VRSFLGLA YYR F+++F+ IA P+T + +           K  P  +++   ++FQ L+  L +  V L  PD    F + +DAS  G+G 
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR-----------KGAPFVWSKACEDSFQNLKQKLVTAPV-LTLPDGSGSFVIYSDASKKGLGC

Query:  VLIQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEK-IQIFTDHKSLKYFFTQKELNMRQRR
        VL Q+G+ +   SR LK  EQNY T++ EL A+V+AL   +++LYG + I IFTDH+ L +    +  N + +R
Subjt:  VLIQQGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEK-IQIFTDHKSLKYFFTQKELNMRQRR

P20825 Retrovirus-related Pol polyprotein from transposon 2973.0e-4938.55Show/hide
Query:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW
        M FGL NA A F   MN + R  L+   +V++DDI+I+S +  EH   +++V   L    L  +  KCEF  K+ +FLGH+V+  G+  +P K++A+ S+
Subjt:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSK-ACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAY
          P+   E+R+FLGL GYYR+F+ N++ IA P+T   +K       K    ++F+ LK  ++  P+L LPD    FV+ +DAS   LG VL Q G  +++
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSK-ACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAY

Query:  ASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR
         SR L  HE NY   + EL A+V+A K +RHYL G +  I +DH+ L++    KE   +  R
Subjt:  ASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus8.7e-4132.85Show/hide
Query:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW
        + FGL NA A+F  +++ + RE +     V+IDDI+++S+    H ++LR+VL  L    L     K  F   Q+ FLG++V+  G+  DP K+ A++  
Subjt:  MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSW

Query:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR-----------KGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCV
          P++V E++ FLG+  YYR+F+++++++A PLT LTR              P    +    SF +LK  L ++ +L  P  +  F + +DAS   +G V
Subjt:  TRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTR-----------KGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCV

Query:  LIQ----QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE-KIQIFTDHKSLKYFFTQKELNMRQRR
        L Q    + + +AY SR L   E+NY T + E+ A++++L   R YLYG   I+++TDH+ L +    +  N + +R
Subjt:  LIQ----QGKVVAYASRQLKSHEQNYPTHDLELAAVVFALKIWRHYLYGE-KIQIFTDHKSLKYFFTQKELNMRQRR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.1e-2746.56Show/hide
Query:  HLRMVLQILRGNKLYAKFSKCEFWLKQMSFLG--HVVSKAGVSMDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW
        HL MVLQI   ++ YA   KC F   Q+++LG  H++S  GVS DPAK+EA+  W  P   +E+R FLGL GYYRRFV+N+ +I  PLT+L +K +   W
Subjt:  HLRMVLQILRGNKLYAKFSKCEFWLKQMSFLG--HVVSKAGVSMDPAKIEAVTSWTRPSTVSEVRSFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVW

Query:  SKACEDSFQNLKQKLVTAPVLTLPDGSGSFV
        ++    +F+ LK  + T PVL LPD    FV
Subjt:  SKACEDSFQNLKQKLVTAPVLTLPDGSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTTGGTTTGACGAATGCTTCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATTGTGTTTATTGATGATATCTTGAT
ATACTCCAAGACGGAAGCCGAGCATGAGGAGCATTTACGTATGGTTTTGCAAATACTTCGGGGTAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGC
AGATGTCCTTTTTAGGCCATGTGGTTTCTAAGGCTGGAGTCTCTATGGATCCAGCTAAGATAGAGGCAGTCACCAGTTGGACTCGACCTTCCACAGTCAGTGAGGTTCGT
AGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGGAGCAA
GGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTTTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCTTCCAAGA
AGGGTTTGGGTTGTGTTTTGATACAGCAAGGTAAGGTAGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTAGAGTTGGCAGCA
GTGGTTTTTGCTTTGAAAATATGGAGGCATTACTTATATGGGGAAAAGATACAGATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATAT
GAGACAGCGAAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTTGGTTTGACGAATGCTTCGGCAGTGTTTATGGACTTGATGAACAGAGTGTTTAGGGAGTTCCTAGATACTTTTGTGATTGTGTTTATTGATGATATCTTGAT
ATACTCCAAGACGGAAGCCGAGCATGAGGAGCATTTACGTATGGTTTTGCAAATACTTCGGGGTAATAAGTTGTATGCAAAGTTCTCGAAATGCGAGTTTTGGCTGAAGC
AGATGTCCTTTTTAGGCCATGTGGTTTCTAAGGCTGGAGTCTCTATGGATCCAGCTAAGATAGAGGCAGTCACCAGTTGGACTCGACCTTCCACAGTCAGTGAGGTTCGT
AGCTTTCTGGGTTTAGCAGGTTATTATCGACGGTTTGTGGAGAACTTTTCTCGTATAGCTACTCCTCTTACTCAGTTGACCAGGAAGGGAGCTCCTTTTGTTTGGAGCAA
GGCATGTGAGGACAGTTTCCAGAACCTTAAACAGAAGCTAGTTACCGCACCGGTTCTTACTTTACCTGATGGTTCTGGCAGTTTTGTGATTTATAGTGATGCTTCCAAGA
AGGGTTTGGGTTGTGTTTTGATACAGCAAGGTAAGGTAGTCGCTTATGCTTCTCGTCAGTTGAAGAGTCATGAGCAGAACTACCCTACACATGATTTAGAGTTGGCAGCA
GTGGTTTTTGCTTTGAAAATATGGAGGCATTACTTATATGGGGAAAAGATACAGATCTTCACGGATCATAAGAGCTTGAAATACTTCTTTACTCAGAAGGAATTGAATAT
GAGACAGCGAAGATGA
Protein sequenceShow/hide protein sequence
MSFGLTNASAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQILRGNKLYAKFSKCEFWLKQMSFLGHVVSKAGVSMDPAKIEAVTSWTRPSTVSEVR
SFLGLAGYYRRFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQNLKQKLVTAPVLTLPDGSGSFVIYSDASKKGLGCVLIQQGKVVAYASRQLKSHEQNYPTHDLELAA
VVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRR