; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc09g0247101 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc09g0247101
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionReverse transcriptase
Genome locationCMiso1.1chr09:11499389..11500290
RNA-Seq ExpressionCmc09g0247101
SyntenyCmc09g0247101
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016020 - membrane (cellular component)
GO:0043227 - membrane-bounded organelle (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR041373 - Reverse transcriptase, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035365.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.5e-11379.34Show/hide
Query:  MSFGLTNAPTVFMNLMNKVF--FLD-------------------------HVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRI
        MSFGLTNAP VFM+LMN+VF  FLD                         HVVSK GV VDPAKIEA+TSWPRPSTVSEVRSF+GLAGYY RF+E+FSRI
Subjt:  MSFGLTNAPTVFMNLMNKVF--FLD-------------------------HVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRI

Query:  ATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWR
        ATP TQLTRKGAPFVWSKAC+DSFQNLKQKLVTA +L +PD SGSFVIYSDASKKGL  VLMQQGKVVTYASRQLKSHE+NY THDLELA VVF LKIWR
Subjt:  ATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWR

Query:  HYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK
        HYLYGEKIQIF  HKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHP KAN+V DALSRKVSHSAALIT+
Subjt:  HYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK

KAA0036553.1 pol protein [Cucumis melo var. makuwa]2.2e-11276.9Show/hide
Query:  MSFGLTNAPTVFMNLMNKVF---------------------------------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFM
        MSFGLTNAP VFM+LMN+VF                                 FL HVVSK GV VDPAKIEA+T W RPSTVSE RSFLGLAGYY RF+
Subjt:  MSFGLTNAPTVFMNLMNKVF---------------------------------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFM

Query:  EDFSRIATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVF
        E+FS IATP TQLTRKGAPFVWSKAC+DSFQNLKQKLVTA +LT+PD SGSFVIYSDASKKGL  VLMQQGKVV YASRQLKSHE+NY THDLELAAVVF
Subjt:  EDFSRIATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVF

Query:  VLKIWRHYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK
         LKIWRHYLYGEKIQIF DHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHP KAN+V DALSRKVSHSAALIT+
Subjt:  VLKIWRHYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK

KAA0051368.1 pol protein [Cucumis melo var. makuwa]1.6e-11377.62Show/hide
Query:  MSFGLTNAPTVFMNLMNKVF---------------------------------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFM
        MSFGLTNAP VFM+LMN+VF                                 FL HVVSK GV VDPAKIEA+T W RPSTVSEVRSFLGLAGYY RF+
Subjt:  MSFGLTNAPTVFMNLMNKVF---------------------------------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFM

Query:  EDFSRIATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVF
        E+FSRIATP TQLTRKGAPFVWSKAC+DSFQNLKQKLVTA +LT+PD SGSFVIYSDASKKGL  VLMQQGKVV YASRQLKSHE+NY THDLELAAVVF
Subjt:  EDFSRIATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVF

Query:  VLKIWRHYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK
         LKIWRHYLYGEKIQIF DHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHP KAN+V DALSRKVSHSAALIT+
Subjt:  VLKIWRHYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK

KAA0067180.1 pol protein [Cucumis melo var. makuwa]1.8e-11484.19Show/hide
Query:  MSFGLTNAPTVFMNLMNKVF---------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSK
        MSFGLTNAP VFM+LMN+VF         FL HVVSK GV VD AKIEA+TSWPRPSTVSEVRSFLGLAGYY RF+E+FSRIATP TQLTRKGAPFVWSK
Subjt:  MSFGLTNAPTVFMNLMNKVF---------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSK

Query:  ACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEKIQIFMDHKSLK
        AC+DSFQNLKQKLVTA    +PD SGSFVIYSDASKKGL  VLMQQGKVV YASRQLKSHE+NY THDLELAAVVF LK WRHYLYGEKIQIF DHKSLK
Subjt:  ACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEKIQIFMDHKSLK

Query:  YFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK
        YFFTQKELNMRQRRWLELVKDYDCEILYHP KAN+V DALSRKVSHSAALIT+
Subjt:  YFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK

TYK26028.1 pol protein [Cucumis melo var. makuwa]2.8e-11584.58Show/hide
Query:  MSFGLTNAPTVFMNLMNKVF---------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSK
        MSFGLTNAP VFM+LMN+VF         FL HVVSK GV VD AKIEA+TSWPRPSTVSEVRSFLGLAGYY RF+E+FSRIATP TQLTRKGAPFVWSK
Subjt:  MSFGLTNAPTVFMNLMNKVF---------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSK

Query:  ACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEKIQIFMDHKSLK
        AC+DSFQNLKQKLVTA    +PD SGSFVIYSDASKKGL  VLMQQGKVV YASRQLKSHE+NYLTHDLELAAVVF LK WRHYLYGEKIQIF DHKSLK
Subjt:  ACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEKIQIFMDHKSLK

Query:  YFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK
        YFFTQKELNMRQRRWLELVKDYDCEILYHP KAN+V DALSRKVSHSAALIT+
Subjt:  YFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK

TrEMBL top hitse value%identityAlignment
A0A5A7T0Y9 Reverse transcriptase1.1e-11276.9Show/hide
Query:  MSFGLTNAPTVFMNLMNKVF---------------------------------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFM
        MSFGLTNAP VFM+LMN+VF                                 FL HVVSK GV VDPAKIEA+T W RPSTVSE RSFLGLAGYY RF+
Subjt:  MSFGLTNAPTVFMNLMNKVF---------------------------------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFM

Query:  EDFSRIATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVF
        E+FS IATP TQLTRKGAPFVWSKAC+DSFQNLKQKLVTA +LT+PD SGSFVIYSDASKKGL  VLMQQGKVV YASRQLKSHE+NY THDLELAAVVF
Subjt:  EDFSRIATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVF

Query:  VLKIWRHYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK
         LKIWRHYLYGEKIQIF DHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHP KAN+V DALSRKVSHSAALIT+
Subjt:  VLKIWRHYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK

A0A5A7T1P4 Reverse transcriptase2.2e-11379.34Show/hide
Query:  MSFGLTNAPTVFMNLMNKVF--FLD-------------------------HVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRI
        MSFGLTNAP VFM+LMN+VF  FLD                         HVVSK GV VDPAKIEA+TSWPRPSTVSEVRSF+GLAGYY RF+E+FSRI
Subjt:  MSFGLTNAPTVFMNLMNKVF--FLD-------------------------HVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRI

Query:  ATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWR
        ATP TQLTRKGAPFVWSKAC+DSFQNLKQKLVTA +L +PD SGSFVIYSDASKKGL  VLMQQGKVVTYASRQLKSHE+NY THDLELA VVF LKIWR
Subjt:  ATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWR

Query:  HYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK
        HYLYGEKIQIF  HKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHP KAN+V DALSRKVSHSAALIT+
Subjt:  HYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK

A0A5A7U7V9 Reverse transcriptase7.5e-11477.62Show/hide
Query:  MSFGLTNAPTVFMNLMNKVF---------------------------------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFM
        MSFGLTNAP VFM+LMN+VF                                 FL HVVSK GV VDPAKIEA+T W RPSTVSEVRSFLGLAGYY RF+
Subjt:  MSFGLTNAPTVFMNLMNKVF---------------------------------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFM

Query:  EDFSRIATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVF
        E+FSRIATP TQLTRKGAPFVWSKAC+DSFQNLKQKLVTA +LT+PD SGSFVIYSDASKKGL  VLMQQGKVV YASRQLKSHE+NY THDLELAAVVF
Subjt:  EDFSRIATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVF

Query:  VLKIWRHYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK
         LKIWRHYLYGEKIQIF DHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHP KAN+V DALSRKVSHSAALIT+
Subjt:  VLKIWRHYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK

A0A5A7VIZ4 Pol protein8.9e-11584.19Show/hide
Query:  MSFGLTNAPTVFMNLMNKVF---------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSK
        MSFGLTNAP VFM+LMN+VF         FL HVVSK GV VD AKIEA+TSWPRPSTVSEVRSFLGLAGYY RF+E+FSRIATP TQLTRKGAPFVWSK
Subjt:  MSFGLTNAPTVFMNLMNKVF---------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSK

Query:  ACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEKIQIFMDHKSLK
        AC+DSFQNLKQKLVTA    +PD SGSFVIYSDASKKGL  VLMQQGKVV YASRQLKSHE+NY THDLELAAVVF LK WRHYLYGEKIQIF DHKSLK
Subjt:  ACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEKIQIFMDHKSLK

Query:  YFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK
        YFFTQKELNMRQRRWLELVKDYDCEILYHP KAN+V DALSRKVSHSAALIT+
Subjt:  YFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK

A0A5D3DR46 Pol protein1.4e-11584.58Show/hide
Query:  MSFGLTNAPTVFMNLMNKVF---------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSK
        MSFGLTNAP VFM+LMN+VF         FL HVVSK GV VD AKIEA+TSWPRPSTVSEVRSFLGLAGYY RF+E+FSRIATP TQLTRKGAPFVWSK
Subjt:  MSFGLTNAPTVFMNLMNKVF---------FLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSK

Query:  ACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEKIQIFMDHKSLK
        AC+DSFQNLKQKLVTA    +PD SGSFVIYSDASKKGL  VLMQQGKVV YASRQLKSHE+NYLTHDLELAAVVF LK WRHYLYGEKIQIF DHKSLK
Subjt:  ACKDSFQNLKQKLVTASILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEKIQIFMDHKSLK

Query:  YFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK
        YFFTQKELNMRQRRWLELVKDYDCEILYHP KAN+V DALSRKVSHSAALIT+
Subjt:  YFFTQKELNMRQRRWLELVKDYDCEILYHPSKANIVVDALSRKVSHSAALITK

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.2e-3839.09Show/hide
Query:  LMNKVFFLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSKACKDS-FQNLKQKLVTASILTIP
        L  +  FL HV++ DG+  +P KIEAI  +P P+   E+++FLGL GYY +F+ +F+ IA P T+  +K      +    DS F+ LK  +    IL +P
Subjt:  LMNKVFFLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSKACKDS-FQNLKQKLVTASILTIP

Query:  DCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDY
        D +  F + +DAS   L  VL Q G  ++Y SR L  HE NY T + EL A+V+  K +RHYL G   +I  DH+ L + +  K+ N +  RW   + ++
Subjt:  DCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDY

Query:  DCEILYHPSKANIVVDALSR
        D +I Y   K N V DALSR
Subjt:  DCEILYHPSKANIVVDALSR

P10394 Retrovirus-related Pol polyprotein from transposon 4122.2e-3033.78Show/hide
Query:  MNKVFFLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDC
        M++V FL H  +  G+L D  K + I ++P P      R F+    YY RF+++F+  +   T+L +K  PF W+  C+ +F +LK +L+  ++L  PD 
Subjt:  MNKVFFLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDC

Query:  SGSFVIYSDASKKGLSYVLMQQGK----VVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVK
        S  F I +DASK+    VL Q        V YASR     E N  T + ELAA+ + +  +R Y+YG+   +  DH+ L Y F+    + +  R    ++
Subjt:  SGSFVIYSDASKKGLSYVLMQQGK----VVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVK

Query:  DYDCEILYHPSKANIVVDALSR
        +Y+  + Y   K N V DALSR
Subjt:  DYDCEILYHPSKANIVVDALSR

P10401 Retrovirus-related Pol polyprotein from transposon gypsy3.8e-3837.99Show/hide
Query:  VFFLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTR-----------KGAPFVWSKACKDSFQNLKQKLVTA
        V +L  +VSKDG   DP K++AI  +P P  V +VRSFLGLA YY  F++DF+ IA P T + +           K  P  +++  +++FQ L+  L + 
Subjt:  VFFLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTR-----------KGAPFVWSKACKDSFQNLKQKLVTA

Query:  S-ILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEK-IQIFMDHKSLKYFFTQKELNMRQRR
          IL  PD    F + +DAS  G+  VL Q+G+ +T  SR LK  E+NY T++ EL A+V+ L   +++LYG + I IF DH+ L +    +  N + +R
Subjt:  S-ILTIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEK-IQIFMDHKSLKYFFTQKELNMRQRR

Query:  WLELVKDYDCEILYHPSKANIVVDALSRK
        W   +  ++ ++ Y P K N V DALSR+
Subjt:  WLELVKDYDCEILYHPSKANIVVDALSRK

P20825 Retrovirus-related Pol polyprotein from transposon 2971.2e-3637.73Show/hide
Query:  LMNKVFFLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSK-ACKDSFQNLKQKLVTASILTIP
        L  +  FL H+V+ DG+  +P K++AI S+P P+   E+R+FLGL GYY +F+ +++ IA P T   +K       K    ++F+ LK  ++   IL +P
Subjt:  LMNKVFFLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSK-ACKDSFQNLKQKLVTASILTIP

Query:  DCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDY
        D    FV+ +DAS   L  VL Q G  +++ SR L  HE NY   + EL A+V+  K +RHYL G +  I  DH+ L++    KE   +  RW   + +Y
Subjt:  DCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDY

Query:  DCEILYHPSKANIVVDALSR
          +I Y   K N V DALSR
Subjt:  DCEILYHPSKANIVVDALSR

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus8.4e-3834.89Show/hide
Query:  LMNKVFFLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTR-----------KGAPFVWSKACKDSFQNLKQK
        L  +V FL ++V+ DG+  DP K+ AI+  P P++V E++ FLG+  YY +F++D++++A P T LTR              P    +    SF +LK  
Subjt:  LMNKVFFLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTR-----------KGAPFVWSKACKDSFQNLKQK

Query:  LVTASILTIPDCSGSFVIYSDASKKGLSYVLMQ----QGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGE-KIQIFMDHKSLKYFFTQKE
        L ++ IL  P  +  F + +DAS   +  VL Q    + + + Y SR L   EENY T + E+ A+++ L   R YLYG   I+++ DH+ L +    + 
Subjt:  LVTASILTIPDCSGSFVIYSDASKKGLSYVLMQ----QGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGE-KIQIFMDHKSLKYFFTQKE

Query:  LNMRQRRWLELVKDYDCEILYHPSKANIVVDALSR
         N + +RW   +++Y+CE++Y P K+N+V DALSR
Subjt:  LNMRQRRWLELVKDYDCEILYHPSKANIVVDALSR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein9.0e-1940.21Show/hide
Query:  HVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDCSGSFV
        H++S +GV  DPAK+EA+  WP P   +E+R FLGL GYY RF++++ +I  P T+L +K +   W++    +F+ LK  + T  +L +PD    FV
Subjt:  HVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASILTIPDCSGSFV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTTTGGATTGACAAATGCTCCAACAGTGTTTATGAATTTGATGAACAAAGTATTTTTTCTAGACCATGTGGTTTCTAAAGATGGTGTTTTGGTGGATCCAGCTAA
GATAGAGGCTATTACCAGTTGGCCCCGACCTTCCACAGTTAGTGAAGTTCGTAGCTTTTTAGGTTTAGCAGGTTATTACCATCGGTTTATGGAGGACTTTTCCCGTATAG
CAACTCCTTTTACTCAGTTGACCAGGAAGGGAGCTCCATTTGTTTGGAGCAAGGCATGTAAGGATAGTTTTCAGAACCTTAAACAGAAGCTCGTTACTGCATCGATTCTT
ACTATACCTGATTGTTCAGGGAGTTTTGTGATTTATAGTGATGCTTCTAAGAAAGGTTTGAGTTATGTTCTAATGCAGCAAGGTAAAGTCGTCACTTATGCTTCTCGTCA
GTTGAAGAGTCATGAGGAGAATTACCTTACCCATGATTTAGAGTTGGCAGCAGTAGTTTTTGTACTGAAGATATGGAGACATTATTTGTATGGCGAGAAGATACAAATTT
TTATGGACCATAAAAGCTTGAAATACTTCTTCACTCAAAAGGAGTTGAATATGAGGCAGCGAAGATGGCTTGAATTAGTAAAGGATTATGATTGTGAGATATTGTATCAT
CCAAGTAAGGCAAATATAGTAGTTGATGCCCTTAGTAGGAAGGTATCACACTCAGCAGCACTCATCACCAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCTTTGGATTGACAAATGCTCCAACAGTGTTTATGAATTTGATGAACAAAGTATTTTTTCTAGACCATGTGGTTTCTAAAGATGGTGTTTTGGTGGATCCAGCTAA
GATAGAGGCTATTACCAGTTGGCCCCGACCTTCCACAGTTAGTGAAGTTCGTAGCTTTTTAGGTTTAGCAGGTTATTACCATCGGTTTATGGAGGACTTTTCCCGTATAG
CAACTCCTTTTACTCAGTTGACCAGGAAGGGAGCTCCATTTGTTTGGAGCAAGGCATGTAAGGATAGTTTTCAGAACCTTAAACAGAAGCTCGTTACTGCATCGATTCTT
ACTATACCTGATTGTTCAGGGAGTTTTGTGATTTATAGTGATGCTTCTAAGAAAGGTTTGAGTTATGTTCTAATGCAGCAAGGTAAAGTCGTCACTTATGCTTCTCGTCA
GTTGAAGAGTCATGAGGAGAATTACCTTACCCATGATTTAGAGTTGGCAGCAGTAGTTTTTGTACTGAAGATATGGAGACATTATTTGTATGGCGAGAAGATACAAATTT
TTATGGACCATAAAAGCTTGAAATACTTCTTCACTCAAAAGGAGTTGAATATGAGGCAGCGAAGATGGCTTGAATTAGTAAAGGATTATGATTGTGAGATATTGTATCAT
CCAAGTAAGGCAAATATAGTAGTTGATGCCCTTAGTAGGAAGGTATCACACTCAGCAGCACTCATCACCAAATAG
Protein sequenceShow/hide protein sequence
MSFGLTNAPTVFMNLMNKVFFLDHVVSKDGVLVDPAKIEAITSWPRPSTVSEVRSFLGLAGYYHRFMEDFSRIATPFTQLTRKGAPFVWSKACKDSFQNLKQKLVTASIL
TIPDCSGSFVIYSDASKKGLSYVLMQQGKVVTYASRQLKSHEENYLTHDLELAAVVFVLKIWRHYLYGEKIQIFMDHKSLKYFFTQKELNMRQRRWLELVKDYDCEILYH
PSKANIVVDALSRKVSHSAALITK