; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G03385 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G03385
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationClcChr08:8886821..8889519
RNA-Seq ExpressionClc08G03385
SyntenyClc08G03385
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016902197.1 PREDICTED: uncharacterized protein LOC107991581 isoform X1 [Cucumis melo]1.5e-4035.45Show/hide
Query:  YLFGQIECPPMFV------NPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEED
        +L  +  CP  FV      N   +E GA    GASSS +T           +  D LLLGWLYNSMT ++A Q+MG+   EDL +A Q  F VQSRAEED
Subjt:  YLFGQIECPPMFV------NPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEED

Query:  FLRQTFQHTRKVLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNV-------VCSSGA--------------SKNPTRGGFNPNASKGDEE
        FLRQ  Q TRK   GL E YN V+ ++QGKP+I+WL+MQ+KLL +++RL +QN        +  S A                N    G+N     G   
Subjt:  FLRQTFQHTRKVLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNV-------VCSSGA--------------SKNPTRGGFNPNASKGDEE

Query:  MEEN--------MMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLM
           N          GH A VCY+R+ KEF      NR  +  + S + N     P   ++TQN+ PF T  D V+D +WY+DS A+NHVT E +N++N  
Subjt:  MEEN--------MMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLM

Query:  EYG-----------------------------------------------------DNHVYIEFHDNCCLVKDKGTSR
        EY                                                      DNH+YIEFH  CC +KDK T +
Subjt:  EYG-----------------------------------------------------DNHVYIEFHDNCCLVKDKGTSR

XP_016902203.1 PREDICTED: uncharacterized protein LOC107991581 isoform X3 [Cucumis melo]2.9e-3938.01Show/hide
Query:  YLFGQIECPPMFV------NPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEED
        +L  +  CP  FV      N   +E GA    GASSS +T           +  D LLLGWLYNSMT ++A Q+MG+   EDL +A Q  F VQSRAEED
Subjt:  YLFGQIECPPMFV------NPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEED

Query:  FLRQTFQHTRKVLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNV-------VCSSGA--------------SKNPTRGGFNPNASKGDEE
        FLRQ  Q TRK   GL E YN V+ ++QGKP+I+WL+MQ+KLL +++RL +QN        +  S A                N    G+N     G   
Subjt:  FLRQTFQHTRKVLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNV-------VCSSGA--------------SKNPTRGGFNPNASKGDEE

Query:  MEEN--------MMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLM
           N          GH A VCY+R+ KEF      NR  +  + S + N     P   ++TQN+ PF T  D V+D +WY+DS A+NHVT E +N++N  
Subjt:  MEEN--------MMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLM

Query:  EYGDNHVYIEFHDNCCLVKDKGTSRVISKGILKDGLYQLEDI
        EY    +Y E                + +G L+DG YQLE +
Subjt:  EYGDNHVYIEFHDNCCLVKDKGTSRVISKGILKDGLYQLEDI

XP_022151683.1 uncharacterized protein LOC111019598 [Momordica charantia]7.4e-5131.17Show/hide
Query:  FSYLFGQIECPPMFVNPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQ
        F YL G   CPP  + P  +      + G++SSQ +S          +VVD+LLLGWLYNSM  ++A QVMG+    +L  A+Q+LF VQSRAE D+L+Q
Subjt:  FSYLFGQIECPPMFVNPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQ

Query:  TFQHTRK----------------------------------VLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNVVCSS---GASKNPTRG
         FQ T K                                  VL GL E+YNP+V  +QGK  ++W EM  +LLTY++RL+YQN + S      ++ P+  
Subjt:  TFQHTRK----------------------------------VLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNVVCSS---GASKNPTRG

Query:  GFNPNASKGDEEMEENMMGHIAFVCYHR---YEKEFVPNNNSNRG--TNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTT
          +  + + ++        H +    HR   Y++      N  RG       N T +N+G N    + A  +++  +T  + V+D SWY DS A++HVT 
Subjt:  GFNPNASKGDEEMEENMMGHIAFVCYHR---YEKEFVPNNNSNRG--TNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTT

Query:  EYNNLSNLMEYGDNHVYIEFHDNCCLVK-----------------------------DKGTSRVISKGILKDGLYQLE-------------------DIA
          NN+   ++Y      I  + N   +                              DK + R + KG LKD LY+L+                    + 
Subjt:  EYNNLSNLMEYGDNHVYIEFHDNCCLVK-----------------------------DKGTSRVISKGILKDGLYQLE-------------------DIA

Query:  AIKSLEVAKESKTNQF----------------------NSGIKTIQTDNGGEYIKIHQLCSQLGIQTHMSCPHTSEQNGQAKRKHRRVVEIGLTL-----
        ++ +  ++ E  T  F                      +  + T+Q+D GGEY  IH LC  LGIQ  +S P+TS QNG+A+RKHR +VE GLTL     
Subjt:  AIKSLEVAKESKTNQF----------------------NSGIKTIQTDNGGEYIKIHQLCSQLGIQTHMSCPHTSEQNGQAKRKHRRVVEIGLTL-----

Query:  -PLQFYWDAFTTATQLLNWRPTLVLAGKFLMERAAERKL
          L ++WD F TAT L+N  P +VL  K  ME    ++L
Subjt:  -PLQFYWDAFTTATQLLNWRPTLVLAGKFLMERAAERKL

XP_038905161.1 uncharacterized protein LOC120091275 isoform X1 [Benincasa hispida]5.5e-3836.25Show/hide
Query:  SGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQTFQHTRK--------------------
        SGASSS +T+ E        + VDQLLLGWLYNSMT E+A QVMG E  +DL  +I +LF VQSR EED+LR  FQ TRK                    
Subjt:  SGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQTFQHTRK--------------------

Query:  --------------VLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQ---------NVVCSSGASKNPTR----------------------
                      VLLGL E+YN +VA++QG+ +++WL+MQ++LL Y+RRL++Q         N + ++  +   TR                      
Subjt:  --------------VLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQ---------NVVCSSGASKNPTR----------------------

Query:  GGFNPNASKGDEEMEE-----NMMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDSASNHVTT
        GG      +G    +        +GHIAF C++RY ++FVPN+  N+     +N   T N +  PT +     SNPF+T  + + D++WY   ASNHVT+
Subjt:  GGFNPNASKGDEEMEE-----NMMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDSASNHVTT

Query:  EYNNLSNLMEYG--DNHVYIEFHDNCCLVKD
        ++NNL N +EY    N + I      CL  D
Subjt:  EYNNLSNLMEYG--DNHVYIEFHDNCCLVKD

XP_038905164.1 uncharacterized protein LOC120091275 isoform X4 [Benincasa hispida]1.2e-3736.98Show/hide
Query:  SGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQTFQHTRK--------------------
        SGASSS +T+ E        + VDQLLLGWLYNSMT E+A QVMG E  +DL  +I +LF VQSR EED+LR  FQ TRK                    
Subjt:  SGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQTFQHTRK--------------------

Query:  --------------VLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQ---------NVVCSSGASKNPTR----------------------
                      VLLGL E+YN +VA++QG+ +++WL+MQ++LL Y+RRL++Q         N + ++  +   TR                      
Subjt:  --------------VLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQ---------NVVCSSGASKNPTR----------------------

Query:  GGFNPNASKGDEEMEE-----NMMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDSASNHVTT
        GG      +G    +        +GHIAF C++RY ++FVPN+  N+     +N   T N +  PT +     SNPF+T  + + D++WY   ASNHVT+
Subjt:  GGFNPNASKGDEEMEE-----NMMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDSASNHVTT

Query:  EYNNLSNLMEY
        ++NNL N +EY
Subjt:  EYNNLSNLMEY

TrEMBL top hitse value%identityAlignment
A0A1S4E1U6 uncharacterized protein LOC107991581 isoform X17.5e-4135.45Show/hide
Query:  YLFGQIECPPMFV------NPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEED
        +L  +  CP  FV      N   +E GA    GASSS +T           +  D LLLGWLYNSMT ++A Q+MG+   EDL +A Q  F VQSRAEED
Subjt:  YLFGQIECPPMFV------NPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEED

Query:  FLRQTFQHTRKVLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNV-------VCSSGA--------------SKNPTRGGFNPNASKGDEE
        FLRQ  Q TRK   GL E YN V+ ++QGKP+I+WL+MQ+KLL +++RL +QN        +  S A                N    G+N     G   
Subjt:  FLRQTFQHTRKVLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNV-------VCSSGA--------------SKNPTRGGFNPNASKGDEE

Query:  MEEN--------MMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLM
           N          GH A VCY+R+ KEF      NR  +  + S + N     P   ++TQN+ PF T  D V+D +WY+DS A+NHVT E +N++N  
Subjt:  MEEN--------MMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLM

Query:  EYG-----------------------------------------------------DNHVYIEFHDNCCLVKDKGTSR
        EY                                                      DNH+YIEFH  CC +KDK T +
Subjt:  EYG-----------------------------------------------------DNHVYIEFHDNCCLVKDKGTSR

A0A1S4E1V2 uncharacterized protein LOC107991581 isoform X31.4e-3938.01Show/hide
Query:  YLFGQIECPPMFV------NPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEED
        +L  +  CP  FV      N   +E GA    GASSS +T           +  D LLLGWLYNSMT ++A Q+MG+   EDL +A Q  F VQSRAEED
Subjt:  YLFGQIECPPMFV------NPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEED

Query:  FLRQTFQHTRKVLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNV-------VCSSGA--------------SKNPTRGGFNPNASKGDEE
        FLRQ  Q TRK   GL E YN V+ ++QGKP+I+WL+MQ+KLL +++RL +QN        +  S A                N    G+N     G   
Subjt:  FLRQTFQHTRKVLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNV-------VCSSGA--------------SKNPTRGGFNPNASKGDEE

Query:  MEEN--------MMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLM
           N          GH A VCY+R+ KEF      NR  +  + S + N     P   ++TQN+ PF T  D V+D +WY+DS A+NHVT E +N++N  
Subjt:  MEEN--------MMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLM

Query:  EYGDNHVYIEFHDNCCLVKDKGTSRVISKGILKDGLYQLEDI
        EY    +Y E                + +G L+DG YQLE +
Subjt:  EYGDNHVYIEFHDNCCLVKDKGTSRVISKGILKDGLYQLEDI

A0A5A7SIT7 Uncharacterized protein2.2e-3734.92Show/hide
Query:  YLFGQIECPPMFV------NPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEED
        +L G+  CP  FV      N   +E GA    GASSS +T     S     +  D LLLGWLYNSMT ++A Q+MG+   EDL +A Q  F VQSRAEED
Subjt:  YLFGQIECPPMFV------NPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEED

Query:  FLRQTFQHTRK----------------------------------VLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNVVCSSGASKNPTR
        FLRQ  Q TRK                                  VLLGL E YN V+ ++QGKP+I+WL+MQ+KLL +++ L +QN         N T+
Subjt:  FLRQTFQHTRK----------------------------------VLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNVVCSSGASKNPTR

Query:  G-----------------------GFNPNASKGDEEMEEN--------MMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNS
                                G+N     G      N          GH A VCY+R+ KEF      +R  +  + S + N     P   ++TQN+
Subjt:  G-----------------------GFNPNASKGDEEMEEN--------MMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNS

Query:  NPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLMEYGDNHVYIEFHDNCCLVKDKGTSRVISKGILKDGLYQLEDI
         PF T  D V+D +WY+DS A+NHVT E +N++N  EY    +Y E                + +G L+DG YQLE +
Subjt:  NPFMTNTDGVLDSSWYVDS-ASNHVTTEYNNLSNLMEYGDNHVYIEFHDNCCLVKDKGTSRVISKGILKDGLYQLEDI

A0A5D3CPY2 Retrotransposon protein, putative, Ty1-copia subclass3.8e-3738.89Show/hide
Query:  DQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQTFQHTRKVLLGLVEDY-----NPVVAILQGKPEITWLEMQTKLLTYKRRL
        D LLLGW+YNSMT E+A Q+MG+   +DL EAIQ LF VQSR EEDFLR  FQ TRK     +EDY       V  + Q KP+I+WL+MQ++LL +++RL
Subjt:  DQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQTFQHTRKVLLGLVEDY-----NPVVAILQGKPEITWLEMQTKLLTYKRRL

Query:  DYQNVVCSSGASKNPTRGGFNPNASKGDEEMEENMMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSS
        ++Q                                                    NSN+ + G  ++ T +N     T  + T NSN F+T  + V+DS+
Subjt:  DYQNVVCSSGASKNPTRGGFNPNASKGDEEMEENMMGHIAFVCYHRYEKEFVPNNNSNRGTNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSS

Query:  WYVDS-ASNHVTTEYNNLSNLMEYG-----------DNHVYIEFHDNCCLVKDKGTSRVISKGILKDGLYQLEDIAAIKSLEVAKESK
        WYVD+ A+NHVT +Y+NLSN ++Y            DN+VY+EFH + C V +K T R I +G+LKDGLY LE +A +  L+ +   K
Subjt:  WYVDS-ASNHVTTEYNNLSNLMEYG-----------DNHVYIEFHDNCCLVKDKGTSRVISKGILKDGLYQLEDIAAIKSLEVAKESK

A0A6J1DCW4 uncharacterized protein LOC1110195983.6e-5131.17Show/hide
Query:  FSYLFGQIECPPMFVNPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQ
        F YL G   CPP  + P  +      + G++SSQ +S          +VVD+LLLGWLYNSM  ++A QVMG+    +L  A+Q+LF VQSRAE D+L+Q
Subjt:  FSYLFGQIECPPMFVNPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEEDFLRQ

Query:  TFQHTRK----------------------------------VLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNVVCSS---GASKNPTRG
         FQ T K                                  VL GL E+YNP+V  +QGK  ++W EM  +LLTY++RL+YQN + S      ++ P+  
Subjt:  TFQHTRK----------------------------------VLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNVVCSS---GASKNPTRG

Query:  GFNPNASKGDEEMEENMMGHIAFVCYHR---YEKEFVPNNNSNRG--TNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTT
          +  + + ++        H +    HR   Y++      N  RG       N T +N+G N    + A  +++  +T  + V+D SWY DS A++HVT 
Subjt:  GFNPNASKGDEEMEENMMGHIAFVCYHR---YEKEFVPNNNSNRG--TNGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDS-ASNHVTT

Query:  EYNNLSNLMEYGDNHVYIEFHDNCCLVK-----------------------------DKGTSRVISKGILKDGLYQLE-------------------DIA
          NN+   ++Y      I  + N   +                              DK + R + KG LKD LY+L+                    + 
Subjt:  EYNNLSNLMEYGDNHVYIEFHDNCCLVK-----------------------------DKGTSRVISKGILKDGLYQLE-------------------DIA

Query:  AIKSLEVAKESKTNQF----------------------NSGIKTIQTDNGGEYIKIHQLCSQLGIQTHMSCPHTSEQNGQAKRKHRRVVEIGLTL-----
        ++ +  ++ E  T  F                      +  + T+Q+D GGEY  IH LC  LGIQ  +S P+TS QNG+A+RKHR +VE GLTL     
Subjt:  AIKSLEVAKESKTNQF----------------------NSGIKTIQTDNGGEYIKIHQLCSQLGIQTHMSCPHTSEQNGQAKRKHRRVVEIGLTL-----

Query:  -PLQFYWDAFTTATQLLNWRPTLVLAGKFLMERAAERKL
          L ++WD F TAT L+N  P +VL  K  ME    ++L
Subjt:  -PLQFYWDAFTTATQLLNWRPTLVLAGKFLMERAAERKL

SwissProt top hitse value%identityAlignment
Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.0e-1239.29Show/hide
Query:  LYQLEDIAAIKSLEVA-KESKTNQFNSGIKTIQTDNGGEYIKIHQLCSQLGIQTHMSCPHTSEQNGQAKRKHRRVVEIGLTL------PLQFYWDAFTTA
        LY L+  + +K   +  K    N+F + I T  +DNGGE++ + +  SQ GI    S PHT E NG ++RKHR +VE GLTL      P  ++  AF  A
Subjt:  LYQLEDIAAIKSLEVA-KESKTNQFNSGIKTIQTDNGGEYIKIHQLCSQLGIQTHMSCPHTSEQNGQAKRKHRRVVEIGLTL------PLQFYWDAFTTA

Query:  TQLLNWRPTLVL
          L+N  PT +L
Subjt:  TQLLNWRPTLVL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-1239.29Show/hide
Query:  LYQLEDIAAIK-SLEVAKESKTNQFNSGIKTIQTDNGGEYIKIHQLCSQLGIQTHMSCPHTSEQNGQAKRKHRRVVEIGLTL------PLQFYWDAFTTA
        LY L+  + +K +  + K    N+F + I T+ +DNGGE++ +    SQ GI    S PHT E NG ++RKHR +VE+GLTL      P  ++  AF+ A
Subjt:  LYQLEDIAAIK-SLEVAKESKTNQFNSGIKTIQTDNGGEYIKIHQLCSQLGIQTHMSCPHTSEQNGQAKRKHRRVVEIGLTL------PLQFYWDAFTTA

Query:  TQLLNWRPTLVL
          L+N  PT +L
Subjt:  TQLLNWRPTLVL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAACGCCAATTCCTCCGCTAACAATGGTGCCCGAAATTTCAGCTACCTCTTTGGGCAAATTGAGTGCCCTCCTATGTTCGTGAATCCACCCGCCAGTGAAATCGG
AGCCAAAGTTATGTCTGGAGCATCAAGCTCACAAGTTACTAGCAGCGAGCAACAGTCAGGGTTAAGCTTGGCACTTGTCGTTGATCAACTACTACTTGGGTGGCTTTACA
ACTCCATGACACTAGAAATAGCCACTCAGGTAATGGGATATGAGAAACCGGAGGACCTTTTGGAAGCAATCCAGAAGTTGTTCGATGTTCAATCGAGAGCCGAAGAGGAT
TTTCTCAGGCAAACGTTCCAACATACCAGAAAAGTCCTTTTAGGGCTTGTTGAAGATTATAATCCAGTAGTTGCTATACTACAAGGAAAACCTGAAATAACGTGGCTTGA
GATGCAAACGAAGCTGTTGACCTATAAAAGACGACTGGACTACCAGAATGTTGTTTGTTCCAGCGGAGCCAGCAAAAATCCTACGCGTGGTGGATTCAATCCAAATGCAA
GTAAAGGGGACGAGGAAATGGAAGAGAACATGATGGGTCACATAGCCTTTGTTTGCTATCATAGATATGAAAAGGAATTTGTCCCCAACAACAATAGTAACAGAGGAACG
AATGGGGGGAGCAACTCTACCACAACTAATAATGGCAAGAATGCTCCAACAACTATGATGGCTACACAAAATAGTAATCCTTTCATGACTAATACTGATGGTGTGCTCGA
CTCAAGCTGGTATGTAGATAGTGCTTCCAACCATGTTACAACCGAATACAATAACCTAAGCAATCTGATGGAGTATGGAGATAATCACGTTTACATTGAATTTCATGATA
ATTGTTGCCTTGTAAAGGACAAGGGTACAAGTCGAGTAATTTCGAAGGGAATTCTTAAGGATGGGTTGTACCAACTGGAAGATATTGCTGCTATCAAGAGTCTTGAAGTT
GCTAAAGAGTCAAAGACGAATCAGTTTAATTCTGGAATTAAAACTATACAAACAGACAATGGTGGAGAGTACATTAAAATTCATCAACTATGCTCTCAATTGGGTATTCA
AACCCATATGTCATGCCCTCACACATCAGAACAAAATGGCCAAGCTAAGCGGAAACACAGACGTGTTGTTGAAATAGGCCTTACTCTACCATTGCAATTCTACTGGGATG
CTTTTACCACAGCAACACAGTTGCTCAATTGGCGGCCAACTCTAGTCCTTGCAGGTAAGTTTCTAATGGAGAGGGCTGCCGAAAGGAAACTCCATGACGAAGAACTCGAA
TTTCATCAAAAACTCGAGCTTGCTCCGGCTATCGGGCTGTCGGGTGAGTTTTGGAGTGGTTGTGTGTGCGGTGTGGCTTGGGCCGTGGCCGGAAGGAAGAACCATTTGAG
GTTATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAACGCCAATTCCTCCGCTAACAATGGTGCCCGAAATTTCAGCTACCTCTTTGGGCAAATTGAGTGCCCTCCTATGTTCGTGAATCCACCCGCCAGTGAAATCGG
AGCCAAAGTTATGTCTGGAGCATCAAGCTCACAAGTTACTAGCAGCGAGCAACAGTCAGGGTTAAGCTTGGCACTTGTCGTTGATCAACTACTACTTGGGTGGCTTTACA
ACTCCATGACACTAGAAATAGCCACTCAGGTAATGGGATATGAGAAACCGGAGGACCTTTTGGAAGCAATCCAGAAGTTGTTCGATGTTCAATCGAGAGCCGAAGAGGAT
TTTCTCAGGCAAACGTTCCAACATACCAGAAAAGTCCTTTTAGGGCTTGTTGAAGATTATAATCCAGTAGTTGCTATACTACAAGGAAAACCTGAAATAACGTGGCTTGA
GATGCAAACGAAGCTGTTGACCTATAAAAGACGACTGGACTACCAGAATGTTGTTTGTTCCAGCGGAGCCAGCAAAAATCCTACGCGTGGTGGATTCAATCCAAATGCAA
GTAAAGGGGACGAGGAAATGGAAGAGAACATGATGGGTCACATAGCCTTTGTTTGCTATCATAGATATGAAAAGGAATTTGTCCCCAACAACAATAGTAACAGAGGAACG
AATGGGGGGAGCAACTCTACCACAACTAATAATGGCAAGAATGCTCCAACAACTATGATGGCTACACAAAATAGTAATCCTTTCATGACTAATACTGATGGTGTGCTCGA
CTCAAGCTGGTATGTAGATAGTGCTTCCAACCATGTTACAACCGAATACAATAACCTAAGCAATCTGATGGAGTATGGAGATAATCACGTTTACATTGAATTTCATGATA
ATTGTTGCCTTGTAAAGGACAAGGGTACAAGTCGAGTAATTTCGAAGGGAATTCTTAAGGATGGGTTGTACCAACTGGAAGATATTGCTGCTATCAAGAGTCTTGAAGTT
GCTAAAGAGTCAAAGACGAATCAGTTTAATTCTGGAATTAAAACTATACAAACAGACAATGGTGGAGAGTACATTAAAATTCATCAACTATGCTCTCAATTGGGTATTCA
AACCCATATGTCATGCCCTCACACATCAGAACAAAATGGCCAAGCTAAGCGGAAACACAGACGTGTTGTTGAAATAGGCCTTACTCTACCATTGCAATTCTACTGGGATG
CTTTTACCACAGCAACACAGTTGCTCAATTGGCGGCCAACTCTAGTCCTTGCAGGTAAGTTTCTAATGGAGAGGGCTGCCGAAAGGAAACTCCATGACGAAGAACTCGAA
TTTCATCAAAAACTCGAGCTTGCTCCGGCTATCGGGCTGTCGGGTGAGTTTTGGAGTGGTTGTGTGTGCGGTGTGGCTTGGGCCGTGGCCGGAAGGAAGAACCATTTGAG
GTTATTTTGA
Protein sequenceShow/hide protein sequence
MANANSSANNGARNFSYLFGQIECPPMFVNPPASEIGAKVMSGASSSQVTSSEQQSGLSLALVVDQLLLGWLYNSMTLEIATQVMGYEKPEDLLEAIQKLFDVQSRAEED
FLRQTFQHTRKVLLGLVEDYNPVVAILQGKPEITWLEMQTKLLTYKRRLDYQNVVCSSGASKNPTRGGFNPNASKGDEEMEENMMGHIAFVCYHRYEKEFVPNNNSNRGT
NGGSNSTTTNNGKNAPTTMMATQNSNPFMTNTDGVLDSSWYVDSASNHVTTEYNNLSNLMEYGDNHVYIEFHDNCCLVKDKGTSRVISKGILKDGLYQLEDIAAIKSLEV
AKESKTNQFNSGIKTIQTDNGGEYIKIHQLCSQLGIQTHMSCPHTSEQNGQAKRKHRRVVEIGLTLPLQFYWDAFTTATQLLNWRPTLVLAGKFLMERAAERKLHDEELE
FHQKLELAPAIGLSGEFWSGCVCGVAWAVAGRKNHLRLF