; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G12800 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G12800
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionRNA-directed DNA polymerase
Genome locationChr7:11096861..11098309
RNA-Seq ExpressionCSPI07G12800
SyntenyCSPI07G12800
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR041373 - Reverse transcriptase, RNase H-like domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7044073.1 unnamed protein product [Microthlaspi erraticum]1.3e-5030.4Show/hide
Query:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSLK-----------------
        +LAL DF + F+V  DA G GIGAVLSQ   P+ +FSEKLSEAR++WSTY+QE Y++ RAL+ WEHYL+ ++FILF +H +LK                 
Subjt:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSLK-----------------

Query:  ------------AGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNPDFGQIW-------------------------------------------
                    +G+ N VADALSR+ +LL  L  ++V F+ L  LY+ + +F ++W                                           
Subjt:  ------------AGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNPDFGQIW-------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------K
                                                                                                           K
Subjt:  ---------------------------------------------------------------------------------------------------K

Query:  TSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKFNTTFKFSTT------------------------------NLVLAQAEFSYNHMKN
        T+DA  I  LFF+EVV+LHG+PK+I+SDRD  FLSHFW TLWR F TT K S+T                                   +A+      KN
Subjt:  TSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKFNTTFKFSTT------------------------------NLVLAQAEFSYNHMKN

Query:  QTTGDKE-----FKEGELVMIHLRKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEY
        +   DK      FKEG+ VM+ LRK R P+G Y KLQ +K  P+ +++K  DN Y +DLP++++I+  FNVADI EY
Subjt:  QTTGDKE-----FKEGELVMIHLRKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEY

KAA0054760.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]7.4e-5136.26Show/hide
Query:  VVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSLKAGSTNVVADALSRKANLLTILKSQVVAFDS
        +VDACG GIGAVLS+  H IEYFSEKLS +R+ WSTYE ELY+LVRALK          F    + ++ K                              
Subjt:  VVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSLKAGSTNVVADALSRKANLLTILKSQVVAFDS

Query:  LPTLYKDNPDFGQIWKTSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKFNTTFKFSTT------------------------------
                       KT+DA+YI NLFF+EVV++H +PKSIVSDRDV FLSHFW+TLW+KF+TT  ++T                               
Subjt:  LPTLYKDNPDFGQIWKTSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKFNTTFKFSTT------------------------------

Query:  -NLVLAQAEFSYNHMKNQTTGD-----------------------------------------------------------------KEFKEGELVMIHL
         +L LAQAEF++N+MKN++TG+                                                                  EF +G+LVMIHL
Subjt:  -NLVLAQAEFSYNHMKNQTTGD-----------------------------------------------------------------KEFKEGELVMIHL

Query:  RKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEYFPPDQLQLSN
        RK R P G YNKL+ K++ P+PI++K+GDN +KI LP  IHI+ +FNVAD+  Y  PD  +L++
Subjt:  RKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEYFPPDQLQLSN

KAG7586297.1 Integrase catalytic core [Arabidopsis thaliana x Arabidopsis arenosa]9.6e-5129.86Show/hide
Query:  ALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL--------------------
        AL DF + F+V  DA G GIGAVLSQ   P+ +FSEKLSEAR+KWSTY+QE Y++ RAL+ WEHYL+ ++FILF +H +L                    
Subjt:  ALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL--------------------

Query:  ---------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNPDFGQIW---------------------------------------------
                 K+G+ N VADALSR+A+LL  L  ++V F+ +  LY+++ +F ++W                                             
Subjt:  ---------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNPDFGQIW---------------------------------------------

Query:  ------------------------------------------------------KTSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKF
                                                              KT+DA  I  LFFKEVV+LHG+PKSI SDRD  FLSHFW TLWR F
Subjt:  ------------------------------------------------------KTSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKF

Query:  NTTFKFSTT-------------------------------NLVLAQAEFSYNHMKNQTTGD---------------------------------------
         T    S+T                               +L L Q EF+YN + +  TG                                        
Subjt:  NTTFKFSTT-------------------------------NLVLAQAEFSYNHMKNQTTGD---------------------------------------

Query:  --------------------------KEFKEGELVMIHLRKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEY------F
                                  K FKEG+ VM+ LRK R P+G YNKL+ +K  P+ +++K  DN Y + LP+ ++I+  FNVADI EY      +
Subjt:  --------------------------KEFKEGELVMIHLRKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEY------F

Query:  PPDQLQLSN
        P + L+ S+
Subjt:  PPDQLQLSN

TXG62763.1 hypothetical protein EZV62_009757 [Acer yangbiense]1.6e-5028.25Show/hide
Query:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL------------------
        +LAL  F + FEV  DA G GIGAVLSQ   P+ +FSEKLS+AR+KWSTY+QE Y+++RALK WEHYL+ ++FIL+ +H +L                  
Subjt:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL------------------

Query:  -----------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNPDFGQIW-------------------------------------------
                   K+G  N VADALSR+A+LL  ++ +++ F+ L  LY D+ DFG++W                                           
Subjt:  -----------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNPDFGQIW-------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------K
                                                                                                           K
Subjt:  ---------------------------------------------------------------------------------------------------K

Query:  TSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKFNTTFKFSTT-------------------------------NLVLAQAEFSYNHMK
        TSDA ++  LFF+EVV+LHG+P+SI SDRD  FLSHFW TLWR+ +T  KFS+T                               ++ LAQAEF+YN+  
Subjt:  TSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKFNTTFKFSTT-------------------------------NLVLAQAEFSYNHMK

Query:  NQTTG-----------------------------------------------------------------DKEFKEGELVMIHLRKARLPIGKYNKLQSK
        +  TG                                                                 +K F EG+ VM+ LRK R P+G YNKL+ +
Subjt:  NQTTG-----------------------------------------------------------------DKEFKEGELVMIHLRKARLPIGKYNKLQSK

Query:  KLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEY
        K  PY +IKK  +N Y IDLP +++I+  FNVAD++E+
Subjt:  KLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEY

XP_028553745.1 uncharacterized protein LOC114580424 [Dendrobium catenatum]3.2e-5433.66Show/hide
Query:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL------------------
        +LAL DF +PF V  DA   GIGAVL Q   P+E+FSEKL  +R++WS YEQELY++VRALK WEHYLL QDFIL  +H +L                  
Subjt:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL------------------

Query:  -----------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNPDFGQIW-------------KTSDAVYITNLFFKEVVQLHGIPKSIVSDR
                   K+G+ N VADALSR+  L+T L++     + L  LY+ + DF + W                  ++   LFFKE+V+LHG+P+S+ SDR
Subjt:  -----------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNPDFGQIW-------------KTSDAVYITNLFFKEVVQLHGIPKSIVSDR

Query:  DVNFLSHFWKTLWRKFNTTFKFSTT-----------------NL--------------VLAQAEFSYNHMKNQTTG------------------------
        DV FLSHFW+ LW++F T  + S++                 NL              VL QAEF++N M N++TG                        
Subjt:  DVNFLSHFWKTLWRKFNTTFKFSTT-----------------NL--------------VLAQAEFSYNHMKNQTTG------------------------

Query:  ---------------------------------------DKEFKEGELVMIHLRKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFN
                                                K F+ G++V + ++K R+P G  NKL  KK  P+ +  +  DN Y IDLP   + +  FN
Subjt:  ---------------------------------------DKEFKEGELVMIHLRKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFN

Query:  VADIFEYFPPDQL
        + D++ YFPPD++
Subjt:  VADIFEYFPPDQL

TrEMBL top hitse value%identityAlignment
A0A2N9GX40 Uncharacterized protein7.4e-5733.4Show/hide
Query:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSLKAGS--TNVVADALSRKA
        +LAL  F + FEV  DA G GIGAVLSQ   P+ +FSEKLSE+R+KWSTY QE Y++VRALK WEHYL+ ++F+L+ +H +LK  +   N VADALSR+A
Subjt:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSLKAGS--TNVVADALSRKA

Query:  NLLTILKSQVVAFDSLPTLYKDNPDFGQIW----------------------------------------------------------------------
        NLL  L  +VV FD L  LY+++ DF +IW                                                                      
Subjt:  NLLTILKSQVVAFDSLPTLYKDNPDFGQIW----------------------------------------------------------------------

Query:  ----------------------------------KTSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKFNTTFKFSTT-----------
                                          KTSDA +I  LFF+EVV+LHG+P+SI SDRD  FLSHFW TLW+ F+T+   STT           
Subjt:  ----------------------------------KTSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKFNTTFKFSTT-----------

Query:  ------NLV--------------LAQAEFSYNHMKNQTTGD-----------------------------------------------------------
              NL+              LAQAEF+YN   +  TG                                                            
Subjt:  ------NLV--------------LAQAEFSYNHMKNQTTGD-----------------------------------------------------------

Query:  ------KEFKEGELVMIHLRKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEYFPPDQL
              K FKEG+ VM+ LRK R P+G YNKL+ KK  PY ++KK  DN Y IDLP+ + I+  FNVAD+++Y   + L
Subjt:  ------KEFKEGELVMIHLRKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEYFPPDQL

A0A2N9HBL1 Integrase catalytic domain-containing protein2.3e-5834.12Show/hide
Query:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL------------------
        +LAL  F + FE+  DA G GIGAVLSQ   P+ +FSEKLSE+R+KWSTY QE Y++VRALK WEHYL+ ++F+L+ +H +L                  
Subjt:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL------------------

Query:  -----------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNPDFGQIW-------------------------------------------
                   K+G  N VADALSR+ANLL  L  +VV FDSL  LY+++ DF +IW                                           
Subjt:  -----------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNPDFGQIW-------------------------------------------

Query:  -------------------------------------------KTSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKFNTTFKFSTT--
                                                   KTSDA +I  LFF+EVV+LHG+P+SI SDRD  FLSHFW TLW+ F+T+   STT  
Subjt:  -------------------------------------------KTSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKFNTTFKFSTT--

Query:  ---------------NLV--------------LAQAEFSYNHMKNQTTGD----------------------------------------------KEFK
                       NL+              LAQAEF+YN   +  TG                                               K FK
Subjt:  ---------------NLV--------------LAQAEFSYNHMKNQTTGD----------------------------------------------KEFK

Query:  EGELVMIHLRKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEYFPPDQL
        EG+ VM+ LRK R P+G YNKL+ KK  PY ++KK  DN Y IDLP+ + I+  FNV D+++Y   + L
Subjt:  EGELVMIHLRKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEYFPPDQL

A0A2N9IH25 Reverse transcriptase1.4e-5532.06Show/hide
Query:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL------------------
        +LAL  F + FEV  DA G GIGAVLSQ   P+ +FSEKLSE+R+KWSTY QE Y++VRALK WEHYL+ ++F+L+ +H +L                  
Subjt:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL------------------

Query:  -----------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNPDFGQIW-------------------------------------------
                   K+G  N VADALSR+ANLL  L  ++V FD L  LY+++ DF +IW                                           
Subjt:  -----------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNPDFGQIW-------------------------------------------

Query:  ------------------------------------------------------KTSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKF
                                                              KTSDA +I  LFF+EVV+LHG+P+SI SDRD  FLSHFW TLW+ F
Subjt:  ------------------------------------------------------KTSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKF

Query:  NTTFKFSTT-----------------NLV--------------LAQAEFSYNHMKNQTTGD---------------------------------------
        +T+   STT                 NL+              LAQAEF+YN   +  TG                                        
Subjt:  NTTFKFSTT-----------------NLV--------------LAQAEFSYNHMKNQTTGD---------------------------------------

Query:  --------------------------KEFKEGELVMIHLRKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEYFPPDQL
                                  K FKEG+ VM+ LRK R P+G YNKL+ KK  PY ++KK  DN Y IDLP+ + I+  FNVAD+++Y   + L
Subjt:  --------------------------KEFKEGELVMIHLRKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEYFPPDQL

A0A2N9J550 Uncharacterized protein3.7e-5636.87Show/hide
Query:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSLKAGSTNVVADALSRKANL
        +LAL  F + F+V  D    G+GAVLSQ +  I ++SE LS+A KKWSTYE E Y++ RALK WEHYL       F   L LK+G    V DAL+++A+L
Subjt:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSLKAGSTNVVADALSRKANL

Query:  LTILKSQVVAFDSLPTLYKDNPDFGQIW------KTSDAVYIT------------------------------------------------------NLF
        L  L+++V+ FD L  LY+++ DFG  W      +  + V+I                                                       NLF
Subjt:  LTILKSQVVAFDSLPTLYKDNPDFGQIW------KTSDAVYIT------------------------------------------------------NLF

Query:  FKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKFNTTFKFSTT-----------------NLV-----------------------LAQAEFSYNHMKNQ
        FKEVV+LHG+P+SI SDRD  FL HFWKTLW++F+ +  +S+T                 NL+                       L      Y    ++
Subjt:  FKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKFNTTFKFSTT-----------------NLV-----------------------LAQAEFSYNHMKNQ

Query:  TTGDKEFKEGELVMIHLRKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEYFPPDQL
           +K F EG+LVM++L+K R  +G YNKL+ KK  PY IIKK  DN Y +DLP  + I+  FN AD+FEYFPPD+L
Subjt:  TTGDKEFKEGELVMIHLRKARLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEYFPPDQL

A0A6N2LVR1 Uncharacterized protein1.1e-5730.94Show/hide
Query:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL------------------
        +LAL DF + FEV  DA   GIGAVLSQ   P+ ++SEKLSEAR+KWSTYE ELY++ RA+KVWEHYL+ ++FILF +H +L                  
Subjt:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL------------------

Query:  -----------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNPDFGQIW-------------------------------------------
                   K+G  N VADALSRK +LLT L+++V+ F+ +  LY  + DFG  W                                           
Subjt:  -----------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNPDFGQIW-------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------K
                                                                                                           K
Subjt:  ---------------------------------------------------------------------------------------------------K

Query:  TSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKFNTTFKFSTT-------------------------------NLVLAQAEFSYNHMK
        TSDAV++ NLFFKEVV+LHG+PKSI SDRD  FLSHFW+TLWR+F+TT  FS+T                               +L LAQAEF+YN M 
Subjt:  TSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKFNTTFKFSTT-------------------------------NLVLAQAEFSYNHMK

Query:  NQTTGD-----------------------------------------------------------------KEFKEGELVMIHLRKARLPIGKYNKLQSK
        N++TG                                                                  K FKEG+LVM++LRK R+P G  +KL  K
Subjt:  NQTTGD-----------------------------------------------------------------KEFKEGELVMIHLRKARLPIGKYNKLQSK

Query:  KLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEYFPPDQ
        K  PY I++K  DN Y++DLP  + I+  FNVAD+FEY PPD+
Subjt:  KLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEYFPPDQ

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.61.0e-1042.68Show/hide
Query:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL
        +L + DFT+ F +  DA    +GAVLSQ  HP+ Y S  L+E    +ST E+EL ++V A K + HYLLG+ F +  +H  L
Subjt:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL

P10401 Retrovirus-related Pol polyprotein from transposon gypsy4.7e-0832.28Show/hide
Query:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLG-QDFILFYNHLSL-----------------
        +L   DF +PF++  DA  +GIGAVLSQ   PI   S  L +  + ++T E+EL ++V AL   +++L G ++  +F +H  L                 
Subjt:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLG-QDFILFYNHLSL-----------------

Query:  ------------KAGSTNVVADALSRK
                    K G  N VADALSR+
Subjt:  ------------KAGSTNVVADALSRK

P20825 Retrovirus-related Pol polyprotein from transposon 2971.7e-1039.76Show/hide
Query:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSLK
        +L L DF + F +  DA    +GAVLSQ+ HPI + S  L++    +S  E+EL ++V A K + HYLLG+ F++  +H  L+
Subjt:  MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSLK

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein2.6e-0628.57Show/hide
Query:  FEVVVDACGNGIGAVLSQHTHP------IEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL----------------------
        + +  DA  +GIGAVL +  +       + YFS+ L  A+K +   E EL  +++AL  + + L G+ F L  +H+SL                      
Subjt:  FEVVVDACGNGIGAVLSQHTHP------IEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL----------------------

Query:  -------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNP
                AG  NVVADA+SR    +T   S+ +  +S  + YK +P
Subjt:  -------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNP

Q99315 Transposon Ty3-G Gag-Pol polyprotein2.0e-0628.57Show/hide
Query:  FEVVVDACGNGIGAVLSQHTHP------IEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL----------------------
        + +  DA  +GIGAVL +  +       + YFS+ L  A+K +   E EL  +++AL  + + L G+ F L  +H+SL                      
Subjt:  FEVVVDACGNGIGAVLSQHTHP------IEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSL----------------------

Query:  -------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNP
                AG  NVVADA+SR    +T   S+ +  +S  + YK +P
Subjt:  -------KAGSTNVVADALSRKANLLTILKSQVVAFDSLPTLYKDNP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGCACTATCGGATTTCACACAACCGTTTGAAGTGGTCGTTGATGCATGTGGGAATGGAATAGGTGCAGTCTTATCCCAACACACTCACCCAATTGAATACTTTAG
TGAAAAACTAAGCGAGGCAAGGAAAAAATGGAGCACCTACGAACAAGAACTATACTCTTTGGTCAGAGCTTTAAAGGTATGGGAACACTATCTCTTAGGACAAGACTTTA
TCCTATTCTACAATCATTTATCGCTCAAGGCTGGTAGTACCAATGTAGTAGCAGATGCCCTTAGTAGGAAAGCCAACCTACTTACTATTCTAAAATCTCAAGTAGTTGCA
TTTGATTCCTTGCCCACTCTTTATAAAGACAATCCCGACTTTGGTCAAATTTGGAAAACATCGGATGCCGTTTACATTACTAATCTGTTTTTTAAAGAAGTGGTACAGTT
GCATGGAATCCCCAAGTCCATAGTCTCCGACCGTGACGTCAACTTCCTAAGTCACTTTTGGAAGACACTTTGGAGAAAATTCAACACAACTTTTAAGTTTAGCACCACTA
ACCTTGTCCTAGCCCAAGCAGAATTTTCATACAATCATATGAAAAACCAAACAACAGGAGATAAAGAATTCAAAGAAGGAGAGCTAGTAATGATACACCTTCGAAAGGCA
AGACTACCGATTGGAAAATACAACAAACTACAATCAAAGAAGTTAGAACCGTATCCCATAATCAAGAAATTTGGAGATAACACATACAAGATTGATCTCCCCGATCACAT
ACATATCAATCTTATCTTCAATGTGGCTGATATATTCGAGTATTTCCCTCCTGATCAACTACAACTTTCAAAC
mRNA sequenceShow/hide mRNA sequence
ATGCTTGCACTATCGGATTTCACACAACCGTTTGAAGTGGTCGTTGATGCATGTGGGAATGGAATAGGTGCAGTCTTATCCCAACACACTCACCCAATTGAATACTTTAG
TGAAAAACTAAGCGAGGCAAGGAAAAAATGGAGCACCTACGAACAAGAACTATACTCTTTGGTCAGAGCTTTAAAGGTATGGGAACACTATCTCTTAGGACAAGACTTTA
TCCTATTCTACAATCATTTATCGCTCAAGGCTGGTAGTACCAATGTAGTAGCAGATGCCCTTAGTAGGAAAGCCAACCTACTTACTATTCTAAAATCTCAAGTAGTTGCA
TTTGATTCCTTGCCCACTCTTTATAAAGACAATCCCGACTTTGGTCAAATTTGGAAAACATCGGATGCCGTTTACATTACTAATCTGTTTTTTAAAGAAGTGGTACAGTT
GCATGGAATCCCCAAGTCCATAGTCTCCGACCGTGACGTCAACTTCCTAAGTCACTTTTGGAAGACACTTTGGAGAAAATTCAACACAACTTTTAAGTTTAGCACCACTA
ACCTTGTCCTAGCCCAAGCAGAATTTTCATACAATCATATGAAAAACCAAACAACAGGAGATAAAGAATTCAAAGAAGGAGAGCTAGTAATGATACACCTTCGAAAGGCA
AGACTACCGATTGGAAAATACAACAAACTACAATCAAAGAAGTTAGAACCGTATCCCATAATCAAGAAATTTGGAGATAACACATACAAGATTGATCTCCCCGATCACAT
ACATATCAATCTTATCTTCAATGTGGCTGATATATTCGAGTATTTCCCTCCTGATCAACTACAACTTTCAAAC
Protein sequenceShow/hide protein sequence
MLALSDFTQPFEVVVDACGNGIGAVLSQHTHPIEYFSEKLSEARKKWSTYEQELYSLVRALKVWEHYLLGQDFILFYNHLSLKAGSTNVVADALSRKANLLTILKSQVVA
FDSLPTLYKDNPDFGQIWKTSDAVYITNLFFKEVVQLHGIPKSIVSDRDVNFLSHFWKTLWRKFNTTFKFSTTNLVLAQAEFSYNHMKNQTTGDKEFKEGELVMIHLRKA
RLPIGKYNKLQSKKLEPYPIIKKFGDNTYKIDLPDHIHINLIFNVADIFEYFPPDQLQLSN