; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G000600 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G000600
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCmo_Chr04:331000..331920
RNA-Seq ExpressionCmoCh04G000600
SyntenyCmoCh04G000600
Gene Ontology termsGO:0044238 - primary metabolic process (biological process)
GO:0071704 - organic substance metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN60238.1 hypothetical protein VITISV_032906 [Vitis vinifera]1.6e-6242.74Show/hide
Query:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE
        MARSM+ A+ L N FWAE VAT+VYLLNISPTKAV+NRTPYEAW+GRKP VSHL++FG VAY L     R KLDEKS KCIFIGYC+QSK YKLYNPVS 
Subjt:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE

Query:  KILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYASCQFALTISDP
        KI+V R+V+F+E  S  W   ++     VE+S   E   +    PS         + S S+   S SS  + S+ETPPRK++SL DIY + Q  L ++DP
Subjt:  KILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYASCQFALTISDP

Query:  MNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK--------------------------------------------------------------
          +EEA EKE+   AM  E+ +++K+ TW++V+LP +K                                                              
Subjt:  MNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK--------------------------------------------------------------

Query:  -------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ
                     L G+L EE+Y + PEGFI  DKE  VY+L+  LYGLKQ
Subjt:  -------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ

CAN72676.1 hypothetical protein VITISV_020406 [Vitis vinifera]8.3e-7253.1Show/hide
Query:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE
        MARSM++A+ L N FWAE VAT+VYLLNISPTKAV+NRTPYEAW+G+KP VSHL++FG VAY L     R KLDEKS KCIFIGYC+QSK YKLYNPVS 
Subjt:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE

Query:  KILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGS-TATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYASCQFALTISD
        KI+V R+V+F+E  SW W   ++     VE+S   E   +    PS    A+ + S+S SS +L+S+SS  + S+ETPPRK++SL DIY + Q  L ++D
Subjt:  KILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGS-TATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYASCQFALTISD

Query:  PMNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK-------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ
        P  +EEA EKE+   AM  E+ +++K+ TW++V+LP +K             L  +L EE+YV+ PEGFI  DKE  VY+L++ LYGLKQ
Subjt:  PMNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK-------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ

KAG7584961.1 Integrase catalytic core [Arabidopsis thaliana x Arabidopsis arenosa]2.0e-6241.44Show/hide
Query:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE
        MARSML+A+ +P++FW EAVAT+VYLLNISPTKAVMNRTPYEAW GRKP VSHL++FGC AYA+     R KLDEKS KC+FIGYC+QSKAY+LYN  S 
Subjt:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE

Query:  KILVRRDVIFNENVSWDWSK-----------EQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYA
        K+++ R+V+FNE     W++           ++E Q+      P+ E + N S  PS +     +S+ SSS+S  S  S    +   PPRK+KSL ++Y 
Subjt:  KILVRRDVIFNENVSWDWSK-----------EQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYA

Query:  SCQFALTISDPMNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK---------------------------------------------------
        SC FAL  SDP+ + EA    D + AM VEM++++K+ TW++VD+P  K                                                   
Subjt:  SCQFALTISDPMNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK---------------------------------------------------

Query:  ------------------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ
                                L G+L+EE+YV+ PEGF+   KE KVY+LR+ LYGLKQ
Subjt:  ------------------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ

RVW25417.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]7.5e-6543.02Show/hide
Query:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE
        MARSM++A+ L N FWAE VAT+VYLLNISPTKAV+NRTPYEAW+GRKP VSHL++FG VAY L +   R KLDEKS KCIFIGYC+QSK YKLYNPVS 
Subjt:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE

Query:  KILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYASCQFALTISDP
        KI+V R+V+F+E  SW W   ++     VE+S   E   +    PS       + + S S+   S+SS  + S+ETPPRK++SL DIY + Q  L ++DP
Subjt:  KILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYASCQFALTISDP

Query:  MNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK--------------------------------------------------------------
          +EEA EKE+   AM  E+ +++K+ TW++V+LP +K                                                              
Subjt:  MNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK--------------------------------------------------------------

Query:  -------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ
                     L G+L EE+YV+ PEGFI   KE  VY+L++ LYGLKQ
Subjt:  -------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ

XP_022966695.1 uncharacterized protein LOC111466326 [Cucurbita maxima]1.2e-9489Show/hide
Query:  GYCTQSK--AYKLYNPVSEKILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKY
        G CT     ++ +YNP+SEKILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPS STATQSVSNSSS ASLNSNSSLEELSDETPPRKY
Subjt:  GYCTQSK--AYKLYNPVSEKILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKY

Query:  KSLVDIYASCQFALTISDPMNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEKLYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQTYLRR
        K L DIYASCQFALT+SDPMNYEEATEKEDGKK M VEMQSVKKDGTWDMVDLPN+K YGDLQEE+YVTPPEGFIK+DKETKVYKLRQTLYGLKQTYLRR
Subjt:  KSLVDIYASCQFALTISDPMNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEKLYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQTYLRR

Query:  PCQKKSSVL
         CQKKSSVL
Subjt:  PCQKKSSVL

TrEMBL top hitse value%identityAlignment
A0A0V0IV83 Putative ovule protein (Fragment)2.3e-7547.11Show/hide
Query:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE
        MARSMLQA+ L NQFWAEAVA S+YLLN+SPTK VMN+TPYEAWH RKP+VSHLR+FGCVAYAL   Q RQKLDEKSEKCIFIGYCT SKAY+LYNP + 
Subjt:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE

Query:  KILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDE------------TPPRKYKSLVDIY
        KI VRRDVIF+EN SWDWS++  +Q+  + +   EE          GST   S S SS     +  S   E S +            TP  KY +  ++Y
Subjt:  KILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDE------------TPPRKYKSLVDIY

Query:  ASCQFALTISDPMNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK--------------------------------------------------
        A+CQFA ++SDP+NYEEA EKE+ +K +  EM+S +K+GTW+M++LP  K                                                  
Subjt:  ASCQFALTISDPMNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK--------------------------------------------------

Query:  -------------------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ
                                 L GDLQEE+YVT P+GFIKE  ETKV+KL++TLYGLKQ
Subjt:  -------------------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ

A0A438CQE1 Retrovirus-related Pol polyprotein from transposon TNT 1-943.6e-6543.02Show/hide
Query:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE
        MARSM++A+ L N FWAE VAT+VYLLNISPTKAV+NRTPYEAW+GRKP VSHL++FG VAY L +   R KLDEKS KCIFIGYC+QSK YKLYNPVS 
Subjt:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE

Query:  KILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYASCQFALTISDP
        KI+V R+V+F+E  SW W   ++     VE+S   E   +    PS       + + S S+   S+SS  + S+ETPPRK++SL DIY + Q  L ++DP
Subjt:  KILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYASCQFALTISDP

Query:  MNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK--------------------------------------------------------------
          +EEA EKE+   AM  E+ +++K+ TW++V+LP +K                                                              
Subjt:  MNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK--------------------------------------------------------------

Query:  -------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ
                     L G+L EE+YV+ PEGFI   KE  VY+L++ LYGLKQ
Subjt:  -------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ

A0A6J1HSC0 uncharacterized protein LOC1114663265.7e-9589Show/hide
Query:  GYCTQSK--AYKLYNPVSEKILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKY
        G CT     ++ +YNP+SEKILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPS STATQSVSNSSS ASLNSNSSLEELSDETPPRKY
Subjt:  GYCTQSK--AYKLYNPVSEKILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKY

Query:  KSLVDIYASCQFALTISDPMNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEKLYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQTYLRR
        K L DIYASCQFALT+SDPMNYEEATEKEDGKK M VEMQSVKKDGTWDMVDLPN+K YGDLQEE+YVTPPEGFIK+DKETKVYKLRQTLYGLKQTYLRR
Subjt:  KSLVDIYASCQFALTISDPMNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEKLYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQTYLRR

Query:  PCQKKSSVL
         CQKKSSVL
Subjt:  PCQKKSSVL

A5AHH2 Uncharacterized protein7.6e-6342.74Show/hide
Query:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE
        MARSM+ A+ L N FWAE VAT+VYLLNISPTKAV+NRTPYEAW+GRKP VSHL++FG VAY L     R KLDEKS KCIFIGYC+QSK YKLYNPVS 
Subjt:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE

Query:  KILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYASCQFALTISDP
        KI+V R+V+F+E  S  W   ++     VE+S   E   +    PS         + S S+   S SS  + S+ETPPRK++SL DIY + Q  L ++DP
Subjt:  KILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYASCQFALTISDP

Query:  MNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK--------------------------------------------------------------
          +EEA EKE+   AM  E+ +++K+ TW++V+LP +K                                                              
Subjt:  MNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK--------------------------------------------------------------

Query:  -------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ
                     L G+L EE+Y + PEGFI  DKE  VY+L+  LYGLKQ
Subjt:  -------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ

A5BT67 Integrase catalytic domain-containing protein4.0e-7253.1Show/hide
Query:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE
        MARSM++A+ L N FWAE VAT+VYLLNISPTKAV+NRTPYEAW+G+KP VSHL++FG VAY L     R KLDEKS KCIFIGYC+QSK YKLYNPVS 
Subjt:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE

Query:  KILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGS-TATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYASCQFALTISD
        KI+V R+V+F+E  SW W   ++     VE+S   E   +    PS    A+ + S+S SS +L+S+SS  + S+ETPPRK++SL DIY + Q  L ++D
Subjt:  KILVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGS-TATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYASCQFALTISD

Query:  PMNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK-------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ
        P  +EEA EKE+   AM  E+ +++K+ TW++V+LP +K             L  +L EE+YV+ PEGFI  DKE  VY+L++ LYGLKQ
Subjt:  PMNYEEATEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK-------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-1734.36Show/hide
Query:  ARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVM--NRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQ-KLDEKSEKCIFIGYCTQSKAYKLYNPV
        AR+M+   +L   FW EAV T+ YL+N  P++A++  ++TPYE WH +KP + HLR+FG   Y   H + +Q K D+KS K IF+GY  +   +KL++ V
Subjt:  ARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVM--NRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQ-KLDEKSEKCIFIGYCTQSKAYKLYNPV

Query:  SEKILVRRDVIFN------------ENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTA---TQSVSNSSSSASLN-SNSSLEELSDETP
        +EK +V RDV+ +            E V    SKE E +       PN+  ++  +  P+ S      Q + +S  S + N  N S + +  E P
Subjt:  SEKILVRRDVIFN------------ENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTA---TQSVSNSSSSASLN-SNSSLEELSDETP

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-2729.38Show/hide
Query:  RSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSEKI
        RSML+  +LP  FW EAV T+ YL+N SP+  +    P   W  ++ S SHL++FGC A+A    + R KLD+KS  CIFIGY  +   Y+L++PV +K+
Subjt:  RSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSEKI

Query:  LVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASP-SGSTATQSVSNSSSSAS--LNSNSSLEELSDETP------------PRKYKSLVDI
        +  RDV+F E  S   +     +K+   + PN  T  ++S +P S  + T  VS         +     L+E  +E               R  +  V+ 
Subjt:  LVRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSASP-SGSTATQSVSNSSSSAS--LNSNSSLEELSDETP------------PRKYKSLVDI

Query:  --YASCQFALTISD--PMNYEEA---TEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK------------------------------------------
          Y S ++ L   D  P + +E     EK    KAM  EM+S++K+GT+ +V+LP  K                                          
Subjt:  --YASCQFALTISD--PMNYEEA---TEKEDGKKAMTVEMQSVKKDGTWDMVDLPNEK------------------------------------------

Query:  ---------------------------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ
                                         L+GDL+EEIY+  PEGF    K+  V KL ++LYGLKQ
Subjt:  ---------------------------------LYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ

P92512 Uncharacterized mitochondrial protein AtMg007103.7e-0645Show/hide
Query:  RSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAY
        RSML    LP  F A+A  T+V+++N  P+ A+    P E W    P+ S+LR FGCVAY
Subjt:  RSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-1225.34Show/hide
Query:  SMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSEKIL
        ++L    +P  +W  A A +VYL+N  PT  +   +P++   G  P+   LR+FGC  Y    P  + KLD+KS +C+F+GY     AY   +  + ++ 
Subjt:  SMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSEKIL

Query:  VRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSA--SPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYASCQFALTISDPM
        + R V F+EN                 +SP +E R  SS   SP  +  T++    + S S   +++    S   P   +++     ++   + + S P 
Subjt:  VRRDVIFNENVSWDWSKEQEQQKITVEVSPNEETRVNSSA--SPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYASCQFALTISDPM

Query:  NYEEATEKEDGKKAMTVEMQS
        + E    +++G +  T   Q+
Subjt:  NYEEATEKEDGKKAMTVEMQS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.5e-0442.86Show/hide
Query:  VKKDGTWDM--VDLPNEKLYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ
        V  D +W +  +D+ N  L G L +++Y++ P GFI +D+   V KLR+ LYGLKQ
Subjt:  VKKDGTWDM--VDLPNEKLYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.6e-1227.04Show/hide
Query:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE
        M  ++L    +P  +W  A + +VYL+N  PT  +  ++P++   G+ P+   L++FGC  Y    P  R KL++KS++C F+GY     AY   +  + 
Subjt:  MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSE

Query:  KILVRRDVIFNE------NVSWDWSKEQEQQKITVE--------------------VSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLE
        ++   R V F+E        ++  S  QEQ+  +                      + P+ +T     +SPS    TQ  S++  S+S++S SS E
Subjt:  KILVRRDVIFNE------NVSWDWSKEQEQQKITVE--------------------VSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.3e-0339.29Show/hide
Query:  VKKDGTWDM--VDLPNEKLYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ
        V  D +W +  +D+ N  L G L +E+Y++ P GF+ +D+   V +LR+ +YGLKQ
Subjt:  VKKDGTWDM--VDLPNEKLYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQ

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.6e-0745Show/hide
Query:  RSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAY
        RSML    LP  F A+A  T+V+++N  P+ A+    P E W    P+ S+LR FGCVAY
Subjt:  RSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAGAAGTATGTTACAAGCAAGGAGACTTCCAAACCAATTTTGGGCCGAAGCCGTAGCAACATCTGTTTATCTATTAAACATCTCACCAACAAAGGCGGTTATGAA
TCGAACTCCCTATGAAGCATGGCATGGAAGGAAACCATCTGTAAGTCATTTACGAATCTTTGGTTGTGTTGCTTATGCTTTGAAACATCCTCAAACTCGTCAAAAGCTTG
ATGAAAAATCTGAAAAATGCATTTTCATTGGCTATTGTACTCAATCAAAGGCATATAAACTATACAACCCCGTTAGTGAAAAGATTTTAGTTAGAAGAGATGTAATATTT
AATGAAAACGTGAGTTGGGATTGGAGTAAAGAACAGGAGCAGCAAAAGATTACAGTTGAAGTTTCCCCAAATGAAGAGACAAGGGTCAACTCCAGTGCATCTCCCAGTGG
TTCCACTGCAACACAAAGTGTGTCAAATTCATCATCTTCAGCATCACTTAATAGCAATTCTTCTCTTGAAGAACTTTCAGATGAGACACCTCCAAGGAAGTATAAATCTT
TAGTTGACATATATGCATCATGTCAGTTTGCTCTTACTATTTCAGACCCTATGAATTATGAAGAAGCAACTGAAAAAGAAGACGGGAAGAAAGCCATGACAGTAGAGATG
CAGTCCGTTAAGAAAGATGGAACATGGGATATGGTTGACCTACCAAACGAAAAGCTTTATGGAGATTTACAAGAAGAGATTTATGTAACACCGCCAGAGGGATTCATAAA
GGAAGACAAAGAAACAAAGGTATACAAGCTGAGACAGACATTGTACGGGTTAAAGCAGACATACTTACGAAGGCCTTGTCAGAAGAAAAGCTCTGTACTTACGTCTACAA
CTTTGAATCATGGGGGAGTGTTGAGATATGATTCAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAGAAGTATGTTACAAGCAAGGAGACTTCCAAACCAATTTTGGGCCGAAGCCGTAGCAACATCTGTTTATCTATTAAACATCTCACCAACAAAGGCGGTTATGAA
TCGAACTCCCTATGAAGCATGGCATGGAAGGAAACCATCTGTAAGTCATTTACGAATCTTTGGTTGTGTTGCTTATGCTTTGAAACATCCTCAAACTCGTCAAAAGCTTG
ATGAAAAATCTGAAAAATGCATTTTCATTGGCTATTGTACTCAATCAAAGGCATATAAACTATACAACCCCGTTAGTGAAAAGATTTTAGTTAGAAGAGATGTAATATTT
AATGAAAACGTGAGTTGGGATTGGAGTAAAGAACAGGAGCAGCAAAAGATTACAGTTGAAGTTTCCCCAAATGAAGAGACAAGGGTCAACTCCAGTGCATCTCCCAGTGG
TTCCACTGCAACACAAAGTGTGTCAAATTCATCATCTTCAGCATCACTTAATAGCAATTCTTCTCTTGAAGAACTTTCAGATGAGACACCTCCAAGGAAGTATAAATCTT
TAGTTGACATATATGCATCATGTCAGTTTGCTCTTACTATTTCAGACCCTATGAATTATGAAGAAGCAACTGAAAAAGAAGACGGGAAGAAAGCCATGACAGTAGAGATG
CAGTCCGTTAAGAAAGATGGAACATGGGATATGGTTGACCTACCAAACGAAAAGCTTTATGGAGATTTACAAGAAGAGATTTATGTAACACCGCCAGAGGGATTCATAAA
GGAAGACAAAGAAACAAAGGTATACAAGCTGAGACAGACATTGTACGGGTTAAAGCAGACATACTTACGAAGGCCTTGTCAGAAGAAAAGCTCTGTACTTACGTCTACAA
CTTTGAATCATGGGGGAGTGTTGAGATATGATTCAAAATGA
Protein sequenceShow/hide protein sequence
MARSMLQARRLPNQFWAEAVATSVYLLNISPTKAVMNRTPYEAWHGRKPSVSHLRIFGCVAYALKHPQTRQKLDEKSEKCIFIGYCTQSKAYKLYNPVSEKILVRRDVIF
NENVSWDWSKEQEQQKITVEVSPNEETRVNSSASPSGSTATQSVSNSSSSASLNSNSSLEELSDETPPRKYKSLVDIYASCQFALTISDPMNYEEATEKEDGKKAMTVEM
QSVKKDGTWDMVDLPNEKLYGDLQEEIYVTPPEGFIKEDKETKVYKLRQTLYGLKQTYLRRPCQKKSSVLTSTTLNHGGVLRYDSK