; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh19G004170 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh19G004170
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionTransposon Ty1-LR1 Gag-Pol polyprotein
Genome locationCmo_Chr19:5019009..5021079
RNA-Seq ExpressionCmoCh19G004170
SyntenyCmoCh19G004170
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
AQA29583.1 reverse transcriptase [Zea mays]2.4e-4841.01Show/hide
Query:  PIQLKEGRVFAKIGERDEQHEHR--RWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTIALLMAAADLRLYDIHF-----PTTSELS
        PI + E +V   +   +E+      RW+LDT ATNHMT +R  FSELDS +  TVKFGDGS++ IEG+G  +  L +    RL  +++          L 
Subjt:  PIQLKEGRVFAKIGERDEQHEHR--RWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTIALLMAAADLRLYDIHF-----PTTSELS

Query:  QF------VLVSP------------TAKR-------------------QKLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSYRADEPLELVHG
        Q       V++               AKR                   +KL++E+MV GLP+I  V+++C+ C++ KQ+R PFP    YRA E LELVHG
Subjt:  QF------VLVSP------------TAKR-------------------QKLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSYRADEPLELVHG

Query:  DICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYC
        D+CGPI   TP   + FLLLVDD SRFMWLTLL++K++A   +   + R E EC KK++VLRT+ G EFTS  F ++C
Subjt:  DICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYC

BAF21751.1 Os07g0528100, partial [Oryza sativa Japonica Group]6.0e-4738.31Show/hide
Query:  PAGEPIQLKEGRVFAKIGERDEQHEHRRWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTI-----------------------ALL
        P    + L E +V       +E+     W LDT ATNHMT +RSAF++LD+ +  TVKFGDGS+++I GRG  +                        + 
Subjt:  PAGEPIQLKEGRVFAKIGERDEQHEHRRWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTI-----------------------ALL

Query:  MAAADLRLYDIHF---------------------PTTSELSQFVLVSP---------TAKR-------------QKLQKEKMVHGLPAIKNVNKLCDGCL
        +   D R YD H                      P+   + +  +  P         TA R             ++L + +MV GLP I +V++LCDGCL
Subjt:  MAAADLRLYDIHF---------------------PTTSELSQFVLVSP---------TAKR-------------QKLQKEKMVHGLPAIKNVNKLCDGCL

Query:  IGKQRRTPFPSQTSYRADEPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSF
         GKQRR PFP +  +RA + LELVHGD+CGPI  ATPG +  FLLLVDD SRFMW+ LL  K EAA A+K+ K   E E  +K+R LRT++G EFTS  F
Subjt:  IGKQRRTPFPSQTSYRADEPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSF

Query:  RKYCDEFG
         ++C + G
Subjt:  RKYCDEFG

BAG94704.1 unnamed protein product [Oryza sativa Japonica Group]6.0e-4738.31Show/hide
Query:  PAGEPIQLKEGRVFAKIGERDEQHEHRRWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTI-----------------------ALL
        P    + L E +V       +E+     W LDT ATNHMT +RSAF++LD+ +  TVKFGDGS+++I GRG  +                        + 
Subjt:  PAGEPIQLKEGRVFAKIGERDEQHEHRRWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTI-----------------------ALL

Query:  MAAADLRLYDIHF---------------------PTTSELSQFVLVSP---------TAKR-------------QKLQKEKMVHGLPAIKNVNKLCDGCL
        +   D R YD H                      P+   + +  +  P         TA R             ++L + +MV GLP I +V++LCDGCL
Subjt:  MAAADLRLYDIHF---------------------PTTSELSQFVLVSP---------TAKR-------------QKLQKEKMVHGLPAIKNVNKLCDGCL

Query:  IGKQRRTPFPSQTSYRADEPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSF
         GKQRR PFP +  +RA + LELVHGD+CGPI  ATPG +  FLLLVDD SRFMW+ LL  K EAA A+K+ K   E E  +K+R LRT++G EFTS  F
Subjt:  IGKQRRTPFPSQTSYRADEPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSF

Query:  RKYCDEFG
         ++C + G
Subjt:  RKYCDEFG

CAE03692.2 OSJNBb0026E15.10 [Oryza sativa Japonica Group]1.5e-5031.11Show/hide
Query:  LASLSTKRTAQSTWEGIKFRRVGVQRVRESNIEQLQKTI---------------------------------EEEWLARL--------------------
        LA+L+ K TAQ  WE IK RR+GVQRVRE+N +QL++                                   E E + +L                    
Subjt:  LASLSTKRTAQSTWEGIKFRRVGVQRVRESNIEQLQKTI---------------------------------EEEWLARL--------------------

Query:  --KLKLHENTGESNGPSSRKGSKKPWKSRGRTCRKDGGSKEGVDRWQVDSVLELREERSPEQELMEQSSGNKEKAEKAQSEEDVGERSPEQELMEQSSG-
          +L L E TG       R    K  K+   + R D     G   +  +  L    + +  Q+    SSGN ++  + + ++D G     Q       G 
Subjt:  --KLKLHENTGESNGPSSRKGSKKPWKSRGRTCRKDGGSKEGVDRWQVDSVLELREERSPEQELMEQSSGNKEKAEKAQSEEDVGERSPEQELMEQSSG-

Query:  -------------------NKEKAEK---AQSEEDEPALFMVSAYIPNFDSKSTEVEVIDDDVEPEEELQLGVGKAAPAGEPIQLKEGRVFAKIGERDEQ
                           +K KA++   AQ E++EPAL +    +   D     V    + V P    +      +P GE + + E +VFA++ +  E 
Subjt:  -------------------NKEKAEK---AQSEEDEPALFMVSAYIPNFDSKSTEVEVIDDDVEPEEELQLGVGKAAPAGEPIQLKEGRVFAKIGERDEQ

Query:  HEHRRWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEG-------------RGVT-------------------------------------
        H+   WILDT ATNHMT +RSAF+ELD+ +  TV+FGDGS++ IEG             RG+                                      
Subjt:  HEHRRWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEG-------------RGVT-------------------------------------

Query:  ---IALLMAAADLRLYDIHFPTTSELSQFVLVSPTAKR-------------QKLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSYRADEPLEL
           + + +  +D  LY I       +      +  A R             +KL +++MV GLP ++ V ++CDGCL+GKQRR  FP+Q+ YRADE LEL
Subjt:  ---IALLMAAADLRLYDIHFPTTSELSQFVLVSPTAKR-------------QKLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSYRADEPLEL

Query:  VHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYCDEFG
        VHGD+CGPI+ ATP     FLLLVDD SR+MWLT++++K EAA A+K  + RAE E  +K+R LR ++G EFTS  F +YC   G
Subjt:  VHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYCDEFG

CAH66352.1 OSIGBa0135C09.3 [Oryza sativa]3.4e-5031.5Show/hide
Query:  LASLSTKRTAQSTWEGIKFRRVGVQRVRESNIEQLQKTI---------------------------------EEEWLARL--------------------
        LA+L+ K TAQ  WE IK RR+GVQRVRE+N +QL++                                   E E + +L                    
Subjt:  LASLSTKRTAQSTWEGIKFRRVGVQRVRESNIEQLQKTI---------------------------------EEEWLARL--------------------

Query:  --KLKLHENTGESNGPSSRKGSKKPWKSRGRTCRKDGGSKEGVDRWQVDSVLELREERSPEQELMEQSSGNKEKAEKAQSEEDVGERSPEQELMEQSSG-
          +L L E TG       R    K  K+   + R D     G   +  +  L    + +  Q+    S GN ++  + + ++D G     Q       G 
Subjt:  --KLKLHENTGESNGPSSRKGSKKPWKSRGRTCRKDGGSKEGVDRWQVDSVLELREERSPEQELMEQSSGNKEKAEKAQSEEDVGERSPEQELMEQSSG-

Query:  -------------------NKEKAEK---AQSEEDEPALFMVSAYIPNFDSKSTEVEVIDDDVEPEEELQLGVGKAAPAGEPIQLKEGRVFAKIGERDEQ
                           +K KA++   AQ E++EPAL +    +   D     V    + V P    +      +P GE + + E +VFA++ +  E 
Subjt:  -------------------NKEKAEK---AQSEEDEPALFMVSAYIPNFDSKSTEVEVIDDDVEPEEELQLGVGKAAPAGEPIQLKEGRVFAKIGERDEQ

Query:  HEHRRWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTI-----------------------------------ALLMAAADLRLYDI
        H+   WILDT ATNHMT +RSAF++LD+ +  TV+FGDGS++ IEGRG  +                                    +L+    LR++D 
Subjt:  HEHRRWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTI-----------------------------------ALLMAAADLRLYDI

Query:  --HFPTTSELSQ-------------FVLVSPTAK----------------RQKLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSYRADEPLEL
          H       S                L + +AK                 +KL +++MV GLP ++ V ++CDGCL+GKQRR  FP+Q+ YRADE LEL
Subjt:  --HFPTTSELSQ-------------FVLVSPTAK----------------RQKLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSYRADEPLEL

Query:  VHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYC
        VHGD+CGPI+ ATP     FLLLVDD SR+MWLTL+++K EAA A+K  +  AE E  +K+R LRT++G EFTS  F +YC
Subjt:  VHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYC

TrEMBL top hitse value%identityAlignment
A0A1P8YYM3 Reverse transcriptase1.2e-4841.01Show/hide
Query:  PIQLKEGRVFAKIGERDEQHEHR--RWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTIALLMAAADLRLYDIHF-----PTTSELS
        PI + E +V   +   +E+      RW+LDT ATNHMT +R  FSELDS +  TVKFGDGS++ IEG+G  +  L +    RL  +++          L 
Subjt:  PIQLKEGRVFAKIGERDEQHEHR--RWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTIALLMAAADLRLYDIHF-----PTTSELS

Query:  QF------VLVSP------------TAKR-------------------QKLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSYRADEPLELVHG
        Q       V++               AKR                   +KL++E+MV GLP+I  V+++C+ C++ KQ+R PFP    YRA E LELVHG
Subjt:  QF------VLVSP------------TAKR-------------------QKLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSYRADEPLELVHG

Query:  DICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYC
        D+CGPI   TP   + FLLLVDD SRFMWLTLL++K++A   +   + R E EC KK++VLRT+ G EFTS  F ++C
Subjt:  DICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYC

A0B9X7 OSIGBa0135C09.3 protein1.6e-5031.5Show/hide
Query:  LASLSTKRTAQSTWEGIKFRRVGVQRVRESNIEQLQKTI---------------------------------EEEWLARL--------------------
        LA+L+ K TAQ  WE IK RR+GVQRVRE+N +QL++                                   E E + +L                    
Subjt:  LASLSTKRTAQSTWEGIKFRRVGVQRVRESNIEQLQKTI---------------------------------EEEWLARL--------------------

Query:  --KLKLHENTGESNGPSSRKGSKKPWKSRGRTCRKDGGSKEGVDRWQVDSVLELREERSPEQELMEQSSGNKEKAEKAQSEEDVGERSPEQELMEQSSG-
          +L L E TG       R    K  K+   + R D     G   +  +  L    + +  Q+    S GN ++  + + ++D G     Q       G 
Subjt:  --KLKLHENTGESNGPSSRKGSKKPWKSRGRTCRKDGGSKEGVDRWQVDSVLELREERSPEQELMEQSSGNKEKAEKAQSEEDVGERSPEQELMEQSSG-

Query:  -------------------NKEKAEK---AQSEEDEPALFMVSAYIPNFDSKSTEVEVIDDDVEPEEELQLGVGKAAPAGEPIQLKEGRVFAKIGERDEQ
                           +K KA++   AQ E++EPAL +    +   D     V    + V P    +      +P GE + + E +VFA++ +  E 
Subjt:  -------------------NKEKAEK---AQSEEDEPALFMVSAYIPNFDSKSTEVEVIDDDVEPEEELQLGVGKAAPAGEPIQLKEGRVFAKIGERDEQ

Query:  HEHRRWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTI-----------------------------------ALLMAAADLRLYDI
        H+   WILDT ATNHMT +RSAF++LD+ +  TV+FGDGS++ IEGRG  +                                    +L+    LR++D 
Subjt:  HEHRRWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTI-----------------------------------ALLMAAADLRLYDI

Query:  --HFPTTSELSQ-------------FVLVSPTAK----------------RQKLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSYRADEPLEL
          H       S                L + +AK                 +KL +++MV GLP ++ V ++CDGCL+GKQRR  FP+Q+ YRADE LEL
Subjt:  --HFPTTSELSQ-------------FVLVSPTAK----------------RQKLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSYRADEPLEL

Query:  VHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYC
        VHGD+CGPI+ ATP     FLLLVDD SR+MWLTL+++K EAA A+K  +  AE E  +K+R LRT++G EFTS  F +YC
Subjt:  VHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYC

Q0J5Y3 Os08g0389500 protein3.8e-4735.09Show/hide
Query:  KFRRVGVQRVRESNIEQLQKTI---EEEWLARLKLKLHENTGESNGPSSRKGSKKPWKSRGRTCRKDGGSKEGVDRWQVDSVLELREERSPEQELMEQSS
        + R V  +R R + +   Q  +   +EEW+A+LK         + G S  KG+  P  S G   ++ GGS+   DR +  S    R+   P        S
Subjt:  KFRRVGVQRVRESNIEQLQKTI---EEEWLARLKLKLHENTGESNGPSSRKGSKKPWKSRGRTCRKDGGSKEGVDRWQVDSVLELREERSPEQELMEQSS

Query:  GNKEKAEKAQSEEDVGERSPEQELMEQSSGNKEKAEKAQSEEDEPALFMVSAYIPNFDSKSTEVEVIDDDVEPEEELQLGVGKAAPAGEPIQLKEGRVFA
           +K +K +     G  + E       S  +++A  AQ EE+E  + MV+         + +V  I     P          + PA E I L E ++F 
Subjt:  GNKEKAEKAQSEEDVGERSPEQELMEQSSGNKEKAEKAQSEEDEPALFMVSAYIPNFDSKSTEVEVIDDDVEPEEELQLGVGKAAPAGEPIQLKEGRVFA

Query:  KIGERDEQHEHRRWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTIALLMAAADLRLYDI-HFP--TTS------------------
        ++G  +   E  RWILDT ATNHMT  RSAFSEL++ IR TVKFGDGS++ IEGRG  +          L  + H P  TT+                  
Subjt:  KIGERDEQHEHRRWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTIALLMAAADLRLYDI-HFP--TTS------------------

Query:  ----------ELSQFVLVSPT-----------------------------------AKRQKLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSY
                   L   V+ SP                                       +KL +  MV GLP I +V+++CD CL+GKQRR PFPS+  Y
Subjt:  ----------ELSQFVLVSPT-----------------------------------AKRQKLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSY

Query:  RADEPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYCDEFG
        RA E LELVHGDICGP+  ATP    LFLLLVDD SR+MWL LL +K +A+ A+KR    AEAE  +K+R LRT++G EFT+ +F +YC E G
Subjt:  RADEPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYCDEFG

Q7XPB1 OSJNBb0026E15.10 protein7.4e-5131.11Show/hide
Query:  LASLSTKRTAQSTWEGIKFRRVGVQRVRESNIEQLQKTI---------------------------------EEEWLARL--------------------
        LA+L+ K TAQ  WE IK RR+GVQRVRE+N +QL++                                   E E + +L                    
Subjt:  LASLSTKRTAQSTWEGIKFRRVGVQRVRESNIEQLQKTI---------------------------------EEEWLARL--------------------

Query:  --KLKLHENTGESNGPSSRKGSKKPWKSRGRTCRKDGGSKEGVDRWQVDSVLELREERSPEQELMEQSSGNKEKAEKAQSEEDVGERSPEQELMEQSSG-
          +L L E TG       R    K  K+   + R D     G   +  +  L    + +  Q+    SSGN ++  + + ++D G     Q       G 
Subjt:  --KLKLHENTGESNGPSSRKGSKKPWKSRGRTCRKDGGSKEGVDRWQVDSVLELREERSPEQELMEQSSGNKEKAEKAQSEEDVGERSPEQELMEQSSG-

Query:  -------------------NKEKAEK---AQSEEDEPALFMVSAYIPNFDSKSTEVEVIDDDVEPEEELQLGVGKAAPAGEPIQLKEGRVFAKIGERDEQ
                           +K KA++   AQ E++EPAL +    +   D     V    + V P    +      +P GE + + E +VFA++ +  E 
Subjt:  -------------------NKEKAEK---AQSEEDEPALFMVSAYIPNFDSKSTEVEVIDDDVEPEEELQLGVGKAAPAGEPIQLKEGRVFAKIGERDEQ

Query:  HEHRRWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEG-------------RGVT-------------------------------------
        H+   WILDT ATNHMT +RSAF+ELD+ +  TV+FGDGS++ IEG             RG+                                      
Subjt:  HEHRRWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEG-------------RGVT-------------------------------------

Query:  ---IALLMAAADLRLYDIHFPTTSELSQFVLVSPTAKR-------------QKLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSYRADEPLEL
           + + +  +D  LY I       +      +  A R             +KL +++MV GLP ++ V ++CDGCL+GKQRR  FP+Q+ YRADE LEL
Subjt:  ---IALLMAAADLRLYDIHFPTTSELSQFVLVSPTAKR-------------QKLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSYRADEPLEL

Query:  VHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYCDEFG
        VHGD+CGPI+ ATP     FLLLVDD SR+MWLT++++K EAA A+K  + RAE E  +K+R LR ++G EFTS  F +YC   G
Subjt:  VHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYCDEFG

Q7XUD9 OSJNBa0088A01.6 protein5.5e-4632.4Show/hide
Query:  LASLSTKRTAQSTWEGIKFRRVGVQRVRESNIEQLQKTIE----------EEWLARLK--------LKLHENTG--------------------------
        LASL+ K +A+  W+ IK  RVGV RVR+S  + LQK  +          EE+  RL         L +H   G                          
Subjt:  LASLSTKRTAQSTWEGIKFRRVGVQRVRESNIEQLQKTIE----------EEWLARLK--------LKLHENTG--------------------------

Query:  -------ESNGPSSRKGSKKPWKSRGRTCRKDGGSKEGVDRWQVDSVLELREERSPEQELMEQSSGNKEKAEKAQSEEDVGERSPEQELMEQSSGNKEKA
               E  G       K P KS     +     ++   RW+     E    ++       +  G K+KAE A  + D   R    +  E    N++K 
Subjt:  -------ESNGPSSRKGSKKPWKSRGRTCRKDGGSKEGVDRWQVDSVLELREERSPEQELMEQSSGNKEKAEKAQSEEDVGERSPEQELMEQSSGNKEKA

Query:  E---------KAQSEEDEPALFMVSAYIPNFDSKSTEVEVIDDDVEPEEELQLGVGKAAPAGEP-------IQLKEGRVFAKIGERDEQHEHRRWILDTR
                  K   +   PA     AY    D    E  ++   V     +      A PA  P       I+L E +VFA +   ++Q +   W LDT 
Subjt:  E---------KAQSEEDEPALFMVSAYIPNFDSKSTEVEVIDDDVEPEEELQLGVGKAAPAGEP-------IQLKEGRVFAKIGERDEQHEHRRWILDTR

Query:  ATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTI---------------------ALLMA-------AADLRLY----DIHFPTTSEL------
        ATNHMT  R  F+ELD  +R +V+FGDGS++ IEGRG  I                     A L++         D+R+Y     I  P    L      
Subjt:  ATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTI---------------------ALLMA-------AADLRLY----DIHFPTTSEL------

Query:  -SQFVLVSPT--------AKRQ-------------------KLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSYRADEPLELVHGDICGPIKS
         ++  L+  T        AK Q                   +L    MV G+P I +V +LCD C+I KQRRTPFP ++ +RA++ LELVHGD+CGPI  
Subjt:  -SQFVLVSPT--------AKRQ-------------------KLQKEKMVHGLPAIKNVNKLCDGCLIGKQRRTPFPSQTSYRADEPLELVHGDICGPIKS

Query:  ATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYCDEFG
        ATPG K+ FLLL DD SR+MW+TLL  KSEAA+A+KR + RAEAE ++K+R+LRT+ G EFTS  F  YC E G
Subjt:  ATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYCDEFG

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.4e-1128.57Show/hide
Query:  KRQKLQKEKMVHGLPAIKNVN---KLCDGCLIGKQRRTPFPS-QTSYRADEPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEA
        K  +++++ M      + N+    ++C+ CL GKQ R PF   +       PL +VH D+CGPI   T   K+ F++ VD  + +    L++ KS+    
Subjt:  KRQKLQKEKMVHGLPAIKNVN---KLCDGCLIGKQRRTPFPS-QTSYRADEPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEA

Query:  VKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYCDEFG
         +    ++EA    K+  L  + GRE+ S   R++C + G
Subjt:  VKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYCDEFG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-1535.65Show/hide
Query:  KLCDGCLIGKQRRTPFPSQTSYRADEPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGR
        K CD CL GKQ R  F + +S R    L+LV+ D+CGP++  + G    F+  +DD SR +W+ +L+ K +  +  ++     E E  +K++ LR++ G 
Subjt:  KLCDGCLIGKQRRTPFPSQTSYRADEPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGR

Query:  EFTSTSFRKYCDEFG
        E+TS  F +YC   G
Subjt:  EFTSTSFRKYCDEFG

P25384 Transposon Ty2-C Gag-Pol polyprotein3.5e-0527.43Show/hide
Query:  CDGCLIGK--QRRTPFPSQTSYRAD-EPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAA--EAVKRIKVRAEAECEKKMRVLRTE
        C  CLIGK  + R    S+  Y+   EP + +H DI GP+        S F+   D+K+RF W+  L  + E +       I    + +   ++ V++ +
Subjt:  CDGCLIGK--QRRTPFPSQTSYRAD-EPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAA--EAVKRIKVRAEAECEKKMRVLRTE

Query:  QGREFTSTSFRKY
        +G E+T+ +  K+
Subjt:  QGREFTSTSFRKY

Q12472 Transposon Ty2-DR1 Gag-Pol polyprotein3.5e-0527.43Show/hide
Query:  CDGCLIGK--QRRTPFPSQTSYRAD-EPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAA--EAVKRIKVRAEAECEKKMRVLRTE
        C  CLIGK  + R    S+  Y+   EP + +H DI GP+        S F+   D+K+RF W+  L  + E +       I    + +   ++ V++ +
Subjt:  CDGCLIGK--QRRTPFPSQTSYRAD-EPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAA--EAVKRIKVRAEAECEKKMRVLRTE

Query:  QGREFTSTSFRKY
        +G E+T+ +  K+
Subjt:  QGREFTSTSFRKY

Q12491 Transposon Ty2-B Gag-Pol polyprotein3.5e-0527.43Show/hide
Query:  CDGCLIGK--QRRTPFPSQTSYRAD-EPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAA--EAVKRIKVRAEAECEKKMRVLRTE
        C  CLIGK  + R    S+  Y+   EP + +H DI GP+        S F+   D+K+RF W+  L  + E +       I    + +   ++ V++ +
Subjt:  CDGCLIGK--QRRTPFPSQTSYRAD-EPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAA--EAVKRIKVRAEAECEKKMRVLRTE

Query:  QGREFTSTSFRKY
        +G E+T+ +  K+
Subjt:  QGREFTSTSFRKY

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCTGGCGTCTCTCTCCACCAAGCGCACCGCGCAATCGACCTGGGAGGGAATCAAATTCCGTCGGGTTGGTGTGCAGCGAGTGCGGGAATCCAACATCGAGCAGCT
GCAGAAGACTATTGAGGAGGAATGGCTTGCACGCCTAAAGCTAAAGCTCCACGAGAACACCGGGGAGAGCAACGGGCCCTCCAGCCGCAAGGGCAGCAAGAAACCATGGA
AATCTCGCGGGCGCACATGCAGGAAAGATGGGGGATCAAAAGAAGGAGTCGACCGATGGCAGGTCGATTCAGTGCTCGAACTGCGGGAAGAGAGGTCACCTGAGCAAGAA
TTGATGGAGCAAAGCTCAGGGAACAAGGAGAAGGCCGAGAAGGCTCAATCCGAGGAGGACGTGGGAGAGAGGTCACCTGAGCAAGAATTGATGGAGCAAAGCTCAGGGAA
CAAGGAGAAGGCCGAGAAGGCTCAATCCGAAGAGGACGAGCCAGCTCTCTTCATGGTAAGTGCATACATCCCCAATTTCGATTCCAAATCCACGGAGGTGGAGGTAATCG
ACGACGACGTCGAACCCGAGGAAGAGCTCCAACTGGGCGTAGGAAAGGCGGCACCAGCTGGGGAGCCAATTCAATTGAAGGAGGGAAGAGTGTTCGCTAAGATTGGCGAG
AGGGACGAGCAGCACGAGCACCGGCGATGGATCCTCGACACAAGGGCAACAAACCATATGACCAGGGCTAGATCTGCGTTCTCCGAGCTCGACTCGGAGATTCGCAGGAC
GGTGAAATTCGGCGATGGCTCCATCATCGAGATCGAAGGGCGTGGTGTCACAATCGCACTTTTAATGGCAGCAGCGGACTTGCGATTGTACGACATTCACTTTCCAACAA
CAAGTGAGCTAAGCCAATTCGTTCTTGTGTCGCCTACGGCCAAGCGACAAAAGCTACAGAAGGAGAAGATGGTGCACGGTTTGCCGGCAATCAAAAACGTGAACAAGCTG
TGCGACGGGTGCCTCATCGGCAAACAGAGGCGCACACCCTTTCCGTCTCAGACATCCTACCGCGCCGACGAGCCGTTGGAGCTTGTACACGGCGATATCTGTGGGCCCAT
CAAGTCGGCGACCCCAGGCTGTAAGAGTCTCTTCCTCCTATTAGTCGATGACAAAAGCCGCTTCATGTGGCTGACCTTGCTGCAAGCGAAAAGTGAGGCGGCAGAGGCGG
TTAAGCGCATTAAAGTACGAGCGGAGGCCGAATGTGAGAAGAAGATGCGAGTGCTACGCACAGAACAAGGCAGAGAATTCACCTCGACAAGTTTCCGTAAGTACTGCGAC
GAGTTCGGCACCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCTGGCGTCTCTCTCCACCAAGCGCACCGCGCAATCGACCTGGGAGGGAATCAAATTCCGTCGGGTTGGTGTGCAGCGAGTGCGGGAATCCAACATCGAGCAGCT
GCAGAAGACTATTGAGGAGGAATGGCTTGCACGCCTAAAGCTAAAGCTCCACGAGAACACCGGGGAGAGCAACGGGCCCTCCAGCCGCAAGGGCAGCAAGAAACCATGGA
AATCTCGCGGGCGCACATGCAGGAAAGATGGGGGATCAAAAGAAGGAGTCGACCGATGGCAGGTCGATTCAGTGCTCGAACTGCGGGAAGAGAGGTCACCTGAGCAAGAA
TTGATGGAGCAAAGCTCAGGGAACAAGGAGAAGGCCGAGAAGGCTCAATCCGAGGAGGACGTGGGAGAGAGGTCACCTGAGCAAGAATTGATGGAGCAAAGCTCAGGGAA
CAAGGAGAAGGCCGAGAAGGCTCAATCCGAAGAGGACGAGCCAGCTCTCTTCATGGTAAGTGCATACATCCCCAATTTCGATTCCAAATCCACGGAGGTGGAGGTAATCG
ACGACGACGTCGAACCCGAGGAAGAGCTCCAACTGGGCGTAGGAAAGGCGGCACCAGCTGGGGAGCCAATTCAATTGAAGGAGGGAAGAGTGTTCGCTAAGATTGGCGAG
AGGGACGAGCAGCACGAGCACCGGCGATGGATCCTCGACACAAGGGCAACAAACCATATGACCAGGGCTAGATCTGCGTTCTCCGAGCTCGACTCGGAGATTCGCAGGAC
GGTGAAATTCGGCGATGGCTCCATCATCGAGATCGAAGGGCGTGGTGTCACAATCGCACTTTTAATGGCAGCAGCGGACTTGCGATTGTACGACATTCACTTTCCAACAA
CAAGTGAGCTAAGCCAATTCGTTCTTGTGTCGCCTACGGCCAAGCGACAAAAGCTACAGAAGGAGAAGATGGTGCACGGTTTGCCGGCAATCAAAAACGTGAACAAGCTG
TGCGACGGGTGCCTCATCGGCAAACAGAGGCGCACACCCTTTCCGTCTCAGACATCCTACCGCGCCGACGAGCCGTTGGAGCTTGTACACGGCGATATCTGTGGGCCCAT
CAAGTCGGCGACCCCAGGCTGTAAGAGTCTCTTCCTCCTATTAGTCGATGACAAAAGCCGCTTCATGTGGCTGACCTTGCTGCAAGCGAAAAGTGAGGCGGCAGAGGCGG
TTAAGCGCATTAAAGTACGAGCGGAGGCCGAATGTGAGAAGAAGATGCGAGTGCTACGCACAGAACAAGGCAGAGAATTCACCTCGACAAGTTTCCGTAAGTACTGCGAC
GAGTTCGGCACCTAA
Protein sequenceShow/hide protein sequence
MPLASLSTKRTAQSTWEGIKFRRVGVQRVRESNIEQLQKTIEEEWLARLKLKLHENTGESNGPSSRKGSKKPWKSRGRTCRKDGGSKEGVDRWQVDSVLELREERSPEQE
LMEQSSGNKEKAEKAQSEEDVGERSPEQELMEQSSGNKEKAEKAQSEEDEPALFMVSAYIPNFDSKSTEVEVIDDDVEPEEELQLGVGKAAPAGEPIQLKEGRVFAKIGE
RDEQHEHRRWILDTRATNHMTRARSAFSELDSEIRRTVKFGDGSIIEIEGRGVTIALLMAAADLRLYDIHFPTTSELSQFVLVSPTAKRQKLQKEKMVHGLPAIKNVNKL
CDGCLIGKQRRTPFPSQTSYRADEPLELVHGDICGPIKSATPGCKSLFLLLVDDKSRFMWLTLLQAKSEAAEAVKRIKVRAEAECEKKMRVLRTEQGREFTSTSFRKYCD
EFGT