; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0094071 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0094071
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr04:6570629..6571348
RNA-Seq ExpressionCmc04g0094071
SyntenyCmc04g0094071
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026117.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.6e-11387.45Show/hide
Query:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN
        ++ ELLQFRRNN+WTLVSKPEGVNVIGTKWIFKNK DE+GC TKNKARLVAQ YTQVEGVDFDETF+PVARL+AIRLLLGISCI KFKLYQMDVKSAFLN
Subjt:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN

Query:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ
         YLNEEVYVAQ KGFVDS++ KHVYKLNKALYGLKQAPRAWYDRLTVYLRG+GY R EIDK+ FIHRKSDQLLVAQIYVDDI F GFP DLVNNFINIMQ
Subjt:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ

Query:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK
         EFEMS VGELSCFL LQIKQKND IFISQEKY +NMVK
Subjt:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK

KAA0032735.1 gag-pol polyprotein [Cucumis melo var. makuwa]6.6e-12598.27Show/hide
Query:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN
        MQEELLQFRRNN+WTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKAR VAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN
Subjt:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN

Query:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ
        GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISF GFPQDLVNNFINIMQ
Subjt:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ

Query:  LEFEMSMVGELSCFLELQIKQKNDGIFISQE
        LEFEMSMVGELSCFL LQIKQKNDGIFISQE
Subjt:  LEFEMSMVGELSCFLELQIKQKNDGIFISQE

KAA0053137.1 gag-pol polyprotein [Cucumis melo var. makuwa]7.8e-11890.38Show/hide
Query:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN
        MQEELLQFRRNN+WTL+SKPEGVNVIGTKWIFKNKTDE+GC TKNKARLVAQ YTQVEGVDFDETF+PVARLEAIRLLLGISCI KFKLYQ+DVKS FLN
Subjt:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN

Query:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ
        GYLNEEVYVAQ KGFVDS++ KHVYKLNKALYGLKQA RAWYDRLTVYLRG+GYSRGEIDK LFIHRKSDQLLVAQIYVDDI F GFP DL+NNFINIMQ
Subjt:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ

Query:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK
         EFEMSMVGELSCFL LQIKQKNDGIFISQEKY RNMVK
Subjt:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK

TYJ98791.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.3e-11789.96Show/hide
Query:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN
        MQEELLQF+RNN+WTLV KPEGVNVIGTKW+FKNKTDE+GC TKNKARLVAQ YTQVEG+DFDETFSPVARLEAIRLLLGISCI KFKLYQMDVKSAFLN
Subjt:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN

Query:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ
        GYLNEEVYVAQ K FVDS++ KHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDK+LFIHRKSDQLLVAQIYVDDI F GFPQ LVNNFI +MQ
Subjt:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ

Query:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK
         EFEMSMVGELSCFL LQIKQKND IFISQEKY RNMVK
Subjt:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK

TYK11575.1 gag-pol polyprotein [Cucumis melo var. makuwa]7.5e-11387.03Show/hide
Query:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN
        ++ ELLQFRRNN+WTLVSKPEGVNVIGTKWIFKNK DE+GC TKNKARLVAQ YTQVEGVDFDETF+PVARL+AIRLLLGISCI KFKLYQMDVKSAFLN
Subjt:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN

Query:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ
         YLNEEVYVAQ KGFVDS++ KHVYKLNKALYGLKQAPRAWYDRLT YLRG+GY R EIDK+ FIHRKSDQLLVAQIYVDDI F GFP DLVNNFINIMQ
Subjt:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ

Query:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK
         EFEMS VGELSCFL LQIKQKND IFISQEKY +NMVK
Subjt:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK

TrEMBL top hitse value%identityAlignment
A0A5A7SN07 Gag-pol polyprotein1.3e-11387.45Show/hide
Query:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN
        ++ ELLQFRRNN+WTLVSKPEGVNVIGTKWIFKNK DE+GC TKNKARLVAQ YTQVEGVDFDETF+PVARL+AIRLLLGISCI KFKLYQMDVKSAFLN
Subjt:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN

Query:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ
         YLNEEVYVAQ KGFVDS++ KHVYKLNKALYGLKQAPRAWYDRLTVYLRG+GY R EIDK+ FIHRKSDQLLVAQIYVDDI F GFP DLVNNFINIMQ
Subjt:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ

Query:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK
         EFEMS VGELSCFL LQIKQKND IFISQEKY +NMVK
Subjt:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK

A0A5D3BI56 Gag-pol polyprotein3.2e-12598.27Show/hide
Query:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN
        MQEELLQFRRNN+WTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKAR VAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN
Subjt:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN

Query:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ
        GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISF GFPQDLVNNFINIMQ
Subjt:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ

Query:  LEFEMSMVGELSCFLELQIKQKNDGIFISQE
        LEFEMSMVGELSCFL LQIKQKNDGIFISQE
Subjt:  LEFEMSMVGELSCFLELQIKQKNDGIFISQE

A0A5D3BJA9 Gag-pol polyprotein1.1e-11789.96Show/hide
Query:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN
        MQEELLQF+RNN+WTLV KPEGVNVIGTKW+FKNKTDE+GC TKNKARLVAQ YTQVEG+DFDETFSPVARLEAIRLLLGISCI KFKLYQMDVKSAFLN
Subjt:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN

Query:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ
        GYLNEEVYVAQ K FVDS++ KHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDK+LFIHRKSDQLLVAQIYVDDI F GFPQ LVNNFI +MQ
Subjt:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ

Query:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK
         EFEMSMVGELSCFL LQIKQKND IFISQEKY RNMVK
Subjt:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK

A0A5D3BPB3 Gag-pol polyprotein3.8e-11890.38Show/hide
Query:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN
        MQEELLQFRRNN+WTL+SKPEGVNVIGTKWIFKNKTDE+GC TKNKARLVAQ YTQVEGVDFDETF+PVARLEAIRLLLGISCI KFKLYQ+DVKS FLN
Subjt:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN

Query:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ
        GYLNEEVYVAQ KGFVDS++ KHVYKLNKALYGLKQA RAWYDRLTVYLRG+GYSRGEIDK LFIHRKSDQLLVAQIYVDDI F GFP DL+NNFINIMQ
Subjt:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ

Query:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK
         EFEMSMVGELSCFL LQIKQKNDGIFISQEKY RNMVK
Subjt:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK

A0A5D3CJ17 Gag-pol polyprotein3.7e-11387.03Show/hide
Query:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN
        ++ ELLQFRRNN+WTLVSKPEGVNVIGTKWIFKNK DE+GC TKNKARLVAQ YTQVEGVDFDETF+PVARL+AIRLLLGISCI KFKLYQMDVKSAFLN
Subjt:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN

Query:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ
         YLNEEVYVAQ KGFVDS++ KHVYKLNKALYGLKQAPRAWYDRLT YLRG+GY R EIDK+ FIHRKSDQLLVAQIYVDDI F GFP DLVNNFINIMQ
Subjt:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQ

Query:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK
         EFEMS VGELSCFL LQIKQKND IFISQEKY +NMVK
Subjt:  LEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMVK

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.6e-4136.29Show/hide
Query:  ELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLNGYL
        EL   + NN WT+  +PE  N++ ++W+F  K +E G   + KARLVA+ +TQ   +D++ETF+PVAR+ + R +L +   +  K++QMDVK+AFLNG L
Subjt:  ELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLNGYL

Query:  NEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKS--DQLLVAQIYVDDISFRGFPQDLVNNFINIMQL
         EE+Y+   +G   S N  +V KLNKA+YGLKQA R W++     L+   +    +D+ ++I  K   ++ +   +YVDD+         +NNF   +  
Subjt:  NEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKS--DQLLVAQIYVDDISFRGFPQDLVNNFINIMQL

Query:  EFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMV
        +F M+ + E+  F+ ++I+ + D I++SQ  YV+ ++
Subjt:  EFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.1e-3734.71Show/hide
Query:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN
        MQEE+   ++N  + LV  P+G   +  KW+FK K D      + KARLV + + Q +G+DFDE FSPV ++ +IR +L ++     ++ Q+DVK+AFL+
Subjt:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN

Query:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSD-QLLVAQIYVDDISFRGFPQDLVNNFINIM
        G L EE+Y+ Q +GF  +     V KLNK+LYGLKQAPR WY +   +++ + Y +   D  ++  R S+   ++  +YVDD+   G  + L+      +
Subjt:  GYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSD-QLLVAQIYVDDISFRGFPQDLVNNFINIM

Query:  QLEFEMSMVGELSCFLELQIKQKNDG--IFISQEKYVRNMVK
           F+M  +G     L ++I ++     +++SQEKY+  +++
Subjt:  QLEFEMSMVGELSCFLELQIKQKNDG--IFISQEKYVRNMVK

P25600 Putative transposon Ty5-1 protein YCL074W3.4e-1532.64Show/hide
Query:  MDVKSAFLNGYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDL
        MDV +AFLN  ++E +YV Q  GFV+ +N  +V++L   +YGLKQAP  W + +   L+  G+ R E +  L+    SD  +   +YVDD+        +
Subjt:  MDVKSAFLNGYLNEEVYVAQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDL

Query:  VNNFINIMQLEFEMSMVGELSCFLELQIKQKNDG-IFISQEKYV
         +     +   + M  +G++  FL L I Q ++G I +S + Y+
Subjt:  VNNFINIMQLEFEMSMVGELSCFLELQIKQKNDG-IFISQEKYV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-4238.86Show/hide
Query:  NNLWTLVSKPEG-VNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLNGYLNEEVYV
        N+ W LV  P   V ++G +WIF  K +  G   + KARLVA+ Y Q  G+D+ ETFSPV +  +IR++LG++    + + Q+DV +AFL G L ++VY+
Subjt:  NNLWTLVSKPEG-VNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLNGYLNEEVYV

Query:  AQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQLEFEMSMVG
        +Q  GF+D     +V KL KALYGLKQAPRAWY  L  YL   G+     D SLF+ ++   ++   +YVDDI   G    L++N ++ +   F +    
Subjt:  AQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQLEFEMSMVG

Query:  ELSCFLELQIKQKNDGIFISQEKYVRNMV
        EL  FL ++ K+   G+ +SQ +Y+ +++
Subjt:  ELSCFLELQIKQKNDGIFISQEKYVRNMV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.6e-4137.99Show/hide
Query:  NNLWTLV-SKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLNGYLNEEVYV
        N+ W LV   P  V ++G +WIF  K +  G   + KARLVA+ Y Q  G+D+ ETFSPV +  +IR++LG++    + + Q+DV +AFL G L +EVY+
Subjt:  NNLWTLV-SKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLNGYLNEEVYV

Query:  AQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQLEFEMSMVG
        +Q  GFVD     +V +L KA+YGLKQAPRAWY  L  YL   G+     D SLF+ ++   ++   +YVDDI   G    L+ + ++ +   F +    
Subjt:  AQRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQLEFEMSMVG

Query:  ELSCFLELQIKQKNDGIFISQEKYVRNMV
        +L  FL ++ K+   G+ +SQ +Y  +++
Subjt:  ELSCFLELQIKQKNDGIFISQEKYVRNMV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.6e-3936.78Show/hide
Query:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN
        M +E+      + W + + P     IG KW++K K +  G   + KARLVA+ YTQ EG+DF ETFSPV +L +++L+L IS I+ F L+Q+D+ +AFLN
Subjt:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLN

Query:  GYLNEEVYVAQRKGFV----DSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFI
        G L+EE+Y+    G+     DS     V  L K++YGLKQA R W+ + +V L G G+ +   D + F+   +   L   +YVDDI         V+   
Subjt:  GYLNEEVYVAQRKGFV----DSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFI

Query:  NIMQLEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMV
        + ++  F++  +G L  FL L+I +   GI I Q KY  +++
Subjt:  NIMQLEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMV

ATMG00810.1 DNA/RNA polymerases superfamily protein2.8e-0437.1Show/hide
Query:  IYVDDISFRGFPQDLVNNFINIMQLEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMV
        +YVDDI   G    L+N  I  +   F M  +G +  FL +QIK    G+F+SQ KY   ++
Subjt:  IYVDDISFRGFPQDLVNNFINIMQLEFEMSMVGELSCFLELQIKQKNDGIFISQEKYVRNMV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)1.0e-1446.34Show/hide
Query:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGIS
        MQEEL    RN  W LV  P   N++G KW+FK K    G   + KARLVA+ + Q EG+ F ET+SPV R   IR +L ++
Subjt:  MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAGGAGCTACTGCAATTTAGACGAAACAATCTCTGGACGTTAGTTTCAAAGCCAGAAGGTGTAAACGTTATTGGCACCAAATGGATATTTAAAAATAAGACTGA
TGAAAGTGGATGTGGGACGAAAAATAAAGCCAGATTAGTAGCTCAACGGTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTTCTCCTGTTGCTCGACTTGAAG
CCATTCGACTTTTACTTGGTATATCATGCATACATAAATTTAAATTGTATCAGATGGATGTAAAGAGTGCCTTCTTAAATGGGTACTTGAACGAGGAGGTTTATGTTGCT
CAACGAAAAGGTTTTGTTGATTCCAAGAACCTGAAGCATGTGTATAAGCTCAACAAAGCCTTATATGGACTAAAGCAAGCTCCGAGAGCTTGGTATGACCGACTAACTGT
ATACTTGAGAGGTAAAGGATATTCCAGAGGAGAAATTGATAAGTCTTTGTTCATACACAGGAAATCTGACCAACTGTTGGTGGCTCAAATTTATGTTGATGACATCAGTT
TTAGAGGATTTCCTCAAGATTTAGTAAATAATTTCATTAATATTATGCAGTTAGAATTCGAAATGAGCATGGTTGGAGAGCTTTCATGCTTTCTGGAACTTCAAATTAAG
CAAAAGAATGATGGCATTTTCATATCACAAGAAAAGTATGTCAGGAATATGGTCAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGAGGAGCTACTGCAATTTAGACGAAACAATCTCTGGACGTTAGTTTCAAAGCCAGAAGGTGTAAACGTTATTGGCACCAAATGGATATTTAAAAATAAGACTGA
TGAAAGTGGATGTGGGACGAAAAATAAAGCCAGATTAGTAGCTCAACGGTATACTCAAGTTGAAGGTGTTGACTTTGATGAAACGTTTTCTCCTGTTGCTCGACTTGAAG
CCATTCGACTTTTACTTGGTATATCATGCATACATAAATTTAAATTGTATCAGATGGATGTAAAGAGTGCCTTCTTAAATGGGTACTTGAACGAGGAGGTTTATGTTGCT
CAACGAAAAGGTTTTGTTGATTCCAAGAACCTGAAGCATGTGTATAAGCTCAACAAAGCCTTATATGGACTAAAGCAAGCTCCGAGAGCTTGGTATGACCGACTAACTGT
ATACTTGAGAGGTAAAGGATATTCCAGAGGAGAAATTGATAAGTCTTTGTTCATACACAGGAAATCTGACCAACTGTTGGTGGCTCAAATTTATGTTGATGACATCAGTT
TTAGAGGATTTCCTCAAGATTTAGTAAATAATTTCATTAATATTATGCAGTTAGAATTCGAAATGAGCATGGTTGGAGAGCTTTCATGCTTTCTGGAACTTCAAATTAAG
CAAAAGAATGATGGCATTTTCATATCACAAGAAAAGTATGTCAGGAATATGGTCAAATAA
Protein sequenceShow/hide protein sequence
MQEELLQFRRNNLWTLVSKPEGVNVIGTKWIFKNKTDESGCGTKNKARLVAQRYTQVEGVDFDETFSPVARLEAIRLLLGISCIHKFKLYQMDVKSAFLNGYLNEEVYVA
QRKGFVDSKNLKHVYKLNKALYGLKQAPRAWYDRLTVYLRGKGYSRGEIDKSLFIHRKSDQLLVAQIYVDDISFRGFPQDLVNNFINIMQLEFEMSMVGELSCFLELQIK
QKNDGIFISQEKYVRNMVK