; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc02g0045021 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc02g0045021
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag-pol polyprotein
Genome locationCMiso1.1chr02:8394835..8395599
RNA-Seq ExpressionCmc02g0045021
SyntenyCmc02g0045021
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042297.1 gag-pol polyprotein [Cucumis melo var. makuwa]2.2e-10277.42Show/hide
Query:  LEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPTLVGYCD
        + +A NK TPAATHVKLTKDTEGAKVDHKLYRSIVG +LYLTA+RPDIAYA+GICARYQ DPRITHLEAVKRILKYV+GTSDFGMMYSYDTTPT+VGYCD
Subjt:  LEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPTLVGYCD

Query:  TDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTKHIDIRH
         DW GS DD K+                            EAEYIAAG+GCTQLIWMKNML EYGFDQDTMTLY DNMSAIDISKN VQHSRTKHIDIRH
Subjt:  TDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTKHIDIRH

Query:  HFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCRT
        HFIRELVEDKVIK D I SNLQL +IFTKPLDASSFEYL  GLGVCRT
Subjt:  HFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCRT

KAA0055610.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.7e-12690.16Show/hide
Query:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT
        MVKKFGLEQARNK TP ATHVKLTKDTE  + DHKLYRSIVG LLYLTASRP+IAY VGICA YQ DPRITHLEAVKRILKYV+GTSDFGMMYSYDTTP 
Subjt:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT

Query:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK
        LVGYCD DW GSADDRKSTSGG FFLGNNLISWLSKKQNCVSLST EAEYIAAG+GCTQLIWMKNMLHEYGFDQDTMTLY DN SAIDISKN VQHSRTK
Subjt:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK

Query:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCRT
        HIDIRHHFIRELVEDKVIKLD I SNLQL +IFTKPLDASSFEYLRAGLGVCRT
Subjt:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCRT

KAA0066740.1 gag-pol polyprotein [Cucumis melo var. makuwa]4.9e-10278Show/hide
Query:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT
        +VKKFGLEQARNK TPA THVKLTKD EGA+VDHKLYRSIVG LLYLTASRPDIAYA+GI ARYQ  PRITHLEA+KRILKYV+ T DFGMMYSYDTTPT
Subjt:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT

Query:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK
        LVGYCD DW GS DDRK                             EAEYIAAG+GCTQLIWMKN+LHEYGFDQDTMTLY +NMSAIDISKNLVQHSRTK
Subjt:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK

Query:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLG
        HIDIRHHFIRE VE+KVIKLD I SNLQLANIFTKPLDASSFEYL AGLG
Subjt:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLG

TYJ97126.1 gag-pol polyprotein [Cucumis melo var. makuwa]1.9e-10678.74Show/hide
Query:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT
        MVKKFGLEQARNK TPAATHVKLTKDT+GAKVDHKLYRSI G LLYLTASRPDIAYA+GI ARYQ +PRITHLEAVKRILKYV+GTSDFGMMYSYDTTPT
Subjt:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT

Query:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK
        +VGYCD DW GSADD K+                            EAEY+AAG+GCTQLIWM+NML EYGFDQDTMTLY DNMSAIDISKN VQHSRTK
Subjt:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK

Query:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCRT
        HIDIRHHF+RELVEDKVIK D I SNLQLA+IFTKPLDASSFEYL AGLGVCRT
Subjt:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCRT

TYK23188.1 gag-pol polyprotein [Cucumis melo var. makuwa]5.2e-11286.32Show/hide
Query:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT
        M KKFGLEQARNK TPAATHVKLT+D +GA+VDHKLYRSIV  LLYLTASRPDIAYAVGICARYQ DPRI+HLEAVKRILKYV+GT+DFGMMYSYDTTPT
Subjt:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT

Query:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK
        LVGYCD DW G ADDRKSTSGG FFLGNNLI WLSKKQNCVSLST EAEYI AG+GCTQLIWM+N+L EYGFDQ T+TLY DNMSAIDISKN VQHSR K
Subjt:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK

Query:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFT
        HIDIRHHFIRELVEDKVI+LD I SNLQLA+IFT
Subjt:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFT

TrEMBL top hitse value%identityAlignment
A0A5A7TKS7 Gag-pol polyprotein1.1e-10277.42Show/hide
Query:  LEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPTLVGYCD
        + +A NK TPAATHVKLTKDTEGAKVDHKLYRSIVG +LYLTA+RPDIAYA+GICARYQ DPRITHLEAVKRILKYV+GTSDFGMMYSYDTTPT+VGYCD
Subjt:  LEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPTLVGYCD

Query:  TDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTKHIDIRH
         DW GS DD K+                            EAEYIAAG+GCTQLIWMKNML EYGFDQDTMTLY DNMSAIDISKN VQHSRTKHIDIRH
Subjt:  TDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTKHIDIRH

Query:  HFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCRT
        HFIRELVEDKVIK D I SNLQL +IFTKPLDASSFEYL  GLGVCRT
Subjt:  HFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCRT

A0A5D3BDQ5 Gag-pol polyprotein9.3e-10778.74Show/hide
Query:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT
        MVKKFGLEQARNK TPAATHVKLTKDT+GAKVDHKLYRSI G LLYLTASRPDIAYA+GI ARYQ +PRITHLEAVKRILKYV+GTSDFGMMYSYDTTPT
Subjt:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT

Query:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK
        +VGYCD DW GSADD K+                            EAEY+AAG+GCTQLIWM+NML EYGFDQDTMTLY DNMSAIDISKN VQHSRTK
Subjt:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK

Query:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCRT
        HIDIRHHF+RELVEDKVIK D I SNLQLA+IFTKPLDASSFEYL AGLGVCRT
Subjt:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCRT

A0A5D3DI97 Gag-pol polyprotein2.5e-11286.32Show/hide
Query:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT
        M KKFGLEQARNK TPAATHVKLT+D +GA+VDHKLYRSIV  LLYLTASRPDIAYAVGICARYQ DPRI+HLEAVKRILKYV+GT+DFGMMYSYDTTPT
Subjt:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT

Query:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK
        LVGYCD DW G ADDRKSTSGG FFLGNNLI WLSKKQNCVSLST EAEYI AG+GCTQLIWM+N+L EYGFDQ T+TLY DNMSAIDISKN VQHSR K
Subjt:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK

Query:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFT
        HIDIRHHFIRELVEDKVI+LD I SNLQLA+IFT
Subjt:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFT

A0A5D3DLL1 Gag-pol polyprotein8.1e-12790.16Show/hide
Query:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT
        MVKKFGLEQARNK TP ATHVKLTKDTE  + DHKLYRSIVG LLYLTASRP+IAY VGICA YQ DPRITHLEAVKRILKYV+GTSDFGMMYSYDTTP 
Subjt:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT

Query:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK
        LVGYCD DW GSADDRKSTSGG FFLGNNLISWLSKKQNCVSLST EAEYIAAG+GCTQLIWMKNMLHEYGFDQDTMTLY DN SAIDISKN VQHSRTK
Subjt:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK

Query:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCRT
        HIDIRHHFIRELVEDKVIKLD I SNLQL +IFTKPLDASSFEYLRAGLGVCRT
Subjt:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCRT

A0A5D3DWS6 Gag-pol polyprotein2.4e-10278Show/hide
Query:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT
        +VKKFGLEQARNK TPA THVKLTKD EGA+VDHKLYRSIVG LLYLTASRPDIAYA+GI ARYQ  PRITHLEA+KRILKYV+ T DFGMMYSYDTTPT
Subjt:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT

Query:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK
        LVGYCD DW GS DDRK                             EAEYIAAG+GCTQLIWMKN+LHEYGFDQDTMTLY +NMSAIDISKNLVQHSRTK
Subjt:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTK

Query:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLG
        HIDIRHHFIRE VE+KVIKLD I SNLQLANIFTKPLDASSFEYL AGLG
Subjt:  HIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLG

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.9e-3836.82Show/hide
Query:  MVKKFGLEQARNKWTPAATHV--KLTKDTEGAKVDHKLYRSIVGGLLY-LTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDT
        ++ KF +E      TP  + +  +L    E         RS++G L+Y +  +RPD+  AV I +RY +       + +KR+L+Y+ GT D  +++  + 
Subjt:  MVKKFGLEQARNKWTPAATHV--KLTKDTEGAKVDHKLYRSIVGGLLY-LTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDT

Query:  T--PTLVGYCDTDWVGSADDRKSTSGGYFFLGN-NLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFD-QDTMTLYYDNMSAIDISKNL
             ++GY D+DW GS  DRKST+G  F + + NLI W +K+QN V+ S+ EAEY+A      + +W+K +L       ++ + +Y DN   I I+ N 
Subjt:  T--PTLVGYCDTDWVGSADDRKSTSGGYFFLGN-NLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFD-QDTMTLYYDNMSAIDISKNL

Query:  VQHSRTKHIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGV
          H R KHIDI++HF RE V++ VI L+ I +  QLA+IFTKPL A+ F  LR  LG+
Subjt:  VQHSRTKHIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGV

P0CV72 Secreted RxLR effector protein 1611.7e-2542.19Show/hide
Query:  YRSIVGGLLYL-TASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPTLVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLS
        Y S VG ++YL   +RPD+A AVG+ +++ +DP  TH +A+KR+L+Y+  T  +G+ ++   T  LVGY D DW G  + R+STSG  F L    +SW S
Subjt:  YRSIVGGLLYL-TASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPTLVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLS

Query:  KKQNCVSLSTAEAEYIAAGNGCTQLIWM
        KKQ  V+LS+ E EY+A      + +W+
Subjt:  KKQNCVSLSTAEAEYIAAGNGCTQLIWM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.5e-5041.86Show/hide
Query:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHK------LYRSIVGGLLY-LTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMY
        ++++F ++ A+   TP A H+KL+K      V+ K       Y S VG L+Y +  +RPDIA+AVG+ +R+  +P   H EAVK IL+Y+ GT+   + +
Subjt:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHK------LYRSIVGGLLY-LTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMY

Query:  SYDTTPTLVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNL
           + P L GY D D  G  D+RKS++G  F      ISW SK Q CV+LST EAEYIAA     ++IW+K  L E G  Q    +Y D+ SAID+SKN 
Subjt:  SYDTTPTLVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNL

Query:  VQHSRTKHIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGV
        + H+RTKHID+R+H+IRE+V+D+ +K+ +IS+N   A++ TK +  + FE  +  +G+
Subjt:  VQHSRTKHIDIRHHFIRELVEDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.5e-4139.58Show/hide
Query:  TPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPTLVGYCDTDWVGSAD
        TP A   KL+  +     D   YR IVG L YL  +RPDI+YAV   +++   P   HL+A+KRIL+Y+ GT + G+      T +L  Y D DW G  D
Subjt:  TPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPTLVGYCDTDWVGSAD

Query:  DRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFD-QDTMTLYYDNMSAIDISKNLVQHSRTKHIDIRHHFIRELV
        D  ST+G   +LG++ ISW SKKQ  V  S+ EAEY +  N  +++ W+ ++L E G        +Y DN+ A  +  N V HSR KHI I +HFIR  V
Subjt:  DRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFD-QDTMTLYYDNMSAIDISKNLVQHSRTKHIDIRHHFIRELV

Query:  EDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCR
        +   +++  +S++ QLA+  TKPL  ++F+   + +GV R
Subjt:  EDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.2e-4139.58Show/hide
Query:  TPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPTLVGYCDTDWVGSAD
        TP AT  KLT  +     D   YR IVG L YL  +RPD++YAV   ++Y   P   H  A+KR+L+Y+ GT D G+      T +L  Y D DW G  D
Subjt:  TPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPTLVGYCDTDWVGSAD

Query:  DRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFD-QDTMTLYYDNMSAIDISKNLVQHSRTKHIDIRHHFIRELV
        D  ST+G   +LG++ ISW SKKQ  V  S+ EAEY +  N  ++L W+ ++L E G        +Y DN+ A  +  N V HSR KHI + +HFIR  V
Subjt:  DRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFD-QDTMTLYYDNMSAIDISKNLVQHSRTKHIDIRHHFIRELV

Query:  EDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCR
        +   +++  +S++ QLA+  TKPL   +F+     +GV +
Subjt:  EDKVIKLDRISSNLQLANIFTKPLDASSFEYLRAGLGVCR

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 88.7e-3337.26Show/hide
Query:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT
        ++ + GL   +    P    V  +  + G  VD K YR ++G L+YL  +R DI++AV   +++   PR+ H +AV +IL Y+ GT   G+ YS      
Subjt:  MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPT

Query:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYY-DNMSAIDISKNLVQHSRT
        L  + D  +    D R+ST+G   FLG +LISW SKKQ  VS S+AEAEY A      +++W+     E        TL + DN +AI I+ N V H RT
Subjt:  LVGYCDTDWVGSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYY-DNMSAIDISKNLVQHSRT

Query:  KHIDIRHHFIRE
        KHI+   H +RE
Subjt:  KHIDIRHHFIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein1.0e-0935.06Show/hide
Query:  LYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPTLVGYCDTDWVGSADDRKSTSG
        +YLT +RPD+ +AV   +++ +  R   ++AV ++L YV GT   G+ YS  +   L  + D+DW    D R+S +G
Subjt:  LYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPTLVGYCDTDWVGSADDRKSTSG

ATMG00810.1 DNA/RNA polymerases superfamily protein1.1e-2440.56Show/hide
Query:  VKLTKDTEGAKV-DHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPTLVGYCDTDWVGSADDRKST
        +KL      AK  D   +RSIVG L YLT +RPDI+YAV I  +   +P +   + +KR+L+YV GT   G+    ++   +  +CD+DW G    R+ST
Subjt:  VKLTKDTEGAKV-DHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPTLVGYCDTDWVGSADDRKST

Query:  SGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIW
        +G   FLG N+ISW +K+Q  VS S+ E EY A      +L W
Subjt:  SGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCAAAAAGTTTGGTTTAGAACAGGCTCGAAATAAGTGGACTCCAGCTGCGACACATGTTAAACTTACAAAAGACACTGAAGGTGCTAAAGTTGATCACAAACTTTA
TAGGAGTATAGTAGGCGGCCTATTATACTTAACAGCAAGTCGACCTGACATAGCTTATGCTGTGGGAATATGTGCTCGTTATCAGACGGATCCCCGCATCACTCACCTAG
AAGCTGTTAAACGAATTCTTAAATACGTTTATGGGACCAGTGACTTTGGAATGATGTATTCCTATGATACCACTCCCACTCTAGTTGGATATTGTGATACTGACTGGGTA
GGTTCAGCTGATGATCGTAAAAGTACGTCTGGAGGTTACTTCTTTTTGGGAAACAATTTAATCTCTTGGTTAAGTAAGAAGCAAAACTGTGTTTCTTTATCTACAGCTGA
AGCTGAATATATAGCTGCAGGTAATGGTTGTACACAATTGATTTGGATGAAAAACATGCTGCATGAATATGGCTTTGATCAGGACACTATGACGTTGTATTATGATAATA
TGAGTGCAATTGATATATCTAAGAATCTCGTTCAACATAGTCGAACTAAGCATATTGATATAAGACACCACTTTATTCGAGAACTAGTTGAAGACAAAGTAATCAAGCTT
GATCGTATTAGTTCAAATTTACAATTAGCCAATATCTTCACTAAACCCCTAGATGCTAGCTCATTCGAATACTTACGTGCTGGTTTAGGTGTGTGTCGCACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCAAAAAGTTTGGTTTAGAACAGGCTCGAAATAAGTGGACTCCAGCTGCGACACATGTTAAACTTACAAAAGACACTGAAGGTGCTAAAGTTGATCACAAACTTTA
TAGGAGTATAGTAGGCGGCCTATTATACTTAACAGCAAGTCGACCTGACATAGCTTATGCTGTGGGAATATGTGCTCGTTATCAGACGGATCCCCGCATCACTCACCTAG
AAGCTGTTAAACGAATTCTTAAATACGTTTATGGGACCAGTGACTTTGGAATGATGTATTCCTATGATACCACTCCCACTCTAGTTGGATATTGTGATACTGACTGGGTA
GGTTCAGCTGATGATCGTAAAAGTACGTCTGGAGGTTACTTCTTTTTGGGAAACAATTTAATCTCTTGGTTAAGTAAGAAGCAAAACTGTGTTTCTTTATCTACAGCTGA
AGCTGAATATATAGCTGCAGGTAATGGTTGTACACAATTGATTTGGATGAAAAACATGCTGCATGAATATGGCTTTGATCAGGACACTATGACGTTGTATTATGATAATA
TGAGTGCAATTGATATATCTAAGAATCTCGTTCAACATAGTCGAACTAAGCATATTGATATAAGACACCACTTTATTCGAGAACTAGTTGAAGACAAAGTAATCAAGCTT
GATCGTATTAGTTCAAATTTACAATTAGCCAATATCTTCACTAAACCCCTAGATGCTAGCTCATTCGAATACTTACGTGCTGGTTTAGGTGTGTGTCGCACTTAA
Protein sequenceShow/hide protein sequence
MVKKFGLEQARNKWTPAATHVKLTKDTEGAKVDHKLYRSIVGGLLYLTASRPDIAYAVGICARYQTDPRITHLEAVKRILKYVYGTSDFGMMYSYDTTPTLVGYCDTDWV
GSADDRKSTSGGYFFLGNNLISWLSKKQNCVSLSTAEAEYIAAGNGCTQLIWMKNMLHEYGFDQDTMTLYYDNMSAIDISKNLVQHSRTKHIDIRHHFIRELVEDKVIKL
DRISSNLQLANIFTKPLDASSFEYLRAGLGVCRT