; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007534 (gene) of Snake gourd v1 genome

Gene IDTan0007534
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG04:18150410..18151214
RNA-Seq ExpressionTan0007534
SyntenyTan0007534
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.0e-11580.77Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG
        MKDLGEAQY+LGIQIVRNRKN+TLAMSQ SYIDK+LSRYKMQNSK G LPFRHG+HLSK+QCPKTPQEVEDMR IPY+SAVGSLMY +L TR DICY++G
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV
        IVSRYQSNPG DHWT VK ILKYLRRTRNY LVYG+ DL LTGYTDSDFQ+DKD+RKSTSG              +KQ CIADSTMEAEYVAACEAAKE 
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD
        VWLRKF+TDLEVVPNM+L I L+CDNSGAVANS+EPRSHKRGKH+ERKYHLIREIVHR D
Subjt:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]3.2e-11480.77Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG
        MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+Q PKTPQEVEDMR IPYASAVGSLMY +L TR DICYA+G
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV
        IVSRYQSNPGLDHWT VK +LKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD
        VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHKRGKH+ERKYHLIREIV R D
Subjt:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD

KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]8.5e-11581.54Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG
        MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+Q PKTPQEVEDMR IPYASAVGSLMYV+L TR DICYA+G
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV
        IVSRYQSNPGLDHWT VK ILKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD
        VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHKRGKH+ERKYHLIREIV R D
Subjt:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]3.2e-11480.77Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG
        MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+Q PKTPQEVEDMR IPYASAVGSLMY +L TR DICYA+G
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV
        IVSRYQSNPGLDHWT VK +LKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD
        VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHKRGKH+ERKYHLIREIV R D
Subjt:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD

KAA0061170.1 gag/pol protein [Cucumis melo var. makuwa]6.5e-11581.54Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG
        MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+Q PKTPQEVEDMR IPYASAVGSLMY +L TR DICYA+G
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV
        IVSRYQSNPGLDHWTTVK ILKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD
        VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHKRGKH+ERKYHLIREIV R D
Subjt:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD

TrEMBL top hitse value%identityAlignment
A0A5A7TKM4 Gag/pol protein4.1e-11581.54Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG
        MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+Q PKTPQEVEDMR IPYASAVGSLMYV+L TR DICYA+G
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV
        IVSRYQSNPGLDHWT VK ILKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD
        VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHKRGKH+ERKYHLIREIV R D
Subjt:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD

A0A5A7TZD0 Gag/pol protein1.6e-11480.77Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG
        MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+Q PKTPQEVEDMR IPYASAVGSLMY +L TR DICYA+G
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV
        IVSRYQSNPGLDHWT VK +LKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD
        VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHKRGKH+ERKYHLIREIV R D
Subjt:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD

A0A5A7UYE8 Gag/pol protein1.6e-11480.77Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG
        MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+Q PKTPQEVEDMR IPYASAVGSLMY +L TR DICYA+G
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV
        IVSRYQSNPGLDHWT VK +LKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD
        VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHKRGKH+ERKYHLIREIV R D
Subjt:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD

A0A5A7V1F5 Gag/pol protein3.2e-11581.54Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG
        MKDLGEAQYVLGIQI+R+RKN+TLA+SQ +YIDK+L RY MQNSK GLLPFRHGVHLSK+Q PKTPQEVEDMR IPYASAVGSLMY +L TR DICYA+G
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV
        IVSRYQSNPGLDHWTTVK ILKYLRRTR+Y LVYG+ DL LTGYTDSDFQTDKDSRKSTSG              IKQGCIADSTMEAEYVAACEAAKE 
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD
        VWLRKF+ DLEVVPNMNL I L+CDNSGAVANS+EPRSHKRGKH+ERKYHLIREIV R D
Subjt:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD

E2GK51 Gag/pol protein (Fragment)4.9e-11680.77Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG
        MKDLGEAQY+LGIQIVRNRKN+TLAMSQ SYIDK+LSRYKMQNSK G LPFRHG+HLSK+QCPKTPQEVEDMR IPY+SAVGSLMY +L TR DICY++G
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV
        IVSRYQSNPG DHWT VK ILKYLRRTRNY LVYG+ DL LTGYTDSDFQ+DKD+RKSTSG              +KQ CIADSTMEAEYVAACEAAKE 
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSG--------------IKQGCIADSTMEAEYVAACEAAKEV

Query:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD
        VWLRKF+TDLEVVPNM+L I L+CDNSGAVANS+EPRSHKRGKH+ERKYHLIREIVHR D
Subjt:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIVHRMD

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.9e-3131.94Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVH---LSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICY
        M DL E ++ +GI+I    +   + +SQ++Y+ KILS++ M+N      P    ++   L+ D+   T          P  S +G LMY++L TR D+  
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVH---LSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICY

Query:  AIGIVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLF---LTGYTDSDFQTDKDSRKSTSGI---------------KQGCIADSTMEAEYVAA
        A+ I+SRY S    + W  +K +L+YL+ T +  L++     F   + GY DSD+   +  RKST+G                +Q  +A S+ EAEY+A 
Subjt:  AIGIVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLF---LTGYTDSDFQTDKDSRKSTSGI---------------KQGCIADSTMEAEYVAA

Query:  CEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIV
         EA +E +WL+  +T + +   +   I ++ DN G ++ +  P  HKR KH++ KYH  RE V
Subjt:  CEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIV

P0CV72 Secreted RxLR effector protein 1611.3e-2039.85Show/hide
Query:  MRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVY-GSGDLFLTGYTDSDFQTDKDSRKSTSGI--------
        M+ +PY SAVG++MY+++ TR D+  A+G++S++ S+P   HW  +K +L+YL+ T+ Y L +  +G   L GY+D+D+  D +SR+STSG         
Subjt:  MRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVY-GSGDLFLTGYTDSDFQTDKDSRKSTSGI--------

Query:  ------KQGCIADSTMEAEYVAACEAAKEVVWL
              KQ  +A S+ E EY+A  EA +E VWL
Subjt:  ------KQGCIADSTMEAEYVAACEAAKEVVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.0e-5443.75Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG
        MKDLG AQ +LG++IVR R +R L +SQ  YI+++L R+ M+N+K    P    + LSK  CP T +E  +M  +PY+SAVGSLMY ++ TR DI +A+G
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSGI--------------KQGCIADSTMEAEYVAACEAAKEV
        +VSR+  NPG +HW  VK IL+YLR T    L +G  D  L GYTD+D   D D+RKS++G                Q C+A ST EAEY+AA E  KE+
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSGI--------------KQGCIADSTMEAEYVAACEAAKEV

Query:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIV
        +WL++F+ +L +         ++CD+  A+  S+    H R KH++ +YH IRE+V
Subjt:  VWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIV

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.3e-2232.3Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG
        +KD  E  Y LGI+    R    L +SQ  YI  +L+R  M  +K    P      LS     K     E      Y   VGSL Y L +TR DI YA+ 
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGD-LFLTGYTDSDFQTDKDSRKSTSGI--------------KQGCIADSTMEAEYVAACEAAKE
         +S++   P  +H   +K IL+YL  T N+ +    G+ L L  Y+D+D+  DKD   ST+G               KQ  +  S+ EAEY +    + E
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGD-LFLTGYTDSDFQTDKDSRKSTSGI--------------KQGCIADSTMEAEYVAACEAAKE

Query:  VVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIV
        + W+   +T+L +   +     ++CDN GA      P  H R KH+   YH IR  V
Subjt:  VVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.7e-2330.74Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG
        +K+  +  Y LGI+    R  + L +SQ  Y   +L+R  M  +K    P      L+     K P   E      Y   VGSL Y L +TR D+ YA+ 
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGD-LFLTGYTDSDFQTDKDSRKSTSGI--------------KQGCIADSTMEAEYVAACEAAKE
         +S+Y   P  DHW  +K +L+YL  T ++ +    G+ L L  Y+D+D+  D D   ST+G               KQ  +  S+ EAEY +    + E
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGSGD-LFLTGYTDSDFQTDKDSRKSTSGI--------------KQGCIADSTMEAEYVAACEAAKE

Query:  VVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIV
        + W+   +T+L +   ++    ++CDN GA      P  H R KH+   YH IR  V
Subjt:  VVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIREIV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.8e-2230.2Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG
        ++DLG  +Y LG++I R+     + + Q  Y   +L    +   K   +P    V  S      +  +  D +   Y   +G LMY L  TR DI +A+ 
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGS-GDLFLTGYTDSDFQTDKDSRKSTSGI--------------KQGCIADSTMEAEYVAACEAAKE
         +S++   P L H   V  IL Y++ T    L Y S  ++ L  ++D+ FQ+ KD+R+ST+G               KQ  ++ S+ EAEY A   A  E
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRRTRNYSLVYGS-GDLFLTGYTDSDFQTDKDSRKSTSGI--------------KQGCIADSTMEAEYVAACEAAKE

Query:  VVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIRE
        ++WL +F  +L++   ++    LFCDN+ A+  +     H+R KH+E   H +RE
Subjt:  VVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSREPRSHKRGKHMERKYHLIRE

ATMG00810.1 DNA/RNA polymerases superfamily protein2.4e-1431.22Show/hide
Query:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSK--NGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYA
        MKDLG   Y LGIQI  +     L +SQT Y ++IL+   M + K  +  LP +    +S  + P    +  D R     S VG+L Y+ L TR DI YA
Subjt:  MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSK--NGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYA

Query:  IGIVSRYQSNPGLDHWTTVKAILKYLRRTRNYSL-VYGSGDLFLTGYTDSDFQTDKDSRKSTSGI--------------KQGCIADSTMEAEYVAACEAA
        + IV +    P L  +  +K +L+Y++ T  + L ++ +  L +  + DSD+     +R+ST+G               +Q  ++ S+ E EY A    A
Subjt:  IGIVSRYQSNPGLDHWTTVKAILKYLRRTRNYSL-VYGSGDLFLTGYTDSDFQTDKDSRKSTSGI--------------KQGCIADSTMEAEYVAACEAA

Query:  KEVVW
         E+ W
Subjt:  KEVVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGATTTGGGAGAAGCTCAGTATGTTCTAGGTATCCAGATTGTCCGGAACCGGAAGAACAGAACGTTGGCCATGTCTCAAACGTCTTATATTGATAAGATATTGTC
TAGATATAAGATGCAGAACTCCAAGAACGGCTTACTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGATCAGTGTCCTAAGACTCCTCAAGAGGTTGAGGATATGAGAT
GTATCCCCTATGCTTCAGCTGTAGGGAGCCTGATGTATGTCCTGTTGTATACTAGGTCTGACATCTGTTATGCAATAGGGATTGTAAGTAGGTATCAATCCAATCCAGGA
TTAGATCACTGGACAACCGTAAAGGCAATCCTCAAGTATCTTAGGAGAACGAGGAACTACAGCTTAGTGTATGGAAGTGGGGATTTGTTCCTTACAGGATACACAGATTC
TGACTTTCAGACCGATAAGGATTCTAGGAAATCCACTTCGGGCATCAAGCAGGGATGCATCGCTGATTCCACTATGGAAGCAGAGTACGTTGCGGCTTGTGAAGCTGCAA
AGGAAGTTGTTTGGCTTAGAAAGTTCATAACCGATTTGGAAGTTGTTCCAAATATGAATTTGTCGATCGCACTGTTTTGTGACAACAGTGGTGCAGTAGCCAACTCCAGA
GAGCCTCGGAGCCATAAGAGAGGCAAACACATGGAGCGGAAGTATCACCTAATACGGGAGATTGTGCACCGCATGGACGCGTGGCGATCCACGCGCAGATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGATTTGGGAGAAGCTCAGTATGTTCTAGGTATCCAGATTGTCCGGAACCGGAAGAACAGAACGTTGGCCATGTCTCAAACGTCTTATATTGATAAGATATTGTC
TAGATATAAGATGCAGAACTCCAAGAACGGCTTACTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGATCAGTGTCCTAAGACTCCTCAAGAGGTTGAGGATATGAGAT
GTATCCCCTATGCTTCAGCTGTAGGGAGCCTGATGTATGTCCTGTTGTATACTAGGTCTGACATCTGTTATGCAATAGGGATTGTAAGTAGGTATCAATCCAATCCAGGA
TTAGATCACTGGACAACCGTAAAGGCAATCCTCAAGTATCTTAGGAGAACGAGGAACTACAGCTTAGTGTATGGAAGTGGGGATTTGTTCCTTACAGGATACACAGATTC
TGACTTTCAGACCGATAAGGATTCTAGGAAATCCACTTCGGGCATCAAGCAGGGATGCATCGCTGATTCCACTATGGAAGCAGAGTACGTTGCGGCTTGTGAAGCTGCAA
AGGAAGTTGTTTGGCTTAGAAAGTTCATAACCGATTTGGAAGTTGTTCCAAATATGAATTTGTCGATCGCACTGTTTTGTGACAACAGTGGTGCAGTAGCCAACTCCAGA
GAGCCTCGGAGCCATAAGAGAGGCAAACACATGGAGCGGAAGTATCACCTAATACGGGAGATTGTGCACCGCATGGACGCGTGGCGATCCACGCGCAGATAG
Protein sequenceShow/hide protein sequence
MKDLGEAQYVLGIQIVRNRKNRTLAMSQTSYIDKILSRYKMQNSKNGLLPFRHGVHLSKDQCPKTPQEVEDMRCIPYASAVGSLMYVLLYTRSDICYAIGIVSRYQSNPG
LDHWTTVKAILKYLRRTRNYSLVYGSGDLFLTGYTDSDFQTDKDSRKSTSGIKQGCIADSTMEAEYVAACEAAKEVVWLRKFITDLEVVPNMNLSIALFCDNSGAVANSR
EPRSHKRGKHMERKYHLIREIVHRMDAWRSTRR