; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018521 (gene) of Snake gourd v1 genome

Gene IDTan0018521
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG07:48967936..48968835
RNA-Seq ExpressionTan0018521
SyntenyTan0018521
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-12476.95Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG
        MKDLGEAQYVLGIQI+ +RKN+TLA+SQA+YIDK+L RY M NSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMYAMLCTRPDICYAVG
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------
        IVSRYQSNPGLDHWT VK +LKYLR TR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTS S F LNGG VVWRSIK                       
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------

Query:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR
            KF+ D EVVPNMNLPITL+C+N+GAVANS+EPRSHKRGKHIERKYHLIREIV  GDV +T+IASEHN+ADPFTK LTAKVFEGHLESLGLR
Subjt:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-12376.61Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG
        MKDLGE QYVLGIQI+ +RKN+TLA+SQA+YIDK+L RY M NSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMYAMLCTRPDICYAVG
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSI------------------------
        IVSRYQSNPGLDHWT VK ILKYLR TR+Y LVYG+ DLILT YT+SDFQTDKDSRKSTS S F LNGG VVWRSI                        
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSI------------------------

Query:  ---KKFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR
           KKF+ D EVVPNMNLPITL+C+N+GAVANS+EPRSHKRGKHIERKYHLIREIV  GDV +T+IASEHN+ADPFTK LTAKVFEGHLESLGLR
Subjt:  ---KKFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR

KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]6.3e-12476.95Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG
        MKDLGEAQYVLGIQI+ +RKN+TLA+SQA+YIDK+L RY M NSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMY MLCTRPDICYAVG
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------
        IVSRYQSNPGLDHWT VK ILKYLR TR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTS S F LNGG VVWRSIK                       
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------

Query:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR
            KF+ D EVVPNMNLPITL+C+N+GAVANS+EPRSHKRGKHIERKYHLIREIV  GDV +T+IASEHN+ADPFTK LTAKVFEGHLESLGLR
Subjt:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-12476.95Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG
        MKDLGEAQYVLGIQI+ +RKN+TLA+SQA+YIDK+L RY M NSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMYAMLCTRPDICYAVG
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------
        IVSRYQSNPGLDHWT VK +LKYLR TR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTS S F LNGG VVWRSIK                       
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------

Query:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR
            KF+ D EVVPNMNLPITL+C+N+GAVANS+EPRSHKRGKHIERKYHLIREIV  GDV +T+IASEHN+ADPFTK LTAKVFEGHLESLGLR
Subjt:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR

KAA0061170.1 gag/pol protein [Cucumis melo var. makuwa]6.3e-12477.29Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG
        MKDLGEAQYVLGIQI+ +RKN+TLA+SQA+YIDK+L RY M NSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMYAMLCTRPDICYAVG
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------
        IVSRYQSNPGLDHWTTVK ILKYLR TR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTS S F LN G VVWRSIK                       
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------

Query:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR
            KF+ D EVVPNMNLPITL+C+N+GAVANS+EPRSHKRGKHIERKYHLIREIV  GDV +T+IASEHN+ADPFTK LTAKVFEGHLESLGLR
Subjt:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein6.8e-12476.61Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG
        MKDLGE QYVLGIQI+ +RKN+TLA+SQA+YIDK+L RY M NSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMYAMLCTRPDICYAVG
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSI------------------------
        IVSRYQSNPGLDHWT VK ILKYLR TR+Y LVYG+ DLILT YT+SDFQTDKDSRKSTS S F LNGG VVWRSI                        
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSI------------------------

Query:  ---KKFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR
           KKF+ D EVVPNMNLPITL+C+N+GAVANS+EPRSHKRGKHIERKYHLIREIV  GDV +T+IASEHN+ADPFTK LTAKVFEGHLESLGLR
Subjt:  ---KKFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR

A0A5A7TKM4 Gag/pol protein3.1e-12476.95Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG
        MKDLGEAQYVLGIQI+ +RKN+TLA+SQA+YIDK+L RY M NSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMY MLCTRPDICYAVG
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------
        IVSRYQSNPGLDHWT VK ILKYLR TR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTS S F LNGG VVWRSIK                       
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------

Query:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR
            KF+ D EVVPNMNLPITL+C+N+GAVANS+EPRSHKRGKHIERKYHLIREIV  GDV +T+IASEHN+ADPFTK LTAKVFEGHLESLGLR
Subjt:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR

A0A5A7TZD0 Gag/pol protein1.4e-12476.95Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG
        MKDLGEAQYVLGIQI+ +RKN+TLA+SQA+YIDK+L RY M NSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMYAMLCTRPDICYAVG
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------
        IVSRYQSNPGLDHWT VK +LKYLR TR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTS S F LNGG VVWRSIK                       
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------

Query:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR
            KF+ D EVVPNMNLPITL+C+N+GAVANS+EPRSHKRGKHIERKYHLIREIV  GDV +T+IASEHN+ADPFTK LTAKVFEGHLESLGLR
Subjt:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR

A0A5A7UYE8 Gag/pol protein1.4e-12476.95Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG
        MKDLGEAQYVLGIQI+ +RKN+TLA+SQA+YIDK+L RY M NSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMYAMLCTRPDICYAVG
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------
        IVSRYQSNPGLDHWT VK +LKYLR TR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTS S F LNGG VVWRSIK                       
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------

Query:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR
            KF+ D EVVPNMNLPITL+C+N+GAVANS+EPRSHKRGKHIERKYHLIREIV  GDV +T+IASEHN+ADPFTK LTAKVFEGHLESLGLR
Subjt:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR

A0A5A7V1F5 Gag/pol protein3.1e-12477.29Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG
        MKDLGEAQYVLGIQI+ +RKN+TLA+SQA+YIDK+L RY M NSKK LLPFRHGVHLSK+Q PKTPQEVEDMRRIPYAS VGSLMYAMLCTRPDICYAVG
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------
        IVSRYQSNPGLDHWTTVK ILKYLR TR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTS S F LN G VVWRSIK                       
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIK-----------------------

Query:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR
            KF+ D EVVPNMNLPITL+C+N+GAVANS+EPRSHKRGKHIERKYHLIREIV  GDV +T+IASEHN+ADPFTK LTAKVFEGHLESLGLR
Subjt:  ----KFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLR

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.5e-3030.77Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVH---LSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICY
        M DL E ++ +GI+I    +   + +SQ++Y+ K+LS++ M N      P    ++   L+ D+   T          P  S++G LMY MLCTRPD+  
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVH---LSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICY

Query:  AVGIVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSG---DLILTAYTDSDFQTDKDSRKSTSESAF-ILNGGVVVWRSIK-----------KFMTD
        AV I+SRY S    + W  +K +L+YL+ T +  L++      +  +  Y DSD+   +  RKST+   F + +  ++ W + +           ++M  
Subjt:  AVGIVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSG---DLILTAYTDSDFQTDKDSRKSTSESAF-ILNGGVVVWRSIK-----------KFMTD

Query:  FEVVP------------NMNL--PITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGL
        FE V             N+ L  PI ++ +N G ++ +  P  HKR KHI+ KYH  RE V +  + +  I +E+ +AD FTK L A  F    + LGL
Subjt:  FEVVP------------NMNL--PITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGL

P0CV72 Secreted RxLR effector protein 1612.8e-2145.37Show/hide
Query:  MRRIPYASVVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVY-GSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGV
        M+ +PY S VG++MY M+ TRPD+  AVG++S++ S+P   HW  +K +L+YL+ T+ Y L +  +G   L  Y+D+D+  D +SR+STS   F LNGG 
Subjt:  MRRIPYASVVGSLMYAMLCTRPDICYAVGIVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVY-GSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGV

Query:  VVWRSIKK
        V WRS K+
Subjt:  VVWRSIKK

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.7e-5338.44Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG
        MKDLG AQ +LG++IV  R +R L +SQ  YI+++L R+ M N+K    P    + LSK  CP T +E  +M ++PY+S VGSLMYAM+CTRPDI +AVG
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRS-------------------------
        +VSR+  NPG +HW  VK IL+YLR T    L +G  D IL  YTD+D   D D+RKS++   F  +GG + W+S                         
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRS-------------------------

Query:  --IKKFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGL
          +K+F+ +  +         ++C++  A+  S+    H R KHI+ +YH IRE+V    + + +I++  N AD  TK +    FE   E +G+
Subjt:  --IKKFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE15.4e-2529.97Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG
        +KD  E  Y LGI+    R    L +SQ  YI  +L+R  M  +K    P      LS     K     E      Y  +VGSL Y +  TRPDI YAV 
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGD-LILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIKK-------FMTDFEVVPN----
         +S++   P  +H   +K IL+YL  T N+ +    G+ L L AY+D+D+  DKD   ST+     L    + W S K+          ++  V N    
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGD-LILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIKK-------FMTDFEVVPN----

Query:  --------------MNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLRVLP
                      +  P  ++C+N GA      P  H R KHI   YH IR  V  G + +  +++   +AD  TK L+   F+     +G+  +P
Subjt:  --------------MNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLRVLP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.4e-2528.62Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG
        +K+  +  Y LGI+    R  + L +SQ  Y   +L+R  M  +K    P      L+     K P   E      Y  +VGSL Y +  TRPD+ YAV 
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGD-LILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIKK-------FMTDFEVVPN----
         +S+Y   P  DHW  +K +L+YL  T ++ +    G+ L L AY+D+D+  D D   ST+     L    + W S K+          ++  V N    
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGSGD-LILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIKK-------FMTDFEVVPN----

Query:  --------------MNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLRVLP
                      ++ P  ++C+N GA      P  H R KHI   YH IR  V  G + +  +++   +AD  TK L+   F+     +G+  +P
Subjt:  --------------MNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLRVLP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.3e-1827.14Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG
        ++DLG  +Y LG++I   R    + + Q  Y   +L    +   K + +P    V  S      +  +  D +   Y  ++G LMY  + TR DI +AV 
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVG

Query:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGS-GDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIKK----------------FMTD-
         +S++   P L H   V  IL Y++ T    L Y S  ++ L  ++D+ FQ+ KD+R+ST+     L   ++ W+S K+                F TD 
Subjt:  IVSRYQSNPGLDHWTTVKAILKYLRITRNYSLVYGS-GDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIKK----------------FMTD-

Query:  --------FEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALT
                 E+   ++ P  LFC+N  A+  +     H+R KHIE   H +RE   +   T++     ++  D FT+ L+
Subjt:  --------FEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIERKYHLIREIVHHGDVTITQIASEHNVADPFTKALT

ATMG00810.1 DNA/RNA polymerases superfamily protein7.5e-1432.39Show/hide
Query:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSK--KALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYA
        MKDLG   Y LGIQI  +     L +SQ  Y +++L+   M + K     LP +    +S  + P    +  D R     S+VG+L Y  L TRPDI YA
Subjt:  MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSK--KALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYA

Query:  VGIVSRYQSNPGLDHWTTVKAILKYLRITRNYSL-VYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVW
        V IV +    P L  +  +K +L+Y++ T  + L ++ +  L + A+ DSD+     +R+ST+     L   ++ W
Subjt:  VGIVSRYQSNPGLDHWTTVKAILKYLRITRNYSL-VYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGATTTGGGAGAGGCACAGTATGTTCTAGGTATCCAGATTGTCTGGAACCGAAAGAATAGAACGCTAGCCATGTCTCAAGCGTCTTATATTGACAAGATGTTGTC
TAGATATAAGATGTGGAACTCCAAGAAGGCCTTGCTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGATCAATGTCCTAAGACGCCTCAAGAGGTTGAGGATATGAGAC
GAATCCCCTATGCTTCAGTTGTAGGGAGCCTGATGTATGCCATGTTGTGTACTAGGCCCGACATCTGTTATGCAGTTGGGATTGTCAGTAGGTATCAATCAAATCCAGGA
TTAGATCACTGGACAACCGTAAAGGCAATCCTCAAGTATCTTAGGATAACGAGGAACTACAGCCTTGTGTATGGGAGTGGGGATTTGATCCTTACGGCATACACAGATTC
TGACTTTCAGACCGATAAGGATTCTAGGAAATCCACTTCGGAGTCAGCCTTCATTCTAAATGGAGGAGTTGTAGTGTGGCGAAGCATCAAGAAGTTCATGACGGATTTTG
AAGTTGTTCCAAATATGAACTTGCCAATCACACTGTTTTGTGAGAACAATGGTGCAGTAGCCAACTCGAGAGAACCTCGGAGTCATAAAAGGGGCAAGCATATAGAGCGT
AAGTATCACCTGATACGGGAGATTGTGCACCACGGAGACGTGACAATCACGCAGATAGCTTCGGAGCACAACGTTGCTGATCCATTTACAAAGGCCCTCACGGCTAAGGT
TTTTGAGGGTCACCTAGAGAGTCTAGGTCTTCGAGTGCTTCCTAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGATTTGGGAGAGGCACAGTATGTTCTAGGTATCCAGATTGTCTGGAACCGAAAGAATAGAACGCTAGCCATGTCTCAAGCGTCTTATATTGACAAGATGTTGTC
TAGATATAAGATGTGGAACTCCAAGAAGGCCTTGCTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGATCAATGTCCTAAGACGCCTCAAGAGGTTGAGGATATGAGAC
GAATCCCCTATGCTTCAGTTGTAGGGAGCCTGATGTATGCCATGTTGTGTACTAGGCCCGACATCTGTTATGCAGTTGGGATTGTCAGTAGGTATCAATCAAATCCAGGA
TTAGATCACTGGACAACCGTAAAGGCAATCCTCAAGTATCTTAGGATAACGAGGAACTACAGCCTTGTGTATGGGAGTGGGGATTTGATCCTTACGGCATACACAGATTC
TGACTTTCAGACCGATAAGGATTCTAGGAAATCCACTTCGGAGTCAGCCTTCATTCTAAATGGAGGAGTTGTAGTGTGGCGAAGCATCAAGAAGTTCATGACGGATTTTG
AAGTTGTTCCAAATATGAACTTGCCAATCACACTGTTTTGTGAGAACAATGGTGCAGTAGCCAACTCGAGAGAACCTCGGAGTCATAAAAGGGGCAAGCATATAGAGCGT
AAGTATCACCTGATACGGGAGATTGTGCACCACGGAGACGTGACAATCACGCAGATAGCTTCGGAGCACAACGTTGCTGATCCATTTACAAAGGCCCTCACGGCTAAGGT
TTTTGAGGGTCACCTAGAGAGTCTAGGTCTTCGAGTGCTTCCTAACTAG
Protein sequenceShow/hide protein sequence
MKDLGEAQYVLGIQIVWNRKNRTLAMSQASYIDKMLSRYKMWNSKKALLPFRHGVHLSKDQCPKTPQEVEDMRRIPYASVVGSLMYAMLCTRPDICYAVGIVSRYQSNPG
LDHWTTVKAILKYLRITRNYSLVYGSGDLILTAYTDSDFQTDKDSRKSTSESAFILNGGVVVWRSIKKFMTDFEVVPNMNLPITLFCENNGAVANSREPRSHKRGKHIER
KYHLIREIVHHGDVTITQIASEHNVADPFTKALTAKVFEGHLESLGLRVLPN