; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004450 (gene) of Snake gourd v1 genome

Gene IDTan0004450
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG06:41888037..41888937
RNA-Seq ExpressionTan0004450
SyntenyTan0004450
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]5.3e-11574.24Show/hide
Query:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------
        MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+Q PK PQEVEDMRRIPYAS          C  PD      
Subjt:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------

Query:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA
                           +LKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LNGG VVWRSIKQGCI DSTMEA+YVAACEAAKEA
Subjt:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA

Query:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR
        VWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+KR KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Subjt:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]8.4e-11373.22Show/hide
Query:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------
        MKDLGE QYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+Q PK PQEVEDMRRIPYAS          C  PD      
Subjt:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------

Query:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA
                           ILKYLRRTR+Y LVYG+ DLILT YT+SDFQTDKDSRKSTS S F LNGG VVWRSIKQGCI DSTMEA+YVAACEAAKEA
Subjt:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA

Query:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR
        VWL+KF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+KR KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Subjt:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR

KAA0042496.1 gag/pol protein [Cucumis melo var. makuwa]4.0e-11574.58Show/hide
Query:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------
        MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+Q PK PQEVEDMRRIPYAS          C  PD      
Subjt:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------

Query:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA
                           ILKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LNGG VVWRSIKQGCI DSTMEA+YVAACEAAKEA
Subjt:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA

Query:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR
        VWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+KR KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Subjt:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]5.3e-11574.24Show/hide
Query:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------
        MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+Q PK PQEVEDMRRIPYAS          C  PD      
Subjt:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------

Query:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA
                           +LKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LNGG VVWRSIKQGCI DSTMEA+YVAACEAAKEA
Subjt:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA

Query:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR
        VWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+KR KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Subjt:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR

KAA0061170.1 gag/pol protein [Cucumis melo var. makuwa]4.4e-11474.24Show/hide
Query:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------
        MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+Q PK PQEVEDMRRIPYAS          C  PD      
Subjt:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------

Query:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA
                           ILKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LN G VVWRSIKQGCI DSTMEA+YVAACEAAKEA
Subjt:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA

Query:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR
        VWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+KR KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Subjt:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein4.1e-11373.22Show/hide
Query:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------
        MKDLGE QYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+Q PK PQEVEDMRRIPYAS          C  PD      
Subjt:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------

Query:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA
                           ILKYLRRTR+Y LVYG+ DLILT YT+SDFQTDKDSRKSTS S F LNGG VVWRSIKQGCI DSTMEA+YVAACEAAKEA
Subjt:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA

Query:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR
        VWL+KF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+KR KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Subjt:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR

A0A5A7TKM4 Gag/pol protein1.9e-11574.58Show/hide
Query:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------
        MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+Q PK PQEVEDMRRIPYAS          C  PD      
Subjt:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------

Query:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA
                           ILKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LNGG VVWRSIKQGCI DSTMEA+YVAACEAAKEA
Subjt:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA

Query:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR
        VWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+KR KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Subjt:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR

A0A5A7TZD0 Gag/pol protein2.5e-11574.24Show/hide
Query:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------
        MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+Q PK PQEVEDMRRIPYAS          C  PD      
Subjt:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------

Query:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA
                           +LKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LNGG VVWRSIKQGCI DSTMEA+YVAACEAAKEA
Subjt:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA

Query:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR
        VWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+KR KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Subjt:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR

A0A5A7UYE8 Gag/pol protein2.5e-11574.24Show/hide
Query:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------
        MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+Q PK PQEVEDMRRIPYAS          C  PD      
Subjt:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------

Query:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA
                           +LKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LNGG VVWRSIKQGCI DSTMEA+YVAACEAAKEA
Subjt:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA

Query:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR
        VWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+KR KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Subjt:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR

A0A5A7V1F5 Gag/pol protein2.2e-11474.24Show/hide
Query:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------
        MKDLGEAQYVLGIQI+ +RKN+ LA+SQA+YIDK+L RY MQNSKKGLLPFRHGVHLSK+Q PK PQEVEDMRRIPYAS          C  PD      
Subjt:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPD------

Query:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA
                           ILKYLRRTR+Y LVYG+ DLILT YTDSDFQTDKDSRKSTSGS F LN G VVWRSIKQGCI DSTMEA+YVAACEAAKEA
Subjt:  ------------------AILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA

Query:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR
        VWLRKF+ DLEVVPNMNLPITL+CDN+GAVANS+E RS+KR KHIERKYHLIREIV RGDV VT+IASEHN+ DPFTK L AKVFEGHLESLGLR
Subjt:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLR

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.9e-2327.15Show/hide
Query:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHL----SKDQC-------------------PKMPQEVEDMRRIPY
        M DL E ++ +GI+I    +   + +SQ++Y+ K+LS++ M+N      P    ++     S + C                   P +   V  + R   
Subjt:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHL----SKDQC-------------------PKMPQEVEDMRRIPY

Query:  ASNC---REPDAILKYLRRTRNYSLVYGSG---DLILTRYTDSDFQTDKDSRKSTSGSAF-ILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEAVWL
         +N    +    +L+YL+ T +  L++      +  +  Y DSD+   +  RKST+G  F + +   + W + +Q  +  S+ EA+Y+A  EA +EA+WL
Subjt:  ASNC---REPDAILKYLRRTRNYSLVYGSG---DLILTRYTDSDFQTDKDSRKSTSGSAF-ILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEAVWL

Query:  RKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGL
        +  +T + +   +  PI ++ DN G ++ +     +KR KHI+ KYH  RE V    + +  I +E+ + D FTK L A  F    + LGL
Subjt:  RKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGL

P0CV72 Secreted RxLR effector protein 1611.3e-1247.06Show/hide
Query:  ILKYLRRTRNYSLVY-GSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEAVWL
        +L+YL+ T+ Y L +  +G   L  Y+D+D+  D +SR+STSG  F LNGG V WRS KQ  +  S+ E +Y+A  EA +EAVWL
Subjt:  ILKYLRRTRNYSLVY-GSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-4235.03Show/hide
Query:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPDA-----
        MKDLG AQ +LG++IV  R +R L +SQ  YI+++L R+ M+N+K    P    + LSK  CP   +E  +M ++PY+S          C  PD      
Subjt:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASN---------CREPDA-----

Query:  -------------------ILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA
                           IL+YLR T    L +G  D IL  YTD+D   D D+RKS++G  F  +GG + W+S  Q C+  ST EA+Y+AA E  KE 
Subjt:  -------------------ILKYLRRTRNYSLVYGSGDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEA

Query:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGL
        +WL++F+ +L +         ++CD+  A+  S+    + R KHI+ +YH IRE+V    + V +I++  N  D  TK +    FE   E +G+
Subjt:  VWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGL

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.2e-2228.33Show/hide
Query:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVE--------------------------DMRR
        +KD  E  Y LGI+    R    L +SQ  YI  +L+R  M  +K    P      LS     K+    E                              
Subjt:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVE--------------------------DMRR

Query:  IPYASNCREPDAILKYLRRTRNYSLVYGSGD-LILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEAVWLRKF
        +P   + +    IL+YL  T N+ +    G+ L L  Y+D+D+  DKD   ST+G    L   P+ W S KQ  +V S+ EA+Y +    + E  W+   
Subjt:  IPYASNCREPDAILKYLRRTRNYSLVYGSGD-LILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEAVWLRKF

Query:  MTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGL-RVPP
        +T+L +   +  P  ++CDN GA         + R KHI   YH IR  V  G + V  +++   + D  TK L    F+     +G+ RVPP
Subjt:  MTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGL-RVPP

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-1925.94Show/hide
Query:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVE--------------------------DMRR
        +K+  +  Y LGI+    R  + L +SQ  Y   +L+R  M  +K    P      L+     K+P   E                              
Subjt:  MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVE--------------------------DMRR

Query:  IPYASNCREPDAILKYLRRTRNYSLVYGSGD-LILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEAVWLRKF
        +P   +      +L+YL  T ++ +    G+ L L  Y+D+D+  D D   ST+G    L   P+ W S KQ  +V S+ EA+Y +    + E  W+   
Subjt:  IPYASNCREPDAILKYLRRTRNYSLVYGSGD-LILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEAVWLRKF

Query:  MTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLG-LRVPP
        +T+L +   ++ P  ++CDN GA         + R KHI   YH IR  V  G + V  +++   + D  TK L    F+     +G ++VPP
Subjt:  MTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIREIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLG-LRVPP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.1e-1736.03Show/hide
Query:  ILKYLRRTRNYSLVYGS-GDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNL
        IL Y++ T    L Y S  ++ L  ++D+ FQ+ KD+R+ST+G    L    + W+S KQ  +  S+ EA+Y A   A  E +WL +F  +L++   ++ 
Subjt:  ILKYLRRTRNYSLVYGS-GDLILTRYTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNL

Query:  PITLFCDNTGAVANSRELRSNKRDKHIERKYHLIRE
        P  LFCDNT A+  +     ++R KHIE   H +RE
Subjt:  PITLFCDNTGAVANSRELRSNKRDKHIERKYHLIRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGATTTGGGAGAGGCACAATATGTTCTAGGTATCCAGATTGTCCCGAACCGCAAGAACAGAATGCTAGCCATGTCTCAGGCATCTTACATTGACAAGATGTTGTC
TAGGTATAAGATGCAGAACTCCAAGAAGGGCTTGCTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGATCAATGTCCTAAGATGCCTCAAGAGGTTGAGGATATGAGAC
GAATCCCCTATGCTTCAAATTGTAGGGAGCCTGATGCAATCCTCAAGTATCTTAGGAGAACGAGGAACTATAGCCTTGTGTATGGGAGTGGGGATTTGATCCTTACGAGA
TACACAGATTCTGACTTTCAGACCGATAAGGATTCTAGGAAATCCACTTCGGGGTCAGCCTTCATTCTAAATGGAGGACCTGTAGTGTGGCGAAGCATCAAACAGGGATG
CATCGTTGATTCCACGATGGAAGCCAAGTATGTTGCGGCTTGTGAAGCTGCAAAGGAAGCTGTTTGGCTCAGGAAGTTCATGACGGATTTAGAAGTTGTTCCAAATATGA
ACTTACCGATCACGTTGTTCTGTGACAACACTGGTGCAGTAGCCAACTCGAGAGAACTTCGGAGTAATAAAAGGGACAAGCATATAGAGCGTAAGTATCACTTGATACGG
GAGATTGTGCACCGTGGAGACGTGACAGTCACGCAGATAGCTTCGGAGCACAACGTTGTTGATCCATTTACAAAGGCCCTTATGGCTAAGGTGTTTGAGGGTCACCTAGA
GAGTCTAGGTCTTCGAGTGCCTCCTGACTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAGATTTGGGAGAGGCACAATATGTTCTAGGTATCCAGATTGTCCCGAACCGCAAGAACAGAATGCTAGCCATGTCTCAGGCATCTTACATTGACAAGATGTTGTC
TAGGTATAAGATGCAGAACTCCAAGAAGGGCTTGCTGCCTTTCAGGCATGGGGTTCACCTGTCTAAGGATCAATGTCCTAAGATGCCTCAAGAGGTTGAGGATATGAGAC
GAATCCCCTATGCTTCAAATTGTAGGGAGCCTGATGCAATCCTCAAGTATCTTAGGAGAACGAGGAACTATAGCCTTGTGTATGGGAGTGGGGATTTGATCCTTACGAGA
TACACAGATTCTGACTTTCAGACCGATAAGGATTCTAGGAAATCCACTTCGGGGTCAGCCTTCATTCTAAATGGAGGACCTGTAGTGTGGCGAAGCATCAAACAGGGATG
CATCGTTGATTCCACGATGGAAGCCAAGTATGTTGCGGCTTGTGAAGCTGCAAAGGAAGCTGTTTGGCTCAGGAAGTTCATGACGGATTTAGAAGTTGTTCCAAATATGA
ACTTACCGATCACGTTGTTCTGTGACAACACTGGTGCAGTAGCCAACTCGAGAGAACTTCGGAGTAATAAAAGGGACAAGCATATAGAGCGTAAGTATCACTTGATACGG
GAGATTGTGCACCGTGGAGACGTGACAGTCACGCAGATAGCTTCGGAGCACAACGTTGTTGATCCATTTACAAAGGCCCTTATGGCTAAGGTGTTTGAGGGTCACCTAGA
GAGTCTAGGTCTTCGAGTGCCTCCTGACTAG
Protein sequenceShow/hide protein sequence
MKDLGEAQYVLGIQIVPNRKNRMLAMSQASYIDKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKMPQEVEDMRRIPYASNCREPDAILKYLRRTRNYSLVYGSGDLILTR
YTDSDFQTDKDSRKSTSGSAFILNGGPVVWRSIKQGCIVDSTMEAKYVAACEAAKEAVWLRKFMTDLEVVPNMNLPITLFCDNTGAVANSRELRSNKRDKHIERKYHLIR
EIVHRGDVTVTQIASEHNVVDPFTKALMAKVFEGHLESLGLRVPPD