; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019987 (gene) of Snake gourd v1 genome

Gene IDTan0019987
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG04:59249522..59250508
RNA-Seq ExpressionTan0019987
SyntenyTan0019987
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.1e-12172.64Show/hide
Query:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL
        M QP+GFIAQ QEQ+VC               W  RFD AI SYGF+QNV+EPCVYKKIVN+ VAFLIL VDDILLIG +V  LTD+K+WL +Q+QMKDL
Subjt:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL

Query:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR
        GEAQY L IQIVRNRKN+TLAMS+ASYI+K+LSRYKMQNSKKG LPFRHG+HLSK+QCPKTP+EVEDMR IPY+SA+ S+MY MLCTR +ICY+VGIV+R
Subjt:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR

Query:  YQSNPGLDHWAAVKAIL-------NGDLIL---NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR
        YQSNPG DHW AVK IL       N  L+    +LILTGYTD DFQ+DKD+RKSTSGS F LN GAVVWRS KQ CIADSTMEAEYVAACEAAKEAVWLR
Subjt:  YQSNPGLDHWAAVKAIL-------NGDLIL---NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR

Query:  KMMLDLEVVPNMNLPITL
        K + DLEVVPNM+LPITL
Subjt:  KMMLDLEVVPNMNLPITL

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]5.3e-12473.27Show/hide
Query:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL
        M QP+GFI QGQEQ+VC               W  RFD AI SYGFDQNV+EPCVYKKI    VAFL+L VDDILLIG +VG LTD+K WLA+Q+QMKDL
Subjt:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL

Query:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR
        GEAQY L IQI+R+RKN+TLA+S+A+YI+K+L RY MQNSKKGLLPFRHGVHLSK+Q PKTP+EVEDMRRIPYASA+ S+MY MLCTR +ICYAVGIV+R
Subjt:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR

Query:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR
        YQSNPGLDHW AVK +L       D +L     +LILTGYTD DFQTDKDSRKSTSGS F LN GAVVWRS KQGCIADSTMEAEYVAACEAAKEAVWLR
Subjt:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR

Query:  KMMLDLEVVPNMNLPITL
        K + DLEVVPNMNLPITL
Subjt:  KMMLDLEVVPNMNLPITL

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]8.5e-12272.33Show/hide
Query:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL
        M QP+GFI QGQEQ+VC               W  RFD AI SYGFDQNV+EPCVYKKI    VAFL+L VDDILLIG +VG LTD+K WLA+Q+QMKDL
Subjt:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL

Query:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR
        GE QY L IQI+R+RKN+TLA+S+A+YI+K+L RY MQNSKKGLLPFRHGVHLSK+Q PKTP+EVEDMRRIPYASA+ S+MY MLCTR +ICYAVGIV+R
Subjt:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR

Query:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR
        YQSNPGLDHW AVK IL       D +L     +LILTGYT+ DFQTDKDSRKSTS S F LN GAVVWRS KQGCIADSTMEAEYVAACEAAKEAVWL+
Subjt:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR

Query:  KMMLDLEVVPNMNLPITL
        K + DLEVVPNMNLPITL
Subjt:  KMMLDLEVVPNMNLPITL

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]5.3e-12473.27Show/hide
Query:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL
        M QP+GFI QGQEQ+VC               W  RFD AI SYGFDQNV+EPCVYKKI    VAFL+L VDDILLIG +VG LTD+K WLA+Q+QMKDL
Subjt:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL

Query:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR
        GEAQY L IQI+R+RKN+TLA+S+A+YI+K+L RY MQNSKKGLLPFRHGVHLSK+Q PKTP+EVEDMRRIPYASA+ S+MY MLCTR +ICYAVGIV+R
Subjt:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR

Query:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR
        YQSNPGLDHW AVK +L       D +L     +LILTGYTD DFQTDKDSRKSTSGS F LN GAVVWRS KQGCIADSTMEAEYVAACEAAKEAVWLR
Subjt:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR

Query:  KMMLDLEVVPNMNLPITL
        K + DLEVVPNMNLPITL
Subjt:  KMMLDLEVVPNMNLPITL

KAA0061170.1 gag/pol protein [Cucumis melo var. makuwa]2.2e-12272.64Show/hide
Query:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL
        M QP+GFI Q QEQ+VC               W  RFD AI SYGFDQNV+EPCVYKKI    VAFL+L VDDILLIG + G LTD+K WLA+Q+QMKDL
Subjt:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL

Query:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR
        GEAQY L IQI+R+RKN+TLA+S+A+YI+K+L RY MQNSKKGLLPFRHGVHLSK+Q PKTP+EVEDMRRIPYASA+ S+MY MLCTR +ICYAVGIV+R
Subjt:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR

Query:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR
        YQSNPGLDHW  VK IL       D +L     +LILTGYTD DFQTDKDSRKSTSGS F LN GAVVWRS KQGCIADSTMEAEYVAACEAAKEAVWLR
Subjt:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR

Query:  KMMLDLEVVPNMNLPITL
        K + DLEVVPNMNLPITL
Subjt:  KMMLDLEVVPNMNLPITL

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein4.1e-12272.33Show/hide
Query:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL
        M QP+GFI QGQEQ+VC               W  RFD AI SYGFDQNV+EPCVYKKI    VAFL+L VDDILLIG +VG LTD+K WLA+Q+QMKDL
Subjt:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL

Query:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR
        GE QY L IQI+R+RKN+TLA+S+A+YI+K+L RY MQNSKKGLLPFRHGVHLSK+Q PKTP+EVEDMRRIPYASA+ S+MY MLCTR +ICYAVGIV+R
Subjt:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR

Query:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR
        YQSNPGLDHW AVK IL       D +L     +LILTGYT+ DFQTDKDSRKSTS S F LN GAVVWRS KQGCIADSTMEAEYVAACEAAKEAVWL+
Subjt:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR

Query:  KMMLDLEVVPNMNLPITL
        K + DLEVVPNMNLPITL
Subjt:  KMMLDLEVVPNMNLPITL

A0A5A7TZD0 Gag/pol protein2.6e-12473.27Show/hide
Query:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL
        M QP+GFI QGQEQ+VC               W  RFD AI SYGFDQNV+EPCVYKKI    VAFL+L VDDILLIG +VG LTD+K WLA+Q+QMKDL
Subjt:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL

Query:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR
        GEAQY L IQI+R+RKN+TLA+S+A+YI+K+L RY MQNSKKGLLPFRHGVHLSK+Q PKTP+EVEDMRRIPYASA+ S+MY MLCTR +ICYAVGIV+R
Subjt:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR

Query:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR
        YQSNPGLDHW AVK +L       D +L     +LILTGYTD DFQTDKDSRKSTSGS F LN GAVVWRS KQGCIADSTMEAEYVAACEAAKEAVWLR
Subjt:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR

Query:  KMMLDLEVVPNMNLPITL
        K + DLEVVPNMNLPITL
Subjt:  KMMLDLEVVPNMNLPITL

A0A5A7UYE8 Gag/pol protein2.6e-12473.27Show/hide
Query:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL
        M QP+GFI QGQEQ+VC               W  RFD AI SYGFDQNV+EPCVYKKI    VAFL+L VDDILLIG +VG LTD+K WLA+Q+QMKDL
Subjt:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL

Query:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR
        GEAQY L IQI+R+RKN+TLA+S+A+YI+K+L RY MQNSKKGLLPFRHGVHLSK+Q PKTP+EVEDMRRIPYASA+ S+MY MLCTR +ICYAVGIV+R
Subjt:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR

Query:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR
        YQSNPGLDHW AVK +L       D +L     +LILTGYTD DFQTDKDSRKSTSGS F LN GAVVWRS KQGCIADSTMEAEYVAACEAAKEAVWLR
Subjt:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR

Query:  KMMLDLEVVPNMNLPITL
        K + DLEVVPNMNLPITL
Subjt:  KMMLDLEVVPNMNLPITL

A0A5A7V1F5 Gag/pol protein1.1e-12272.64Show/hide
Query:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL
        M QP+GFI Q QEQ+VC               W  RFD AI SYGFDQNV+EPCVYKKI    VAFL+L VDDILLIG + G LTD+K WLA+Q+QMKDL
Subjt:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL

Query:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR
        GEAQY L IQI+R+RKN+TLA+S+A+YI+K+L RY MQNSKKGLLPFRHGVHLSK+Q PKTP+EVEDMRRIPYASA+ S+MY MLCTR +ICYAVGIV+R
Subjt:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR

Query:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR
        YQSNPGLDHW  VK IL       D +L     +LILTGYTD DFQTDKDSRKSTSGS F LN GAVVWRS KQGCIADSTMEAEYVAACEAAKEAVWLR
Subjt:  YQSNPGLDHWAAVKAIL-----NGDLIL-----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR

Query:  KMMLDLEVVPNMNLPITL
        K + DLEVVPNMNLPITL
Subjt:  KMMLDLEVVPNMNLPITL

E2GK51 Gag/pol protein (Fragment)5.4e-12272.64Show/hide
Query:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL
        M QP+GFIAQ QEQ+VC               W  RFD AI SYGF+QNV+EPCVYKKIVN+ VAFLIL VDDILLIG +V  LTD+K+WL +Q+QMKDL
Subjt:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL

Query:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR
        GEAQY L IQIVRNRKN+TLAMS+ASYI+K+LSRYKMQNSKKG LPFRHG+HLSK+QCPKTP+EVEDMR IPY+SA+ S+MY MLCTR +ICY+VGIV+R
Subjt:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR

Query:  YQSNPGLDHWAAVKAIL-------NGDLIL---NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR
        YQSNPG DHW AVK IL       N  L+    +LILTGYTD DFQ+DKD+RKSTSGS F LN GAVVWRS KQ CIADSTMEAEYVAACEAAKEAVWLR
Subjt:  YQSNPGLDHWAAVKAIL-------NGDLIL---NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLR

Query:  KMMLDLEVVPNMNLPITL
        K + DLEVVPNM+LPITL
Subjt:  KMMLDLEVVPNMNLPITL

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.8e-2727.53Show/hide
Query:  CWLRRFDEAINSYGFDQNVNEPCVY---KKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDLGEAQYTLSIQIVRNRKNRTLAMSKASYI
        CW   F++A+    F  +  + C+Y   K  +N  + +++L VDD+++   ++  + + K +L  +++M DL E ++ + I+I    +   + +S+++Y+
Subjt:  CWLRRFDEAINSYGFDQNVNEPCVY---KKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDLGEAQYTLSIQIVRNRKNRTLAMSKASYI

Query:  NKMLSRYKMQNSKKGLLPFRHGVH---LSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTRYQSNPGLDHWAAVKAI---LNGDLI
         K+LS++ M+N      P    ++   L+ D+   T          P  S I  +MY+MLCTR ++  AV I++RY S    + W  +K +   L G + 
Subjt:  NKMLSRYKMQNSKKGLLPFRHGVH---LSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTRYQSNPGLDHWAAVKAI---LNGDLI

Query:  LNLI----------LTGYTDYDFQTDKDSRKSTSGSAF-ILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLRKMMLDLEV
        + LI          + GY D D+   +  RKST+G  F + +   + W + +Q  +A S+ EAEY+A  EA +EA+WL+ ++  + +
Subjt:  LNLI----------LTGYTDYDFQTDKDSRKSTSGSAF-ILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLRKMMLDLEV

P0CV72 Secreted RxLR effector protein 1611.2e-2041.35Show/hide
Query:  MRRIPYASAIRSMMYVMLCTRHNICYAVGIVTRYQSNPGLDHWAAVKAILNGDLILNLI-----------LTGYTDYDFQTDKDSRKSTSGSAFILNAGA
        M+ +PY SA+ ++MY+M+ TR ++  AVG+++++ S+P   HW A+K +L                    L GY+D D+  D +SR+STSG  F LN G 
Subjt:  MRRIPYASAIRSMMYVMLCTRHNICYAVGIVTRYQSNPGLDHWAAVKAILNGDLILNLI-----------LTGYTDYDFQTDKDSRKSTSGSAFILNAGA

Query:  VVWRSFKQGCIADSTMEAEYVAACEAAKEAVWL
        V WRS KQ  +A S+ E EY+A  EA +EAVWL
Subjt:  VVWRSFKQGCIADSTMEAEYVAACEAAKEAVWL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.1e-5940.39Show/hide
Query:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVY-KKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKD
        M+QP+GF   G++  VC               W  +FD  + S  + +  ++PCVY K+   N    L+L VDD+L++G + GL+  +K  L+  + MKD
Subjt:  MDQPKGFIAQGQEQRVC---------------WLRRFDEAINSYGFDQNVNEPCVY-KKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKD

Query:  LGEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVT
        LG AQ  L ++IVR R +R L +S+  YI ++L R+ M+N+K    P    + LSK  CP T  E  +M ++PY+SA+ S+MY M+CTR +I +AVG+V+
Subjt:  LGEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVT

Query:  RYQSNPGLDHWAAVKAIL------NGDLIL----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWL
        R+  NPG +HW AVK IL       GD +     + IL GYTD D   D D+RKS++G  F  + GA+ W+S  Q C+A ST EAEY+AA E  KE +WL
Subjt:  RYQSNPGLDHWAAVKAIL------NGDLIL----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWL

Query:  RKMMLDL
        ++ + +L
Subjt:  RKMMLDL

P92519 Uncharacterized mitochondrial protein AtMg008104.4e-1229.24Show/hide
Query:  FLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDLGEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSK--KGLLPFRHGVHLSKDQCPKTPR
        +L+L VDDILL G    LL  +   L+S + MKDLG   Y L IQI  +     L +S+  Y  ++L+   M + K     LP +    +S  + P    
Subjt:  FLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDLGEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSK--KGLLPFRHGVHLSKDQCPKTPR

Query:  EVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTRYQSNPGLDHWAAVKAIL---NGDLILNLIL--------TGYTDYDFQTDKDSRKSTSGSAFIL
        +  D R     S + ++ Y+ L TR +I YAV IV +    P L  +  +K +L    G +   L +          + D D+     +R+ST+G    L
Subjt:  EVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTRYQSNPGLDHWAAVKAIL---NGDLILNLIL--------TGYTDYDFQTDKDSRKSTSGSAFIL

Query:  NAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVW
            + W + +Q  ++ S+ E EY A    A E  W
Subjt:  NAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVW

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE18.2e-1927.51Show/hide
Query:  MDQPKGFIAQGQEQRVCWLRR------------FDEAIN---SYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL
        M QP GFI + +   VC LR+            + E  N   + GF  +V++  ++      ++ ++++ VDDIL+ G +  LL +  + L+ ++ +KD 
Subjt:  MDQPKGFIAQGQEQRVCWLRR------------FDEAIN---SYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDL

Query:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR
         E  Y L I+    R    L +S+  YI  +L+R  M  +K    P      LS     K     E      Y   + S+ Y+   TR +I YAV  +++
Subjt:  GEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTR

Query:  YQSNPGLDHWAAVKAIL-------NGDLIL----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWL
        +   P  +H  A+K IL       N  + L     L L  Y+D D+  DKD   ST+G    L    + W S KQ  +  S+ EAEY +    + E  W+
Subjt:  YQSNPGLDHWAAVKAIL-------NGDLIL----NLILTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWL

Query:  RKMMLDLEV
          ++ +L +
Subjt:  RKMMLDLEV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.7e-1926.35Show/hide
Query:  WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDLGEAQYTLSIQIVRNRKNRTLAMSKASYINKML
        W  +F   +  +GF Q+ ++   + KI       +++ VDDI++       + ++K  L S ++++DLG  +Y L ++I R+     +   K  Y   +L
Subjt:  WLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDLGEAQYTLSIQIVRNRKNRTLAMSKASYINKML

Query:  SRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTRYQSNPGLDHWAAVKAILN---GDLILNLI---
            +   K   +P    V  S      +  +  D +   Y   I  +MY+ + TR +I +AV  ++++   P L H  AV  IL+   G +   L    
Subjt:  SRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTRYQSNPGLDHWAAVKAILN---GDLILNLI---

Query:  -----LTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLRKMMLDLEV
             L  ++D  FQ+ KD+R+ST+G    L    + W+S KQ  ++ S+ EAEY A   A  E +WL +   +L++
Subjt:  -----LTGYTDYDFQTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLRKMMLDLEV

ATMG00810.1 DNA/RNA polymerases superfamily protein3.1e-1329.24Show/hide
Query:  FLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDLGEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSK--KGLLPFRHGVHLSKDQCPKTPR
        +L+L VDDILL G    LL  +   L+S + MKDLG   Y L IQI  +     L +S+  Y  ++L+   M + K     LP +    +S  + P    
Subjt:  FLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDLGEAQYTLSIQIVRNRKNRTLAMSKASYINKMLSRYKMQNSK--KGLLPFRHGVHLSKDQCPKTPR

Query:  EVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTRYQSNPGLDHWAAVKAIL---NGDLILNLIL--------TGYTDYDFQTDKDSRKSTSGSAFIL
        +  D R     S + ++ Y+ L TR +I YAV IV +    P L  +  +K +L    G +   L +          + D D+     +R+ST+G    L
Subjt:  EVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTRYQSNPGLDHWAAVKAIL---NGDLILNLIL--------TGYTDYDFQTDKDSRKSTSGSAFIL

Query:  NAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVW
            + W + +Q  ++ S+ E EY A    A E  W
Subjt:  NAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCAACCCAAGGGGTTCATTGCCCAAGGCCAAGAGCAAAGAGTTTGCTGGCTTCGAAGGTTTGATGAGGCGATCAATTCTTATGGCTTTGATCAAAATGTTAACGA
GCCTTGTGTCTACAAGAAAATCGTTAACAACACTGTCGCATTTCTAATATTGTGTGTGGATGATATCCTTCTCATTGGGATTGAGGTAGGACTTCTTACTGACATTAAGG
AATGGTTGGCTTCGCAATACCAAATGAAAGATTTGGGAGAGGCACAGTATACTCTAAGTATCCAGATAGTCCGAAACCGGAAGAATAGAACGCTAGCCATGTCTAAGGCG
TCTTATATTAACAAGATGTTGTCTAGATATAAGATGCAGAACTCCAAGAAGGGTTTGCTGCCTTTCAGGCATGGGGTTCACTTGTCTAAGGATCAATGTCCTAAGACACC
TCGAGAGGTTGAGGATATGAGACGAATCCCCTATGCTTCAGCTATAAGGAGCATGATGTATGTCATGTTGTGTACTAGGCACAACATCTGTTATGCAGTTGGGATTGTCA
CTAGGTATCAATCCAATCCAGGATTAGATCACTGGGCAGCCGTAAAGGCAATCCTCAATGGGGATTTGATCCTTAATTTGATCCTTACGGGATACACAGATTATGACTTT
CAGACCGATAAGGATTCTAGGAAATCCACTTCGGGGTCAGCCTTCATTCTAAATGCAGGAGCTGTAGTGTGGCGAAGCTTCAAGCAGGGATGTATCGCTGATTCCACGAT
GGAAGCCGAATATGTTGCGGCTTGTGAAGCTGCAAAGGAAGCTGTTTGGCTTAGGAAGATGATGCTAGATTTGGAAGTTGTTCCAAATATGAACTTGCCAATCACGTTGG
TGACAACAGTGGTGCAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGACCAACCCAAGGGGTTCATTGCCCAAGGCCAAGAGCAAAGAGTTTGCTGGCTTCGAAGGTTTGATGAGGCGATCAATTCTTATGGCTTTGATCAAAATGTTAACGA
GCCTTGTGTCTACAAGAAAATCGTTAACAACACTGTCGCATTTCTAATATTGTGTGTGGATGATATCCTTCTCATTGGGATTGAGGTAGGACTTCTTACTGACATTAAGG
AATGGTTGGCTTCGCAATACCAAATGAAAGATTTGGGAGAGGCACAGTATACTCTAAGTATCCAGATAGTCCGAAACCGGAAGAATAGAACGCTAGCCATGTCTAAGGCG
TCTTATATTAACAAGATGTTGTCTAGATATAAGATGCAGAACTCCAAGAAGGGTTTGCTGCCTTTCAGGCATGGGGTTCACTTGTCTAAGGATCAATGTCCTAAGACACC
TCGAGAGGTTGAGGATATGAGACGAATCCCCTATGCTTCAGCTATAAGGAGCATGATGTATGTCATGTTGTGTACTAGGCACAACATCTGTTATGCAGTTGGGATTGTCA
CTAGGTATCAATCCAATCCAGGATTAGATCACTGGGCAGCCGTAAAGGCAATCCTCAATGGGGATTTGATCCTTAATTTGATCCTTACGGGATACACAGATTATGACTTT
CAGACCGATAAGGATTCTAGGAAATCCACTTCGGGGTCAGCCTTCATTCTAAATGCAGGAGCTGTAGTGTGGCGAAGCTTCAAGCAGGGATGTATCGCTGATTCCACGAT
GGAAGCCGAATATGTTGCGGCTTGTGAAGCTGCAAAGGAAGCTGTTTGGCTTAGGAAGATGATGCTAGATTTGGAAGTTGTTCCAAATATGAACTTGCCAATCACGTTGG
TGACAACAGTGGTGCAGTAG
Protein sequenceShow/hide protein sequence
MDQPKGFIAQGQEQRVCWLRRFDEAINSYGFDQNVNEPCVYKKIVNNTVAFLILCVDDILLIGIEVGLLTDIKEWLASQYQMKDLGEAQYTLSIQIVRNRKNRTLAMSKA
SYINKMLSRYKMQNSKKGLLPFRHGVHLSKDQCPKTPREVEDMRRIPYASAIRSMMYVMLCTRHNICYAVGIVTRYQSNPGLDHWAAVKAILNGDLILNLILTGYTDYDF
QTDKDSRKSTSGSAFILNAGAVVWRSFKQGCIADSTMEAEYVAACEAAKEAVWLRKMMLDLEVVPNMNLPITLVTTVVQ