; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019656 (gene) of Snake gourd v1 genome

Gene IDTan0019656
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG05:26696569..26698422
RNA-Seq ExpressionTan0019656
SyntenyTan0019656
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.1e-20061.26Show/hide
Query:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI
        MVRSMMSY QL  SFW YA+ET + ILN VPSKSV ETP+ELWKGRK SL++FRIWGCPAHVLV NPKKLEP S+LC FVGYPKE+RGGLFY PQENKV 
Subjt:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI

Query:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA
        +STNATF+EEDH RNH+PRSK+VL E     T   D+  +S++V  +A+ S QSH SQ L +PRRSGRVV QP+RYLGL ETQ+II DDGVEDPL+YKQA
Subjt:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA

Query:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE
        M DVD+DQWIKA++LEMES+ FNSVW LVD    V+ IGCKWIYKRKRD  GKVQ FKARLVAKG+TQ+EGVDYEETFS V MLKSIR LLSI  FY+YE
Subjt:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE

Query:  IWQMDVKTAFLNG---------------------------------------------------------------------------------------
        IWQMDVKTAFLNG                                                                                       
Subjt:  IWQMDVKTAFLNG---------------------------------------------------------------------------------------

Query:  --YLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL
          YLTD+K WL TQFQMKDLGEAQY+LGIQI R  KNKTLA+SQASYIDK+LSRY MQNSK+G LPFRHGIHLSKEQCP+TPQEVEDMR IPY+S +GSL
Subjt:  --YLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL

Query:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS
        M                                        RTR+Y LVYG KDLIL GYTDSDFQ+DKD+RKSTSGSVFTLNGGA+VWR +KQ CIADS
Subjt:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS

Query:  TMEAEYVATCEAAKEVV
        TMEAEYVA CEAAKE V
Subjt:  TMEAEYVATCEAAKEVV

KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-22267.42Show/hide
Query:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI
        MVRSMMSY QL +SFW YAVET V ILN VPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ PQEN+V 
Subjt:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI

Query:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA
        +STNATF+EEDHMRNHKPRSKLVL+EATDE TRVVD+ G SSRVD   +TS QSHPSQSL MPRRSGRVVSQP+RYLGL ETQV+I DDGVEDPLSYKQA
Subjt:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA

Query:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE
        M DVDKDQW+KA+DLEMES+ FNSVWELVD  EGV+ IGCKWIYKRKRD+ GKVQ FKARLVAKG+TQREGVDYEETFS V MLKSIR LLSI  FYDYE
Subjt:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE

Query:  IWQMDVKTAFLN----------------------------------------------------------------------------------------
        IWQMDVKTAFLN                                                                                        
Subjt:  IWQMDVKTAFLN----------------------------------------------------------------------------------------

Query:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL
         GYLTD+K WLA QFQMKDLGEAQYVLGIQI R  KNKTLALSQA+YIDK+L RYSMQNSK+GLLPFRHG+HLSKEQ P+TPQEVEDMRRIPYAS +GSL
Subjt:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL

Query:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS
        M                                        RTRDY LVYG KDLIL GYTDSDFQTDKDSRKSTSGSVFTLNGGA+VWR IKQ CIADS
Subjt:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS

Query:  TMEAEYVATCEAAKEVV
        TMEAEYVA CEAAKE V
Subjt:  TMEAEYVATCEAAKEVV

KAA0033121.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-21465.32Show/hide
Query:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI
        MVRSMMSY QL +SFW YAVET V ILN V SKSVSETPFELW+GRKPSL HF+I GCPAHVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ PQ+N+V+
Subjt:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI

Query:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA
        +STNATF+EEDHMR+HKP++KLVLNEA DE TRVVD+ G SSRV+   +TS QSHPSQSL MPRRSGR+VSQP+RYLGL ETQV+I DDGVEDPLSY QA
Subjt:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA

Query:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE
        M DVDKDQW+KA+DLEMES+ FN +WELVD  EGV+ IGCKWIYKRKRD+ GKVQ FKARLVAKG+TQREGVDYEETFS V MLKSIR LLSI  FYDYE
Subjt:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE

Query:  IWQMDVKTAFLN----------------------------------------------------------------------------------------
        IW+MDV TAFLN                                                                                        
Subjt:  IWQMDVKTAFLN----------------------------------------------------------------------------------------

Query:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL
         GYLTD+K WLA QFQMKDLGEAQYVLGIQI R  KNKTLALSQA+YIDKML RYSMQNSK+GLLPFRHG+HLSKEQCP+TPQEVEDMRRIPYAS +GSL
Subjt:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL

Query:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS
        M                                        RTRDY LVYG KDLIL GYTDSDFQT+KDSRKSTS SVFTLNGGAIVWR IKQ CIADS
Subjt:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS

Query:  TMEAEYVATCEAAKEVV
        TMEAEYVA CEAAKE V
Subjt:  TMEAEYVATCEAAKEVV

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]5.5e-21966.45Show/hide
Query:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI
        MVRSMMSY QL +SFW YAVET V ILN VPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ P+EN+V 
Subjt:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI

Query:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA
        +STNATF+EEDHMRNHKPRSKLVL+EATDE TRVVD+ G SSRVD   +TS QSHPSQSL MPRRSGRVVSQP+RYLGL ETQV+I DDGVEDPLSYKQA
Subjt:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA

Query:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE
        M DVDKDQW+KA+DLEMES+ FNSVWELVD  EGV+ IGCKWIYKRKRD+ GKVQ FKARLVAKG+T++EGVDYEETFS V MLKSIR LLSI  FYDYE
Subjt:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE

Query:  IWQMDVKTAFLN----------------------------------------------------------------------------------------
        IWQMDVKTAFLN                                                                                        
Subjt:  IWQMDVKTAFLN----------------------------------------------------------------------------------------

Query:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL
         GYLTD+K WLA QFQMKDLGE QYVLGIQI R  KNKTLALSQA+YIDK+L RYSMQNSK+GLLPFRHG+HLSKEQ P+TPQEVEDMRRIPYAS +GSL
Subjt:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL

Query:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS
        M                                        RTRDY LVYG KDLIL GYT+SDFQTDKDSRKSTS SVFTLNGGA+VWR IKQ CIADS
Subjt:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS

Query:  TMEAEYVATCEAAKEVV
        TMEAEYVA CEAAKE V
Subjt:  TMEAEYVATCEAAKEVV

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-22267.42Show/hide
Query:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI
        MVRSMMSY QL +SFW YAVET V ILN VPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ PQEN+V 
Subjt:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI

Query:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA
        +STNATF+EEDHMRNHKPRSKLVL+EATDE TRVVD+ G SSRVD   +TS QSHPSQSL MPRRSGRVVSQP+RYLGL ETQV+I DDGVEDPLSYKQA
Subjt:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA

Query:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE
        M DVDKDQW+KA+DLEMES+ FNSVWELVD  EGV+ IGCKWIYKRKRD+ GKVQ FKARLVAKG+TQREGVDYEETFS V MLKSIR LLSI  FYDYE
Subjt:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE

Query:  IWQMDVKTAFLN----------------------------------------------------------------------------------------
        IWQMDVKTAFLN                                                                                        
Subjt:  IWQMDVKTAFLN----------------------------------------------------------------------------------------

Query:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL
         GYLTD+K WLA QFQMKDLGEAQYVLGIQI R  KNKTLALSQA+YIDK+L RYSMQNSK+GLLPFRHG+HLSKEQ P+TPQEVEDMRRIPYAS +GSL
Subjt:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL

Query:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS
        M                                        RTRDY LVYG KDLIL GYTDSDFQTDKDSRKSTSGSVFTLNGGA+VWR IKQ CIADS
Subjt:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS

Query:  TMEAEYVATCEAAKEVV
        TMEAEYVA CEAAKE V
Subjt:  TMEAEYVATCEAAKEVV

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein2.7e-21966.45Show/hide
Query:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI
        MVRSMMSY QL +SFW YAVET V ILN VPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ P+EN+V 
Subjt:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI

Query:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA
        +STNATF+EEDHMRNHKPRSKLVL+EATDE TRVVD+ G SSRVD   +TS QSHPSQSL MPRRSGRVVSQP+RYLGL ETQV+I DDGVEDPLSYKQA
Subjt:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA

Query:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE
        M DVDKDQW+KA+DLEMES+ FNSVWELVD  EGV+ IGCKWIYKRKRD+ GKVQ FKARLVAKG+T++EGVDYEETFS V MLKSIR LLSI  FYDYE
Subjt:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE

Query:  IWQMDVKTAFLN----------------------------------------------------------------------------------------
        IWQMDVKTAFLN                                                                                        
Subjt:  IWQMDVKTAFLN----------------------------------------------------------------------------------------

Query:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL
         GYLTD+K WLA QFQMKDLGE QYVLGIQI R  KNKTLALSQA+YIDK+L RYSMQNSK+GLLPFRHG+HLSKEQ P+TPQEVEDMRRIPYAS +GSL
Subjt:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL

Query:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS
        M                                        RTRDY LVYG KDLIL GYT+SDFQTDKDSRKSTS SVFTLNGGA+VWR IKQ CIADS
Subjt:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS

Query:  TMEAEYVATCEAAKEVV
        TMEAEYVA CEAAKE V
Subjt:  TMEAEYVATCEAAKEVV

A0A5A7TZD0 Gag/pol protein1.2e-22267.42Show/hide
Query:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI
        MVRSMMSY QL +SFW YAVET V ILN VPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ PQEN+V 
Subjt:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI

Query:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA
        +STNATF+EEDHMRNHKPRSKLVL+EATDE TRVVD+ G SSRVD   +TS QSHPSQSL MPRRSGRVVSQP+RYLGL ETQV+I DDGVEDPLSYKQA
Subjt:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA

Query:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE
        M DVDKDQW+KA+DLEMES+ FNSVWELVD  EGV+ IGCKWIYKRKRD+ GKVQ FKARLVAKG+TQREGVDYEETFS V MLKSIR LLSI  FYDYE
Subjt:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE

Query:  IWQMDVKTAFLN----------------------------------------------------------------------------------------
        IWQMDVKTAFLN                                                                                        
Subjt:  IWQMDVKTAFLN----------------------------------------------------------------------------------------

Query:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL
         GYLTD+K WLA QFQMKDLGEAQYVLGIQI R  KNKTLALSQA+YIDK+L RYSMQNSK+GLLPFRHG+HLSKEQ P+TPQEVEDMRRIPYAS +GSL
Subjt:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL

Query:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS
        M                                        RTRDY LVYG KDLIL GYTDSDFQTDKDSRKSTSGSVFTLNGGA+VWR IKQ CIADS
Subjt:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS

Query:  TMEAEYVATCEAAKEVV
        TMEAEYVA CEAAKE V
Subjt:  TMEAEYVATCEAAKEVV

A0A5A7UYE8 Gag/pol protein1.2e-22267.42Show/hide
Query:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI
        MVRSMMSY QL +SFW YAVET V ILN VPSKSVSETPFELW+GRKPSL HFRIWGCPAHVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ PQEN+V 
Subjt:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI

Query:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA
        +STNATF+EEDHMRNHKPRSKLVL+EATDE TRVVD+ G SSRVD   +TS QSHPSQSL MPRRSGRVVSQP+RYLGL ETQV+I DDGVEDPLSYKQA
Subjt:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA

Query:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE
        M DVDKDQW+KA+DLEMES+ FNSVWELVD  EGV+ IGCKWIYKRKRD+ GKVQ FKARLVAKG+TQREGVDYEETFS V MLKSIR LLSI  FYDYE
Subjt:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE

Query:  IWQMDVKTAFLN----------------------------------------------------------------------------------------
        IWQMDVKTAFLN                                                                                        
Subjt:  IWQMDVKTAFLN----------------------------------------------------------------------------------------

Query:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL
         GYLTD+K WLA QFQMKDLGEAQYVLGIQI R  KNKTLALSQA+YIDK+L RYSMQNSK+GLLPFRHG+HLSKEQ P+TPQEVEDMRRIPYAS +GSL
Subjt:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL

Query:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS
        M                                        RTRDY LVYG KDLIL GYTDSDFQTDKDSRKSTSGSVFTLNGGA+VWR IKQ CIADS
Subjt:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS

Query:  TMEAEYVATCEAAKEVV
        TMEAEYVA CEAAKE V
Subjt:  TMEAEYVATCEAAKEVV

A0A5D3CZY3 Gag/pol protein6.8e-21565.32Show/hide
Query:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI
        MVRSMMSY QL +SFW YAVET V ILN V SKSVSETPFELW+GRKPSL HF+I GCPAHVLVTNPKKLEP SRLCQFVGYPKETRGGLF+ PQ+N+V+
Subjt:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI

Query:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA
        +STNATF+EEDHMR+HKP++KLVLNEA DE TRVVD+ G SSRV+   +TS QSHPSQSL MPRRSGR+VSQP+RYLGL ETQV+I DDGVEDPLSY QA
Subjt:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA

Query:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE
        M DVDKDQW+KA+DLEMES+ FN +WELVD  EGV+ IGCKWIYKRKRD+ GKVQ FKARLVAKG+TQREGVDYEETFS V MLKSIR LLSI  FYDYE
Subjt:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE

Query:  IWQMDVKTAFLN----------------------------------------------------------------------------------------
        IW+MDV TAFLN                                                                                        
Subjt:  IWQMDVKTAFLN----------------------------------------------------------------------------------------

Query:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL
         GYLTD+K WLA QFQMKDLGEAQYVLGIQI R  KNKTLALSQA+YIDKML RYSMQNSK+GLLPFRHG+HLSKEQCP+TPQEVEDMRRIPYAS +GSL
Subjt:  -GYLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL

Query:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS
        M                                        RTRDY LVYG KDLIL GYTDSDFQT+KDSRKSTS SVFTLNGGAIVWR IKQ CIADS
Subjt:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS

Query:  TMEAEYVATCEAAKEVV
        TMEAEYVA CEAAKE V
Subjt:  TMEAEYVATCEAAKEVV

E2GK51 Gag/pol protein (Fragment)5.6e-20161.26Show/hide
Query:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI
        MVRSMMSY QL  SFW YA+ET + ILN VPSKSV ETP+ELWKGRK SL++FRIWGCPAHVLV NPKKLEP S+LC FVGYPKE+RGGLFY PQENKV 
Subjt:  MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVI

Query:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA
        +STNATF+EEDH RNH+PRSK+VL E     T   D+  +S++V  +A+ S QSH SQ L +PRRSGRVV QP+RYLGL ETQ+II DDGVEDPL+YKQA
Subjt:  ISTNATFMEEDHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQA

Query:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE
        M DVD+DQWIKA++LEMES+ FNSVW LVD    V+ IGCKWIYKRKRD  GKVQ FKARLVAKG+TQ+EGVDYEETFS V MLKSIR LLSI  FY+YE
Subjt:  MYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYE

Query:  IWQMDVKTAFLNG---------------------------------------------------------------------------------------
        IWQMDVKTAFLNG                                                                                       
Subjt:  IWQMDVKTAFLNG---------------------------------------------------------------------------------------

Query:  --YLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL
          YLTD+K WL TQFQMKDLGEAQY+LGIQI R  KNKTLA+SQASYIDK+LSRY MQNSK+G LPFRHGIHLSKEQCP+TPQEVEDMR IPY+S +GSL
Subjt:  --YLTDIKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSL

Query:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS
        M                                        RTR+Y LVYG KDLIL GYTDSDFQ+DKD+RKSTSGSVFTLNGGA+VWR +KQ CIADS
Subjt:  M----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADS

Query:  TMEAEYVATCEAAKEVV
        TMEAEYVA CEAAKE V
Subjt:  TMEAEYVATCEAAKEVV

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.4e-3823.61Show/hide
Query:  RSMMSYVQLSASFWRYAVETTVQILNTVPSKSV---SETPFELWKGRKPSLQHFRIWGCPAHVLVTNPK-KLEPHSRLCQFVGY----------------
        R+M+S  +L  SFW  AV T   ++N +PS+++   S+TP+E+W  +KP L+H R++G   +V + N + K +  S    FVGY                
Subjt:  RSMMSYVQLSASFWRYAVETTVQILNTVPSKSV---SETPFELWKGRKPSLQHFRIWGCPAHVLVTNPK-KLEPHSRLCQFVGY----------------

Query:  ---------------------------PKETRGGLFYAPQENKVIIST----------NATFMEE-------------------------------DHMR
                                    KE+    F  P +++ II T          N  F+++                                 ++
Subjt:  ---------------------------PKETRGGLFYAPQENKVIIST----------NATFMEE-------------------------------DHMR

Query:  NHKPRSKLVLNEA--TDEPTRVVDQAGTSSRVDGRASTSSQ-------SHPSQSLGMP---RRSGRVVSQPDRYLGLAE---TQVIISDDGV--EDPLSY
        + K  +K  LNE+        + +  G+ +  + R S +++        +P+++ G+    RRS R+ ++P       +    +V+++   +  + P S+
Subjt:  NHKPRSKLVLNEA--TDEPTRVVDQAGTSSRVDGRASTSSQ-------SHPSQSLGMP---RRSGRVVSQPDRYLGLAE---TQVIISDDGV--EDPLSY

Query:  KQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFY
         +  Y  DK  W +AI+ E+ +   N+ W +  + E   ++  +W++  K + +G    +KARLVA+GFTQ+  +DYEETF+ V  + S R +LS+V+ Y
Subjt:  KQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFY

Query:  DYEIWQMDVKTAFLNG----------------------------------------------------------------------------YLTDI---
        + ++ QMDVKTAFLNG                                                                            Y+ D+   
Subjt:  DYEIWQMDVKTAFLNG----------------------------------------------------------------------------YLTDI---

Query:  ----------KNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHL----SKEQC---------------
                  K +L  +F+M DL E ++ +GI+I  + ++K + LSQ++Y+ K+LS+++M+N      P    I+     S E C               
Subjt:  ----------KNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHL----SKEQC---------------

Query:  ----PETPQEVEDMRRIPYASTIGS------------LMRTRDYTLVYGTKDLI----LIGYTDSDFQTDKDSRKSTSGSVFTL-NGGAIVWRIIKQRCI
            P+    V  + R  Y+S   S            L  T D  L++  K+L     +IGY DSD+   +  RKST+G +F + +   I W   +Q  +
Subjt:  ----PETPQEVEDMRRIPYASTIGS------------LMRTRDYTLVYGTKDLI----LIGYTDSDFQTDKDSRKSTSGSVFTL-NGGAIVWRIIKQRCI

Query:  ADSTMEAEYVATCEAAKEVV
        A S+ EAEY+A  EA +E +
Subjt:  ADSTMEAEYVATCEAAKEVV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.6e-6730.6Show/hide
Query:  VRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVS-ETPFELWKGRKPSLQHFRIWGCP--AHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENK
        VRSM+   +L  SFW  AV+T   ++N  PS  ++ E P  +W  ++ S  H +++GC   AHV      KL+  S  C F+GY  E  G   + P + K
Subjt:  VRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVS-ETPFELWKGRKPSLQHFRIWGCP--AHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENK

Query:  VIISTNATFMEEDHMRNHKPRSKLVLN--------------------EATDEPTRVVDQAG----TSSRVDGRASTSSQSHPSQSLGMP---RRSGRVVS
        VI S +  F E + +R     S+ V N                      TDE +   +Q G       ++D         HP+Q        RRS R   
Subjt:  VIISTNATFMEEDHMRNHKPRSKLVLN--------------------EATDEPTRVVDQAG----TSSRVDGRASTSSQSHPSQSLGMP---RRSGRVVS

Query:  QPDRYLGLAETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREG
        +  RY   +   V+ISDD   +P S K+ +   +K+Q +KA+  EMES+  N  ++LV+  +G R + CKW++K K+D   K+  +KARLV KGF Q++G
Subjt:  QPDRYLGLAETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREG

Query:  VDYEETFSLVGMLKSIRKLLSIVVFYDYEIWQMDVKTAFLNG----------------------------------------------------------
        +D++E FS V  + SIR +LS+    D E+ Q+DVKTAFL+G                                                          
Subjt:  VDYEETFSLVGMLKSIRKLLSIVVFYDYEIWQMDVKTAFLNG----------------------------------------------------------

Query:  -------------------YLTD-------------IKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHG
                           Y+ D             +K  L+  F MKDLG AQ +LG++I R   ++ L LSQ  YI+++L R++M+N+K    P    
Subjt:  -------------------YLTD-------------IKNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHG

Query:  IHLSKEQCPETPQEVEDMRRIPYASTIGSLM----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKD
        + LSK+ CP T +E  +M ++PY+S +GSLM                                         T    L +G  D IL GYTD+D   D D
Subjt:  IHLSKEQCPETPQEVEDMRRIPYASTIGSLM----------------------------------------RTRDYTLVYGTKDLILIGYTDSDFQTDKD

Query:  SRKSTSGSVFTLNGGAIVWRIIKQRCIADSTMEAEYVATCEAAKEVV
        +RKS++G +FT +GGAI W+   Q+C+A ST EAEY+A  E  KE++
Subjt:  SRKSTSGSVFTLNGGAIVWRIIKQRCIADSTMEAEYVATCEAAKEVV

P92520 Uncharacterized mitochondrial protein AtMg008201.2e-1135.42Show/hide
Query:  KQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSI
        K  ++ +    W +A+  E++++  N  W LV       ++GCKW++K K  + G +   KARLVAKGF Q EG+ + ET+S V    +IR +L++
Subjt:  KQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-2023.85Show/hide
Query:  SQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEG-VRLIGCKWIYKRKRDAIGKVQ
        + S+G   ++G +   P       +  + +S     +P +  QA+ D   ++W  A+  E+ +   N  W+LV      V ++GC+WI+ +K ++ G + 
Subjt:  SQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEG-VRLIGCKWIYKRKRDAIGKVQ

Query:  AFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYEIWQMDVKTAFLNGYLTD--------------------------------------
         +KARLVAKG+ QR G+DY ETFS V    SIR +L + V   + I Q+DV  AFL G LTD                                      
Subjt:  AFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYEIWQMDVKTAFLNGYLTD--------------------------------------

Query:  -IKNWLAT--------------------------------------------------QFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRY
         ++N+L T                                                  +F +KD  E  Y LGI+  R+     L LSQ  YI  +L+R 
Subjt:  -IKNWLAT--------------------------------------------------QFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRY

Query:  SMQNSK--------------------------RGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSLMRTRDYTLVYGTKD----------LILIG
        +M  +K                          RG++     +  ++         +     +P    + +L R   Y  + GT +          L L  
Subjt:  SMQNSK--------------------------RGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSLMRTRDYTLVYGTKD----------LILIG

Query:  YTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADSTMEAEYVATCEAAKEV
        Y+D+D+  DKD   ST+G +  L    I W   KQ+ +  S+ EAEY +    + E+
Subjt:  YTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIKQRCIADSTMEAEYVATCEAAKEV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.3e-1842.86Show/hide
Query:  DPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELV-DQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLL
        +P +  QAM D   D+W +A+  E+ +   N  W+LV      V ++GC+WI+ +K ++ G +  +KARLVAKG+ QR G+DY ETFS V    SIR +L
Subjt:  DPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELV-DQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLL

Query:  SIVVFYDYEIWQMDVKTAFLNGYLTD
         + V   + I Q+DV  AFL G LTD
Subjt:  SIVVFYDYEIWQMDVKTAFLNGYLTD

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.2e-2322.88Show/hide
Query:  EDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLL
        ++P +Y +A   +    W  A+D E+ +++    WE+       + IGCKW+YK K ++ G ++ +KARLVAKG+TQ+EG+D+ ETFS V  L S++ +L
Subjt:  EDPLSYKQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLL

Query:  SIVVFYDYEIWQMDVKTAFLNG------------------------------------------------------------------------------
        +I   Y++ + Q+D+  AFLNG                                                                              
Subjt:  SIVVFYDYEIWQMDVKTAFLNG------------------------------------------------------------------------------

Query:  --YLTDI-------------KNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPE-------
          Y+ DI             K+ L + F+++DLG  +Y LG++I R A    + + Q  Y   +L    +   K   +P    +  S     +       
Subjt:  --YLTDI-------------KNWLATQFQMKDLGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPE-------

Query:  -------------------TPQEVEDMRRIPYASTIGSLMRTRDYT-------LVYGTK-DLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIK
                              ++      P  +   ++M+   Y        L Y ++ ++ L  ++D+ FQ+ KD+R+ST+G    L    I W+  K
Subjt:  -------------------TPQEVEDMRRIPYASTIGSLMRTRDYT-------LVYGTK-DLILIGYTDSDFQTDKDSRKSTSGSVFTLNGGAIVWRIIK

Query:  QRCIADSTMEAEYVATCEAAKEVV
        Q+ ++ S+ EAEY A   A  E++
Subjt:  QRCIADSTMEAEYVATCEAAKEVV

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.7e-1335.42Show/hide
Query:  KQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSI
        K  ++ +    W +A+  E++++  N  W LV       ++GCKW++K K  + G +   KARLVAKGF Q EG+ + ET+S V    +IR +L++
Subjt:  KQAMYDVDKDQWIKAIDLEMESIDFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGTTCTATGATGAGCTATGTTCAATTGTCTGCCTCGTTTTGGAGATACGCAGTAGAGACTACAGTTCAAATCTTGAACACTGTTCCATCAAAGAGTGTTTCAGA
AACACCTTTTGAATTGTGGAAGGGGCGTAAACCTAGTTTACAACACTTCAGGATTTGGGGTTGTCCGGCACATGTGCTAGTGACAAACCCAAAGAAACTGGAACCTCATT
CAAGATTATGCCAATTTGTTGGCTATCCCAAAGAAACGAGAGGTGGTCTTTTCTACGCTCCACAAGAAAACAAGGTGATTATATCGACAAACGCCACTTTCATGGAAGAA
GATCACATGAGGAACCATAAACCGCGTAGTAAATTAGTGTTAAATGAAGCTACAGATGAACCAACAAGAGTTGTTGATCAAGCTGGAACTTCATCAAGAGTTGATGGAAG
AGCCAGCACCTCAAGTCAGTCTCATCCTTCTCAATCGTTGGGAATGCCTCGACGCAGTGGGAGGGTTGTTTCCCAACCTGACCGTTACTTGGGTTTAGCTGAAACTCAAG
TTATCATATCTGATGACGGCGTAGAAGATCCATTGTCTTATAAACAGGCAATGTATGACGTAGACAAGGACCAATGGATCAAAGCCATAGACCTTGAAATGGAGTCAATA
GACTTCAATTCAGTATGGGAACTTGTAGACCAACTTGAAGGGGTTAGACTCATAGGGTGTAAATGGATCTATAAGAGAAAGAGAGATGCAATCGGAAAGGTACAGGCCTT
TAAAGCTAGACTTGTAGCAAAGGGTTTTACCCAAAGGGAAGGAGTTGACTATGAAGAAACTTTTTCCCTTGTTGGTATGCTGAAGTCCATAAGGAAACTCTTGTCCATAG
TCGTGTTTTATGATTATGAAATCTGGCAAATGGACGTCAAGACTGCCTTTTTGAATGGATACCTAACTGACATAAAGAATTGGCTGGCGACCCAATTCCAAATGAAAGAT
TTGGGAGAGGCGCAATATGTTCTTGGGATTCAGATCTTCAGAATCGCAAAGAACAAAACGCTAGCTCTGTCTCAAGCCTCTTATATCGACAAAATGTTGTCCCGATATTC
GATGCAGAATTCCAAGAGGGGCTTATTACCCTTCAGGCATGGAATTCATCTGTCTAAGGAACAGTGTCCTGAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCT
ATGCCTCTACAATAGGTAGCTTAATGAGAACGAGGGACTATACGCTTGTGTATGGGACTAAGGATTTGATCCTTATAGGATACACTGATTCTGATTTTCAGACCGATAAG
GATTCTCGTAAATCCACATCGGGATCAGTTTTCACCCTTAACGGGGGAGCTATAGTATGGCGAATCATCAAGCAAAGATGCATCGCTGACTCCACAATGGAGGCAGAGTA
TGTCGCTACTTGTGAAGCAGCTAAAGAGGTTGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGTTCTATGATGAGCTATGTTCAATTGTCTGCCTCGTTTTGGAGATACGCAGTAGAGACTACAGTTCAAATCTTGAACACTGTTCCATCAAAGAGTGTTTCAGA
AACACCTTTTGAATTGTGGAAGGGGCGTAAACCTAGTTTACAACACTTCAGGATTTGGGGTTGTCCGGCACATGTGCTAGTGACAAACCCAAAGAAACTGGAACCTCATT
CAAGATTATGCCAATTTGTTGGCTATCCCAAAGAAACGAGAGGTGGTCTTTTCTACGCTCCACAAGAAAACAAGGTGATTATATCGACAAACGCCACTTTCATGGAAGAA
GATCACATGAGGAACCATAAACCGCGTAGTAAATTAGTGTTAAATGAAGCTACAGATGAACCAACAAGAGTTGTTGATCAAGCTGGAACTTCATCAAGAGTTGATGGAAG
AGCCAGCACCTCAAGTCAGTCTCATCCTTCTCAATCGTTGGGAATGCCTCGACGCAGTGGGAGGGTTGTTTCCCAACCTGACCGTTACTTGGGTTTAGCTGAAACTCAAG
TTATCATATCTGATGACGGCGTAGAAGATCCATTGTCTTATAAACAGGCAATGTATGACGTAGACAAGGACCAATGGATCAAAGCCATAGACCTTGAAATGGAGTCAATA
GACTTCAATTCAGTATGGGAACTTGTAGACCAACTTGAAGGGGTTAGACTCATAGGGTGTAAATGGATCTATAAGAGAAAGAGAGATGCAATCGGAAAGGTACAGGCCTT
TAAAGCTAGACTTGTAGCAAAGGGTTTTACCCAAAGGGAAGGAGTTGACTATGAAGAAACTTTTTCCCTTGTTGGTATGCTGAAGTCCATAAGGAAACTCTTGTCCATAG
TCGTGTTTTATGATTATGAAATCTGGCAAATGGACGTCAAGACTGCCTTTTTGAATGGATACCTAACTGACATAAAGAATTGGCTGGCGACCCAATTCCAAATGAAAGAT
TTGGGAGAGGCGCAATATGTTCTTGGGATTCAGATCTTCAGAATCGCAAAGAACAAAACGCTAGCTCTGTCTCAAGCCTCTTATATCGACAAAATGTTGTCCCGATATTC
GATGCAGAATTCCAAGAGGGGCTTATTACCCTTCAGGCATGGAATTCATCTGTCTAAGGAACAGTGTCCTGAGACACCTCAAGAAGTTGAGGATATGAGACGTATTCCCT
ATGCCTCTACAATAGGTAGCTTAATGAGAACGAGGGACTATACGCTTGTGTATGGGACTAAGGATTTGATCCTTATAGGATACACTGATTCTGATTTTCAGACCGATAAG
GATTCTCGTAAATCCACATCGGGATCAGTTTTCACCCTTAACGGGGGAGCTATAGTATGGCGAATCATCAAGCAAAGATGCATCGCTGACTCCACAATGGAGGCAGAGTA
TGTCGCTACTTGTGAAGCAGCTAAAGAGGTTGTTTGA
Protein sequenceShow/hide protein sequence
MVRSMMSYVQLSASFWRYAVETTVQILNTVPSKSVSETPFELWKGRKPSLQHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFYAPQENKVIISTNATFMEE
DHMRNHKPRSKLVLNEATDEPTRVVDQAGTSSRVDGRASTSSQSHPSQSLGMPRRSGRVVSQPDRYLGLAETQVIISDDGVEDPLSYKQAMYDVDKDQWIKAIDLEMESI
DFNSVWELVDQLEGVRLIGCKWIYKRKRDAIGKVQAFKARLVAKGFTQREGVDYEETFSLVGMLKSIRKLLSIVVFYDYEIWQMDVKTAFLNGYLTDIKNWLATQFQMKD
LGEAQYVLGIQIFRIAKNKTLALSQASYIDKMLSRYSMQNSKRGLLPFRHGIHLSKEQCPETPQEVEDMRRIPYASTIGSLMRTRDYTLVYGTKDLILIGYTDSDFQTDK
DSRKSTSGSVFTLNGGAIVWRIIKQRCIADSTMEAEYVATCEAAKEVV