; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019997 (gene) of Snake gourd v1 genome

Gene IDTan0019997
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG06:41889012..41890447
RNA-Seq ExpressionTan0019997
SyntenyTan0019997
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-20774.32Show/hide
Query:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT
        MNVKAR G+EYF+SFIDDYSRYGY+YLM   SE LEKFKE+KTEV+NLL K IK LRSDRGGEY+D +FQDYMI+HGI  QLS PG PQQNGVSERRNRT
Subjt:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT

Query:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN
         LDMVRSMMSYA+LP SFWGYAVETAV+ILN VPSKSV ETPFELW GRK +L HFRIW CP HVLV+NPKKLEPRS+LC FVG PKETRGG F+DP++N
Subjt:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN

Query:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD
        RV VSTN TFLEEDH+R+H P+SK+VL+E      RVVD    S  V + ++S Q   SQ L MPRRSGRVV QP RY+GL ETQVV PDD  EDPL+Y 
Subjt:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD

Query:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD
        QAM DVDKD+W+KAMD EMESMYFNSVWELVD P+GVKPIGCKWIYK KR   GKV               EGVDYEETFSPVAM+KSIRILL+I  +YD
Subjt:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD

Query:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK
        YE+WQ+DVKTAFLNGNL+E I+M QP+GFI QGQEQKVC+L RSIYGLKQASRSWNIRFD AIKSYGFDQN+DEPCVYK
Subjt:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK

KAA0033121.1 gag/pol protein [Cucumis melo var. makuwa]4.4e-20272.44Show/hide
Query:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT
        MNVKAR G+EYF+SFIDDYSRYGY+YLM   SE LEKFKE+K EV+NLL K IK LRSDRGGEY+D +FQDYMI+HGI +QLS PG PQQNGV ERRNRT
Subjt:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT

Query:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN
         LDMVRSMMSYA+LP SFWGYAVETAV+ILN V SKSV ETPFELW GRK +L HF+I  CP HVLV+NPKKLEPRS+LC FVG PKETRGG F+DP+ N
Subjt:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN

Query:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD
        RVLVSTN TFLEEDH+RDH P++K+VLNE      RVVD    S  V + ++S Q   SQ L MPRRSGR+V QP RY+GL ETQVV PDD  EDPL+Y+
Subjt:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD

Query:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD
        QAM DVDKD+W+KAMD EMESMYFN +WELVD P+GVKPIGCKWIYK KR   GKV               EGVDYEETFSPVAM+KSIRILL+I  +YD
Subjt:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD

Query:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK
        YE+W++DV TAFLNGNL+E I+M QP+GFI QGQEQKVC+L RSIYGLKQASRSWNIRFD AIKSYGF+QN+DEPCVYK
Subjt:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]2.5e-20573.9Show/hide
Query:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT
        MNVKAR  +EYF+SFIDDYSRYGY+YLM   SE LEKFKE+KTEV+NLL K IK  RSDRGGEY+D  FQDYMI+HGI  QLS PG PQQNGVSERRNRT
Subjt:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT

Query:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN
         LDMVRSMMSYA+LP SFWGYAVETAV+ILN VPSKSV ETPFELW GRK +L HFRIW CP HVLV+NPKKLEPRS+LC FVG PKETRGG F+DPK+N
Subjt:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN

Query:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD
        RV VSTN TFLEEDH+R+H P+SK+VL+E      RVVD    S  V + ++S Q   SQ L MPRRSGRVV QP RY+GL ETQVV PDD  EDPL+Y 
Subjt:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD

Query:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD
        QAM DVDKD+W+KAMD EMESMYFNSVWELVD P+GVKPIGCKWIYK KR   GKV               EGVDYEETFS VAM+KSIRILL+I  +YD
Subjt:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD

Query:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK
        YE+WQ+DVKTAFLNGNL+E I+M QP+GFI QGQEQKVC+L RSIYGLKQASRSWNIRFD AIKSYGFDQN+DEPCVYK
Subjt:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-20774.32Show/hide
Query:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT
        MNVKAR G+EYF+SFIDDYSRYGY+YLM   SE LEKFKE+KTEV+NLL K IK LRSDRGGEY+D +FQDYMI+HGI  QLS PG PQQNGVSERRNRT
Subjt:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT

Query:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN
         LDMVRSMMSYA+LP SFWGYAVETAV+ILN VPSKSV ETPFELW GRK +L HFRIW CP HVLV+NPKKLEPRS+LC FVG PKETRGG F+DP++N
Subjt:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN

Query:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD
        RV VSTN TFLEEDH+R+H P+SK+VL+E      RVVD    S  V + ++S Q   SQ L MPRRSGRVV QP RY+GL ETQVV PDD  EDPL+Y 
Subjt:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD

Query:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD
        QAM DVDKD+W+KAMD EMESMYFNSVWELVD P+GVKPIGCKWIYK KR   GKV               EGVDYEETFSPVAM+KSIRILL+I  +YD
Subjt:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD

Query:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK
        YE+WQ+DVKTAFLNGNL+E I+M QP+GFI QGQEQKVC+L RSIYGLKQASRSWNIRFD AIKSYGFDQN+DEPCVYK
Subjt:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK

TYK15984.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-20473.28Show/hide
Query:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT
        MNVKAR  +EYF+SFIDDYSRYGY+YLM   SE LEKFKE+K EV+NLL K IK LRSDRGGEY+D +FQDYMI+HGI  QLS PG PQQNGVSERRNRT
Subjt:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT

Query:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN
         LDMV SMMSYA+LP SFWGYAVETAV+ILN VPSKSV + PFELW GRK +L HFRIW CP HVLV+NPKKLEPRS+LC FVG PKETRGG F+DP++N
Subjt:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN

Query:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD
        RV VSTN TFLEEDH+R+H P+SK+VL+E      RVVD    S  V + ++S Q   SQ L MPRRSGRVV QP RY+GL ETQVV PDD  EDPL+Y 
Subjt:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD

Query:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD
        QAM DVDKD+W+KAMD EMESMYFNSVWELVD P+GVKPIGCKWIYK KR   GKV               EGVDYEETFSPVAM+KSIRILL+I  +YD
Subjt:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD

Query:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK
        YE+WQ+DVKTAFLNGNL+E I+M QP+GFI QGQEQKVC+L RSIYGLKQASRSWNIRFD AIKSYGFDQN+DEPCVYK
Subjt:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein1.2e-20573.9Show/hide
Query:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT
        MNVKAR  +EYF+SFIDDYSRYGY+YLM   SE LEKFKE+KTEV+NLL K IK  RSDRGGEY+D  FQDYMI+HGI  QLS PG PQQNGVSERRNRT
Subjt:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT

Query:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN
         LDMVRSMMSYA+LP SFWGYAVETAV+ILN VPSKSV ETPFELW GRK +L HFRIW CP HVLV+NPKKLEPRS+LC FVG PKETRGG F+DPK+N
Subjt:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN

Query:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD
        RV VSTN TFLEEDH+R+H P+SK+VL+E      RVVD    S  V + ++S Q   SQ L MPRRSGRVV QP RY+GL ETQVV PDD  EDPL+Y 
Subjt:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD

Query:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD
        QAM DVDKD+W+KAMD EMESMYFNSVWELVD P+GVKPIGCKWIYK KR   GKV               EGVDYEETFS VAM+KSIRILL+I  +YD
Subjt:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD

Query:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK
        YE+WQ+DVKTAFLNGNL+E I+M QP+GFI QGQEQKVC+L RSIYGLKQASRSWNIRFD AIKSYGFDQN+DEPCVYK
Subjt:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK

A0A5A7TZD0 Gag/pol protein5.8e-20874.32Show/hide
Query:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT
        MNVKAR G+EYF+SFIDDYSRYGY+YLM   SE LEKFKE+KTEV+NLL K IK LRSDRGGEY+D +FQDYMI+HGI  QLS PG PQQNGVSERRNRT
Subjt:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT

Query:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN
         LDMVRSMMSYA+LP SFWGYAVETAV+ILN VPSKSV ETPFELW GRK +L HFRIW CP HVLV+NPKKLEPRS+LC FVG PKETRGG F+DP++N
Subjt:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN

Query:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD
        RV VSTN TFLEEDH+R+H P+SK+VL+E      RVVD    S  V + ++S Q   SQ L MPRRSGRVV QP RY+GL ETQVV PDD  EDPL+Y 
Subjt:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD

Query:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD
        QAM DVDKD+W+KAMD EMESMYFNSVWELVD P+GVKPIGCKWIYK KR   GKV               EGVDYEETFSPVAM+KSIRILL+I  +YD
Subjt:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD

Query:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK
        YE+WQ+DVKTAFLNGNL+E I+M QP+GFI QGQEQKVC+L RSIYGLKQASRSWNIRFD AIKSYGFDQN+DEPCVYK
Subjt:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK

A0A5A7UYE8 Gag/pol protein5.8e-20874.32Show/hide
Query:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT
        MNVKAR G+EYF+SFIDDYSRYGY+YLM   SE LEKFKE+KTEV+NLL K IK LRSDRGGEY+D +FQDYMI+HGI  QLS PG PQQNGVSERRNRT
Subjt:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT

Query:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN
         LDMVRSMMSYA+LP SFWGYAVETAV+ILN VPSKSV ETPFELW GRK +L HFRIW CP HVLV+NPKKLEPRS+LC FVG PKETRGG F+DP++N
Subjt:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN

Query:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD
        RV VSTN TFLEEDH+R+H P+SK+VL+E      RVVD    S  V + ++S Q   SQ L MPRRSGRVV QP RY+GL ETQVV PDD  EDPL+Y 
Subjt:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD

Query:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD
        QAM DVDKD+W+KAMD EMESMYFNSVWELVD P+GVKPIGCKWIYK KR   GKV               EGVDYEETFSPVAM+KSIRILL+I  +YD
Subjt:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD

Query:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK
        YE+WQ+DVKTAFLNGNL+E I+M QP+GFI QGQEQKVC+L RSIYGLKQASRSWNIRFD AIKSYGFDQN+DEPCVYK
Subjt:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK

A0A5D3CYF4 Gag/pol protein6.0e-20573.28Show/hide
Query:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT
        MNVKAR  +EYF+SFIDDYSRYGY+YLM   SE LEKFKE+K EV+NLL K IK LRSDRGGEY+D +FQDYMI+HGI  QLS PG PQQNGVSERRNRT
Subjt:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT

Query:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN
         LDMV SMMSYA+LP SFWGYAVETAV+ILN VPSKSV + PFELW GRK +L HFRIW CP HVLV+NPKKLEPRS+LC FVG PKETRGG F+DP++N
Subjt:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN

Query:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD
        RV VSTN TFLEEDH+R+H P+SK+VL+E      RVVD    S  V + ++S Q   SQ L MPRRSGRVV QP RY+GL ETQVV PDD  EDPL+Y 
Subjt:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD

Query:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD
        QAM DVDKD+W+KAMD EMESMYFNSVWELVD P+GVKPIGCKWIYK KR   GKV               EGVDYEETFSPVAM+KSIRILL+I  +YD
Subjt:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD

Query:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK
        YE+WQ+DVKTAFLNGNL+E I+M QP+GFI QGQEQKVC+L RSIYGLKQASRSWNIRFD AIKSYGFDQN+DEPCVYK
Subjt:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK

A0A5D3CZY3 Gag/pol protein2.1e-20272.44Show/hide
Query:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT
        MNVKAR G+EYF+SFIDDYSRYGY+YLM   SE LEKFKE+K EV+NLL K IK LRSDRGGEY+D +FQDYMI+HGI +QLS PG PQQNGV ERRNRT
Subjt:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT

Query:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN
         LDMVRSMMSYA+LP SFWGYAVETAV+ILN V SKSV ETPFELW GRK +L HF+I  CP HVLV+NPKKLEPRS+LC FVG PKETRGG F+DP+ N
Subjt:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDN

Query:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD
        RVLVSTN TFLEEDH+RDH P++K+VLNE      RVVD    S  V + ++S Q   SQ L MPRRSGR+V QP RY+GL ETQVV PDD  EDPL+Y+
Subjt:  RVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVR-SQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYD

Query:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD
        QAM DVDKD+W+KAMD EMESMYFN +WELVD P+GVKPIGCKWIYK KR   GKV               EGVDYEETFSPVAM+KSIRILL+I  +YD
Subjt:  QAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAITAYYD

Query:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK
        YE+W++DV TAFLNGNL+E I+M QP+GFI QGQEQKVC+L RSIYGLKQASRSWNIRFD AIKSYGF+QN+DEPCVYK
Subjt:  YEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVYK

SwissProt top hitse value%identityAlignment
P04146 Copia protein6.5e-5527.21Show/hide
Query:  YFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRTFLDMVRSMMS
        YFV F+D ++ Y   YL+   S+    F++F  + +      +  L  D G EY+  + + + +  GI+  L+ P  PQ NGVSER  RT  +  R+M+S
Subjt:  YFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRTFLDMVRSMMS

Query:  YARLPDSFWGYAVETAVYILNTVPSKSVCE---TPFELWHGRKINLRHFRIWRCPTHVLVSNPK-KLEPRSKLCLFVGCPKETRGGFFYD----------
         A+L  SFWG AV TA Y++N +PS+++ +   TP+E+WH +K  L+H R++    +V + N + K + +S   +FVG   E  G   +D          
Subjt:  YARLPDSFWGYAVETAVYILNTVPSKSVCE---TPFELWHGRKINLRHFRIWRCPTHVLVSNPK-KLEPRSKLCLFVGCPKETRGGFFYD----------

Query:  ---------------------------------PKDNRVLVST----------NTTFLEE-------------------------------DHIRDHLPK
                                         P D+R ++ T          N  FL++                                 ++D    
Subjt:  ---------------------------------PKDNRVLVST----------NTTFLEE-------------------------------DHIRDHLPK

Query:  SKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVRSQE------LEMP----------RRSGRVVRQP-----ERYMGLAETQVVTPDDDCEDPLTYDQA
        +K  LNE  S   +  D  + SK   +P+ S +  + E      ++ P          RRS R+  +P     E    L +  +       + P ++D+ 
Subjt:  SKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVRSQE------LEMP----------RRSGRVVRQP-----ERYMGLAETQVVTPDDDCEDPLTYDQA

Query:  MADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCK----------------RGVDGKVEGVDYEETFSPVAMVKSIRILLAITAYYDY
            DK  W +A++ E+ +   N+ W +  +P+    +  +W++  K                RG   K + +DYEETF+PVA + S R +L++   Y+ 
Subjt:  MADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCK----------------RGVDGKVEGVDYEETFSPVAMVKSIRILLAITAYYDY

Query:  EVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVY
        +V Q+DVKTAFLNG L E IYM  P+G  +      VC+L ++IYGLKQA+R W   F++A+K   F  +  + C+Y
Subjt:  EVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVY

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.1e-8635.49Show/hide
Query:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT
        M +++  G +YFV+FIDD SR  ++Y++    +  + F++F   V+   G+ +K LRSD GGEY   +F++Y   HGI  + + PG PQ NGV+ER NRT
Subjt:  MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRT

Query:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVC-ETPFELWHGRKINLRHFRIWRCP--THVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDP
         ++ VRSM+  A+LP SFWG AV+TA Y++N  PS  +  E P  +W  ++++  H +++ C    HV      KL+ +S  C+F+G   E  G   +DP
Subjt:  FLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSVC-ETPFELWHGRKINLRHFRIWRCP--THVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDP

Query:  KDNRVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVRSQELEMP------------------------------RRS
           +V+ S +  F  E  +R     S+ V    N  +   V   STS +     S++   S++ E P                              RRS
Subjt:  KDNRVLVSTNTTFLEEDHIRDHLPKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVRSQELEMP------------------------------RRS

Query:  GRVVRQPERYMGLAETQVVTPDDDCEDPLTYDQAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV------------
         R   +  RY     T+ V   DD  +P +  + ++  +K++ +KAM +EMES+  N  ++LV+ P G +P+ CKW++K K+  D K+            
Subjt:  GRVVRQPERYMGLAETQVVTPDDDCEDPLTYDQAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV------------

Query:  ---EGVDYEETFSPVAMVKSIRILLAITAYYDYEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGF
           +G+D++E FSPV  + SIR +L++ A  D EV Q+DVKTAFL+G+L+E IYM+QP+GF V G++  VC+L +S+YGLKQA R W ++FD  +KS  +
Subjt:  ---EGVDYEETFSPVAMVKSIRILLAITAYYDYEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGF

Query:  DQNIDEPCVY
         +   +PCVY
Subjt:  DQNIDEPCVY

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.1e-1426.32Show/hide
Query:  YFVSFIDDYSRYGYIYLMH--RNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRTFLDMVRSM
        YF+SF D+ +R+ ++Y +H  R    L  F      +KN     +  ++ DRG EY +     +  + GIT   +T    + +GV+ER NRT L+  R++
Subjt:  YFVSFIDDYSRYGYIYLMH--RNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRTFLDMVRSM

Query:  MSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSN---PKKLEPRSKLCLFVGCPKETRGGFFYDPKDNRVLVS
        +  + LP+  W  AVE +  I N++ S    ++  +  H     L    I      V+V+N     K+ PR      +   + + G   Y P   + + +
Subjt:  MSYARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSN---PKKLEPRSKLCLFVGCPKETRGGFFYDPKDNRVLVS

Query:  TNTTFLEED
        TN   L+++
Subjt:  TNTTFLEED

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE14.2e-4624.17Show/hide
Query:  YEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRTFLDMVRSM
        Y Y+V F+D ++RY ++Y + + S+  E F  FK  ++N     I T  SD GGE++     +Y   HGI+   S P  P+ NG+SER++R  ++   ++
Subjt:  YEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRTFLDMVRSM

Query:  MSYARLPDSFWGYAVETAVYILNTVPSKSV-CETPFELWHGRKINLRHFRIWRCPTHVLVS--NPKKLEPRSKLCLFVGCPKETRGGFFYDPKDNRVLVS
        +S+A +P ++W YA   AVY++N +P+  +  E+PF+   G   N    R++ C  +  +   N  KL+ +S+ C+F+G             + +R+ +S
Subjt:  MSYARLPDSFWGYAVETAVYILNTVPSKSV-CETPFELWHGRKINLRHFRIWRCPTHVLVS--NPKKLEPRSKLCLFVGCPKETRGGFFYDPKDNRVLVS

Query:  TNTTFLE---------------------------------------------------------------------------------------------
         +  F E                                                                                             
Subjt:  TNTTFLE---------------------------------------------------------------------------------------------

Query:  -------EDHIRDHLPKSKIVLNETNSSLARVVDGAST----SKSVVDPSSSSQVRSQE-------LEMPRRSGRVVRQPER------YMGL-AETQVVT
               +   + H  ++    N TN S +++    ST    S S   P++S+   S         +  P    ++V    +       MG  A+  ++ 
Subjt:  -------EDHIRDHLPKSKIVLNETNSSLARVVDGAST----SKSVVDPSSSSQVRSQE-------LEMPRRSGRVVRQPER------YMGL-AETQVVT

Query:  PDDDC---------EDPLTYDQAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDG-VKPIGCKWIYKCKRGVDGKVE---------------GVDYEE
        P+             +P T  QA+ D   + W  AM  E+ +   N  W+LV  P   V  +GC+WI+  K   DG +                G+DY E
Subjt:  PDDDC---------EDPLTYDQAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDG-VKPIGCKWIYKCKRGVDGKVE---------------GVDYEE

Query:  TFSPVAMVKSIRILLAITAYYDYEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVY
        TFSPV    SIRI+L +     + + Q+DV  AFL G L + +YM QP GFI + +   VC+L++++YGLKQA R+W +     + + GF  ++ +  ++
Subjt:  TFSPVAMVKSIRILLAITAYYDYEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDEPCVY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.4e-4825.66Show/hide
Query:  YEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRTFLDMVRSM
        Y Y+V F+D ++RY ++Y + + S+  + F  FK+ V+N     I TL SD GGE++    +DY+  HGI+   S P  P+ NG+SER++R  ++M  ++
Subjt:  YEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRTFLDMVRSM

Query:  MSYARLPDSFWGYAVETAVYILNTVPSKSV-CETPFELWHGRKINLRHFRIWRCPTHVLVS--NPKKLEPRSKLCLFVG---------CPKETRGGFF--
        +S+A +P ++W YA   AVY++N +P+  +  ++PF+   G+  N    +++ C  +  +   N  KLE +SK C F+G         C     G  +  
Subjt:  MSYARLPDSFWGYAVETAVYILNTVPSKSV-CETPFELWHGRKINLRHFRIWRCPTHVLVS--NPKKLEPRSKLCLFVG---------CPKETRGGFF--

Query:  -YDPKDNRVLVSTNTTF---LEEDHIRDH---------LPKSKIVL------------------NETNSSLARVVDGASTSKSVVDPSSS----------
         +   D R    + T F     ++   D          LP + +VL                  + +     +V      S S+  PSSS          
Subjt:  -YDPKDNRVLVSTNTTF---LEEDHIRDH---------LPKSKIVL------------------NETNSSLARVVDGASTSKSVVDPSSS----------

Query:  -----------SQVRSQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDC--------------------------------------------------
                   S   S  L  P  +      P +   L ++ + +P                                                      
Subjt:  -----------SQVRSQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDC--------------------------------------------------

Query:  ------------------EDPLTYDQAMADVDKDEWIKAMDQEMESMYFNSVWELV-DQPDGVKPIGCKWIYKCKRGVDGKVE---------------GV
                           +P T  QAM D   D W +AM  E+ +   N  W+LV   P  V  +GC+WI+  K   DG +                G+
Subjt:  ------------------EDPLTYDQAMADVDKDEWIKAMDQEMESMYFNSVWELV-DQPDGVKPIGCKWIYKCKRGVDGKVE---------------GV

Query:  DYEETFSPVAMVKSIRILLAITAYYDYEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDE
        DY ETFSPV    SIRI+L +     + + Q+DV  AFL G L + +YM QP GF+ + +   VCRL+++IYGLKQA R+W +     + + GF  +I +
Subjt:  DYEETFSPVAMVKSIRILLAITAYYDYEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQNIDE

Query:  PCVY
          ++
Subjt:  PCVY

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.2e-3441.21Show/hide
Query:  EDPLTYDQAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILL
        ++P TY++A   +    W  AMD E+ +M     WE+   P   KPIGCKW+YK K   DG +               EG+D+ ETFSPV  + S++++L
Subjt:  EDPLTYDQAMADVDKDEWIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILL

Query:  AITAYYDYEVWQIDVKTAFLNGNLDEPIYMDQPKGFIV-QGQE---QKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQN
        AI+A Y++ + Q+D+  AFLNG+LDE IYM  P G+   QG       VC L++SIYGLKQASR W ++F   +  +GF Q+
Subjt:  AITAYYDYEVWQIDVKTAFLNGNLDEPIYMDQPKGFIV-QGQE---QKVCRLQRSIYGLKQASRSWNIRFDEAIKSYGFDQN

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.7e-0634.15Show/hide
Query:  NRTFLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSV-CETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSK
        NRT ++ VRSM+    LP +F   A  TAV+I+N  PS ++    P E+W        + R + C  ++   +  KL+PR+K
Subjt:  NRTFLDMVRSMMSYARLPDSFWGYAVETAVYILNTVPSKSV-CETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.0e-0831.76Show/hide
Query:  WIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAI
        W +AM +E++++  N  W LV  P     +GCKW++K K   DG +               EG+ + ET+SPV    +IR +L +
Subjt:  WIKAMDQEMESMYFNSVWELVDQPDGVKPIGCKWIYKCKRGVDGKV---------------EGVDYEETFSPVAMVKSIRILLAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGTTAAAGCACGAAGAGGTTACGAATACTTCGTATCTTTCATAGATGACTATTCAAGGTATGGGTACATTTACCTAATGCATAGGAACTCTGAAACACTTGAAAA
GTTCAAGGAGTTCAAGACTGAGGTTAAGAACCTGTTAGGTAAAACGATAAAAACACTTCGATCGGATCGAGGTGGAGAGTATATAGACACTAAATTCCAGGACTATATGA
TAGATCATGGAATTACGTTGCAACTCTCAACACCTGGTATGCCACAGCAGAATGGTGTATCAGAGAGGAGAAACAGAACCTTTTTGGACATGGTTCGGTCGATGATGAGC
TATGCTCGTCTCCCTGATTCTTTTTGGGGTTACGCAGTGGAGACTGCAGTTTACATTTTGAACACCGTTCCGTCGAAGAGTGTTTGTGAAACACCTTTCGAACTCTGGCA
TGGTCGTAAAATCAATTTACGTCATTTCAGAATTTGGAGATGCCCGACCCATGTGTTGGTGTCAAACCCGAAGAAGTTGGAACCTCGTTCAAAGTTGTGCCTCTTTGTAG
GTTGCCCAAAAGAGACTAGAGGTGGTTTCTTTTATGATCCTAAGGATAATAGGGTGCTTGTGTCGACAAACACCACTTTCCTTGAGGAAGACCACATTAGGGATCACTTA
CCAAAGAGTAAAATTGTGTTGAATGAAACAAACAGTTCATTAGCAAGAGTTGTTGATGGTGCTAGTACATCAAAAAGTGTTGTTGATCCAAGCTCGTCTAGTCAAGTCCG
TTCCCAAGAGTTGGAAATGCCTCGACGTAGTGGGAGGGTTGTGAGACAGCCTGAACGTTACATGGGTTTAGCTGAAACCCAAGTTGTCACTCCTGATGATGACTGCGAGG
ATCCATTGACCTATGATCAGGCAATGGCAGACGTTGACAAAGACGAATGGATTAAAGCCATGGACCAGGAAATGGAGTCGATGTACTTCAATTCCGTCTGGGAGCTTGTG
GACCAACCGGATGGTGTTAAACCTATTGGTTGCAAGTGGATCTACAAGTGTAAACGTGGTGTAGATGGGAAGGTGGAAGGGGTTGACTATGAGGAAACCTTTTCACCTGT
TGCCATGGTAAAGTCGATTCGAATCCTTCTGGCCATTACCGCATATTATGACTATGAGGTATGGCAGATAGATGTCAAGACCGCCTTTCTGAATGGCAACCTTGACGAGC
CCATCTACATGGACCAGCCAAAGGGGTTCATTGTCCAAGGCCAAGAGCAAAAGGTTTGCCGTCTTCAAAGGTCTATTTATGGACTGAAACAAGCCTCGAGGTCTTGGAAT
ATAAGGTTTGATGAGGCGATCAAATCTTATGGCTTTGATCAAAATATTGACGAGCCTTGTGTTTACAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATGTTAAAGCACGAAGAGGTTACGAATACTTCGTATCTTTCATAGATGACTATTCAAGGTATGGGTACATTTACCTAATGCATAGGAACTCTGAAACACTTGAAAA
GTTCAAGGAGTTCAAGACTGAGGTTAAGAACCTGTTAGGTAAAACGATAAAAACACTTCGATCGGATCGAGGTGGAGAGTATATAGACACTAAATTCCAGGACTATATGA
TAGATCATGGAATTACGTTGCAACTCTCAACACCTGGTATGCCACAGCAGAATGGTGTATCAGAGAGGAGAAACAGAACCTTTTTGGACATGGTTCGGTCGATGATGAGC
TATGCTCGTCTCCCTGATTCTTTTTGGGGTTACGCAGTGGAGACTGCAGTTTACATTTTGAACACCGTTCCGTCGAAGAGTGTTTGTGAAACACCTTTCGAACTCTGGCA
TGGTCGTAAAATCAATTTACGTCATTTCAGAATTTGGAGATGCCCGACCCATGTGTTGGTGTCAAACCCGAAGAAGTTGGAACCTCGTTCAAAGTTGTGCCTCTTTGTAG
GTTGCCCAAAAGAGACTAGAGGTGGTTTCTTTTATGATCCTAAGGATAATAGGGTGCTTGTGTCGACAAACACCACTTTCCTTGAGGAAGACCACATTAGGGATCACTTA
CCAAAGAGTAAAATTGTGTTGAATGAAACAAACAGTTCATTAGCAAGAGTTGTTGATGGTGCTAGTACATCAAAAAGTGTTGTTGATCCAAGCTCGTCTAGTCAAGTCCG
TTCCCAAGAGTTGGAAATGCCTCGACGTAGTGGGAGGGTTGTGAGACAGCCTGAACGTTACATGGGTTTAGCTGAAACCCAAGTTGTCACTCCTGATGATGACTGCGAGG
ATCCATTGACCTATGATCAGGCAATGGCAGACGTTGACAAAGACGAATGGATTAAAGCCATGGACCAGGAAATGGAGTCGATGTACTTCAATTCCGTCTGGGAGCTTGTG
GACCAACCGGATGGTGTTAAACCTATTGGTTGCAAGTGGATCTACAAGTGTAAACGTGGTGTAGATGGGAAGGTGGAAGGGGTTGACTATGAGGAAACCTTTTCACCTGT
TGCCATGGTAAAGTCGATTCGAATCCTTCTGGCCATTACCGCATATTATGACTATGAGGTATGGCAGATAGATGTCAAGACCGCCTTTCTGAATGGCAACCTTGACGAGC
CCATCTACATGGACCAGCCAAAGGGGTTCATTGTCCAAGGCCAAGAGCAAAAGGTTTGCCGTCTTCAAAGGTCTATTTATGGACTGAAACAAGCCTCGAGGTCTTGGAAT
ATAAGGTTTGATGAGGCGATCAAATCTTATGGCTTTGATCAAAATATTGACGAGCCTTGTGTTTACAAGTAA
Protein sequenceShow/hide protein sequence
MNVKARRGYEYFVSFIDDYSRYGYIYLMHRNSETLEKFKEFKTEVKNLLGKTIKTLRSDRGGEYIDTKFQDYMIDHGITLQLSTPGMPQQNGVSERRNRTFLDMVRSMMS
YARLPDSFWGYAVETAVYILNTVPSKSVCETPFELWHGRKINLRHFRIWRCPTHVLVSNPKKLEPRSKLCLFVGCPKETRGGFFYDPKDNRVLVSTNTTFLEEDHIRDHL
PKSKIVLNETNSSLARVVDGASTSKSVVDPSSSSQVRSQELEMPRRSGRVVRQPERYMGLAETQVVTPDDDCEDPLTYDQAMADVDKDEWIKAMDQEMESMYFNSVWELV
DQPDGVKPIGCKWIYKCKRGVDGKVEGVDYEETFSPVAMVKSIRILLAITAYYDYEVWQIDVKTAFLNGNLDEPIYMDQPKGFIVQGQEQKVCRLQRSIYGLKQASRSWN
IRFDEAIKSYGFDQNIDEPCVYK