; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002103 (gene) of Snake gourd v1 genome

Gene IDTan0002103
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG09:44449215..44451187
RNA-Seq ExpressionTan0002103
SyntenyTan0002103
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-21060.03Show/hide
Query:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH
        MD  FQDYMIE GI SQLSAPG PQ+NG+SERRNRTLLDMVRSMMSYA+LP SFWGYAVETAV+ILNNVPSKSV ETPFELW GRK SL HFRIWG P H
Subjt:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH

Query:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELG
        VLV+NPKKLEPRS+LC FVGYPKETRGGLF+DP+ENRV VSTNATFLEEDH+R+H PRSK+VL+E  ++     +D    S+ V + +TS Q   SQ L 
Subjt:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELG

Query:  MPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKR-------------
        MPRRSGRVV QP+RY+GL ET VV PDD  EDPL+Y  AM DVDKD+W+KAMD EMESMYFN + ELVD P+GVKPIGCKWIYKRKR             
Subjt:  MPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAI
                 GVD           KSI IL +I  +YDYE+WQMDVKTAFLNG L+E+I+M QP+GFI QGQEQKVC+L+RSIYGLKQASRSWNI FD AI
Subjt:  ---------GVDG----------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAI

Query:  KSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL----------------------------------
        KSYGFDQNV+EPCVYKK     VAFLV+YVD ILL GN+V +LTDVK WLA+QFQMKDLGEAQYVL                                  
Subjt:  KSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL----------------------------------

Query:  ------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILTGYT
                                            GS MY MLCTRPDICYAVGIV RYQ N GLDH   VK +LKYLRRTR+Y LVYG  DLILTGYT
Subjt:  ------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILTGYT

Query:  ----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK
                  R ST   +  L       +  WRS      ++GCI DSTMEAEYVA  ++ ++
Subjt:  ----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]4.8e-21160.33Show/hide
Query:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH
        MD  FQDYMIE GI SQLSAPG PQ+NG+SERRNRTLLDMVRSMMSYA+LP SFWGYAVETAV+ILNNVPSKSV ETPFELW GRK SL HFRIWG P H
Subjt:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH

Query:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELG
        VLV+NPKKLEPRS+LC FVGYPKETRGGLF+DPKENRV VSTNATFLEEDH+R+H PRSK+VL+E  ++     +D    S+ V + +TS Q   SQ L 
Subjt:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELG

Query:  MPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKR-------------
        MPRRSGRVV QP+RY+GL ET VV PDD  EDPL+Y  AM DVDKD+W+KAMD EMESMYFN + ELVD P+GVKPIGCKWIYKRKR             
Subjt:  MPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAI
                 GVD           KSI IL +I  +YDYE+WQMDVKTAFLNG L+E+I+M QP+GFI QGQEQKVC+L+RSIYGLKQASRSWNI FD AI
Subjt:  ---------GVDG----------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAI

Query:  KSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL----------------------------------
        KSYGFDQNV+EPCVYKK     VAFLV+YVD ILL GN+V +LTDVK WLA+QFQMKDLGE QYVL                                  
Subjt:  KSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL----------------------------------

Query:  ------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILTGYT
                                            GS MY MLCTRPDICYAVGIV RYQ N GLDH   VK ILKYLRRTR+Y LVYG  DLILTGYT
Subjt:  ------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILTGYT

Query:  ----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK
                  R ST R +  L       +  WRS      ++GCI DSTMEAEYVA  ++ ++
Subjt:  ----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK

KAA0047792.1 gag/pol protein [Cucumis melo var. makuwa]2.7e-20658.11Show/hide
Query:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH
        MD +FQ+Y++E GI SQLSAPG PQ+NG+SERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAVYILN VPSKSV ETP +LW+GRKGSL HFRIWG P H
Subjt:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH

Query:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKE---EFIDGASTSTSVVDPSTSSQI-RSQ
        VL +NPKKLEPRSKLCLFVGYPK TRGG FYDPK+N+V VSTNATFLEEDH+R+H PRSKIVLNE+ ++  E     ++  S  T VV   +S++  + Q
Subjt:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKE---EFIDGASTSTSVVDPSTSSQI-RSQ

Query:  ELGMPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKRGVDG------
         L  PRRSGRV   P RYM L ET  V  D D EDPLT+  AM DVDKDEWIKAM+ E+ESMYFN + +LVDQPDGVKPIGCKWIYKRKRG DG      
Subjt:  ELGMPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKRGVDG------

Query:  --------------------------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFD
                                  KSI IL +I AY+DYE+WQMDVKTAFLNG L+ETIYM QP+GFI  GQEQK+C+L+RSIYGLKQASRSWNI FD
Subjt:  --------------------------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFD

Query:  EAIKSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL-------------------------------
         AIKSYGFDQ V+EPCVYK+ ++K+VAFLV+YVD ILL GN++  LTD+K+WLA+QFQMKDLGEAQ+VL                               
Subjt:  EAIKSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL-------------------------------

Query:  ---------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILT
                                               GS MY MLCTRPDICYAVGIV RYQ N GL H   VK ILKYLRRTR+Y LVYG  DLILT
Subjt:  ---------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILT

Query:  GYT----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK
        GYT          R ST   + IL       +  WRS      ++GCI DSTMEAEYVA  ++ ++
Subjt:  GYT----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK

KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-21060.03Show/hide
Query:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH
        MD  FQDYMIE GI SQLSAPG PQ+NG+SERRNRTLLDMVRSMMSYA+LP SFWGYAVETAV+ILNNVPSKSV ETPFELW GRK SL HFRIWG P H
Subjt:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH

Query:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELG
        VLV+NPKKLEPRS+LC FVGYPKETRGGLF+DP+ENRV VSTNATFLEEDH+R+H PRSK+VL+E  ++     +D    S+ V + +TS Q   SQ L 
Subjt:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELG

Query:  MPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKR-------------
        MPRRSGRVV QP+RY+GL ET VV PDD  EDPL+Y  AM DVDKD+W+KAMD EMESMYFN + ELVD P+GVKPIGCKWIYKRKR             
Subjt:  MPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAI
                 GVD           KSI IL +I  +YDYE+WQMDVKTAFLNG L+E+I+M QP+GFI QGQEQKVC+L+RSIYGLKQASRSWNI FD AI
Subjt:  ---------GVDG----------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAI

Query:  KSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL----------------------------------
        KSYGFDQNV+EPCVYKK     VAFLV+YVD ILL GN+V +LTDVK WLA+QFQMKDLGEAQYVL                                  
Subjt:  KSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL----------------------------------

Query:  ------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILTGYT
                                            GS MY MLCTRPDICYAVGIV RYQ N GLDH   VK +LKYLRRTR+Y LVYG  DLILTGYT
Subjt:  ------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILTGYT

Query:  ----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK
                  R ST   +  L       +  WRS      ++GCI DSTMEAEYVA  ++ ++
Subjt:  ----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK

KAA0062410.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-20663.19Show/hide
Query:  EFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTHVLV
        +F++Y  E      L  PG PQ+NG+SERRNRTLLDMVRSMMSYA+LP SFWGYAV+TAV+ILNNVPSKSV ETPFELW GRK SL HFRIWG P HVLV
Subjt:  EFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTHVLV

Query:  SNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELGMPR
        +NPKKLEPRS+LC F+GYPKETRGGLF+DP+ENRV + TNATFLEEDH+RDH P+SK+VLNE   D     +D    S+ V + +TS Q   SQ L MPR
Subjt:  SNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELGMPR

Query:  RSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKRGVDGKSIEILSAIVAY
        RSGR V QP+ Y+GL ET VV PDD  EDPL+Y  A  DVDKD+W+KAMD +MESMYFN M ELVD P+GVKPIGCKWIYKRK+   GKSI IL +I  +
Subjt:  RSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKRGVDGKSIEILSAIVAY

Query:  YDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAIKSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILL
        YDYE+WQMDVKTAFLN  L+E+I+M QP+GFI QGQEQKVC+L++SIYGLKQASRSWNI FD AIKSYGFDQNV+EPCVYKK     VAFLV+YVD ILL
Subjt:  YDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAIKSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILL

Query:  NGNEVEFLTDVKRWLASQFQMKDLGEAQYVL-------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDH
         GN+V +LTDVK WLA+QFQMKDLGE QYVL                                     GS MY MLCTRPDICYAVGIV RYQ N GLDH
Subjt:  NGNEVEFLTDVKRWLASQFQMKDLGEAQYVL-------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDH

Query:  RDTVKAILKYLRRTRNYNLVYGGGDLILTGYT----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK
          TVK I KYLRRTR+Y LVY   DLILTGYT          R ST R +  L       +  WRS      ++GCI DSTMEAEYVA  ++ ++
Subjt:  RDTVKAILKYLRRTRNYNLVYGGGDLILTGYT----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein2.3e-21160.33Show/hide
Query:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH
        MD  FQDYMIE GI SQLSAPG PQ+NG+SERRNRTLLDMVRSMMSYA+LP SFWGYAVETAV+ILNNVPSKSV ETPFELW GRK SL HFRIWG P H
Subjt:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH

Query:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELG
        VLV+NPKKLEPRS+LC FVGYPKETRGGLF+DPKENRV VSTNATFLEEDH+R+H PRSK+VL+E  ++     +D    S+ V + +TS Q   SQ L 
Subjt:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELG

Query:  MPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKR-------------
        MPRRSGRVV QP+RY+GL ET VV PDD  EDPL+Y  AM DVDKD+W+KAMD EMESMYFN + ELVD P+GVKPIGCKWIYKRKR             
Subjt:  MPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAI
                 GVD           KSI IL +I  +YDYE+WQMDVKTAFLNG L+E+I+M QP+GFI QGQEQKVC+L+RSIYGLKQASRSWNI FD AI
Subjt:  ---------GVDG----------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAI

Query:  KSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL----------------------------------
        KSYGFDQNV+EPCVYKK     VAFLV+YVD ILL GN+V +LTDVK WLA+QFQMKDLGE QYVL                                  
Subjt:  KSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL----------------------------------

Query:  ------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILTGYT
                                            GS MY MLCTRPDICYAVGIV RYQ N GLDH   VK ILKYLRRTR+Y LVYG  DLILTGYT
Subjt:  ------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILTGYT

Query:  ----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK
                  R ST R +  L       +  WRS      ++GCI DSTMEAEYVA  ++ ++
Subjt:  ----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK

A0A5A7TWB9 Gag/pol protein1.3e-20658.11Show/hide
Query:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH
        MD +FQ+Y++E GI SQLSAPG PQ+NG+SERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAVYILN VPSKSV ETP +LW+GRKGSL HFRIWG P H
Subjt:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH

Query:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKE---EFIDGASTSTSVVDPSTSSQI-RSQ
        VL +NPKKLEPRSKLCLFVGYPK TRGG FYDPK+N+V VSTNATFLEEDH+R+H PRSKIVLNE+ ++  E     ++  S  T VV   +S++  + Q
Subjt:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKE---EFIDGASTSTSVVDPSTSSQI-RSQ

Query:  ELGMPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKRGVDG------
         L  PRRSGRV   P RYM L ET  V  D D EDPLT+  AM DVDKDEWIKAM+ E+ESMYFN + +LVDQPDGVKPIGCKWIYKRKRG DG      
Subjt:  ELGMPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKRGVDG------

Query:  --------------------------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFD
                                  KSI IL +I AY+DYE+WQMDVKTAFLNG L+ETIYM QP+GFI  GQEQK+C+L+RSIYGLKQASRSWNI FD
Subjt:  --------------------------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFD

Query:  EAIKSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL-------------------------------
         AIKSYGFDQ V+EPCVYK+ ++K+VAFLV+YVD ILL GN++  LTD+K+WLA+QFQMKDLGEAQ+VL                               
Subjt:  EAIKSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL-------------------------------

Query:  ---------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILT
                                               GS MY MLCTRPDICYAVGIV RYQ N GL H   VK ILKYLRRTR+Y LVYG  DLILT
Subjt:  ---------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILT

Query:  GYT----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK
        GYT          R ST   + IL       +  WRS      ++GCI DSTMEAEYVA  ++ ++
Subjt:  GYT----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK

A0A5A7TZD0 Gag/pol protein6.8e-21160.03Show/hide
Query:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH
        MD  FQDYMIE GI SQLSAPG PQ+NG+SERRNRTLLDMVRSMMSYA+LP SFWGYAVETAV+ILNNVPSKSV ETPFELW GRK SL HFRIWG P H
Subjt:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH

Query:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELG
        VLV+NPKKLEPRS+LC FVGYPKETRGGLF+DP+ENRV VSTNATFLEEDH+R+H PRSK+VL+E  ++     +D    S+ V + +TS Q   SQ L 
Subjt:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELG

Query:  MPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKR-------------
        MPRRSGRVV QP+RY+GL ET VV PDD  EDPL+Y  AM DVDKD+W+KAMD EMESMYFN + ELVD P+GVKPIGCKWIYKRKR             
Subjt:  MPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAI
                 GVD           KSI IL +I  +YDYE+WQMDVKTAFLNG L+E+I+M QP+GFI QGQEQKVC+L+RSIYGLKQASRSWNI FD AI
Subjt:  ---------GVDG----------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAI

Query:  KSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL----------------------------------
        KSYGFDQNV+EPCVYKK     VAFLV+YVD ILL GN+V +LTDVK WLA+QFQMKDLGEAQYVL                                  
Subjt:  KSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL----------------------------------

Query:  ------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILTGYT
                                            GS MY MLCTRPDICYAVGIV RYQ N GLDH   VK +LKYLRRTR+Y LVYG  DLILTGYT
Subjt:  ------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILTGYT

Query:  ----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK
                  R ST   +  L       +  WRS      ++GCI DSTMEAEYVA  ++ ++
Subjt:  ----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK

A0A5A7UYE8 Gag/pol protein6.8e-21160.03Show/hide
Query:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH
        MD  FQDYMIE GI SQLSAPG PQ+NG+SERRNRTLLDMVRSMMSYA+LP SFWGYAVETAV+ILNNVPSKSV ETPFELW GRK SL HFRIWG P H
Subjt:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTH

Query:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELG
        VLV+NPKKLEPRS+LC FVGYPKETRGGLF+DP+ENRV VSTNATFLEEDH+R+H PRSK+VL+E  ++     +D    S+ V + +TS Q   SQ L 
Subjt:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELG

Query:  MPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKR-------------
        MPRRSGRVV QP+RY+GL ET VV PDD  EDPL+Y  AM DVDKD+W+KAMD EMESMYFN + ELVD P+GVKPIGCKWIYKRKR             
Subjt:  MPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKR-------------

Query:  ---------GVDG----------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAI
                 GVD           KSI IL +I  +YDYE+WQMDVKTAFLNG L+E+I+M QP+GFI QGQEQKVC+L+RSIYGLKQASRSWNI FD AI
Subjt:  ---------GVDG----------KSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAI

Query:  KSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL----------------------------------
        KSYGFDQNV+EPCVYKK     VAFLV+YVD ILL GN+V +LTDVK WLA+QFQMKDLGEAQYVL                                  
Subjt:  KSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL----------------------------------

Query:  ------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILTGYT
                                            GS MY MLCTRPDICYAVGIV RYQ N GLDH   VK +LKYLRRTR+Y LVYG  DLILTGYT
Subjt:  ------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILTGYT

Query:  ----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK
                  R ST   +  L       +  WRS      ++GCI DSTMEAEYVA  ++ ++
Subjt:  ----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK

A0A5D3DS50 Gag/pol protein1.0e-20663.19Show/hide
Query:  EFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTHVLV
        +F++Y  E      L  PG PQ+NG+SERRNRTLLDMVRSMMSYA+LP SFWGYAV+TAV+ILNNVPSKSV ETPFELW GRK SL HFRIWG P HVLV
Subjt:  EFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTHVLV

Query:  SNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELGMPR
        +NPKKLEPRS+LC F+GYPKETRGGLF+DP+ENRV + TNATFLEEDH+RDH P+SK+VLNE   D     +D    S+ V + +TS Q   SQ L MPR
Subjt:  SNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIR-SQELGMPR

Query:  RSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKRGVDGKSIEILSAIVAY
        RSGR V QP+ Y+GL ET VV PDD  EDPL+Y  A  DVDKD+W+KAMD +MESMYFN M ELVD P+GVKPIGCKWIYKRK+   GKSI IL +I  +
Subjt:  RSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKRGVDGKSIEILSAIVAY

Query:  YDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAIKSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILL
        YDYE+WQMDVKTAFLN  L+E+I+M QP+GFI QGQEQKVC+L++SIYGLKQASRSWNI FD AIKSYGFDQNV+EPCVYKK     VAFLV+YVD ILL
Subjt:  YDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAIKSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILL

Query:  NGNEVEFLTDVKRWLASQFQMKDLGEAQYVL-------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDH
         GN+V +LTDVK WLA+QFQMKDLGE QYVL                                     GS MY MLCTRPDICYAVGIV RYQ N GLDH
Subjt:  NGNEVEFLTDVKRWLASQFQMKDLGEAQYVL-------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDH

Query:  RDTVKAILKYLRRTRNYNLVYGGGDLILTGYT----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK
          TVK I KYLRRTR+Y LVY   DLILTGYT          R ST R +  L       +  WRS      ++GCI DSTMEAEYVA  ++ ++
Subjt:  RDTVKAILKYLRRTRNYNLVYGGGDLILTGYT----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.0e-5122.64Show/hide
Query:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCE---TPFELWSGRKGSLHHFRIWGY
        +  E + + +++GI+  L+ P  PQ NG+SER  RT+ +  R+M+S A+L  SFWG AV TA Y++N +PS+++ +   TP+E+W  +K  L H R++G 
Subjt:  MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCE---TPFELWSGRKGSLHHFRIWGY

Query:  PTHVLVSNPK-KLEPRSKLCLFVGYPKETRGGLFYD-------------------------------------------PKENRVLVST----------N
          +V + N + K + +S   +FVGY  E  G   +D                                           P ++R ++ T          N
Subjt:  PTHVLVSNPK-KLEPRSKLCLFVGYPKETRGGLFYD-------------------------------------------PKENRVLVST----------N

Query:  ATFLEE-------------------------------DHVRDHLPRSKIVLNEMDEDIKEEFID---GASTSTSVVDPSTSSQIRSQELGMP--------
          FL++                                 ++D    +K  LNE  +  +++ ++   G+       +  T+  ++   +  P        
Subjt:  ATFLEE-------------------------------DHVRDHLPRSKIVLNEMDEDIKEEFID---GASTSTSVVDPSTSSQIRSQELGMP--------

Query:  --RRSGRVVRQP-----DRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRK---------
          RRS R+  +P     +    L +  + A     + P ++D      DK  W +A++ E+ +   N    +  +P+    +  +W++  K         
Subjt:  --RRSGRVVRQP-----DRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRK---------

Query:  -------RGVDGK----------------SIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIM
               RG   K                S   + ++V  Y+ +V QMDVKTAFLNG L E IYM  P+G         VC+L+++IYGLKQA+R W  +
Subjt:  -------RGVDGK----------------SIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIM

Query:  FDEAIKSYGFDQNVEEPCVY---KKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQY----------------------------
        F++A+K   F  +  + C+Y   K  +++ + ++++YVD +++   ++  + + KR+L  +F+M DL E ++                            
Subjt:  FDEAIKSYGFDQNVEEPCVY---KKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQY----------------------------

Query:  ---------------------------------VLGSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILTGYTR
                                         ++G  MY+MLCTRPD+  AV I+ RY      +    +K +L+YL+ T +  L++      L    +
Subjt:  ---------------------------------VLGSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILTGYTR

Query:  ISTFRPIRILGNPLWSPS--------FEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRKLFGL
        I  +      G+ +   S        F++        ++  +  S+ EAEY+AL ++ R+   L
Subjt:  ISTFRPIRILGNPLWSPS--------FEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRKLFGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.7e-8430.62Show/hide
Query:  EFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVC-ETPFELWSGRKGSLHHFRIWGYP--TH
        EF++Y    GI  + + PG PQ NG++ER NRT+++ VRSM+  A+LP SFWG AV+TA Y++N  PS  +  E P  +W+ ++ S  H +++G     H
Subjt:  EFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVC-ETPFELWSGRKGSLHHFRIWGYP--TH

Query:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIRSQELG-
        V      KL+ +S  C+F+GY  E  G   +DP + +V+ S +  F  E  VR           +M E +K   I    T  S  +  TS++  + E+  
Subjt:  VLVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIRSQELG-

Query:  MPRRSGRVVRQPDRY-MGLAE-----------------------------TSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQ
           + G V+ Q ++   G+ E                             T  V   DD  +P +    +   +K++ +KAM +EMES+  N   +LV+ 
Subjt:  MPRRSGRVVRQPDRY-MGLAE-----------------------------TSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQ

Query:  PDGVKPIGCKWIYKRKRGVDGK--------------------------------SIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQG
        P G +P+ CKW++K K+  D K                                SI  + ++ A  D EV Q+DVKTAFL+G L+E IYM+QP+GF   G
Subjt:  PDGVKPIGCKWIYKRKRGVDGK--------------------------------SIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQG

Query:  QEQKVCRLHRSIYGLKQASRSWNIMFDEAIKSYGFDQNVEEPCVY-KKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL---
        ++  VC+L++S+YGLKQA R W + FD  +KS  + +   +PCVY K+  +     L++YVD +L+ G +   +  +K  L+  F MKDLG AQ +L   
Subjt:  QEQKVCRLHRSIYGLKQASRSWNIMFDEAIKSYGFDQNVEEPCVY-KKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL---

Query:  -------------------------------------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHR
                                                                           GS MY M+CTRPDI +AVG+V R+  N G +H 
Subjt:  -------------------------------------------------------------------GSPMYVMLCTRPDICYAVGIVGRYQPNQGLDHR

Query:  DTVKAILKYLRRTRNYNLVYGGGDLILTGYTRISTFRPI-RILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRKLFGL
        + VK IL+YLR T    L +GG D IL GYT       I     +  +  +F   + S     + C+  ST EAEY+A  ++ +++  L
Subjt:  DTVKAILKYLRRTRNYNLVYGGGDLILTGYTRISTFRPI-RILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRKLFGL

P25600 Putative transposon Ty5-1 protein YCL074W4.5e-1826.69Show/hide
Query:  MDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAIKSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEF
        MDV TAFLN  +DE IY+ QP GF+ +     V  L+  +YGLKQA   WN   +  +K  GF ++  E  +Y ++      ++ +YVD +L+     + 
Subjt:  MDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAIKSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEF

Query:  LTDVKRWLASQFQMKDLGEAQYVLG-----------------------------------------SPMY-------------------VMLCT---RPD
           VK+ L   + MKDLG+    LG                                          P++                   ++ C    RPD
Subjt:  LTDVKRWLASQFQMKDLGEAQYVLG-----------------------------------------SPMY-------------------VMLCT---RPD

Query:  ICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVY-GGGDLILTGY
        I Y V ++ R+       H ++ + +L+YL  TR+  L Y  G  L LT Y
Subjt:  ICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVY-GGGDLILTGY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.5e-3922.11Show/hide
Query:  DYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSV-CETPFELWSGRKGSLHHFRIWG---YPTHVL
        +Y  + GI+   S P  P+ NG+SER++R +++   +++S+A +P ++W YA   AVY++N +P+  +  E+PF+   G   +    R++G   YP  + 
Subjt:  DYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSV-CETPFELWSGRKGSLHHFRIWG---YPTHVL

Query:  VSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLE---------------EDHVRDH----------------LP--------------
          N  KL+ +S+ C+F+GY       L    + +R+ +S +  F E               ++  R+                 LP              
Subjt:  VSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLE---------------EDHVRDH----------------LP--------------

Query:  --------RSKIVLNEMDEDIKEEFIDG-------------------------ASTSTSVVDPSTSSQIR-SQELGMPRRSGRVVRQPDRYMGLAETSVV
                 S++  + +D      F                            +S +TS  +P+  S  + +Q L  P +S      P      + TS  
Subjt:  --------RSKIVLNEMDEDIKEEFIDG-------------------------ASTSTSVVDPSTSSQIR-SQELGMPRRSGRVVRQPDRYMGLAETSVV

Query:  AP--------------DDDCEDPLTYDHAM------------------------------VDVDKDE-WIKAMDQEMESMYFNCMRELVDQPDG-VKPIG
         P              +++ + PL   H+M                              +   KDE W  AM  E+ +   N   +LV  P   V  +G
Subjt:  AP--------------DDDCEDPLTYDHAM------------------------------VDVDKDE-WIKAMDQEMESMYFNCMRELVDQPDG-VKPIG

Query:  CKWIYKRKRGVDGK--------------------------------SIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRL
        C+WI+ +K   DG                                 SI I+  +     + + Q+DV  AFL G L + +YM QP GFI + +   VC+L
Subjt:  CKWIYKRKRGVDGK--------------------------------SIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIAQGQEQKVCRL

Query:  HRSIYGLKQASRSWNIMFDEAIKSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVLG-----------
         +++YGLKQA R+W +     + + GF  +V +  ++     K++ ++++YVD IL+ GN+   L +    L+ +F +KD  E  Y LG           
Subjt:  HRSIYGLKQASRSWNIMFDEAIKSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVLG-----------

Query:  ----------------------------SP----------------------MYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNL
                                    SP                      +  +  TRPDI YAV  + ++      +H   +K IL+YL  T N+ +
Subjt:  ----------------------------SP----------------------MYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNL

Query:  -VYGGGDLILTGYT----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRKL
         +  G  L L  Y+           +ST   I  LG+        W S    K Q+G ++ S+ EAEY ++  +  ++
Subjt:  -VYGGGDLILTGYT----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRKL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.3e-3822.53Show/hide
Query:  QDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSV-CETPFELWSGRKGSLHHFRIWG---YPTHV
        +DY+ + GI+   S P  P+ NG+SER++R +++M  +++S+A +P ++W YA   AVY++N +P+  +  ++PF+   G+  +    +++G   YP  +
Subjt:  QDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSV-CETPFELWSGRKGSLHHFRIWG---YPTHV

Query:  LVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEE---------------------------------------------DHV----
           N  KLE +SK C F+GY       L       R+  S +  F E                                               H+    
Subjt:  LVSNPKKLEPRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEE---------------------------------------------DHV----

Query:  ---------------RDHLPRSKI---------------------------------VLNEMDED-----------------IKEEFIDGASTSTSVVD-
                         +LP S I                                 +LN  + +                 I    I   STS S  + 
Subjt:  ---------------RDHLPRSKI---------------------------------VLNEMDED-----------------IKEEFIDGASTSTSVVD-

Query:  PSTSS----------------QIRSQ----ELGMPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMREL
        PS+SS                Q+ +Q       M  R+   +R+P++    A TS+ A      +P T   AM D   D W +AM  E+ +   N   +L
Subjt:  PSTSS----------------QIRSQ----ELGMPRRSGRVVRQPDRYMGLAETSVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMREL

Query:  V-DQPDGVKPIGCKWIYKRKRGVDGK--------------------------------SIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGF
        V   P  V  +GC+WI+ +K   DG                                 SI I+  +     + + Q+DV  AFL G L + +YM QP GF
Subjt:  V-DQPDGVKPIGCKWIYKRKRGVDGK--------------------------------SIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGF

Query:  IAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAIKSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL
        + + +   VCRL ++IYGLKQA R+W +     + + GF  ++ +  ++     +++ ++++YVD IL+ GN+   L      L+ +F +K+  +  Y L
Subjt:  IAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAIKSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVL

Query:  G---------------------------------------SP----------------------MYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAI
        G                                       SP                      +  +  TRPD+ YAV  + +Y      DH + +K +
Subjt:  G---------------------------------------SP----------------------MYVMLCTRPDICYAVGIVGRYQPNQGLDHRDTVKAI

Query:  LKYLRRTRNYNL-VYGGGDLILTGYT----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRKL
        L+YL  T ++ + +  G  L L  Y+           +ST   I  LG+        W S    K Q+G ++ S+ EAEY ++  +  +L
Subjt:  LKYLRRTRNYNL-VYGGGDLILTGYT----------RISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRKL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.1e-3032.22Show/hide
Query:  EDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKRGVDG--------------------------------KSIEILS
        ++P TY+ A   +    W  AMD E+ +M      E+   P   KPIGCKW+YK K   DG                                 S++++ 
Subjt:  EDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKRGVDG--------------------------------KSIEILS

Query:  AIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIA-QGQE---QKVCRLHRSIYGLKQASRSWNIMFDEAIKSYGFDQNVEEPCVYKKTVDKTVAFL
        AI A Y++ + Q+D+  AFLNG LDE IYM  P G+ A QG       VC L +SIYGLKQASR W + F   +  +GF Q+  +   + K        +
Subjt:  AIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQPKGFIA-QGQE---QKVCRLHRSIYGLKQASRSWNIMFDEAIKSYGFDQNVEEPCVYKKTVDKTVAFL

Query:  VMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVLG-----SPMYVMLCTRP---DICYAVGIVG
        ++YVD I++  N    + ++K  L S F+++DLG  +Y LG     S   + +C R    D+    G++G
Subjt:  VMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVLG-----SPMYVMLCTRP---DICYAVGIVG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.1e-0634.15Show/hide
Query:  NRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSV-CETPFELWSGRKGSLHHFRIWGYPTHVLVSNPKKLEPRSK
        NRT+++ VRSM+    LP +F   A  TAV+I+N  PS ++    P E+W     +  + R +G   ++   +  KL+PR+K
Subjt:  NRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSV-CETPFELWSGRKGSLHHFRIWGYPTHVLVSNPKKLEPRSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACACTGAATTCCAGGACTATATGATAGAACGCGGAATCACGTCCCAACTCTCAGCACCTGGTATGCCACAGAAGAATGGTATATCAGAGAGGAGAAACAGAACCTT
GTTGGACATGGTTCGGTCGATGATGAGCTATGCTCGTCTCCCTGATTCTTTTTGGGGTTACGCAGTGGAGACTGCGGTTTATATTTTGAACAACGTTCCGTCGAAGAGTG
TTTGTGAAACACCTTTCGAGCTCTGGAGTGGACGTAAAGGCAGTTTACATCACTTTAGAATTTGGGGATACCCGACCCACGTGTTGGTGTCAAACCCGAAAAAGTTGGAA
CCCCGTTCAAAATTGTGCCTATTCGTAGGTTACCCTAAAGAGACTAGGGGTGGTCTATTTTACGATCCTAAGGAAAATAGGGTGCTTGTGTCGACAAACGCCACTTTCCT
AGAAGAAGACCACGTCAGGGATCATTTACCAAGGAGTAAAATTGTGTTAAATGAAATGGATGAAGACATCAAAGAAGAGTTCATTGATGGGGCTAGTACGTCAACAAGTG
TTGTTGATCCTAGCACGTCTAGTCAAATCCGTTCCCAAGAGTTGGGAATGCCTCGACGTAGTGGGAGGGTTGTGAGACAGCCTGATCGCTACATGGGTTTAGCTGAAACC
TCAGTTGTCGCTCCTGATGATGACTGTGAGGATCCATTGACCTATGATCATGCAATGGTTGATGTTGACAAAGACGAATGGATTAAAGCTATGGATCAGGAAATGGAGTC
TATGTACTTCAATTGCATGCGGGAGCTTGTGGATCAACCGGATGGGGTAAAACCTATTGGTTGCAAATGGATCTACAAGCGTAAGCGTGGCGTAGATGGGAAGTCGATCG
AGATTCTTTCCGCCATTGTTGCGTATTATGACTACGAGGTATGGCAGATGGACGTCAAGACAGCCTTTCTGAATGGCAAACTTGATGAGACCATCTACATGGACCAGCCC
AAAGGGTTCATTGCCCAAGGCCAAGAGCAAAAGGTTTGTCGGCTTCATAGGTCCATTTATGGGCTGAAACAAGCTTCGAGGTCTTGGAACATAATGTTTGATGAGGCGAT
CAAATCTTATGGCTTTGATCAAAATGTCGAAGAGCCTTGTGTCTACAAGAAAACCGTTGACAAGACTGTCGCATTTTTGGTGATGTATGTGGACGCTATTCTTCTCAATG
GGAATGAGGTAGAATTTCTTACTGACGTTAAAAGGTGGCTAGCTTCGCAATTCCAAATGAAAGATTTGGGAGAAGCTCAGTACGTTCTAGGGAGCCCGATGTATGTCATG
TTGTGTACTAGGCCCGACATCTGTTATGCAGTAGGAATTGTCGGTAGATATCAGCCCAATCAAGGATTAGATCACCGGGACACCGTGAAGGCAATCCTCAAGTATCTTAG
GAGAACGAGGAACTACAACTTAGTGTATGGCGGTGGGGATTTGATCCTCACGGGATACACACGGATTTCGACTTTTCGACCGATAAGGATTCTAGGAAATCCACTTTGGT
CACCTTCATTCGAATGGAGGAGTTGTAGTATGGCGAAGCATCAGCGAGGATGCATCATCGATTCCACTATGGAAGCGGAGTATGTTGCGCTTGTGAAGTCGCAAAGGAAG
TTGTTTGGCTTAGGAAGTTCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGACACTGAATTCCAGGACTATATGATAGAACGCGGAATCACGTCCCAACTCTCAGCACCTGGTATGCCACAGAAGAATGGTATATCAGAGAGGAGAAACAGAACCTT
GTTGGACATGGTTCGGTCGATGATGAGCTATGCTCGTCTCCCTGATTCTTTTTGGGGTTACGCAGTGGAGACTGCGGTTTATATTTTGAACAACGTTCCGTCGAAGAGTG
TTTGTGAAACACCTTTCGAGCTCTGGAGTGGACGTAAAGGCAGTTTACATCACTTTAGAATTTGGGGATACCCGACCCACGTGTTGGTGTCAAACCCGAAAAAGTTGGAA
CCCCGTTCAAAATTGTGCCTATTCGTAGGTTACCCTAAAGAGACTAGGGGTGGTCTATTTTACGATCCTAAGGAAAATAGGGTGCTTGTGTCGACAAACGCCACTTTCCT
AGAAGAAGACCACGTCAGGGATCATTTACCAAGGAGTAAAATTGTGTTAAATGAAATGGATGAAGACATCAAAGAAGAGTTCATTGATGGGGCTAGTACGTCAACAAGTG
TTGTTGATCCTAGCACGTCTAGTCAAATCCGTTCCCAAGAGTTGGGAATGCCTCGACGTAGTGGGAGGGTTGTGAGACAGCCTGATCGCTACATGGGTTTAGCTGAAACC
TCAGTTGTCGCTCCTGATGATGACTGTGAGGATCCATTGACCTATGATCATGCAATGGTTGATGTTGACAAAGACGAATGGATTAAAGCTATGGATCAGGAAATGGAGTC
TATGTACTTCAATTGCATGCGGGAGCTTGTGGATCAACCGGATGGGGTAAAACCTATTGGTTGCAAATGGATCTACAAGCGTAAGCGTGGCGTAGATGGGAAGTCGATCG
AGATTCTTTCCGCCATTGTTGCGTATTATGACTACGAGGTATGGCAGATGGACGTCAAGACAGCCTTTCTGAATGGCAAACTTGATGAGACCATCTACATGGACCAGCCC
AAAGGGTTCATTGCCCAAGGCCAAGAGCAAAAGGTTTGTCGGCTTCATAGGTCCATTTATGGGCTGAAACAAGCTTCGAGGTCTTGGAACATAATGTTTGATGAGGCGAT
CAAATCTTATGGCTTTGATCAAAATGTCGAAGAGCCTTGTGTCTACAAGAAAACCGTTGACAAGACTGTCGCATTTTTGGTGATGTATGTGGACGCTATTCTTCTCAATG
GGAATGAGGTAGAATTTCTTACTGACGTTAAAAGGTGGCTAGCTTCGCAATTCCAAATGAAAGATTTGGGAGAAGCTCAGTACGTTCTAGGGAGCCCGATGTATGTCATG
TTGTGTACTAGGCCCGACATCTGTTATGCAGTAGGAATTGTCGGTAGATATCAGCCCAATCAAGGATTAGATCACCGGGACACCGTGAAGGCAATCCTCAAGTATCTTAG
GAGAACGAGGAACTACAACTTAGTGTATGGCGGTGGGGATTTGATCCTCACGGGATACACACGGATTTCGACTTTTCGACCGATAAGGATTCTAGGAAATCCACTTTGGT
CACCTTCATTCGAATGGAGGAGTTGTAGTATGGCGAAGCATCAGCGAGGATGCATCATCGATTCCACTATGGAAGCGGAGTATGTTGCGCTTGTGAAGTCGCAAAGGAAG
TTGTTTGGCTTAGGAAGTTCATGA
Protein sequenceShow/hide protein sequence
MDTEFQDYMIERGITSQLSAPGMPQKNGISERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGYPTHVLVSNPKKLE
PRSKLCLFVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKIVLNEMDEDIKEEFIDGASTSTSVVDPSTSSQIRSQELGMPRRSGRVVRQPDRYMGLAET
SVVAPDDDCEDPLTYDHAMVDVDKDEWIKAMDQEMESMYFNCMRELVDQPDGVKPIGCKWIYKRKRGVDGKSIEILSAIVAYYDYEVWQMDVKTAFLNGKLDETIYMDQP
KGFIAQGQEQKVCRLHRSIYGLKQASRSWNIMFDEAIKSYGFDQNVEEPCVYKKTVDKTVAFLVMYVDAILLNGNEVEFLTDVKRWLASQFQMKDLGEAQYVLGSPMYVM
LCTRPDICYAVGIVGRYQPNQGLDHRDTVKAILKYLRRTRNYNLVYGGGDLILTGYTRISTFRPIRILGNPLWSPSFEWRSCSMAKHQRGCIIDSTMEAEYVALVKSQRK
LFGLGSS