; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007622 (gene) of Snake gourd v1 genome

Gene IDTan0007622
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG10:40465609..40467609
RNA-Seq ExpressionTan0007622
SyntenyTan0007622
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033228.1 gag/pol protein [Cucumis melo var. makuwa]2.7e-13157.14Show/hide
Query:  AKEIMESLREIHDDPLPSGRDERASIDELSQNFQSLHRVKESEANVAYRSYHRGSTSGTKRVAPSRLKGKKRMKRGKTDRVAAHNGKKVKEIAEKGNCFH
        A++IM+SLR++   P         SI           ++K+ +ANVAY      S+SG++++       K++  +GK   +AA NGK   ++A KG CFH
Subjt:  AKEIMESLREIHDDPLPSGRDERASIDELSQNFQSLHRVKESEANVAYRSYHRGSTSGTKRVAPSRLKGKKRMKRGKTDRVAAHNGKKVKEIAEKGNCFH

Query:  CNGGGHWKRNCPKFLAERKNQG------------------------------------HINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSG
        CN   HWK NCPK+L ++K +G                                    HINL+RIE+LVK+GLLNE E++SLP CESCLEGKMTKRPF+ 
Subjt:  CNGGGHWKRNCPKFLAERKNQG------------------------------------HINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSG

Query:  KGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKFETLEKFKEYKTEVENLLGKSLKTLRSDR---------------------
        KGYRAKEPLEL+H DL GPMN KARGG+EYF+SFIDDYS YGY+YLM  K E LEKFKEYK EVENLL K +K LRSDR                     
Subjt:  KGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKFETLEKFKEYKTEVENLLGKSLKTLRSDR---------------------

Query:  --APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCL
          APG PQQN VS RRNRTLLDMV SMMSYA+LP SFWGYAVET V+ILNNVPSKSV ETPFELW GRK SL HF+IWGC  HVLV+NPKKLEPR  LC 
Subjt:  --APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCL

Query:  FVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNE
        FVGYPKETRGGLF+DP+EN+V VS NA F+EEDH+RDH PRSK+VLNE
Subjt:  FVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNE

KAA0060254.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-14066.08Show/hide
Query:  ASIDELSQNFQSLHRVKESEANVA--YRSYHRGSTSGTKRVAPSRLKGKKRMKRG----KTDRVAAHNGKKVKEIAEKGNCFHCNGGGHWKRNCPKFLAE
        A IDE SQ        ++ EANVA   R +HRGSTSGTK +  S    K + K+G    K +  AA   KK K  A KG CFH N  GHWKRNCPK+LAE
Subjt:  ASIDELSQNFQSLHRVKESEANVA--YRSYHRGSTSGTKRVAPSRLKGKKRMKRG----KTDRVAAHNGKKVKEIAEKGNCFHCNGGGHWKRNCPKFLAE

Query:  RK--NQGHINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKF
        +K   QGHINLNRIE+LVK+G+L+E EENSLP+CESCLEGKMTKRPF+GKG+RAKEPLELVHSDL GPMN KARG +EYF++F DDYSRYGY+YLM  K 
Subjt:  RK--NQGHINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKF

Query:  ETLEKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNN
        E LEKFKEYK EVEN L K++KT RSDR                       APG PQQNGVS RRNRTLLDMVRSM+SYA LP+SFWGYAV+TAVYILN 
Subjt:  ETLEKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNN

Query:  VPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNEM
        VPSKSV ETP +LW+GRKGSL HFRIWGCP HVL +NPKKLEPR KLCLFVGYPK TRGG FYD ++N+V V TNA F+E+DH+R+H PRSKIVLN++
Subjt:  VPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNEM

TYJ96910.1 gag/pol protein [Cucumis melo var. makuwa]3.1e-13562.89Show/hide
Query:  QNFQSLHRVK--ESEANVA--YRSYHRGSTSGTKRVAPS----RLKGKKRMKRGKTDRVAAHNGKKVKEIAEKGNCFHCNGGGHWKRNCPKFLAERKNQG
        Q F+SL ++K  + EANVA   R +HRGSTSGTK +  S    + K  K  ++ K +  AA   KK K  A KG  FHCN  GHWKRNCPK+LAE+K   
Subjt:  QNFQSLHRVK--ESEANVA--YRSYHRGSTSGTKRVAPS----RLKGKKRMKRGKTDRVAAHNGKKVKEIAEKGNCFHCNGGGHWKRNCPKFLAERKNQG

Query:  ------------------------HINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSF
                                HINLNRIE+LV++GLL+E EEN LPVCESCLEGKMTKRPF+GKG+RAKEPLELVHSDL GPMN KARGG+EYF++F
Subjt:  ------------------------HINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSF

Query:  IDDYSRYGYIYLMHKKFETLEKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLP
         DDYSRYGY+YLM  K E LEKFKEYK EVEN L K++KT RSDR                       APG PQQNGVS RRN+TLLDMV SMMSYA LP
Subjt:  IDDYSRYGYIYLMHKKFETLEKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLP

Query:  DSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDH
        +SFWGYAV+TAVYILN VPSKSV ETP +LW+GRKGSLHHFRI GCP HVL  N KKLEPR KLCLFVGY K +RGG FYDP++N+VLVSTNA F+EEDH
Subjt:  DSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDH

Query:  VRDHLPRSKIVLNEM
        +R+H PRSKIVLNE+
Subjt:  VRDHLPRSKIVLNEM

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-14267.09Show/hide
Query:  ASIDELSQNFQSLHRVKESEANVA--YRSYHRGSTSGTKRVAPSRLKGKKRMKRG----KTDRVAAHNGKKVKEIAEKGNCFHCNGGGHWKRNCPKFLAE
        A IDE SQ        ++ EANVA   R +HRGSTSGTK +  S    K + K+G    K +  AA   KK K  A KG CFH N  GHWKRNCPK+LAE
Subjt:  ASIDELSQNFQSLHRVKESEANVA--YRSYHRGSTSGTKRVAPSRLKGKKRMKRG----KTDRVAAHNGKKVKEIAEKGNCFHCNGGGHWKRNCPKFLAE

Query:  RK--NQGHINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKF
        +K   QGHINLNRIE+LVK+G+L+E EENSLP+CESCLEGKMTKRPF+GKG+RAKEPLELVHSDL GPMN KARG +EYF++F DDYSRYGY+YLM  K 
Subjt:  RK--NQGHINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKF

Query:  ETLEKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNN
        E LEKFKEYK EVEN L K++KT RSDR                       APG PQQNGVS RRNRTLLDMVRSM+SYA LP+SFWGYAV+TAVYILN 
Subjt:  ETLEKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNN

Query:  VPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNEM
        VPSKSV ETP +LW+GRKGSL HFRIWGCP HVL +NPKKLEPR KLCLFVGYPK TRGG FYDP++N+V VSTNA F+EEDH+R+H PRSKIVLNE+
Subjt:  VPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNEM

TYJ98809.1 gag/pol protein [Cucumis melo var. makuwa]2.7e-13157.14Show/hide
Query:  AKEIMESLREIHDDPLPSGRDERASIDELSQNFQSLHRVKESEANVAYRSYHRGSTSGTKRVAPSRLKGKKRMKRGKTDRVAAHNGKKVKEIAEKGNCFH
        A++IM+SLR++   P         SI           ++K+ +ANVAY      S+SG++++       K++  +GK   +AA NGK   ++A KG CFH
Subjt:  AKEIMESLREIHDDPLPSGRDERASIDELSQNFQSLHRVKESEANVAYRSYHRGSTSGTKRVAPSRLKGKKRMKRGKTDRVAAHNGKKVKEIAEKGNCFH

Query:  CNGGGHWKRNCPKFLAERKNQG------------------------------------HINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSG
        CN   HWK NCPK+L ++K +G                                    HINL+RIE+LVK+GLLNE E++SLP CESCLEGKMTKRPF+ 
Subjt:  CNGGGHWKRNCPKFLAERKNQG------------------------------------HINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSG

Query:  KGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKFETLEKFKEYKTEVENLLGKSLKTLRSDR---------------------
        KGYRAKEPLEL+H DL GPMN KARGG+EYF+SFIDDYS YGY+YLM  K E LEKFKEYK EVENLL K +K LRSDR                     
Subjt:  KGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKFETLEKFKEYKTEVENLLGKSLKTLRSDR---------------------

Query:  --APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCL
          APG PQQN VS RRNRTLLDMV SMMSYA+LP SFWGYAVET V+ILNNVPSKSV ETPFELW GRK SL HF+IWGC  HVLV+NPKKLEPR  LC 
Subjt:  --APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCL

Query:  FVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNE
        FVGYPKETRGGLF+DP+EN+V VS NA F+EEDH+RDH PRSK+VLNE
Subjt:  FVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNE

TrEMBL top hitse value%identityAlignment
A0A5A7SRR6 Gag/pol protein1.3e-13157.14Show/hide
Query:  AKEIMESLREIHDDPLPSGRDERASIDELSQNFQSLHRVKESEANVAYRSYHRGSTSGTKRVAPSRLKGKKRMKRGKTDRVAAHNGKKVKEIAEKGNCFH
        A++IM+SLR++   P         SI           ++K+ +ANVAY      S+SG++++       K++  +GK   +AA NGK   ++A KG CFH
Subjt:  AKEIMESLREIHDDPLPSGRDERASIDELSQNFQSLHRVKESEANVAYRSYHRGSTSGTKRVAPSRLKGKKRMKRGKTDRVAAHNGKKVKEIAEKGNCFH

Query:  CNGGGHWKRNCPKFLAERKNQG------------------------------------HINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSG
        CN   HWK NCPK+L ++K +G                                    HINL+RIE+LVK+GLLNE E++SLP CESCLEGKMTKRPF+ 
Subjt:  CNGGGHWKRNCPKFLAERKNQG------------------------------------HINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSG

Query:  KGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKFETLEKFKEYKTEVENLLGKSLKTLRSDR---------------------
        KGYRAKEPLEL+H DL GPMN KARGG+EYF+SFIDDYS YGY+YLM  K E LEKFKEYK EVENLL K +K LRSDR                     
Subjt:  KGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKFETLEKFKEYKTEVENLLGKSLKTLRSDR---------------------

Query:  --APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCL
          APG PQQN VS RRNRTLLDMV SMMSYA+LP SFWGYAVET V+ILNNVPSKSV ETPFELW GRK SL HF+IWGC  HVLV+NPKKLEPR  LC 
Subjt:  --APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCL

Query:  FVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNE
        FVGYPKETRGGLF+DP+EN+V VS NA F+EEDH+RDH PRSK+VLNE
Subjt:  FVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNE

A0A5A7UYX7 Gag/pol protein4.5e-14066.08Show/hide
Query:  ASIDELSQNFQSLHRVKESEANVA--YRSYHRGSTSGTKRVAPSRLKGKKRMKRG----KTDRVAAHNGKKVKEIAEKGNCFHCNGGGHWKRNCPKFLAE
        A IDE SQ        ++ EANVA   R +HRGSTSGTK +  S    K + K+G    K +  AA   KK K  A KG CFH N  GHWKRNCPK+LAE
Subjt:  ASIDELSQNFQSLHRVKESEANVA--YRSYHRGSTSGTKRVAPSRLKGKKRMKRG----KTDRVAAHNGKKVKEIAEKGNCFHCNGGGHWKRNCPKFLAE

Query:  RK--NQGHINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKF
        +K   QGHINLNRIE+LVK+G+L+E EENSLP+CESCLEGKMTKRPF+GKG+RAKEPLELVHSDL GPMN KARG +EYF++F DDYSRYGY+YLM  K 
Subjt:  RK--NQGHINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKF

Query:  ETLEKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNN
        E LEKFKEYK EVEN L K++KT RSDR                       APG PQQNGVS RRNRTLLDMVRSM+SYA LP+SFWGYAV+TAVYILN 
Subjt:  ETLEKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNN

Query:  VPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNEM
        VPSKSV ETP +LW+GRKGSL HFRIWGCP HVL +NPKKLEPR KLCLFVGYPK TRGG FYD ++N+V V TNA F+E+DH+R+H PRSKIVLN++
Subjt:  VPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNEM

A0A5D3BAN6 Gag/pol protein1.5e-13562.89Show/hide
Query:  QNFQSLHRVK--ESEANVA--YRSYHRGSTSGTKRVAPS----RLKGKKRMKRGKTDRVAAHNGKKVKEIAEKGNCFHCNGGGHWKRNCPKFLAERKNQG
        Q F+SL ++K  + EANVA   R +HRGSTSGTK +  S    + K  K  ++ K +  AA   KK K  A KG  FHCN  GHWKRNCPK+LAE+K   
Subjt:  QNFQSLHRVK--ESEANVA--YRSYHRGSTSGTKRVAPS----RLKGKKRMKRGKTDRVAAHNGKKVKEIAEKGNCFHCNGGGHWKRNCPKFLAERKNQG

Query:  ------------------------HINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSF
                                HINLNRIE+LV++GLL+E EEN LPVCESCLEGKMTKRPF+GKG+RAKEPLELVHSDL GPMN KARGG+EYF++F
Subjt:  ------------------------HINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSF

Query:  IDDYSRYGYIYLMHKKFETLEKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLP
         DDYSRYGY+YLM  K E LEKFKEYK EVEN L K++KT RSDR                       APG PQQNGVS RRN+TLLDMV SMMSYA LP
Subjt:  IDDYSRYGYIYLMHKKFETLEKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLP

Query:  DSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDH
        +SFWGYAV+TAVYILN VPSKSV ETP +LW+GRKGSLHHFRI GCP HVL  N KKLEPR KLCLFVGY K +RGG FYDP++N+VLVSTNA F+EEDH
Subjt:  DSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDH

Query:  VRDHLPRSKIVLNEM
        +R+H PRSKIVLNE+
Subjt:  VRDHLPRSKIVLNEM

A0A5D3BHG7 Gag/pol protein7.5e-14367.09Show/hide
Query:  ASIDELSQNFQSLHRVKESEANVA--YRSYHRGSTSGTKRVAPSRLKGKKRMKRG----KTDRVAAHNGKKVKEIAEKGNCFHCNGGGHWKRNCPKFLAE
        A IDE SQ        ++ EANVA   R +HRGSTSGTK +  S    K + K+G    K +  AA   KK K  A KG CFH N  GHWKRNCPK+LAE
Subjt:  ASIDELSQNFQSLHRVKESEANVA--YRSYHRGSTSGTKRVAPSRLKGKKRMKRG----KTDRVAAHNGKKVKEIAEKGNCFHCNGGGHWKRNCPKFLAE

Query:  RK--NQGHINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKF
        +K   QGHINLNRIE+LVK+G+L+E EENSLP+CESCLEGKMTKRPF+GKG+RAKEPLELVHSDL GPMN KARG +EYF++F DDYSRYGY+YLM  K 
Subjt:  RK--NQGHINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKF

Query:  ETLEKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNN
        E LEKFKEYK EVEN L K++KT RSDR                       APG PQQNGVS RRNRTLLDMVRSM+SYA LP+SFWGYAV+TAVYILN 
Subjt:  ETLEKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNN

Query:  VPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNEM
        VPSKSV ETP +LW+GRKGSL HFRIWGCP HVL +NPKKLEPR KLCLFVGYPK TRGG FYDP++N+V VSTNA F+EEDH+R+H PRSKIVLNE+
Subjt:  VPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNEM

A0A5D3BIJ7 Gag/pol protein1.3e-13157.14Show/hide
Query:  AKEIMESLREIHDDPLPSGRDERASIDELSQNFQSLHRVKESEANVAYRSYHRGSTSGTKRVAPSRLKGKKRMKRGKTDRVAAHNGKKVKEIAEKGNCFH
        A++IM+SLR++   P         SI           ++K+ +ANVAY      S+SG++++       K++  +GK   +AA NGK   ++A KG CFH
Subjt:  AKEIMESLREIHDDPLPSGRDERASIDELSQNFQSLHRVKESEANVAYRSYHRGSTSGTKRVAPSRLKGKKRMKRGKTDRVAAHNGKKVKEIAEKGNCFH

Query:  CNGGGHWKRNCPKFLAERKNQG------------------------------------HINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSG
        CN   HWK NCPK+L ++K +G                                    HINL+RIE+LVK+GLLNE E++SLP CESCLEGKMTKRPF+ 
Subjt:  CNGGGHWKRNCPKFLAERKNQG------------------------------------HINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSG

Query:  KGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKFETLEKFKEYKTEVENLLGKSLKTLRSDR---------------------
        KGYRAKEPLEL+H DL GPMN KARGG+EYF+SFIDDYS YGY+YLM  K E LEKFKEYK EVENLL K +K LRSDR                     
Subjt:  KGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKFETLEKFKEYKTEVENLLGKSLKTLRSDR---------------------

Query:  --APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCL
          APG PQQN VS RRNRTLLDMV SMMSYA+LP SFWGYAVET V+ILNNVPSKSV ETPFELW GRK SL HF+IWGC  HVLV+NPKKLEPR  LC 
Subjt:  --APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLKLCL

Query:  FVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNE
        FVGYPKETRGGLF+DP+EN+V VS NA F+EEDH+RDH PRSK+VLNE
Subjt:  FVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNE

SwissProt top hitse value%identityAlignment
P04146 Copia protein5.7e-3128.43Show/hide
Query:  GHINLNRIEKLVKSGLLNEFE-----ENSLPVCESCLEGKMTKRPFSGKGYRA--KEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKK
        GHI+  ++ ++ +  + ++       E S  +CE CL GK  + PF     +   K PL +VHSD+ GP+         YFV F+D ++ Y   YL+  K
Subjt:  GHINLNRIEKLVKSGLLNEFE-----ENSLPVCESCLEGKMTKRPFSGKGYRA--KEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKK

Query:  FETLEKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILN
         +    F+++  + E      +  L  D                         P  PQ NGVS R  RT+ +  R+M+S A+L  SFWG AV TA Y++N
Subjt:  FETLEKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILN

Query:  NVPSKSVCE---TPFELWSGRKGSLHHFRIWGCPTHVLVSNPK-KLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIV
         +PS+++ +   TP+E+W  +K  L H R++G   +V + N + K + +    +FVGY  E  G   +D    + +V+ + +  E + V     + + V
Subjt:  NVPSKSVCE---TPFELWSGRKGSLHHFRIWGCPTHVLVSNPK-KLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.0e-4434.46Show/hide
Query:  KNQGHINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKFETL
        K  GH++   ++ L K  L++  +  ++  C+ CL GK  +  F     R    L+LV+SD+ GPM  ++ GG +YFV+FIDD SR  ++Y++  K +  
Subjt:  KNQGHINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKFETL

Query:  EKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPS
        + F+++   VE   G+ LK LRSD                         PG PQ NGV+ R NRT+++ VRSM+  A+LP SFWG AV+TA Y++N  PS
Subjt:  EKFKEYKTEVENLLGKSLKTLRSDR-----------------------APGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPS

Query:  KSVC-ETPFELWSGRKGSLHHFRIWGCP--THVLVSNPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLN
          +  E P  +W+ ++ S  H +++GC    HV      KL+ +   C+F+GY  E  G   +DP + +V+ S + +F  E  VR     S+ V N
Subjt:  KSVC-ETPFELWSGRKGSLHHFRIWGCP--THVLVSNPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLN

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.1e-1524.32Show/hide
Query:  GHINLNRIEKLVKSGLLNEFEENSLP-------VCESCLEGKMTKRPFSGKGYRAK-----EPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIY
        GH N   I+K +K   +   +E+ +         C  CL GK TK     KG R K     EP + +H+D++GP++   +    YF+SF D+ +R+ ++Y
Subjt:  GHINLNRIEKLVKSGLLNEFEENSLP-------VCESCLEGKMTKRPFSGKGYRAK-----EPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIY

Query:  LMHKKFE--TLEKFKEYKTEVENLLGKSLKTLRSDRAPGMPQQ-----------------------NGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVE
         +H + E   L  F      ++N     +  ++ DR      +                       +GV+ R NRTLL+  R+++  + LP+  W  AVE
Subjt:  LMHKKFE--TLEKFKEYKTEVENLLGKSLKTLRSDRAPGMPQQ-----------------------NGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVE

Query:  TAVYILNNVPSKSVCETPFELWSGRKG-SLHHFRIWGCPTHVLVSNP-KKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEED
         +  I N++ S    ++  +  +G  G  +     +G P  V   NP  K+ PR      +   + + G + Y P   + + +TN + ++++
Subjt:  TAVYILNNVPSKSVCETPFELWSGRKG-SLHHFRIWGCPTHVLVSNP-KKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEED

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE19.4e-2627.02Show/hide
Query:  CESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKFETLEKFKEYKTEVENLLGKSLKTLRSDRA----
        C  CL  K  K PFS     +  PLE ++SD++      +   Y Y+V F+D ++RY ++Y + +K +  E F  +K  +EN     + T  SD      
Subjt:  CESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKFETLEKFKEYKTEVENLLGKSLKTLRSDRA----

Query:  -----------------PGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSV-CETPFELWSGRKGSLHHFRIWGCPTHVL
                         P  P+ NG+S R++R +++   +++S+A +P ++W YA   AVY++N +P+  +  E+PF+   G   +    R++GC  +  
Subjt:  -----------------PGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSV-CETPFELWSGRKGSLHHFRIWGCPTHVL

Query:  VS--NPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIE
        +   N  KL+ + + C+F+GY       L    + +R+ +S +  F E
Subjt:  VS--NPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.1e-2527.31Show/hide
Query:  GHINLNRIEKLVKSGLLNEFE-ENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKFETLEK
        GH +L  +  ++ +  L      + L  C  C   K  K PFS     + +PLE ++SD++      +   Y Y+V F+D ++RY ++Y + +K +  + 
Subjt:  GHINLNRIEKLVKSGLLNEFE-ENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKFETLEK

Query:  FKEYKTEVENLLGKSLKTLRSDRA---------------------PGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSV-
        F  +K+ VEN     + TL SD                       P  P+ NG+S R++R +++M  +++S+A +P ++W YA   AVY++N +P+  + 
Subjt:  FKEYKTEVENLLGKSLKTLRSDRA---------------------PGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSV-

Query:  CETPFELWSGRKGSLHHFRIWGCPTHVLVS--NPKKLEPRLKLCLFVGY
         ++PF+   G+  +    +++GC  +  +   N  KLE + K C F+GY
Subjt:  CETPFELWSGRKGSLHHFRIWGCPTHVLVS--NPKKLEPRLKLCLFVGY

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.0e-0641.67Show/hide
Query:  HINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYG
        H++   +E LVK G L+  + +SL  CE C+ GK  +  FS   +  K PL+ VHSDL+G
Subjt:  HINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.5e-0735.37Show/hide
Query:  NRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSV-CETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLK
        NRT+++ VRSM+    LP +F   A  TAV+I+N  PS ++    P E+W     +  + R +GC  ++   +  KL+PR K
Subjt:  NRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSV-CETPFELWSGRKGSLHHFRIWGCPTHVLVSNPKKLEPRLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGATGGTTTCGCTAAGGAGATCATGGAGTCCTTGCGAGAAATACATGATGACCCACTTCCATCTGGCCGAGATGAACGGGCTTCGATCGACGAGTTGAGCCAGAA
TTTTCAATCCTTGCACAGGGTCAAGGAATCTGAGGCAAATGTTGCCTACAGGTCTTATCACAGGGGTTCGACCTCTGGGACGAAACGTGTTGCTCCTTCACGCCTGAAAG
GGAAGAAGAGGATGAAGAGGGGTAAAACTGACCGTGTTGCCGCCCACAATGGCAAGAAGGTCAAGGAGATTGCAGAGAAAGGAAACTGTTTCCACTGCAATGGGGGTGGT
CACTGGAAGAGAAACTGTCCCAAATTCCTGGCCGAGAGGAAGAATCAAGGCCACATTAATCTCAATAGGATTGAGAAACTAGTGAAGAGTGGACTTCTAAACGAGTTTGA
AGAAAACTCTTTGCCGGTGTGTGAGTCATGCCTTGAGGGCAAAATGACCAAACGTCCTTTTAGTGGAAAAGGATATAGAGCCAAAGAGCCTCTTGAGTTAGTACATTCTG
ACCTCTATGGTCCGATGAATGCTAAAGCTCGGGGCGGTTATGAGTATTTCGTGTCTTTCATAGACGATTACTCCAGATATGGGTATATTTACCTAATGCACAAGAAGTTT
GAAACTCTTGAAAAATTCAAGGAGTACAAGACTGAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAACACTTCGATCGGATAGAGCGCCTGGTATGCCGCAGCAGAATGG
TGTATCGGGGAGGAGAAACAGAACCTTGTTGGACATGGTTCGGTCGATGATGAGCTATGCTCGTCTCCCTGATTCTTTTTGGGGGTACGCAGTGGAGACTGCGGTTTATA
TTTTGAACAACGTTCCATCGAAGAGTGTTTGTGAAACACCTTTCGAGCTCTGGAGTGGACGTAAAGGCAGTTTACATCACTTTAGAATTTGGGGATGCCCGACCCACGTG
TTGGTGTCAAACCCGAAAAAGCTGGAACCCCGTTTGAAATTGTGCCTATTCGTAGGTTACCCTAAAGAGACTAGGGGTGGTCTCTTTTACGATCCTAGAGAAAATAGGGT
GCTTGTGTCGACAAACGCCATTTTCATAGAGGAAGACCACGTCAGGGATCATTTACCAAGGAGTAAAATTGTATTAAATGAAATGGATGCGATACATCGCCAGAGTTCTT
GA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGATGGTTTCGCTAAGGAGATCATGGAGTCCTTGCGAGAAATACATGATGACCCACTTCCATCTGGCCGAGATGAACGGGCTTCGATCGACGAGTTGAGCCAGAA
TTTTCAATCCTTGCACAGGGTCAAGGAATCTGAGGCAAATGTTGCCTACAGGTCTTATCACAGGGGTTCGACCTCTGGGACGAAACGTGTTGCTCCTTCACGCCTGAAAG
GGAAGAAGAGGATGAAGAGGGGTAAAACTGACCGTGTTGCCGCCCACAATGGCAAGAAGGTCAAGGAGATTGCAGAGAAAGGAAACTGTTTCCACTGCAATGGGGGTGGT
CACTGGAAGAGAAACTGTCCCAAATTCCTGGCCGAGAGGAAGAATCAAGGCCACATTAATCTCAATAGGATTGAGAAACTAGTGAAGAGTGGACTTCTAAACGAGTTTGA
AGAAAACTCTTTGCCGGTGTGTGAGTCATGCCTTGAGGGCAAAATGACCAAACGTCCTTTTAGTGGAAAAGGATATAGAGCCAAAGAGCCTCTTGAGTTAGTACATTCTG
ACCTCTATGGTCCGATGAATGCTAAAGCTCGGGGCGGTTATGAGTATTTCGTGTCTTTCATAGACGATTACTCCAGATATGGGTATATTTACCTAATGCACAAGAAGTTT
GAAACTCTTGAAAAATTCAAGGAGTACAAGACTGAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAACACTTCGATCGGATAGAGCGCCTGGTATGCCGCAGCAGAATGG
TGTATCGGGGAGGAGAAACAGAACCTTGTTGGACATGGTTCGGTCGATGATGAGCTATGCTCGTCTCCCTGATTCTTTTTGGGGGTACGCAGTGGAGACTGCGGTTTATA
TTTTGAACAACGTTCCATCGAAGAGTGTTTGTGAAACACCTTTCGAGCTCTGGAGTGGACGTAAAGGCAGTTTACATCACTTTAGAATTTGGGGATGCCCGACCCACGTG
TTGGTGTCAAACCCGAAAAAGCTGGAACCCCGTTTGAAATTGTGCCTATTCGTAGGTTACCCTAAAGAGACTAGGGGTGGTCTCTTTTACGATCCTAGAGAAAATAGGGT
GCTTGTGTCGACAAACGCCATTTTCATAGAGGAAGACCACGTCAGGGATCATTTACCAAGGAGTAAAATTGTATTAAATGAAATGGATGCGATACATCGCCAGAGTTCTT
GA
Protein sequenceShow/hide protein sequence
MSDGFAKEIMESLREIHDDPLPSGRDERASIDELSQNFQSLHRVKESEANVAYRSYHRGSTSGTKRVAPSRLKGKKRMKRGKTDRVAAHNGKKVKEIAEKGNCFHCNGGG
HWKRNCPKFLAERKNQGHINLNRIEKLVKSGLLNEFEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLYGPMNAKARGGYEYFVSFIDDYSRYGYIYLMHKKF
ETLEKFKEYKTEVENLLGKSLKTLRSDRAPGMPQQNGVSGRRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKSVCETPFELWSGRKGSLHHFRIWGCPTHV
LVSNPKKLEPRLKLCLFVGYPKETRGGLFYDPRENRVLVSTNAIFIEEDHVRDHLPRSKIVLNEMDAIHRQSS