; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006349 (gene) of Snake gourd v1 genome

Gene IDTan0006349
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG11:26712161..26713606
RNA-Seq ExpressionTan0006349
SyntenyTan0006349
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]9.5e-13254.88Show/hide
Query:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG
        MFGQ S+    D+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA  DE+SQ                            T++LN            G
Subjt:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG

Query:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---
        Q  EANVA+ +  +H GST GTKS+       K + K+G    K + AAA+  KK K  A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV   
Subjt:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---

Query:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA
                                           GE+T+RVG+G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+ CL+E   S++F +NK 
Subjt:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA

Query:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP
        FI   G  ICSA LENNLYVL+  + K++LN E+FKTA T+ K+ KISPKEN HLWHLRL HIN+NRIE+LVK+GLL+ELEENSLPVCESCLEGKMTK P
Subjt:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP

Query:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR
        F+GKG+RAKEPLELVHSDLCGPMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +F+
Subjt:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]9.5e-13254.67Show/hide
Query:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG
        MFGQ S+    D+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA  DE+SQ                            T++LN            G
Subjt:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG

Query:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---
        Q  EANVA+ +  +H GST GTKS+       K + K+G    K + AAA+  KK K  A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV   
Subjt:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---

Query:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA
                                           GE+T+RVG+G +VSA A+G ++L+    +LLL+N+Y+VP   RNL+S+ CL+E   S++F +NK 
Subjt:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA

Query:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP
        FI   G  ICSA LENNLYVL+  + K++LN E+FKTA T+ K+ KISPKEN HLWHLRL HIN+NRIE+LVK+GLL+ELEENSLPVCESCLEGKMTK P
Subjt:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP

Query:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR
        F+GKG+RAKEPLELVHS+LCGPMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +F+
Subjt:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]9.5e-13254.88Show/hide
Query:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG
        MFGQ S+    D+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA  DE+SQ                            T++LN            G
Subjt:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG

Query:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---
        Q  EANVA+ +  +H GST GTKS+       K + K+G    K + AAA+  KK K  A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV   
Subjt:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---

Query:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA
                                           GE+T+RVG+G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+ CL+E   S++F +NK 
Subjt:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA

Query:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP
        FI   G  ICSA LENNLYVL+  + K++LN E+FKTA T+ K+ KISPKEN HLWHLRL HIN+NRIE+LVK+GLL+ELEENSLPVCESCLEGKMTK P
Subjt:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP

Query:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR
        F+GKG+RAKEPLELVHSDLCGPMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +F+
Subjt:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]9.5e-13254.88Show/hide
Query:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG
        MFGQ S+    D+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA  DE+SQ                            T++LN            G
Subjt:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG

Query:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---
        Q  EANVA+ +  +H GST GTKS+       K + K+G    K + AAA+  KK K  A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV   
Subjt:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---

Query:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA
                                           GE+T+RVG+G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+ CL+E   S++F +NK 
Subjt:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA

Query:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP
        FI   G  ICSA LENNLYVL+  + K++LN E+FKTA T+ K+ KISPKEN HLWHLRL HIN+NRIE+LVK+GLL+ELEENSLPVCESCLEGKMTK P
Subjt:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP

Query:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR
        F+GKG+RAKEPLELVHSDLCGPMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +F+
Subjt:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]9.5e-13254.88Show/hide
Query:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG
        MFGQ S+    D+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA  DE+SQ                            T++LN            G
Subjt:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG

Query:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---
        Q  EANVA+ +  +H GST GTKS+       K + K+G    K + AAA+  KK K  A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV   
Subjt:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---

Query:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA
                                           GE+T+RVG+G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+ CL+E   S++F +NK 
Subjt:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA

Query:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP
        FI   G  ICSA LENNLYVL+  + K++LN E+FKTA T+ K+ KISPKEN HLWHLRL HIN+NRIE+LVK+GLL+ELEENSLPVCESCLEGKMTK P
Subjt:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP

Query:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR
        F+GKG+RAKEPLELVHSDLCGPMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +F+
Subjt:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein4.6e-13254.88Show/hide
Query:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG
        MFGQ S+    D+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA  DE+SQ                            T++LN            G
Subjt:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG

Query:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---
        Q  EANVA+ +  +H GST GTKS+       K + K+G    K + AAA+  KK K  A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV   
Subjt:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---

Query:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA
                                           GE+T+RVG+G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+ CL+E   S++F +NK 
Subjt:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA

Query:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP
        FI   G  ICSA LENNLYVL+  + K++LN E+FKTA T+ K+ KISPKEN HLWHLRL HIN+NRIE+LVK+GLL+ELEENSLPVCESCLEGKMTK P
Subjt:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP

Query:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR
        F+GKG+RAKEPLELVHSDLCGPMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +F+
Subjt:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR

A0A5A7TU93 Gag/pol protein4.6e-13254.67Show/hide
Query:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG
        MFGQ S+    D+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA  DE+SQ                            T++LN            G
Subjt:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG

Query:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---
        Q  EANVA+ +  +H GST GTKS+       K + K+G    K + AAA+  KK K  A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV   
Subjt:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---

Query:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA
                                           GE+T+RVG+G +VSA A+G ++L+    +LLL+N+Y+VP   RNL+S+ CL+E   S++F +NK 
Subjt:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA

Query:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP
        FI   G  ICSA LENNLYVL+  + K++LN E+FKTA T+ K+ KISPKEN HLWHLRL HIN+NRIE+LVK+GLL+ELEENSLPVCESCLEGKMTK P
Subjt:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP

Query:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR
        F+GKG+RAKEPLELVHS+LCGPMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +F+
Subjt:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR

A0A5A7TWB9 Gag/pol protein4.6e-13254.88Show/hide
Query:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG
        MFGQ S+    D+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA  DE+SQ                            T++LN            G
Subjt:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG

Query:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---
        Q  EANVA+ +  +H GST GTKS+       K + K+G    K + AAA+  KK K  A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV   
Subjt:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---

Query:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA
                                           GE+T+RVG+G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+ CL+E   S++F +NK 
Subjt:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA

Query:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP
        FI   G  ICSA LENNLYVL+  + K++LN E+FKTA T+ K+ KISPKEN HLWHLRL HIN+NRIE+LVK+GLL+ELEENSLPVCESCLEGKMTK P
Subjt:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP

Query:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR
        F+GKG+RAKEPLELVHSDLCGPMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +F+
Subjt:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR

A0A5A7V4M1 Gag/pol protein4.6e-13254.88Show/hide
Query:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG
        MFGQ S+    D+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA  DE+SQ                            T++LN            G
Subjt:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG

Query:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---
        Q  EANVA+ +  +H GST GTKS+       K + K+G    K + AAA+  KK K  A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV   
Subjt:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---

Query:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA
                                           GE+T+RVG+G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+ CL+E   S++F +NK 
Subjt:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA

Query:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP
        FI   G  ICSA LENNLYVL+  + K++LN E+FKTA T+ K+ KISPKEN HLWHLRL HIN+NRIE+LVK+GLL+ELEENSLPVCESCLEGKMTK P
Subjt:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP

Query:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR
        F+GKG+RAKEPLELVHSDLCGPMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +F+
Subjt:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR

A0A5D3CPJ6 Gag/pol protein4.6e-13254.88Show/hide
Query:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG
        MFGQ S+    D+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA  DE+SQ                            T++LN            G
Subjt:  MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGA-FDESSQ----------------------------TSILN-----------SG

Query:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---
        Q  EANVA+ +  +H GST GTKS+       K + K+G    K + AAA+  KK K  A KG CFHCN  GHWKRNCPK+LAE+K   QGK DLLV   
Subjt:  QGIEANVASVS--YHGGSTCGTKSVAPLRPKGKKRMKRG----KTDRAAAQKGKKVKEVAEKGKCFHCNGSGHWKRNCPKFLAERK--NQGKCDLLV---

Query:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA
                                           GE+T+RVG+G +VSA A+G ++L     +LLL+N+Y+VP   RNL+S+ CL+E   S++F +NK 
Subjt:  -----------------------------------GEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKA

Query:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP
        FI   G  ICSA LENNLYVL+  + K++LN E+FKTA T+ K+ KISPKEN HLWHLRL HIN+NRIE+LVK+GLL+ELEENSLPVCESCLEGKMTK P
Subjt:  FISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCP

Query:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR
        F+GKG+RAKEPLELVHSDLCGPMNVKARGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +F+
Subjt:  FSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.3e-1930.47Show/hide
Query:  GELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKAFISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKK
        GE + A   G V+L    +  L D L+       NL+S+  L E  +SI F+ +   IS           +N L V+K + + + + V  F+      K 
Subjt:  GELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKAFISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKK

Query:  AKISPKENVHLWHLRLDHIN------VNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCPFSGKGYRA--KEPLELVHSDLCGPMNVKARGGYEYFVS
             K N  LWH R  HI+      + R        LLN L E S  +CE CL GK  + PF     +   K PL +VHSD+CGP+         YFV 
Subjt:  AKISPKENVHLWHLRLDHIN------VNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCPFSGKGYRA--KEPLELVHSDLCGPMNVKARGGYEYFVS

Query:  FIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR
        F+D ++ Y   YL+  KS+    F+++  + E      +  L  D G EY+  E R
Subjt:  FIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-946.8e-2427.66Show/hide
Query:  PLRPKGKKRMKRGKTDRAAAQKGKK--VKEVAEKGKCFHCNG-SGHW-KRNCPKFLAERKNQGKCDLLVGEV-TLRVGSGELVSAAAIGTVKLHFG-GKY
        P + KG+   ++   + AA  +     V  + E+ +C H +G    W         A       C  + G+  T+++G+      A IG + +    G  
Subjt:  PLRPKGKKRMKRGKTDRAAAQKGKK--VKEVAEKGKCFHCNG-SGHW-KRNCPKFLAERKNQGKCDLLVGEV-TLRVGSGELVSAAAIGTVKLHFG-GKY

Query:  LLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKAFISFKGNYICSASLENNLYVLKPNSIKSILNVELFKT-AETRTKKAKISPKE-NVHLWHLRLDH
        L+L ++  VP    NL+S    ++     S+  N+ +   KG+ + +               K +    L++T AE    +   +  E +V LWH R+ H
Subjt:  LLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKAFISFKGNYICSASLENNLYVLKPNSIKSILNVELFKT-AETRTKKAKISPKE-NVHLWHLRLDH

Query:  INVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCPFSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKE
        ++   ++ L K  L++  +  ++  C+ CL GK  +  F     R    L+LV+SD+CGPM +++ GG +YFV+FIDD SR  ++Y++  K +  + F++
Subjt:  INVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCPFSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKE

Query:  YKTEVENLLGKSLKTLRSDRGGEYMDTEF
        +   VE   G+ LK LRSD GGEY   EF
Subjt:  YKTEVENLLGKSLKTLRSDRGGEYMDTEF

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.0e-1126.51Show/hide
Query:  AIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKAFISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKE
        AIG +  +F             P    +L+S+S L    I+  F  N    S  G  +       + Y L   S K ++   + K       K+K   K 
Subjt:  AIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKAFISFKGNYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKE

Query:  NVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLP-------VCESCLEGKMTKCPFSGKGYRAK-----EPLELVHSDLCGPMNVKARGGYEYFVSFID
           L H  L H N   I+K +K   +  L+E+ +         C  CL GK TK     KG R K     EP + +H+D+ GP++   +    YF+SF D
Subjt:  NVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLP-------VCESCLEGKMTKCPFSGKGYRAK-----EPLELVHSDLCGPMNVKARGGYEYFVSFID

Query:  DYSRYGYIYLMHKKSE--TLEKFKEYKTEVENLLGKSLKTLRSDRGGEY
        + +R+ ++Y +H + E   L  F      ++N     +  ++ DRG EY
Subjt:  DYSRYGYIYLMHKKSE--TLEKFKEYKTEVENLLGKSLKTLRSDRGGEY

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.6e-1527.49Show/hide
Query:  VGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKAFISFK----GNYICSASLENNLYVLKPNSIKSILNVELFKT
        V  G  +  +  G+  L    + L L N+  VP   +NL+S+  L  +   +S E   A    K    G  +     ++ LY      I S   V LF  
Subjt:  VGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKAFISFK----GNYICSASLENNLYVLKPNSIKSILNVELFKT

Query:  AETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELE-ENSLPVCESCLEGKMTKCPFSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVS
               A  S K     WH RL H   + +  ++ +  L+ L   +    C  CL  K  K PFS     +  PLE ++SD+     + +   Y Y+V 
Subjt:  AETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELE-ENSLPVCESCLEGKMTKCPFSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVS

Query:  FIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYM
        F+D ++RY ++Y + +KS+  E F  +K  +EN     + T  SD GGE++
Subjt:  FIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-1326.29Show/hide
Query:  VGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKAFISFK----GNYICSASLENNLYVLKPNSIKSILNVELFKT
        +  G  +     G+  L    + L L+ +  VP   +NL+S+  L  +   +S E   A    K    G  +     ++ LY      I S   V +F  
Subjt:  VGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKAFISFK----GNYICSASLENNLYVLKPNSIKSILNVELFKT

Query:  AETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELE-ENSLPVCESCLEGKMTKCPFSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVS
               A    K     WH RL H ++  +  ++ +  L  L   + L  C  C   K  K PFS     + +PLE ++SD+     + +   Y Y+V 
Subjt:  AETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELE-ENSLPVCESCLEGKMTKCPFSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVS

Query:  FIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYM
        F+D ++RY ++Y + +KS+  + F  +K+ VEN     + TL SD GGE++
Subjt:  FIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYM

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein3.0e-1135.85Show/hide
Query:  VLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCPFSGKGYRAKEPLELVHSDL
        +LK N   S+  ++   + ET       + K+   LWH RL H++   +E LVK G L+  + +SL  CE C+ GK  +  FS   +  K PL+ VHSDL
Subjt:  VLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCPFSGKGYRAKEPLELVHSDL

Query:  CGPMNV
         G  +V
Subjt:  CGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGGACAACGATCCTTTTTCAGTCCGCAAGACTCGCTCAAATACATTTTCAACGCTCGGATGAAAGAGGGGTCGTCTGTCCGTGAACATGTTCTAGACATGATGAC
CCGCTTTAATCTGGCTGAGATGAATGGGGCTTTTGACGAGTCGAGCCAGACTTCAATCCTGAACAGTGGTCAAGGAATCGAGGCAAATGTTGCCTCTGTGTCTTATCACG
GGGGTTCGACCTGTGGGACAAAATCTGTTGCTCCTTTACGCCCGAAAGGGAAGAAGAGGATGAAGAGGGGTAAAACTGACCGAGCTGCCGCCCAGAAGGGCAAGAAGGTC
AAGGAGGTTGCAGAGAAAGGAAAGTGTTTTCACTGCAATGGAAGCGGACACTGGAAGAGAAACTGTCCCAAATTCCTAGCCGAGAGGAAGAACCAAGGTAAATGTGATTT
ACTTGTGGGTGAGGTGACTCTACGGGTTGGATCCGGGGAGCTTGTCTCTGCTGCAGCAATCGGCACAGTGAAGCTGCATTTTGGCGGGAAGTACTTATTGTTAGACAATT
TGTACATAGTTCCAGGGTTTACTAGAAACCTTGTTTCTATTTCCTGCTTAATTGAACACTGTATTTCAATTTCTTTTGAATTAAATAAAGCGTTTATTTCCTTCAAAGGG
AATTATATTTGTTCAGCTTCACTTGAAAATAATCTGTATGTTTTAAAACCCAATTCGATTAAAAGTATTTTGAATGTTGAATTGTTTAAAACTGCGGAAACACGAACTAA
GAAAGCAAAAATTTCTCCAAAAGAAAATGTTCATCTTTGGCATCTACGGTTAGACCACATTAATGTCAATAGGATTGAGAAACTAGTGAAGAGTGGACTTCTAAACGAGT
TGGAAGAAAACTCTTTGCCAGTATGTGAGTCATGCCTTGAAGGCAAAATGACCAAATGTCCTTTTAGTGGAAAAGGATATAGAGCCAAAGAGCCGCTCGAGTTAGTACAT
TCTGACCTCTGTGGTCCGATGAATGTTAAAGCTCGGGGCGGTTATGAGTACTTCGTGTCTTTCATAGACGATTACTCCAGATATGGGTATATTTACCTAATGCATAAGAA
GTCTGAAACTCTTGAAAAGTTTAAGGAGTATAAGACTGAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAACACTTCGATCGGATCGAGGTGGAGAGTACATGGACACCG
AATTCCGGACTATATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTGGACAACGATCCTTTTTCAGTCCGCAAGACTCGCTCAAATACATTTTCAACGCTCGGATGAAAGAGGGGTCGTCTGTCCGTGAACATGTTCTAGACATGATGAC
CCGCTTTAATCTGGCTGAGATGAATGGGGCTTTTGACGAGTCGAGCCAGACTTCAATCCTGAACAGTGGTCAAGGAATCGAGGCAAATGTTGCCTCTGTGTCTTATCACG
GGGGTTCGACCTGTGGGACAAAATCTGTTGCTCCTTTACGCCCGAAAGGGAAGAAGAGGATGAAGAGGGGTAAAACTGACCGAGCTGCCGCCCAGAAGGGCAAGAAGGTC
AAGGAGGTTGCAGAGAAAGGAAAGTGTTTTCACTGCAATGGAAGCGGACACTGGAAGAGAAACTGTCCCAAATTCCTAGCCGAGAGGAAGAACCAAGGTAAATGTGATTT
ACTTGTGGGTGAGGTGACTCTACGGGTTGGATCCGGGGAGCTTGTCTCTGCTGCAGCAATCGGCACAGTGAAGCTGCATTTTGGCGGGAAGTACTTATTGTTAGACAATT
TGTACATAGTTCCAGGGTTTACTAGAAACCTTGTTTCTATTTCCTGCTTAATTGAACACTGTATTTCAATTTCTTTTGAATTAAATAAAGCGTTTATTTCCTTCAAAGGG
AATTATATTTGTTCAGCTTCACTTGAAAATAATCTGTATGTTTTAAAACCCAATTCGATTAAAAGTATTTTGAATGTTGAATTGTTTAAAACTGCGGAAACACGAACTAA
GAAAGCAAAAATTTCTCCAAAAGAAAATGTTCATCTTTGGCATCTACGGTTAGACCACATTAATGTCAATAGGATTGAGAAACTAGTGAAGAGTGGACTTCTAAACGAGT
TGGAAGAAAACTCTTTGCCAGTATGTGAGTCATGCCTTGAAGGCAAAATGACCAAATGTCCTTTTAGTGGAAAAGGATATAGAGCCAAAGAGCCGCTCGAGTTAGTACAT
TCTGACCTCTGTGGTCCGATGAATGTTAAAGCTCGGGGCGGTTATGAGTACTTCGTGTCTTTCATAGACGATTACTCCAGATATGGGTATATTTACCTAATGCATAAGAA
GTCTGAAACTCTTGAAAAGTTTAAGGAGTATAAGACTGAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAACACTTCGATCGGATCGAGGTGGAGAGTACATGGACACCG
AATTCCGGACTATATGA
Protein sequenceShow/hide protein sequence
MFGQRSFFSPQDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGAFDESSQTSILNSGQGIEANVASVSYHGGSTCGTKSVAPLRPKGKKRMKRGKTDRAAAQKGKKV
KEVAEKGKCFHCNGSGHWKRNCPKFLAERKNQGKCDLLVGEVTLRVGSGELVSAAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISCLIEHCISISFELNKAFISFKG
NYICSASLENNLYVLKPNSIKSILNVELFKTAETRTKKAKISPKENVHLWHLRLDHINVNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKCPFSGKGYRAKEPLELVH
SDLCGPMNVKARGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFRTI