; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000815 (gene) of Snake gourd v1 genome

Gene IDTan0000815
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG08:49547515..49548957
RNA-Seq ExpressionTan0000815
SyntenyTan0000815
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]6.0e-13456.15Show/hide
Query:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG
        M EG+SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE   E F    SN VMNKI+YTL TLLNELQ F+SL ++K  + EANVA   R +HRGSTSG
Subjt:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG

Query:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG
        TK +  S       + KG                                           KK+ K+GK D                     A  H    
Subjt:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG

Query:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP
         +GI+S + L  GE+T+RVG+G +VS  A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ  S++F  +K FI   G  ICSA LE+NLYVL+ 
Subjt:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP

Query:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM
         + K++LNTE+FKTA T+ K+ K+SPK+NAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLELVHSDLCGPM
Subjt:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM

Query:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ
        NVKARGG+EYF++F +DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI SQLSA G PQ
Subjt:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]2.1e-13456.15Show/hide
Query:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG
        M EG+SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE   E F    SN VMNKI+YTL TLLNELQ F+SL ++K  + EANVA   R +HRGSTSG
Subjt:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG

Query:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG
        TK +  S       + KG                                           KK+ K+GK D                     A  H    
Subjt:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG

Query:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP
         +GI+S Q L  GE+T+RVG+G +VS  A+G ++L+    +LLL+N+Y+VP   RNL+S+  L+EQ  S++F  +K FI   G  ICSA LE+NLYVL+ 
Subjt:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP

Query:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM
         + K++LNTE+FKTA T+ K+ K+SPK+NAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLELVHS+LCGPM
Subjt:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM

Query:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ
        NVKARGG+EYF++F +DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI SQLSA G PQ
Subjt:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]6.0e-13456.15Show/hide
Query:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG
        M EG+SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE   E F    SN VMNKI+YTL TLLNELQ F+SL ++K  + EANVA   R +HRGSTSG
Subjt:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG

Query:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG
        TK +  S       + KG                                           KK+ K+GK D                     A  H    
Subjt:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG

Query:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP
         +GI+S + L  GE+T+RVG+G +VS  A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ  S++F  +K FI   G  ICSA LE+NLYVL+ 
Subjt:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP

Query:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM
         + K++LNTE+FKTA T+ K+ K+SPK+NAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLELVHSDLCGPM
Subjt:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM

Query:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ
        NVKARGG+EYF++F +DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI SQLSA G PQ
Subjt:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]6.0e-13456.15Show/hide
Query:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG
        M EG+SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE   E F    SN VMNKI+YTL TLLNELQ F+SL ++K  + EANVA   R +HRGSTSG
Subjt:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG

Query:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG
        TK +  S       + KG                                           KK+ K+GK D                     A  H    
Subjt:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG

Query:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP
         +GI+S + L  GE+T+RVG+G +VS  A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ  S++F  +K FI   G  ICSA LE+NLYVL+ 
Subjt:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP

Query:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM
         + K++LNTE+FKTA T+ K+ K+SPK+NAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLELVHSDLCGPM
Subjt:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM

Query:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ
        NVKARGG+EYF++F +DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI SQLSA G PQ
Subjt:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]6.0e-13456.15Show/hide
Query:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG
        M EG+SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE   E F    SN VMNKI+YTL TLLNELQ F+SL ++K  + EANVA   R +HRGSTSG
Subjt:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG

Query:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG
        TK +  S       + KG                                           KK+ K+GK D                     A  H    
Subjt:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG

Query:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP
         +GI+S + L  GE+T+RVG+G +VS  A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ  S++F  +K FI   G  ICSA LE+NLYVL+ 
Subjt:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP

Query:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM
         + K++LNTE+FKTA T+ K+ K+SPK+NAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLELVHSDLCGPM
Subjt:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM

Query:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ
        NVKARGG+EYF++F +DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI SQLSA G PQ
Subjt:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.9e-13456.15Show/hide
Query:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG
        M EG+SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE   E F    SN VMNKI+YTL TLLNELQ F+SL ++K  + EANVA   R +HRGSTSG
Subjt:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG

Query:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG
        TK +  S       + KG                                           KK+ K+GK D                     A  H    
Subjt:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG

Query:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP
         +GI+S + L  GE+T+RVG+G +VS  A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ  S++F  +K FI   G  ICSA LE+NLYVL+ 
Subjt:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP

Query:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM
         + K++LNTE+FKTA T+ K+ K+SPK+NAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLELVHSDLCGPM
Subjt:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM

Query:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ
        NVKARGG+EYF++F +DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI SQLSA G PQ
Subjt:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ

A0A5A7TU93 Gag/pol protein1.0e-13456.15Show/hide
Query:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG
        M EG+SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE   E F    SN VMNKI+YTL TLLNELQ F+SL ++K  + EANVA   R +HRGSTSG
Subjt:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG

Query:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG
        TK +  S       + KG                                           KK+ K+GK D                     A  H    
Subjt:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG

Query:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP
         +GI+S Q L  GE+T+RVG+G +VS  A+G ++L+    +LLL+N+Y+VP   RNL+S+  L+EQ  S++F  +K FI   G  ICSA LE+NLYVL+ 
Subjt:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP

Query:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM
         + K++LNTE+FKTA T+ K+ K+SPK+NAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLELVHS+LCGPM
Subjt:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM

Query:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ
        NVKARGG+EYF++F +DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI SQLSA G PQ
Subjt:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ

A0A5A7TWB9 Gag/pol protein2.9e-13456.15Show/hide
Query:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG
        M EG+SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE   E F    SN VMNKI+YTL TLLNELQ F+SL ++K  + EANVA   R +HRGSTSG
Subjt:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG

Query:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG
        TK +  S       + KG                                           KK+ K+GK D                     A  H    
Subjt:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG

Query:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP
         +GI+S + L  GE+T+RVG+G +VS  A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ  S++F  +K FI   G  ICSA LE+NLYVL+ 
Subjt:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP

Query:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM
         + K++LNTE+FKTA T+ K+ K+SPK+NAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLELVHSDLCGPM
Subjt:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM

Query:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ
        NVKARGG+EYF++F +DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI SQLSA G PQ
Subjt:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ

A0A5A7V4M1 Gag/pol protein2.9e-13456.15Show/hide
Query:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG
        M EG+SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE   E F    SN VMNKI+YTL TLLNELQ F+SL ++K  + EANVA   R +HRGSTSG
Subjt:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG

Query:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG
        TK +  S       + KG                                           KK+ K+GK D                     A  H    
Subjt:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG

Query:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP
         +GI+S + L  GE+T+RVG+G +VS  A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ  S++F  +K FI   G  ICSA LE+NLYVL+ 
Subjt:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP

Query:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM
         + K++LNTE+FKTA T+ K+ K+SPK+NAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLELVHSDLCGPM
Subjt:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM

Query:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ
        NVKARGG+EYF++F +DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI SQLSA G PQ
Subjt:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ

A0A5D3CPJ6 Gag/pol protein2.9e-13456.15Show/hide
Query:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG
        M EG+SVREHVL+MM HFN+AEMNGA IDE+SQVSFILE   E F    SN VMNKI+YTL TLLNELQ F+SL ++K  + EANVA   R +HRGSTSG
Subjt:  MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVK--ESEANVA--YRSYHRGSTSG

Query:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG
        TK +  S       + KG                                           KK+ K+GK D                     A  H    
Subjt:  TKPVAPS-------RPKG-------------------------------------------KKRMKRGKTD--------------------RAAPH--KG

Query:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP
         +GI+S + L  GE+T+RVG+G +VS  A+G ++L     +LLL+N+Y+VP   RNL+S+  L+EQ  S++F  +K FI   G  ICSA LE+NLYVL+ 
Subjt:  KKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKP

Query:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM
         + K++LNTE+FKTA T+ K+ K+SPK+NAHLWHLRLGHINLNRIE+LVK+GLL+ELEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLELVHSDLCGPM
Subjt:  NSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPM

Query:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ
        NVKARGG+EYF++F +DYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI SQLSA G PQ
Subjt:  NVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.2e-2129.09Show/hide
Query:  GELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKPNSVKSVLNTELFKTAETRTKK
        GE +     G V+L    +  L D L+       NL+S+  L E  +S+ F+ S   IS  G           L V+K + + + +    F+      K 
Subjt:  GELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKPNSVKSVLNTELFKTAETRTKK

Query:  AKVSPKDNAHLWHLRLGHIN------LNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRA--KEPLELVHSDLCGPMNVKARGGYEYFVS
             K+N  LWH R GHI+      + R        LLN L E S  +CE CL GK  + PF     +   K PL +VHSD+CGP+         YFV 
Subjt:  AKVSPKDNAHLWHLRLGHIN------LNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRA--KEPLELVHSDLCGPMNVKARGGYEYFVS

Query:  FIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ
        F++ ++ Y   YL+  KS+    F+++  + E   +  +  L  D G EY+  E + + ++ GI   L+    PQ
Subjt:  FIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-2930.43Show/hide
Query:  TLRVGSGELVSVAAIGTVKLHFG-GKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKPNSVKSVLNTELFKT
        T+++G+     +A IG + +    G  L+L ++  VP    NL+S  +L        F + K  ++ KG+L+ +               K V    L++T
Subjt:  TLRVGSGELVSVAAIGTVKLHFG-GKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKPNSVKSVLNTELFKT

Query:  -AETRTKKAKVSPKD-NAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFV
         AE    +   +  + +  LWH R+GH++   ++ L K  L++  +  ++  C+ CL GK  +  F     R    L+LV+SD+CGPM +++ GG +YFV
Subjt:  -AETRTKKAKVSPKD-NAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFV

Query:  SFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ
        +FI+D SR  ++Y++  K +  + F+++   VE    + LK LRSD GGEY   EF++Y   HGI  + +  G PQ
Subjt:  SFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ

Q12491 Transposon Ty2-B Gag-Pol polyprotein9.9e-1525.47Show/hide
Query:  VSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKPNSVKSVLNTELFKTAETRTKKAKV
        + + AIG +  +F             P    +L+S+S L  Q I+  F  +    S  G ++       + Y L   S K ++ + + K       K+K 
Subjt:  VSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKPNSVKSVLNTELFKTAETRTKKAKV

Query:  SPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLP-------VCESCLEGKMTKRPFSGKGYRAK-----EPLELVHSDLCGPMNVKARGGYEYFV
          K    L H  LGH N   I+K +K   +  L+E+ +         C  CL GK TK     KG R K     EP + +H+D+ GP++   +    YF+
Subjt:  SPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLP-------VCESCLEGKMTKRPFSGKGYRAK-----EPLELVHSDLCGPMNVKARGGYEYFV

Query:  SFIEDYSRYGYIYLMH--RKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGI
        SF ++ +R+ ++Y +H  R+   L  F      ++N  +  +  ++ DRG EY +     +    GI
Subjt:  SFIEDYSRYGYIYLMH--RKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGI

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.6e-1726.67Show/hide
Query:  DRAAPHKGKKGINSL---QPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSA
        D  A H      N+L   QP   G+  + V  G  + ++  G+  L    + L L N+  VP   +NL+S+  L      VS E   A    K       
Subjt:  DRAAPHKGKKGINSL---QPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSA

Query:  SLEHNLYVLKPNSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELE-ENSLPVCESCLEGKMTKRPFSGKGYRAKEP
         L   + +L+    K  L      +++  +  A  S K     WH RLGH   + +  ++ +  L+ L   +    C  CL  K  K PFS     +  P
Subjt:  SLEHNLYVLKPNSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELE-ENSLPVCESCLEGKMTKRPFSGKGYRAKEP

Query:  LELVHSDLCGPMNVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ
        LE ++SD+     + +   Y Y+V F++ ++RY ++Y + +KS+  E F  +K  +EN     + T  SD GGE++     +Y  +HGI    S    P+
Subjt:  LELVHSDLCGPMNVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-1625.77Show/hide
Query:  TKPVAPSRPKGKKRM------KRGKTDRAAPHKGKKGINSL---QPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSL
        T P  P +P+    +           D  A H      N+L   QP   G+  + +  G  + +   G+  L    + L L+ +  VP   +NL+S+  L
Subjt:  TKPVAPSRPKGKKRM------KRGKTDRAAPHKGKKGINSL---QPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSL

Query:  IEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKPNSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELE-ENSL
              VS E   A    K        L   + +L+    K  L      +++  +  A    K     WH RLGH +L  +  ++ +  L  L   + L
Subjt:  IEQCISVSFESSKAFISFKGNLICSASLEHNLYVLKPNSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELE-ENSL

Query:  PVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGE
          C  C   K  K PFS     + +PLE ++SD+     + +   Y Y+V F++ ++RY ++Y + +KS+  + F  +K+ VEN     + TL SD GGE
Subjt:  PVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPMNVKARGGYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGE

Query:  YMDTEFQDYMIEHGIPSQLSARGMPQ
        ++    +DY+ +HGI    S    P+
Subjt:  YMDTEFQDYMIEHGIPSQLSARGMPQ

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein8.1e-1239.33Show/hide
Query:  TAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPMNV
        + ET       + KD   LWH RL H++   +E LVK G L+  + +SL  CE C+ GK  +  FS   +  K PL+ VHSDL G  +V
Subjt:  TAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAGGGGTCGTCTGTCCGTGAACATGTTCTAGACATGATGACCCACTTTAATCTTGCTGAGATGAACGGGGCATCAATCGATGAGTCGAGCCAAGTCAGTTTTAT
TTTGGAAGACTCTTCTGAAGAGTTTCCTTCAGTTTCTAGCAATGTTGTTATGAACAAAATTAGCTACACTCTGGCTACCCTCCTCAATGAGCTACAGAATTTCCAGTCCT
TGAACAGGGTCAAGGAATCTGAGGCAAATGTTGCCTACAGGTCTTATCACAGGGGTTCGACCTCTGGGACGAAACCTGTTGCTCCTTCACGCCCGAAAGGAAAAAAGAGG
ATGAAGAGGGGTAAAACTGACCGAGCTGCCCCCCACAAGGGCAAGAAGGGGATTAATTCTTTGCAGCCGCTGCGAGAGGGTGAGGTGACTCTACGGGTTGGATCCGGGGA
GCTTGTGTCTGTTGCAGCGATCGGTACGGTGAAGCTACATTTTGGCGGGAAGTACTTATTATTAGATAATTTGTACATTGTACCAGGGTTTACTAGAAACCTTGTTTCCA
TTTCCTCCCTAATTGAACAATGTATATCTGTTTCCTTTGAATCTAGTAAAGCATTTATTTCTTTCAAAGGCAATCTTATTTGTTCTGCTTCACTTGAGCATAATCTGTAT
GTTTTGAAACCTAATTCGGTCAAAAGTGTTTTGAATACTGAATTGTTTAAAACTGCAGAAACACGAACTAAGAAAGCGAAAGTTTCTCCTAAAGATAATGCCCATCTTTG
GCATCTACGGTTAGGCCACATTAATCTCAATAGGATTGAGAAACTAGTGAAGAGTGGACTTCTAAACGAGTTGGAAGAAAACTCTTTGCCGGTGTGTGAGTCATGCCTTG
AGGGCAAGATGACCAAACGTCCTTTTAGTGGAAAAGGATATAGAGCCAAAGAGCCTCTTGAGTTAGTACATTCTGACCTCTGTGGTCCTATGAATGTTAAAGCTCGGGGT
GGTTATGAGTACTTCGTGTCTTTCATAGAGGATTACTCGAGGTATGGGTATATTTACCTAATGCATAGGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTGA
GGTTGAGAACCTCTTAGATAAATCGCTTAAAACACTTCGATCGGATCGAGGTGGAGAGTACATGGACACTGAATTCCAGGACTATATGATAGAACACGGAATTCCGTCTC
AACTCTCAGCGCGTGGTATGCCACAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGAGGGGTCGTCTGTCCGTGAACATGTTCTAGACATGATGACCCACTTTAATCTTGCTGAGATGAACGGGGCATCAATCGATGAGTCGAGCCAAGTCAGTTTTAT
TTTGGAAGACTCTTCTGAAGAGTTTCCTTCAGTTTCTAGCAATGTTGTTATGAACAAAATTAGCTACACTCTGGCTACCCTCCTCAATGAGCTACAGAATTTCCAGTCCT
TGAACAGGGTCAAGGAATCTGAGGCAAATGTTGCCTACAGGTCTTATCACAGGGGTTCGACCTCTGGGACGAAACCTGTTGCTCCTTCACGCCCGAAAGGAAAAAAGAGG
ATGAAGAGGGGTAAAACTGACCGAGCTGCCCCCCACAAGGGCAAGAAGGGGATTAATTCTTTGCAGCCGCTGCGAGAGGGTGAGGTGACTCTACGGGTTGGATCCGGGGA
GCTTGTGTCTGTTGCAGCGATCGGTACGGTGAAGCTACATTTTGGCGGGAAGTACTTATTATTAGATAATTTGTACATTGTACCAGGGTTTACTAGAAACCTTGTTTCCA
TTTCCTCCCTAATTGAACAATGTATATCTGTTTCCTTTGAATCTAGTAAAGCATTTATTTCTTTCAAAGGCAATCTTATTTGTTCTGCTTCACTTGAGCATAATCTGTAT
GTTTTGAAACCTAATTCGGTCAAAAGTGTTTTGAATACTGAATTGTTTAAAACTGCAGAAACACGAACTAAGAAAGCGAAAGTTTCTCCTAAAGATAATGCCCATCTTTG
GCATCTACGGTTAGGCCACATTAATCTCAATAGGATTGAGAAACTAGTGAAGAGTGGACTTCTAAACGAGTTGGAAGAAAACTCTTTGCCGGTGTGTGAGTCATGCCTTG
AGGGCAAGATGACCAAACGTCCTTTTAGTGGAAAAGGATATAGAGCCAAAGAGCCTCTTGAGTTAGTACATTCTGACCTCTGTGGTCCTATGAATGTTAAAGCTCGGGGT
GGTTATGAGTACTTCGTGTCTTTCATAGAGGATTACTCGAGGTATGGGTATATTTACCTAATGCATAGGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTGA
GGTTGAGAACCTCTTAGATAAATCGCTTAAAACACTTCGATCGGATCGAGGTGGAGAGTACATGGACACTGAATTCCAGGACTATATGATAGAACACGGAATTCCGTCTC
AACTCTCAGCGCGTGGTATGCCACAATAG
Protein sequenceShow/hide protein sequence
MKEGSSVREHVLDMMTHFNLAEMNGASIDESSQVSFILEDSSEEFPSVSSNVVMNKISYTLATLLNELQNFQSLNRVKESEANVAYRSYHRGSTSGTKPVAPSRPKGKKR
MKRGKTDRAAPHKGKKGINSLQPLREGEVTLRVGSGELVSVAAIGTVKLHFGGKYLLLDNLYIVPGFTRNLVSISSLIEQCISVSFESSKAFISFKGNLICSASLEHNLY
VLKPNSVKSVLNTELFKTAETRTKKAKVSPKDNAHLWHLRLGHINLNRIEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELVHSDLCGPMNVKARG
GYEYFVSFIEDYSRYGYIYLMHRKSETLEKFKEYKTEVENLLDKSLKTLRSDRGGEYMDTEFQDYMIEHGIPSQLSARGMPQ