; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000075 (gene) of Snake gourd v1 genome

Gene IDTan0000075
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG07:62828014..62830620
RNA-Seq ExpressionTan0000075
SyntenyTan0000075
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-21861.27Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE
        MSS+IIALL  D+L G+NY +WK+ +N ILV  D+ FVL EECP  P   AS+ VRD YDRW +AN+KA+++I+ASMSD+L+KKHE+MV+A++IM+SL+E
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE

Query:  MFGQQSFQVR------HDSLKYIFN-------ARMKEGS-------------------------SVREHVLDMMTRF--NLAEMNGAS--IDESSQGIDS
        MFGQ S Q++      H   +++ +        + KEG                          +V EH      ++     E  GA+  +  S Q   S
Subjt:  MFGQQSFQVR------HDSLKYIFN-------ARMKEGS-------------------------SVREHVLDMMTRF--NLAEMNGAS--IDESSQGIDS

Query:  WQPLREDEVTLRVGSGELVSAAAIG----------TVEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCAPMNVKPRGGY
        ++ L + E+TL+VG+G+++SA A+G           + +LVK+GLLN+L++ SLP CESCLEGKMTKRPF+GKGYRAKEPLELIHSDLC PMNVK RGG+
Subjt:  WQPLREDEVTLRVGSGELVSAAAIG----------TVEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCAPMNVKPRGGY

Query:  EYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMM
        EYF+SFIDDYSRYGY+YLM  KSE LEKFKEYKTEVENLL K +K LRSDRGGEYMD  FQDYMIEHGI SQLS PG PQQNGVSERRNRTLLDMVRSMM
Subjt:  EYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMM

Query:  SYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFYDPKENRCLCFHKCR
        SYA+LP SFWGYAVETAV+ILNNVPSK+V ETPFELW GRK SL HFRIWGCPAHVLV+N KKLEPRSRLC FVG PKETRGGLF+DP+ENR        
Subjt:  SYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFYDPKENRCLCFHKCR

Query:  FPEEDHIGDHLPRSKIVLNEMGDYIAKEFFDGTSASMAFIDPSASSQIR-SQELGMPRRSGRTVRQPDRYKGLAETSVVAANDDCEDPLTYNQAMVDVDK
        F EEDH+ +H PRSK+VL+E  D  +    D    S    + + S Q   SQ L MPRRSGR V QP+RY GL ET VV  +D  EDPL+Y QAM DVDK
Subjt:  FPEEDHIGDHLPRSKIVLNEMGDYIAKEFFDGTSASMAFIDPSASSQIR-SQELGMPRRSGRTVRQPDRYKGLAETSVVAANDDCEDPLTYNQAMVDVDK

Query:  DEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV
        D+W+KAMD EMESMYFNSVWELVD P GVKPIGCKWIYKRKR   GKV
Subjt:  DEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV

KAA0060254.1 gag/pol protein [Cucumis melo var. makuwa]5.5e-22158.89Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE
        M++A + +LA+DKL G+NY SWKN IN +L+ DD+KFVL EECPQ+P + A++ VR+ Y+RW + NEK + YI+AS+S+VLAKKHE M++A+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE

Query:  MFGQQSFQVRHDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGASIDESSQG-------IDSWQPLREDEVT----------------LRVGSGELV
        MFGQ S+Q+ HD+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA IDE+SQG         S +       +                 + G G   
Subjt:  MFGQQSFQVRHDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGASIDESSQG-------IDSWQPLREDEVT----------------LRVGSGELV

Query:  SAAAIGT-------------------------------------------VEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHS
        + AA  T                                           +E+LVK+G+L+ELEENSLP+CESCLEGKMTKRPF+GKG+RAKEPLEL+HS
Subjt:  SAAAIGT-------------------------------------------VEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHS

Query:  DLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSE
        DLC PMNVK RG +EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E  I SQLS PG PQQNGVSE
Subjt:  DLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSE

Query:  RRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFY
        RRNRTLLDMVRSM+SYA LP+SFWGYAV+TAVYILN VPSK+V ETP +LW+GRKGSL HFRIWGCPAHVL +N KKLEPRS+LCLFVG PK TRGG FY
Subjt:  RRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFY

Query:  DPKENRCLCFHKCRFPEEDHIGDHLPRSKIVLNEMGDYIAK---EFFDGTSASMAFIDPSASSQ-IRSQELGMPRRSGRTVRQPDRYKGLAETSVVAAND
        D K+N+        F E+DHI +H PRSKIVLN++   I +      + +SA    +   +S++  + Q L  PRRSGR    P RY  L ET  V ++ 
Subjt:  DPKENRCLCFHKCRFPEEDHIGDHLPRSKIVLNEMGDYIAK---EFFDGTSASMAFIDPSASSQ-IRSQELGMPRRSGRTVRQPDRYKGLAETSVVAAND

Query:  DCEDPLTYNQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGK
        D EDPLT+ +AM DVDKDEWIKAM+ E+ESMYFNSVW+LVDQP+GVKPIGCKWIYKRKRG DGK
Subjt:  DCEDPLTYNQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGK

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-22259.25Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE
        M++A + +LA+DKL G+NY SWKN IN +L+ DD+KFVL EECPQ+P + A++ VR+ Y+RW + NEK + YI+AS+S+VLAKKHE M++A+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE

Query:  MFGQQSFQVRHDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGASIDESSQG-------IDSWQPLREDEVT----------------LRVGSGELV
        MFGQ S+Q+ HD+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA IDE+SQG         S +       +                 + G G   
Subjt:  MFGQQSFQVRHDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGASIDESSQG-------IDSWQPLREDEVT----------------LRVGSGELV

Query:  SAAAIGT-------------------------------------------VEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHS
        + AA  T                                           +E+LVK+G+L+ELEENSLP+CESCLEGKMTKRPF+GKG+RAKEPLEL+HS
Subjt:  SAAAIGT-------------------------------------------VEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHS

Query:  DLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSE
        DLC PMNVK RG +EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E  I SQLS PG PQQNGVSE
Subjt:  DLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSE

Query:  RRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFY
        RRNRTLLDMVRSM+SYA LP+SFWGYAV+TAVYILN VPSK+V ETP +LW+GRKGSL HFRIWGCPAHVL +N KKLEPRS+LCLFVG PK TRGG FY
Subjt:  RRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFY

Query:  DPKENRCLCFHKCRFPEEDHIGDHLPRSKIVLNEMGDYIAK---EFFDGTSASMAFIDPSASSQI-RSQELGMPRRSGRTVRQPDRYKGLAETSVVAAND
        DPK+N+        F EEDHI +H PRSKIVLNE+     +      +  SA    +   +S++  + Q L  PRRSGR    P RY  L ET  V ++ 
Subjt:  DPKENRCLCFHKCRFPEEDHIGDHLPRSKIVLNEMGDYIAK---EFFDGTSASMAFIDPSASSQI-RSQELGMPRRSGRTVRQPDRYKGLAETSVVAAND

Query:  DCEDPLTYNQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV
        D EDPLT+ +AM DVDKDEWIKAM+ E+ESMYFNSVW+LVDQP+GVKPIGCKWIYKRKRG DGKV
Subjt:  DCEDPLTYNQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]3.3e-21861.27Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE
        MSS+IIALL  D+L G+NY +WK+ +N ILV  D+ FVL EECP  P   AS+ VRD YDRW +AN+KA+++I+ASMSD+L+KKHE+MV+A++IM+SL+E
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE

Query:  MFGQQSFQVR------HDSLKYIFN-------ARMKEGS-------------------------SVREHVLDMMTRF--NLAEMNGAS--IDESSQGIDS
        MFGQ S Q++      H   +++ +        + KEG                          +V EH      ++     E  GA+  +  S Q   S
Subjt:  MFGQQSFQVR------HDSLKYIFN-------ARMKEGS-------------------------SVREHVLDMMTRF--NLAEMNGAS--IDESSQGIDS

Query:  WQPLREDEVTLRVGSGELVSAAAIG----------TVEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCAPMNVKPRGGY
        ++ L + E+TL+VG+G+++SA A+G           + +LVK+GLLN+L++ SLP CESCLEGKMTKRPF+GKGYRAKEPLELIHSDLC PMNVK RGG+
Subjt:  WQPLREDEVTLRVGSGELVSAAAIG----------TVEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCAPMNVKPRGGY

Query:  EYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMM
        EYF+SFIDDYSRYGY+YLM  KSE LEKFKEYKTEVENLL K +K LRSDRGGEYMD  FQDYMIEHGI SQLS PG PQQNGVSERRNRTLLDMVRSMM
Subjt:  EYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMM

Query:  SYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFYDPKENRCLCFHKCR
        SYA+LP SFWGYAVETAV+ILNNVPSK+V ETPFELW GRK SL HFRIWGCPAHVLV+N KKLEPRSRLC FVG PKETRGGLF+DP+ENR        
Subjt:  SYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFYDPKENRCLCFHKCR

Query:  FPEEDHIGDHLPRSKIVLNEMGDYIAKEFFDGTSASMAFIDPSASSQIR-SQELGMPRRSGRTVRQPDRYKGLAETSVVAANDDCEDPLTYNQAMVDVDK
        F EEDH+ +H PRSK+VL+E  D  +    D    S    + + S Q   SQ L MPRRSGR V QP+RY GL ET VV  +D  EDPL+Y QAM DVDK
Subjt:  FPEEDHIGDHLPRSKIVLNEMGDYIAKEFFDGTSASMAFIDPSASSQIR-SQELGMPRRSGRTVRQPDRYKGLAETSVVAANDDCEDPLTYNQAMVDVDK

Query:  DEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV
        D+W+KAMD EMESMYFNSVWELVD P GVKPIGCKWIYKRKR   GKV
Subjt:  DEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]2.3e-21146.82Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+L+ DD++FVL EECPQ+P + A+R VR+ Y+RW +ANEKA+ YI+AS+S+VLAKKHE M++A+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE

Query:  MFGQQSFQVRHDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGASIDESS-----------------------------------------------
        MFGQ S+Q++HD+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA IDE+S                                               
Subjt:  MFGQQSFQVRHDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGASIDESS-----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------QGIDSWQPLREDEVTLRVGSGELVSAAAIG------------------------------------------------
                              QGI SW+ L   E+T+RVG+G +VSA A+G                                                
Subjt:  ----------------------QGIDSWQPLREDEVTLRVGSGELVSAAAIG------------------------------------------------

Query:  ----------------------------------------------------------------TVEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFS
                                                                         +E+LVK+GLL+ELEENSLPVCESCLEGKMTKRPF+
Subjt:  ----------------------------------------------------------------TVEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFS

Query:  GKGYRAKEPLELIHSDLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS
        GKG+RAKEPLEL+HSDLC PMNVK RGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI S
Subjt:  GKGYRAKEPLELIHSDLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS

Query:  QLSVPGMPQQNGVSERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLC
        QLS PG PQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAVYILN VPSK+V ETP +LW+GRKGSL HFRIWGCPAHVL +N KKLEPRS+LC
Subjt:  QLSVPGMPQQNGVSERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLC

Query:  LFVGCPKETRGGLFYDPKENRCLCFHKCRFPEEDHIGDHLPRSKIVLNEMGDYIAK---EFFDGTSASMAFIDPSASSQI-RSQELGMPRRSGRTVRQPD
        LFVG PK TRGG FYDPK+N+        F EEDHI +H PRSKIVLNE+     +      +  SA    +   +S++  + Q L  PRRSGR    P 
Subjt:  LFVGCPKETRGGLFYDPKENRCLCFHKCRFPEEDHIGDHLPRSKIVLNEMGDYIAK---EFFDGTSASMAFIDPSASSQI-RSQELGMPRRSGRTVRQPD

Query:  RYKGLAETSVVAANDDCEDPLTYNQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV
        RY  L ET  V ++ D EDPLT+ +AM DVDKDEWIKAM+ E+ESMYFNSVW+LVDQP+GVKPIGCKWIYKRKRG DGKV
Subjt:  RYKGLAETSVVAANDDCEDPLTYNQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.1e-21146.82Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE
        M+SA + +LA+DKL G+NY SWKN INT+L+ DD++FVL EECPQ+P + A+R VR+ Y+RW +ANEKA+ YI+AS+S+VLAKKHE M++A+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE

Query:  MFGQQSFQVRHDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGASIDESS-----------------------------------------------
        MFGQ S+Q++HD+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA IDE+S                                               
Subjt:  MFGQQSFQVRHDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGASIDESS-----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------QGIDSWQPLREDEVTLRVGSGELVSAAAIG------------------------------------------------
                              QGI SW+ L   E+T+RVG+G +VSA A+G                                                
Subjt:  ----------------------QGIDSWQPLREDEVTLRVGSGELVSAAAIG------------------------------------------------

Query:  ----------------------------------------------------------------TVEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFS
                                                                         +E+LVK+GLL+ELEENSLPVCESCLEGKMTKRPF+
Subjt:  ----------------------------------------------------------------TVEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFS

Query:  GKGYRAKEPLELIHSDLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS
        GKG+RAKEPLEL+HSDLC PMNVK RGG+EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E GI S
Subjt:  GKGYRAKEPLELIHSDLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITS

Query:  QLSVPGMPQQNGVSERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLC
        QLS PG PQQNGVSERRNRTLLDMVRSMMSYA LP+SFWGYAV+TAVYILN VPSK+V ETP +LW+GRKGSL HFRIWGCPAHVL +N KKLEPRS+LC
Subjt:  QLSVPGMPQQNGVSERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLC

Query:  LFVGCPKETRGGLFYDPKENRCLCFHKCRFPEEDHIGDHLPRSKIVLNEMGDYIAK---EFFDGTSASMAFIDPSASSQI-RSQELGMPRRSGRTVRQPD
        LFVG PK TRGG FYDPK+N+        F EEDHI +H PRSKIVLNE+     +      +  SA    +   +S++  + Q L  PRRSGR    P 
Subjt:  LFVGCPKETRGGLFYDPKENRCLCFHKCRFPEEDHIGDHLPRSKIVLNEMGDYIAK---EFFDGTSASMAFIDPSASSQI-RSQELGMPRRSGRTVRQPD

Query:  RYKGLAETSVVAANDDCEDPLTYNQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV
        RY  L ET  V ++ D EDPLT+ +AM DVDKDEWIKAM+ E+ESMYFNSVW+LVDQP+GVKPIGCKWIYKRKRG DGKV
Subjt:  RYKGLAETSVVAANDDCEDPLTYNQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV

A0A5A7UYE8 Gag/pol protein1.6e-21861.27Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE
        MSS+IIALL  D+L G+NY +WK+ +N ILV  D+ FVL EECP  P   AS+ VRD YDRW +AN+KA+++I+ASMSD+L+KKHE+MV+A++IM+SL+E
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE

Query:  MFGQQSFQVR------HDSLKYIFN-------ARMKEGS-------------------------SVREHVLDMMTRF--NLAEMNGAS--IDESSQGIDS
        MFGQ S Q++      H   +++ +        + KEG                          +V EH      ++     E  GA+  +  S Q   S
Subjt:  MFGQQSFQVR------HDSLKYIFN-------ARMKEGS-------------------------SVREHVLDMMTRF--NLAEMNGAS--IDESSQGIDS

Query:  WQPLREDEVTLRVGSGELVSAAAIG----------TVEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCAPMNVKPRGGY
        ++ L + E+TL+VG+G+++SA A+G           + +LVK+GLLN+L++ SLP CESCLEGKMTKRPF+GKGYRAKEPLELIHSDLC PMNVK RGG+
Subjt:  WQPLREDEVTLRVGSGELVSAAAIG----------TVEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCAPMNVKPRGGY

Query:  EYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMM
        EYF+SFIDDYSRYGY+YLM  KSE LEKFKEYKTEVENLL K +K LRSDRGGEYMD  FQDYMIEHGI SQLS PG PQQNGVSERRNRTLLDMVRSMM
Subjt:  EYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMM

Query:  SYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFYDPKENRCLCFHKCR
        SYA+LP SFWGYAVETAV+ILNNVPSK+V ETPFELW GRK SL HFRIWGCPAHVLV+N KKLEPRSRLC FVG PKETRGGLF+DP+ENR        
Subjt:  SYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFYDPKENRCLCFHKCR

Query:  FPEEDHIGDHLPRSKIVLNEMGDYIAKEFFDGTSASMAFIDPSASSQIR-SQELGMPRRSGRTVRQPDRYKGLAETSVVAANDDCEDPLTYNQAMVDVDK
        F EEDH+ +H PRSK+VL+E  D  +    D    S    + + S Q   SQ L MPRRSGR V QP+RY GL ET VV  +D  EDPL+Y QAM DVDK
Subjt:  FPEEDHIGDHLPRSKIVLNEMGDYIAKEFFDGTSASMAFIDPSASSQIR-SQELGMPRRSGRTVRQPDRYKGLAETSVVAANDDCEDPLTYNQAMVDVDK

Query:  DEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV
        D+W+KAMD EMESMYFNSVWELVD P GVKPIGCKWIYKRKR   GKV
Subjt:  DEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV

A0A5A7UYX7 Gag/pol protein2.6e-22158.89Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE
        M++A + +LA+DKL G+NY SWKN IN +L+ DD+KFVL EECPQ+P + A++ VR+ Y+RW + NEK + YI+AS+S+VLAKKHE M++A+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE

Query:  MFGQQSFQVRHDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGASIDESSQG-------IDSWQPLREDEVT----------------LRVGSGELV
        MFGQ S+Q+ HD+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA IDE+SQG         S +       +                 + G G   
Subjt:  MFGQQSFQVRHDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGASIDESSQG-------IDSWQPLREDEVT----------------LRVGSGELV

Query:  SAAAIGT-------------------------------------------VEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHS
        + AA  T                                           +E+LVK+G+L+ELEENSLP+CESCLEGKMTKRPF+GKG+RAKEPLEL+HS
Subjt:  SAAAIGT-------------------------------------------VEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHS

Query:  DLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSE
        DLC PMNVK RG +EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E  I SQLS PG PQQNGVSE
Subjt:  DLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSE

Query:  RRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFY
        RRNRTLLDMVRSM+SYA LP+SFWGYAV+TAVYILN VPSK+V ETP +LW+GRKGSL HFRIWGCPAHVL +N KKLEPRS+LCLFVG PK TRGG FY
Subjt:  RRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFY

Query:  DPKENRCLCFHKCRFPEEDHIGDHLPRSKIVLNEMGDYIAK---EFFDGTSASMAFIDPSASSQ-IRSQELGMPRRSGRTVRQPDRYKGLAETSVVAAND
        D K+N+        F E+DHI +H PRSKIVLN++   I +      + +SA    +   +S++  + Q L  PRRSGR    P RY  L ET  V ++ 
Subjt:  DPKENRCLCFHKCRFPEEDHIGDHLPRSKIVLNEMGDYIAK---EFFDGTSASMAFIDPSASSQ-IRSQELGMPRRSGRTVRQPDRYKGLAETSVVAAND

Query:  DCEDPLTYNQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGK
        D EDPLT+ +AM DVDKDEWIKAM+ E+ESMYFNSVW+LVDQP+GVKPIGCKWIYKRKRG DGK
Subjt:  DCEDPLTYNQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGK

A0A5D3BHG7 Gag/pol protein6.3e-22359.25Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE
        M++A + +LA+DKL G+NY SWKN IN +L+ DD+KFVL EECPQ+P + A++ VR+ Y+RW + NEK + YI+AS+S+VLAKKHE M++A+EIM+SLQE
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE

Query:  MFGQQSFQVRHDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGASIDESSQG-------IDSWQPLREDEVT----------------LRVGSGELV
        MFGQ S+Q+ HD+LKYI+NARM EG+SVREHVL+MM  FN+AEMNGA IDE+SQG         S +       +                 + G G   
Subjt:  MFGQQSFQVRHDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGASIDESSQG-------IDSWQPLREDEVT----------------LRVGSGELV

Query:  SAAAIGT-------------------------------------------VEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHS
        + AA  T                                           +E+LVK+G+L+ELEENSLP+CESCLEGKMTKRPF+GKG+RAKEPLEL+HS
Subjt:  SAAAIGT-------------------------------------------VEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHS

Query:  DLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSE
        DLC PMNVK RG +EYF++F DDYSRYGY+YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E  I SQLS PG PQQNGVSE
Subjt:  DLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSE

Query:  RRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFY
        RRNRTLLDMVRSM+SYA LP+SFWGYAV+TAVYILN VPSK+V ETP +LW+GRKGSL HFRIWGCPAHVL +N KKLEPRS+LCLFVG PK TRGG FY
Subjt:  RRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFY

Query:  DPKENRCLCFHKCRFPEEDHIGDHLPRSKIVLNEMGDYIAK---EFFDGTSASMAFIDPSASSQI-RSQELGMPRRSGRTVRQPDRYKGLAETSVVAAND
        DPK+N+        F EEDHI +H PRSKIVLNE+     +      +  SA    +   +S++  + Q L  PRRSGR    P RY  L ET  V ++ 
Subjt:  DPKENRCLCFHKCRFPEEDHIGDHLPRSKIVLNEMGDYIAK---EFFDGTSASMAFIDPSASSQI-RSQELGMPRRSGRTVRQPDRYKGLAETSVVAAND

Query:  DCEDPLTYNQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV
        D EDPLT+ +AM DVDKDEWIKAM+ E+ESMYFNSVW+LVDQP+GVKPIGCKWIYKRKRG DGKV
Subjt:  DCEDPLTYNQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV

A0A5D3BUN8 Gag/pol protein1.6e-21861.27Show/hide
Query:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE
        MSS+IIALL  D+L G+NY +WK+ +N ILV  D+ FVL EECP  P   AS+ VRD YDRW +AN+KA+++I+ASMSD+L+KKHE+MV+A++IM+SL+E
Subjt:  MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQE

Query:  MFGQQSFQVR------HDSLKYIFN-------ARMKEGS-------------------------SVREHVLDMMTRF--NLAEMNGAS--IDESSQGIDS
        MFGQ S Q++      H   +++ +        + KEG                          +V EH      ++     E  GA+  +  S Q   S
Subjt:  MFGQQSFQVR------HDSLKYIFN-------ARMKEGS-------------------------SVREHVLDMMTRF--NLAEMNGAS--IDESSQGIDS

Query:  WQPLREDEVTLRVGSGELVSAAAIG----------TVEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCAPMNVKPRGGY
        ++ L + E+TL+VG+G+++SA A+G           + +LVK+GLLN+L++ SLP CESCLEGKMTKRPF+GKGYRAKEPLELIHSDLC PMNVK RGG+
Subjt:  WQPLREDEVTLRVGSGELVSAAAIG----------TVEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCAPMNVKPRGGY

Query:  EYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMM
        EYF+SFIDDYSRYGY+YLM  KSE LEKFKEYKTEVENLL K +K LRSDRGGEYMD  FQDYMIEHGI SQLS PG PQQNGVSERRNRTLLDMVRSMM
Subjt:  EYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMM

Query:  SYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFYDPKENRCLCFHKCR
        SYA+LP SFWGYAVETAV+ILNNVPSK+V ETPFELW GRK SL HFRIWGCPAHVLV+N KKLEPRSRLC FVG PKETRGGLF+DP+ENR        
Subjt:  SYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFYDPKENRCLCFHKCR

Query:  FPEEDHIGDHLPRSKIVLNEMGDYIAKEFFDGTSASMAFIDPSASSQIR-SQELGMPRRSGRTVRQPDRYKGLAETSVVAANDDCEDPLTYNQAMVDVDK
        F EEDH+ +H PRSK+VL+E  D  +    D    S    + + S Q   SQ L MPRRSGR V QP+RY GL ET VV  +D  EDPL+Y QAM DVDK
Subjt:  FPEEDHIGDHLPRSKIVLNEMGDYIAKEFFDGTSASMAFIDPSASSQIR-SQELGMPRRSGRTVRQPDRYKGLAETSVVAANDDCEDPLTYNQAMVDVDK

Query:  DEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV
        D+W+KAMD EMESMYFNSVWELVD P GVKPIGCKWIYKRKR   GKV
Subjt:  DEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-4037.13Show/hide
Query:  LLNELEENSLPVCESCLEGKMTKRPFSGKGYRA--KEPLELIHSDLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGK
        LLN L E S  +CE CL GK  + PF     +   K PL ++HSD+C P+         YFV F+D ++ Y   YL+  KS+    F+++  + E     
Subjt:  LLNELEENSLPVCESCLEGKMTKRPFSGKGYRA--KEPLELIHSDLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGK

Query:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVCE---TPFELWSG
         +  L  D G EY+  E + + ++ GI+  L+VP  PQ NGVSER  RT+ +  R+M+S A+L  SFWG AV TA Y++N +PS+ + +   TP+E+W  
Subjt:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVCE---TPFELWSG

Query:  RKGSLHHFRIWGCPAHVLVSNLK-KLEPRSRLCLFVG
        +K  L H R++G   +V + N + K + +S   +FVG
Subjt:  RKGSLHHFRIWGCPAHVLVSNLK-KLEPRSRLCLFVG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.5e-6432.88Show/hide
Query:  LVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENL
        L K  L++  +  ++  C+ CL GK  +  F     R    L+L++SD+C PM ++  GG +YFV+FIDD SR  ++Y++  K +  + F+++   VE  
Subjt:  LVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENL

Query:  LGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVC-ETPFELWS
         G+ LK LRSD GGEY   EF++Y   HGI  + +VPG PQ NGV+ER NRT+++ VRSM+  A+LP SFWG AV+TA Y++N  PS  +  E P  +W+
Subjt:  LGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVC-ETPFELWS

Query:  GRKGSLHHFRIWGCP--AHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFYDPKENRCLCFHKCRFPEEDHIGDHLPRSKIVLN-----------------
         ++ S  H +++GC   AHV      KL+ +S  C+F+G   E  G   +DP + + +      F  E  +      S+ V N                 
Subjt:  GRKGSLHHFRIWGCP--AHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFYDPKENRCLCFHKCRFPEEDHIGDHLPRSKIVLN-----------------

Query:  ----------EMGDYIAKEFFDGTSASMAFIDPSASSQIRSQELGMPRRSGRTVRQPDRYKGLAETSVVAANDDCEDPLTYNQAMVDVDKDEWIKAMDQE
                  E G+   +    G        +    +Q   Q   + RRS R   +  RY     T  V  +DD  +P +  + +   +K++ +KAM +E
Subjt:  ----------EMGDYIAKEFFDGTSASMAFIDPSASSQIRSQELGMPRRSGRTVRQPDRYKGLAETSVVAANDDCEDPLTYNQAMVDVDKDEWIKAMDQE

Query:  MESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV
        MES+  N  ++LV+ P G +P+ CKW++K K+  D K+
Subjt:  MESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV

Q12491 Transposon Ty2-B Gag-Pol polyprotein3.9e-2028.64Show/hide
Query:  LVSAAAIGTVEKLVKSGLLNELEENSLP-------VCESCLEGKMTKRPFSGKGYRAK-----EPLELIHSDLCAPMNVKPRGGYEYFVSFIDDYSRYGY
        ++  A   +++K +K   +  L+E+ +         C  CL GK TK     KG R K     EP + +H+D+  P++  P+    YF+SF D+ +R+ +
Subjt:  LVSAAAIGTVEKLVKSGLLNELEENSLP-------VCESCLEGKMTKRPFSGKGYRAK-----EPLELIHSDLCAPMNVKPRGGYEYFVSFIDDYSRYGY

Query:  IYLMHKKSE--TLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMMSYARLPDSFWGYA
        +Y +H + E   L  F      ++N     +  ++ DRG EY +     +    GIT+  +     + +GV+ER NRTLL+  R+++  + LP+  W  A
Subjt:  IYLMHKKSE--TLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMMSYARLPDSFWGYA

Query:  VETAVYILNNVPS
        VE +  I N++ S
Subjt:  VETAVYILNNVPS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-3431.97Show/hide
Query:  CESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYM
        C  CL  K  K PFS     +  PLE I+SD+ +   +     Y Y+V F+D ++RY ++Y + +KS+  E F  +K  +EN     + T  SD GGE++
Subjt:  CESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYM

Query:  DTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNV-CETPFELWSGRKGSLHHFRIWGCPAH
             +Y  +HGI+   S P  P+ NG+SER++R +++   +++S+A +P ++W YA   AVY++N +P+  +  E+PF+   G   +    R++GC  +
Subjt:  DTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNV-CETPFELWSGRKGSLHHFRIWGCPAH

Query:  VLVS--NLKKLEPRSRLCLFVGCPKETRGGLFYDPKENRCLCFH
          +   N  KL+ +SR C+F+G          Y   ++  LC H
Subjt:  VLVS--NLKKLEPRSRLCLFVGCPKETRGGLFYDPKENRCLCFH

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.5e-3531.95Show/hide
Query:  LLNELEEN-SLPV---------CESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKT
        +LN +  N SLPV         C  C   K  K PFS     + +PLE I+SD+ +   +     Y Y+V F+D ++RY ++Y + +KS+  + F  +K+
Subjt:  LLNELEEN-SLPV---------CESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKT

Query:  EVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNV-CETP
         VEN     + TL SD GGE++    +DY+ +HGI+   S P  P+ NG+SER++R +++M  +++S+A +P ++W YA   AVY++N +P+  +  ++P
Subjt:  EVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQNGVSERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNV-CETP

Query:  FELWSGRKGSLHHFRIWGCPAHVLVS--NLKKLEPRSRLCLFVGCPKETRGGLFYDPKENRCLCFH
        F+   G+  +    +++GC  +  +   N  KLE +S+ C F+G          Y   ++  LC H
Subjt:  FELWSGRKGSLHHFRIWGCPAHVLVS--NLKKLEPRSRLCLFVGCPKETRGGLFYDPKENRCLCFH

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.7e-0741.27Show/hide
Query:  EDPLTYNQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV
        ++P TYN+A   +    W  AMD E+ +M     WE+   P   KPIGCKW+YK K   DG +
Subjt:  EDPLTYNQAMVDVDKDEWIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV

ATMG00300.1 Gag-Pol-related retrotransposon family protein6.2e-0542.31Show/hide
Query:  VEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDL
        +E LVK G L+  + +SL  CE C+ GK  +  FS   +  K PL+ +HSDL
Subjt:  VEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDL

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.7e-0735.37Show/hide
Query:  NRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNV-CETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSR
        NRT+++ VRSM+    LP +F   A  TAV+I+N  PS  +    P E+W     +  + R +GC A++      KL+PR++
Subjt:  NRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNV-CETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSR

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)2.4e-0438.64Show/hide
Query:  WIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDG
        W +AM +E++++  N  W LV  P     +GCKW++K K   DG
Subjt:  WIKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCGCTATTATAGCTTTACTTGCTTCCGATAAGTTAGTGGGAGATAATTACCAGAGCTGGAAAAACAACATCAATACAATTTTGGTGACTGACGACATAAAGTT
CGTGTTGTCTGAGGAGTGTCCTCAGATGCCGGGCTCGACCGCATCGCGTGTCGTTCGCGATACGTACGATCGATGGATCAGGGCCAATGAAAAGGCCAAGGTCTATATAA
TTGCCAGCATGTCTGATGTTTTGGCAAAGAAGCATGAGCTGATGGTTTCTGCTAAGGAAATCATGGAGTCCTTGCAGGAAATGTTTGGACAACAGTCCTTTCAGGTCCGG
CATGACTCGCTCAAATACATTTTCAACGCTCGGATGAAAGAGGGGTCGTCTGTCCGTGAACATGTTCTAGACATGATGACCCGCTTTAATCTGGCTGAGATGAATGGGGC
TTCGATCGATGAGTCGAGTCAGGGGATTGATTCCTGGCAGCCGCTGCGAGAGGATGAGGTGACTCTACGGGTTGGATCCGGGGAGCTTGTCTCTGCTGCAGCAATCGGCA
CGGTTGAGAAATTGGTGAAGAGTGGACTTCTAAACGAGTTGGAAGAAAACTCTTTACCAGTATGTGAGTCATGCCTTGAAGGCAAAATGACCAAACGTCCTTTTAGTGGA
AAAGGATATAGAGCCAAAGAGCCTCTTGAGTTAATACATTCTGACCTATGTGCTCCGATGAATGTTAAACCTCGGGGCGGTTATGAGTACTTCGTGTCTTTCATAGACGA
TTACTCCAGATATGGGTATATTTACCTAATGCATAAGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTATAAGACTGAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAA
CACTTCGATCAGATCGAGGTGGAGAGTACATGGACACTGAATTTCAGGACTATATGATAGAACACGGAATTACGTCCCAACTCTCAGTGCCTGGTATGCCACAGCAGAAT
GGTGTATCGGAGAGGAGAAACAGAACCTTGTTGGACATGGTTCGATCGATGATGAGCTATGCTCGTCTCCCTGATTCCTTTTGGGGATACGCAGTAGAGACTGCGGTTTA
TATTTTGAACAATGTTCCCTCAAAGAATGTTTGTGAAACACCTTTCGAGCTCTGGAGTGGACGTAAAGGCAGTTTACATCACTTCAGGATTTGGGGATGCCCAGCCCATG
TGTTGGTGTCAAACCTGAAGAAGTTGGAACCCCGTTCAAGATTATGCCTATTCGTAGGTTGCCCTAAAGAGACTAGGGGTGGTCTGTTCTACGATCCCAAGGAAAACAGG
TGCTTGTGTTTCCACAAATGTCGTTTTCCTGAGGAAGACCACATCGGGGATCATTTACCTAGAAGCAAAATTGTATTGAACGAAATGGGCGATTACATCGCCAAAGAGTT
CTTTGATGGAACTAGTGCGTCGATGGCGTTCATCGACCCTAGCGCGTCTAGTCAAATCCGCTCCCAAGAGTTGGGAATGCCTCGACGTAGTGGGAGGACTGTGAGACAGC
CCGATCGCTATAAGGGTTTAGCTGAAACCTCAGTTGTCGCTGCTAACGATGATTGTGAGGATCCATTGACCTATAATCAGGCAATGGTTGATGTTGATAAAGACGAGTGG
ATTAAAGCTATGGACCAAGAAATGGAGTCTATGTACTTCAATTCTGTCTGGGAGCTTGTGGATCAACCGAATGGGGTAAAACCTATTGGTTGTAAATGGATCTACAAGCG
TAAACGTGGCGTAGATGGGAAGGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCGCTATTATAGCTTTACTTGCTTCCGATAAGTTAGTGGGAGATAATTACCAGAGCTGGAAAAACAACATCAATACAATTTTGGTGACTGACGACATAAAGTT
CGTGTTGTCTGAGGAGTGTCCTCAGATGCCGGGCTCGACCGCATCGCGTGTCGTTCGCGATACGTACGATCGATGGATCAGGGCCAATGAAAAGGCCAAGGTCTATATAA
TTGCCAGCATGTCTGATGTTTTGGCAAAGAAGCATGAGCTGATGGTTTCTGCTAAGGAAATCATGGAGTCCTTGCAGGAAATGTTTGGACAACAGTCCTTTCAGGTCCGG
CATGACTCGCTCAAATACATTTTCAACGCTCGGATGAAAGAGGGGTCGTCTGTCCGTGAACATGTTCTAGACATGATGACCCGCTTTAATCTGGCTGAGATGAATGGGGC
TTCGATCGATGAGTCGAGTCAGGGGATTGATTCCTGGCAGCCGCTGCGAGAGGATGAGGTGACTCTACGGGTTGGATCCGGGGAGCTTGTCTCTGCTGCAGCAATCGGCA
CGGTTGAGAAATTGGTGAAGAGTGGACTTCTAAACGAGTTGGAAGAAAACTCTTTACCAGTATGTGAGTCATGCCTTGAAGGCAAAATGACCAAACGTCCTTTTAGTGGA
AAAGGATATAGAGCCAAAGAGCCTCTTGAGTTAATACATTCTGACCTATGTGCTCCGATGAATGTTAAACCTCGGGGCGGTTATGAGTACTTCGTGTCTTTCATAGACGA
TTACTCCAGATATGGGTATATTTACCTAATGCATAAGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTATAAGACTGAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAA
CACTTCGATCAGATCGAGGTGGAGAGTACATGGACACTGAATTTCAGGACTATATGATAGAACACGGAATTACGTCCCAACTCTCAGTGCCTGGTATGCCACAGCAGAAT
GGTGTATCGGAGAGGAGAAACAGAACCTTGTTGGACATGGTTCGATCGATGATGAGCTATGCTCGTCTCCCTGATTCCTTTTGGGGATACGCAGTAGAGACTGCGGTTTA
TATTTTGAACAATGTTCCCTCAAAGAATGTTTGTGAAACACCTTTCGAGCTCTGGAGTGGACGTAAAGGCAGTTTACATCACTTCAGGATTTGGGGATGCCCAGCCCATG
TGTTGGTGTCAAACCTGAAGAAGTTGGAACCCCGTTCAAGATTATGCCTATTCGTAGGTTGCCCTAAAGAGACTAGGGGTGGTCTGTTCTACGATCCCAAGGAAAACAGG
TGCTTGTGTTTCCACAAATGTCGTTTTCCTGAGGAAGACCACATCGGGGATCATTTACCTAGAAGCAAAATTGTATTGAACGAAATGGGCGATTACATCGCCAAAGAGTT
CTTTGATGGAACTAGTGCGTCGATGGCGTTCATCGACCCTAGCGCGTCTAGTCAAATCCGCTCCCAAGAGTTGGGAATGCCTCGACGTAGTGGGAGGACTGTGAGACAGC
CCGATCGCTATAAGGGTTTAGCTGAAACCTCAGTTGTCGCTGCTAACGATGATTGTGAGGATCCATTGACCTATAATCAGGCAATGGTTGATGTTGATAAAGACGAGTGG
ATTAAAGCTATGGACCAAGAAATGGAGTCTATGTACTTCAATTCTGTCTGGGAGCTTGTGGATCAACCGAATGGGGTAAAACCTATTGGTTGTAAATGGATCTACAAGCG
TAAACGTGGCGTAGATGGGAAGGTGTAA
Protein sequenceShow/hide protein sequence
MSSAIIALLASDKLVGDNYQSWKNNINTILVTDDIKFVLSEECPQMPGSTASRVVRDTYDRWIRANEKAKVYIIASMSDVLAKKHELMVSAKEIMESLQEMFGQQSFQVR
HDSLKYIFNARMKEGSSVREHVLDMMTRFNLAEMNGASIDESSQGIDSWQPLREDEVTLRVGSGELVSAAAIGTVEKLVKSGLLNELEENSLPVCESCLEGKMTKRPFSG
KGYRAKEPLELIHSDLCAPMNVKPRGGYEYFVSFIDDYSRYGYIYLMHKKSETLEKFKEYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSVPGMPQQN
GVSERRNRTLLDMVRSMMSYARLPDSFWGYAVETAVYILNNVPSKNVCETPFELWSGRKGSLHHFRIWGCPAHVLVSNLKKLEPRSRLCLFVGCPKETRGGLFYDPKENR
CLCFHKCRFPEEDHIGDHLPRSKIVLNEMGDYIAKEFFDGTSASMAFIDPSASSQIRSQELGMPRRSGRTVRQPDRYKGLAETSVVAANDDCEDPLTYNQAMVDVDKDEW
IKAMDQEMESMYFNSVWELVDQPNGVKPIGCKWIYKRKRGVDGKV