; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003773 (gene) of Snake gourd v1 genome

Gene IDTan0003773
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG04:50982593..50984186
RNA-Seq ExpressionTan0003773
SyntenyTan0003773
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-18764.44Show/hide
Query:  WKRKCPKFLAERK-NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIVS
        WK  CPK+L ++K  +GK DLLV ETCLVE+  +AWILDSGATNHVCSS Q   S++ L + E+TL+VG G+++SA A+   KL F  K++ L+NLYIV 
Subjt:  WKRKCPKFLAERK-NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIVS

Query:  GFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVKS
           RNLVS+SCLIEH  S++F +N+AFI   G  ICSA LENNLYVL+PN  K+VLN E+F+TA T+ K+ ++SP  N +LWHLRLGHINL+R  +LVK+
Subjt:  GFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVKS

Query:  GLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGKS
        GLLN L++ SLP CESCLEGKMTKRPF+GKGYRAKEPLELIHSDLCGPMNVKARGG+EYF+SFI DYSRYGY+YLM  KSE LEKFK YKTEVENLL K 
Subjt:  GLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGKS

Query:  LKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKCS
        +K LRSDRGGEYMD  FQDYMIEHGI SQLSAP                   VRS                E  ++ILNNVPSKSV ETPFELW GRK S
Subjt:  LKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKCS

Query:  LHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDRTSARVADGASTSTSVVDPST
        L  FRIWGCP HVLV+NPKKLEPRS+LC +VGYPKETRGGLF+DP+ENRV VSTNATFLEEDH+R+H PRSKLVL+E    S RV D    S+ V + +T
Subjt:  LHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDRTSARVADGASTSTSVVDPST

Query:  SSQIR-SQELGIPRRSGRVVRRP
        S Q   SQ L +PRRSGRVV +P
Subjt:  SSQIR-SQELGIPRRSGRVVRRP

KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-18562.95Show/hide
Query:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV
        WKR CPK+LAE+K   QGK DLLV ETCLVE+ DSAWI+DSGATNHVCSSFQGI SW+ L  GE+T+RVG G +VSA A+  ++L     +LLL+N+Y+V
Subjt:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV

Query:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK
            RNL+S+ CL+E   S++F +NK FI   G  ICSA LENNLYVL+  + K++LNTE+FKTA T+ K+ K+SPKEN HLWHLRLGHINLNR E+LVK
Subjt:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK

Query:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK
        +GLL++LEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLEL+HSDLCGPMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFK YK EVEN L K
Subjt:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK

Query:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC
        ++KT RSDRGGEYMD +FQ+Y++E GI SQLSAP                   VRS                +  +YILN VPSKSV ETP +LW+GRK 
Subjt:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC

Query:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV
        SL  FRIWGCP HVL +NPKKLEPRSKLCL+VGYPK TRGG FYDPK+N+V VSTNATFLEEDH+R+H PRSK+VLNE+ +     S RV +  S  T V
Subjt:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV

Query:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI
        V   +S++  + Q L  PRRSGRV   PI
Subjt:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI

KAA0048404.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-18562.95Show/hide
Query:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV
        WKR CPK+LAE+K   QGK DLLV ETCLVE+ DSAWI+DSGATNHVCSSFQGI SW+ L  GE+T+RVG G +VSA A+  ++L     +LLL+N+Y+V
Subjt:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV

Query:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK
            RNL+S+ CL+E   S++F +NK FI   G  ICSA LENNLYVL+  + K++LNTE+FKTA T+ K+ K+SPKEN HLWHLRLGHINLNR E+LVK
Subjt:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK

Query:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK
        +GLL++LEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLEL+HSDLCGPMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFK YK EVEN L K
Subjt:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK

Query:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC
        ++KT RSDRGGEYMD +FQ+Y++E GI SQLSAP                   VRS                +  +YILN VPSKSV ETP +LW+GRK 
Subjt:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC

Query:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV
        SL  FRIWGCP HVL +NPKKLEPRSKLCL+VGYPK TRGG FYDPK+N+V VSTNATFLEEDH+R+H PRSK+VLNE+ +     S RV +  S  T V
Subjt:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV

Query:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI
        V   +S++  + Q L  PRRSGRV   PI
Subjt:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-18562.95Show/hide
Query:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV
        WKR CPK+LAE+K   QGK DLLV ETCLVE+ DSAWI+DSGATNHVCSSFQGI SW+ L  GE+T+RVG G +VSA A+  ++L     +LLL+N+Y+V
Subjt:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV

Query:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK
            RNL+S+ CL+E   S++F +NK FI   G  ICSA LENNLYVL+  + K++LNTE+FKTA T+ K+ K+SPKEN HLWHLRLGHINLNR E+LVK
Subjt:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK

Query:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK
        +GLL++LEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLEL+HSDLCGPMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFK YK EVEN L K
Subjt:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK

Query:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC
        ++KT RSDRGGEYMD +FQ+Y++E GI SQLSAP                   VRS                +  +YILN VPSKSV ETP +LW+GRK 
Subjt:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC

Query:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV
        SL  FRIWGCP HVL +NPKKLEPRSKLCL+VGYPK TRGG FYDPK+N+V VSTNATFLEEDH+R+H PRSK+VLNE+ +     S RV +  S  T V
Subjt:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV

Query:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI
        V   +S++  + Q L  PRRSGRV   PI
Subjt:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-18562.95Show/hide
Query:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV
        WKR CPK+LAE+K   QGK DLLV ETCLVE+ DSAWI+DSGATNHVCSSFQGI SW+ L  GE+T+RVG G +VSA A+  ++L     +LLL+N+Y+V
Subjt:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV

Query:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK
            RNL+S+ CL+E   S++F +NK FI   G  ICSA LENNLYVL+  + K++LNTE+FKTA T+ K+ K+SPKEN HLWHLRLGHINLNR E+LVK
Subjt:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK

Query:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK
        +GLL++LEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLEL+HSDLCGPMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFK YK EVEN L K
Subjt:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK

Query:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC
        ++KT RSDRGGEYMD +FQ+Y++E GI SQLSAP                   VRS                +  +YILN VPSKSV ETP +LW+GRK 
Subjt:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC

Query:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV
        SL  FRIWGCP HVL +NPKKLEPRSKLCL+VGYPK TRGG FYDPK+N+V VSTNATFLEEDH+R+H PRSK+VLNE+ +     S RV +  S  T V
Subjt:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV

Query:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI
        V   +S++  + Q L  PRRSGRV   PI
Subjt:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein6.7e-18662.95Show/hide
Query:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV
        WKR CPK+LAE+K   QGK DLLV ETCLVE+ DSAWI+DSGATNHVCSSFQGI SW+ L  GE+T+RVG G +VSA A+  ++L     +LLL+N+Y+V
Subjt:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV

Query:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK
            RNL+S+ CL+E   S++F +NK FI   G  ICSA LENNLYVL+  + K++LNTE+FKTA T+ K+ K+SPKEN HLWHLRLGHINLNR E+LVK
Subjt:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK

Query:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK
        +GLL++LEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLEL+HSDLCGPMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFK YK EVEN L K
Subjt:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK

Query:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC
        ++KT RSDRGGEYMD +FQ+Y++E GI SQLSAP                   VRS                +  +YILN VPSKSV ETP +LW+GRK 
Subjt:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC

Query:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV
        SL  FRIWGCP HVL +NPKKLEPRSKLCL+VGYPK TRGG FYDPK+N+V VSTNATFLEEDH+R+H PRSK+VLNE+ +     S RV +  S  T V
Subjt:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV

Query:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI
        V   +S++  + Q L  PRRSGRV   PI
Subjt:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI

A0A5A7TZD0 Gag/pol protein7.2e-18864.44Show/hide
Query:  WKRKCPKFLAERK-NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIVS
        WK  CPK+L ++K  +GK DLLV ETCLVE+  +AWILDSGATNHVCSS Q   S++ L + E+TL+VG G+++SA A+   KL F  K++ L+NLYIV 
Subjt:  WKRKCPKFLAERK-NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIVS

Query:  GFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVKS
           RNLVS+SCLIEH  S++F +N+AFI   G  ICSA LENNLYVL+PN  K+VLN E+F+TA T+ K+ ++SP  N +LWHLRLGHINL+R  +LVK+
Subjt:  GFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVKS

Query:  GLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGKS
        GLLN L++ SLP CESCLEGKMTKRPF+GKGYRAKEPLELIHSDLCGPMNVKARGG+EYF+SFI DYSRYGY+YLM  KSE LEKFK YKTEVENLL K 
Subjt:  GLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGKS

Query:  LKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKCS
        +K LRSDRGGEYMD  FQDYMIEHGI SQLSAP                   VRS                E  ++ILNNVPSKSV ETPFELW GRK S
Subjt:  LKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKCS

Query:  LHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDRTSARVADGASTSTSVVDPST
        L  FRIWGCP HVLV+NPKKLEPRS+LC +VGYPKETRGGLF+DP+ENRV VSTNATFLEEDH+R+H PRSKLVL+E    S RV D    S+ V + +T
Subjt:  LHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDRTSARVADGASTSTSVVDPST

Query:  SSQIR-SQELGIPRRSGRVVRRP
        S Q   SQ L +PRRSGRVV +P
Subjt:  SSQIR-SQELGIPRRSGRVVRRP

A0A5A7V4M1 Gag/pol protein6.7e-18662.95Show/hide
Query:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV
        WKR CPK+LAE+K   QGK DLLV ETCLVE+ DSAWI+DSGATNHVCSSFQGI SW+ L  GE+T+RVG G +VSA A+  ++L     +LLL+N+Y+V
Subjt:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV

Query:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK
            RNL+S+ CL+E   S++F +NK FI   G  ICSA LENNLYVL+  + K++LNTE+FKTA T+ K+ K+SPKEN HLWHLRLGHINLNR E+LVK
Subjt:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK

Query:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK
        +GLL++LEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLEL+HSDLCGPMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFK YK EVEN L K
Subjt:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK

Query:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC
        ++KT RSDRGGEYMD +FQ+Y++E GI SQLSAP                   VRS                +  +YILN VPSKSV ETP +LW+GRK 
Subjt:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC

Query:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV
        SL  FRIWGCP HVL +NPKKLEPRSKLCL+VGYPK TRGG FYDPK+N+V VSTNATFLEEDH+R+H PRSK+VLNE+ +     S RV +  S  T V
Subjt:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV

Query:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI
        V   +S++  + Q L  PRRSGRV   PI
Subjt:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI

A0A5D3CPJ6 Gag/pol protein6.7e-18662.95Show/hide
Query:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV
        WKR CPK+LAE+K   QGK DLLV ETCLVE+ DSAWI+DSGATNHVCSSFQGI SW+ L  GE+T+RVG G +VSA A+  ++L     +LLL+N+Y+V
Subjt:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV

Query:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK
            RNL+S+ CL+E   S++F +NK FI   G  ICSA LENNLYVL+  + K++LNTE+FKTA T+ K+ K+SPKEN HLWHLRLGHINLNR E+LVK
Subjt:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK

Query:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK
        +GLL++LEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLEL+HSDLCGPMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFK YK EVEN L K
Subjt:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK

Query:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC
        ++KT RSDRGGEYMD +FQ+Y++E GI SQLSAP                   VRS                +  +YILN VPSKSV ETP +LW+GRK 
Subjt:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC

Query:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV
        SL  FRIWGCP HVL +NPKKLEPRSKLCL+VGYPK TRGG FYDPK+N+V VSTNATFLEEDH+R+H PRSK+VLNE+ +     S RV +  S  T V
Subjt:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV

Query:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI
        V   +S++  + Q L  PRRSGRV   PI
Subjt:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI

A0A5D3DS88 Gag/pol protein6.7e-18662.95Show/hide
Query:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV
        WKR CPK+LAE+K   QGK DLLV ETCLVE+ DSAWI+DSGATNHVCSSFQGI SW+ L  GE+T+RVG G +VSA A+  ++L     +LLL+N+Y+V
Subjt:  WKRKCPKFLAERK--NQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIV

Query:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK
            RNL+S+ CL+E   S++F +NK FI   G  ICSA LENNLYVL+  + K++LNTE+FKTA T+ K+ K+SPKEN HLWHLRLGHINLNR E+LVK
Subjt:  SGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVK

Query:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK
        +GLL++LEENSLPVCESCLEGKMTKRPF+GKG+RAKEPLEL+HSDLCGPMNVKARGG+EYF++F  DYSRYGY+YLM  KSE LEKFK YK EVEN L K
Subjt:  SGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGK

Query:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC
        ++KT RSDRGGEYMD +FQ+Y++E GI SQLSAP                   VRS                +  +YILN VPSKSV ETP +LW+GRK 
Subjt:  SLKTLRSDRGGEYMDTEFQDYMIEHGITSQLSAP-------------------VRS---------------GELRIYILNNVPSKSVCETPFELWSGRKC

Query:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV
        SL  FRIWGCP HVL +NPKKLEPRSKLCL+VGYPK TRGG FYDPK+N+V VSTNATFLEEDH+R+H PRSK+VLNE+ +     S RV +  S  T V
Subjt:  SLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDR----TSARVADGASTSTSV

Query:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI
        V   +S++  + Q L  PRRSGRV   PI
Subjt:  VDPSTSSQI-RSQELGIPRRSGRVVRRPI

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.0e-3125.05Show/hide
Query:  WILDSGATNHVCSS----FQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIVSGFTRNLVSISCLIEHCISVSFELNKAFISFK
        ++LDSGA++H+ +        ++   PL+          GE + A     V+L    +  L D L+       NL+S+  L E  +S+ F+ +   IS  
Subjt:  WILDSGATNHVCSS----FQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIVSGFTRNLVSISCLIEHCISVSFELNKAFISFK

Query:  GNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHIN------LNRNEKLVKSGLLNDLEENSLPVCESCLEGKMTKR
                 +N L V+K + + + +    F+      K      K N  LWH R GHI+      + R        LLN+L E S  +CE CL GK  + 
Subjt:  GNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHIN------LNRNEKLVKSGLLNDLEENSLPVCESCLEGKMTKR

Query:  PFSGKGYRA--KEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIE
        PF     +   K PL ++HSD+CGP+         YFV F+  ++ Y   YL+  KS+    F+ +  + E      +  L  D G EY+  E + + ++
Subjt:  PFSGKGYRA--KEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIE

Query:  HGITSQLSAP------------VRS-------------------GELRI---YILNNVPSKSVCE---TPFELWSGRKCSLHQFRIWGCPTHVLVSNPK-
         GI+  L+ P            +R+                   GE  +   Y++N +PS+++ +   TP+E+W  +K  L   R++G   +V + N + 
Subjt:  HGITSQLSAP------------VRS-------------------GELRI---YILNNVPSKSVCE---TPFELWSGRKCSLHQFRIWGCPTHVLVSNPK-

Query:  KLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLV
        K + +S   ++VGY  E  G   +D    + +V+ +    E + V     + + V
Subjt:  KLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.5e-3826.31Show/hide
Query:  ETCL-VESSDSAWILDSGATNHVCSSFQGIDSWQPLRE-------GEV-TLRVGYGELVSAAAIDTVKLHFS-GKYLLLDNLYIVSGFTRNLVSISCLIE
        E C+ +   +S W++D+ A++H            P+R+       G+  T+++G       A I  + +  + G  L+L ++  V     NL+S    ++
Subjt:  ETCL-VESSDSAWILDSGATNHVCSSFQGIDSWQPLRE-------GEV-TLRVGYGELVSAAAIDTVKLHFS-GKYLLLDNLYIVSGFTRNLVSISCLIE

Query:  HCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKT-AETRTKKAKVSPKE-NVHLWHLRLGHINLNRNEKLVKSGLLNDLEENSLP
             S+  N+ +   KG+++ +               K V    L++T AE    +   +  E +V LWH R+GH++    + L K  L++  +  ++ 
Subjt:  HCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKT-AETRTKKAKVSPKE-NVHLWHLRLGHINLNRNEKLVKSGLLNDLEENSLP

Query:  VCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGKSLKTLRSDRGGEY
         C+ CL GK  +  F     R    L+L++SD+CGPM +++ GG +YFV+FI D SR  ++Y++  K +  + F+ +   VE   G+ LK LRSD GGEY
Subjt:  VCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGKSLKTLRSDRGGEY

Query:  MDTEFQDYMIEHGITSQLSAP-------------------VRS------------GE---LRIYILNNVPSKSVC-ETPFELWSGRKCSLHQFRIWGCP-
           EF++Y   HGI  + + P                   VRS            GE      Y++N  PS  +  E P  +W+ ++ S    +++GC  
Subjt:  MDTEFQDYMIEHGITSQLSAP-------------------VRS------------GE---LRIYILNNVPSKSVC-ETPFELWSGRKCSLHQFRIWGCP-

Query:  -THVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDRTSARVADGASTSTSVVDPSTSSQIRSQ
          HV      KL+ +S  C+++GY  E  G   +DP + +V+ S +  F  E  VR     S+ V N +      +   ++  TS    ST+ ++  Q
Subjt:  -THVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRSKLVLNEMDRTSARVADGASTSTSVVDPSTSSQIRSQ

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.0e-1324.18Show/hide
Query:  ILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIVSGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVIC
        ++DSGA+  +  S   +    P    E+ +     + +   AI  +  +F                  +L+S+S L    I+  F  N    S  G V+ 
Subjt:  ILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIVSGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVIC

Query:  SASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVKSGLLNDLEENSLP-------VCESCLEGKMTKRPFSG
              + Y L   S K ++ + + K       K+K   K    L H  LGH N    +K +K   +  L+E+ +         C  CL GK TK     
Subjt:  SASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVKSGLLNDLEENSLP-------VCESCLEGKMTKRPFSG

Query:  KGYRAK-----EPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSE--TLEKFKGYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMI
        KG R K     EP + +H+D+ GP++   +    YF+SF  + +R+ ++Y +H + E   L  F      ++N     +  ++ DRG EY +     +  
Subjt:  KGYRAK-----EPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSE--TLEKFKGYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMI

Query:  EHGITS
          GIT+
Subjt:  EHGITS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-2925.63Show/hide
Query:  SDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIVSGFTRNLVSISCLIEHCISVSFELNKAFISFK
        S + W+LDSGAT+H+ S F  +   QP   G+  + V  G  +  +   +  L    + L L N+  V    +NL+S+  L  +   VS E   A    K
Subjt:  SDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIVSGFTRNLVSISCLIEHCISVSFELNKAFISFK

Query:  ----GNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVKSGLLNDLE-ENSLPVCESCLEGKMTKRP
            G  +     ++ LY      I S     LF         A  S K     WH RLGH   +    ++ +  L+ L   +    C  CL  K  K P
Subjt:  ----GNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVKSGLLNDLE-ENSLPVCESCLEGKMTKRP

Query:  FSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGI
        FS     +  PLE I+SD+     + +   Y Y+V F+  ++RY ++Y + +KS+  E F  +K  +EN     + T  SD GGE++     +Y  +HGI
Subjt:  FSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGI

Query:  TSQLSAP----------------VRSG------------------ELRIYILNNVPSKSV-CETPFELWSGRKCSLHQFRIWGCPTHVLVS--NPKKLEP
        +   S P                V +G                   + +Y++N +P+  +  E+PF+   G   +  + R++GC  +  +   N  KL+ 
Subjt:  TSQLSAP----------------VRSG------------------ELRIYILNNVPSKSV-CETPFELWSGRKCSLHQFRIWGCPTHVLVS--NPKKLEP

Query:  RSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLE
        +S+ C+++GY       L    + +R+ +S +  F E
Subjt:  RSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLE

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-2824.71Show/hide
Query:  WILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIVSGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVI
        W+LDSGAT+H+ S F  +   QP   G+  + +  G  +      +  L  S + L L+ +  V    +NL+S+  L           N+  + F     
Subjt:  WILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIVSGFTRNLVSISCLIEHCISVSFELNKAFISFKGNVI

Query:  CSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVKSGLLNDLE-ENSLPVCESCLEGKMTKRPFSGKGYRA
            L   + +L+    K  L      +++  +  A    K     WH RLGH +L     ++ +  L  L   + L  C  C   K  K PFS     +
Subjt:  CSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVKSGLLNDLE-ENSLPVCESCLEGKMTKRPFSGKGYRA

Query:  KEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGI---TSQLS
         +PLE I+SD+     + +   Y Y+V F+  ++RY ++Y + +KS+  + F  +K+ VEN     + TL SD GGE++    +DY+ +HGI   TS   
Subjt:  KEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGI---TSQLS

Query:  APVRSG-------------------------------ELRIYILNNVPSKSV-CETPFELWSGRKCSLHQFRIWGCPTHVLVS--NPKKLEPRSKLCLYV
         P  +G                                + +Y++N +P+  +  ++PF+   G+  +  + +++GC  +  +   N  KLE +SK C ++
Subjt:  APVRSG-------------------------------ELRIYILNNVPSKSV-CETPFELWSGRKCSLHQFRIWGCPTHVLVS--NPKKLEPRSKLCLYV

Query:  GYPKETRGGLFYDPKENRVLVSTNATFLE
        GY       L       R+  S +  F E
Subjt:  GYPKETRGGLFYDPKENRVLVSTNATFLE

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.1e-1037.08Show/hide
Query:  TAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVKSGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNV
        + ET       + K+   LWH RL H++    E LVK G L+  + +SL  CE C+ GK  +  FS   +  K PL+ +HSDL G  +V
Subjt:  TAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVKSGLLNDLEENSLPVCESCLEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGCGACTTTGGAAGAGGAAGTGTCCCAAATTCCTGGCCGAGAGGAAGAATCAAGGTAAATGTGATTTACTTGTGACTGAAACATGTTTAGTGGAGAGTAGTGACTC
TGCCTGGATATTGGATTCGGGCGCCACTAACCACGTTTGTTCTTCTTTTCAGGGGATTGATTCCTGGCAGCCGCTGCGAGAGGGTGAGGTGACTCTACGGGTTGGATACG
GGGAGCTTGTCTCTGCTGCAGCGATCGACACGGTGAAGCTACACTTTAGCGGGAAGTACTTATTGTTGGACAATTTGTACATAGTCTCAGGGTTTACTAGAAATCTTGTT
TCAATTTCCTGCTTAATTGAACATTGTATTTCGGTTTCTTTTGAATTAAATAAAGCGTTTATTTCTTTCAAAGGGAATGTTATTTGTTCAGCTTCACTCGAAAATAATCT
GTATGTTTTGAAACCTAATTCGATCAAAAGTGTTTTGAATACTGAATTGTTTAAAACTGCAGAAACACGAACTAAGAAAGCAAAAGTTTCTCCAAAAGAAAATGTCCATC
TTTGGCATCTACGGTTAGGCCACATTAATCTCAATAGGAATGAGAAACTAGTGAAGAGTGGACTTCTAAACGATTTGGAAGAAAACTCTTTGCCGGTGTGTGAGTCATGC
CTTGAGGGTAAGATGACCAAACGTCCTTTTAGTGGAAAAGGATATAGAGCCAAAGAGCCTCTTGAGTTAATACATTCTGACCTCTGTGGTCCGATGAATGTTAAAGCTCG
AGGCGGTTATGAGTATTTCGTATCTTTCATAGCCGATTACTCCAGATATGGGTATATTTACCTAATGCACAAAAAGTCTGAAACTCTTGAAAAGTTCAAGGGGTACAAGA
CTGAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAACACTTCGATCGGATCGAGGTGGAGAGTACATGGACACTGAATTCCAGGACTATATGATAGAACACGGAATTACG
TCTCAACTCTCAGCACCTGTACGCAGTGGAGAGCTGCGCATTTATATTTTGAACAACGTTCCGTCGAAGAGTGTTTGTGAAACACCTTTCGAGCTCTGGAGTGGACGTAA
ATGCAGTTTACATCAGTTCAGAATTTGGGGATGCCCGACTCATGTGTTGGTGTCAAACCCGAAAAAGTTGGAACCTCGTTCGAAATTGTGCCTATACGTAGGTTACCCTA
AAGAGACTAGAGGTGGTCTGTTTTACGATCCTAAGGAAAATAGAGTGCTTGTGTCAACAAACGCCACTTTCCTTGAGGAAGACCATGTCAGGGATCATTTACCAAGAAGT
AAACTTGTGTTGAATGAAATGGACCGTACATCAGCAAGAGTTGCTGATGGGGCTAGTACGTCAACAAGTGTTGTTGATCCTAGCACGTCTAGTCAAATCCGTTCCCAAGA
GTTGGGAATACCTCGACGTAGTGGGAGGGTCGTGAGGCGACCGATCGCTTCATGGGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGCGACTTTGGAAGAGGAAGTGTCCCAAATTCCTGGCCGAGAGGAAGAATCAAGGTAAATGTGATTTACTTGTGACTGAAACATGTTTAGTGGAGAGTAGTGACTC
TGCCTGGATATTGGATTCGGGCGCCACTAACCACGTTTGTTCTTCTTTTCAGGGGATTGATTCCTGGCAGCCGCTGCGAGAGGGTGAGGTGACTCTACGGGTTGGATACG
GGGAGCTTGTCTCTGCTGCAGCGATCGACACGGTGAAGCTACACTTTAGCGGGAAGTACTTATTGTTGGACAATTTGTACATAGTCTCAGGGTTTACTAGAAATCTTGTT
TCAATTTCCTGCTTAATTGAACATTGTATTTCGGTTTCTTTTGAATTAAATAAAGCGTTTATTTCTTTCAAAGGGAATGTTATTTGTTCAGCTTCACTCGAAAATAATCT
GTATGTTTTGAAACCTAATTCGATCAAAAGTGTTTTGAATACTGAATTGTTTAAAACTGCAGAAACACGAACTAAGAAAGCAAAAGTTTCTCCAAAAGAAAATGTCCATC
TTTGGCATCTACGGTTAGGCCACATTAATCTCAATAGGAATGAGAAACTAGTGAAGAGTGGACTTCTAAACGATTTGGAAGAAAACTCTTTGCCGGTGTGTGAGTCATGC
CTTGAGGGTAAGATGACCAAACGTCCTTTTAGTGGAAAAGGATATAGAGCCAAAGAGCCTCTTGAGTTAATACATTCTGACCTCTGTGGTCCGATGAATGTTAAAGCTCG
AGGCGGTTATGAGTATTTCGTATCTTTCATAGCCGATTACTCCAGATATGGGTATATTTACCTAATGCACAAAAAGTCTGAAACTCTTGAAAAGTTCAAGGGGTACAAGA
CTGAGGTTGAGAACCTCTTAGGTAAATCGCTTAAAACACTTCGATCGGATCGAGGTGGAGAGTACATGGACACTGAATTCCAGGACTATATGATAGAACACGGAATTACG
TCTCAACTCTCAGCACCTGTACGCAGTGGAGAGCTGCGCATTTATATTTTGAACAACGTTCCGTCGAAGAGTGTTTGTGAAACACCTTTCGAGCTCTGGAGTGGACGTAA
ATGCAGTTTACATCAGTTCAGAATTTGGGGATGCCCGACTCATGTGTTGGTGTCAAACCCGAAAAAGTTGGAACCTCGTTCGAAATTGTGCCTATACGTAGGTTACCCTA
AAGAGACTAGAGGTGGTCTGTTTTACGATCCTAAGGAAAATAGAGTGCTTGTGTCAACAAACGCCACTTTCCTTGAGGAAGACCATGTCAGGGATCATTTACCAAGAAGT
AAACTTGTGTTGAATGAAATGGACCGTACATCAGCAAGAGTTGCTGATGGGGCTAGTACGTCAACAAGTGTTGTTGATCCTAGCACGTCTAGTCAAATCCGTTCCCAAGA
GTTGGGAATACCTCGACGTAGTGGGAGGGTCGTGAGGCGACCGATCGCTTCATGGGTTTAG
Protein sequenceShow/hide protein sequence
MGRLWKRKCPKFLAERKNQGKCDLLVTETCLVESSDSAWILDSGATNHVCSSFQGIDSWQPLREGEVTLRVGYGELVSAAAIDTVKLHFSGKYLLLDNLYIVSGFTRNLV
SISCLIEHCISVSFELNKAFISFKGNVICSASLENNLYVLKPNSIKSVLNTELFKTAETRTKKAKVSPKENVHLWHLRLGHINLNRNEKLVKSGLLNDLEENSLPVCESC
LEGKMTKRPFSGKGYRAKEPLELIHSDLCGPMNVKARGGYEYFVSFIADYSRYGYIYLMHKKSETLEKFKGYKTEVENLLGKSLKTLRSDRGGEYMDTEFQDYMIEHGIT
SQLSAPVRSGELRIYILNNVPSKSVCETPFELWSGRKCSLHQFRIWGCPTHVLVSNPKKLEPRSKLCLYVGYPKETRGGLFYDPKENRVLVSTNATFLEEDHVRDHLPRS
KLVLNEMDRTSARVADGASTSTSVVDPSTSSQIRSQELGIPRRSGRVVRRPIASWV