; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017081 (gene) of Snake gourd v1 genome

Gene IDTan0017081
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG08:54722820..54724767
RNA-Seq ExpressionTan0017081
SyntenyTan0017081
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059226.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-12856.02Show/hide
Query:  PSPPKGKKKMKR----GKTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKKNQ--------------------------------------
        PSP   +K  KR    GK    A +   K K VA K KCFHCN   HWK NC K+L +KK +                                      
Subjt:  PSPPKGKKKMKR----GKTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKKNQ--------------------------------------

Query:  -------GHINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKK
               GHINL+RI +LV +GLLN+L++   P CESCLEGKMTK PF+GKGYRA+EPLELIHSDLCGPMN+KARGG+EYF+SFID YSRYG+ YLM  K
Subjt:  -------GHINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKK

Query:  SETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILN
        SE LEKFKEYKTEVENLL K +K  RSDRGGEYMD  FQDYMIEH I SQLSAPG PQQNG+SERRNRTLLDMVRS MSYA+LP SFWGYAVETAV+ILN
Subjt:  SETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILN

Query:  NNLGMPDPCVGVKPEKV--------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLN
        N   +P   V   P ++              G P  +L                + YPKETRGGL F P++NRV VSTNATFLEE+H+R+H PRSK+VL+
Subjt:  NNLGMPDPCVGVKPEKV--------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLN

Query:  EIDSTSARVADGASTSTSVVDPITSSQIR-SQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE
        E    S RV D    S+ V +  TS Q   SQ L MP+RSGRVV QP+RY+GL ET VV PDD  EDPL+Y QAM DVDKD+
Subjt:  EIDSTSARVADGASTSTSVVDPITSSQIR-SQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE

KAA0060254.1 gag/pol protein [Cucumis melo var. makuwa]2.0e-13058.12Show/hide
Query:  EAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKK--NQGHINLNRIEKL
        + EANVA   R +HRG TSGTK +  S    K K K+G    K + AAA+  KK+K  A KG CFH N  GHWKRNC K+L EKK   QGHINLNRIE+L
Subjt:  EAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKK--NQGHINLNRIEKL

Query:  VNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLL
        V +G+L+ELEEN  P+CESCLEGKMTK PF+GKG+RA+EPLEL+HSDLCGPMN+KARG +EYF++F D YSRYG+ YLM  KSE LEKFKEYK EVEN L
Subjt:  VNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLL

Query:  GKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV-
         K++KT RSDRGGEYMD +FQ+Y++E EI SQLSAPG PQQNG+SERRNRTLLDMVRS +SYA LP+SFWGYAV+TAVYILN    +P   V   P K+ 
Subjt:  GKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV-

Query:  -------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLN----EIDSTSARVADGAS
                     G P  +L                + YPK TRGG  +  KDN+V V TNATFLE++HIR+H PRSK+VLN    EI   S RV + +S
Subjt:  -------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLN----EIDSTSARVADGAS

Query:  TSTSVVDPITSSQ-IRSQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE
          T VV   +S++  + Q L  P+RSGRV   P RYM L ET  V  D D EDPLT+ +AM DVDKDE
Subjt:  TSTSVVDPITSSQ-IRSQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE

TYJ96910.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-13856.48Show/hide
Query:  MNTINYSLTTLLNELQTFQSLMRIRTSEAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAEKGKCFHCNGGGHW
        MN I+Y+LTTLLNELQTF+SLM+I+  + EANVA   R +HRG TSGTK +  S    K K  +G    K + AAA+  KK K  A KG  FHCN  GHW
Subjt:  MNTINYSLTTLLNELQTFQSLMRIRTSEAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAEKGKCFHCNGGGHW

Query:  KRNCSKFLGEKKNQG------------------------HINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGP
        KRNC K+L EKK                           HINLNRIE+LV +GLL+ELEEN+ PVCESCLEGKMTK PF+GKG+RA+EPLEL+HSDLCGP
Subjt:  KRNCSKFLGEKKNQG------------------------HINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGP

Query:  MNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRT
        MN+KARGG+EYF++F D YSRYG+ YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E  I SQLSAPG PQQNG+SERRN+T
Subjt:  MNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRT

Query:  LLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV--------------GTPFEILPIR---------------YPKETRGGLCFYP
        LLDMV S MSYA LP+SFWGYAV+TAVYILN    +P   V   P K+              G P  +L I                Y K +RGG  + P
Subjt:  LLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV--------------GTPFEILPIR---------------YPKETRGGLCFYP

Query:  KDNRVLVSTNATFLEENHIRDHLPRSKMVLNEIDS----TSARVADGASTSTSVVDPITSSQI-RSQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDC
        KDN+VLVSTNATFLEE+HIR+H PRSK+VLNE+ +     S RV +  S  TSVV   +S++  + Q L  P+RSGRV   P RYM L ET  V  D D 
Subjt:  KDNRVLVSTNATFLEENHIRDHLPRSKMVLNEIDS----TSARVADGASTSTSVVDPITSSQI-RSQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDC

Query:  EDPLTYDQAMVDVDKDE
        EDPLT+ +AM DVDKDE
Subjt:  EDPLTYDQAMVDVDKDE

TYJ97618.1 gag/pol protein [Cucumis melo var. makuwa]9.5e-13358.55Show/hide
Query:  EAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKK--NQGHINLNRIEKL
        + EANVA   R +HRG TSGTK +  S    K K K+G    K + AAA+  KK+K  A KG CFH N  GHWKRNC K+L EKK   QGHINLNRIE+L
Subjt:  EAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKK--NQGHINLNRIEKL

Query:  VNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLL
        V +G+L+ELEEN  P+CESCLEGKMTK PF+GKG+RA+EPLEL+HSDLCGPMN+KARG +EYF++F D YSRYG+ YLM  KSE LEKFKEYK EVEN L
Subjt:  VNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLL

Query:  GKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV-
         K++KT RSDRGGEYMD +FQ+Y++E EI SQLSAPG PQQNG+SERRNRTLLDMVRS +SYA LP+SFWGYAV+TAVYILN    +P   V   P K+ 
Subjt:  GKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV-

Query:  -------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLNEID----STSARVADGAS
                     G P  +L                + YPK TRGG  + PKDN+V VSTNATFLEE+HIR+H PRSK+VLNE+       S RV +  S
Subjt:  -------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLNEID----STSARVADGAS

Query:  TSTSVVDPITSSQI-RSQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE
          T VV   +S++  + Q L  P+RSGRV   P RYM L ET  V  D D EDPLT+ +AM DVDKDE
Subjt:  TSTSVVDPITSSQI-RSQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE

TYK02840.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-12856.02Show/hide
Query:  PSPPKGKKKMKR----GKTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKKNQ--------------------------------------
        PSP   +K  KR    GK    A +   K K VA K KCFHCN   HWK NC K+L +KK +                                      
Subjt:  PSPPKGKKKMKR----GKTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKKNQ--------------------------------------

Query:  -------GHINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKK
               GHINL+RI +LV +GLLN+L++   P CESCLEGKMTK PF+GKGYRA+EPLELIHSDLCGPMN+KARGG+EYF+SFID YSRYG+ YLM  K
Subjt:  -------GHINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKK

Query:  SETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILN
        SE LEKFKEYKTEVENLL K +K  RSDRGGEYMD  FQDYMIEH I SQLSAPG PQQNG+SERRNRTLLDMVRS MSYA+LP SFWGYAVETAV+ILN
Subjt:  SETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILN

Query:  NNLGMPDPCVGVKPEKV--------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLN
        N   +P   V   P ++              G P  +L                + YPKETRGGL F P++NRV VSTNATFLEE+H+R+H PRSK+VL+
Subjt:  NNLGMPDPCVGVKPEKV--------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLN

Query:  EIDSTSARVADGASTSTSVVDPITSSQIR-SQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE
        E    S RV D    S+ V +  TS Q   SQ L MP+RSGRVV QP+RY+GL ET VV PDD  EDPL+Y QAM DVDKD+
Subjt:  EIDSTSARVADGASTSTSVVDPITSSQIR-SQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE

TrEMBL top hitse value%identityAlignment
A0A5A7UYE8 Gag/pol protein9.0e-12956.02Show/hide
Query:  PSPPKGKKKMKR----GKTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKKNQ--------------------------------------
        PSP   +K  KR    GK    A +   K K VA K KCFHCN   HWK NC K+L +KK +                                      
Subjt:  PSPPKGKKKMKR----GKTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKKNQ--------------------------------------

Query:  -------GHINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKK
               GHINL+RI +LV +GLLN+L++   P CESCLEGKMTK PF+GKGYRA+EPLELIHSDLCGPMN+KARGG+EYF+SFID YSRYG+ YLM  K
Subjt:  -------GHINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKK

Query:  SETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILN
        SE LEKFKEYKTEVENLL K +K  RSDRGGEYMD  FQDYMIEH I SQLSAPG PQQNG+SERRNRTLLDMVRS MSYA+LP SFWGYAVETAV+ILN
Subjt:  SETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILN

Query:  NNLGMPDPCVGVKPEKV--------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLN
        N   +P   V   P ++              G P  +L                + YPKETRGGL F P++NRV VSTNATFLEE+H+R+H PRSK+VL+
Subjt:  NNLGMPDPCVGVKPEKV--------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLN

Query:  EIDSTSARVADGASTSTSVVDPITSSQIR-SQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE
        E    S RV D    S+ V +  TS Q   SQ L MP+RSGRVV QP+RY+GL ET VV PDD  EDPL+Y QAM DVDKD+
Subjt:  EIDSTSARVADGASTSTSVVDPITSSQIR-SQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE

A0A5A7UYX7 Gag/pol protein9.6e-13158.12Show/hide
Query:  EAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKK--NQGHINLNRIEKL
        + EANVA   R +HRG TSGTK +  S    K K K+G    K + AAA+  KK+K  A KG CFH N  GHWKRNC K+L EKK   QGHINLNRIE+L
Subjt:  EAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKK--NQGHINLNRIEKL

Query:  VNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLL
        V +G+L+ELEEN  P+CESCLEGKMTK PF+GKG+RA+EPLEL+HSDLCGPMN+KARG +EYF++F D YSRYG+ YLM  KSE LEKFKEYK EVEN L
Subjt:  VNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLL

Query:  GKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV-
         K++KT RSDRGGEYMD +FQ+Y++E EI SQLSAPG PQQNG+SERRNRTLLDMVRS +SYA LP+SFWGYAV+TAVYILN    +P   V   P K+ 
Subjt:  GKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV-

Query:  -------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLN----EIDSTSARVADGAS
                     G P  +L                + YPK TRGG  +  KDN+V V TNATFLE++HIR+H PRSK+VLN    EI   S RV + +S
Subjt:  -------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLN----EIDSTSARVADGAS

Query:  TSTSVVDPITSSQ-IRSQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE
          T VV   +S++  + Q L  P+RSGRV   P RYM L ET  V  D D EDPLT+ +AM DVDKDE
Subjt:  TSTSVVDPITSSQ-IRSQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE

A0A5D3BAN6 Gag/pol protein7.3e-13956.48Show/hide
Query:  MNTINYSLTTLLNELQTFQSLMRIRTSEAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAEKGKCFHCNGGGHW
        MN I+Y+LTTLLNELQTF+SLM+I+  + EANVA   R +HRG TSGTK +  S    K K  +G    K + AAA+  KK K  A KG  FHCN  GHW
Subjt:  MNTINYSLTTLLNELQTFQSLMRIRTSEAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAEKGKCFHCNGGGHW

Query:  KRNCSKFLGEKKNQG------------------------HINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGP
        KRNC K+L EKK                           HINLNRIE+LV +GLL+ELEEN+ PVCESCLEGKMTK PF+GKG+RA+EPLEL+HSDLCGP
Subjt:  KRNCSKFLGEKKNQG------------------------HINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGP

Query:  MNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRT
        MN+KARGG+EYF++F D YSRYG+ YLM  KSE LEKFKEYK EVEN L K++KT RSDRGGEYMD +FQ+Y++E  I SQLSAPG PQQNG+SERRN+T
Subjt:  MNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRT

Query:  LLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV--------------GTPFEILPIR---------------YPKETRGGLCFYP
        LLDMV S MSYA LP+SFWGYAV+TAVYILN    +P   V   P K+              G P  +L I                Y K +RGG  + P
Subjt:  LLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV--------------GTPFEILPIR---------------YPKETRGGLCFYP

Query:  KDNRVLVSTNATFLEENHIRDHLPRSKMVLNEIDS----TSARVADGASTSTSVVDPITSSQI-RSQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDC
        KDN+VLVSTNATFLEE+HIR+H PRSK+VLNE+ +     S RV +  S  TSVV   +S++  + Q L  P+RSGRV   P RYM L ET  V  D D 
Subjt:  KDNRVLVSTNATFLEENHIRDHLPRSKMVLNEIDS----TSARVADGASTSTSVVDPITSSQI-RSQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDC

Query:  EDPLTYDQAMVDVDKDE
        EDPLT+ +AM DVDKDE
Subjt:  EDPLTYDQAMVDVDKDE

A0A5D3BHG7 Gag/pol protein4.6e-13358.55Show/hide
Query:  EAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKK--NQGHINLNRIEKL
        + EANVA   R +HRG TSGTK +  S    K K K+G    K + AAA+  KK+K  A KG CFH N  GHWKRNC K+L EKK   QGHINLNRIE+L
Subjt:  EAEANVA--IRSYHRGWTSGTKPVAPSPPKGKKKMKRG----KTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKK--NQGHINLNRIEKL

Query:  VNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLL
        V +G+L+ELEEN  P+CESCLEGKMTK PF+GKG+RA+EPLEL+HSDLCGPMN+KARG +EYF++F D YSRYG+ YLM  KSE LEKFKEYK EVEN L
Subjt:  VNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLL

Query:  GKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV-
         K++KT RSDRGGEYMD +FQ+Y++E EI SQLSAPG PQQNG+SERRNRTLLDMVRS +SYA LP+SFWGYAV+TAVYILN    +P   V   P K+ 
Subjt:  GKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKV-

Query:  -------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLNEID----STSARVADGAS
                     G P  +L                + YPK TRGG  + PKDN+V VSTNATFLEE+HIR+H PRSK+VLNE+       S RV +  S
Subjt:  -------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLNEID----STSARVADGAS

Query:  TSTSVVDPITSSQI-RSQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE
          T VV   +S++  + Q L  P+RSGRV   P RYM L ET  V  D D EDPLT+ +AM DVDKDE
Subjt:  TSTSVVDPITSSQI-RSQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE

A0A5D3BUN8 Gag/pol protein9.0e-12956.02Show/hide
Query:  PSPPKGKKKMKR----GKTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKKNQ--------------------------------------
        PSP   +K  KR    GK    A +   K K VA K KCFHCN   HWK NC K+L +KK +                                      
Subjt:  PSPPKGKKKMKR----GKTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKKNQ--------------------------------------

Query:  -------GHINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKK
               GHINL+RI +LV +GLLN+L++   P CESCLEGKMTK PF+GKGYRA+EPLELIHSDLCGPMN+KARGG+EYF+SFID YSRYG+ YLM  K
Subjt:  -------GHINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKK

Query:  SETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILN
        SE LEKFKEYKTEVENLL K +K  RSDRGGEYMD  FQDYMIEH I SQLSAPG PQQNG+SERRNRTLLDMVRS MSYA+LP SFWGYAVETAV+ILN
Subjt:  SETLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILN

Query:  NNLGMPDPCVGVKPEKV--------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLN
        N   +P   V   P ++              G P  +L                + YPKETRGGL F P++NRV VSTNATFLEE+H+R+H PRSK+VL+
Subjt:  NNLGMPDPCVGVKPEKV--------------GTPFEILP---------------IRYPKETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLN

Query:  EIDSTSARVADGASTSTSVVDPITSSQIR-SQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE
        E    S RV D    S+ V +  TS Q   SQ L MP+RSGRVV QP+RY+GL ET VV PDD  EDPL+Y QAM DVDKD+
Subjt:  EIDSTSARVADGASTSTSVVDPITSSQIR-SQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPLTYDQAMVDVDKDE

SwissProt top hitse value%identityAlignment
P04146 Copia protein9.5e-2734.2Show/hide
Query:  INLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRA--EEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKF
        + + R     +  LLN LE +   +CE CL GK  + PF     +   + PL ++HSD+CGP+         YFV F+D ++ Y  TYL+  KS+    F
Subjt:  INLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRA--EEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKF

Query:  KEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILN
        +++  + E      +     D G EY+  E + + ++  I+  L+ P  PQ NG+SER  RT+ +  R+ +S A+L  SFWG AV TA Y++N
Subjt:  KEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.2e-3938.36Show/hide
Query:  KNQGHINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETL
        K  GH++   ++ L    L++  +      C+ CL GK  +  F     R    L+L++SD+CGPM I++ GG +YFV+FID  SR    Y++  K +  
Subjt:  KNQGHINLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETL

Query:  EKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLG
        + F+++   VE   G+ LK  RSD GGEY   EF++Y   H I  + + PG PQ NG++ER NRT+++ VRS +  A+LP SFWG AV+TA Y++N +  
Subjt:  EKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLG

Query:  MPDPCVGVKPEKVGTPFEI
        +  P     PE+V T  E+
Subjt:  MPDPCVGVKPEKVGTPFEI

Q12491 Transposon Ty2-B Gag-Pol polyprotein4.4e-1628.37Show/hide
Query:  GHINLNRIEKLVNSGLLNELEEN-------FSPVCESCLEGKMTKCPFSGKGYRAE-----EPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTY
        GH N   I+K +    +  L+E+        +  C  CL GK TK     KG R +     EP + +H+D+ GP++   +    YF+SF D  +R+   Y
Subjt:  GHINLNRIEKLVNSGLLNELEEN-------FSPVCESCLEGKMTKCPFSGKGYRAE-----EPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTY

Query:  LMHKKSE--TLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVE
         +H + E   L  F      ++N     +   + DRG EY +     +     IT+  +     + +G++ER NRTLL+  R+ +  + LP+  W  AVE
Subjt:  LMHKKSE--TLEKFKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVE

Query:  TAVYILNN
         +  I N+
Subjt:  TAVYILNN

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.5e-2431.92Show/hide
Query:  LNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYK
        LN +    +  +LN   +  S  C  CL  K  K PFS     +  PLE I+SD+     I +   Y Y+V F+D+++RY   Y + +KS+  E F  +K
Subjt:  LNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYK

Query:  TEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVG
          +EN     + T  SD GGE++     +Y  +H I+   S P  P+ NG+SER++R +++   + +S+A +P ++W YA   AVY++N    +P P + 
Subjt:  TEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVG

Query:  VKPEKVGTPFEIL
            ++ +PF+ L
Subjt:  VKPEKVGTPFEIL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.6e-2631.65Show/hide
Query:  GHINLNRIEKLVNSGLLNELEENFSPV-CESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEK
        GH +L  +  ++++  L  L  +   + C  C   K  K PFS     + +PLE I+SD+     I +   Y Y+V F+D+++RY   Y + +KS+  + 
Subjt:  GHINLNRIEKLVNSGLLNELEENFSPV-CESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEK

Query:  FKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMP
        F  +K+ VEN     + T  SD GGE++    +DY+ +H I+   S P  P+ NG+SER++R +++M  + +S+A +P ++W YA   AVY++N    +P
Subjt:  FKEYKTEVENLLGKSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMP

Query:  DPCVGVKPEKVGTPFEIL---PIRYPKETRGGLCFYP
         P +     ++ +PF+ L   P  Y K    G   YP
Subjt:  DPCVGVKPEKVGTPFEIL---PIRYPKETRGGLCFYP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACACGATAAACTACTCACTGACAACTCTTCTTAACGAGCTACAAACCTTCCAGTCCTTGATGAGGATCAGGACGTCGGAAGCTGAGGCAAACGTTGCCATTAGGTC
TTATCACAGGGGTTGGACCTCTGGGACAAAGCCTGTAGCTCCTTCACCCCCGAAAGGGAAGAAAAAGATGAAGAGGGGTAAAACTGATCGAGCTGCAGCCCAAAAGGGCA
AGAAGACCAAGGAAGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATGGGGGCGGACACTGGAAGAGGAACTGCTCCAAATTCCTAGGCGAGAAAAAGAATCAAGGCCAC
ATTAATCTCAATAGGATTGAGAAACTAGTGAATAGTGGACTTCTAAACGAGTTGGAAGAAAACTTTTCACCGGTGTGTGAGTCATGCCTTGAAGGCAAAATGACCAAATG
TCCTTTTAGTGGAAAAGGATATAGAGCAGAGGAGCCCCTTGAGCTAATACACTCTGACCTCTGTGGTCCGATGAATATTAAAGCACGAGGTGGTTATGAATACTTCGTGT
CTTTCATAGATTATTACTCGAGGTATGGGCATACTTACCTAATGCATAAGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTGAGGTTGAGAACCTGTTAGGT
AAATCGCTTAAAACACATCGATCGGATCGAGGTGGAGAGTATATGGACACTGAATTCCAAGACTATATGATAGAACACGAAATTACGTCCCAACTCTCAGCACCTGGTAT
GCCACAACAGAATGGCATATCGGAGAGGAGAAACAGAACCTTGTTGGACATGGTTCGGTCGACGATGAGCTATGCTCGTCTCCCTGATTCCTTTTGGGGTTACGCAGTGG
AGACTGCGGTTTACATTTTGAACAACAATTTAGGGATGCCCGACCCATGTGTTGGTGTCAAACCCGAAAAAGTTGGAACCCCGTTCGAAATTTTGCCTATTCGTTACCCA
AAAGAGACCAGAGGTGGTCTGTGTTTTTATCCTAAGGATAATAGGGTGCTTGTGTCGACAAACGCCACTTTCCTTGAGGAAAATCATATCAGGGATCATTTACCAAGGAG
TAAAATGGTGTTAAATGAAATCGATAGTACGTCAGCAAGAGTTGCTGATGGGGCTAGTACATCAACAAGTGTTGTTGATCCTATAACGTCTAGTCAAATTCGTTCCCAAG
AGTTGGGAATGCCTCAACGTAGTGGGAGGGTTGTGAGACAGCCTGATCGTTACATGGGTTTAGCTGAAACCCCAGTTGTCACTCCTGATGATGACTGCGAGGATCCATTG
ACCTATGATCAGGCAATGGTAGATGTTGACAAAGACGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGAACACGATAAACTACTCACTGACAACTCTTCTTAACGAGCTACAAACCTTCCAGTCCTTGATGAGGATCAGGACGTCGGAAGCTGAGGCAAACGTTGCCATTAGGTC
TTATCACAGGGGTTGGACCTCTGGGACAAAGCCTGTAGCTCCTTCACCCCCGAAAGGGAAGAAAAAGATGAAGAGGGGTAAAACTGATCGAGCTGCAGCCCAAAAGGGCA
AGAAGACCAAGGAAGTTGCAGAGAAAGGAAAGTGTTTCCACTGCAATGGGGGCGGACACTGGAAGAGGAACTGCTCCAAATTCCTAGGCGAGAAAAAGAATCAAGGCCAC
ATTAATCTCAATAGGATTGAGAAACTAGTGAATAGTGGACTTCTAAACGAGTTGGAAGAAAACTTTTCACCGGTGTGTGAGTCATGCCTTGAAGGCAAAATGACCAAATG
TCCTTTTAGTGGAAAAGGATATAGAGCAGAGGAGCCCCTTGAGCTAATACACTCTGACCTCTGTGGTCCGATGAATATTAAAGCACGAGGTGGTTATGAATACTTCGTGT
CTTTCATAGATTATTACTCGAGGTATGGGCATACTTACCTAATGCATAAGAAGTCTGAAACTCTTGAAAAGTTCAAGGAGTACAAGACTGAGGTTGAGAACCTGTTAGGT
AAATCGCTTAAAACACATCGATCGGATCGAGGTGGAGAGTATATGGACACTGAATTCCAAGACTATATGATAGAACACGAAATTACGTCCCAACTCTCAGCACCTGGTAT
GCCACAACAGAATGGCATATCGGAGAGGAGAAACAGAACCTTGTTGGACATGGTTCGGTCGACGATGAGCTATGCTCGTCTCCCTGATTCCTTTTGGGGTTACGCAGTGG
AGACTGCGGTTTACATTTTGAACAACAATTTAGGGATGCCCGACCCATGTGTTGGTGTCAAACCCGAAAAAGTTGGAACCCCGTTCGAAATTTTGCCTATTCGTTACCCA
AAAGAGACCAGAGGTGGTCTGTGTTTTTATCCTAAGGATAATAGGGTGCTTGTGTCGACAAACGCCACTTTCCTTGAGGAAAATCATATCAGGGATCATTTACCAAGGAG
TAAAATGGTGTTAAATGAAATCGATAGTACGTCAGCAAGAGTTGCTGATGGGGCTAGTACATCAACAAGTGTTGTTGATCCTATAACGTCTAGTCAAATTCGTTCCCAAG
AGTTGGGAATGCCTCAACGTAGTGGGAGGGTTGTGAGACAGCCTGATCGTTACATGGGTTTAGCTGAAACCCCAGTTGTCACTCCTGATGATGACTGCGAGGATCCATTG
ACCTATGATCAGGCAATGGTAGATGTTGACAAAGACGAATAG
Protein sequenceShow/hide protein sequence
MNTINYSLTTLLNELQTFQSLMRIRTSEAEANVAIRSYHRGWTSGTKPVAPSPPKGKKKMKRGKTDRAAAQKGKKTKEVAEKGKCFHCNGGGHWKRNCSKFLGEKKNQGH
INLNRIEKLVNSGLLNELEENFSPVCESCLEGKMTKCPFSGKGYRAEEPLELIHSDLCGPMNIKARGGYEYFVSFIDYYSRYGHTYLMHKKSETLEKFKEYKTEVENLLG
KSLKTHRSDRGGEYMDTEFQDYMIEHEITSQLSAPGMPQQNGISERRNRTLLDMVRSTMSYARLPDSFWGYAVETAVYILNNNLGMPDPCVGVKPEKVGTPFEILPIRYP
KETRGGLCFYPKDNRVLVSTNATFLEENHIRDHLPRSKMVLNEIDSTSARVADGASTSTSVVDPITSSQIRSQELGMPQRSGRVVRQPDRYMGLAETPVVTPDDDCEDPL
TYDQAMVDVDKDE