; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003803 (gene) of Snake gourd v1 genome

Gene IDTan0003803
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG05:25572066..25574048
RNA-Seq ExpressionTan0003803
SyntenyTan0003803
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026233.1 gag/pol protein [Cucumis melo var. makuwa]2.5e-12657.02Show/hide
Query:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG
        M VHFN+A  N A IDE+SQVS ILE+LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SL+KI+G KGEANVA  +R +HRGSTSGTK + SSS N + 
Subjt:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG

Query:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK--------------------------------------------
        KK   KK  +G +A+   A+  KK K    KG CF CN++GHWKRNCPKY+A+KKK                                            
Subjt:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK--------------------------------------------

Query:  ---EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYA
            DDYSRYG +YLM HKSEALEKFKE+K EVEN L K +KT RSDR                       AP TPQQNGVSERRNRTLLDMVRSMMSYA
Subjt:  ---EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYA

Query:  HLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLK
        HLP+SFWGYAV+T VYILN VPSKSVSETP +LW G KGSLRHFRIWGCPAHVL  NPKKL+ R KLCL VGY K TRGG FYDPK++KVFVSTNATFL+
Subjt:  HLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLK

Query:  EDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQESPSQELSMPRRSWRV
        EDHIR+HKP+ K+VL+ L     + + +     S    VV    S++    Q L  PRRS RV
Subjt:  EDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQESPSQELSMPRRSWRV

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]3.8e-13057.67Show/hide
Query:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG
        M VHFN+AE NGA IDE+SQVS ILE+LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SL+KI+G KGEANVA  +R +HRGSTSGTK + SSS N + 
Subjt:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG

Query:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK--------------------------------------------
        KK   KK  +G +A+   A+  KK K    KG CFHCN++GHWKRNCPKY+AEKKK                                            
Subjt:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK--------------------------------------------

Query:  ---EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYA
            DDYSRYG +YLM HKSEALEKFKE+K EVEN L K +KT RSDR                        PGTPQQNGVS+RRNRTLLDMVRSMMSY 
Subjt:  ---EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYA

Query:  HLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLK
        HLP+SFWGYAV+T VYILN VPSKSVS+TP +LW GRKGSLRHFRIWGCPAHVL  NPKKL+ R KLCL VGY K TRGG FYDPK++KVFVSTNATFL+
Subjt:  HLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLK

Query:  EDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQESPSQELSMPRRSWRV
        EDHIR+HKP+ K+VL+ L     + + +     S  T VV    S++    Q L  PRRS RV
Subjt:  EDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQESPSQELSMPRRSWRV

KAA0058447.1 gag/pol protein [Cucumis melo var. makuwa]3.9e-12756.59Show/hide
Query:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG
        M VHFN+ E NGA IDE+SQVS ILE+LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SL+KI+G KGEANVA  +R +H+GSTSGTK + SSS N   
Subjt:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG

Query:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK--------------------------------------------
        KK   KK  +G +A+   A+  KK K    KG CFHCN++GHWKRNCPKY+AEKKK                                            
Subjt:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK--------------------------------------------

Query:  ---EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYA
            DDYSRYG +YLM HK EALEKFKE+K +VEN L K +KT RSDR                       APGTPQQNGVSERRNRTLLDMV+SM+SYA
Subjt:  ---EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYA

Query:  HLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLK
        HLP+SFWGYAV+T VYILN VPSKS SETP +LW G KGSLRHFRIWGCPAHVL  NPKKL+   KLCL VGY K TRGG FYDPK++KVFVSTNATFL+
Subjt:  HLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLK

Query:  EDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQESPSQELSMPRRSWRV
        EDHIR+HKP+ K+VL+ L  +  + + +     S  T VV    S++    Q L  PRRS RV
Subjt:  EDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQESPSQELSMPRRSWRV

KAA0066490.1 gag/pol protein [Cucumis melo var. makuwa]7.7e-12360.58Show/hide
Query:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG
        M VHF++AE NGA IDE+SQVS ILE+LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SL+KI+G KGEANVA  +R +HRGSTSGTK + SSS N + 
Subjt:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG

Query:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKKEDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSD
        KK   KK  +G +A+   A+  KK K    KG CFHCN++GHWKRNCPKY+AEKKK                 +ALEKFKE+K EVEN L K +KT RSD
Subjt:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKKEDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSD

Query:  R-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYAHLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIW
        R                       APGTPQQNGVSERRNRTL DMVRSMMSYAHLP+SFWGYAV+T VYILN VPSKSV ETP +LW GRKGSLRHFRIW
Subjt:  R-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYAHLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIW

Query:  GCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLKEDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQ
        GC AHVL  NPKKL+ R KLCL VGY K TRGG FYDPK++KVFVSTNATFL+EDHIR+HKP+ K+VL+ L     + + +     S    VV    S++
Subjt:  GCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLKEDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQ

Query:  ESPSQELSMPRRSWRV
            Q L  PRRS RV
Subjt:  ESPSQELSMPRRSWRV

TYJ96910.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-11750.48Show/hide
Query:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG
        M VHFN+AE NGA IDE+S+VS IL++LP+SFLQFRSNVVMNK+ + LT+LLNELQTF+SL+KI+G KGEANVA  +R +HRGSTSGTK + SSS N + 
Subjt:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG

Query:  KKKNMKKVSKGKQADKV---VAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK-----------------------------------------
        KK      +KG Q +KV    A+  KK K    KG  FHCN++GHWKRNCPKY+AEKKK                                         
Subjt:  KKKNMKKVSKGKQADKV---VAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK-----------------------------------------

Query:  -------------------------------------------------------------EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVK
                                                                      DDYSRYG +YLM HKSEALEKFKE+K EVEN L K +K
Subjt:  -------------------------------------------------------------EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVK

Query:  TLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYAHLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLR
        T RSDR                       APGTPQQNGVSERRN+TLLDMV SMMSYAHLP+SFWGYAV+T VYILN VPSKSVSETP +LW GRKGSL 
Subjt:  TLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYAHLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLR

Query:  HFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLKEDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDT
        HFRI GCPAHVL  N KKL+ R KLCL VGY K +RGG FYDPK++KV VSTNATFL+EDHIR+HKP+ K+VL+ L     + + +     S  T+VV  
Subjt:  HFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLKEDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDT

Query:  SLSSQESPSQELSMPRRSWRV
          S++    Q L  PRRS RV
Subjt:  SLSSQESPSQELSMPRRSWRV

TrEMBL top hitse value%identityAlignment
A0A5A7SNP8 Gag/pol protein1.2e-12657.02Show/hide
Query:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG
        M VHFN+A  N A IDE+SQVS ILE+LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SL+KI+G KGEANVA  +R +HRGSTSGTK + SSS N + 
Subjt:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG

Query:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK--------------------------------------------
        KK   KK  +G +A+   A+  KK K    KG CF CN++GHWKRNCPKY+A+KKK                                            
Subjt:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK--------------------------------------------

Query:  ---EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYA
            DDYSRYG +YLM HKSEALEKFKE+K EVEN L K +KT RSDR                       AP TPQQNGVSERRNRTLLDMVRSMMSYA
Subjt:  ---EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYA

Query:  HLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLK
        HLP+SFWGYAV+T VYILN VPSKSVSETP +LW G KGSLRHFRIWGCPAHVL  NPKKL+ R KLCL VGY K TRGG FYDPK++KVFVSTNATFL+
Subjt:  HLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLK

Query:  EDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQESPSQELSMPRRSWRV
        EDHIR+HKP+ K+VL+ L     + + +     S    VV    S++    Q L  PRRS RV
Subjt:  EDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQESPSQELSMPRRSWRV

A0A5A7U869 Gag/pol protein1.8e-13057.67Show/hide
Query:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG
        M VHFN+AE NGA IDE+SQVS ILE+LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SL+KI+G KGEANVA  +R +HRGSTSGTK + SSS N + 
Subjt:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG

Query:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK--------------------------------------------
        KK   KK  +G +A+   A+  KK K    KG CFHCN++GHWKRNCPKY+AEKKK                                            
Subjt:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK--------------------------------------------

Query:  ---EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYA
            DDYSRYG +YLM HKSEALEKFKE+K EVEN L K +KT RSDR                        PGTPQQNGVS+RRNRTLLDMVRSMMSY 
Subjt:  ---EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYA

Query:  HLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLK
        HLP+SFWGYAV+T VYILN VPSKSVS+TP +LW GRKGSLRHFRIWGCPAHVL  NPKKL+ R KLCL VGY K TRGG FYDPK++KVFVSTNATFL+
Subjt:  HLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLK

Query:  EDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQESPSQELSMPRRSWRV
        EDHIR+HKP+ K+VL+ L     + + +     S  T VV    S++    Q L  PRRS RV
Subjt:  EDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQESPSQELSMPRRSWRV

A0A5A7UUN7 Gag/pol protein1.9e-12756.59Show/hide
Query:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG
        M VHFN+ E NGA IDE+SQVS ILE+LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SL+KI+G KGEANVA  +R +H+GSTSGTK + SSS N   
Subjt:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG

Query:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK--------------------------------------------
        KK   KK  +G +A+   A+  KK K    KG CFHCN++GHWKRNCPKY+AEKKK                                            
Subjt:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK--------------------------------------------

Query:  ---EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYA
            DDYSRYG +YLM HK EALEKFKE+K +VEN L K +KT RSDR                       APGTPQQNGVSERRNRTLLDMV+SM+SYA
Subjt:  ---EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYA

Query:  HLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLK
        HLP+SFWGYAV+T VYILN VPSKS SETP +LW G KGSLRHFRIWGCPAHVL  NPKKL+   KLCL VGY K TRGG FYDPK++KVFVSTNATFL+
Subjt:  HLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLK

Query:  EDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQESPSQELSMPRRSWRV
        EDHIR+HKP+ K+VL+ L  +  + + +     S  T VV    S++    Q L  PRRS RV
Subjt:  EDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQESPSQELSMPRRSWRV

A0A5A7VH46 Gag/pol protein3.7e-12360.58Show/hide
Query:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG
        M VHF++AE NGA IDE+SQVS ILE+LP+SFLQFRSN VMNK+ + LT+LLNELQTF+SL+KI+G KGEANVA  +R +HRGSTSGTK + SSS N + 
Subjt:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG

Query:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKKEDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSD
        KK   KK  +G +A+   A+  KK K    KG CFHCN++GHWKRNCPKY+AEKKK                 +ALEKFKE+K EVEN L K +KT RSD
Subjt:  KKKNMKKVSKGKQADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKKEDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSD

Query:  R-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYAHLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIW
        R                       APGTPQQNGVSERRNRTL DMVRSMMSYAHLP+SFWGYAV+T VYILN VPSKSV ETP +LW GRKGSLRHFRIW
Subjt:  R-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYAHLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIW

Query:  GCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLKEDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQ
        GC AHVL  NPKKL+ R KLCL VGY K TRGG FYDPK++KVFVSTNATFL+EDHIR+HKP+ K+VL+ L     + + +     S    VV    S++
Subjt:  GCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLKEDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDTSLSSQ

Query:  ESPSQELSMPRRSWRV
            Q L  PRRS RV
Subjt:  ESPSQELSMPRRSWRV

A0A5D3BAN6 Gag/pol protein8.0e-11850.48Show/hide
Query:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG
        M VHFN+AE NGA IDE+S+VS IL++LP+SFLQFRSNVVMNK+ + LT+LLNELQTF+SL+KI+G KGEANVA  +R +HRGSTSGTK + SSS N + 
Subjt:  MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKL-FNLTSLLNELQTFQSLIKIQGSKGEANVA--SRSYHRGSTSGTKIVASSSSNLRG

Query:  KKKNMKKVSKGKQADKV---VAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK-----------------------------------------
        KK      +KG Q +KV    A+  KK K    KG  FHCN++GHWKRNCPKY+AEKKK                                         
Subjt:  KKKNMKKVSKGKQADKV---VAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKK-----------------------------------------

Query:  -------------------------------------------------------------EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVK
                                                                      DDYSRYG +YLM HKSEALEKFKE+K EVEN L K +K
Subjt:  -------------------------------------------------------------EDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVK

Query:  TLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYAHLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLR
        T RSDR                       APGTPQQNGVSERRN+TLLDMV SMMSYAHLP+SFWGYAV+T VYILN VPSKSVSETP +LW GRKGSL 
Subjt:  TLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYAHLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLR

Query:  HFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLKEDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDT
        HFRI GCPAHVL  N KKL+ R KLCL VGY K +RGG FYDPK++KV VSTNATFL+EDHIR+HKP+ K+VL+ L     + + +     S  T+VV  
Subjt:  HFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLKEDHIRDHKPKCKVVLSALDGRLAKVANK---NTSTSTTVVDT

Query:  SLSSQESPSQELSMPRRSWRV
          S++    Q L  PRRS RV
Subjt:  SLSSQESPSQELSMPRRSWRV

SwissProt top hitse value%identityAlignment
P04146 Copia protein3.0e-2129.91Show/hide
Query:  DDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYAHLPS
        D ++ Y   YL+ +KS+    F++F  + E     +V  L  D                         P TPQ NGVSER  RT+ +  R+M+S A L  
Subjt:  DDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYAHLPS

Query:  SFWGYAVETPVYILNSVPSKSV---SETPFELWKGRKGSLRHFRIWGCPAHVLVTNPK-KLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLK
        SFWG AV T  Y++N +PS+++   S+TP+E+W  +K  L+H R++G   +V + N + K D +    + VGY  E  G   +D   +K  V+ +    +
Subjt:  SFWGYAVETPVYILNSVPSKSV---SETPFELWKGRKGSLRHFRIWGCPAHVLVTNPK-KLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLK

Query:  EDHIRDHKPKCKVV
         + +     K + V
Subjt:  EDHIRDHKPKCKVV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.3e-2832.79Show/hide
Query:  DDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYAHLPS
        DD SR   +Y++  K +  + F++F   VE + G+++K LRSD                         PGTPQ NGV+ER NRT+++ VRSM+  A LP 
Subjt:  DDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRNRTLLDMVRSMMSYAHLPS

Query:  SFWGYAVETPVYILNSVPSKSVS-ETPFELWKGRKGSLRHFRIWGCP--AHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLKE
        SFWG AV+T  Y++N  PS  ++ E P  +W  ++ S  H +++GC   AHV      KLD +   C+ +GY  E  G   +DP + KV  S +  F +E
Subjt:  SFWGYAVETPVYILNSVPSKSVS-ETPFELWKGRKGSLRHFRIWGCP--AHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLKE

Query:  DHIR---DHKPKCKVVLSALDGRLAKVANKNTSTSTTVVDTSLSSQE
          +R   D   K K  +      +   +N  TS  +T  + S   ++
Subjt:  DHIR---DHKPKCKVVLSALDGRLAKVANKNTSTSTTVVDTSLSSQE

Q12337 Transposon Ty2-GR1 Gag-Pol polyprotein1.8e-0523.56Show/hide
Query:  HWKRNCPKYIAEKKKEDDYSRYGPIYLMHHKSE--ALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRN
        H  ++ P Y       D+ +R+  +Y +H + E   L  F      ++NQ   RV  ++ DR                            + +GV+ER N
Subjt:  HWKRNCPKYIAEKKKEDDYSRYGPIYLMHHKSE--ALEKFKEFKGEVENQLGKRVKTLRSDR-----------------------APGTPQQNGVSERRN

Query:  RTLLDMVRSMMSYAHLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIWGCP-------AHVLVTNPKKLDSRLKLCLLVGY----QK
        RTLL+  R+++  + LP+  W  AVE    I NS+ S           K RK + +H  + G            ++ N    DS++    + GY     +
Subjt:  RTLLDMVRSMMSYAHLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIWGCP-------AHVLVTNPKKLDSRLKLCLLVGY----QK

Query:  ETRGGLFYDPKEDKVFVSTNATFLK
         + G + Y P   K   +TN   L+
Subjt:  ETRGGLFYDPKEDKVFVSTNATFLK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE17.0e-1826.67Show/hide
Query:  DDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDRA---------------------PGTPQQNGVSERRNRTLLDMVRSMMSYAHLPSSF
        D ++RY  +Y +  KS+  E F  FK  +EN+   R+ T  SD                       P TP+ NG+SER++R +++   +++S+A +P ++
Subjt:  DDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDRA---------------------PGTPQQNGVSERRNRTLLDMVRSMMSYAHLPSSF

Query:  WGYAVETPVYILNSVPSKSVS-ETPFELWKGRKGSLRHFRIWGCPAHVLVT--NPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATF
        W YA    VY++N +P+  +  E+PF+   G   +    R++GC  +  +   N  KLD + + C+ +GY       L    +  ++++S +  F
Subjt:  WGYAVETPVYILNSVPSKSVS-ETPFELWKGRKGSLRHFRIWGCPAHVLVT--NPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATF

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.6e-1726.67Show/hide
Query:  DDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDRA---------------------PGTPQQNGVSERRNRTLLDMVRSMMSYAHLPSSF
        D ++RY  +Y +  KS+  + F  FK  VEN+   R+ TL SD                       P TP+ NG+SER++R +++M  +++S+A +P ++
Subjt:  DDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDRA---------------------PGTPQQNGVSERRNRTLLDMVRSMMSYAHLPSSF

Query:  WGYAVETPVYILNSVPSKSVS-ETPFELWKGRKGSLRHFRIWGCPAHVLVT--NPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATF
        W YA    VY++N +P+  +  ++PF+   G+  +    +++GC  +  +   N  KL+ + K C  +GY       L       +++ S +  F
Subjt:  WGYAVETPVYILNSVPSKSVS-ETPFELWKGRKGSLRHFRIWGCPAHVLVT--NPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATF

Arabidopsis top hitse value%identityAlignment
ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein3.7e-0634.78Show/hide
Query:  NRTLLDMVRSMMSYAHLPSSFWGYAVETPVYILNSVPSKSVS-ETPFELWKGRKGSLRHFRIWGCPAHV
        NRT+++ VRSM+    LP +F   A  T V+I+N  PS +++   P E+W     +  + R +GC A++
Subjt:  NRTLLDMVRSMMSYAHLPSSFWGYAVETPVYILNSVPSKSVS-ETPFELWKGRKGSLRHFRIWGCPAHV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGTACACTTTAACATCGCGGAGTCGAATGGTGCTTCTATCGATGAGTCGAGCCAGGTCAGCATCATTCTGGAAACCCTTCCAGATAGTTTCCTACAGTTTAGAAG
TAATGTTGTTATGAACAAGCTTTTTAATCTTACCTCCCTTCTAAATGAACTCCAGACCTTTCAGTCTTTGATCAAAATTCAGGGATCGAAAGGTGAGGCCAATGTTGCCA
GTAGGAGTTATCACAGAGGTTCGACCTCTGGGACAAAAATAGTGGCTTCATCTTCTTCTAACCTGAGAGGTAAGAAGAAGAACATGAAGAAGGTTAGTAAAGGAAAACAG
GCTGATAAAGTTGTTGCCCAGAAGGGCAAGAAAGTTAAAGACGTTGTTGATAAAGGAAAGTGTTTCCACTGCAACAAGGACGGGCATTGGAAACGGAACTGTCCGAAGTA
CATTGCAGAAAAGAAGAAGGAAGATGATTATTCTAGATATGGGCCTATTTACCTAATGCACCACAAGTCTGAAGCACTTGAAAAGTTCAAGGAATTCAAGGGTGAGGTTG
AAAATCAATTAGGTAAAAGAGTTAAAACACTTCGATCAGATCGAGCCCCCGGCACTCCACAACAAAATGGTGTATCGGAAAGGAGAAATAGAACCTTGTTGGACATGGTT
CGATCAATGATGAGTTATGCTCATCTCCCTAGTTCTTTTTGGGGTTATGCAGTGGAGACTCCAGTATACATTTTGAATAGTGTGCCCTCCAAAAGTGTTTCTGAAACACC
TTTTGAACTCTGGAAAGGACGTAAAGGTAGTTTACGTCACTTTAGAATATGGGGTTGTCCAGCACATGTGCTTGTGACAAATCCAAAGAAGTTGGATTCACGTTTAAAGT
TGTGCCTATTAGTAGGATACCAAAAAGAAACAAGAGGTGGTTTATTCTATGATCCTAAGGAAGATAAGGTCTTTGTGTCAACAAATGCCACTTTCTTGAAGGAGGACCAC
ATAAGGGACCACAAACCAAAATGTAAGGTAGTGTTGAGTGCGTTAGACGGGAGATTAGCAAAAGTTGCTAATAAGAATACTAGTACGTCAACAACAGTTGTTGATACTAG
TTTGTCTAGTCAAGAGAGTCCATCTCAAGAGTTGAGTATGCCTCGACGTAGTTGGAGGGTTGTGATATAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGGTACACTTTAACATCGCGGAGTCGAATGGTGCTTCTATCGATGAGTCGAGCCAGGTCAGCATCATTCTGGAAACCCTTCCAGATAGTTTCCTACAGTTTAGAAG
TAATGTTGTTATGAACAAGCTTTTTAATCTTACCTCCCTTCTAAATGAACTCCAGACCTTTCAGTCTTTGATCAAAATTCAGGGATCGAAAGGTGAGGCCAATGTTGCCA
GTAGGAGTTATCACAGAGGTTCGACCTCTGGGACAAAAATAGTGGCTTCATCTTCTTCTAACCTGAGAGGTAAGAAGAAGAACATGAAGAAGGTTAGTAAAGGAAAACAG
GCTGATAAAGTTGTTGCCCAGAAGGGCAAGAAAGTTAAAGACGTTGTTGATAAAGGAAAGTGTTTCCACTGCAACAAGGACGGGCATTGGAAACGGAACTGTCCGAAGTA
CATTGCAGAAAAGAAGAAGGAAGATGATTATTCTAGATATGGGCCTATTTACCTAATGCACCACAAGTCTGAAGCACTTGAAAAGTTCAAGGAATTCAAGGGTGAGGTTG
AAAATCAATTAGGTAAAAGAGTTAAAACACTTCGATCAGATCGAGCCCCCGGCACTCCACAACAAAATGGTGTATCGGAAAGGAGAAATAGAACCTTGTTGGACATGGTT
CGATCAATGATGAGTTATGCTCATCTCCCTAGTTCTTTTTGGGGTTATGCAGTGGAGACTCCAGTATACATTTTGAATAGTGTGCCCTCCAAAAGTGTTTCTGAAACACC
TTTTGAACTCTGGAAAGGACGTAAAGGTAGTTTACGTCACTTTAGAATATGGGGTTGTCCAGCACATGTGCTTGTGACAAATCCAAAGAAGTTGGATTCACGTTTAAAGT
TGTGCCTATTAGTAGGATACCAAAAAGAAACAAGAGGTGGTTTATTCTATGATCCTAAGGAAGATAAGGTCTTTGTGTCAACAAATGCCACTTTCTTGAAGGAGGACCAC
ATAAGGGACCACAAACCAAAATGTAAGGTAGTGTTGAGTGCGTTAGACGGGAGATTAGCAAAAGTTGCTAATAAGAATACTAGTACGTCAACAACAGTTGTTGATACTAG
TTTGTCTAGTCAAGAGAGTCCATCTCAAGAGTTGAGTATGCCTCGACGTAGTTGGAGGGTTGTGATATAG
Protein sequenceShow/hide protein sequence
MKVHFNIAESNGASIDESSQVSIILETLPDSFLQFRSNVVMNKLFNLTSLLNELQTFQSLIKIQGSKGEANVASRSYHRGSTSGTKIVASSSSNLRGKKKNMKKVSKGKQ
ADKVVAQKGKKVKDVVDKGKCFHCNKDGHWKRNCPKYIAEKKKEDDYSRYGPIYLMHHKSEALEKFKEFKGEVENQLGKRVKTLRSDRAPGTPQQNGVSERRNRTLLDMV
RSMMSYAHLPSSFWGYAVETPVYILNSVPSKSVSETPFELWKGRKGSLRHFRIWGCPAHVLVTNPKKLDSRLKLCLLVGYQKETRGGLFYDPKEDKVFVSTNATFLKEDH
IRDHKPKCKVVLSALDGRLAKVANKNTSTSTTVVDTSLSSQESPSQELSMPRRSWRVVI