; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc08g0227541 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc08g0227541
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr08:20700710..20702197
RNA-Seq ExpressionCmc08g0227541
SyntenyCmc08g0227541
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]8.7e-27394.95Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI KNGVHICSAKL NNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQ ISPNNN+YL HLRLGHINLDRI RLVKNGL+NKL+D SLPPCESCLEGKMTKRPF GKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKILRSDRG EYMDLRFQDYMIEHGIQSQLSAPGTPQQNGV+ERRNRTLLDMVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH
        SSFWGYAVETAVHILNNVP KSVSET FELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP SRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE+H
Subjt:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH

Query:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ
        MR+HKPRSKLVL+EATDESTRV+DEVGPSSR+DETTTSGQSHPSQSLR+PRRSGR+VSQPNRYL LTETQVVIPDD VEDPLSYKQAMNDVDKDQ
Subjt:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-26993.94Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI KNGVHICSAKL NNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQ ISPNNN+YL HLRLGHINLDRI RLVK+GL+NKL+D SLPPCESCLEGKMTKRPF GKGYRAKEPLELIHSDLCGPMNVKARG FEYFISF
Subjt:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKI RSDRG EYMDL FQDYMIEHGIQSQLSAPGTPQQNGV+ERRNRTLLDMVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH
        SSFWGYAVETAVHILNNVP KSVSET FELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP SRLCQFVGYPKETRGGLFFDP+ENRVFVSTNATFLEE+H
Subjt:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH

Query:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ
        MR+HKPRSKLVL+EATDESTRV+DEVGPSSR+DETTTSGQSHPSQSLR+PRRSGR+VSQPNRYL LTETQVVIPDD VEDPLSYKQAMNDVDKDQ
Subjt:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]4.1e-25490.72Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT
        M LKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSI+FSMNEAFISKNGVHICS KL +NLYVL+PNE KAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQ IS NNN+YL HLRLGHINLDRI RLVKNGL+NKLEDDSLPPCESCLEGKMTKRPF GKGYRAKEPLELIHSDLCGPMNVKA GGFEYFISF
Subjt:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP
        IDDYS YGYLYL+EHKSEALEKFKEYK EVENLLSKKIKILRSDRG EYMDLRFQDYMIEHGIQSQLSAPGTPQQNGV+ERRNRTLLDMV SMMSY QLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH
        SSFWGYAVETAVHILNNVP K+V ET FELWRGRKPSLSHFRIW CP HVLVTNPKKLEP SRLCQFVGYPKETRGGLFFDPQENRVFVSTNATF EE+H
Subjt:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH

Query:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYK
        MRDHKPR KLVL+EATDESTRV+DEVGPSSR+DETTTSGQSHPSQSLR+PRRSGR +SQ  RYL LTET VVIPD  VEDPL YK
Subjt:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYK

KAA0065386.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-21779.6Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAK                                                                 PNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQ ISPNN +YL HLRLGHINLD+I RLVKNGL+NKLEDDSLPPCES LEGKMTKRPF GKGYRAKEPLELIHSDL GPMNVKAR GFEYFISF
Subjt:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEK KEY+ EVENLLS+KIKILRSDRG EYMDLRFQDYMIEHGIQSQLSA GTPQQNGV+ERRNRTLLDMVRSMMSYAQ P
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH
        SSFWGYAVETAVHILNNVP KSVSE  FELWRGRKPSLSHFRIWGCP H+LVTNPKKLEP SRLCQFVGYPK+TRGGLFFDPQENRVFVSTNATFLEE+H
Subjt:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH

Query:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ
        MRDHKPRSKLVLNEAT+ESTRV+DEVGPSSR+DETTTSGQSHPSQ LR+PR SGRIVS+PNRYL LTETQVVIPDD VEDPLSYKQAMNDVDKDQ
Subjt:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ

KAA0067938.1 gag/pol protein [Cucumis melo var. makuwa]4.1e-22282.22Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT
        MTL VGTGDVISARAVGD KLFFG KFMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNEAFISKNG     AKL +NLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQ ISPNNN+YL HLRL HINLDRI RLVKNGL+NKL+DDSLPPCESCLEGKMTKRPF GK YRAKEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP
        IDDYSRYGYLYLMEHK EALEKFKEYK EVENLLSKKIKILRSDRG EYMDLRFQDYMIEHGIQSQLSAPGTPQQNGV+ERRNRTLLDMVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH
        SSFWG                                                 PKKLEP SRLCQFVGYPKE RGGLFFDPQENRVFVSTN TFLEE+ 
Subjt:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH

Query:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ
        MRDHKPRSKLVL EATDESTRV+DEV PSSR+DETTTSGQSHPSQSLR+PRRSGRIVSQP RYL LTETQVVIPDD VEDPLSYKQ MNDVDK+Q
Subjt:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein5.7e-27093.94Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI KNGVHICSAKL NNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQ ISPNNN+YL HLRLGHINLDRI RLVK+GL+NKL+D SLPPCESCLEGKMTKRPF GKGYRAKEPLELIHSDLCGPMNVKARG FEYFISF
Subjt:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKI RSDRG EYMDL FQDYMIEHGIQSQLSAPGTPQQNGV+ERRNRTLLDMVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH
        SSFWGYAVETAVHILNNVP KSVSET FELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP SRLCQFVGYPKETRGGLFFDP+ENRVFVSTNATFLEE+H
Subjt:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH

Query:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ
        MR+HKPRSKLVL+EATDESTRV+DEVGPSSR+DETTTSGQSHPSQSLR+PRRSGR+VSQPNRYL LTETQVVIPDD VEDPLSYKQAMNDVDKDQ
Subjt:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ

A0A5A7TZD0 Gag/pol protein4.2e-27394.95Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFI KNGVHICSAKL NNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQ ISPNNN+YL HLRLGHINLDRI RLVKNGL+NKL+D SLPPCESCLEGKMTKRPF GKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEKFKEYK EVENLLSKKIKILRSDRG EYMDLRFQDYMIEHGIQSQLSAPGTPQQNGV+ERRNRTLLDMVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH
        SSFWGYAVETAVHILNNVP KSVSET FELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEP SRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEE+H
Subjt:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH

Query:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ
        MR+HKPRSKLVL+EATDESTRV+DEVGPSSR+DETTTSGQSHPSQSLR+PRRSGR+VSQPNRYL LTETQVVIPDD VEDPLSYKQAMNDVDKDQ
Subjt:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ

A0A5A7VGC7 Gag/pol protein5.1e-21879.6Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAK                                                                 PNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQ ISPNN +YL HLRLGHINLD+I RLVKNGL+NKLEDDSLPPCES LEGKMTKRPF GKGYRAKEPLELIHSDL GPMNVKAR GFEYFISF
Subjt:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEK KEY+ EVENLLS+KIKILRSDRG EYMDLRFQDYMIEHGIQSQLSA GTPQQNGV+ERRNRTLLDMVRSMMSYAQ P
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH
        SSFWGYAVETAVHILNNVP KSVSE  FELWRGRKPSLSHFRIWGCP H+LVTNPKKLEP SRLCQFVGYPK+TRGGLFFDPQENRVFVSTNATFLEE+H
Subjt:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH

Query:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ
        MRDHKPRSKLVLNEAT+ESTRV+DEVGPSSR+DETTTSGQSHPSQ LR+PR SGRIVS+PNRYL LTETQVVIPDD VEDPLSYKQAMNDVDKDQ
Subjt:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ

A0A5A7VJG3 Gag/pol protein2.0e-22282.22Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT
        MTL VGTGDVISARAVGD KLFFG KFMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNEAFISKNG     AKL +NLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQ ISPNNN+YL HLRL HINLDRI RLVKNGL+NKL+DDSLPPCESCLEGKMTKRPF GK YRAKEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP
        IDDYSRYGYLYLMEHK EALEKFKEYK EVENLLSKKIKILRSDRG EYMDLRFQDYMIEHGIQSQLSAPGTPQQNGV+ERRNRTLLDMVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH
        SSFWG                                                 PKKLEP SRLCQFVGYPKE RGGLFFDPQENRVFVSTN TFLEE+ 
Subjt:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH

Query:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ
        MRDHKPRSKLVL EATDESTRV+DEV PSSR+DETTTSGQSHPSQSLR+PRRSGRIVSQP RYL LTETQVVIPDD VEDPLSYKQ MNDVDK+Q
Subjt:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ

A0A5D3BNE1 Gag/pol protein2.0e-25490.72Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT
        M LKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSI+FSMNEAFISKNGVHICS KL +NLYVL+PNE KAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQ IS NNN+YL HLRLGHINLDRI RLVKNGL+NKLEDDSLPPCESCLEGKMTKRPF GKGYRAKEPLELIHSDLCGPMNVKA GGFEYFISF
Subjt:  ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP
        IDDYS YGYLYL+EHKSEALEKFKEYK EVENLLSKKIKILRSDRG EYMDLRFQDYMIEHGIQSQLSAPGTPQQNGV+ERRNRTLLDMV SMMSY QLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH
        SSFWGYAVETAVHILNNVP K+V ET FELWRGRKPSLSHFRIW CP HVLVTNPKKLEP SRLCQFVGYPKETRGGLFFDPQENRVFVSTNATF EE+H
Subjt:  SSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENH

Query:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYK
        MRDHKPR KLVL+EATDESTRV+DEVGPSSR+DETTTSGQSHPSQSLR+PRRSGR +SQ  RYL LTET VVIPD  VEDPL YK
Subjt:  MRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.8e-4731.27Show/hide
Query:  LENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHIC-SAKLGNNLYVLRPNEAKAVLNHEMFRTANTQNKRQGISPNNNSYLRHLRLGHINL
        LE++    +   NL+SV  L E   SI F  +   ISKNG+ +  ++ + NN+          V+N + + + N ++K       NN  L H R GHI+ 
Subjt:  LENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHIC-SAKLGNNLYVLRPNEAKAVLNHEMFRTANTQNKRQGISPNNNSYLRHLRLGHINL

Query:  DRIVRLVK------NGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRA--KEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
         +++ + +        L+N LE  S   CE CL GK  + PF     +   K PL ++HSD+CGP+         YF+ F+D ++ Y   YL+++KS+  
Subjt:  DRIVRLVK------NGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRA--KEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL

Query:  EKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPL
          F+++ A+ E   + K+  L  D G EY+    + + ++ GI   L+ P TPQ NGV+ER  RT+ +  R+M+S A+L  SFWG AV TA +++N +P 
Subjt:  EKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPL

Query:  KSV---SETTFELWRGRKPSLSHFRIWGCPAHVLVTNPK-KLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENHMRDHKPRSKLVLNEAT
        +++   S+T +E+W  +KP L H R++G   +V + N + K +  S    FVGY  E  G   +D    +  V+ +    E N +     + + V  + +
Subjt:  KSV---SETTFELWRGRKPSLSHFRIWGCPAHVLVTNPK-KLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENHMRDHKPRSKLVLNEAT

Query:  DES
         ES
Subjt:  DES

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.4e-5730.93Show/hide
Query:  TLKVGTGDVISARAVGDAKLFFG-NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT
        T+K+G         +GD  +       + L+++  VP ++ NL+S   L    Y   F+  +  ++K  + I                AK V    ++RT
Subjt:  TLKVGTGDVISARAVGDAKLFFG-NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRT

Query:  --ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFI
             Q +        +  L H R+GH++   +  L K  LI+  +  ++ PC+ CL GK  +  F     R    L+L++SD+CGPM +++ GG +YF+
Subjt:  --ANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFI

Query:  SFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQ
        +FIDD SR  ++Y+++ K +  + F+++ A VE    +K+K LRSD G EY    F++Y   HGI+ + + PGTPQ NGV ER NRT+++ VRSM+  A+
Subjt:  SFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQ

Query:  LPSSFWGYAVETAVHILNNVPLKSVS-ETTFELWRGRKPSLSHFRIWGCP--AHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATF
        LP SFWG AV+TA +++N  P   ++ E    +W  ++ S SH +++GC   AHV      KL+  S  C F+GY  E  G   +DP + +V  S +  F
Subjt:  LPSSFWGYAVETAVHILNNVPLKSVS-ETTFELWRGRKPSLSHFRIWGCP--AHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATF

Query:  LEENHMR-----DHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQ-------------------SHPSQ---SLRIPRRSGRIVSQPNRYLSLTE
          E+ +R       K ++ ++ N  T  ST   +     S  DE +  G+                    HP+Q     +  RRS R   +  RY S TE
Subjt:  LEENHMR-----DHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQ-------------------SHPSQ---SLRIPRRSGRIVSQPNRYLSLTE

Query:  TQVVIPDDVVEDPLSYKQAMNDVDKDQ
          V+I DD   +P S K+ ++  +K+Q
Subjt:  TQVVIPDDVVEDPLSYKQAMNDVDKDQ

Q12491 Transposon Ty2-B Gag-Pol polyprotein2.2e-2423.96Show/hide
Query:  ISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRTANTQNKRQGI
        I   A+G+    F N           P I  +L+S+S L     +  F+ N    S   V     K G+  ++   ++   + +H    T N  NK + +
Subjt:  ISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRTANTQNKRQGI

Query:  SPNNNSY-LRHLRLGHINLDRIVRLVKNGLINKLEDDSLP-------PCESCLEGKMTKRPFAGKGYRAK-----EPLELIHSDLCGPMNVKARGGFEYF
          N   Y L H  LGH N   I + +K   +  L++  +         C  CL GK TK     KG R K     EP + +H+D+ GP++   +    YF
Subjt:  SPNNNSY-LRHLRLGHINLDRIVRLVKNGLINKLEDDSLP-------PCESCLEGKMTKRPFAGKGYRAK-----EPLELIHSDLCGPMNVKARGGFEYF

Query:  ISFIDDYSRYGYLYLMEHKSE--ALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMS
        ISF D+ +R+ ++Y +  + E   L  F    A ++N  + ++ +++ DRG EY +     +    GI +  +     + +GV ER NRTLL+  R+++ 
Subjt:  ISFIDDYSRYGYLYLMEHKSE--ALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMS

Query:  YAQLPSSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNP-KKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNAT
         + LP+  W  AVE +  I N++      ++  +        ++    +G P  V   NP  K+ P       +   + + G + + P   +   +TN  
Subjt:  YAQLPSSFWGYAVETAVHILNNVPLKSVSETTFELWRGRKPSLSHFRIWGCPAHVLVTNP-KKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNAT

Query:  FLEENHMR-DHKPRSKLV----LNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMN
         L++N  + D      L     LN  T  +   I++       D+ T S   + S+   I   S  +V+          +Q + P  +  +P+   +A+ 
Subjt:  FLEENHMR-DHKPRSKLV----LNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMN

Query:  DVDKD
        +VD D
Subjt:  DVDKD

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.9e-3725.81Show/hide
Query:  VGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMF
        V  G  I     G   L   ++ + L N+  VP I +NL+SV  L          +  +F + +      GV +   K  + LY      ++ V    +F
Subjt:  VGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMF

Query:  RTANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLE-DDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYF
         + +++         ++S+  H RLGH     +  ++ N  ++ L        C  CL  K  K PF+     +  PLE I+SD+     + +   + Y+
Subjt:  RTANTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLE-DDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYF

Query:  ISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYA
        + F+D ++RY +LY ++ KS+  E F  +K  +EN    +I    SD G E++ L   +Y  +HGI    S P TP+ NG++ER++R +++   +++S+A
Subjt:  ISFIDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYA

Query:  QLPSSFWGYAVETAVHILNNVPLKSVS-ETTFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNAT
         +P ++W YA   AV+++N +P   +  E+ F+   G  P+    R++GC  +  +   N  KL+  SR C F+GY       L    Q +R+++S +  
Subjt:  QLPSSFWGYAVETAVHILNNVPLKSVS-ETTFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNAT

Query:  FLEE-----------NHMRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLR
        F E            + +++ +  S  V +  T   TR      PS        +  S PS   R
Subjt:  FLEE-----------NHMRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQSHPSQSLR

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.8e-3928.25Show/hide
Query:  VGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEAFIS--KNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRTA
        +  G  I     G A L   ++ + L  +  VP I +NL+SV  L   +  S+ F      +     GV +   K  + LY      ++AV    MF + 
Subjt:  VGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEAFIS--KNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRTA

Query:  NTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLE-DDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
         ++         ++S+  H RLGH +L  +  ++ N  +  L     L  C  C   K  K PF+     + +PLE I+SD+     + +   + Y++ F
Subjt:  NTQNKRQGISPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLE-DDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP
        +D ++RY +LY ++ KS+  + F  +K+ VEN    +I  L SD G E++ LR  DY+ +HGI    S P TP+ NG++ER++R +++M  +++S+A +P
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPLKSVS-ETTFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLE
         ++W YA   AV+++N +P   +  ++ F+   G+ P+    +++GC  +  +   N  KLE  S+ C F+GY       L       R++ S +  F E
Subjt:  SSFWGYAVETAVHILNNVPLKSVS-ETTFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLE

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein2.1e-0636.62Show/hide
Query:  LRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNV
        L H RL H++   +  LVK G ++  +  SL  CE C+ GK  +  F+   +  K PL+ +HSDL G  +V
Subjt:  LRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.0e-0835.37Show/hide
Query:  NRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPLKSVS-ETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSR
        NRT+++ VRSM+    LP +F   A  TAVHI+N  P  +++     E+W    P+ S+ R +GC A++   +  KL+P ++
Subjt:  NRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPLKSVS-ETTFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTATATAGTTCC
TAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTACATATTTGTT
CGGCTAAGCTTGGAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAGGAATT
TCTCCAAATAACAATAGCTATCTTCGGCATTTAAGATTAGGTCACATAAATCTTGATCGGATCGTGAGATTGGTAAAGAATGGACTTATAAACAAGTTAGAAGATGATTC
ATTACCTCCATGTGAATCTTGTCTTGAAGGAAAAATGACAAAGAGACCTTTTGCTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTTTGTG
GTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCAAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTT
GAAAAGTTCAAGGAGTATAAGGCTGAAGTTGAAAATTTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGAAGAGTACATGGATTTGAGATTCCAGGACTA
TATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTAACAGAAAGGAGAAATAGAACCTTGTTAGACATGGTTCGTTCAATGA
TGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTTGAAGAGTGTTTCTGAAACAACTTTCGAGTTA
TGGCGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACATGTGTTAGTGACAAATCCCAAGAAGTTGGAACCTCATTCAAGGTTATGCCAATT
TGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTCGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTTGGAAGAAAACCACATGAGAGATC
ATAAACCACGAAGCAAATTAGTATTAAATGAAGCTACTGATGAATCAACAAGGGTTATTGATGAAGTTGGTCCCTCATCAAGAATTGATGAAACCACCACATCAGGTCAA
TCTCATCCTTCTCAATCGTTGAGAATCCCTCGACGCAGTGGGAGGATTGTATCACAACCTAACCGTTATTTGAGTTTAACTGAAACTCAGGTTGTCATACCAGATGATGT
TGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTATATAGTTCC
TAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTCTAAGAATGGTGTACATATTTGTT
CGGCTAAGCTTGGAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAGGAATT
TCTCCAAATAACAATAGCTATCTTCGGCATTTAAGATTAGGTCACATAAATCTTGATCGGATCGTGAGATTGGTAAAGAATGGACTTATAAACAAGTTAGAAGATGATTC
ATTACCTCCATGTGAATCTTGTCTTGAAGGAAAAATGACAAAGAGACCTTTTGCTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTTTGTG
GTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCAAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTT
GAAAAGTTCAAGGAGTATAAGGCTGAAGTTGAAAATTTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGAAGAGTACATGGATTTGAGATTCCAGGACTA
TATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTAACAGAAAGGAGAAATAGAACCTTGTTAGACATGGTTCGTTCAATGA
TGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTTGAAGAGTGTTTCTGAAACAACTTTCGAGTTA
TGGCGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACATGTGTTAGTGACAAATCCCAAGAAGTTGGAACCTCATTCAAGGTTATGCCAATT
TGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTCGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTTGGAAGAAAACCACATGAGAGATC
ATAAACCACGAAGCAAATTAGTATTAAATGAAGCTACTGATGAATCAACAAGGGTTATTGATGAAGTTGGTCCCTCATCAAGAATTGATGAAACCACCACATCAGGTCAA
TCTCATCCTTCTCAATCGTTGAGAATCCCTCGACGCAGTGGGAGGATTGTATCACAACCTAACCGTTATTTGAGTTTAACTGAAACTCAGGTTGTCATACCAGATGATGT
TGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATAG
Protein sequenceShow/hide protein sequence
MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFISKNGVHICSAKLGNNLYVLRPNEAKAVLNHEMFRTANTQNKRQGI
SPNNNSYLRHLRLGHINLDRIVRLVKNGLINKLEDDSLPPCESCLEGKMTKRPFAGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
EKFKEYKAEVENLLSKKIKILRSDRGEEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVTERRNRTLLDMVRSMMSYAQLPSSFWGYAVETAVHILNNVPLKSVSETTFEL
WRGRKPSLSHFRIWGCPAHVLVTNPKKLEPHSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEENHMRDHKPRSKLVLNEATDESTRVIDEVGPSSRIDETTTSGQ
SHPSQSLRIPRRSGRIVSQPNRYLSLTETQVVIPDDVVEDPLSYKQAMNDVDKDQ