; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc04g0102931 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc04g0102931
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionGag/pol protein
Genome locationCMiso1.1chr04:19822756..19824468
RNA-Seq ExpressionCmc04g0102931
SyntenyCmc04g0102931
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025945.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0099.28Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLL+MVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
        MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
Subjt:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM

Query:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL
        DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQ +G+
Subjt:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL

KAA0035907.1 gag/pol protein [Cucumis melo var. makuwa]0.0e+0098.21Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVK+GLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARG FEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL FQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLL+MVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP+ENRVFVSTNATFLEEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
        MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
Subjt:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM

Query:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL
        DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT+ +G+
Subjt:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL

KAA0060534.1 gag/pol protein [Cucumis melo var. makuwa]8.5e-26493.81Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        M LKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSI+FSMNEAFI KNGVHICS KLE+NLYVL+PNE KAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRIS NNNTYLWHLRLGHINLDRIGRLVKNGLLNKL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKA GGFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP
        IDDYS YGYLYL+EHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLL+MV SMMSY QLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        SSFWGYAVETAVHILNNVPSK+V ETPFELWRGRKPSLSHFRIW CP HVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATF EEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYK
        MR+HKPR KLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGR +SQ  RYLGLTET VVIPD GVEDPL YK
Subjt:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYK

KAA0065386.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-25482.99Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAK                                                                 PNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNN TYLWHLRLGHINLD+IGRLVKNGLLNKL+D SLPPCES LEGKMTKRPF GKGYRAKEPLELIHSDL GPMNVKAR GFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEK KEY+TEVENLLS+KIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSA GTPQQNGVSERRNRTLL+MVRSMMSYAQ P
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        SSFWGYAVETAVHILNNVPSKSVSE PFELWRGRKPSLSHFRIWGCP H+LVTNPKKLEPRSRLCQFVGYPK+TRGGLFFDPQENRVFVSTNATFLEEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
        MR+HKPRSKLVL+EAT+ESTRVVDEVGPSSRVDETTTSGQSHPSQ LRMPR SGR+VS+PNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
Subjt:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM

Query:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKV
        DLEMESMYFNSVWELVDLP+GVKPIGCKWIYKRKR+SAGKV
Subjt:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKV

KAA0067938.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-25785.37Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTL VGTGDVISARAVGD KLFFG KFMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNEAFI KNG     AKLE+NLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRL HINLDRIGRLVKNGLLNKLKD SLPPCESCLEGKMTKRPFTGK YRAKEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP
        IDDYSRYGYLYLMEHK EALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLL+MVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        SSFWG                                                 PKKLEPRSRLCQFVGYPKE RGGLFFDPQENRVFVSTN TFLEED 
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
        MR+HKPRSKLVL EATDESTRVVDEV PSSRVDETTTSGQSHPSQSLRMPRRSGR+VSQP RYLGLTETQVVIPDDGVEDPLSYKQ MNDVDK+QWVKAM
Subjt:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM

Query:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK
        DLE+ESMYFNSVWEL DL EGVKPIGCKWIYKRKRDS GK
Subjt:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK

TrEMBL top hitse value%identityAlignment
A0A5A7T2V9 Gag/pol protein0.0e+0098.21Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVK+GLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARG FEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKI RSDRGGEYMDL FQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLL+MVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDP+ENRVFVSTNATFLEEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
        MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
Subjt:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM

Query:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL
        DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYT+ +G+
Subjt:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL

A0A5A7TZD0 Gag/pol protein0.0e+0099.28Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLL+MVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
        MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
Subjt:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM

Query:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL
        DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQ +G+
Subjt:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL

A0A5A7VGC7 Gag/pol protein1.3e-25482.99Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTLKVGTGDVISARAVGDAK                                                                 PNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNN TYLWHLRLGHINLD+IGRLVKNGLLNKL+D SLPPCES LEGKMTKRPF GKGYRAKEPLELIHSDL GPMNVKAR GFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP
        IDDYSRYGYLYLMEHKSEALEK KEY+TEVENLLS+KIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSA GTPQQNGVSERRNRTLL+MVRSMMSYAQ P
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        SSFWGYAVETAVHILNNVPSKSVSE PFELWRGRKPSLSHFRIWGCP H+LVTNPKKLEPRSRLCQFVGYPK+TRGGLFFDPQENRVFVSTNATFLEEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
        MR+HKPRSKLVL+EAT+ESTRVVDEVGPSSRVDETTTSGQSHPSQ LRMPR SGR+VS+PNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
Subjt:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM

Query:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKV
        DLEMESMYFNSVWELVDLP+GVKPIGCKWIYKRKR+SAGKV
Subjt:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKV

A0A5A7VJG3 Gag/pol protein5.8e-25885.37Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        MTL VGTGDVISARAVGD KLFFG KFMFLENLYIVPKIKRNLV VSCLIEHMYSINFSMNEAFI KNG     AKLE+NLYVLRPNEAKAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRISPNNNTYLWHLRL HINLDRIGRLVKNGLLNKLKD SLPPCESCLEGKMTKRPFTGK YRAKEPLELIHSDLCGPMNVKARGGFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP
        IDDYSRYGYLYLMEHK EALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLL+MVRSMMSYAQLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        SSFWG                                                 PKKLEPRSRLCQFVGYPKE RGGLFFDPQENRVFVSTN TFLEED 
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM
        MR+HKPRSKLVL EATDESTRVVDEV PSSRVDETTTSGQSHPSQSLRMPRRSGR+VSQP RYLGLTETQVVIPDDGVEDPLSYKQ MNDVDK+QWVKAM
Subjt:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAM

Query:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK
        DLE+ESMYFNSVWEL DL EGVKPIGCKWIYKRKRDS GK
Subjt:  DLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGK

A0A5D3BNE1 Gag/pol protein4.1e-26493.81Show/hide
Query:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        M LKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSI+FSMNEAFI KNGVHICS KLE+NLYVL+PNE KAVLNHEMFRT
Subjt:  MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
        ANTQNKRQRIS NNNTYLWHLRLGHINLDRIGRLVKNGLLNKL+D SLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKA GGFEYFISF
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP
        IDDYS YGYLYL+EHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLL+MV SMMSY QLP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH
        SSFWGYAVETAVHILNNVPSK+V ETPFELWRGRKPSLSHFRIW CP HVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATF EEDH
Subjt:  SSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDH

Query:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYK
        MR+HKPR KLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGR +SQ  RYLGLTET VVIPD GVEDPL YK
Subjt:  MRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQSHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYK

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.5e-5326.05Show/hide
Query:  LENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHIC-SAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHIN-
        LE++    +   NL+SV  L E   SI F  +   I KNG+ +  ++ + NN+          V+N + + + N ++K       NN  LWH R GHI+ 
Subjt:  LENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHIC-SAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRISPNNNTYLWHLRLGHIN-

Query:  -----LDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
             + R        LLN L ++S   CE CL GK  + PF     +   K PL ++HSD+CGP+         YF+ F+D ++ Y   YL+++KS+  
Subjt:  -----LDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRA--KEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL

Query:  EKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLPSSFWGYAVETAVHILNNVPS
          F+++  + E   + K+  L  D G EY+    + + ++ GI   L+ P TPQ NGVSER  RT+    R+M+S A+L  SFWG AV TA +++N +PS
Subjt:  EKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLPSSFWGYAVETAVHILNNVPS

Query:  KSV---SETPFELWRGRKPSLSHFRIWGCPAHVLVTNPK-KLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEAT
        +++   S+TP+E+W  +KP L H R++G   +V + N + K + +S    FVGY  E  G   +D    +  V+ +    E + + +   + + V  + +
Subjt:  KSV---SETPFELWRGRKPSLSHFRIWGCPAHVLVTNPK-KLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEAT

Query:  DE---------STRVVDEVGP--SSRVDETTTSGQSHPSQSLRMPRRSGRVV--------------------SQPNRYL---------------------
         E         S +++    P  S   D       S  S++   P  S +++                     + N+Y                      
Subjt:  DE---------STRVVDEVGP--SSRVDETTTSGQSHPSQSLRMPRRSGRVV--------------------SQPNRYL---------------------

Query:  ----GLTETQVVIPDDGVEDP---------------------LSYKQ--------------AMNDV-----------DKDQWVKAMDLEMESMYFNSVWE
              +ET   + + G+++P                     +SY +                NDV           DK  W +A++ E+ +   N+ W 
Subjt:  ----GLTETQVVIPDDGVEDP---------------------LSYKQ--------------AMNDV-----------DKDQWVKAMDLEMESMYFNSVWE

Query:  LVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQ
        +   PE    +  +W++  K +  G    +KARLVA+G+TQ
Subjt:  LVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQ

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.9e-8132.65Show/hide
Query:  TLKVGTGDVISARAVGDAKLFFG-NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        T+K+G         +GD  +       + L+++  VP ++ NL+S   L    Y   F+  +  + K  + I        LY       +  LN      
Subjt:  TLKVGTGDVISARAVGDAKLFFG-NKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF
              +  IS +    LWH R+GH++   +  L K  L++  K  ++ PC+ CL GK  +  F     R    L+L++SD+CGPM +++ GG +YF++F
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISF

Query:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP
        IDD SR  ++Y+++ K +  + F+++   VE    +K+K LRSD GGEY    F++Y   HGI+ + + PGTPQ NGV+ER NRT++  VRSM+  A+LP
Subjt:  IDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLP

Query:  SSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCP--AHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLE
         SFWG AV+TA +++N  PS  ++ E P  +W  ++ S SH +++GC   AHV      KL+ +S  C F+GY  E  G   +DP + +V  S +  F E
Subjt:  SSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCP--AHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLE

Query:  EDHM----RNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQ-------------------SHPSQSLRMP---RRSGRVVSQPNRYLGLTETQV
         +       + K ++ ++ +  T  ST   +     S  DE +  G+                    HP+Q        RRS R   +  RY   +   V
Subjt:  EDHM----RNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQ-------------------SHPSQSLRMP---RRSGRVVSQPNRYLGLTETQV

Query:  VIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL
        +I DD   +P S K+ ++  +K+Q +KAM  EMES+  N  ++LV+LP+G +P+ CKW++K K+D   K+  +KARLV KG+ Q KG+
Subjt:  VIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL

Q12491 Transposon Ty2-B Gag-Pol polyprotein1.1e-2425.49Show/hide
Query:  ISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRI
        I   A+G+    F N           P I  +L+S+S L     +  F+ N      +G  +       + Y L  ++   + +H    T N  NK +  
Subjt:  ISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRI

Query:  SPNNNTY-LWHLRLGHINLDRIGRLVKNGLLNKLKD-------VSLPPCESCLEGKMTKRPFTGKGYRAK-----EPLELIHSDLCGPMNVKARGGFEYF
        S N   Y L H  LGH N   I + +K   +  LK+        S   C  CL GK TK     KG R K     EP + +H+D+ GP++   +    YF
Subjt:  SPNNNTY-LWHLRLGHINLDRIGRLVKNGLLNKLKD-------VSLPPCESCLEGKMTKRPFTGKGYRAK-----EPLELIHSDLCGPMNVKARGGFEYF

Query:  ISFIDDYSRYGYLYLMEHKSE--ALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMS
        ISF D+ +R+ ++Y +  + E   L  F      ++N  + ++ +++ DRG EY +     +    GI +  +     + +GV+ER NRTLLN  R+++ 
Subjt:  ISFIDDYSRYGYLYLMEHKSE--ALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMS

Query:  YAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNP-KKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNAT
         + LP+  W  AVE +  I N++ S    ++  +        ++    +G P  V   NP  K+ PR      +   + + G + + P   +   +TN  
Subjt:  YAQLPSSFWGYAVETAVHILNNVPSKSVSETPFELWRGRKPSLSHFRIWGCPAHVLVTNP-KKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNAT

Query:  FLEEDHMR
         L+++  +
Subjt:  FLEEDHMR

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.2e-4824.38Show/hide
Query:  VGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMF
        V  G  I     G   L   ++ + L N+  VP I +NL+SV  L          +  +F + +      GV +   K ++ LY     E     +  + 
Subjt:  VGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE------HMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMF

Query:  RTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYF
          A+  +K    S       WH RLGH     +  ++ N  L+ L        C  CL  K  K PF+     +  PLE I+SD+     + +   + Y+
Subjt:  RTANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYF

Query:  ISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYA
        + F+D ++RY +LY ++ KS+  E F  +K  +EN    +I    SD GGE++ L   +Y  +HGI    S P TP+ NG+SER++R ++    +++S+A
Subjt:  ISFIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYA

Query:  QLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNAT
         +P ++W YA   AV+++N +P+  +  E+PF+   G  P+    R++GC  +  +   N  KL+ +SR C F+GY       L    Q +R+++S +  
Subjt:  QLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNAT

Query:  FLEE-----------DHMRNHKPRSKLVLSEATDESTR---------------------------------------------------VVDEVGPSSRV
        F E              ++  +  S  V S  T   TR                                                      + GP    
Subjt:  FLEE-----------DHMRNHKPRSKLVLSEATDESTR---------------------------------------------------VVDEVGPSSRV

Query:  DETTTSGQSHPS-----------------QSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMND-------------------------
          T T  Q+H S                 QSL  P +S      P      + T    P   +  P    Q +N+                         
Subjt:  DETTTSGQSHPS-----------------QSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMND-------------------------

Query:  -------------------VDKDQWVKAMDLEMESMYFNSVWELVDLPEG-VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL
                           +  ++W  AM  E+ +   N  W+LV  P   V  +GC+WI+ +K +S G +  +KARLVAKGY Q  GL
Subjt:  -------------------VDKDQWVKAMDLEMESMYFNSVWELVDLPEG-VKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE26.5e-4925.76Show/hide
Query:  VGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEAFIYKN---GVHICSAKLENNLYVLRPNEAKAVLNHEMFRT
        +  G  I     G A L   ++ + L  +  VP I +NL+SV  L   +  S+ F    +F  K+   GV +   K ++ LY      ++AV    MF  
Subjt:  VGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIE-HMYSINFSMNEAFIYKN---GVHICSAKLENNLYVLRPNEAKAVLNHEMFRT

Query:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFIS
        A+  +K    S       WH RLGH +L  +  ++ N  L  L     L  C  C   K  K PF+     + +PLE I+SD+     + +   + Y++ 
Subjt:  ANTQNKRQRISPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLK-DVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFIS

Query:  FIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQL
        F+D ++RY +LY ++ KS+  + F  +K+ VEN    +I  L SD GGE++ LR  DY+ +HGI    S P TP+ NG+SER++R ++ M  +++S+A +
Subjt:  FIDDYSRYGYLYLMEHKSEALEKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQL

Query:  PSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFL
        P ++W YA   AV+++N +P+  +  ++PF+   G+ P+    +++GC  +  +   N  KLE +S+ C F+GY       L       R++ S +  F 
Subjt:  PSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVT--NPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFL

Query:  EE----------------------DHMRNHK--PRSKLVL------------------SEATDESTRVVDEVGPSSRVDETTTSGQSHPSQS--------
        E                        +  +H   P + LVL                  S +   +T+V     PSS +   ++S  + PS +        
Subjt:  EE----------------------DHMRNHK--PRSKLVL------------------SEATDESTRVVDEVGPSSRVDETTTSGQSHPSQS--------

Query:  ------------LRMPRRSGRVVSQPNRYLGLTETQVVIP----------------------------------------------------DDGVEDP-
                    L  P  +    + PN+   L ++ +  P                                                     DG+  P 
Subjt:  ------------LRMPRRSGRVVSQPNRYLGLTETQVVIP----------------------------------------------------DDGVEDP-

Query:  --LSY----------KQAMNDVDKDQWVKAMDLEMESMYFNSVWELV-DLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL
           SY          + A+  +  D+W +AM  E+ +   N  W+LV   P  V  +GC+WI+ +K +S G +  +KARLVAKGY Q  GL
Subjt:  --LSY----------KQAMNDVDKDQWVKAMDLEMESMYFNSVWELV-DLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 84.1e-1445.68Show/hide
Query:  EDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL
        ++P +Y +A   +    W  AMD E+ +M     WE+  LP   KPIGCKW+YK K +S G ++ +KARLVAKGYTQ +G+
Subjt:  EDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVAKGYTQGKGL

ATMG00300.1 Gag-Pol-related retrotransposon family protein6.8e-0940Show/hide
Query:  NNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV
        + T LWH RL H++   +  LVK G L+  K  SL  CE C+ GK  +  F+   +  K PL+ +HSDL G  +V
Subjt:  NNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.0e-0939.02Show/hide
Query:  NRTLLNMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSR
        NRT++  VRSM+    LP +F   A  TAVHI+N  PS +++   P E+W    P+ S+ R +GC A++   +  KL+PR++
Subjt:  NRTLLNMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVS-ETPFELWRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSR

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)4.0e-0934.23Show/hide
Query:  MPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARL
        M  RS   +++ N    LT T  +      ++P S   A+ D     W +AM  E++++  N  W LV  P     +GCKW++K K  S G +   KARL
Subjt:  MPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARL

Query:  VAKGYTQGKGL
        VAKG+ Q +G+
Subjt:  VAKGYTQGKGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCC
TAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTATAAGAATGGTGTACATATTTGTT
CAGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATT
TCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAACAAGTTAAAAGATGTTTC
ATTACCTCCATGTGAATCTTGTCTTGAAGGTAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTGTG
GTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCTAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTT
GAAAAGTTCAAGGAGTATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTTCAGGACTA
TATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAGAGGAGAAATAGAACCTTGTTAAACATGGTTCGTTCAATGA
TGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTGTTTCTGAAACACCTTTCGAGCTA
TGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTATTAGTGACAAATCCCAAGAAGTTGGAACCTCGTTCAAGGTTATGCCAATT
TGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTTGGAAGAAGACCACATGAGAAATC
ATAAACCACGAAGCAAATTAGTATTAAGTGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGGGTTGATGAAACCACCACATCAGGTCAA
TCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGGTTGTATCACAACCTAACCGCTATTTGGGTTTAACTGAAACTCAAGTTGTCATACCAGATGATGG
TGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTCAGTGTGGG
AGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTTAAAGCTAGACTTGTGGCA
AAAGGGTATACCCAAGGGAAGGGGTTGACTATGAGGAAACTTTCTCTCCTGTTGCTATGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGACACTCAAGGTTGGAACGGGAGATGTCATTTCAGCTCGTGCAGTGGGAGATGCTAAGTTGTTTTTCGGAAATAAATTCATGTTTTTGGAAAACTTGTACATAGTTCC
TAAAATTAAAAGGAACTTAGTTTCCGTTTCTTGTCTTATTGAACATATGTACTCAATTAATTTTTCTATGAATGAAGCGTTCATTTATAAGAATGGTGTACATATTTGTT
CAGCTAAGCTTGAAAACAACTTGTATGTATTAAGACCTAATGAAGCAAAAGCAGTTTTAAATCATGAGATGTTTAGAACTGCTAATACTCAAAATAAAAGGCAAAGAATT
TCTCCAAATAACAATACCTATCTTTGGCATTTAAGATTAGGTCACATAAATCTCGATCGGATCGGGAGATTGGTAAAGAATGGACTTCTAAACAAGTTAAAAGATGTTTC
ATTACCTCCATGTGAATCTTGTCTTGAAGGTAAAATGACAAAGAGACCTTTTACTGGAAAAGGTTATAGAGCCAAAGAGCCTTTAGAACTTATACATTCAGACCTCTGTG
GTCCGATGAATGTAAAAGCTAGAGGGGGTTTTGAATACTTCATCTCTTTTATAGATGATTATTCTAGGTATGGTTATTTATACTTAATGGAGCATAAGTCTGAAGCTCTT
GAAAAGTTCAAGGAGTATAAGACTGAAGTTGAAAATCTATTAAGTAAAAAGATTAAAATACTTCGATCTGATCGAGGTGGAGAGTACATGGATTTGAGATTTCAGGACTA
TATGATAGAACATGGAATCCAATCCCAACTCTCAGCACCTGGTACACCTCAACAAAATGGTGTATCAGAGAGGAGAAATAGAACCTTGTTAAACATGGTTCGTTCAATGA
TGAGTTACGCTCAATTGCCTAGCTCGTTTTGGGGGTATGCAGTAGAGACTGCAGTTCATATCTTGAACAATGTTCCCTCGAAGAGTGTTTCTGAAACACCTTTCGAGCTA
TGGAGAGGACGTAAACCTAGTTTAAGTCATTTCAGAATTTGGGGTTGTCCAGCACACGTATTAGTGACAAATCCCAAGAAGTTGGAACCTCGTTCAAGGTTATGCCAATT
TGTTGGTTACCCTAAAGAGACGAGAGGTGGTCTATTCTTTGATCCACAAGAAAATAGAGTGTTTGTATCGACAAATGCTACTTTCTTGGAAGAAGACCACATGAGAAATC
ATAAACCACGAAGCAAATTAGTATTAAGTGAAGCTACTGATGAATCAACAAGGGTTGTTGATGAAGTTGGTCCCTCATCAAGGGTTGATGAAACCACCACATCAGGTCAA
TCTCATCCTTCTCAATCGTTGAGAATGCCTCGACGCAGTGGGAGGGTTGTATCACAACCTAACCGCTATTTGGGTTTAACTGAAACTCAAGTTGTCATACCAGATGATGG
TGTTGAGGATCCATTGTCCTATAAACAGGCAATGAATGATGTAGATAAGGACCAATGGGTCAAAGCCATGGACCTTGAAATGGAGTCTATGTACTTCAATTCAGTGTGGG
AGCTTGTAGATCTACCTGAAGGGGTAAAACCTATAGGGTGCAAATGGATCTATAAGAGAAAGAGAGATTCAGCTGGGAAGGTACAGACCTTTAAAGCTAGACTTGTGGCA
AAAGGGTATACCCAAGGGAAGGGGTTGACTATGAGGAAACTTTCTCTCCTGTTGCTATGTTAA
Protein sequenceShow/hide protein sequence
MTLKVGTGDVISARAVGDAKLFFGNKFMFLENLYIVPKIKRNLVSVSCLIEHMYSINFSMNEAFIYKNGVHICSAKLENNLYVLRPNEAKAVLNHEMFRTANTQNKRQRI
SPNNNTYLWHLRLGHINLDRIGRLVKNGLLNKLKDVSLPPCESCLEGKMTKRPFTGKGYRAKEPLELIHSDLCGPMNVKARGGFEYFISFIDDYSRYGYLYLMEHKSEAL
EKFKEYKTEVENLLSKKIKILRSDRGGEYMDLRFQDYMIEHGIQSQLSAPGTPQQNGVSERRNRTLLNMVRSMMSYAQLPSSFWGYAVETAVHILNNVPSKSVSETPFEL
WRGRKPSLSHFRIWGCPAHVLVTNPKKLEPRSRLCQFVGYPKETRGGLFFDPQENRVFVSTNATFLEEDHMRNHKPRSKLVLSEATDESTRVVDEVGPSSRVDETTTSGQ
SHPSQSLRMPRRSGRVVSQPNRYLGLTETQVVIPDDGVEDPLSYKQAMNDVDKDQWVKAMDLEMESMYFNSVWELVDLPEGVKPIGCKWIYKRKRDSAGKVQTFKARLVA
KGYTQGKGLTMRKLSLLLLC