; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0009940 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0009940
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr09:14024969..14027500
RNA-Seq ExpressionPay0009940
SyntenyPay0009940
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]1.1e-25077.74Show/hide
Query:  NSW--DYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF----------------------------------------------VVMGDFNAIRVHFEA
        N W  +YSCSYSNSGVGRIWV+WKK RFSF T V+DE+F                                              VVMGDFNAIRVH EA
Subjt:  NSW--DYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF----------------------------------------------VVMGDFNAIRVHFEA

Query:  FGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGISDHSPILFYPSFQQNSR-----------
        FGGSPIQGEME+FDLA RDADLVEPSVQGNWFTWTSKV GSGMLRRLDR+LVNDEWLSAWPT+ +NVLPWGISDHSPILFYPSFQ NSR           
Subjt:  FGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGISDHSPILFYPSFQQNSR-----------

Query:  -DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSDVLSRQVGLATEAFWTAVRLEEASLRQKS
         +PSFIEVVARMW RHEGVS LVSLMRNL +LKP LRR+FGRHI+SL+EEVHIAKE MD AQREVE NP+SDVLSRQ  LATE FWTAVRLEEASLRQKS
Subjt:  -DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSDVLSRQVGLATEAFWTAVRLEEASLRQKS

Query:  RIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVIDDIVQFRWSEECCQALQIPISREEVRRVL
        ++RWL LGDQNTAFFHR VRSR+SRNSLLSLVD+DGSRVSSHDGV Q+AVNYF NSLGSQEIGYREL P+IDDIVQF+WSEECCQALQ+PISREEVRRVL
Subjt:  RIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVIDDIVQFRWSEECCQALQIPISREEVRRVL

Query:  FSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCNVIYKCISKILADRLRVWLPSFIGSNQ
        FSMDSGKAPGPDGFSVGF+KGAWSVV EDFC+ VLHFFETCYLPIGVNAT ITLIPK  GAE++E+FRPISCCNV+YKCISKILADRLR+WLPSFI SNQ
Subjt:  FSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCNVIYKCISKILADRLRVWLPSFIGSNQ

Query:  SAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGLLLLL
        SAFIPGRSII+NILLCQELV GYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGLL+ +
Subjt:  SAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGLLLLL

KAA0059841.1 reverse transcriptase [Cucumis melo var. makuwa]5.6e-26871.53Show/hide
Query:  MELWTEAWLAIVASVVGKPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVV
        MELWTEA LA+VAS VGKPI+LDL TKE  RLSYARVCV+LEG  NM A+ITV+L GVDFNVS+NYEWKP+KCNLCCAFGHSG KC RSVESK IQEEVV
Subjt:  MELWTEAWLAIVASVVGKPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVV

Query:  SKIIPGKGEGVDSEPCAKVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEG
         KII GK +G+DS+PC +VVLESFKQLEE EIRSSPNRH S                                               D DKWAL++IEG
Subjt:  SKIIPGKGEGVDSEPCAKVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEG

Query:  SPPPLQVDEGT-----VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF---------------------------------
        SPPPLQVDEGT     V  S          NS DY   YSNSGVG+IWV+WKKNRFSFST VVDEQF                                 
Subjt:  SPPPLQVDEGT-----VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF---------------------------------

Query:  -------------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGI
                     VVMGDFNAIRVH EAFGGSPIQGEMEDFDLA RDADLVEPSVQGNWFTWTSKVHGSGMLRRLDR+LVND WLSAWPT+LVNVLPWGI
Subjt:  -------------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGI

Query:  SDHSPILFYPSFQQNSR------------DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSD
        SDHSPILFYPSFQ NS+            DPSFIEVVARMW RHEGVSPLVSLMRNL +LKPTLRR+FGRHIQSL+EEVHIAKE MDRAQR+VE N MSD
Subjt:  SDHSPILFYPSFQQNSR------------DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSD

Query:  VLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVID
        VLSRQ  LATE FWTAVRLEEASLRQKSRIRWL+LGDQNT FFHR VRSRMSRNSLLSLVD+DGSRVSSHDGV QLAVNYFRNSLGSQEIGYREL PVID
Subjt:  VLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVID

Query:  DIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNAT
        DI+QF+WSEECCQALQ+PISREEVRRVLFSMDSGKAPGPDGFSVG FKG WSVV EDFCDVVLHFFETCYLP+GVNAT
Subjt:  DIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNAT

KAA0062888.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]0.0e+0087.26Show/hide
Query:  MELWTEAWLAIVASVVGKPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVV
        MELWTEAWLAIVASVVGKPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCA GHSGGKCPRSVESKIIQEEVV
Subjt:  MELWTEAWLAIVASVVGKPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVV

Query:  SKIIPGKGEGVDSEPCAKVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEG
        SKIIPGKGEGVDSEPCAKVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEG
Subjt:  SKIIPGKGEGVDSEPCAKVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEG

Query:  SPPPLQVDEGT-----------------------------VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF---------
        SPPPLQVDEGT                             VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF         
Subjt:  SPPPLQVDEGT-----------------------------VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF---------

Query:  -------------------------------------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLD
                                             VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLD
Subjt:  -------------------------------------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLD

Query:  RILVNDEWLSAWPTLLVNVLPWGISDHSPILFYPSFQQNSRDPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDR
        RILVNDEWLSAWPTLLV                   Q    DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDR
Subjt:  RILVNDEWLSAWPTLLVNVLPWGISDHSPILFYPSFQQNSRDPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDR

Query:  AQREVEHNPMSDVLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQ
        AQREVEHNPMSDVLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQ
Subjt:  AQREVEHNPMSDVLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQ

Query:  EIGYRELFPVIDDIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRG
        EIGYRELFPVIDDIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRG
Subjt:  EIGYRELFPVIDDIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRG

Query:  AEQMEEFRPISCCNVIYKCISKILADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGL
        AEQMEEFRPISCCNVIYKCISKILADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFL G+
Subjt:  AEQMEEFRPISCCNVIYKCISKILADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGL

TYK28099.1 uncharacterized protein E5676_scaffold1467G00020 [Cucumis melo var. makuwa]4.1e-24762.2Show/hide
Query:  KPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVVSKIIPGKGEGVDSEPCA
        KPI+LD ATK+  RLSYARVCV+LEG  NM AEITV+LRGVDFNVSVNYEWKP+KCNLCCAFGHS  KC RSVESK IQEEVV      KG+ VD E C 
Subjt:  KPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVVSKIIPGKGEGVDSEPCA

Query:  KVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEGSPPPLQVDEGT------
        +VVLESFKQ+E+GEIRSSPNRH S VEK VGK DEFTLVTRKKSELVS+RDRGKS+ V MPNSFGSLLEVGD DKWAL++IEGS PPLQVDEGT      
Subjt:  KVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEGSPPPLQVDEGT------

Query:  --------------VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF-----------------------------------
                      VRE NFD VSRRF NSWDYSCSYSNSGVGRIWV+WKKNRFSFST V+DEQF                                   
Subjt:  --------------VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF-----------------------------------

Query:  -----------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGISD
                   VVM DFNAIRVH EAF GSPIQGEMEDF+LA RDADLVEPSVQGNWFTWTSKV GSGMLRRLDR+LVND+WLS WPT+LVNVLPWGISD
Subjt:  -----------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGISD

Query:  HSPILFYPSFQQNSR------------DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSDVL
        H PILFYPSFQ +++            DPSFIEVV RMW RHEGVSPLV LMRNL  LKP LRRRFGRHI+ L+EEV I KE MD AQRE          
Subjt:  HSPILFYPSFQQNSR------------DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSDVL

Query:  SRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVIDDI
                                                                                 +AVNYFRNSLGSQEIGYREL PVIDDI
Subjt:  SRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVIDDI

Query:  VQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCN
        VQF+WSEECCQALQ+PISREEVRRVLFSMDSGKAPGPDGFSV                            +G+NAT ITLIPK  GAE++E+F PISC N
Subjt:  VQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCN

Query:  VIYKCISKILADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELV
        V+YKCISKILADRLRVWLPSFI SNQSAFIPGRSII+NILLCQEL+
Subjt:  VIYKCISKILADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELV

XP_008466769.1 PREDICTED: uncharacterized protein LOC103504100 [Cucumis melo]8.2e-29677.29Show/hide
Query:  MELWTEAWLAIVASVVGKPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVV
        MELWTEA LA+VAS VGKPI+LDL TKE  RLSYARVCV+LEG  NM A+ITV+L GVDFNVS+NYEWKP+KCNLCCAFGHSG KC RSVESK IQEEVV
Subjt:  MELWTEAWLAIVASVVGKPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVV

Query:  SKIIPGKGEGVDSEPCAKVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEG
         KII GK +G+DS+PC +VVLESFKQLEE EIRSSPNRH S VE+  GK DEFTLVTRKKSELVSVRDRGKS+ V MPNSFGSLLEVGD DKWAL++IEG
Subjt:  SKIIPGKGEGVDSEPCAKVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEG

Query:  SPPPLQVDEGT-----VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF---------------------------------
        SPPPLQVDEGT     V  S          NS DY   YSNSGVG+IWV+WKKNRFSFST VVDEQF                                 
Subjt:  SPPPLQVDEGT-----VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF---------------------------------

Query:  -------------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGI
                     VVMGDFNAIRVH EAFGGSPIQGEMEDFDLA RDADLVEPSVQGNWFTWTSKVHGSGMLRRLDR+LVND WLSAWPT+LVNVLPWGI
Subjt:  -------------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGI

Query:  SDHSPILFYPSFQQNSR------------DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSD
        SDHSPILFYPSFQ NS+            DPSFIEVVARMW RHEGVSPLVSLMRNL +LKPTLRR+FGRHIQSL+EEVHIAKE MDRAQR+VE N MSD
Subjt:  SDHSPILFYPSFQQNSR------------DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSD

Query:  VLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVID
        VLSRQ  LATE FWTAVRLEEASLRQKSRIRWL+LGDQNT FFHR VRSRMSRNSLLSLVD+DGSRVSSHDGV QLAVNYFRNSLGSQEIGYREL PVID
Subjt:  VLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVID

Query:  DIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNAT
        DI+QF+WSEECCQALQ+PISREEVRRVLFSMDSGKAPGPDGFSVG FKG WSVV EDFCDVVLHFFETCYLP+GVNAT
Subjt:  DIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNAT

TrEMBL top hitse value%identityAlignment
A0A1S3CRZ6 uncharacterized protein LOC1035041004.0e-29677.29Show/hide
Query:  MELWTEAWLAIVASVVGKPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVV
        MELWTEA LA+VAS VGKPI+LDL TKE  RLSYARVCV+LEG  NM A+ITV+L GVDFNVS+NYEWKP+KCNLCCAFGHSG KC RSVESK IQEEVV
Subjt:  MELWTEAWLAIVASVVGKPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVV

Query:  SKIIPGKGEGVDSEPCAKVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEG
         KII GK +G+DS+PC +VVLESFKQLEE EIRSSPNRH S VE+  GK DEFTLVTRKKSELVSVRDRGKS+ V MPNSFGSLLEVGD DKWAL++IEG
Subjt:  SKIIPGKGEGVDSEPCAKVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEG

Query:  SPPPLQVDEGT-----VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF---------------------------------
        SPPPLQVDEGT     V  S          NS DY   YSNSGVG+IWV+WKKNRFSFST VVDEQF                                 
Subjt:  SPPPLQVDEGT-----VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF---------------------------------

Query:  -------------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGI
                     VVMGDFNAIRVH EAFGGSPIQGEMEDFDLA RDADLVEPSVQGNWFTWTSKVHGSGMLRRLDR+LVND WLSAWPT+LVNVLPWGI
Subjt:  -------------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGI

Query:  SDHSPILFYPSFQQNSR------------DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSD
        SDHSPILFYPSFQ NS+            DPSFIEVVARMW RHEGVSPLVSLMRNL +LKPTLRR+FGRHIQSL+EEVHIAKE MDRAQR+VE N MSD
Subjt:  SDHSPILFYPSFQQNSR------------DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSD

Query:  VLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVID
        VLSRQ  LATE FWTAVRLEEASLRQKSRIRWL+LGDQNT FFHR VRSRMSRNSLLSLVD+DGSRVSSHDGV QLAVNYFRNSLGSQEIGYREL PVID
Subjt:  VLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVID

Query:  DIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNAT
        DI+QF+WSEECCQALQ+PISREEVRRVLFSMDSGKAPGPDGFSVG FKG WSVV EDFCDVVLHFFETCYLP+GVNAT
Subjt:  DIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNAT

A0A5A7SPE5 Reverse transcriptase domain-containing protein2.0e-24762.2Show/hide
Query:  KPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVVSKIIPGKGEGVDSEPCA
        KPI+LD ATK+  RLSYARVCV+LEG  NM AEITV+LRGVDFNVSVNYEWKP+KCNLCCAFGHS  KC RSVESK IQEEVV      KG+ VD E C 
Subjt:  KPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVVSKIIPGKGEGVDSEPCA

Query:  KVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEGSPPPLQVDEGT------
        +VVLESFKQ+E+GEIRSSPNRH S VEK VGK DEFTLVTRKKSELVS+RDRGKS+ V MPNSFGSLLEVGD DKWAL++IEGS PPLQVDEGT      
Subjt:  KVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEGSPPPLQVDEGT------

Query:  --------------VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF-----------------------------------
                      VRE NFD VSRRF NSWDYSCSYSNSGVGRIWV+WKKNRFSFST V+DEQF                                   
Subjt:  --------------VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF-----------------------------------

Query:  -----------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGISD
                   VVM DFNAIRVH EAF GSPIQGEMEDF+LA RDADLVEPSVQGNWFTWTSKV GSGMLRRLDR+LVND+WLS WPT+LVNVLPWGISD
Subjt:  -----------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGISD

Query:  HSPILFYPSFQQNSR------------DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSDVL
        H PILFYPSFQ +++            DPSFIEVV RMW RHEGVSPLV LMRNL  LKP LRRRFGRHI+ L+EEV I KE MD AQRE          
Subjt:  HSPILFYPSFQQNSR------------DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSDVL

Query:  SRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVIDDI
                                                                                 +AVNYFRNSLGSQEIGYREL PVIDDI
Subjt:  SRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVIDDI

Query:  VQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCN
        VQF+WSEECCQALQ+PISREEVRRVLFSMDSGKAPGPDGFSV                            +G+NAT ITLIPK  GAE++E+F PISC N
Subjt:  VQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCN

Query:  VIYKCISKILADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELV
        V+YKCISKILADRLRVWLPSFI SNQSAFIPGRSII+NILLCQEL+
Subjt:  VIYKCISKILADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELV

A0A5A7TZS0 Reverse transcriptase domain-containing protein5.1e-25177.74Show/hide
Query:  NSW--DYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF----------------------------------------------VVMGDFNAIRVHFEA
        N W  +YSCSYSNSGVGRIWV+WKK RFSF T V+DE+F                                              VVMGDFNAIRVH EA
Subjt:  NSW--DYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF----------------------------------------------VVMGDFNAIRVHFEA

Query:  FGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGISDHSPILFYPSFQQNSR-----------
        FGGSPIQGEME+FDLA RDADLVEPSVQGNWFTWTSKV GSGMLRRLDR+LVNDEWLSAWPT+ +NVLPWGISDHSPILFYPSFQ NSR           
Subjt:  FGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGISDHSPILFYPSFQQNSR-----------

Query:  -DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSDVLSRQVGLATEAFWTAVRLEEASLRQKS
         +PSFIEVVARMW RHEGVS LVSLMRNL +LKP LRR+FGRHI+SL+EEVHIAKE MD AQREVE NP+SDVLSRQ  LATE FWTAVRLEEASLRQKS
Subjt:  -DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSDVLSRQVGLATEAFWTAVRLEEASLRQKS

Query:  RIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVIDDIVQFRWSEECCQALQIPISREEVRRVL
        ++RWL LGDQNTAFFHR VRSR+SRNSLLSLVD+DGSRVSSHDGV Q+AVNYF NSLGSQEIGYREL P+IDDIVQF+WSEECCQALQ+PISREEVRRVL
Subjt:  RIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVIDDIVQFRWSEECCQALQIPISREEVRRVL

Query:  FSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCNVIYKCISKILADRLRVWLPSFIGSNQ
        FSMDSGKAPGPDGFSVGF+KGAWSVV EDFC+ VLHFFETCYLPIGVNAT ITLIPK  GAE++E+FRPISCCNV+YKCISKILADRLR+WLPSFI SNQ
Subjt:  FSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCNVIYKCISKILADRLRVWLPSFIGSNQ

Query:  SAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGLLLLL
        SAFIPGRSII+NILLCQELV GYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGLL+ +
Subjt:  SAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGLLLLL

A0A5A7V275 Reverse transcriptase2.7e-26871.53Show/hide
Query:  MELWTEAWLAIVASVVGKPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVV
        MELWTEA LA+VAS VGKPI+LDL TKE  RLSYARVCV+LEG  NM A+ITV+L GVDFNVS+NYEWKP+KCNLCCAFGHSG KC RSVESK IQEEVV
Subjt:  MELWTEAWLAIVASVVGKPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVV

Query:  SKIIPGKGEGVDSEPCAKVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEG
         KII GK +G+DS+PC +VVLESFKQLEE EIRSSPNRH S                                               D DKWAL++IEG
Subjt:  SKIIPGKGEGVDSEPCAKVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEG

Query:  SPPPLQVDEGT-----VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF---------------------------------
        SPPPLQVDEGT     V  S          NS DY   YSNSGVG+IWV+WKKNRFSFST VVDEQF                                 
Subjt:  SPPPLQVDEGT-----VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF---------------------------------

Query:  -------------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGI
                     VVMGDFNAIRVH EAFGGSPIQGEMEDFDLA RDADLVEPSVQGNWFTWTSKVHGSGMLRRLDR+LVND WLSAWPT+LVNVLPWGI
Subjt:  -------------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGI

Query:  SDHSPILFYPSFQQNSR------------DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSD
        SDHSPILFYPSFQ NS+            DPSFIEVVARMW RHEGVSPLVSLMRNL +LKPTLRR+FGRHIQSL+EEVHIAKE MDRAQR+VE N MSD
Subjt:  SDHSPILFYPSFQQNSR------------DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSD

Query:  VLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVID
        VLSRQ  LATE FWTAVRLEEASLRQKSRIRWL+LGDQNT FFHR VRSRMSRNSLLSLVD+DGSRVSSHDGV QLAVNYFRNSLGSQEIGYREL PVID
Subjt:  VLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVID

Query:  DIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNAT
        DI+QF+WSEECCQALQ+PISREEVRRVLFSMDSGKAPGPDGFSVG FKG WSVV EDFCDVVLHFFETCYLP+GVNAT
Subjt:  DIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNAT

A0A5A7V5J2 Non-LTR retroelement reverse transcriptase-like protein0.0e+0087.26Show/hide
Query:  MELWTEAWLAIVASVVGKPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVV
        MELWTEAWLAIVASVVGKPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCA GHSGGKCPRSVESKIIQEEVV
Subjt:  MELWTEAWLAIVASVVGKPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVV

Query:  SKIIPGKGEGVDSEPCAKVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEG
        SKIIPGKGEGVDSEPCAKVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEG
Subjt:  SKIIPGKGEGVDSEPCAKVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEG

Query:  SPPPLQVDEGT-----------------------------VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF---------
        SPPPLQVDEGT                             VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF         
Subjt:  SPPPLQVDEGT-----------------------------VRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQF---------

Query:  -------------------------------------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLD
                                             VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLD
Subjt:  -------------------------------------VVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLD

Query:  RILVNDEWLSAWPTLLVNVLPWGISDHSPILFYPSFQQNSRDPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDR
        RILVNDEWLSAWPTLLV                   Q    DPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDR
Subjt:  RILVNDEWLSAWPTLLVNVLPWGISDHSPILFYPSFQQNSRDPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDR

Query:  AQREVEHNPMSDVLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQ
        AQREVEHNPMSDVLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQ
Subjt:  AQREVEHNPMSDVLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQ

Query:  EIGYRELFPVIDDIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRG
        EIGYRELFPVIDDIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRG
Subjt:  EIGYRELFPVIDDIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRG

Query:  AEQMEEFRPISCCNVIYKCISKILADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGL
        AEQMEEFRPISCCNVIYKCISKILADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFL G+
Subjt:  AEQMEEFRPISCCNVIYKCISKILADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.4e-1824.69Show/hide
Query:  RPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVIDDIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSV
        R ++ +  +N + ++ +  G   +    +      Y+++   ++     E+   +D     R ++E  ++L  PI+  E+  ++ S+ + K+PGPDGF+ 
Subjt:  RPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVIDDIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSV

Query:  GFFKGAWSVVEEDFCDVVLHFFETC----YLPIGVNATVITLIPK-RRGAEQMEEFRPISCCNVIYKCISKILADRLRVWLPSFIGSNQSAFIPGRSIID
         F++      +E+    +L  F++      LP       I LIPK  R   + E FRPIS  N+  K ++KILA+R++  +   I  +Q  FIPG     
Subjt:  GFFKGAWSVVEEDFCDVVLHFFETC----YLPIGVNATVITLIPK-RRGAEQMEEFRPISCCNVIYKCISKILADRLRVWLPSFIGSNQSAFIPGRSIID

Query:  NILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGLL
        NI     +++  +    K    + +D +KA+D +   F+   L
Subjt:  NILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGLL

P11369 LINE-1 retrotransposable element ORF2 protein6.4e-1731.79Show/hide
Query:  LQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETC----YLPIGVNATVITLIPK-RRGAEQMEEFRPISCCNVIYKCIS
        L  PIS +E+  V+ S+ + K+PGPDGFS  F++      +ED   ++   F        LP       ITLIPK ++   ++E FRPIS  N+  K ++
Subjt:  LQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETC----YLPIGVNATVITLIPK-RRGAEQMEEFRPISCCNVIYKCIS

Query:  KILADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGLL
        KILA+R++  + + I  +Q  FIPG     NI     ++   +    K    + +D +KA+D +   F+  +L
Subjt:  KILADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGLL

P14381 Transposon TX1 uncharacterized 149 kDa protein1.1e-2723.73Show/hide
Query:  FTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGISDHSPILFYPSFQQNSRDPS---FIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRF
        FT+     G     R+DRI ++   +S   +  + + P+  SDH+ +    S   +    +   F   +    G  + V       R  Q+   TL + +
Subjt:  FTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGISDHSPILFYPSFQQNSRDPS---FIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRF

Query:  --GR-HIQSLNEEVHIAKENMDRAQREVEHNPMSDVLSRQVGLATEAFWTAVRLEEASLRQ-----------KSRIRWLELGDQNTAFFHRPVRSRMSRN
          G+ H++ L +E   +      A+ E  +  + D+  R  G   +A        + +LR            +SR++ L   D+ + FF+   + + +R 
Subjt:  --GR-HIQSLNEEVHIAKENMDRAQREVEHNPMSDVLSRQVGLATEAFWTAVRLEEASLRQ-----------KSRIRWLELGDQNTAFFHRPVRSRMSRN

Query:  SLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVIDDIVQFRW------SEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFK
         +  L   DG+ +   + +   A ++++N             P+  D  +  W      SE   + L+ PI+ +E+ + L  M   K+PG DG ++ FF+
Subjt:  SLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVIDDIVQFRW------SEECCQALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFK

Query:  GAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCNVIYKCISKILADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELV
          W  +  DF  V+   F+   LP+     V++L+PK+     ++ +RP+S  +  YK ++K ++ RL+  L   I  +QS  +PGR+I DN+ L ++L+
Subjt:  GAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCNVIYKCISKILADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELV

Query:  EGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGLL
          +   +G     L +D +KA+D V+  +L G L
Subjt:  EGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGLL

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM2.5e-0525.75Show/hide
Query:  ISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCNVIYKCISKILADRLRV
        I+ +++R    S+ S  +PGPDG +    K A  V       ++        LP  +       IPK   A++ ++FRPIS  +V+ + ++ ILA RL  
Subjt:  ISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCNVIYKCISKILADRLRV

Query:  ---WLPSFIGSNQSAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGLL
           W P      Q  F+P     DN  +   ++   H +  +      +D+ KA+DS++   ++  L
Subjt:  ---WLPSFIGSNQSAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGLL

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)9.6e-0528.12Show/hide
Query:  PISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCNVIYKCISKILADRLR
        PI+REE++  +       APG DG +V       + +  +F  V LH     ++P    A   TLIPK    E    +RPI+  + + + + +ILA RL 
Subjt:  PISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCNVIYKCISKILADRLR

Query:  VWLPSFIGSNQSAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKV---DLQKAYDSVN
          +             G + ID  L+   L++ Y  +  + R T  V   D++KA+D+V+
Subjt:  VWLPSFIGSNQSAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKV---DLQKAYDSVN

Arabidopsis top hitse value%identityAlignment
AT1G40390.1 DNAse I-like superfamily protein4.3e-1629.22Show/hide
Query:  RFSFSTCVVDEQFVVMGDFNAIRVHFEAFGGSP----IQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVL
        R S S+ + +  ++V+GDFN I    E +   P    +QG +ED     RD+DLV+   +G  +TW++    + +LR+LDR +VN  WL+ +PT      
Subjt:  RFSFSTCVVDEQFVVMGDFNAIRVHFEAFGGSP----IQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVL

Query:  PWGISDHSPILF-------------YPSFQQNSRDPSFIEVVARMWGRHEGV-SPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVE
        P   SDH+  +              +  F   S  P FI  +   W +   V S + SL   L+  K   R         LN      +      Q ++ 
Subjt:  PWGISDHSPILF-------------YPSFQQNSRDPSFIEVVARMWGRHEGV-SPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVE

Query:  HNPMSDVLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGD
         NP SD L R   +A + +       E+  +QKSRI+WL+ GD
Subjt:  HNPMSDVLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGD

AT1G43760.1 DNAse I-like superfamily protein1.2e-5831.49Show/hide
Query:  LTVIEGSPPPLQVDEGTVRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQFVVMGDFNAIRV---HFEAFGGS-PIQGEMEDF
        ++ + GSP  L V  G+  ESN   +      SW    +Y  S +GRIW+VW  +         D+  +++GDF+ I     H+     S P++G +E+F
Subjt:  LTVIEGSPPPLQVDEGTVRESNFDYVSRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQFVVMGDFNAIRV---HFEAFGGS-PIQGEMEDF

Query:  DLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGISDHSPILF------------YPSFQQNSRDPSFIEVVARMW
            RD+DLV+   +G  +TW++    + ++R+LDR + N +W S++P+ +      G+SDHSP +             +  F   S  P+F+  +   W
Subjt:  DLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVNDEWLSAWPTLLVNVLPWGISDHSPILF------------YPSFQQNSRDPSFIEVVARMW

Query:  GRHEGV-SPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSDVLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNT
             V S + SL  +L+  K   +    +   ++  +   A ++++  Q ++  NP SD L R   +A + +       E+  RQKSRI+WL+ GD NT
Subjt:  GRHEGV-SPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSDVLSRQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNT

Query:  AFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGS-QEIGYRELFPVIDDIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGP
         FFH+ + +  ++N +  L   D  RV +   V ++ V Y+ + LGS  +I   +    I DI  FR ++     L    S +E+   +F+M   KAPGP
Subjt:  AFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGS-QEIGYRELFPVIDDIVQFRWSEECCQALQIPISREEVRRVLFSMDSGKAPGP

Query:  DGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCNVIYKCIS
        D F+  FF  +W VV++     V  FF T +L    NAT ITLIPK  G +Q+  FRP+SCC V+YK I+
Subjt:  DGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCNVIYKCIS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases3.9e-0941.1Show/hide
Query:  LADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELVEGYHLNSG-KPRCTLKVDLQKAYDSVNWDFLFGLLL
        + +RL+  + + IG  Q++FIPGR   DNI+  QE V       G K    LK+DL+KAYD + WD+L   L+
Subjt:  LADRLRVWLPSFIGSNQSAFIPGRSIIDNILLCQELVEGYHLNSG-KPRCTLKVDLQKAYDSVNWDFLFGLLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTATGGACAGAAGCATGGTTGGCTATTGTAGCGAGTGTTGTGGGTAAGCCTATAACTTTAGATTTGGCCACTAAGGAGCATAGTAGGCTTTCCTACGCTCGTGT
TTGTGTCAAATTAGAGGGAGAGTTCAATATGCGTGCTGAAATTACTGTTAATCTCAGAGGAGTGGATTTTAATGTTTCAGTAAATTATGAGTGGAAGCCTCAGAAGTGTA
ATTTGTGTTGTGCATTTGGGCACTCTGGTGGCAAGTGTCCTAGAAGTGTGGAGAGTAAAATCATACAGGAGGAGGTTGTGAGCAAGATTATTCCTGGAAAAGGGGAGGGA
GTAGACAGTGAACCTTGTGCGAAAGTTGTTCTAGAATCATTCAAACAGTTGGAGGAAGGGGAAATTAGGAGTTCTCCAAATAGACATGGTAGTCACGTGGAGAAGGAGGT
GGGAAAACGAGATGAATTTACCCTTGTAACTCGCAAGAAGAGTGAGTTGGTATCTGTTAGAGATCGTGGTAAGAGTATAGTGGTGGCTATGCCAAACTCTTTTGGTAGTC
TTTTGGAGGTGGGTGACGTTGACAAGTGGGCTTTAACAGTAATAGAGGGCTCACCACCGCCCTTACAGGTGGATGAAGGTACTGTTCGCGAAAGTAATTTTGATTATGTT
TCTAGAAGATTTGGTAATTCTTGGGATTACTCTTGCAGTTACAGTAATAGTGGTGTTGGTCGGATTTGGGTGGTGTGGAAGAAGAATCGCTTTTCTTTCTCTACTTGTGT
GGTGGATGAGCAGTTTGTTGTCATGGGAGATTTTAATGCTATTAGAGTTCATTTTGAAGCATTTGGGGGATCTCCTATTCAGGGTGAGATGGAGGATTTTGATCTGGCTA
CACGTGATGCTGATTTAGTAGAGCCTTCGGTGCAGGGTAACTGGTTCACTTGGACTAGTAAGGTGCATGGGTCTGGGATGTTGCGACGACTGGATCGTATTTTAGTGAAT
GATGAATGGCTATCTGCATGGCCTACTTTATTGGTAAATGTGCTACCATGGGGTATTTCCGACCATTCTCCTATTTTATTCTATCCTAGCTTTCAGCAAAATAGTAGAGA
TCCTTCATTTATTGAGGTGGTTGCTAGGATGTGGGGTCGGCATGAGGGTGTCTCTCCGCTAGTGAGCCTCATGAGAAATCTTCAAAATCTCAAACCTACCCTCCGTAGGC
GTTTTGGTAGGCACATCCAGAGCCTAAATGAGGAGGTGCACATTGCAAAGGAGAACATGGATAGGGCTCAGAGAGAGGTAGAACATAATCCTATGTCGGATGTTTTGAGT
CGCCAAGTAGGCCTTGCTACTGAGGCTTTTTGGACAGCAGTTAGATTGGAGGAAGCCTCTCTTCGTCAGAAATCCAGAATTCGATGGTTAGAACTTGGTGATCAGAATAC
GGCCTTTTTCCATCGACCTGTTCGTTCTCGTATGAGTCGTAATTCTCTGCTTTCTTTAGTAGATGCGGATGGCTCAAGGGTTTCTTCACATGATGGGGTGGTTCAGCTGG
CAGTTAATTATTTTCGTAATAGTTTGGGATCCCAGGAGATTGGTTATAGAGAGTTGTTCCCCGTTATTGATGATATTGTTCAGTTTCGGTGGTCTGAGGAGTGTTGTCAG
GCGTTACAGATACCTATTAGCCGGGAGGAAGTTAGGAGGGTCTTATTCTCTATGGATAGTGGAAAGGCTCCTGGTCCTGATGGGTTCTCTGTAGGATTCTTCAAAGGTGC
CTGGTCTGTGGTTGAGGAAGATTTTTGTGATGTTGTTTTGCATTTCTTTGAGACTTGTTATCTTCCAATAGGAGTTAATGCTACTGTTATTACCCTCATCCCTAAGCGTC
GTGGGGCTGAGCAGATGGAGGAATTTCGGCCTATTTCTTGTTGTAATGTGATATACAAATGCATTTCTAAGATTTTGGCTGATAGGCTTCGTGTGTGGCTTCCTTCTTTT
ATCGGTAGTAACCAGTCTGCTTTTATCCCTGGGAGGAGTATTATTGATAACATTCTGCTGTGTCAGGAACTGGTCGAGGGTTATCATCTTAATTCTGGTAAACCTCGATG
TACTTTGAAAGTTGATCTTCAAAAAGCCTATGACTCTGTTAATTGGGATTTTCTGTTTGGATTGTTATTGCTATTAGTACTCCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTATGGACAGAAGCATGGTTGGCTATTGTAGCGAGTGTTGTGGGTAAGCCTATAACTTTAGATTTGGCCACTAAGGAGCATAGTAGGCTTTCCTACGCTCGTGT
TTGTGTCAAATTAGAGGGAGAGTTCAATATGCGTGCTGAAATTACTGTTAATCTCAGAGGAGTGGATTTTAATGTTTCAGTAAATTATGAGTGGAAGCCTCAGAAGTGTA
ATTTGTGTTGTGCATTTGGGCACTCTGGTGGCAAGTGTCCTAGAAGTGTGGAGAGTAAAATCATACAGGAGGAGGTTGTGAGCAAGATTATTCCTGGAAAAGGGGAGGGA
GTAGACAGTGAACCTTGTGCGAAAGTTGTTCTAGAATCATTCAAACAGTTGGAGGAAGGGGAAATTAGGAGTTCTCCAAATAGACATGGTAGTCACGTGGAGAAGGAGGT
GGGAAAACGAGATGAATTTACCCTTGTAACTCGCAAGAAGAGTGAGTTGGTATCTGTTAGAGATCGTGGTAAGAGTATAGTGGTGGCTATGCCAAACTCTTTTGGTAGTC
TTTTGGAGGTGGGTGACGTTGACAAGTGGGCTTTAACAGTAATAGAGGGCTCACCACCGCCCTTACAGGTGGATGAAGGTACTGTTCGCGAAAGTAATTTTGATTATGTT
TCTAGAAGATTTGGTAATTCTTGGGATTACTCTTGCAGTTACAGTAATAGTGGTGTTGGTCGGATTTGGGTGGTGTGGAAGAAGAATCGCTTTTCTTTCTCTACTTGTGT
GGTGGATGAGCAGTTTGTTGTCATGGGAGATTTTAATGCTATTAGAGTTCATTTTGAAGCATTTGGGGGATCTCCTATTCAGGGTGAGATGGAGGATTTTGATCTGGCTA
CACGTGATGCTGATTTAGTAGAGCCTTCGGTGCAGGGTAACTGGTTCACTTGGACTAGTAAGGTGCATGGGTCTGGGATGTTGCGACGACTGGATCGTATTTTAGTGAAT
GATGAATGGCTATCTGCATGGCCTACTTTATTGGTAAATGTGCTACCATGGGGTATTTCCGACCATTCTCCTATTTTATTCTATCCTAGCTTTCAGCAAAATAGTAGAGA
TCCTTCATTTATTGAGGTGGTTGCTAGGATGTGGGGTCGGCATGAGGGTGTCTCTCCGCTAGTGAGCCTCATGAGAAATCTTCAAAATCTCAAACCTACCCTCCGTAGGC
GTTTTGGTAGGCACATCCAGAGCCTAAATGAGGAGGTGCACATTGCAAAGGAGAACATGGATAGGGCTCAGAGAGAGGTAGAACATAATCCTATGTCGGATGTTTTGAGT
CGCCAAGTAGGCCTTGCTACTGAGGCTTTTTGGACAGCAGTTAGATTGGAGGAAGCCTCTCTTCGTCAGAAATCCAGAATTCGATGGTTAGAACTTGGTGATCAGAATAC
GGCCTTTTTCCATCGACCTGTTCGTTCTCGTATGAGTCGTAATTCTCTGCTTTCTTTAGTAGATGCGGATGGCTCAAGGGTTTCTTCACATGATGGGGTGGTTCAGCTGG
CAGTTAATTATTTTCGTAATAGTTTGGGATCCCAGGAGATTGGTTATAGAGAGTTGTTCCCCGTTATTGATGATATTGTTCAGTTTCGGTGGTCTGAGGAGTGTTGTCAG
GCGTTACAGATACCTATTAGCCGGGAGGAAGTTAGGAGGGTCTTATTCTCTATGGATAGTGGAAAGGCTCCTGGTCCTGATGGGTTCTCTGTAGGATTCTTCAAAGGTGC
CTGGTCTGTGGTTGAGGAAGATTTTTGTGATGTTGTTTTGCATTTCTTTGAGACTTGTTATCTTCCAATAGGAGTTAATGCTACTGTTATTACCCTCATCCCTAAGCGTC
GTGGGGCTGAGCAGATGGAGGAATTTCGGCCTATTTCTTGTTGTAATGTGATATACAAATGCATTTCTAAGATTTTGGCTGATAGGCTTCGTGTGTGGCTTCCTTCTTTT
ATCGGTAGTAACCAGTCTGCTTTTATCCCTGGGAGGAGTATTATTGATAACATTCTGCTGTGTCAGGAACTGGTCGAGGGTTATCATCTTAATTCTGGTAAACCTCGATG
TACTTTGAAAGTTGATCTTCAAAAAGCCTATGACTCTGTTAATTGGGATTTTCTGTTTGGATTGTTATTGCTATTAGTACTCCTTTGA
Protein sequenceShow/hide protein sequence
MELWTEAWLAIVASVVGKPITLDLATKEHSRLSYARVCVKLEGEFNMRAEITVNLRGVDFNVSVNYEWKPQKCNLCCAFGHSGGKCPRSVESKIIQEEVVSKIIPGKGEG
VDSEPCAKVVLESFKQLEEGEIRSSPNRHGSHVEKEVGKRDEFTLVTRKKSELVSVRDRGKSIVVAMPNSFGSLLEVGDVDKWALTVIEGSPPPLQVDEGTVRESNFDYV
SRRFGNSWDYSCSYSNSGVGRIWVVWKKNRFSFSTCVVDEQFVVMGDFNAIRVHFEAFGGSPIQGEMEDFDLATRDADLVEPSVQGNWFTWTSKVHGSGMLRRLDRILVN
DEWLSAWPTLLVNVLPWGISDHSPILFYPSFQQNSRDPSFIEVVARMWGRHEGVSPLVSLMRNLQNLKPTLRRRFGRHIQSLNEEVHIAKENMDRAQREVEHNPMSDVLS
RQVGLATEAFWTAVRLEEASLRQKSRIRWLELGDQNTAFFHRPVRSRMSRNSLLSLVDADGSRVSSHDGVVQLAVNYFRNSLGSQEIGYRELFPVIDDIVQFRWSEECCQ
ALQIPISREEVRRVLFSMDSGKAPGPDGFSVGFFKGAWSVVEEDFCDVVLHFFETCYLPIGVNATVITLIPKRRGAEQMEEFRPISCCNVIYKCISKILADRLRVWLPSF
IGSNQSAFIPGRSIIDNILLCQELVEGYHLNSGKPRCTLKVDLQKAYDSVNWDFLFGLLLLLVLL