; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh17G007710 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh17G007710
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionTransposable element protein
Genome locationCma_Chr17:6193303..6205457
RNA-Seq ExpressionCmaCh17G007710
SyntenyCmaCh17G007710
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016788 - hydrolase activity, acting on ester bonds (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH09562.1 transposable element gene [Prunus dulcis]6.9e-27057.74Show/hide
Query:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY
        S  P+     + S+ LEV+KF+G+NNFG+W+ EV D+L  Q+LD+ L E+ PED+ + +WT++N  AC  IRS L K+ KYP+MK T AK+LW KLE KY
Subjt:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY

Query:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY
        M KS EN+L+LKK+LFRF Y+ GISM EHL+D+NKI+ DL NLDV I DEDKAL LLNSLP+ Y+ + TTLL+G + +  ++VS AL+N+E RKKE++T 
Subjt:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY

Query:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL
        Q+S +  L A GRT   K  + G+ R KSRG+        K+ECA+CRQKGHWKKDC  LK K     GS++NV +  D D + AL  S +   ++EWIL
Subjt:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL

Query:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT
        DSGC+YHMCP +E F  F+E +GGVV MGN++ CK  GIG + +KM DG +R+L DVRYVPD+KKNLISLG L++ G +I +EGG LK   GALV + GT
Subjt:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT

Query:  RRGSIYYLNGTTIIGHVAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKN
        RR ++Y+L G+T+IG  AV  + + D ++LWHMRLGHAGEKALQ LV QG+LKGA   +++FCEHC+ GKQ  VKFGT+IH TKGILDYVHTDVWGP+KN
Subjt:  RRGSIYYLNGTTIIGHVAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKN

Query:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE
         S GG  +FV+F+DD+S+R+W+Y M+ K+EVL+IFL+WKKM+E Q+ +KIK LRSDNGGEY  DPFLKVC+DEGIVRHFTV   PQQNGVAERMN+TL+E
Subjt:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE

Query:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI
        KVRC+LS AGL KAFWAEA++YA HL+NRLP + N G+TP+EVWSG P +DY  LH+FGCPAYYHV +SKLDPRAKKA FMGFS GVKGYRLWCP+  KI
Subjt:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI

Query:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDT
        V SRDVTFDE+ M  Q K E   EA K +   ++V      V P        +   D  E+ E   EE++   E  ++ +SIA  R RR IRKP RF D 
Subjt:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDT

Query:  IAYAFSII-DGVPN
        +AYA  +I D +P+
Subjt:  IAYAFSII-DGVPN

BBH10183.1 transposable element gene [Prunus dulcis]6.9e-27057.74Show/hide
Query:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY
        S  P+     + S+ LEV+KF+G+NNFG+W+ EV D+L  Q+LD+ L E+ PED+ + +WT++N  AC  IRS L K+ KYP+MK T AK+LW KLE KY
Subjt:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY

Query:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY
        M KS EN+L+LKK+LFRF Y+ GISM EHL+D+NKI+ DL NLDV I DEDKAL LLNSLP+ Y+ + TTLL+G + +  ++VS AL+N+E RKKE++T 
Subjt:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY

Query:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL
        Q+S +  L A GRT   K  + G+ R KSRG+        K+ECA+CRQKGHWKKDC  LK K     GS++NV +  D D + AL  S +   ++EWIL
Subjt:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL

Query:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT
        DSGC+YHMCP +E F  F+E +GGVV MGN++ CK  GIG + +KM DG +R+L DVRYVPD+KKNLISLG L++ G +I +EGG LK   GALV + GT
Subjt:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT

Query:  RRGSIYYLNGTTIIGHVAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKN
        RR ++Y+L G+T+IG  AV  + + D ++LWHMRLGHAGEKALQ LV QG+LKGA   +++FCEHC+ GKQ  VKFGT+IH TKGILDYVHTDVWGP+KN
Subjt:  RRGSIYYLNGTTIIGHVAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKN

Query:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE
         S GG  +FV+F+DD+S+R+W+Y M+ K+EVL+IFL+WKKM+E Q+ +KIK LRSDNGGEY  DPFLKVC+DEGIVRHFTV   PQQNGVAERMN+TL+E
Subjt:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE

Query:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI
        KVRC+LS AGL KAFWAEA++YA HL+NRLP + N G+TP+EVWSG P +DY  LH+FGCPAYYHV +SKLDPRAKKA FMGFS GVKGYRLWCP+  KI
Subjt:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI

Query:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDT
        V SRDVTFDE+ M  Q K E   EA K +   ++V      V P        +   D  E+ E   EE++   E  ++ +SIA  R RR IRKP RF D 
Subjt:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDT

Query:  IAYAFSII-DGVPN
        +AYA  +I D +P+
Subjt:  IAYAFSII-DGVPN

BBN69445.1 hypothetical protein Prudu_948S000300 [Prunus dulcis]3.4e-26957.77Show/hide
Query:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY
        S  P+     + S+ LEV+KF+G+NNFG+W+ EV D+L  Q+LD+ L E+ PED+ + +WT++N  AC  IRS L K+ KYP+MK T AK+LW KLE KY
Subjt:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY

Query:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY
        M KS EN+L+LKK+LFRF Y+ GISM EHL+D+NKI+ DL NLDV I DEDKAL LLNSLP+ Y+ + TTLL+G + +  ++VS AL+N+E RKKE++T 
Subjt:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY

Query:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL
        Q+S +  L A GRT   K  + G+ R KSRG+        K+ECA+CRQKGHWKKDC  LK K     GS++NV +  D D + AL  S +   ++EWIL
Subjt:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL

Query:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT
        DSGC+YHMCP +E F  F+E +GGVV MGN++ CK  GIG + +KM DG +R+L DVRYVPD+KKNLISLG L++ G +I +EGG LK   GALV + GT
Subjt:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT

Query:  RRGSIYYLNGTTIIGHVAV---ASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGP
        RR ++Y+L G+T+IG  AV   A +   D ++LWHMRLGHAGEKALQ LV QG+LKGA   +++FCEHC+ GKQ  VKFGT+IH TKGILDYVHTDVWGP
Subjt:  RRGSIYYLNGTTIIGHVAV---ASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGP

Query:  TKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQT
        +KN S GG  +FV+F+DD+S+R+W+Y M+ K+EVL+IFL+WKKM+E Q+ +KIK LRSDNGGEY  DPFLKVC+DEGIVRHFTV   PQQNGVAERMN+T
Subjt:  TKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQT

Query:  LIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPET
        L+EKVRC+LS AGL KAFWAEA++YA HL+NRLP + N G+TP+EVWSG P +DY  LH+FGCPAYYHV +SKLDPRAKKA FMGFS GVKGYRLWCP+ 
Subjt:  LIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPET

Query:  SKIVNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARF
         KIV SRDVTFDE+ M  Q K E   EA K +   ++V      V P    I   +   D  E+ E   EE++   E  ++ +SIA  R RR IRKP RF
Subjt:  SKIVNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARF

Query:  DDTIAYAFSII-DGVPN
         D +AYA  +I D +P+
Subjt:  DDTIAYAFSII-DGVPN

BBN69488.1 transposable element gene [Prunus dulcis]9.9e-26957.46Show/hide
Query:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY
        S  P+     + S+ LEV+KF+G+NNFG+W+ EV D+L  Q+LD+ L E+ PED+ + +WT++N  AC  IRS L K+ KYP+MK T AK+LW KLE KY
Subjt:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY

Query:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY
        M KS EN+L+LKK+LFRF Y+ GISM EHL+D+NKI+ DL NLDV I DEDKAL LLNSLP+ Y+ + TTLL+G + +  ++VS AL+N+E RKKE++T 
Subjt:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY

Query:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL
        Q+S +  L A GRT   K  + G+ R KSRG+        K+ECA+CRQKGHWKKDC  LK K     GS++NV +  D D + AL  S +   ++EWIL
Subjt:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL

Query:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT
        DSGC+YHMCP +E F  F+E + GVV MGN++ CK  GIG + +KM DG +R+L DVRYVPD+KKNLISLG L++ G +I +EGG LK   GALV + GT
Subjt:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT

Query:  RRGSIYYLNGTTIIGHVAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKN
        RR ++Y+L G+T+IG  AV  + + D ++LWHMRLGHAGEKALQ LV QG+LKGA   +++FCEHC+ GKQ  VKFGT+IH TKGILDYVHTDVWGP+KN
Subjt:  RRGSIYYLNGTTIIGHVAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKN

Query:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE
         S GG  +FV+F+DD+S+R+W+Y M+ K+EVL+IFL+WKKM+E Q+ +KIK LRSDNGGEY  DPFLKVC+DEGIVRHFTV   PQQNGVAERMN+TL+E
Subjt:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE

Query:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI
        KVRC+LS AGL KAFWAEA++YA HL+NRLP + N G+TP+EVWSG P +DY  LH+FGCPAYYHV +SKLDPRAKKA FMGFS GVKGYRLWCP+  KI
Subjt:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI

Query:  VNSRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQV-NNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDTIAY
        V SRDVTFDE+ M  Q   N  E   +  K + S   V   + P+  V ++  D     +   EE++   E  ++ +SIA  R RR IRKP RF D +AY
Subjt:  VNSRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQV-NNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDTIAY

Query:  AFSII-DGVPN
        A  +I D +P+
Subjt:  AFSII-DGVPN

PSS35035.1 Endonuclease [Actinidia chinensis var. chinensis]1.3e-30062.93Show/hide
Query:  QVVSEPSCEPSRRGSRVWSSN--LEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKEL
        ++ S  +      G R   SN   EVEKFDGTNNFG+W+ EV D+L+ QDLD++L E  PEDM+E +W KLNRQACG IR CL KDQKY  MK T AKEL
Subjt:  QVVSEPSCEPSRRGSRVWSSN--LEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKEL

Query:  WDKLEAKYMKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEM
        WDKLE KYM KSVEN+LYLKKKLFRF +K+G SM EHL+ FNKI+ DL NLDV+IDDEDKALLLLNSLP++YE + TTLL+G   I F DVSNAL+NNE+
Subjt:  WDKLEAKYMKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEM

Query:  RKKEKETYQSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVSQTSE
        RKK+++ ++ SSSN LTA GRTS  K     + R KSRG+S   R+L K+ECAYC QKGHW+KDC  ++ K  K+N+  D     D A T+SLS S + E
Subjt:  RKKEKETYQSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVSQTSE

Query:  WILDSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAI
        WILDSGCSYHMCP+++ F  F+E +GGVV MGND+ CK MGIG++++KMF G ++ L DVRYVPDLKKNLISLG  D+ GY+IILEGG LKV RGALV +
Subjt:  WILDSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAI

Query:  TGTRRGSIYYLNGTTIIGHVAVA-SSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWG
         GTRRG++Y+L+G+T+ G VA++ SS E D SKLWHMRLGH GEKALQTLV QGVLKGA TG+I+FCEHC+ GKQ  VKFGT+IHQT+GILDYVH+DVWG
Subjt:  TGTRRGSIYYLNGTTIIGHVAVA-SSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWG

Query:  PTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQ
        P+KN SLGG R+FVTF+DD+S+RVW+Y MRHK+EVL+IFL+WKKM+E QT +KIK+LRSDNGGEY  DPF  VC  EGIVRHFT+ G PQQNGVAERMN+
Subjt:  PTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQ

Query:  TLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPE
        TL++KVRC++  AGLSKAFWAEA++YA HL NRLP +   G+TP+EVWSG   +DYD LH+FGCPAY+HV + KLDPRAKKA F+GFS GVKGYRLWCPE
Subjt:  TLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPE

Query:  TSKIVNSRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDT
        T KIVNSRDVTFDES M     + N   +   +  + SP V   T + +D  +   D ++      E QS      +  ESIA +RP+RVIR+PAR+ DT
Subjt:  TSKIVNSRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDT

Query:  IAYAFSIIDGVP
        +AYA  +I+GVP
Subjt:  IAYAFSIIDGVP

TrEMBL top hitse value%identityAlignment
A0A2R6RYD4 Endonuclease6.2e-30162.93Show/hide
Query:  QVVSEPSCEPSRRGSRVWSSN--LEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKEL
        ++ S  +      G R   SN   EVEKFDGTNNFG+W+ EV D+L+ QDLD++L E  PEDM+E +W KLNRQACG IR CL KDQKY  MK T AKEL
Subjt:  QVVSEPSCEPSRRGSRVWSSN--LEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKEL

Query:  WDKLEAKYMKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEM
        WDKLE KYM KSVEN+LYLKKKLFRF +K+G SM EHL+ FNKI+ DL NLDV+IDDEDKALLLLNSLP++YE + TTLL+G   I F DVSNAL+NNE+
Subjt:  WDKLEAKYMKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEM

Query:  RKKEKETYQSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVSQTSE
        RKK+++ ++ SSSN LTA GRTS  K     + R KSRG+S   R+L K+ECAYC QKGHW+KDC  ++ K  K+N+  D     D A T+SLS S + E
Subjt:  RKKEKETYQSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVSQTSE

Query:  WILDSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAI
        WILDSGCSYHMCP+++ F  F+E +GGVV MGND+ CK MGIG++++KMF G ++ L DVRYVPDLKKNLISLG  D+ GY+IILEGG LKV RGALV +
Subjt:  WILDSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAI

Query:  TGTRRGSIYYLNGTTIIGHVAVA-SSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWG
         GTRRG++Y+L+G+T+ G VA++ SS E D SKLWHMRLGH GEKALQTLV QGVLKGA TG+I+FCEHC+ GKQ  VKFGT+IHQT+GILDYVH+DVWG
Subjt:  TGTRRGSIYYLNGTTIIGHVAVA-SSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWG

Query:  PTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQ
        P+KN SLGG R+FVTF+DD+S+RVW+Y MRHK+EVL+IFL+WKKM+E QT +KIK+LRSDNGGEY  DPF  VC  EGIVRHFT+ G PQQNGVAERMN+
Subjt:  PTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQ

Query:  TLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPE
        TL++KVRC++  AGLSKAFWAEA++YA HL NRLP +   G+TP+EVWSG   +DYD LH+FGCPAY+HV + KLDPRAKKA F+GFS GVKGYRLWCPE
Subjt:  TLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPE

Query:  TSKIVNSRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDT
        T KIVNSRDVTFDES M     + N   +   +  + SP V   T + +D  +   D ++      E QS      +  ESIA +RP+RVIR+PAR+ DT
Subjt:  TSKIVNSRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDT

Query:  IAYAFSIIDGVP
        +AYA  +I+GVP
Subjt:  IAYAFSIIDGVP

A0A4Y1RZX4 Transposable element protein3.3e-27057.74Show/hide
Query:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY
        S  P+     + S+ LEV+KF+G+NNFG+W+ EV D+L  Q+LD+ L E+ PED+ + +WT++N  AC  IRS L K+ KYP+MK T AK+LW KLE KY
Subjt:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY

Query:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY
        M KS EN+L+LKK+LFRF Y+ GISM EHL+D+NKI+ DL NLDV I DEDKAL LLNSLP+ Y+ + TTLL+G + +  ++VS AL+N+E RKKE++T 
Subjt:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY

Query:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL
        Q+S +  L A GRT   K  + G+ R KSRG+        K+ECA+CRQKGHWKKDC  LK K     GS++NV +  D D + AL  S +   ++EWIL
Subjt:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL

Query:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT
        DSGC+YHMCP +E F  F+E +GGVV MGN++ CK  GIG + +KM DG +R+L DVRYVPD+KKNLISLG L++ G +I +EGG LK   GALV + GT
Subjt:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT

Query:  RRGSIYYLNGTTIIGHVAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKN
        RR ++Y+L G+T+IG  AV  + + D ++LWHMRLGHAGEKALQ LV QG+LKGA   +++FCEHC+ GKQ  VKFGT+IH TKGILDYVHTDVWGP+KN
Subjt:  RRGSIYYLNGTTIIGHVAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKN

Query:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE
         S GG  +FV+F+DD+S+R+W+Y M+ K+EVL+IFL+WKKM+E Q+ +KIK LRSDNGGEY  DPFLKVC+DEGIVRHFTV   PQQNGVAERMN+TL+E
Subjt:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE

Query:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI
        KVRC+LS AGL KAFWAEA++YA HL+NRLP + N G+TP+EVWSG P +DY  LH+FGCPAYYHV +SKLDPRAKKA FMGFS GVKGYRLWCP+  KI
Subjt:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI

Query:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDT
        V SRDVTFDE+ M  Q K E   EA K +   ++V      V P        +   D  E+ E   EE++   E  ++ +SIA  R RR IRKP RF D 
Subjt:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDT

Query:  IAYAFSII-DGVPN
        +AYA  +I D +P+
Subjt:  IAYAFSII-DGVPN

A0A4Y1S2D4 Transposable element protein3.3e-27057.74Show/hide
Query:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY
        S  P+     + S+ LEV+KF+G+NNFG+W+ EV D+L  Q+LD+ L E+ PED+ + +WT++N  AC  IRS L K+ KYP+MK T AK+LW KLE KY
Subjt:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY

Query:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY
        M KS EN+L+LKK+LFRF Y+ GISM EHL+D+NKI+ DL NLDV I DEDKAL LLNSLP+ Y+ + TTLL+G + +  ++VS AL+N+E RKKE++T 
Subjt:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY

Query:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL
        Q+S +  L A GRT   K  + G+ R KSRG+        K+ECA+CRQKGHWKKDC  LK K     GS++NV +  D D + AL  S +   ++EWIL
Subjt:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL

Query:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT
        DSGC+YHMCP +E F  F+E +GGVV MGN++ CK  GIG + +KM DG +R+L DVRYVPD+KKNLISLG L++ G +I +EGG LK   GALV + GT
Subjt:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT

Query:  RRGSIYYLNGTTIIGHVAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKN
        RR ++Y+L G+T+IG  AV  + + D ++LWHMRLGHAGEKALQ LV QG+LKGA   +++FCEHC+ GKQ  VKFGT+IH TKGILDYVHTDVWGP+KN
Subjt:  RRGSIYYLNGTTIIGHVAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKN

Query:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE
         S GG  +FV+F+DD+S+R+W+Y M+ K+EVL+IFL+WKKM+E Q+ +KIK LRSDNGGEY  DPFLKVC+DEGIVRHFTV   PQQNGVAERMN+TL+E
Subjt:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE

Query:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI
        KVRC+LS AGL KAFWAEA++YA HL+NRLP + N G+TP+EVWSG P +DY  LH+FGCPAYYHV +SKLDPRAKKA FMGFS GVKGYRLWCP+  KI
Subjt:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI

Query:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDT
        V SRDVTFDE+ M  Q K E   EA K +   ++V      V P        +   D  E+ E   EE++   E  ++ +SIA  R RR IRKP RF D 
Subjt:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDT

Query:  IAYAFSII-DGVPN
        +AYA  +I D +P+
Subjt:  IAYAFSII-DGVPN

A0A5H2XN74 Transposable element protein4.8e-26957.46Show/hide
Query:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY
        S  P+     + S+ LEV+KF+G+NNFG+W+ EV D+L  Q+LD+ L E+ PED+ + +WT++N  AC  IRS L K+ KYP+MK T AK+LW KLE KY
Subjt:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY

Query:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY
        M KS EN+L+LKK+LFRF Y+ GISM EHL+D+NKI+ DL NLDV I DEDKAL LLNSLP+ Y+ + TTLL+G + +  ++VS AL+N+E RKKE++T 
Subjt:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY

Query:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL
        Q+S +  L A GRT   K  + G+ R KSRG+        K+ECA+CRQKGHWKKDC  LK K     GS++NV +  D D + AL  S +   ++EWIL
Subjt:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL

Query:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT
        DSGC+YHMCP +E F  F+E + GVV MGN++ CK  GIG + +KM DG +R+L DVRYVPD+KKNLISLG L++ G +I +EGG LK   GALV + GT
Subjt:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT

Query:  RRGSIYYLNGTTIIGHVAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKN
        RR ++Y+L G+T+IG  AV  + + D ++LWHMRLGHAGEKALQ LV QG+LKGA   +++FCEHC+ GKQ  VKFGT+IH TKGILDYVHTDVWGP+KN
Subjt:  RRGSIYYLNGTTIIGHVAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKN

Query:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE
         S GG  +FV+F+DD+S+R+W+Y M+ K+EVL+IFL+WKKM+E Q+ +KIK LRSDNGGEY  DPFLKVC+DEGIVRHFTV   PQQNGVAERMN+TL+E
Subjt:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE

Query:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI
        KVRC+LS AGL KAFWAEA++YA HL+NRLP + N G+TP+EVWSG P +DY  LH+FGCPAYYHV +SKLDPRAKKA FMGFS GVKGYRLWCP+  KI
Subjt:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI

Query:  VNSRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQV-NNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDTIAY
        V SRDVTFDE+ M  Q   N  E   +  K + S   V   + P+  V ++  D     +   EE++   E  ++ +SIA  R RR IRKP RF D +AY
Subjt:  VNSRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQV-NNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDTIAY

Query:  AFSII-DGVPN
        A  +I D +P+
Subjt:  AFSII-DGVPN

A0A5H2XTW6 Uncharacterized protein1.7e-26957.77Show/hide
Query:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY
        S  P+     + S+ LEV+KF+G+NNFG+W+ EV D+L  Q+LD+ L E+ PED+ + +WT++N  AC  IRS L K+ KYP+MK T AK+LW KLE KY
Subjt:  SCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY

Query:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY
        M KS EN+L+LKK+LFRF Y+ GISM EHL+D+NKI+ DL NLDV I DEDKAL LLNSLP+ Y+ + TTLL+G + +  ++VS AL+N+E RKKE++T 
Subjt:  MKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY

Query:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL
        Q+S +  L A GRT   K  + G+ R KSRG+        K+ECA+CRQKGHWKKDC  LK K     GS++NV +  D D + AL  S +   ++EWIL
Subjt:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL

Query:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT
        DSGC+YHMCP +E F  F+E +GGVV MGN++ CK  GIG + +KM DG +R+L DVRYVPD+KKNLISLG L++ G +I +EGG LK   GALV + GT
Subjt:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT

Query:  RRGSIYYLNGTTIIGHVAV---ASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGP
        RR ++Y+L G+T+IG  AV   A +   D ++LWHMRLGHAGEKALQ LV QG+LKGA   +++FCEHC+ GKQ  VKFGT+IH TKGILDYVHTDVWGP
Subjt:  RRGSIYYLNGTTIIGHVAV---ASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGP

Query:  TKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQT
        +KN S GG  +FV+F+DD+S+R+W+Y M+ K+EVL+IFL+WKKM+E Q+ +KIK LRSDNGGEY  DPFLKVC+DEGIVRHFTV   PQQNGVAERMN+T
Subjt:  TKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQT

Query:  LIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPET
        L+EKVRC+LS AGL KAFWAEA++YA HL+NRLP + N G+TP+EVWSG P +DY  LH+FGCPAYYHV +SKLDPRAKKA FMGFS GVKGYRLWCP+ 
Subjt:  LIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPET

Query:  SKIVNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARF
         KIV SRDVTFDE+ M  Q K E   EA K +   ++V      V P    I   +   D  E+ E   EE++   E  ++ +SIA  R RR IRKP RF
Subjt:  SKIVNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARF

Query:  DDTIAYAFSII-DGVPN
         D +AYA  +I D +P+
Subjt:  DDTIAYAFSII-DGVPN

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.9e-6828.04Show/hide
Query:  VEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKSVENKLYLKKKLFR
        ++ FDG   + +W+  +  LL  QD+   +   MP ++ ++ W K  R A   I   L             A+++ + L+A Y +KS+ ++L L+K+L  
Subjt:  VEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKSVENKLYLKKKLFR

Query:  FDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTL-LHGITNIDFEDVSNALINNEMRKKEKETYQSSSSNVLTA--HGRT
              +S+  H   F+++I++LL    KI++ DK   LL +LP  Y+ ++T +      N+    V N L++ E+  K K  +  +S  V+ A  H   
Subjt:  FDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTL-LHGITNIDFEDVSNALINNEMRKKEKETYQSSSSNVLTA--HGRT

Query:  SNWK----KNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVSQTSE--------WILDSGCSYH
        + +K    KN   +P+   +G S       K +C +C ++GH KKDC   K   +  N   +    T  +  I+  V + +         ++LDSG S H
Subjt:  SNWK----KNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVSQTSE--------WILDSGCSYH

Query:  MCPNKEMFLDFKE----FNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGTRRG
        +  ++ ++ D  E        V   G        GI    +++ +     L+DV +  +   NL+S+  L  +G  I  +   + +++  L+ +  +   
Subjt:  MCPNKEMFLDFKE----FNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGTRRG

Query:  SIYYLNGTTIIGHVAVA-SSKEQDISKLWHMRLGHAGE------KALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQT--KGILDYVHTDV
            LN   +I   A + ++K ++  +LWH R GH  +      K      +Q +L        + CE C+ GKQ  + F     +T  K  L  VH+DV
Subjt:  SIYYLNGTTIIGHVAVA-SSKEQDISKLWHMRLGHAGE------KALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQT--KGILDYVHTDV

Query:  WGPTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERM
         GP   V+L  K +FV F+D ++     Y +++K++V  +F ++    E   + K+  L  DNG EY  +   + C  +GI  H TVP  PQ NGV+ERM
Subjt:  WGPTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERM

Query:  NQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSG--NGGRTPLEVWSGSPVSDYDK-LHVFGCPAYYHVTD--SKLDPRAKKAKFMGFSKGVKG
         +T+ EK R ++S A L K+FW EA+  A +L+NR+P     +  +TP E+W       Y K L VFG   Y H+ +   K D ++ K+ F+G+     G
Subjt:  NQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSG--NGGRTPLEVWSGSPVSDYDK-LHVFGCPAYYHVTD--SKLDPRAKKAKFMGFSKGVKG

Query:  YRLWCPETSKIVNSRDVTFDESGM
        ++LW     K + +RDV  DE+ M
Subjt:  YRLWCPETSKIVNSRDVTFDESGM

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-948.8e-14338.77Show/hide
Query:  EVEKFDGTNNFGVWRGEVSDLLVMQDLD--VSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKSVENKLYLKKK
        EV KF+G N F  W+  + DLL+ Q L   + +  + P+ M   +W  L+ +A   IR  L  D     +    A+ +W +LE+ YM K++ NKLYLKK+
Subjt:  EVEKFDGTNNFGVWRGEVSDLLVMQDLD--VSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKSVENKLYLKKK

Query:  LFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETYQSSSSNVLTAHGRT
        L+     EG +   HL+ FN +IT L NL VKI++EDKA+LLLNSLP SY+ + TT+LHG T I+ +DV++AL+ NE  +K+ E    +    L   GR 
Subjt:  LFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETYQSSSSNVLTAHGRT

Query:  SNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDC-SILKEKGSKSNVVRDDDT----DTDDALTISLSVSQ--------TSEWILDSGCSY
         +++++     R  +RG+S N  K     C  C Q GH+K+DC +  K KG  S    DD+T      +D + + ++  +         SEW++D+  S+
Subjt:  SNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDC-SILKEKGSKSNVVRDDDT----DTDDALTISLSVSQ--------TSEWILDSGCSY

Query:  HMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGTRRGSIY
        H  P +++F  +   + G V MGN S  KI GIG + IK   G    LKDVR+VPDL+ NLIS   LD  GY         ++ +G+LV   G  RG++Y
Subjt:  HMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGTRRGSIY

Query:  YLNGTTIIGHVAVASSKEQDIS-KLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKNVSLGG
          N     G +  A   + +IS  LWH R+GH  EK LQ L  + ++  A    +  C++C+FGKQ  V F TS  +   ILD V++DV GP +  S+GG
Subjt:  YLNGTTIIGHVAVASSKEQDIS-KLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKNVSLGG

Query:  KRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIEKVRCI
         ++FVTFIDD S+++W+Y ++ K++V Q+F ++  +VE +T +K+KRLRSDNGGEYT   F + C   GI    TVPG PQ NGVAERMN+T++EKVR +
Subjt:  KRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIEKVRCI

Query:  LSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTD---SKLDPRAKKAKFMGFSKGVKGYRLWCPETSKIVN
        L  A L K+FW EA+  A +L+NR P        P  VW+   VS Y  L VFGC A+ HV     +KLD ++    F+G+     GYRLW P   K++ 
Subjt:  LSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTD---SKLDPRAKKAKFMGFSKGVKGYRLWCPETSKIVN

Query:  SRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAP--TEEPIDQVNNNFDVLEQKEQS---LEEQSLVNERVEEPE
        SRDV F ES     ++    +  ++V+  +    V  P  +  P    +   +V EQ EQ    +E+   ++E VEE E
Subjt:  SRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAP--TEEPIDQVNNNFDVLEQKEQS---LEEQSLVNERVEEPE

Q12501 Transposon Ty2-OR2 Gag-Pol polyprotein7.2e-2822.22Show/hide
Query:  INNEMRKKEKETYQS---SSSNVLTAHGRTSNWKKNECGEPRLKSR---GRSDNWRKLDKNECAYCRQKGHWKKDCSIL----KEKGSKSNVVRDDDTDT
        +N   + K+   Y++   +S N       T N+ +    +PR         S  + +++ +          +  D + L    ++K SK     D + + 
Subjt:  INNEMRKKEKETYQS---SSSNVLTAHGRTSNWKKNECGEPRLKSR---GRSDNWRKLDKNECAYCRQKGHWKKDCSIL----KEKGSKSNVVRDDDTDT

Query:  DDALTISLSVSQTSEWILDSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIIL
         D L I    SQT   +  +   +H  PN E+           +         I  IG++     +G    +K + + P++  +L+SL  L         
Subjt:  DDALTISLSVSQTSEWILDSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIIL

Query:  EGGNLKVARGALVAITGTRRGSIYYLNGTTII-GHVAVAS----SKEQDISK----LWHMRLGHAGEKALQTLVNQGVLKGATTGRIDF-------CEHC
            L+ + G ++A    + G  Y+L+   +I  H++  +    +K + ++K    L H  LGHA  +++Q  + +  +       I++       C  C
Subjt:  EGGNLKVARGALVAITGTRRGSIYYLNGTTII-GHVAVAS----SKEQDISK----LWHMRLGHAGEKALQTLVNQGVLKGATTGRIDF-------CEHC

Query:  IFG---KQKNVKFGTSIHQ-TKGILDYVHTDVWGPTKNVSLGGKRWFVTFIDDYSKRVWMYPM--RHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGE
        + G   K ++VK     +Q +     Y+HTD++GP  ++      +F++F D+ ++  W+YP+  R +  +L +F      ++NQ + ++  ++ D G E
Subjt:  IFG---KQKNVKFGTSIHQ-TKGILDYVHTDVWGPTKNVSLGGKRWFVTFIDDYSKRVWMYPM--RHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGE

Query:  YTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGC
        YT     K   + GI   +T     + +GVAER+N+TL+   R +L  +GL    W  A+ ++  + N L VS    ++  +  +G    D   +  FG 
Subjt:  YTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGC

Query:  PAYY--HVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKIVNSRDVTFDESGMFLQKIENNDEALKQ--VEKVVFSPDV------------VAPTEEP
        P     H  DSK+ PR      +  S+   GY ++ P   K V++ +            ++NN   L Q   + + F  D+               TE+ 
Subjt:  PAYY--HVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKIVNSRDVTFDESGMFLQKIENNDEALKQ--VEKVVFSPDV------------VAPTEEP

Query:  IDQVNNNFDVLEQKEQSLEEQSLVNERVEE---PESIAKNRPRRVIRKPARFDDTIA
         DQ N   D   Q E  +    LVN+   +   P  + K  P + +R P   D  I+
Subjt:  IDQVNNNFDVLEQKEQSLEEQSLVNERVEE---PESIAKNRPRRVIRKPARFDDTIA

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.9e-5224.41Show/hide
Query:  WTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNK-IIT---DLLNLDVKIDDEDKALL
        W + ++     +   +    +    + T A ++W+ L   Y   S  +   L+ +L     K+     + +DD+ + ++T    L  L   +D +++   
Subjt:  WTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNK-IIT---DLLNLDVKIDDEDKALL

Query:  LLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMR----------------KKEKETYQSSSSNVLTAHGRTSNWKKNECGEPRLKSRGR---SDNW
        +L +LPE Y+ V+  +    T     ++   L+N+E +                   + T  ++++N    + R  N   N   +P  +S      ++N 
Subjt:  LLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMR----------------KKEKETYQSSSSNVLTAHGRTSNWKKNECGEPRLKSRGR---SDNW

Query:  RKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVS---QTSEWILDSGCSYHMCPNKEMFLDFKEFNGG-VVYMGNDSTCKIM
         K    +C  C  +GH  K CS L+   S  N  +     T      +L++     ++ W+LDSG ++H+  +       + + GG  V + + ST  I 
Subjt:  RKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVS---QTSEWILDSGCSYHMCPNKEMFLDFKEFNGG-VVYMGNDSTCKIM

Query:  GIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISL-GVLDASGYRIILEGGNLKV--ARGALVAITGTRRGSIYY--LNGTTIIGHVAVASSKEQDISKLWH
          GS  +      +  L ++ YVP++ KNLIS+  + +A+G  +     + +V      +  + G  +  +Y   +  +  +   A  SSK    S  WH
Subjt:  GIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISL-GVLDASGYRIILEGGNLKV--ARGALVAITGTRRGSIYY--LNGTTIIGHVAVASSKEQDISKLWH

Query:  MRLGHAGEKALQTLVNQGVLKGAT-TGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEV
         RLGH     L ++++   L     + +   C  C+  K   V F  S   +   L+Y+++DVW  +  +S    R++V F+D +++  W+YP++ K++V
Subjt:  MRLGHAGEKALQTLVNQGVLKGAT-TGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEV

Query:  LQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLP
         + F+ +K ++EN+   +I    SDNGGE+      +     GI    + P  P+ NG++ER ++ ++E    +LS A + K +W  A + AV+L+NRLP
Subjt:  LQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLP

Query:  VSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHV---TDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKIVNSRDVTFDES----GMFLQKIENNDEA
               +P +   G+   +YDKL VFGC  Y  +      KLD ++++  F+G+S     Y     +TS++  SR V FDE+      +L  +    E 
Subjt:  VSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHV---TDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKIVNSRDVTFDES----GMFLQKIENNDEA

Query:  LKQVEKVVFSPDVVAPTEEPI
         ++    V+SP    PT  P+
Subjt:  LKQVEKVVFSPDVVAPTEEPI

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE29.3e-5225.81Show/hide
Query:  WTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNS
        W + ++     I   +    +    + T A ++W+ L   Y   S  +   L + + RFD                    L  L   +D +++   +L +
Subjt:  WTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNS

Query:  LPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETYQ--SSSSNVLTAHGRTSNWKKNECGEPR--LKSRGRSDNWR-------------KLDKN
        LP+ Y+ V+  +    T     ++   LIN E +     + +    ++NV+T     +N  +N  G+ R    +  RS++W+             K    
Subjt:  LPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETYQ--SSSSNVLTAHGRTSNWKKNECGEPR--LKSRGRSDNWR-------------KLDKN

Query:  ECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVS---QTSEWILDSGCSYHMCPNKEMFLDFKEFNGG-VVYMGNDSTCKIMGIGSVQ
         C  C  +GH  K C  L +  S +N  +     T      +L+V+     + W+LDSG ++H+  +       + + GG  V + + ST  I   GS  
Subjt:  ECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVS---QTSEWILDSGCSYHMCPNKEMFLDFKEFNGG-VVYMGNDSTCKIMGIGSVQ

Query:  IKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILE----GGNLKVARGALVAITGTRRGSIYY--LNGTTIIGHVAVASSKEQDISKLWHMRLGH
        +      +  L  V YVP++ KNLIS+  L  +  R+ +E       +K     +  + G  +  +Y   +  +  +   A   SK    S  WH RLGH
Subjt:  IKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILE----GGNLKVARGALVAITGTRRGSIYY--LNGTTIIGHVAVASSKEQDISKLWHMRLGH

Query:  AGEKALQTLVNQGVLKGAT-TGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFL
             L ++++   L     + ++  C  C   K   V F  S   +   L+Y+++DVW  +  +S+   R++V F+D +++  W+YP++ K++V   F+
Subjt:  AGEKALQTLVNQGVLKGAT-TGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDYVHTDVWGPTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFL

Query:  EWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFT-VPGKPQQNGVAERMNQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGN
         +K +VEN+   +I  L SDNGGE+     L+    +  + HFT  P  P+ NG++ER ++ ++E    +LS A + K +W  A S AV+L+NRLP    
Subjt:  EWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFT-VPGKPQQNGVAERMNQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGN

Query:  GGRTPLEVWSGSPVSDYDKLHVFGCPAYYHV---TDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKIVNSRDVTFDE
          ++P +   G P  +Y+KL VFGC  Y  +      KL+ ++K+  FMG+S     Y      T ++  SR V FDE
Subjt:  GGRTPLEVWSGSPVSDYDKLHVFGCPAYYHV---TDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKIVNSRDVTFDE

Arabidopsis top hitse value%identityAlignment
AT1G15215.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors9.0e-1058.82Show/hide
Query:  ATDLSRLAFKAFSSIDHAWYDIASFLSYRVNCHGELEARVRYTGFGKDEDE
        A+DL+ LAF+A S+ D+AWYD++SFL+YRV   GELE RVR++GF    DE
Subjt:  ATDLSRLAFKAFSSIDHAWYDIASFLSYRVNCHGELEARVRYTGFGKDEDE

AT1G15215.2 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors9.0e-1058.82Show/hide
Query:  ATDLSRLAFKAFSSIDHAWYDIASFLSYRVNCHGELEARVRYTGFGKDEDE
        A+DL+ LAF+A S+ D+AWYD++SFL+YRV   GELE RVR++GF    DE
Subjt:  ATDLSRLAFKAFSSIDHAWYDIASFLSYRVNCHGELEARVRYTGFGKDEDE

AT1G15215.2 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors1.8e-0242Show/hide
Query:  YTTDDDLTEIVEIDNILKDSEDRTLGQVFFQDFALHFSCSPRRARKSPVT
        Y T+  L+EIV+++N+ K+  D++L + F Q  A  FSCS  R  KS +T
Subjt:  YTTDDDLTEIVEIDNILKDSEDRTLGQVFFQDFALHFSCSPRRARKSPVT

AT1G15215.3 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors9.0e-1058.82Show/hide
Query:  ATDLSRLAFKAFSSIDHAWYDIASFLSYRVNCHGELEARVRYTGFGKDEDE
        A+DL+ LAF+A S+ D+AWYD++SFL+YRV   GELE RVR++GF    DE
Subjt:  ATDLSRLAFKAFSSIDHAWYDIASFLSYRVNCHGELEARVRYTGFGKDEDE

AT1G15215.3 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors1.8e-0242Show/hide
Query:  YTTDDDLTEIVEIDNILKDSEDRTLGQVFFQDFALHFSCSPRRARKSPVT
        Y T+  L+EIV+++N+ K+  D++L + F Q  A  FSCS  R  KS +T
Subjt:  YTTDDDLTEIVEIDNILKDSEDRTLGQVFFQDFALHFSCSPRRARKSPVT

ATMG00300.1 Gag-Pol-related retrotransposon family protein4.3e-2040.18Show/hide
Query:  GNLKVARGALVAITGTRRGSIYYLNGTTIIGHVAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTK
        G LKV +G    + G R  S+Y L G+   G   +A +  +D ++LWH RL H  ++ ++ LV +G L  +    + FCE CI+GK   V F T  H TK
Subjt:  GNLKVARGALVAITGTRRGSIYYLNGTTIIGHVAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTK

Query:  GILDYVHTDVWG
          LDYVH+D+WG
Subjt:  GILDYVHTDVWG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.5e-1547.19Show/hide
Query:  MNQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMG
        MN+T+IEKVR +L + GL K F A+A + AVH++N+ P +      P EVW  S V  Y  L  FGC AY H  + KL PRAKK +  G
Subjt:  MNQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTTGCAGAAAAAATGTGCGCAATCCATGCACCCAATATGTGACGCTTACAATTACTTATACAACTGATGATGATTTGACTGAGATTGTGGAGATTGATAATATCTT
AAAGGACTCTGAAGATCGAACACTTGGTCAAGTGTTTTTCCAGGATTTCGCACTTCATTTCAGTTGCTCCCCTAGGCGCGCTAGAAAATCTCCCGTCACTGCGGTACAGG
GGAACTTCTCTCCGAACCACGTAAATATCTTAGTGCCTCTTTGTGTAAAATCGTTCATTGGTGTTGTGAGTACCTTTGTTCCGCTTTCGGCGCACAACAAGTGGTATCAG
AGCCAAGTTGTGGTATTTTGGGGTATTGTGAAAGCTTGGGCCTCATCGACCATTAGACAACCATCACGACGTGGATCGAGAGTGTGGAGTTCCAATCTCGAAGTAGAAAA
ATTTGATGGAACCAATAATTTTGGTGTTTGGCGAGGCGAAGTTAGCGATTTATTGGTCATGCAAGACCTCGATGTCTCTCTCCAAGAGGAAATGCCTGAAGACATGACAG
AAGCAGAATGGACGAAATTAAATCGGCAAGCCTGTGGAATTATTCGATCCTGTCTTGGCAAGGATCAGAAATATCCATTTATGAAGGTTACCATGGCGAAAGAATTATGG
GACAAGCTTGAAGCGAAATATATGAAGAAGAGTGTGGAAAATAAACTCTATTTGAAGAAGAAACTCTTCCGTTTTGATTACAAGGAAGGCATCTCAATGGCTGAGCATTT
GGATGATTTTAACAAAATCATCACTGATTTGCTCAATCTTGATGTTAAGATTGATGACGAGGACAAGGCGTTGTTGTTATTAAACTCCTTGCCGGAATCCTACGAGTTTG
TAGTAACGACCCTACTACATGGAATCACTAATATTGATTTTGAAGATGTTTCAAATGCCTTAATAAACAATGAGATGCGAAAGAAGGAAAAGGAGACATATCAAAGCTCT
AGCTCAAATGTTCTCACTGCTCATGGAAGGACTTCCAATTGGAAAAAGAACGAATGTGGAGAGCCTCGACTAAAGTCGAGAGGGGAACTTCTCTCCGAACCACGTAAATA
TCTTAGTGCCTCTTTGTGTAAAATCGTTCATTGGTGTTGTGAGTACCTTTGTTCCGCTTTCGGCGCACAACAAGTGGTATCAGAGCCAAGTTGTGAACCATCACGACGTG
GATCGAGAGTGTGGAGTTCCAATCTCGAAGTAGAAAAATTTGATGGAACCAATAATTTTGGTGTTTGGCGAGGCGAAGTTAGCGATTTATTGGTCATGCAAGACCTCGAT
GTCTCTCTCCAAGAGGAAATGCCTGAAGACATGACAGAAGCAGAATGGACGAAATTAAATCGGCAAGCCTGTGGAATTATTCGATCCTGTCTTGGCAAGGATCAGAAATA
TCCATTTATGAAGGTTACCATGGCGAAAGAATTATGGGACAAGCTTGAAGCGAAATATATGAAGAAGAGTGTGGAAAATAAACTCTATTTGAAGAAGAAACTCTTCCGTT
TTGATTACAAGGAAGGCATCTCAATGGCTGAGCATTTGGATGATTTTAACAAAATCATCACTGATTTGCTCAATCTTGATGTTAAGATTGATGACGAGGACAAGGCGTTG
TTGTTATTAAACTCCTTGCCGGAATCCTACGAGTTTGTAGTAACGACCCTACTACATGGAATCACTAATATTGATTTTGAAGATGTTTCAAATGCCTTAATAAACAATGA
GATGCGAAAGAAGGAAAAGGAGACATATCAAAGCTCTAGCTCAAATGTTCTCACTGCTCATGGAAGGACTTCCAATTGGAAAAAGAACGAATGTGGAGAGCCTCGACTAA
AGTCGAGAGGTAGATCTGATAATTGGAGAAAACTTGATAAAAATGAGTGTGCATATTGTCGACAGAAAGGCCATTGGAAAAAAGATTGTTCGATCTTAAAAGAGAAGGGG
TCGAAGTCCAATGTGGTTAGAGATGACGACACTGATACAGATGATGCCTTGACAATCTCCCTATCAGTCAGCCAAACCAGTGAATGGATTCTTGATTCTGGGTGCTCCTA
TCATATGTGTCCCAACAAAGAGATGTTTTTGGACTTCAAGGAGTTCAATGGTGGAGTTGTCTATATGGGAAATGATAGTACTTGTAAGATAATGGGAATCGGCTCAGTAC
AAATCAAGATGTTTGACGGGGTTATACGAAAACTCAAAGATGTGAGGTATGTCCCTGACTTGAAGAAAAATTTGATTTCACTTGGCGTTTTGGATGCGAGTGGTTATCGC
ATCATTTTGGAAGGAGGGAATTTGAAAGTAGCTCGTGGAGCTTTGGTGGCAATTACAGGAACTAGAAGAGGTAGCATATACTACCTCAACGGAACCACAATAATTGGGCA
TGTTGCTGTGGCAAGTTCGAAAGAACAAGACATATCGAAATTATGGCACATGCGACTCGGGCATGCTGGTGAGAAGGCGCTTCAGACACTGGTGAACCAAGGAGTTCTTA
AGGGTGCCACAACTGGAAGAATTGATTTCTGTGAACATTGTATATTTGGCAAACAAAAAAATGTGAAATTTGGTACATCTATACACCAAACAAAGGGCATTTTGGATTAT
GTTCACACTGATGTATGGGGTCCCACAAAGAATGTCTCATTGGGAGGAAAGAGGTGGTTCGTCACCTTTATTGATGACTACTCGAAGAGAGTTTGGATGTATCCCATGAG
GCACAAAAATGAGGTTCTCCAAATCTTCCTAGAGTGGAAGAAAATGGTAGAGAATCAGACGGACAAGAAAATCAAAAGGCTGAGATCAGACAACGGGGGAGAATATACTT
ATGATCCATTCCTTAAAGTATGCCGAGATGAAGGGATCGTTCGACACTTCACTGTTCCTGGTAAGCCACAACAAAATGGAGTTGCTGAGAGAATGAACCAGACATTAATA
GAGAAGGTTCGATGCATATTGTCTCAAGCAGGATTGAGTAAGGCATTTTGGGCTGAGGCTCTCAGTTATGCAGTTCACTTGGTGAATCGTTTACCTGTTTCTGGAAACGG
TGGGAGAACTCCGCTTGAGGTATGGTCAGGTAGTCCTGTTAGTGATTATGATAAATTACATGTGTTTGGGTGTCCTGCTTATTATCATGTGACAGACTCAAAGCTGGATC
CTAGAGCAAAAAAAGCCAAGTTTATGGGCTTTAGCAAAGGTGTGAAGGGCTATAGGTTATGGTGTCCAGAAACAAGTAAGATTGTTAATAGTCGAGATGTGACATTCGAT
GAGTCTGGAATGTTTTTGCAGAAAATTGAAAATAATGACGAGGCATTGAAGCAGGTGGAGAAGGTGGTGTTCTCTCCTGATGTGGTTGCTCCTACTGAAGAACCTATTGA
CCAGGTAAATAATAACTTTGATGTCTTAGAACAGAAAGAGCAAAGTCTGGAGGAGCAAAGCCTTGTAAATGAAAGAGTGGAGGAACCTGAGTCTATCGCCAAGAATAGAC
CACGAAGGGTAATTCGAAAACCTGCAAGGTTTGATGATACGATAGCATATGCTTTCTCTATAATTGATGGAGTTCCCAACTTACGCAATGCAACTGATCTTTCAAGATTA
GCATTCAAAGCCTTTTCGTCAATAGACCATGCATGGTATGACATTGCTTCGTTCCTCAGTTACCGAGTTAATTGCCATGGAGAACTAGAAGCACGAGTTCGATATACTGG
CTTCGGAAAGGATGAGGATGAGTGTGAAAGTTGGAGATCTTGTGTTATGTTACCGGGTACATTTATGTTACACAGAAAAAGAAAAAAGAAACCAACGAGAACTGGAGAGA
GGAACCAAACAGAAAAGAAAAAAAAAAAGTGCAAAAAATGCCAAAAGGCGAAACCTAAACCAGATTCTATAAACCAACCTACCACGGGGAGGAATATAAGTACTATAATT
GTCTGGTTACCAGGAAAGCTTCATATCGAACGGAAAGACATGATCATGCGCTCTACCTCAATGCATACGTTCTTGAGATTCTGCCTTATTTGCTTAGGAAAAAGTAAATT
TGTGGAGATTGTTCTGCTGGGATTTAGAGTACAATGTAAATGGAATCCCGAACCTTCATTCCTCAGAAGGTACATTTACAAGCTGTTGCTTGGGATTTACCGTAAAAAAT
TGGGTAACTGTCAGAGCTACAAACTGAACCGAAAATACCTCATTCTGTTAGTAATTATTATGCAACCTGTCAGCGCACAGAAGGATATATGTTCTCGTTTAGGAAAAGGG
TGA
mRNA sequenceShow/hide mRNA sequence
ATGCGTTGCAGAAAAAATGTGCGCAATCCATGCACCCAATATGTGACGCTTACAATTACTTATACAACTGATGATGATTTGACTGAGATTGTGGAGATTGATAATATCTT
AAAGGACTCTGAAGATCGAACACTTGGTCAAGTGTTTTTCCAGGATTTCGCACTTCATTTCAGTTGCTCCCCTAGGCGCGCTAGAAAATCTCCCGTCACTGCGGTACAGG
GGAACTTCTCTCCGAACCACGTAAATATCTTAGTGCCTCTTTGTGTAAAATCGTTCATTGGTGTTGTGAGTACCTTTGTTCCGCTTTCGGCGCACAACAAGTGGTATCAG
AGCCAAGTTGTGGTATTTTGGGGTATTGTGAAAGCTTGGGCCTCATCGACCATTAGACAACCATCACGACGTGGATCGAGAGTGTGGAGTTCCAATCTCGAAGTAGAAAA
ATTTGATGGAACCAATAATTTTGGTGTTTGGCGAGGCGAAGTTAGCGATTTATTGGTCATGCAAGACCTCGATGTCTCTCTCCAAGAGGAAATGCCTGAAGACATGACAG
AAGCAGAATGGACGAAATTAAATCGGCAAGCCTGTGGAATTATTCGATCCTGTCTTGGCAAGGATCAGAAATATCCATTTATGAAGGTTACCATGGCGAAAGAATTATGG
GACAAGCTTGAAGCGAAATATATGAAGAAGAGTGTGGAAAATAAACTCTATTTGAAGAAGAAACTCTTCCGTTTTGATTACAAGGAAGGCATCTCAATGGCTGAGCATTT
GGATGATTTTAACAAAATCATCACTGATTTGCTCAATCTTGATGTTAAGATTGATGACGAGGACAAGGCGTTGTTGTTATTAAACTCCTTGCCGGAATCCTACGAGTTTG
TAGTAACGACCCTACTACATGGAATCACTAATATTGATTTTGAAGATGTTTCAAATGCCTTAATAAACAATGAGATGCGAAAGAAGGAAAAGGAGACATATCAAAGCTCT
AGCTCAAATGTTCTCACTGCTCATGGAAGGACTTCCAATTGGAAAAAGAACGAATGTGGAGAGCCTCGACTAAAGTCGAGAGGGGAACTTCTCTCCGAACCACGTAAATA
TCTTAGTGCCTCTTTGTGTAAAATCGTTCATTGGTGTTGTGAGTACCTTTGTTCCGCTTTCGGCGCACAACAAGTGGTATCAGAGCCAAGTTGTGAACCATCACGACGTG
GATCGAGAGTGTGGAGTTCCAATCTCGAAGTAGAAAAATTTGATGGAACCAATAATTTTGGTGTTTGGCGAGGCGAAGTTAGCGATTTATTGGTCATGCAAGACCTCGAT
GTCTCTCTCCAAGAGGAAATGCCTGAAGACATGACAGAAGCAGAATGGACGAAATTAAATCGGCAAGCCTGTGGAATTATTCGATCCTGTCTTGGCAAGGATCAGAAATA
TCCATTTATGAAGGTTACCATGGCGAAAGAATTATGGGACAAGCTTGAAGCGAAATATATGAAGAAGAGTGTGGAAAATAAACTCTATTTGAAGAAGAAACTCTTCCGTT
TTGATTACAAGGAAGGCATCTCAATGGCTGAGCATTTGGATGATTTTAACAAAATCATCACTGATTTGCTCAATCTTGATGTTAAGATTGATGACGAGGACAAGGCGTTG
TTGTTATTAAACTCCTTGCCGGAATCCTACGAGTTTGTAGTAACGACCCTACTACATGGAATCACTAATATTGATTTTGAAGATGTTTCAAATGCCTTAATAAACAATGA
GATGCGAAAGAAGGAAAAGGAGACATATCAAAGCTCTAGCTCAAATGTTCTCACTGCTCATGGAAGGACTTCCAATTGGAAAAAGAACGAATGTGGAGAGCCTCGACTAA
AGTCGAGAGGTAGATCTGATAATTGGAGAAAACTTGATAAAAATGAGTGTGCATATTGTCGACAGAAAGGCCATTGGAAAAAAGATTGTTCGATCTTAAAAGAGAAGGGG
TCGAAGTCCAATGTGGTTAGAGATGACGACACTGATACAGATGATGCCTTGACAATCTCCCTATCAGTCAGCCAAACCAGTGAATGGATTCTTGATTCTGGGTGCTCCTA
TCATATGTGTCCCAACAAAGAGATGTTTTTGGACTTCAAGGAGTTCAATGGTGGAGTTGTCTATATGGGAAATGATAGTACTTGTAAGATAATGGGAATCGGCTCAGTAC
AAATCAAGATGTTTGACGGGGTTATACGAAAACTCAAAGATGTGAGGTATGTCCCTGACTTGAAGAAAAATTTGATTTCACTTGGCGTTTTGGATGCGAGTGGTTATCGC
ATCATTTTGGAAGGAGGGAATTTGAAAGTAGCTCGTGGAGCTTTGGTGGCAATTACAGGAACTAGAAGAGGTAGCATATACTACCTCAACGGAACCACAATAATTGGGCA
TGTTGCTGTGGCAAGTTCGAAAGAACAAGACATATCGAAATTATGGCACATGCGACTCGGGCATGCTGGTGAGAAGGCGCTTCAGACACTGGTGAACCAAGGAGTTCTTA
AGGGTGCCACAACTGGAAGAATTGATTTCTGTGAACATTGTATATTTGGCAAACAAAAAAATGTGAAATTTGGTACATCTATACACCAAACAAAGGGCATTTTGGATTAT
GTTCACACTGATGTATGGGGTCCCACAAAGAATGTCTCATTGGGAGGAAAGAGGTGGTTCGTCACCTTTATTGATGACTACTCGAAGAGAGTTTGGATGTATCCCATGAG
GCACAAAAATGAGGTTCTCCAAATCTTCCTAGAGTGGAAGAAAATGGTAGAGAATCAGACGGACAAGAAAATCAAAAGGCTGAGATCAGACAACGGGGGAGAATATACTT
ATGATCCATTCCTTAAAGTATGCCGAGATGAAGGGATCGTTCGACACTTCACTGTTCCTGGTAAGCCACAACAAAATGGAGTTGCTGAGAGAATGAACCAGACATTAATA
GAGAAGGTTCGATGCATATTGTCTCAAGCAGGATTGAGTAAGGCATTTTGGGCTGAGGCTCTCAGTTATGCAGTTCACTTGGTGAATCGTTTACCTGTTTCTGGAAACGG
TGGGAGAACTCCGCTTGAGGTATGGTCAGGTAGTCCTGTTAGTGATTATGATAAATTACATGTGTTTGGGTGTCCTGCTTATTATCATGTGACAGACTCAAAGCTGGATC
CTAGAGCAAAAAAAGCCAAGTTTATGGGCTTTAGCAAAGGTGTGAAGGGCTATAGGTTATGGTGTCCAGAAACAAGTAAGATTGTTAATAGTCGAGATGTGACATTCGAT
GAGTCTGGAATGTTTTTGCAGAAAATTGAAAATAATGACGAGGCATTGAAGCAGGTGGAGAAGGTGGTGTTCTCTCCTGATGTGGTTGCTCCTACTGAAGAACCTATTGA
CCAGGTAAATAATAACTTTGATGTCTTAGAACAGAAAGAGCAAAGTCTGGAGGAGCAAAGCCTTGTAAATGAAAGAGTGGAGGAACCTGAGTCTATCGCCAAGAATAGAC
CACGAAGGGTAATTCGAAAACCTGCAAGGTTTGATGATACGATAGCATATGCTTTCTCTATAATTGATGGAGTTCCCAACTTACGCAATGCAACTGATCTTTCAAGATTA
GCATTCAAAGCCTTTTCGTCAATAGACCATGCATGGTATGACATTGCTTCGTTCCTCAGTTACCGAGTTAATTGCCATGGAGAACTAGAAGCACGAGTTCGATATACTGG
CTTCGGAAAGGATGAGGATGAGTGTGAAAGTTGGAGATCTTGTGTTATGTTACCGGGTACATTTATGTTACACAGAAAAAGAAAAAAGAAACCAACGAGAACTGGAGAGA
GGAACCAAACAGAAAAGAAAAAAAAAAAGTGCAAAAAATGCCAAAAGGCGAAACCTAAACCAGATTCTATAAACCAACCTACCACGGGGAGGAATATAAGTACTATAATT
GTCTGGTTACCAGGAAAGCTTCATATCGAACGGAAAGACATGATCATGCGCTCTACCTCAATGCATACGTTCTTGAGATTCTGCCTTATTTGCTTAGGAAAAAGTAAATT
TGTGGAGATTGTTCTGCTGGGATTTAGAGTACAATGTAAATGGAATCCCGAACCTTCATTCCTCAGAAGGTACATTTACAAGCTGTTGCTTGGGATTTACCGTAAAAAAT
TGGGTAACTGTCAGAGCTACAAACTGAACCGAAAATACCTCATTCTGTTAGTAATTATTATGCAACCTGTCAGCGCACAGAAGGATATATGTTCTCGTTTAGGAAAAGGG
TGATCCAGCCATTAAAACTGCTTGGAGCCAGGGGGCACTTGAAAAGTGGTTCAATTTTTCAACTTAGACCCTCAAATTGAAATGCTGGGGATCAAACTTCATAGAGGCTT
TCGCCTTGGCACGCTGAGAACGAGGGCTGCTTGGTCACTGGTTGCTGCAGTTGCAGTGATTCGATACCCACAGTCCCATCGGCGATGAATACCTTCACTAGGGTATAAGA
AATCCAACTCGACCACCTGAAGTTTAAGAAAAACGGTCAGGCATGCCCATACGAACCTTCTTCTTTGGTTGTTCATCGTCTTCCTCCTCCAAAGTCAATCTGCCTCGCTT
ATGCCCCACTTGACAAATGAGCTTCAGAAGATGGAAATTCAGTAATAAGATGTCAAACCTCGAAGTTAACTAATTGGGGCTAATTGGGGCGGGGGTCACAAATTTCTATC
TGAGCATCATACCTTTTGAGCACCATCTGTATTTAT
Protein sequenceShow/hide protein sequence
MRCRKNVRNPCTQYVTLTITYTTDDDLTEIVEIDNILKDSEDRTLGQVFFQDFALHFSCSPRRARKSPVTAVQGNFSPNHVNILVPLCVKSFIGVVSTFVPLSAHNKWYQ
SQVVVFWGIVKAWASSTIRQPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELW
DKLEAKYMKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETYQSS
SSNVLTAHGRTSNWKKNECGEPRLKSRGELLSEPRKYLSASLCKIVHWCCEYLCSAFGAQQVVSEPSCEPSRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLD
VSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKSVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKAL
LLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETYQSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEKG
SKSNVVRDDDTDTDDALTISLSVSQTSEWILDSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYR
IILEGGNLKVARGALVAITGTRRGSIYYLNGTTIIGHVAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTSIHQTKGILDY
VHTDVWGPTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLI
EKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKIVNSRDVTFD
ESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARFDDTIAYAFSIIDGVPNLRNATDLSRL
AFKAFSSIDHAWYDIASFLSYRVNCHGELEARVRYTGFGKDEDECESWRSCVMLPGTFMLHRKRKKKPTRTGERNQTEKKKKKCKKCQKAKPKPDSINQPTTGRNISTII
VWLPGKLHIERKDMIMRSTSMHTFLRFCLICLGKSKFVEIVLLGFRVQCKWNPEPSFLRRYIYKLLLGIYRKKLGNCQSYKLNRKYLILLVIIMQPVSAQKDICSRLGKG