; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh10G005960 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh10G005960
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionTransposable element protein
Genome locationCma_Chr10:2748129..2756919
RNA-Seq ExpressionCmaCh10G005960
SyntenyCmaCh10G005960
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016740 - transferase activity (molecular function)
GO:0016788 - hydrolase activity, acting on ester bonds (molecular function)
InterPro domainsIPR000086 - NUDIX hydrolase domain
IPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR015797 - NUDIX hydrolase-like domain superfamily
IPR025724 - GAG-pre-integrase domain
IPR031804 - Domain of unknown function DUF4743
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBH09562.1 transposable element gene [Prunus dulcis]1.5e-26558.09Show/hide
Query:  SCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY
        S  P      + S+ LEV+KF+G+NNFG+W+ EV D+L  Q+LD+ L E+ PED+ + +WT++N  AC  IRS L K+ KYP+MK T AK+LW KLE KY
Subjt:  SCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY

Query:  MKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY
        M K  EN+L+LKK+LFRF Y+ GISM EHL+D+NKI+ DL NLDV I DEDKAL LLNSLP+ Y+ + TTLL+G + +  ++VS AL+N+E RKKE++T 
Subjt:  MKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY

Query:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL
        Q+S +  L A GRT   K  + G+ R KSRG+        K+ECA+CRQKGHWKKDC  LK K     GS++NV +  D D + AL  S +   ++EWIL
Subjt:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL

Query:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT
        DSGC+YHMCP +E F  F+E +GGVV MGN++ CK  GIG + +KM DG +R+L DVRYVPD+KKNLISLG L++ G +I +EGG LK   GALV + GT
Subjt:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT

Query:  RRGSIYYLNGTTIIGHAAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGPTKN
        RR ++Y+L G+T+IG AAV  + + D ++LWHMRLGHAGEKALQ LV QG+LKGA   +++FCEHC+ GKQ  VKFGTAIH TKGILDYVHTDVWGP+KN
Subjt:  RRGSIYYLNGTTIIGHAAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGPTKN

Query:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE
         S GG  +FV+F+DD+S+R+W+Y M+ K+EVL+IFL+WKKM+E Q+ +KIK LRSDNGGEY  DPFLKVC+DEGIVRHFTV   PQQNGVAERMN+TL+E
Subjt:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE

Query:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI
        KVRC+LS AGL KAFWAEA++YA HL+NRLP + N G+TP+EVWSG P +DY  LH+FGCPAYYHV +SKLDPRAKKA FMGFS GVKGYRLWCP+  KI
Subjt:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI

Query:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY
        V SRDVTFDE+ M  Q K E   EA K +   ++V      V P        +   D  E+ E   EE++   E  ++ +SIA  R RR IRKP R+
Subjt:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY

BBH10183.1 transposable element gene [Prunus dulcis]1.5e-26558.09Show/hide
Query:  SCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY
        S  P      + S+ LEV+KF+G+NNFG+W+ EV D+L  Q+LD+ L E+ PED+ + +WT++N  AC  IRS L K+ KYP+MK T AK+LW KLE KY
Subjt:  SCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY

Query:  MKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY
        M K  EN+L+LKK+LFRF Y+ GISM EHL+D+NKI+ DL NLDV I DEDKAL LLNSLP+ Y+ + TTLL+G + +  ++VS AL+N+E RKKE++T 
Subjt:  MKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY

Query:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL
        Q+S +  L A GRT   K  + G+ R KSRG+        K+ECA+CRQKGHWKKDC  LK K     GS++NV +  D D + AL  S +   ++EWIL
Subjt:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL

Query:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT
        DSGC+YHMCP +E F  F+E +GGVV MGN++ CK  GIG + +KM DG +R+L DVRYVPD+KKNLISLG L++ G +I +EGG LK   GALV + GT
Subjt:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT

Query:  RRGSIYYLNGTTIIGHAAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGPTKN
        RR ++Y+L G+T+IG AAV  + + D ++LWHMRLGHAGEKALQ LV QG+LKGA   +++FCEHC+ GKQ  VKFGTAIH TKGILDYVHTDVWGP+KN
Subjt:  RRGSIYYLNGTTIIGHAAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGPTKN

Query:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE
         S GG  +FV+F+DD+S+R+W+Y M+ K+EVL+IFL+WKKM+E Q+ +KIK LRSDNGGEY  DPFLKVC+DEGIVRHFTV   PQQNGVAERMN+TL+E
Subjt:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE

Query:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI
        KVRC+LS AGL KAFWAEA++YA HL+NRLP + N G+TP+EVWSG P +DY  LH+FGCPAYYHV +SKLDPRAKKA FMGFS GVKGYRLWCP+  KI
Subjt:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI

Query:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY
        V SRDVTFDE+ M  Q K E   EA K +   ++V      V P        +   D  E+ E   EE++   E  ++ +SIA  R RR IRKP R+
Subjt:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY

BBN69445.1 hypothetical protein Prudu_948S000300 [Prunus dulcis]7.3e-26558.13Show/hide
Query:  SCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY
        S  P      + S+ LEV+KF+G+NNFG+W+ EV D+L  Q+LD+ L E+ PED+ + +WT++N  AC  IRS L K+ KYP+MK T AK+LW KLE KY
Subjt:  SCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY

Query:  MKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY
        M K  EN+L+LKK+LFRF Y+ GISM EHL+D+NKI+ DL NLDV I DEDKAL LLNSLP+ Y+ + TTLL+G + +  ++VS AL+N+E RKKE++T 
Subjt:  MKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY

Query:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL
        Q+S +  L A GRT   K  + G+ R KSRG+        K+ECA+CRQKGHWKKDC  LK K     GS++NV +  D D + AL  S +   ++EWIL
Subjt:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL

Query:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT
        DSGC+YHMCP +E F  F+E +GGVV MGN++ CK  GIG + +KM DG +R+L DVRYVPD+KKNLISLG L++ G +I +EGG LK   GALV + GT
Subjt:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT

Query:  RRGSIYYLNGTTIIGHAAV---ASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGP
        RR ++Y+L G+T+IG AAV   A +   D ++LWHMRLGHAGEKALQ LV QG+LKGA   +++FCEHC+ GKQ  VKFGTAIH TKGILDYVHTDVWGP
Subjt:  RRGSIYYLNGTTIIGHAAV---ASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGP

Query:  TKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQT
        +KN S GG  +FV+F+DD+S+R+W+Y M+ K+EVL+IFL+WKKM+E Q+ +KIK LRSDNGGEY  DPFLKVC+DEGIVRHFTV   PQQNGVAERMN+T
Subjt:  TKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQT

Query:  LIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPET
        L+EKVRC+LS AGL KAFWAEA++YA HL+NRLP + N G+TP+EVWSG P +DY  LH+FGCPAYYHV +SKLDPRAKKA FMGFS GVKGYRLWCP+ 
Subjt:  LIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPET

Query:  SKIVNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY
         KIV SRDVTFDE+ M  Q K E   EA K +   ++V      V P    I   +   D  E+ E   EE++   E  ++ +SIA  R RR IRKP R+
Subjt:  SKIVNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY

BBN69488.1 transposable element gene [Prunus dulcis]2.1e-26457.81Show/hide
Query:  SCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY
        S  P      + S+ LEV+KF+G+NNFG+W+ EV D+L  Q+LD+ L E+ PED+ + +WT++N  AC  IRS L K+ KYP+MK T AK+LW KLE KY
Subjt:  SCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY

Query:  MKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY
        M K  EN+L+LKK+LFRF Y+ GISM EHL+D+NKI+ DL NLDV I DEDKAL LLNSLP+ Y+ + TTLL+G + +  ++VS AL+N+E RKKE++T 
Subjt:  MKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY

Query:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL
        Q+S +  L A GRT   K  + G+ R KSRG+        K+ECA+CRQKGHWKKDC  LK K     GS++NV +  D D + AL  S +   ++EWIL
Subjt:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL

Query:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT
        DSGC+YHMCP +E F  F+E + GVV MGN++ CK  GIG + +KM DG +R+L DVRYVPD+KKNLISLG L++ G +I +EGG LK   GALV + GT
Subjt:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT

Query:  RRGSIYYLNGTTIIGHAAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGPTKN
        RR ++Y+L G+T+IG AAV  + + D ++LWHMRLGHAGEKALQ LV QG+LKGA   +++FCEHC+ GKQ  VKFGTAIH TKGILDYVHTDVWGP+KN
Subjt:  RRGSIYYLNGTTIIGHAAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGPTKN

Query:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE
         S GG  +FV+F+DD+S+R+W+Y M+ K+EVL+IFL+WKKM+E Q+ +KIK LRSDNGGEY  DPFLKVC+DEGIVRHFTV   PQQNGVAERMN+TL+E
Subjt:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE

Query:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI
        KVRC+LS AGL KAFWAEA++YA HL+NRLP + N G+TP+EVWSG P +DY  LH+FGCPAYYHV +SKLDPRAKKA FMGFS GVKGYRLWCP+  KI
Subjt:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI

Query:  VNSRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQV-NNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY
        V SRDVTFDE+ M  Q   N  E   +  K + S   V   + P+  V ++  D     +   EE++   E  ++ +SIA  R RR IRKP R+
Subjt:  VNSRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQV-NNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY

PSS35035.1 Endonuclease [Actinidia chinensis var. chinensis]7.5e-29462.99Show/hide
Query:  QVVSEPSCEPXRRGSRVWSSN--LEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKEL
        ++ S  +      G R   SN   EVEKFDGTNNFG+W+ EV D+L+ QDLD++L E  PEDM+E +W KLNRQACG IR CL KDQKY  MK T AKEL
Subjt:  QVVSEPSCEPXRRGSRVWSSN--LEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKEL

Query:  WDKLEAKYMKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEM
        WDKLE KYM K VEN+LYLKKKLFRF +K+G SM EHL+ FNKI+ DL NLDV+IDDEDKALLLLNSLP++YE + TTLL+G   I F DVSNAL+NNE+
Subjt:  WDKLEAKYMKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEM

Query:  RKKEKETYQSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVSQTSE
        RKK+++ ++ SSSN LTA GRTS  K     + R KSRG+S   R+L K+ECAYC QKGHW+KDC  ++ K  K+N+  D     D A T+SLS S + E
Subjt:  RKKEKETYQSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVSQTSE

Query:  WILDSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAI
        WILDSGCSYHMCP+++ F  F+E +GGVV MGND+ CK MGIG++++KMF G ++ L DVRYVPDLKKNLISLG  D+ GY+IILEGG LKV RGALV +
Subjt:  WILDSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAI

Query:  TGTRRGSIYYLNGTTIIGHAAVA-SSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWG
         GTRRG++Y+L+G+T+ G  A++ SS E D SKLWHMRLGH GEKALQTLV QGVLKGA TG+I+FCEHC+ GKQ  VKFGTAIHQT+GILDYVH+DVWG
Subjt:  TGTRRGSIYYLNGTTIIGHAAVA-SSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWG

Query:  PTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQ
        P+KN SLGG R+FVTF+DD+S+RVW+Y MRHK+EVL+IFL+WKKM+E QT +KIK+LRSDNGGEY  DPF  VC  EGIVRHFT+ G PQQNGVAERMN+
Subjt:  PTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQ

Query:  TLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPE
        TL++KVRC++  AGLSKAFWAEA++YA HL NRLP +   G+TP+EVWSG   +DYD LH+FGCPAY+HV + KLDPRAKKA F+GFS GVKGYRLWCPE
Subjt:  TLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPE

Query:  TSKIVNSRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY
        T KIVNSRDVTFDES M     + N   +   +  + SP V   T + +D  +   D ++      E QS      +  ESIA +RP+RVIR+PARY
Subjt:  TSKIVNSRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY

TrEMBL top hitse value%identityAlignment
A0A2R6RYD4 Endonuclease3.6e-29462.99Show/hide
Query:  QVVSEPSCEPXRRGSRVWSSN--LEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKEL
        ++ S  +      G R   SN   EVEKFDGTNNFG+W+ EV D+L+ QDLD++L E  PEDM+E +W KLNRQACG IR CL KDQKY  MK T AKEL
Subjt:  QVVSEPSCEPXRRGSRVWSSN--LEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKEL

Query:  WDKLEAKYMKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEM
        WDKLE KYM K VEN+LYLKKKLFRF +K+G SM EHL+ FNKI+ DL NLDV+IDDEDKALLLLNSLP++YE + TTLL+G   I F DVSNAL+NNE+
Subjt:  WDKLEAKYMKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEM

Query:  RKKEKETYQSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVSQTSE
        RKK+++ ++ SSSN LTA GRTS  K     + R KSRG+S   R+L K+ECAYC QKGHW+KDC  ++ K  K+N+  D     D A T+SLS S + E
Subjt:  RKKEKETYQSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVSQTSE

Query:  WILDSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAI
        WILDSGCSYHMCP+++ F  F+E +GGVV MGND+ CK MGIG++++KMF G ++ L DVRYVPDLKKNLISLG  D+ GY+IILEGG LKV RGALV +
Subjt:  WILDSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAI

Query:  TGTRRGSIYYLNGTTIIGHAAVA-SSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWG
         GTRRG++Y+L+G+T+ G  A++ SS E D SKLWHMRLGH GEKALQTLV QGVLKGA TG+I+FCEHC+ GKQ  VKFGTAIHQT+GILDYVH+DVWG
Subjt:  TGTRRGSIYYLNGTTIIGHAAVA-SSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWG

Query:  PTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQ
        P+KN SLGG R+FVTF+DD+S+RVW+Y MRHK+EVL+IFL+WKKM+E QT +KIK+LRSDNGGEY  DPF  VC  EGIVRHFT+ G PQQNGVAERMN+
Subjt:  PTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQ

Query:  TLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPE
        TL++KVRC++  AGLSKAFWAEA++YA HL NRLP +   G+TP+EVWSG   +DYD LH+FGCPAY+HV + KLDPRAKKA F+GFS GVKGYRLWCPE
Subjt:  TLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPE

Query:  TSKIVNSRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY
        T KIVNSRDVTFDES M     + N   +   +  + SP V   T + +D  +   D ++      E QS      +  ESIA +RP+RVIR+PARY
Subjt:  TSKIVNSRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY

A0A4Y1RZX4 Transposable element protein7.1e-26658.09Show/hide
Query:  SCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY
        S  P      + S+ LEV+KF+G+NNFG+W+ EV D+L  Q+LD+ L E+ PED+ + +WT++N  AC  IRS L K+ KYP+MK T AK+LW KLE KY
Subjt:  SCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY

Query:  MKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY
        M K  EN+L+LKK+LFRF Y+ GISM EHL+D+NKI+ DL NLDV I DEDKAL LLNSLP+ Y+ + TTLL+G + +  ++VS AL+N+E RKKE++T 
Subjt:  MKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY

Query:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL
        Q+S +  L A GRT   K  + G+ R KSRG+        K+ECA+CRQKGHWKKDC  LK K     GS++NV +  D D + AL  S +   ++EWIL
Subjt:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL

Query:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT
        DSGC+YHMCP +E F  F+E +GGVV MGN++ CK  GIG + +KM DG +R+L DVRYVPD+KKNLISLG L++ G +I +EGG LK   GALV + GT
Subjt:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT

Query:  RRGSIYYLNGTTIIGHAAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGPTKN
        RR ++Y+L G+T+IG AAV  + + D ++LWHMRLGHAGEKALQ LV QG+LKGA   +++FCEHC+ GKQ  VKFGTAIH TKGILDYVHTDVWGP+KN
Subjt:  RRGSIYYLNGTTIIGHAAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGPTKN

Query:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE
         S GG  +FV+F+DD+S+R+W+Y M+ K+EVL+IFL+WKKM+E Q+ +KIK LRSDNGGEY  DPFLKVC+DEGIVRHFTV   PQQNGVAERMN+TL+E
Subjt:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE

Query:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI
        KVRC+LS AGL KAFWAEA++YA HL+NRLP + N G+TP+EVWSG P +DY  LH+FGCPAYYHV +SKLDPRAKKA FMGFS GVKGYRLWCP+  KI
Subjt:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI

Query:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY
        V SRDVTFDE+ M  Q K E   EA K +   ++V      V P        +   D  E+ E   EE++   E  ++ +SIA  R RR IRKP R+
Subjt:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY

A0A4Y1S2D4 Transposable element protein7.1e-26658.09Show/hide
Query:  SCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY
        S  P      + S+ LEV+KF+G+NNFG+W+ EV D+L  Q+LD+ L E+ PED+ + +WT++N  AC  IRS L K+ KYP+MK T AK+LW KLE KY
Subjt:  SCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY

Query:  MKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY
        M K  EN+L+LKK+LFRF Y+ GISM EHL+D+NKI+ DL NLDV I DEDKAL LLNSLP+ Y+ + TTLL+G + +  ++VS AL+N+E RKKE++T 
Subjt:  MKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY

Query:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL
        Q+S +  L A GRT   K  + G+ R KSRG+        K+ECA+CRQKGHWKKDC  LK K     GS++NV +  D D + AL  S +   ++EWIL
Subjt:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL

Query:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT
        DSGC+YHMCP +E F  F+E +GGVV MGN++ CK  GIG + +KM DG +R+L DVRYVPD+KKNLISLG L++ G +I +EGG LK   GALV + GT
Subjt:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT

Query:  RRGSIYYLNGTTIIGHAAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGPTKN
        RR ++Y+L G+T+IG AAV  + + D ++LWHMRLGHAGEKALQ LV QG+LKGA   +++FCEHC+ GKQ  VKFGTAIH TKGILDYVHTDVWGP+KN
Subjt:  RRGSIYYLNGTTIIGHAAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGPTKN

Query:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE
         S GG  +FV+F+DD+S+R+W+Y M+ K+EVL+IFL+WKKM+E Q+ +KIK LRSDNGGEY  DPFLKVC+DEGIVRHFTV   PQQNGVAERMN+TL+E
Subjt:  VSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIE

Query:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI
        KVRC+LS AGL KAFWAEA++YA HL+NRLP + N G+TP+EVWSG P +DY  LH+FGCPAYYHV +SKLDPRAKKA FMGFS GVKGYRLWCP+  KI
Subjt:  KVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKI

Query:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY
        V SRDVTFDE+ M  Q K E   EA K +   ++V      V P        +   D  E+ E   EE++   E  ++ +SIA  R RR IRKP R+
Subjt:  VNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY

A0A5B7BZJ7 Putative copia LTR rider2.2e-26761.7Show/hide
Query:  GAQQVVSEPSCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQ-EEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAK
        G     S P        + V ++  EVEKFDG+ NF +W+ EV D+L  Q+LD++L  E  P+D++E +W K+NRQACG IR CL KDQKY  M+ T A+
Subjt:  GAQQVVSEPSCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQ-EEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAK

Query:  ELWDKLEAKYMKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINN
         LW KLE K+M K +EN+LYLKKKLF+F YK+G SM +HLD+FNK++ DL NLD +I +EDKALLLLNSLP+SY+ + TTLL+G   I F+DVSNAL+NN
Subjt:  ELWDKLEAKYMKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINN

Query:  EMRKKEKETYQSSSSNVLTAHGRTSNWKKNECG---EPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEKGSK---SNVVRDDDTDTDDALTIS
        E RK+++  ++ + S  LT  GR+    +   G   + R +SRG S   R+  K+EC  C +KGHWKKDC   K+KG K   +N+V  D+ + + A T++
Subjt:  EMRKKEKETYQSSSSNVLTAHGRTSNWKKNECG---EPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEKGSK---SNVVRDDDTDTDDALTIS

Query:  LSVSQTSEWILDSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKV
         S S + EWILDSGCSYHMCP+++ F   +EF+GGVV +GN++ C   G+G +++KM +G I+ L++VRYVPDLKKNLISLGVLD+ GYRII+EGG LKV
Subjt:  LSVSQTSEWILDSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKV

Query:  ARGALVAITGTRRGSIYYLNGTTIIGHAAVASS---KEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGI
         RGA VA+ GTR+G++Y L+G T+ G A V+ S   +E D SKLWHMRLGHAGEKALQ LV QGVLKGA TG++ FCEHC+ GKQ  VKFGTA+H+T+GI
Subjt:  ARGALVAITGTRRGSIYYLNGTTIIGHAAVASS---KEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGI

Query:  LDYVHTDVWGPTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQ
        LDYVHTDVWGPTK  SLGGK WFVTFIDD+S+RVW+Y MRHK+EVL IFL WKKM+E QT +KI+RLRSDNGGEY  DPF  +CRDEGIVRHFTV G PQ
Subjt:  LDYVHTDVWGPTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQ

Query:  QNGVAERMNQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKG
        QNGVAERMN+TL+EKVRC+LS AGL KAFWAEA++YA H+VNRLP + N G+TPLEVWSGS  +DYD+LHVFGCPAY+HVT++KLDPRAKKA F+GFS G
Subjt:  QNGVAERMNQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKG

Query:  VKGYRLWCPETSKIVNSRDVTFDESGMFLQK
        VKGYRLWCP T KIV SRDVTF+E+ M  QK
Subjt:  VKGYRLWCPETSKIVNSRDVTFDESGMFLQK

A0A5H2XTW6 Uncharacterized protein3.5e-26558.13Show/hide
Query:  SCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY
        S  P      + S+ LEV+KF+G+NNFG+W+ EV D+L  Q+LD+ L E+ PED+ + +WT++N  AC  IRS L K+ KYP+MK T AK+LW KLE KY
Subjt:  SCEPXRRGSRVWSSNLEVEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKY

Query:  MKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY
        M K  EN+L+LKK+LFRF Y+ GISM EHL+D+NKI+ DL NLDV I DEDKAL LLNSLP+ Y+ + TTLL+G + +  ++VS AL+N+E RKKE++T 
Subjt:  MKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETY

Query:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL
        Q+S +  L A GRT   K  + G+ R KSRG+        K+ECA+CRQKGHWKKDC  LK K     GS++NV +  D D + AL  S +   ++EWIL
Subjt:  QSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEK-----GSKSNVVRDDDTDTDDALTISLSVSQTSEWIL

Query:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT
        DSGC+YHMCP +E F  F+E +GGVV MGN++ CK  GIG + +KM DG +R+L DVRYVPD+KKNLISLG L++ G +I +EGG LK   GALV + GT
Subjt:  DSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGT

Query:  RRGSIYYLNGTTIIGHAAV---ASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGP
        RR ++Y+L G+T+IG AAV   A +   D ++LWHMRLGHAGEKALQ LV QG+LKGA   +++FCEHC+ GKQ  VKFGTAIH TKGILDYVHTDVWGP
Subjt:  RRGSIYYLNGTTIIGHAAV---ASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGP

Query:  TKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQT
        +KN S GG  +FV+F+DD+S+R+W+Y M+ K+EVL+IFL+WKKM+E Q+ +KIK LRSDNGGEY  DPFLKVC+DEGIVRHFTV   PQQNGVAERMN+T
Subjt:  TKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQT

Query:  LIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPET
        L+EKVRC+LS AGL KAFWAEA++YA HL+NRLP + N G+TP+EVWSG P +DY  LH+FGCPAYYHV +SKLDPRAKKA FMGFS GVKGYRLWCP+ 
Subjt:  LIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMGFSKGVKGYRLWCPET

Query:  SKIVNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY
         KIV SRDVTFDE+ M  Q K E   EA K +   ++V      V P    I   +   D  E+ E   EE++   E  ++ +SIA  R RR IRKP R+
Subjt:  SKIVNSRDVTFDESGMFLQ-KIENNDEALKQV---EKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNERVEEPESIAKNRPRRVIRKPARY

SwissProt top hitse value%identityAlignment
P04146 Copia protein4.5e-6827.9Show/hide
Query:  VEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKRVENKLYLKKKLFR
        ++ FDG   + +W+  +  LL  QD+   +   MP ++ ++ W K  R A   I   L             A+++ + L+A Y +K + ++L L+K+L  
Subjt:  VEKFDGTNNFGVWRGEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKRVENKLYLKKKLFR

Query:  FDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTL-LHGITNIDFEDVSNALINNEMRKKEKETYQSSSSNVLTA--HGRT
              +S+  H   F+++I++LL    KI++ DK   LL +LP  Y+ ++T +      N+    V N L++ E+  K K  +  +S  V+ A  H   
Subjt:  FDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTL-LHGITNIDFEDVSNALINNEMRKKEKETYQSSSSNVLTA--HGRT

Query:  SNWK----KNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVSQTSE--------WILDSGCSYH
        + +K    KN   +P+   +G S       K +C +C ++GH KKDC   K   +  N   +    T  +  I+  V + +         ++LDSG S H
Subjt:  SNWK----KNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVSQTSE--------WILDSGCSYH

Query:  MCPNKEMFLDFKE----FNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGTRRG
        +  ++ ++ D  E        V   G        GI    +++ +     L+DV +  +   NL+S+  L  +G  I  +   + +++  L+ +  +   
Subjt:  MCPNKEMFLDFKE----FNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGTRRG

Query:  SIYYLNGTTIIGHAAVA-SSKEQDISKLWHMRLGHAGE------KALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQT--KGILDYVHTDV
            LN   +I   A + ++K ++  +LWH R GH  +      K      +Q +L        + CE C+ GKQ  + F     +T  K  L  VH+DV
Subjt:  SIYYLNGTTIIGHAAVA-SSKEQDISKLWHMRLGHAGE------KALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQT--KGILDYVHTDV

Query:  WGPTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERM
         GP   V+L  K +FV F+D ++     Y +++K++V  +F ++    E   + K+  L  DNG EY  +   + C  +GI  H TVP  PQ NGV+ERM
Subjt:  WGPTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERM

Query:  NQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSG--NGGRTPLEVWSGSPVSDYDK-LHVFGCPAYYHVTD--SKLDPRAKKAKFMGFSKGVKG
         +T+ EK R ++S A L K+FW EA+  A +L+NR+P     +  +TP E+W       Y K L VFG   Y H+ +   K D ++ K+ F+G+     G
Subjt:  NQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSG--NGGRTPLEVWSGSPVSDYDK-LHVFGCPAYYHVTD--SKLDPRAKKAKFMGFSKGVKG

Query:  YRLWCPETSKIVNSRDVTFDESGM
        ++LW     K + +RDV  DE+ M
Subjt:  YRLWCPETSKIVNSRDVTFDESGM

P0C026 Nudix hydrolase 24, chloroplastic2.7e-9752.08Show/hide
Query:  FHNFSIVSSL-YNASKISRTTYPMLSTFRPAVTRSISVAAGNFSWDDVVHISLPSSFQDDPSDLTGYFEKVKLCNRGSDTQSEFLPFVIDDLIVGYVHHG
        F + +++ +L + +S  SR+  P L   R     ++ ++  +F+W+DV   S       + SDLTG+ EKV  CNRG +   EF+PFVI++ IVGY+H G
Subjt:  FHNFSIVSSL-YNASKISRTTYPMLSTFRPAVTRSISVAAGNFSWDDVVHISLPSSFQDDPSDLTGYFEKVKLCNRGSDTQSEFLPFVIDDLIVGYVHHG

Query:  FAKHLKQYPNVFIFPRDDCGQFGAYLTLHESLKTPEDRTHAVGDVVKCLG-KEVIPGTRNELYPVTSSFGASNFFSLERAAAPYFGIKVYGVHMNGYVEK
        F K+L+ + ++F F      Q+G ++TL+  L  PE+RT AV  V+K LG K +IPG RNELYPV  SF A  FFS+ERAAAPYFG+K Y +H+NGYVE+
Subjt:  FAKHLKQYPNVFIFPRDDCGQFGAYLTLHESLKTPEDRTHAVGDVVKCLG-KEVIPGTRNELYPVTSSFGASNFFSLERAAAPYFGIKVYGVHMNGYVEK

Query:  EGEKFLWVAKRSQTKPTFPGMLDHLVAGGLPQGIPCGENVMKECEEEAGIPRSISKGAIPVGAISYTDIKGFSYKRDVQFCYDLKLPESFIPENQDGEVE
        +G+KFLW+ KRS  K T+PG LDHLVAGGLP GI   EN++KECEEEAGI + ++  AI VG +SY DI  + + RDV FCYDL+LP+ F+P NQDGEV+
Subjt:  EGEKFLWVAKRSQTKPTFPGMLDHLVAGGLPQGIPCGENVMKECEEEAGIPRSISKGAIPVGAISYTDIKGFSYKRDVQFCYDLKLPESFIPENQDGEVE

Query:  GFMLLLVTNVANVIRRTQLFKPNCTLVIIDFLFRHG
         F L+ V  VANV+R+T  FK +C+LVIIDFLFRHG
Subjt:  GFMLLLVTNVANVIRRTQLFKPNCTLVIIDFLFRHG

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-14238.64Show/hide
Query:  EVEKFDGTNNFGVWRGEVSDLLVMQDLD--VSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKRVENKLYLKKK
        EV KF+G N F  W+  + DLL+ Q L   + +  + P+ M   +W  L+ +A   IR  L  D     +    A+ +W +LE+ YM K + NKLYLKK+
Subjt:  EVEKFDGTNNFGVWRGEVSDLLVMQDLD--VSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKRVENKLYLKKK

Query:  LFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETYQSSSSNVLTAHGRT
        L+     EG +   HL+ FN +IT L NL VKI++EDKA+LLLNSLP SY+ + TT+LHG T I+ +DV++AL+ NE  +K+ E    +    L   GR 
Subjt:  LFRFDYKEGISMAEHLDDFNKIITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETYQSSSSNVLTAHGRT

Query:  SNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDC-SILKEKGSKSNVVRDDDT----DTDDALTISLSVSQ--------TSEWILDSGCSY
         +++++     R  +RG+S N  K     C  C Q GH+K+DC +  K KG  S    DD+T      +D + + ++  +         SEW++D+  S+
Subjt:  SNWKKNECGEPRLKSRGRSDNWRKLDKNECAYCRQKGHWKKDC-SILKEKGSKSNVVRDDDT----DTDDALTISLSVSQ--------TSEWILDSGCSY

Query:  HMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGTRRGSIY
        H  P +++F  +   + G V MGN S  KI GIG + IK   G    LKDVR+VPDL+ NLIS   LD  GY         ++ +G+LV   G  RG++Y
Subjt:  HMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGTRRGSIY

Query:  YLNGTTIIGHAAVASSKEQDIS-KLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGPTKNVSLGG
          N     G    A   + +IS  LWH R+GH  EK LQ L  + ++  A    +  C++C+FGKQ  V F T+  +   ILD V++DV GP +  S+GG
Subjt:  YLNGTTIIGHAAVASSKEQDIS-KLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGPTKNVSLGG

Query:  KRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIEKVRCI
         ++FVTFIDD S+++W+Y ++ K++V Q+F ++  +VE +T +K+KRLRSDNGGEYT   F + C   GI    TVPG PQ NGVAERMN+T++EKVR +
Subjt:  KRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIEKVRCI

Query:  LSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTD---SKLDPRAKKAKFMGFSKGVKGYRLWCPETSKIVN
        L  A L K+FW EA+  A +L+NR P        P  VW+   VS Y  L VFGC A+ HV     +KLD ++    F+G+     GYRLW P   K++ 
Subjt:  LSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTD---SKLDPRAKKAKFMGFSKGVKGYRLWCPETSKIVN

Query:  SRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAP--TEEPIDQVNNNFDVLEQKEQS---LEEQSLVNERVEEPE
        SRDV F ES     ++    +  ++V+  +    V  P  +  P    +   +V EQ EQ    +E+   ++E VEE E
Subjt:  SRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAP--TEEPIDQVNNNFDVLEQKEQS---LEEQSLVNERVEEPE

Q8VXZ0 Nudix hydrolase 20, chloroplastic7.9e-10554.78Show/hide
Query:  SIVSSLYNASKISRTTYPMLSTFRPAV-------TRSISVA-----AGNFSWDDVVHISLPSSFQDDPSDLTGYFEKVKLCNRGSDTQSEFLPFVIDDLI
        ++ +SL+++  I+R   P+L     ++       +R++S A     + +F+WDDV+        + + SDLTG+FEK+  CNRGS+   EF+PFVI++ I
Subjt:  SIVSSLYNASKISRTTYPMLSTFRPAV-------TRSISVA-----AGNFSWDDVVHISLPSSFQDDPSDLTGYFEKVKLCNRGSDTQSEFLPFVIDDLI

Query:  VGYVHHGFAKHLKQYPNVFIFPRD-DC-GQFGAYLTLHESLKTPEDRTHAVGDVVKCLG-KEVIPGTRNELYPVTSSFGASNFFSLERAAAPYFGIKVYG
        VGY+H  F ++L+++ ++F F ++  C  +   Y+TL+  L+ PEDRT AV DV+K LG K +IPG RNELYPV  SF A  FFSLERAAAPYFGIK YG
Subjt:  VGYVHHGFAKHLKQYPNVFIFPRD-DC-GQFGAYLTLHESLKTPEDRTHAVGDVVKCLG-KEVIPGTRNELYPVTSSFGASNFFSLERAAAPYFGIKVYG

Query:  VHMNGYVEKEGEKFLWVAKRSQTKPTFPGMLDHLVAGGLPQGIPCGENVMKECEEEAGIPRSISKGAIPVGAISYTDIKGFSYKRDVQFCYDLKLPESFI
        VHMNGYVE++G+K LW+ KRS +K T+PGMLDHLVAGGLP GI CG N++KECEEEAGI R+I+  AI VGA+SY DI  + +KRDV FCYDL+LPE F+
Subjt:  VHMNGYVEKEGEKFLWVAKRSQTKPTFPGMLDHLVAGGLPQGIPCGENVMKECEEEAGIPRSISKGAIPVGAISYTDIKGFSYKRDVQFCYDLKLPESFI

Query:  PENQDGEVEGFMLLLVTNVANVIRRTQLFKPNCTLVIIDFLFRHG
        P+NQDGEVE F L+ V  VA+VI++T  FK NC+LVIIDFLFRHG
Subjt:  PENQDGEVEGFMLLLVTNVANVIRRTQLFKPNCTLVIIDFLFRHG

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.0e-5123.92Show/hide
Query:  WTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNK-IIT---DLLNLDVKIDDEDKALL
        W + ++     +   +    +    + T A ++W+ L     +K   N  Y      R   K+     + +DD+ + ++T    L  L   +D +++   
Subjt:  WTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNK-IIT---DLLNLDVKIDDEDKALL

Query:  LLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMR----------------KKEKETYQSSSSNVLTAHGRTSNWKKNECGEPRLKSRGR---SDNW
        +L +LPE Y+ V+  +    T     ++   L+N+E +                   + T  ++++N    + R  N   N   +P  +S      ++N 
Subjt:  LLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMR----------------KKEKETYQSSSSNVLTAHGRTSNWKKNECGEPRLKSRGR---SDNW

Query:  RKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVS---QTSEWILDSGCSYHMCPNKEMFLDFKEFNGG-VVYMGNDSTCKIM
         K    +C  C  +GH  K CS L+   S  N  +     T      +L++     ++ W+LDSG ++H+  +       + + GG  V + + ST  I 
Subjt:  RKLDKNECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVS---QTSEWILDSGCSYHMCPNKEMFLDFKEFNGG-VVYMGNDSTCKIM

Query:  GIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISL-GVLDASGYRIILEGGNLKV--ARGALVAITGTRRGSIYYLNGTTIIGHAAVASSKEQDISKLWHMR
          GS  +      +  L ++ YVP++ KNLIS+  + +A+G  +     + +V      +  + G  +  +Y     +    +  AS   +     WH R
Subjt:  GIGSVQIKMFDGVIRKLKDVRYVPDLKKNLISL-GVLDASGYRIILEGGNLKV--ARGALVAITGTRRGSIYYLNGTTIIGHAAVASSKEQDISKLWHMR

Query:  LGHAGEKALQTLVNQGVLKGAT-TGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGPTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQ
        LGH     L ++++   L     + +   C  C+  K   V F  +   +   L+Y+++DVW  +  +S    R++V F+D +++  W+YP++ K++V +
Subjt:  LGHAGEKALQTLVNQGVLKGAT-TGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGPTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQ

Query:  IFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVS
         F+ +K ++EN+   +I    SDNGGE+      +     GI    + P  P+ NG++ER ++ ++E    +LS A + K +W  A + AV+L+NRLP  
Subjt:  IFLEWKKMVENQTDKKIKRLRSDNGGEYTYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVS

Query:  GNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHV---TDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKIVNSRDVTFDES----GMFLQKIENNDEALK
             +P +   G+   +YDKL VFGC  Y  +      KLD ++++  F+G+S     Y     +TS++  SR V FDE+      +L  +    E  +
Subjt:  GNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHV---TDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKIVNSRDVTFDES----GMFLQKIENNDEALK

Query:  QVEKVVFSPDVVAPTEEPI
        +    V+SP    PT  P+
Subjt:  QVEKVVFSPDVVAPTEEPI

Arabidopsis top hitse value%identityAlignment
AT5G19460.1 nudix hydrolase homolog 205.6e-10654.78Show/hide
Query:  SIVSSLYNASKISRTTYPMLSTFRPAV-------TRSISVA-----AGNFSWDDVVHISLPSSFQDDPSDLTGYFEKVKLCNRGSDTQSEFLPFVIDDLI
        ++ +SL+++  I+R   P+L     ++       +R++S A     + +F+WDDV+        + + SDLTG+FEK+  CNRGS+   EF+PFVI++ I
Subjt:  SIVSSLYNASKISRTTYPMLSTFRPAV-------TRSISVA-----AGNFSWDDVVHISLPSSFQDDPSDLTGYFEKVKLCNRGSDTQSEFLPFVIDDLI

Query:  VGYVHHGFAKHLKQYPNVFIFPRD-DC-GQFGAYLTLHESLKTPEDRTHAVGDVVKCLG-KEVIPGTRNELYPVTSSFGASNFFSLERAAAPYFGIKVYG
        VGY+H  F ++L+++ ++F F ++  C  +   Y+TL+  L+ PEDRT AV DV+K LG K +IPG RNELYPV  SF A  FFSLERAAAPYFGIK YG
Subjt:  VGYVHHGFAKHLKQYPNVFIFPRD-DC-GQFGAYLTLHESLKTPEDRTHAVGDVVKCLG-KEVIPGTRNELYPVTSSFGASNFFSLERAAAPYFGIKVYG

Query:  VHMNGYVEKEGEKFLWVAKRSQTKPTFPGMLDHLVAGGLPQGIPCGENVMKECEEEAGIPRSISKGAIPVGAISYTDIKGFSYKRDVQFCYDLKLPESFI
        VHMNGYVE++G+K LW+ KRS +K T+PGMLDHLVAGGLP GI CG N++KECEEEAGI R+I+  AI VGA+SY DI  + +KRDV FCYDL+LPE F+
Subjt:  VHMNGYVEKEGEKFLWVAKRSQTKPTFPGMLDHLVAGGLPQGIPCGENVMKECEEEAGIPRSISKGAIPVGAISYTDIKGFSYKRDVQFCYDLKLPESFI

Query:  PENQDGEVEGFMLLLVTNVANVIRRTQLFKPNCTLVIIDFLFRHG
        P+NQDGEVE F L+ V  VA+VI++T  FK NC+LVIIDFLFRHG
Subjt:  PENQDGEVEGFMLLLVTNVANVIRRTQLFKPNCTLVIIDFLFRHG

AT5G19470.1 nudix hydrolase homolog 241.9e-9852.08Show/hide
Query:  FHNFSIVSSL-YNASKISRTTYPMLSTFRPAVTRSISVAAGNFSWDDVVHISLPSSFQDDPSDLTGYFEKVKLCNRGSDTQSEFLPFVIDDLIVGYVHHG
        F + +++ +L + +S  SR+  P L   R     ++ ++  +F+W+DV   S       + SDLTG+ EKV  CNRG +   EF+PFVI++ IVGY+H G
Subjt:  FHNFSIVSSL-YNASKISRTTYPMLSTFRPAVTRSISVAAGNFSWDDVVHISLPSSFQDDPSDLTGYFEKVKLCNRGSDTQSEFLPFVIDDLIVGYVHHG

Query:  FAKHLKQYPNVFIFPRDDCGQFGAYLTLHESLKTPEDRTHAVGDVVKCLG-KEVIPGTRNELYPVTSSFGASNFFSLERAAAPYFGIKVYGVHMNGYVEK
        F K+L+ + ++F F      Q+G ++TL+  L  PE+RT AV  V+K LG K +IPG RNELYPV  SF A  FFS+ERAAAPYFG+K Y +H+NGYVE+
Subjt:  FAKHLKQYPNVFIFPRDDCGQFGAYLTLHESLKTPEDRTHAVGDVVKCLG-KEVIPGTRNELYPVTSSFGASNFFSLERAAAPYFGIKVYGVHMNGYVEK

Query:  EGEKFLWVAKRSQTKPTFPGMLDHLVAGGLPQGIPCGENVMKECEEEAGIPRSISKGAIPVGAISYTDIKGFSYKRDVQFCYDLKLPESFIPENQDGEVE
        +G+KFLW+ KRS  K T+PG LDHLVAGGLP GI   EN++KECEEEAGI + ++  AI VG +SY DI  + + RDV FCYDL+LP+ F+P NQDGEV+
Subjt:  EGEKFLWVAKRSQTKPTFPGMLDHLVAGGLPQGIPCGENVMKECEEEAGIPRSISKGAIPVGAISYTDIKGFSYKRDVQFCYDLKLPESFIPENQDGEVE

Query:  GFMLLLVTNVANVIRRTQLFKPNCTLVIIDFLFRHG
         F L+ V  VANV+R+T  FK +C+LVIIDFLFRHG
Subjt:  GFMLLLVTNVANVIRRTQLFKPNCTLVIIDFLFRHG

AT5G19470.2 nudix hydrolase homolog 242.2e-9450.89Show/hide
Query:  FHNFSIVSSL-YNASKISRTTYPMLSTFRPAVTRSISVAAGNFSWDDVVHISLPSSFQDDPSDLTGYFEKVKLCNRGSDTQSEFLPFVIDDLIVGYVHHG
        F + +++ +L + +S  SR+  P L   R     ++ ++  +F+W+DV   S       + SDLTG+ EKV  CNRG +   EF+PFVI++ IVGY+H G
Subjt:  FHNFSIVSSL-YNASKISRTTYPMLSTFRPAVTRSISVAAGNFSWDDVVHISLPSSFQDDPSDLTGYFEKVKLCNRGSDTQSEFLPFVIDDLIVGYVHHG

Query:  FAKHLKQYPNVFIFPRDDCGQFGAYLTLHESLKTPEDRTHAVGDVVKCLG-KEVIPGTRNELYPVTSSFGASNFFSLERAAAPYFGIKVYGVHMNGYVEK
        F K+L+ + ++F F      Q+G ++TL+  L  PE+RT AV  V+K LG K +IPG RNELYPV  SF A  FFS+ERAAAPYFG+K Y +H+NGYVE+
Subjt:  FAKHLKQYPNVFIFPRDDCGQFGAYLTLHESLKTPEDRTHAVGDVVKCLG-KEVIPGTRNELYPVTSSFGASNFFSLERAAAPYFGIKVYGVHMNGYVEK

Query:  EGEKFLWVAKRSQTKPTFPGMLDHLVAGGLPQGIPCGENVMKECEEEAGIPRSISKGAIPVGAISYTDIKGFSYKRDVQFCYDLKLPESFIPENQDGEVE
        +G+KFLW+ KRS  K T+PG LDHLVAGGL        N++KECEEEAGI + ++  AI VG +SY DI  + + RDV FCYDL+LP+ F+P NQDGEV+
Subjt:  EGEKFLWVAKRSQTKPTFPGMLDHLVAGGLPQGIPCGENVMKECEEEAGIPRSISKGAIPVGAISYTDIKGFSYKRDVQFCYDLKLPESFIPENQDGEVE

Query:  GFMLLLVTNVANVIRRTQLFKPNCTLVIIDFLFRHG
         F L+ V  VANV+R+T  FK +C+LVIIDFLFRHG
Subjt:  GFMLLLVTNVANVIRRTQLFKPNCTLVIIDFLFRHG

ATMG00300.1 Gag-Pol-related retrotransposon family protein1.6e-2040.18Show/hide
Query:  GNLKVARGALVAITGTRRGSIYYLNGTTIIGHAAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTK
        G LKV +G    + G R  S+Y L G+   G + +A +  +D ++LWH RL H  ++ ++ LV +G L  +    + FCE CI+GK   V F T  H TK
Subjt:  GNLKVARGALVAITGTRRGSIYYLNGTTIIGHAAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGATTGRIDFCEHCIFGKQKNVKFGTAIHQTK

Query:  GILDYVHTDVWG
          LDYVH+D+WG
Subjt:  GILDYVHTDVWG

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein2.1e-1547.19Show/hide
Query:  MNQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMG
        MN+T+IEKVR +L + GL K F A+A + AVH++N+ P +      P EVW  S V  Y  L  FGC AY H  + KL PRAKK +  G
Subjt:  MNQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHVTDSKLDPRAKKAKFMG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTGCAAATCTCTCGTTGCACACAGATTTCATAACTTCTCCATTGTCTCTTCCCTTTACAACGCTTCGAAGATTTCCCGGACGACGTATCCAATGTTGTCT
ACATTTCGGCCAGCAGTAACCCGCTCGATATCGGTCGCCGCCGGAAACTTCTCTTGGGACGACGTTGTGCACATTTCTTTACCATCAAGCTTTCAGGACGATCCT
TCAGATCTGACGGGCTATTTCGAGAAGGTCAAGCTCTGCAATCGTGGATCAGATACGCAATCTGAATTTCTCCCCTTTGTCATCGACGACCTAATTGTTGGCTAC
GTGCACCACGGTTTCGCTAAACACCTGAAGCAGTATCCGAATGTCTTTATCTTCCCCCGAGATGATTGTGGTCAATTTGGCGCTTACTTAACCTTGCATGAATCG
TTGAAGACACCTGAAGATAGGACGCATGCAGTTGGAGATGTAGTTAAGTGTTTGGGTAAAGAAGTCATTCCAGGTACACGGAATGAGCTATACCCTGTTACTTCG
TCCTTTGGTGCGTCAAACTTCTTCTCACTCGAACGAGCTGCGGCCCCTTATTTTGGAATAAAGGTTTATGGAGTTCATATGAATGGTTATGTTGAGAAGGAGGGA
GAAAAGTTTTTGTGGGTTGCGAAAAGGAGTCAAACAAAGCCAACTTTTCCTGGAATGTTGGATCATTTAGTGGCTGGAGGGCTGCCTCAGGGCATCCCTTGTGGG
GAGAATGTGATGAAGGAATGTGAGGAGGAAGCAGGCATACCGAGATCAATATCTAAAGGGGCCATACCAGTTGGTGCCATTTCATATACAGACATCAAAGGTTTT
AGTTACAAGAGAGATGTTCAATTCTGTTACGATTTAAAGCTTCCTGAAAGCTTTATACCGGAGAATCAAGATGGGGAGGTTGAAGGATTCATGTTGTTGCTTGTG
ACAAATGTCGCAAATGTGATTCGGAGGACACAGTTATTCAAGCCTAATTGTACTCTTGTCATTATCGACTTCCTATTTCGACACGGTGATTGGTTCGTGGATGTA
GGGGAACTTCTCTCCGAACCACGTAAATATCTTAGTGCCTCATTGTGTAAAATCGTTCATTGGTGTTGTGAGTACCTTTGTTCCGCTTTCGGCGCACAACAAGTG
GTATCAGAGCCAAGTTGTGAACCANCACGACGTGGATCGAGAGTGTGGAGTTCCAATCTCGAAGTAGAAAAATTTGATGGAACCAATAATTTTGGTGTTTGGCGA
GGCGAAGTTAGCGATTTATTGGTCATGCAAGACCTCGATGTCTCTCTCCAAGAGGAAATGCCTGAAGACATGACAGAAGCAGAATGGACGAAATTAAATCGGCAA
GCCTGTGGAATTATTCGATCCTGTCTTGGCAAGGATCAGAAATATCCATTTATGAAGGTTACCATGGCGAAAGAATTATGGGACAAGCTTGAAGCGAAATATATG
AAGAAGAGAGTGGAAAATAAACTCTATTTGAAGAAGAAACTCTTCCGTTTTGATTACAAGGAAGGCATCTCAATGGCTGAGCATTTGGATGATTTTAACAAAATC
ATCACTGATTTGCTCAATCTTGATGTTAAGATTGATGACGAGGACAAGGCGTTGTTGTTATTAAACTCCTTGCCGGAATCCTACGAGTTTGTAGTAACGACCCTA
CTACATGGAATCACTAATATTGATTTTGAAGATGTTTCAAATGCCTTAATAAACAATGAGATGCGAAAGAAGGAAAAGGAGACGTATCAAAGCTCTAGCTCAAAT
GTTCTCACTGCTCATGGAAGGACTTCCAATTGGAAAAAGAACGAATGTGGAGAGCCTCGACTAAAGTCGAGAGGTAGATCTGATAATTGGAGAAAACTTGATAAA
AATGAGTGTGCATATTGTCGACAGAAAGGCCATTGGAAAAAAGATTGTTCGATCTTAAAAGAGAAGGGGTCGAAGTCCAATGTGGTTAGAGATGACGACACTGAT
ACAGATGATGCCTTGACAATCTCCCTATCAGTCAGCCAAACCAGTGAATGGATTCTTGATTCTGGGTGCTCCTATCATATGTGTCCCAACAAAGAGATGTTTTTG
GACTTCAAGGAGTTCAATGGTGGAGTTGTCTATATGGGAAATGATAGTACTTGTAAGATAATGGGAATCGGCTCAGTACAAATCAAGATGTTTGACGGGGTTATA
CGAAAACTCAAAGATGTGAGGTATGTCCCTGACTTGAAGAAAAATTTGATTTCACTTGGCGTTTTGGATGCGAGTGGTTATCGCATCATTTTGGAAGGAGGGAAT
TTGAAAGTAGCTCGTGGAGCTTTGGTGGCAATTACAGGAACTAGAAGAGGTAGCATATACTACCTCAACGGAACCACAATAATTGGGCATGCTGCTGTGGCAAGT
TCGAAAGAACAAGACATATCGAAATTATGGCACATGCGACTCGGGCATGCTGGTGAGAAGGCGCTTCAGACACTGGTGAACCAAGGAGTTCTTAAGGGTGCCACA
ACTGGAAGAATTGATTTCTGTGAACATTGTATATTTGGCAAACAAAAAAATGTGAAGTTTGGTACAGCTATACACCAAACAAAGGGCATTTTGGATTATGTTCAC
ACTGATGTATGGGGTCCCACAAAGAATGTCTCATTGGGAGGAAAGAGGTGGTTCGTCACCTTTATTGATGACTACTCGAAGAGAGTTTGGATGTATCCCATGAGG
CACAAAAACGAGGTTCTCCAAATCTTCCTAGAGTGGAAGAAAATGGTAGAGAATCAGACGGACAAGAAAATCAAAAGGCTGAGATCAGACAACGGGGGAGAATAT
ACTTATGATCCATTCCTTAAAGTATGCCGAGATGAAGGGATCGTTCGACACTTCACTGTTCCTGGTAAGCCACAACAAAATGGAGTTGCTGAGAGAATGAACCAG
ACATTAATAGAGAAGGTTCGATGCATATTGTCTCAAGCAGGATTGAGTAAGGCATTTTGGGCTGAGGCTCTCAGTTATGCAGTTCACTTGGTGAATCGTTTACCT
GTTTCTGGAAACGGTGGGAGAACTCCGCTTGAGGTATGGTCAGGTAGTCCTGTTAGTGATTATGATAAATTACATGTGTTTGGGTGTCCTGCTTATTATCATGTG
ACAGACTCAAAGCTGGATCCTAGAGCAAAAAAAGCCAAGTTTATGGGCTTTAGCAAAGGTGTGAAGGGCTATAGGTTATGGTGTCCAGAAACAAGTAAGATTGTT
AATAGTCGAGATGTGACATTCGATGAGTCTGGAATGTTTTTGCAGAAAATTGAAAATAATGACGAGGCATTGAAGCAGGTGGAGAAGGTGGTGTTCTCTCCTGAT
GTGGTTGCTCCTACTGAAGAACCTATTGACCAGGTAAATAATAACTTTGATGTCTTAGAACAGAAAGAGCAAAGTCTGGAGGAGCAAAGCCTTGTAAATGAAAGA
GTGGAGGAACCTGAGTCTATCGCCAAGAATAGACCACGAAGGGTAATTCGAAAACCTGCAAGGTACATTAATCCCGAAAGTTCGGGGTACTTGGAGCTACTGCAG
AGTTTGAGAAGTGGGACCTGCTTCTGA
mRNA sequenceShow/hide mRNA sequence
GGTCTTGTTGTCGTCCGGTCAGTGTTCCTTTTATGGTTATTTCCTCTGGTAATTCTGGCGAAGTTAGCTCCAATCCAATTCCATGGCCTGCAAATCTCTCGTTGC
ACACAGATTTCATAACTTCTCCATTGTCTCTTCCCTTTACAACGCTTCGAAGATTTCCCGGACGACGTATCCAATGTTGTCTACATTTCGGCCAGCAGTAACCCG
CTCGATATCGGTCGCCGCCGGAAACTTCTCTTGGGACGACGTTGTGCACATTTCTTTACCATCAAGCTTTCAGGACGATCCTTCAGATCTGACGGGCTATTTCGA
GAAGGTCAAGCTCTGCAATCGTGGATCAGATACGCAATCTGAATTTCTCCCCTTTGTCATCGACGACCTAATTGTTGGCTACGTGCACCACGGTTTCGCTAAACA
CCTGAAGCAGTATCCGAATGTCTTTATCTTCCCCCGAGATGATTGTGGTCAATTTGGCGCTTACTTAACCTTGCATGAATCGTTGAAGACACCTGAAGATAGGAC
GCATGCAGTTGGAGATGTAGTTAAGTGTTTGGGTAAAGAAGTCATTCCAGGTACACGGAATGAGCTATACCCTGTTACTTCGTCCTTTGGTGCGTCAAACTTCTT
CTCACTCGAACGAGCTGCGGCCCCTTATTTTGGAATAAAGGTTTATGGAGTTCATATGAATGGTTATGTTGAGAAGGAGGGAGAAAAGTTTTTGTGGGTTGCGAA
AAGGAGTCAAACAAAGCCAACTTTTCCTGGAATGTTGGATCATTTAGTGGCTGGAGGGCTGCCTCAGGGCATCCCTTGTGGGGAGAATGTGATGAAGGAATGTGA
GGAGGAAGCAGGCATACCGAGATCAATATCTAAAGGGGCCATACCAGTTGGTGCCATTTCATATACAGACATCAAAGGTTTTAGTTACAAGAGAGATGTTCAATT
CTGTTACGATTTAAAGCTTCCTGAAAGCTTTATACCGGAGAATCAAGATGGGGAGGTTGAAGGATTCATGTTGTTGCTTGTGACAAATGTCGCAAATGTGATTCG
GAGGACACAGTTATTCAAGCCTAATTGTACTCTTGTCATTATCGACTTCCTATTTCGACACGGTGATTGGTTCGTGGATGTAGGGGAACTTCTCTCCGAACCACG
TAAATATCTTAGTGCCTCATTGTGTAAAATCGTTCATTGGTGTTGTGAGTACCTTTGTTCCGCTTTCGGCGCACAACAAGTGGTATCAGAGCCAAGTTGTGAACC
ANCACGACGTGGATCGAGAGTGTGGAGTTCCAATCTCGAAGTAGAAAAATTTGATGGAACCAATAATTTTGGTGTTTGGCGAGGCGAAGTTAGCGATTTATTGGT
CATGCAAGACCTCGATGTCTCTCTCCAAGAGGAAATGCCTGAAGACATGACAGAAGCAGAATGGACGAAATTAAATCGGCAAGCCTGTGGAATTATTCGATCCTG
TCTTGGCAAGGATCAGAAATATCCATTTATGAAGGTTACCATGGCGAAAGAATTATGGGACAAGCTTGAAGCGAAATATATGAAGAAGAGAGTGGAAAATAAACT
CTATTTGAAGAAGAAACTCTTCCGTTTTGATTACAAGGAAGGCATCTCAATGGCTGAGCATTTGGATGATTTTAACAAAATCATCACTGATTTGCTCAATCTTGA
TGTTAAGATTGATGACGAGGACAAGGCGTTGTTGTTATTAAACTCCTTGCCGGAATCCTACGAGTTTGTAGTAACGACCCTACTACATGGAATCACTAATATTGA
TTTTGAAGATGTTTCAAATGCCTTAATAAACAATGAGATGCGAAAGAAGGAAAAGGAGACGTATCAAAGCTCTAGCTCAAATGTTCTCACTGCTCATGGAAGGAC
TTCCAATTGGAAAAAGAACGAATGTGGAGAGCCTCGACTAAAGTCGAGAGGTAGATCTGATAATTGGAGAAAACTTGATAAAAATGAGTGTGCATATTGTCGACA
GAAAGGCCATTGGAAAAAAGATTGTTCGATCTTAAAAGAGAAGGGGTCGAAGTCCAATGTGGTTAGAGATGACGACACTGATACAGATGATGCCTTGACAATCTC
CCTATCAGTCAGCCAAACCAGTGAATGGATTCTTGATTCTGGGTGCTCCTATCATATGTGTCCCAACAAAGAGATGTTTTTGGACTTCAAGGAGTTCAATGGTGG
AGTTGTCTATATGGGAAATGATAGTACTTGTAAGATAATGGGAATCGGCTCAGTACAAATCAAGATGTTTGACGGGGTTATACGAAAACTCAAAGATGTGAGGTA
TGTCCCTGACTTGAAGAAAAATTTGATTTCACTTGGCGTTTTGGATGCGAGTGGTTATCGCATCATTTTGGAAGGAGGGAATTTGAAAGTAGCTCGTGGAGCTTT
GGTGGCAATTACAGGAACTAGAAGAGGTAGCATATACTACCTCAACGGAACCACAATAATTGGGCATGCTGCTGTGGCAAGTTCGAAAGAACAAGACATATCGAA
ATTATGGCACATGCGACTCGGGCATGCTGGTGAGAAGGCGCTTCAGACACTGGTGAACCAAGGAGTTCTTAAGGGTGCCACAACTGGAAGAATTGATTTCTGTGA
ACATTGTATATTTGGCAAACAAAAAAATGTGAAGTTTGGTACAGCTATACACCAAACAAAGGGCATTTTGGATTATGTTCACACTGATGTATGGGGTCCCACAAA
GAATGTCTCATTGGGAGGAAAGAGGTGGTTCGTCACCTTTATTGATGACTACTCGAAGAGAGTTTGGATGTATCCCATGAGGCACAAAAACGAGGTTCTCCAAAT
CTTCCTAGAGTGGAAGAAAATGGTAGAGAATCAGACGGACAAGAAAATCAAAAGGCTGAGATCAGACAACGGGGGAGAATATACTTATGATCCATTCCTTAAAGT
ATGCCGAGATGAAGGGATCGTTCGACACTTCACTGTTCCTGGTAAGCCACAACAAAATGGAGTTGCTGAGAGAATGAACCAGACATTAATAGAGAAGGTTCGATG
CATATTGTCTCAAGCAGGATTGAGTAAGGCATTTTGGGCTGAGGCTCTCAGTTATGCAGTTCACTTGGTGAATCGTTTACCTGTTTCTGGAAACGGTGGGAGAAC
TCCGCTTGAGGTATGGTCAGGTAGTCCTGTTAGTGATTATGATAAATTACATGTGTTTGGGTGTCCTGCTTATTATCATGTGACAGACTCAAAGCTGGATCCTAG
AGCAAAAAAAGCCAAGTTTATGGGCTTTAGCAAAGGTGTGAAGGGCTATAGGTTATGGTGTCCAGAAACAAGTAAGATTGTTAATAGTCGAGATGTGACATTCGA
TGAGTCTGGAATGTTTTTGCAGAAAATTGAAAATAATGACGAGGCATTGAAGCAGGTGGAGAAGGTGGTGTTCTCTCCTGATGTGGTTGCTCCTACTGAAGAACC
TATTGACCAGGTAAATAATAACTTTGATGTCTTAGAACAGAAAGAGCAAAGTCTGGAGGAGCAAAGCCTTGTAAATGAAAGAGTGGAGGAACCTGAGTCTATCGC
CAAGAATAGACCACGAAGGGTAATTCGAAAACCTGCAAGGTACATTAATCCCGAAAGTTCGGGGTACTTGGAGCTACTGCAGAGTTTGAGAAGTGGGACCTGCTT
CTGAATAAAACAATTCGCCCAATACGTTGATGATAGAAAAATCAGGTTAATTTGTTTCAACATCGAATCTGGATTGTTACATACTGTTTATTGTTTTTGCAGAAC
TTTCTTGTTCATTAGATGTATTTGTGAAGCCAAACAAACTCTGTTTGGTACATCTATTAAATGAAAACACATTAACAAAAGTACACCTGAAAGCCCACTACATGG
AAACAATTTGAATTTCATGAACTCCCCAGCCACTATAAACACACTACACTTCACACTGACTTGTATTATTCACTTGCGTTTCTAACCCAAGATTTGACTGAGAAC
TATTGAATATTCAAAAGCCCAGGATTTCACCAAGAACTTGAAGCAACTGGAGGTGTGCAGTTAAGTGAGGGAAGTGTCCATTAGTGTTAATAACTCTCACTGTGC
AACTGCCCTTGATTTGCTTCTGCATAAAGATGGCAACCGAGGCAGGGGCGGCAACGTCGGTTTTAGTGTGTAGGATGGTGCATGGAATGTCGACCTTTTCAAGAA
TCTCCCTCTCATCCACTCTGAACACTGTCTTGGCCAAGGACACTGCAACTTCAGGCCTCATCTCATGCAAGCACTTCCGCAACCTGATCAGTGACAGAGGATCGG
CTTCGTCTACAACCAGAGAAGAAAACTGTGTGGCCCAATTTTCGTAGTTGGATTCCATATTTGCTATCATTTGCTCGATGTCCGATTGGTCGAAGCCACCTTCAT
AGTCTTCTGTATTTAGATACCTAACACAATGATCAATATGTTGTTAGAACTTAGGAACTTAAGAGCTGAAAGAATTACATTTCAGAACCGTTTTTAAGCAACTTG
TAGAAATAAACTGTATTAATAAAAAATAATAAAA
Protein sequenceShow/hide protein sequence
MACKSLVAHRFHNFSIVSSLYNASKISRTTYPMLSTFRPAVTRSISVAAGNFSWDDVVHISLPSSFQDDPSDLTGYFEKVKLCNRGSDTQSEFLPFVIDDLIVGY
VHHGFAKHLKQYPNVFIFPRDDCGQFGAYLTLHESLKTPEDRTHAVGDVVKCLGKEVIPGTRNELYPVTSSFGASNFFSLERAAAPYFGIKVYGVHMNGYVEKEG
EKFLWVAKRSQTKPTFPGMLDHLVAGGLPQGIPCGENVMKECEEEAGIPRSISKGAIPVGAISYTDIKGFSYKRDVQFCYDLKLPESFIPENQDGEVEGFMLLLV
TNVANVIRRTQLFKPNCTLVIIDFLFRHGDWFVDVGELLSEPRKYLSASLCKIVHWCCEYLCSAFGAQQVVSEPSCEPXRRGSRVWSSNLEVEKFDGTNNFGVWR
GEVSDLLVMQDLDVSLQEEMPEDMTEAEWTKLNRQACGIIRSCLGKDQKYPFMKVTMAKELWDKLEAKYMKKRVENKLYLKKKLFRFDYKEGISMAEHLDDFNKI
ITDLLNLDVKIDDEDKALLLLNSLPESYEFVVTTLLHGITNIDFEDVSNALINNEMRKKEKETYQSSSSNVLTAHGRTSNWKKNECGEPRLKSRGRSDNWRKLDK
NECAYCRQKGHWKKDCSILKEKGSKSNVVRDDDTDTDDALTISLSVSQTSEWILDSGCSYHMCPNKEMFLDFKEFNGGVVYMGNDSTCKIMGIGSVQIKMFDGVI
RKLKDVRYVPDLKKNLISLGVLDASGYRIILEGGNLKVARGALVAITGTRRGSIYYLNGTTIIGHAAVASSKEQDISKLWHMRLGHAGEKALQTLVNQGVLKGAT
TGRIDFCEHCIFGKQKNVKFGTAIHQTKGILDYVHTDVWGPTKNVSLGGKRWFVTFIDDYSKRVWMYPMRHKNEVLQIFLEWKKMVENQTDKKIKRLRSDNGGEY
TYDPFLKVCRDEGIVRHFTVPGKPQQNGVAERMNQTLIEKVRCILSQAGLSKAFWAEALSYAVHLVNRLPVSGNGGRTPLEVWSGSPVSDYDKLHVFGCPAYYHV
TDSKLDPRAKKAKFMGFSKGVKGYRLWCPETSKIVNSRDVTFDESGMFLQKIENNDEALKQVEKVVFSPDVVAPTEEPIDQVNNNFDVLEQKEQSLEEQSLVNER
VEEPESIAKNRPRRVIRKPARYINPESSGYLELLQSLRSGTCF