; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G11960 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G11960
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionIntegrase
Genome locationChr4:10265451..10267951
RNA-Seq ExpressionCSPI04G11960
SyntenyCSPI04G11960
Gene Ontology termsGO:0006396 - RNA processing (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR012337 - Ribonuclease H-like superfamily
IPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038926.1 integrase [Cucumis melo var. makuwa]0.0e+0065.22Show/hide
Query:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K+K
Subjt:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK

Query:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH
        +LRSDRGGEYI F +F KE GI HQ T R T  QNGVAERKNR IME+ARSMLKAK L ++FWGDAV C +Y+LNRA TKSV G+TP E W G KP+VSH
Subjt:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ R                                                       ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS++E +PR+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAIRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP GY + GEE KV +LKKALYGLKQAP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP

Query:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL
        RAWYSRID+FFLK GFRRCPYEHALY KED+ G FLI+ LYVDDL+FTGN   + ++F+ SMK EFEM++MGL+HYFLGIEV Q + EI I Q+KYA DL
Subjt:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL

Query:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL
        LKKF+MENA P +TPM+  LKL K D+ EA D ++YRSLVGSLMYLT TR DI+F+VS+LSRFMT+PKRSHWEAGKRVLRYILGT++ GI+YK+  ++V+
Subjt:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL

Query:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK
         G+ DSDWGGN+DD KSTSGYVF++GSG  SW SKKQ VVALSTTEAEYISL+ A CQALWLR +L ELKC Q+  T++FCDN S+I+LSKNPVFHGRSK
Subjt:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK

Query:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV
        HI IKYHFIR+L+KDGEV ++YCKTQDQVAD+FTKALK D F+K + KLGV +V
Subjt:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV

KAA0048003.1 integrase [Cucumis melo var. makuwa]0.0e+0065.22Show/hide
Query:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K+K
Subjt:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK

Query:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH
        +LRSDRGGEYI F +F KE GI HQ T R T  QNGVAERKNR IME+ARSMLKAK L ++FWGDAV C +Y+LNRA TKSV G+TP E W G KP+VSH
Subjt:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ R                                                       ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS++E +PR+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAIRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP GY + GEE KV +LKKALYGLKQAP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP

Query:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL
        RAWYSRID+FFLK GFRRCPYEHALY KED+ G FLI+ LYVDDL+FTGN   + ++F+ SMK EFEM++MGL+HYFLGIEV Q + EI I Q+KYA DL
Subjt:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL

Query:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL
        LKKF+MENA P +TPM+  LKL K D+ EA D ++YRSLVGSLMYLT TR DI+F+VS+LSRFMT+PKRSHWEAGKRVLRYILGT++ GI+YK+  ++V+
Subjt:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL

Query:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK
         G+ DSDWGGN+DD KSTSGYVF++GSG  SW SKKQ VVALSTTEAEYISL+ A CQALWLR +L ELKC Q+  T++FCDN S+I+LSKNPVFHGRSK
Subjt:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK

Query:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV
        HI IKYHFIR+L+KDGEV ++YCKTQDQVAD+FTKALK D F+K + KLGV +V
Subjt:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV

KAA0057291.1 integrase [Cucumis melo var. makuwa]0.0e+0065.22Show/hide
Query:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K+K
Subjt:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK

Query:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH
        +LRSDRGGEYI F +F KE GI HQ T R T  QNGVAERKNR IME+ARSMLKAK L ++FWGDAV C +Y+LNRA TKSV G+TP E W G KP+VSH
Subjt:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ R                                                       ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS++E +PR+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAIRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP GY + GEE KV +LKKALYGLKQAP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP

Query:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL
        RAWYSRID+FFLK GFRRCPYEHALY KED+ G FLI+ LYVDDL+FTGN   + ++F+ SMK EFEM++MGL+HYFLGIEV Q + EI I Q+KYA DL
Subjt:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL

Query:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL
        LKKF+MENA P +TPM+  LKL K D+ EA D ++YRSLVGSLMYLT TR DI+F+VS+LSRFMT+PKRSHWEAGKRVLRYILGT++ GI+YK+  ++V+
Subjt:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL

Query:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK
         G+ DSDWGGN+DD KSTSGYVF++GSG  SW SKKQ VVALSTTEAEYISL+ A CQALWLR +L ELKC Q+  T++FCDN S+I+LSKNPVFHGRSK
Subjt:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK

Query:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV
        HI IKYHFIR+L+KDGEV ++YCKTQDQVAD+FTKALK D F+K + KLGV +V
Subjt:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV

KAA0060243.1 integrase [Cucumis melo var. makuwa]0.0e+0065.22Show/hide
Query:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K+K
Subjt:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK

Query:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH
        +LRSDRGGEYI F +F KE GI HQ T R T  QNGVAERKNR IME+ARSMLKAK L ++FWGDAV C +Y+LNRA TKSV G+TP E W G KP+VSH
Subjt:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ R                                                       ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS++E +PR+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAIRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP GY + GEE KV +LKKALYGLKQAP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP

Query:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL
        RAWYSRID+FFLK GFRRCPYEHALY KED+ G FLI+ LYVDDL+FTGN   + ++F+ SMK EFEM++MGL+HYFLGIEV Q + EI I Q+KYA DL
Subjt:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL

Query:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL
        LKKF+MENA P +TPM+  LKL K D+ EA D ++YRSLVGSLMYLT TR DI+F+VS+LSRFMT+PKRSHWEAGKRVLRYILGT++ GI+YK+  ++V+
Subjt:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL

Query:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK
         G+ DSDWGGN+DD KSTSGYVF++GSG  SW SKKQ VVALSTTEAEYISL+ A CQALWLR +L ELKC Q+  T++FCDN S+I+LSKNPVFHGRSK
Subjt:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK

Query:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV
        HI IKYHFIR+L+KDGEV ++YCKTQDQVAD+FTKALK D F+K + KLGV +V
Subjt:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV

KAA0060377.1 integrase [Cucumis melo var. makuwa]0.0e+0065.22Show/hide
Query:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K+K
Subjt:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK

Query:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH
        +LRSDRGGEYI F +F KE GI HQ T R T  QNGVAERKNR IME+ARSMLKAK L ++FWGDAV C +Y+LNRA TKSV G+TP E W G KP+VSH
Subjt:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ R                                                       ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS++E +PR+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAIRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP GY + GEE KV +LKKALYGLKQAP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP

Query:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL
        RAWYSRID+FFLK GFRRCPYEHALY KED+ G FLI+ LYVDDL+FTGN   + ++F+ SMK EFEM++MGL+HYFLGIEV Q + EI I Q+KYA DL
Subjt:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL

Query:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL
        LKKF+MENA P +TPM+  LKL K D+ EA D ++YRSLVGSLMYLT TR DI+F+VS+LSRFMT+PKRSHWEAGKRVLRYILGT++ GI+YK+  ++V+
Subjt:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL

Query:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK
         G+ DSDWGGN+DD KSTSGYVF++GSG  SW SKKQ VVALSTTEAEYISL+ A CQALWLR +L ELKC Q+  T++FCDN S+I+LSKNPVFHGRSK
Subjt:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK

Query:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV
        HI IKYHFIR+L+KDGEV ++YCKTQDQVAD+FTKALK D F+K + KLGV +V
Subjt:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV

TrEMBL top hitse value%identityAlignment
A0A5A7TWN2 Integrase0.0e+0065.22Show/hide
Query:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K+K
Subjt:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK

Query:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH
        +LRSDRGGEYI F +F KE GI HQ T R T  QNGVAERKNR IME+ARSMLKAK L ++FWGDAV C +Y+LNRA TKSV G+TP E W G KP+VSH
Subjt:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ R                                                       ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS++E +PR+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAIRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP GY + GEE KV +LKKALYGLKQAP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP

Query:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL
        RAWYSRID+FFLK GFRRCPYEHALY KED+ G FLI+ LYVDDL+FTGN   + ++F+ SMK EFEM++MGL+HYFLGIEV Q + EI I Q+KYA DL
Subjt:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL

Query:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL
        LKKF+MENA P +TPM+  LKL K D+ EA D ++YRSLVGSLMYLT TR DI+F+VS+LSRFMT+PKRSHWEAGKRVLRYILGT++ GI+YK+  ++V+
Subjt:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL

Query:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK
         G+ DSDWGGN+DD KSTSGYVF++GSG  SW SKKQ VVALSTTEAEYISL+ A CQALWLR +L ELKC Q+  T++FCDN S+I+LSKNPVFHGRSK
Subjt:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK

Query:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV
        HI IKYHFIR+L+KDGEV ++YCKTQDQVAD+FTKALK D F+K + KLGV +V
Subjt:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV

A0A5A7UDP7 Integrase0.0e+0065.22Show/hide
Query:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K+K
Subjt:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK

Query:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH
        +LRSDRGGEYI F +F KE GI HQ T R T  QNGVAERKNR IME+ARSMLKAK L ++FWGDAV C +Y+LNRA TKSV G+TP E W G KP+VSH
Subjt:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ R                                                       ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS++E +PR+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAIRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP GY + GEE KV +LKKALYGLKQAP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP

Query:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL
        RAWYSRID+FFLK GFRRCPYEHALY KED+ G FLI+ LYVDDL+FTGN   + ++F+ SMK EFEM++MGL+HYFLGIEV Q + EI I Q+KYA DL
Subjt:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL

Query:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL
        LKKF+MENA P +TPM+  LKL K D+ EA D ++YRSLVGSLMYLT TR DI+F+VS+LSRFMT+PKRSHWEAGKRVLRYILGT++ GI+YK+  ++V+
Subjt:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL

Query:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK
         G+ DSDWGGN+DD KSTSGYVF++GSG  SW SKKQ VVALSTTEAEYISL+ A CQALWLR +L ELKC Q+  T++FCDN S+I+LSKNPVFHGRSK
Subjt:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK

Query:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV
        HI IKYHFIR+L+KDGEV ++YCKTQDQVAD+FTKALK D F+K + KLGV +V
Subjt:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV

A0A5A7UW83 Integrase0.0e+0065.22Show/hide
Query:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K+K
Subjt:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK

Query:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH
        +LRSDRGGEYI F +F KE GI HQ T R T  QNGVAERKNR IME+ARSMLKAK L ++FWGDAV C +Y+LNRA TKSV G+TP E W G KP+VSH
Subjt:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ R                                                       ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS++E +PR+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAIRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP GY + GEE KV +LKKALYGLKQAP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP

Query:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL
        RAWYSRID+FFLK GFRRCPYEHALY KED+ G FLI+ LYVDDL+FTGN   + ++F+ SMK EFEM++MGL+HYFLGIEV Q + EI I Q+KYA DL
Subjt:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL

Query:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL
        LKKF+MENA P +TPM+  LKL K D+ EA D ++YRSLVGSLMYLT TR DI+F+VS+LSRFMT+PKRSHWEAGKRVLRYILGT++ GI+YK+  ++V+
Subjt:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL

Query:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK
         G+ DSDWGGN+DD KSTSGYVF++GSG  SW SKKQ VVALSTTEAEYISL+ A CQALWLR +L ELKC Q+  T++FCDN S+I+LSKNPVFHGRSK
Subjt:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK

Query:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV
        HI IKYHFIR+L+KDGEV ++YCKTQDQVAD+FTKALK D F+K + KLGV +V
Subjt:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV

A0A5A7V0P6 Integrase0.0e+0065.22Show/hide
Query:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K+K
Subjt:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK

Query:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH
        +LRSDRGGEYI F +F KE GI HQ T R T  QNGVAERKNR IME+ARSMLKAK L ++FWGDAV C +Y+LNRA TKSV G+TP E W G KP+VSH
Subjt:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ R                                                       ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS++E +PR+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAIRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP GY + GEE KV +LKKALYGLKQAP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP

Query:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL
        RAWYSRID+FFLK GFRRCPYEHALY KED+ G FLI+ LYVDDL+FTGN   + ++F+ SMK EFEM++MGL+HYFLGIEV Q + EI I Q+KYA DL
Subjt:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL

Query:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL
        LKKF+MENA P +TPM+  LKL K D+ EA D ++YRSLVGSLMYLT TR DI+F+VS+LSRFMT+PKRSHWEAGKRVLRYILGT++ GI+YK+  ++V+
Subjt:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL

Query:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK
         G+ DSDWGGN+DD KSTSGYVF++GSG  SW SKKQ VVALSTTEAEYISL+ A CQALWLR +L ELKC Q+  T++FCDN S+I+LSKNPVFHGRSK
Subjt:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK

Query:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV
        HI IKYHFIR+L+KDGEV ++YCKTQDQVAD+FTKALK D F+K + KLGV +V
Subjt:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV

A0A5D3E3T2 Integrase0.0e+0065.22Show/hide
Query:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK
        + NI  E  +CE C+  KHHR+SFPTG +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K+K
Subjt:  MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK

Query:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH
        +LRSDRGGEYI F +F KE GI HQ T R T  QNGVAERKNR IME+ARSMLKAK L ++FWGDAV C +Y+LNRA TKSV G+TP E W G KP+VSH
Subjt:  SLRSDRGGEYIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSH

Query:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP
        LRVFG IAYSHI ++ R                                                       ++P HV++D  + A++LE    Q + S 
Subjt:  LRVFGCIAYSHISDEKRD------------------------------------------------------QNPLHVDMDGKKDARDLELEVTQPLTSP

Query:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ
        SS  S STS++E +PR+ R+IQEIYNT+ RI D+   +FALFA VDPV F+EAIQDE WK AM+QEIDAIRRNETWEL++LP NK+ALGVKW+YRTKLK 
Subjt:  SS--SHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQ

Query:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP
        +G V+KYKARLVVKGYKQ++GVDYEE+FAPVTR+ET+RL+L+LAA+N WKV+QMDVKSAFLNG+L++EI+V QP GY + GEE KV +LKKALYGLKQAP
Subjt:  NGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAP

Query:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL
        RAWYSRID+FFLK GFRRCPYEHALY KED+ G FLI+ LYVDDL+FTGN   + ++F+ SMK EFEM++MGL+HYFLGIEV Q + EI I Q+KYA DL
Subjt:  RAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDL

Query:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL
        LKKF+MENA P +TPM+  LKL K D+ EA D ++YRSLVGSLMYLT TR DI+F+VS+LSRFMT+PKRSHWEAGKRVLRYILGT++ GI+YK+  ++V+
Subjt:  LKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVL

Query:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK
         G+ DSDWGGN+DD KSTSGYVF++GSG  SW SKKQ VVALSTTEAEYISL+ A CQALWLR +L ELKC Q+  T++FCDN S+I+LSKNPVFHGRSK
Subjt:  VGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSK

Query:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV
        HI IKYHFIR+L+KDGEV ++YCKTQDQVAD+FTKALK D F+K + KLGV +V
Subjt:  HINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.2e-12830.97Show/hide
Query:  MSNIKKEDQLCEACVFGKHHRNSF-PTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKL
        ++N++   ++CE C+ GK  R  F          +PL +VH+D+CGP+   T     YF+ F+D ++     YL+K KS  F  F+ F A  E   NLK+
Subjt:  MSNIKKEDQLCEACVFGKHHRNSF-PTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKL

Query:  KSLRSDRGGEYI--VFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSV--QGITPQETWSGLK
          L  D G EY+      F  + GI +  TV  T   NGV+ER  R I E AR+M+   KL   FWG+AV  A YL+NR  ++++     TP E W   K
Subjt:  KSLRSDRGGEYI--VFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSV--QGITPQETWSGLK

Query:  PTVSHLRVFGCIAYSHI-----------------------------------------------------------------------------------
        P + HLRVFG   Y HI                                                                                   
Subjt:  PTVSHLRVFGCIAYSHI-----------------------------------------------------------------------------------

Query:  -------------SDEKRDQNP----------------------------------LHVDMDGKKDARDLELEVTQPLTSPSSSHSTSDEE-------TT
                      D K  +N                                    +   + KK  RD  L  ++   +P+ S  +   E         
Subjt:  -------------SDEKRDQNP----------------------------------LHVDMDGKKDARDLELEVTQPLTSPSSSHSTSDEE-------TT

Query:  PRKTRNIQEIYNTSRRILDEEHVDF---------------ALFANVDPVYFEEAIQDE--NWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTK
        P K   I+ I   S R+  +  + +                +F +V   + E   +D+  +W++A+N E++A + N TW + K PENK  +  +W++  K
Subjt:  PRKTRNIQEIYNTSRRILDEEHVDF---------------ALFANVDPVYFEEAIQDE--NWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTK

Query:  LKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLK
          + G   +YKARLV +G+ QK+ +DYEE FAPV R+ + R +L+L  + N KVHQMDVK+AFLNG L++EIY+  P G +     + VC+L KA+YGLK
Subjt:  LKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLK

Query:  QAPRAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGN-FLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKY
        QA R W+   +    +  F     +  +Y  +  N N  + + LYVDD++        +  F+  + ++F MT++  + +F+GI ++  +++I + Q  Y
Subjt:  QAPRAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGN-FLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKY

Query:  AKDLLKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMY-LTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRN
         K +L KF MEN    STP+   +     +  E  + T  RSL+G LMY +  TR D+  +V++LSR+ +      W+  KRVLRY+ GT+D  + +K+N
Subjt:  AKDLLKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMY-LTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRN

Query:  V--DNVLVGYSDSDWGGNIDDFKSTSGYVFNI-GSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKN
        +  +N ++GY DSDW G+  D KST+GY+F +     + W +K+Q+ VA S+TEAEY++L  A  +ALWL+ +L  +    E    ++ DNQ  IS++ N
Subjt:  V--DNVLVGYSDSDWGGNIDDFKSTSGYVFNI-GSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKN

Query:  PVFHGRSKHINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGV
        P  H R+KHI+IKYHF RE +++  + + Y  T++Q+AD+FTK L    F+++++KLG+
Subjt:  PVFHGRSKHINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGV

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-16437.5Show/hide
Query:  CEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLKSLRSDRGGEY
        C+ C+FGK HR SF T  S R    L+LV++D+CGPM   + GGN+YF+TFIDD SRK W+Y+LK K   F+ F+ F A+VE E+  KLK LRSD GGEY
Subjt:  CEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLKSLRSDRGGEY

Query:  I--VFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSHLRVFGCIA
            F ++   +GI+H+KTV  T   NGVAER NR I+E  RSML+  KL   FWG+AV  A YL+NR+ +  +    P+  W+  + + SHL+VFGC A
Subjt:  I--VFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSHLRVFGCIA

Query:  YSHISDEKRDQ-----------------------NPLHVDMDGKKDA--RDLELE-----------------VTQPLTS--PSSSHSTSDE---------
        ++H+  E+R +                       +P+   +   +D   R+ E+                  VT P TS  P+S+ ST+DE         
Subjt:  YSHISDEKRDQ-----------------------NPLHVDMDGKKDA--RDLELE-----------------VTQPLTS--PSSSHSTSDE---------

Query:  ---ETTPRKTRNIQEIYNTSR--------RILDEEHVDFALFANVDPVYFEEAIQDENWKD------------AMNQEIDAIRRNETWELVKLPENKKAL
           E   +    ++E+ + ++        R  +   V+   + + + V   +  + E+ K+            AM +E++++++N T++LV+LP+ K+ L
Subjt:  ---ETTPRKTRNIQEIYNTSR--------RILDEEHVDFALFANVDPVYFEEAIQDENWKD------------AMNQEIDAIRRNETWELVKLPENKKAL

Query:  GVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCR
          KW+++ K   + ++ +YKARLVVKG++QK G+D++E+F+PV ++ ++R +L+LAA  + +V Q+DVK+AFL+G LE+EIY+EQP G+   G+++ VC+
Subjt:  GVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCR

Query:  LKKALYGLKQAPRAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEV--KQGD
        L K+LYGLKQAPR WY + D+F     + +   +  +Y K     NF+I+ LYVDD++  G    +I + +  + K F+M ++G     LG+++  ++  
Subjt:  LKKALYGLKQAPRAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEV--KQGD

Query:  NEIAIFQKKYAKDLLKKFKMENAYPASTPMELGLKLSKHDVSEAFD------ATIYRSLVGSLMY-LTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVL
         ++ + Q+KY + +L++F M+NA P STP+   LKLSK       +         Y S VGSLMY +  TR DI  +V ++SRF+ +P + HWEA K +L
Subjt:  NEIAIFQKKYAKDLLKKFKMENAYPASTPMELGLKLSKHDVSEAFD------ATIYRSLVGSLMY-LTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVL

Query:  RYILGTVDHGIHYKRNVDNVLVGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIM
        RY+ GT    + +  + D +L GY+D+D  G+ID+ KS++GY+F    GA+SW SK Q  VALSTTEAEYI+ +    + +WL+  L EL   Q K  ++
Subjt:  RYILGTVDHGIHYKRNVDNVLVGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIM

Query:  FCDNQSSISLSKNPVFHGRSKHINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGV
        +CD+QS+I LSKN ++H R+KHI+++YH+IRE++ D  + +    T +  AD+ TK +  + F   KE +G+
Subjt:  FCDNQSSISLSKNPVFHGRSKHINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGV

P25600 Putative transposon Ty5-1 protein YCL074W7.0e-4134.98Show/hide
Query:  MDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAPRAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNM
        MDV +AFLN  +++ IYV+QPPG+      + V  L   +YGLKQAP  W   I+N   K GF R   EH LY +   +G  + I +YVDDL+    S  
Subjt:  MDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGLKQAPRAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNM

Query:  MIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDN-EIAIFQKKYAKDLLKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTT-RL
        + +  ++ + K + M ++G +  FLG+ + Q  N +I +  + Y      + ++       TP+     L +       D T Y+S+VG L++   T R 
Subjt:  MIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDN-EIAIFQKKYAKDLLKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTT-RL

Query:  DIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVLVGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKK-QDVVALSTTEAEYI
        DI + VSLLSRF+  P+  H E+ +RVLRY+  T    + Y+      L  Y D+  G   D   ST GYV  +    V+W+SKK + V+ + +TEAEYI
Subjt:  DIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVLVGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKK-QDVVALSTTEAEYI

Query:  SLS
        + S
Subjt:  SLS

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.3e-13230.09Show/hide
Query:  CEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLKSLRSDRGGEY
        C  C+  K ++  F +  +  +++PLE +++D+       +H   RY++ F+D ++R TW+Y LK+KS   E F TFK ++EN    ++ +  SD GGE+
Subjt:  CEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLKSLRSDRGGEY

Query:  IVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSHLRVFGCIAY-
        +   ++  ++GI H  +   T   NG++ERK+R I+E   ++L    +   +W  A   A+YL+NR  T  +Q  +P +   G  P    LRVFGC  Y 
Subjt:  IVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSHLRVFGCIAY-

Query:  -------------------------------------------------------------SHISDEKRD-----------------------QNPLHV-
                                                                     S + +++R+                        +P H  
Subjt:  -------------------------------------------------------------SHISDEKRD-----------------------QNPLHV-

Query:  -----------------------------------------------------------DMDGKKDARDLELEVTQPLTSPSSSHSTSDEETTPRKTRN-
                                                                   +        +   ++ Q L++P+ S S+S   TT   + + 
Subjt:  -----------------------------------------------------------DMDGKKDARDLELEVTQPLTSPSSSHSTSDEETTPRKTRN-

Query:  --------------IQEIYNTSRRILDEEH------------------VDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKAL
                      + +I N + +     H                  +  +L A  +P    +A++DE W++AM  EI+A   N TW+LV  P +   +
Subjt:  --------------IQEIYNTSRRILDEEH------------------VDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKKAL

Query:  -GVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVC
         G +WI+  K   +G + +YKARLV KGY Q+ G+DY E F+PV +  ++R++L +A   +W + Q+DV +AFL G L D++Y+ QPPG+      N VC
Subjt:  -GVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVC

Query:  RLKKALYGLKQAPRAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDN
        +L+KALYGLKQAPRAWY  + N+ L  GF     + +L+  +    + + + +YVDD++ TGN   ++    +++ + F + +   LHYFLGIE K+   
Subjt:  RLKKALYGLKQAPRAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDN

Query:  EIAIFQKKYAKDLLKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVD
         + + Q++Y  DLL +  M  A P +TPM    KLS +  ++  D T YR +VGSL YL  TR DI ++V+ LS+FM  P   H +A KR+LRY+ GT +
Subjt:  EIAIFQKKYAKDLLKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVD

Query:  HGIHYKRNVDNVLVGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSI
        HGI  K+     L  YSD+DW G+ DD+ ST+GY+  +G   +SW+SKKQ  V  S+TEAEY S++  S +  W+ ++L EL     +  +++CDN  + 
Subjt:  HGIHYKRNVDNVLVGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSI

Query:  SLSKNPVFHGRSKHINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV
         L  NPVFH R KHI I YHFIR  ++ G + + +  T DQ+AD  TK L   +F     K+GV  V
Subjt:  SLSKNPVFHGRSKHINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE22.9e-13531.07Show/hide
Query:  CEACVFGKHHRNSFPTGGSWRASKPLELVHTDL-CGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLKSLRSDRGGE
        C  C   K H+  F +  +  +SKPLE +++D+   P+ +  +   RY++ F+D ++R TW+Y LK+KS   + F  FK++VEN    ++ +L SD GGE
Subjt:  CEACVFGKHHRNSFPTGGSWRASKPLELVHTDL-CGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLKSLRSDRGGE

Query:  YIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSHLRVFGCIAY
        ++V  D+L ++GI H  +   T   NG++ERK+R I+E+  ++L    +   +W  A + A+YL+NR  T  +Q  +P +   G  P    L+VFGC  Y
Subjt:  YIVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSHLRVFGCIAY

Query:  S--------------------------------HI-----------------------------SDEKRDQN-------------PL----------HVD
                                         HI                             S E+R  +             PL          H+D
Subjt:  S--------------------------------HI-----------------------------SDEKRDQN-------------PL----------HVD

Query:  MDGKKDARDLELEVTQ---------PLTSPSSSHSTSDEETTPRKTRNIQEIYNTS--------------------------RRILDEEHVD--------
           +  +    L  TQ          ++SPSSS  T+     P+ T    +  N++                          +  +   H+         
Subjt:  MDGKKDARDLELEVTQ---------PLTSPSSSHSTSDEETTPRKTRNIQEIYNTS--------------------------RRILDEEHVD--------

Query:  -------------------------------------------------------FALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELV-KLP
                                                                +L AN +P    +A++D+ W+ AM  EI+A   N TW+LV   P
Subjt:  -------------------------------------------------------FALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELV-KLP

Query:  ENKKALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGE
         +   +G +WI+  K   +G + +YKARLV KGY Q+ G+DY E F+PV +  ++R++L +A   +W + Q+DV +AFL G L DE+Y+ QPPG+     
Subjt:  ENKKALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGE

Query:  ENKVCRLKKALYGLKQAPRAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEV
         + VCRL+KA+YGLKQAPRAWY  +  + L  GF     + +L+  +    + + + +YVDD++ TGN  ++++   +++ + F +     LHYFLGIE 
Subjt:  ENKVCRLKKALYGLKQAPRAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEV

Query:  KQGDNEIAIFQKKYAKDLLKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYI
        K+    + + Q++Y  DLL +  M  A P +TPM    KL+ H  ++  D T YR +VGSL YL  TR D+ ++V+ LS++M  P   HW A KRVLRY+
Subjt:  KQGDNEIAIFQKKYAKDLLKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYI

Query:  LGTVDHGIHYKRNVDNVLVGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCD
         GT DHGI  K+     L  YSD+DW G+ DD+ ST+GY+  +G   +SW+SKKQ  V  S+TEAEY S++  S +  W+ ++L EL        +++CD
Subjt:  LGTVDHGIHYKRNVDNVLVGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCD

Query:  NQSSISLSKNPVFHGRSKHINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV
        N  +  L  NPVFH R KHI + YHFIR  ++ G + + +  T DQ+AD  TK L   +F     K+GV +V
Subjt:  NQSSISLSKNPVFHGRSKHINIKYHFIRELIKDGEVYIRYCKTQDQVADVFTKALKTDSFLKMKEKLGVWEV

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.3e-10339.25Show/hide
Query:  PSSSHSTSDEETTPRKTRNIQEIYNTSRRILDEEHV-DFALFANVDPVY---------------FEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENK
        P  S  TS   T  RK   +Q+ Y  S   L    +  F  +  V P+Y               + EA +   W  AM+ EI A+    TWE+  LP NK
Subjt:  PSSSHSTSDEETTPRKTRNIQEIYNTSRRILDEEHV-DFALFANVDPVY---------------FEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENK

Query:  KALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGY-AKIGEE-
        K +G KW+Y+ K   +G +++YKARLV KGY Q+ G+D+ E F+PV +L +V+L+LA++A  N+ +HQ+D+ +AFLNG L++EIY++ PPGY A+ G+  
Subjt:  KALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGY-AKIGEE-

Query:  --NKVCRLKKALYGLKQAPRAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIE
          N VC LKK++YGLKQA R W+ +     +  GF +   +H  + K      FL + +YVDD+I   N++  ++E +  +K  F++ ++G L YFLG+E
Subjt:  --NKVCRLKKALYGLKQAPRAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIE

Query:  VKQGDNEIAIFQKKYAKDLLKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRY
        + +    I I Q+KYA DLL +  +    P+S PM+  +  S H   +  DA  YR L+G LMYL  TRLDI F+V+ LS+F  +P+ +H +A  ++L Y
Subjt:  VKQGDNEIAIFQKKYAKDLLKKFKMENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRY

Query:  ILGTVDHGIHYKRNVDNVLVGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFC
        I GTV  G+ Y    +  L  +SD+ +    D  +ST+GY   +G+  +SW SKKQ VV+ S+ EAEY +LS A+ + +WL     EL+ P  K T++FC
Subjt:  ILGTVDHGIHYKRNVDNVLVGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFC

Query:  DNQSSISLSKNPVFHGRSKHINIKYHFIRE
        DN ++I ++ N VFH R+KHI    H +RE
Subjt:  DNQSSISLSKNPVFHGRSKHINIKYHFIRE

ATMG00240.1 Gag-Pol-related retrotransposon family protein9.5e-0938.46Show/hide
Query:  MYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVLVGYSDSDWGGNIDDFKSTSGY
        MYLT TR D+ F+V+ LS+F ++ + +  +A  +VL Y+ GTV  G+ Y    D  L  ++DSDW    D  +S +G+
Subjt:  MYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVLVGYSDSDWGGNIDDFKSTSGY

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein1.2e-0838.67Show/hide
Query:  NRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSHLRVFGCIAYSHISDEK
        NR I+E  RSML    L   F  DA   A++++N+  + ++    P E W    PT S+LR FGC+AY H  + K
Subjt:  NRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSHLRVFGCIAYSHISDEK

ATMG00810.1 DNA/RNA polymerases superfamily protein2.8e-3735.4Show/hide
Query:  LIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDLLKKFKMENAYPASTPMELGLKLSKHDVSEAFDATI
        + + LYVDD++ TG+SN ++      +   F M ++G +HYFLGI++K   + + + Q KYA+ +L    M +  P STP+ L L  S    ++  D + 
Subjt:  LIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDLLKKFKMENAYPASTPMELGLKLSKHDVSEAFDATI

Query:  YRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVLVGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASK
        +RS+VG+L YLT TR DI ++V+++ + M  P  + ++  KRVLRY+ GT+ HG++  +N    +  + DSDW G     +ST+G+   +G   +SW++K
Subjt:  YRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVLVGYSDSDWGGNIDDFKSTSGYVFNIGSGAVSWASK

Query:  KQDVVALSTTEAEYISLSVASCQALW
        +Q  V+ S+TE EY +L++ + +  W
Subjt:  KQDVVALSTTEAEYISLSVASCQALW

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)5.0e-1844.57Show/hide
Query:  AIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALA
        A++D  W  AM +E+DA+ RN+TW LV  P N+  LG KW+++TKL  +G + + KARLV KG+ Q+ G+ + E ++PV R  T+R +L +A
Subjt:  AIQDENWKDAMNQEIDAIRRNETWELVKLPENKKALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAATATTAAAAAGGAAGATCAACTCTGTGAAGCATGTGTTTTCGGAAAGCATCATCGAAATTCATTTCCGACTGGAGGTTCTTGGAGAGCATCAAAACCACTCGA
GCTTGTTCATACAGACTTATGTGGACCTATGAGAACTACTACACATGGAGGTAACCGTTATTTTCTCACATTTATTGATGACTACAGTCGAAAAACATGGATTTATCTAC
TAAAAGAAAAGAGTGCTACTTTCGAATGTTTCAAGACATTCAAAGCAATGGTGGAAAATGAAAGTAACTTGAAATTGAAATCATTGCGTTCGGATCGTGGAGGAGAATAT
ATTGTTTTTGTAGATTTCTTGAAGGAAAATGGAATCAAACATCAGAAGACTGTTCGAAGAACTACTCATCAAAACGGAGTTGCAGAGAGGAAAAATAGAATAATAATGGA
ACTTGCCAGAAGTATGTTGAAGGCAAAGAAGCTTCTTGATCAATTTTGGGGAGACGCAGTAACTTGTGCTATTTATCTTCTAAATAGAGCTTCAACGAAAAGTGTACAAG
GTATTACTCCTCAAGAAACATGGAGCGGATTGAAACCAACCGTTAGTCACCTAAGAGTGTTTGGGTGCATTGCTTACTCTCACATTTCAGATGAGAAAAGAGACCAAAAT
CCATTACATGTTGATATGGATGGAAAAAAAGATGCTCGAGACTTGGAGCTTGAAGTAACTCAACCACTGACTTCACCTTCTTCATCACACTCCACAAGTGATGAAGAAAC
TACTCCAAGGAAGACCAGAAATATTCAAGAGATCTATAATACTTCAAGAAGGATACTAGATGAAGAACATGTTGATTTTGCTTTATTTGCAAATGTTGATCCTGTATACT
TTGAAGAAGCAATTCAAGATGAAAATTGGAAAGATGCAATGAATCAAGAGATTGATGCAATAAGAAGAAACGAGACATGGGAGTTGGTAAAATTACCAGAAAATAAAAAG
GCTCTTGGAGTCAAATGGATCTATAGAACAAAGCTAAAGCAAAACGGAGAAGTGCAAAAATACAAAGCCAGACTCGTTGTAAAAGGTTACAAACAAAAGTTTGGTGTGGA
TTATGAAGAAGTTTTTGCACCGGTAACTCGCTTGGAGACTGTTCGTTTGTTGTTAGCCCTTGCAGCAAAAAATAACTGGAAAGTTCATCAAATGGATGTAAAGTCAGCAT
TCCTAAATGGGTATTTAGAGGATGAAATATATGTTGAGCAACCCCCCGGTTATGCAAAGATTGGAGAAGAAAATAAGGTGTGTCGATTAAAGAAAGCCTTGTATGGGCTA
AAGCAAGCACCAAGGGCTTGGTACAGTCGCATCGACAATTTTTTCTTAAAGGATGGTTTCAGAAGATGTCCATATGAACATGCTCTCTACACCAAAGAAGATGAAAATGG
TAATTTCTTGATAATTTGTCTATATGTTGATGATTTAATATTTACGGGCAACTCAAATATGATGATTGAAGAATTCAGAGAGAGCATGAAAAAGGAATTTGAGATGACTA
ATATGGGTTTACTTCATTATTTTCTTGGTATTGAAGTTAAACAAGGTGATAATGAGATTGCAATTTTCCAAAAGAAGTATGCAAAAGATTTGTTGAAAAAGTTCAAAATG
GAGAATGCTTATCCTGCCAGTACTCCTATGGAATTGGGTTTAAAGTTAAGTAAGCATGATGTTAGTGAAGCTTTTGATGCCACCATTTATAGAAGTTTGGTTGGAAGTTT
AATGTATTTAACTACAACTAGACTTGATATTATGTTCTCGGTCAGTTTATTGAGTAGATTTATGACATCACCAAAGAGAAGTCATTGGGAAGCTGGAAAGAGAGTTCTTA
GATACATTCTTGGAACTGTTGATCATGGAATCCACTATAAAAGGAATGTGGATAATGTTCTTGTTGGCTACAGTGATAGTGATTGGGGAGGAAATATTGATGATTTCAAA
AGTACTTCTGGGTATGTATTTAATATTGGTTCTGGAGCAGTTTCATGGGCATCAAAGAAGCAAGATGTTGTAGCATTGTCCACAACAGAAGCTGAATACATTTCTTTGTC
TGTTGCTAGTTGTCAAGCACTTTGGCTAAGAAATGTACTACATGAATTGAAGTGTCCTCAAGAGAAAGGGACCATCATGTTCTGTGACAATCAATCATCTATTTCACTTT
CGAAGAATCCCGTTTTTCATGGAAGAAGCAAACACATAAACATCAAATATCATTTCATCAGAGAATTGATCAAAGATGGAGAAGTATATATCAGGTATTGCAAGACTCAA
GATCAAGTTGCAGACGTATTCACAAAAGCATTAAAGACAGATTCATTCTTGAAAATGAAAGAGAAGCTCGGAGTTTGGGAAGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCAAATATTAAAAAGGAAGATCAACTCTGTGAAGCATGTGTTTTCGGAAAGCATCATCGAAATTCATTTCCGACTGGAGGTTCTTGGAGAGCATCAAAACCACTCGA
GCTTGTTCATACAGACTTATGTGGACCTATGAGAACTACTACACATGGAGGTAACCGTTATTTTCTCACATTTATTGATGACTACAGTCGAAAAACATGGATTTATCTAC
TAAAAGAAAAGAGTGCTACTTTCGAATGTTTCAAGACATTCAAAGCAATGGTGGAAAATGAAAGTAACTTGAAATTGAAATCATTGCGTTCGGATCGTGGAGGAGAATAT
ATTGTTTTTGTAGATTTCTTGAAGGAAAATGGAATCAAACATCAGAAGACTGTTCGAAGAACTACTCATCAAAACGGAGTTGCAGAGAGGAAAAATAGAATAATAATGGA
ACTTGCCAGAAGTATGTTGAAGGCAAAGAAGCTTCTTGATCAATTTTGGGGAGACGCAGTAACTTGTGCTATTTATCTTCTAAATAGAGCTTCAACGAAAAGTGTACAAG
GTATTACTCCTCAAGAAACATGGAGCGGATTGAAACCAACCGTTAGTCACCTAAGAGTGTTTGGGTGCATTGCTTACTCTCACATTTCAGATGAGAAAAGAGACCAAAAT
CCATTACATGTTGATATGGATGGAAAAAAAGATGCTCGAGACTTGGAGCTTGAAGTAACTCAACCACTGACTTCACCTTCTTCATCACACTCCACAAGTGATGAAGAAAC
TACTCCAAGGAAGACCAGAAATATTCAAGAGATCTATAATACTTCAAGAAGGATACTAGATGAAGAACATGTTGATTTTGCTTTATTTGCAAATGTTGATCCTGTATACT
TTGAAGAAGCAATTCAAGATGAAAATTGGAAAGATGCAATGAATCAAGAGATTGATGCAATAAGAAGAAACGAGACATGGGAGTTGGTAAAATTACCAGAAAATAAAAAG
GCTCTTGGAGTCAAATGGATCTATAGAACAAAGCTAAAGCAAAACGGAGAAGTGCAAAAATACAAAGCCAGACTCGTTGTAAAAGGTTACAAACAAAAGTTTGGTGTGGA
TTATGAAGAAGTTTTTGCACCGGTAACTCGCTTGGAGACTGTTCGTTTGTTGTTAGCCCTTGCAGCAAAAAATAACTGGAAAGTTCATCAAATGGATGTAAAGTCAGCAT
TCCTAAATGGGTATTTAGAGGATGAAATATATGTTGAGCAACCCCCCGGTTATGCAAAGATTGGAGAAGAAAATAAGGTGTGTCGATTAAAGAAAGCCTTGTATGGGCTA
AAGCAAGCACCAAGGGCTTGGTACAGTCGCATCGACAATTTTTTCTTAAAGGATGGTTTCAGAAGATGTCCATATGAACATGCTCTCTACACCAAAGAAGATGAAAATGG
TAATTTCTTGATAATTTGTCTATATGTTGATGATTTAATATTTACGGGCAACTCAAATATGATGATTGAAGAATTCAGAGAGAGCATGAAAAAGGAATTTGAGATGACTA
ATATGGGTTTACTTCATTATTTTCTTGGTATTGAAGTTAAACAAGGTGATAATGAGATTGCAATTTTCCAAAAGAAGTATGCAAAAGATTTGTTGAAAAAGTTCAAAATG
GAGAATGCTTATCCTGCCAGTACTCCTATGGAATTGGGTTTAAAGTTAAGTAAGCATGATGTTAGTGAAGCTTTTGATGCCACCATTTATAGAAGTTTGGTTGGAAGTTT
AATGTATTTAACTACAACTAGACTTGATATTATGTTCTCGGTCAGTTTATTGAGTAGATTTATGACATCACCAAAGAGAAGTCATTGGGAAGCTGGAAAGAGAGTTCTTA
GATACATTCTTGGAACTGTTGATCATGGAATCCACTATAAAAGGAATGTGGATAATGTTCTTGTTGGCTACAGTGATAGTGATTGGGGAGGAAATATTGATGATTTCAAA
AGTACTTCTGGGTATGTATTTAATATTGGTTCTGGAGCAGTTTCATGGGCATCAAAGAAGCAAGATGTTGTAGCATTGTCCACAACAGAAGCTGAATACATTTCTTTGTC
TGTTGCTAGTTGTCAAGCACTTTGGCTAAGAAATGTACTACATGAATTGAAGTGTCCTCAAGAGAAAGGGACCATCATGTTCTGTGACAATCAATCATCTATTTCACTTT
CGAAGAATCCCGTTTTTCATGGAAGAAGCAAACACATAAACATCAAATATCATTTCATCAGAGAATTGATCAAAGATGGAGAAGTATATATCAGGTATTGCAAGACTCAA
GATCAAGTTGCAGACGTATTCACAAAAGCATTAAAGACAGATTCATTCTTGAAAATGAAAGAGAAGCTCGGAGTTTGGGAAGTCTAG
Protein sequenceShow/hide protein sequence
MSNIKKEDQLCEACVFGKHHRNSFPTGGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLKSLRSDRGGEY
IVFVDFLKENGIKHQKTVRRTTHQNGVAERKNRIIMELARSMLKAKKLLDQFWGDAVTCAIYLLNRASTKSVQGITPQETWSGLKPTVSHLRVFGCIAYSHISDEKRDQN
PLHVDMDGKKDARDLELEVTQPLTSPSSSHSTSDEETTPRKTRNIQEIYNTSRRILDEEHVDFALFANVDPVYFEEAIQDENWKDAMNQEIDAIRRNETWELVKLPENKK
ALGVKWIYRTKLKQNGEVQKYKARLVVKGYKQKFGVDYEEVFAPVTRLETVRLLLALAAKNNWKVHQMDVKSAFLNGYLEDEIYVEQPPGYAKIGEENKVCRLKKALYGL
KQAPRAWYSRIDNFFLKDGFRRCPYEHALYTKEDENGNFLIICLYVDDLIFTGNSNMMIEEFRESMKKEFEMTNMGLLHYFLGIEVKQGDNEIAIFQKKYAKDLLKKFKM
ENAYPASTPMELGLKLSKHDVSEAFDATIYRSLVGSLMYLTTTRLDIMFSVSLLSRFMTSPKRSHWEAGKRVLRYILGTVDHGIHYKRNVDNVLVGYSDSDWGGNIDDFK
STSGYVFNIGSGAVSWASKKQDVVALSTTEAEYISLSVASCQALWLRNVLHELKCPQEKGTIMFCDNQSSISLSKNPVFHGRSKHINIKYHFIRELIKDGEVYIRYCKTQ
DQVADVFTKALKTDSFLKMKEKLGVWEV