; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G10110 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G10110
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionIntegrase
Genome locationChr4:8124313..8126510
RNA-Seq ExpressionCSPI04G10110
SyntenyCSPI04G10110
Gene Ontology termsGO:0006397 - mRNA processing (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003729 - mRNA binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001584 - Integrase, catalytic core
IPR001878 - Zinc finger, CCHC-type
IPR012337 - Ribonuclease H-like superfamily
IPR025724 - GAG-pre-integrase domain
IPR036397 - Ribonuclease H superfamily
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051601.1 integrase [Cucumis melo var. makuwa]6.1e-24865.84Show/hide
Query:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG
        AAWD L++ Y+GE+KVK++RLQ LR+EFD I+MK++ETIEEFFNR+L+IVN LRSNGE + DQR+VEKILRSM R++EHIVVAIEESKDLSTLSINSLMG
Subjt:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG

Query:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG
        SLQSHELRLK FD  P EEAF MQ+S+RG S GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQCFNC +YG
Subjt:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG

Query:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK
        HFQA+CW+ K     +   +  EQ   D+ +LFL  +VQ++  E  WYLDSGCSN+MTG   IF++LDES Q+ VKTGDN +L+VKG+G+ILVKTK G K
Subjt:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK

Query:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV
        R+TNV+YVPGLKHNLLS+GQLL RG  V F+ +IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++ D SWLWH R+GHL+F +LS++C+ HMV
Subjt:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV

Query:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK
        RG+ NI  E  +CE C+L KHHR+SFPT  +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K
Subjt:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK

Query:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV
        +K+LRSDRGGEYI F +F+KE GI +Q T R TPQQ GVA RKNR IME+ARSMLK+K LP++FWGDAV C VY+LNRA TKSV G+TP EAW   K +V
Subjt:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV

Query:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV
        SHLRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNPV
Subjt:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV

KAA0060377.1 integrase [Cucumis melo var. makuwa]1.0e-24765.84Show/hide
Query:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG
        AAWD L++ Y+GE+KVK++RLQ LR+EFD I+MK++ETIEEFFNR+L+IVN LRSNGE + DQR+VEKILRSM R++EHIVVAIEESKDLSTLSINSLMG
Subjt:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG

Query:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG
        SLQSHELRLK FD  P EEAF MQ+S+RG S GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQCFNC +YG
Subjt:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG

Query:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK
        HFQA+CW+ K     +   +  EQ   D+ +LFL  +VQ++  E  WYLDSGCSN+MTG   IF++LDES Q+ VKTGDN +L+VKG+G+ILVKTK G K
Subjt:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK

Query:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV
        R+TNV+YVPGLKHNLLS+GQLL RG  V F+ +IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++ D SWLWH R+GHL+F +LS++C+ HMV
Subjt:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV

Query:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK
        RG+ NI  E  +CE C+L KHHR+SFPT  +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K
Subjt:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK

Query:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV
        +K+LRSDRGGEYI F +F+KE GI +Q T R TPQQ GVA RKNR IME+ARSMLK+K LP++FWGDAV C VY+LNRA TKSV G+TP EAW   K +V
Subjt:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV

Query:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV
        SHLRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNPV
Subjt:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV

KAA0060690.1 integrase [Cucumis melo var. makuwa]6.1e-24865.84Show/hide
Query:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG
        AAWD L++ Y+GE+KVK++RLQ LR+EFD I+MK++ETIEEFFNR+L+IVN LRSNGE + DQR+VEKILRSM R++EHIVVAIEESKDLSTLSINSLMG
Subjt:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG

Query:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG
        SLQSHELRLK FD  P EEAF MQ+S+RG S GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQCFNC +YG
Subjt:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG

Query:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK
        HFQA+CW+ K     +   +  EQ   D+ +LFL  +VQ++  E  WYLDSGCSN+MTG   IF++LDES Q+ VKTGDN +L+VKG+G+ILVKTK G K
Subjt:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK

Query:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV
        R+TNV+YVPGLKHNLLS+GQLL RG  V F+ +IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++ D SWLWH R+GHL+F +LS++C+ HMV
Subjt:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV

Query:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK
        RG+ NI  E  +CE C+L KHHR+SFPT  +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K
Subjt:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK

Query:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV
        +K+LRSDRGGEYI F +F+KE GI +Q T R TPQQ GVA RKNR IME+ARSMLK+K LP++FWGDAV C VY+LNRA TKSV G+TP EAW   K +V
Subjt:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV

Query:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV
        SHLRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNPV
Subjt:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV

TYJ95504.1 integrase [Cucumis melo var. makuwa]1.0e-24765.84Show/hide
Query:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG
        AAWD L++ Y+GE+KVK++RLQ LR+EFD I+MK++ETIEEFFNR+L+IVN LRSNGE + DQR+VEKILRSM R++EHIVVAIEESKDLSTLSINSLMG
Subjt:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG

Query:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG
        SLQSHELRLK FD  P EEAF MQ+S+RG S GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQCFNC +YG
Subjt:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG

Query:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK
        HFQA+CW+ K     +   +  EQ   D+ +LFL  +VQ++  E  WYLDSGCSN+MTG   IF++LDES Q+ VKTGDN +L+VKG+G+ILVKTK G K
Subjt:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK

Query:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV
        R+TNV+YVPGLKHNLLS+GQLL RG  V F+ +IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++ D SWLWH R+GHL+F +LS++C+ HMV
Subjt:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV

Query:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK
        RG+ NI  E  +CE C+L KHHR+SFPT  +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K
Subjt:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK

Query:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV
        +K+LRSDRGGEYI F +F+KE GI +Q T R TPQQ GVA RKNR IME+ARSMLK+K LP++FWGDAV C VY+LNRA TKSV G+TP EAW   K +V
Subjt:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV

Query:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV
        SHLRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNPV
Subjt:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV

TYK30104.1 integrase [Cucumis melo var. makuwa]1.0e-24765.84Show/hide
Query:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG
        AAWD L++ Y+GE+KVK++RLQ LR+EFD I+MK++ETIEEFFNR+L+IVN LRSNGE + DQR+VEKILRSM R++EHIVVAIEESKDLSTLSINSLMG
Subjt:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG

Query:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG
        SLQSHELRLK FD  P EEAF MQ+S+RG S GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQCFNC +YG
Subjt:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG

Query:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK
        HFQA+CW+ K     +   +  EQ   D+ +LFL  +VQ++  E  WYLDSGCSN+MTG   IF++LDES Q+ VKTGDN +L+VKG+G+ILVKTK G K
Subjt:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK

Query:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV
        R+TNV+YVPGLKHNLLS+GQLL RG  V F+ +IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++ D SWLWH R+GHL+F +LS++C+ HMV
Subjt:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV

Query:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK
        RG+ NI  E  +CE C+L KHHR+SFPT  +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K
Subjt:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK

Query:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV
        +K+LRSDRGGEYI F +F+KE GI +Q T R TPQQ GVA RKNR IME+ARSMLK+K LP++FWGDAV C VY+LNRA TKSV G+TP EAW   K +V
Subjt:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV

Query:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV
        SHLRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNPV
Subjt:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV

TrEMBL top hitse value%identityAlignment
A0A5A7UDJ2 Integrase3.0e-24865.84Show/hide
Query:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG
        AAWD L++ Y+GE+KVK++RLQ LR+EFD I+MK++ETIEEFFNR+L+IVN LRSNGE + DQR+VEKILRSM R++EHIVVAIEESKDLSTLSINSLMG
Subjt:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG

Query:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG
        SLQSHELRLK FD  P EEAF MQ+S+RG S GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQCFNC +YG
Subjt:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG

Query:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK
        HFQA+CW+ K     +   +  EQ   D+ +LFL  +VQ++  E  WYLDSGCSN+MTG   IF++LDES Q+ VKTGDN +L+VKG+G+ILVKTK G K
Subjt:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK

Query:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV
        R+TNV+YVPGLKHNLLS+GQLL RG  V F+ +IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++ D SWLWH R+GHL+F +LS++C+ HMV
Subjt:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV

Query:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK
        RG+ NI  E  +CE C+L KHHR+SFPT  +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K
Subjt:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK

Query:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV
        +K+LRSDRGGEYI F +F+KE GI +Q T R TPQQ GVA RKNR IME+ARSMLK+K LP++FWGDAV C VY+LNRA TKSV G+TP EAW   K +V
Subjt:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV

Query:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV
        SHLRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNPV
Subjt:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV

A0A5A7V047 Integrase3.0e-24865.84Show/hide
Query:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG
        AAWD L++ Y+GE+KVK++RLQ LR+EFD I+MK++ETIEEFFNR+L+IVN LRSNGE + DQR+VEKILRSM R++EHIVVAIEESKDLSTLSINSLMG
Subjt:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG

Query:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG
        SLQSHELRLK FD  P EEAF MQ+S+RG S GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQCFNC +YG
Subjt:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG

Query:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK
        HFQA+CW+ K     +   +  EQ   D+ +LFL  +VQ++  E  WYLDSGCSN+MTG   IF++LDES Q+ VKTGDN +L+VKG+G+ILVKTK G K
Subjt:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK

Query:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV
        R+TNV+YVPGLKHNLLS+GQLL RG  V F+ +IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++ D SWLWH R+GHL+F +LS++C+ HMV
Subjt:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV

Query:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK
        RG+ NI  E  +CE C+L KHHR+SFPT  +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K
Subjt:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK

Query:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV
        +K+LRSDRGGEYI F +F+KE GI +Q T R TPQQ GVA RKNR IME+ARSMLK+K LP++FWGDAV C VY+LNRA TKSV G+TP EAW   K +V
Subjt:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV

Query:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV
        SHLRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNPV
Subjt:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV

A0A5D3BQ81 Integrase5.1e-24865.84Show/hide
Query:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG
        AAWD L++ Y+GE+KVK++RLQ LR+EFD I+MK++ETIEEFFNR+L+IVN LRSNGE + DQR+VEKILRSM R++EHIVVAIEESKDLSTLSINSLMG
Subjt:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG

Query:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG
        SLQSHELRLK FD  P EEAF MQ+S+RG S GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQCFNC +YG
Subjt:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG

Query:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK
        HFQA+CW+ K     +   +  EQ   D+ +LFL  +VQ++  E  WYLDSGCSN+MTG   IF++LDES Q+ VKTGDN +L+VKG+G+ILVKTK G K
Subjt:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK

Query:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV
        R+TNV+YVPGLKHNLLS+GQLL RG  V F+ +IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++ D SWLWH R+GHL+F +LS++C+ HMV
Subjt:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV

Query:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK
        RG+ NI  E  +CE C+L KHHR+SFPT  +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K
Subjt:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK

Query:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV
        +K+LRSDRGGEYI F +F+KE GI +Q T R TPQQ GVA RKNR IME+ARSMLK+K LP++FWGDAV C VY+LNRA TKSV G+TP EAW   K +V
Subjt:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV

Query:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV
        SHLRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNPV
Subjt:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV

A0A5D3CLV1 Integrase5.1e-24865.84Show/hide
Query:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG
        AAWD L++ Y+GE+KVK++RLQ LR+EFD I+MK++ETIEEFFNR+L+IVN LRSNGE + DQR+VEKILRSM R++EHIVVAIEESKDLSTLSINSLMG
Subjt:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG

Query:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG
        SLQSHELRLK FD  P EEAF MQ+S+RG S GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQCFNC +YG
Subjt:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG

Query:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK
        HFQA+CW+ K     +   +  EQ   D+ +LFL  +VQ++  E  WYLDSGCSN+MTG   IF++LDES Q+ VKTGDN +L+VKG+G+ILVKTK G K
Subjt:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK

Query:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV
        R+TNV+YVPGLKHNLLS+GQLL RG  V F+ +IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++ D SWLWH R+GHL+F +LS++C+ HMV
Subjt:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV

Query:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK
        RG+ NI  E  +CE C+L KHHR+SFPT  +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K
Subjt:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK

Query:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV
        +K+LRSDRGGEYI F +F+KE GI +Q T R TPQQ GVA RKNR IME+ARSMLK+K LP++FWGDAV C VY+LNRA TKSV G+TP EAW   K +V
Subjt:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV

Query:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV
        SHLRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNPV
Subjt:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV

A0A5D3E3T2 Integrase5.1e-24865.84Show/hide
Query:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG
        AAWD L++ Y+GE+KVK++RLQ LR+EFD I+MK++ETIEEFFNR+L+IVN LRSNGE + DQR+VEKILRSM R++EHIVVAIEESKDLSTLSINSLMG
Subjt:  AAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMG

Query:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG
        SLQSHELRLK FD  P EEAF MQ+S+RG S GRRGG G RG GR N    + + S ++Q  S+  RGR S RGRG    + GGRG+FS IQCFNC +YG
Subjt:  SLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRG----RSGGRGDFSHIQCFNCRRYG

Query:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK
        HFQA+CW+ K     +   +  EQ   D+ +LFL  +VQ++  E  WYLDSGCSN+MTG   IF++LDES Q+ VKTGDN +L+VKG+G+ILVKTK G K
Subjt:  HFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAK

Query:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV
        R+TNV+YVPGLKHNLLS+GQLL RG  V F+ +IC I+ + G LI KV+MT NKMFP+   Y ++ CF +++ D SWLWH R+GHL+F +LS++C+ HMV
Subjt:  RITNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMV

Query:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK
        RG+ NI  E  +CE C+L KHHR+SFPT  +WRASKPLEL+HTDLCGPMRTTT+GGNRYF+TFIDD+SRK WIY LKEKS    CFK+FKA  EN+S  K
Subjt:  RGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLK

Query:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV
        +K+LRSDRGGEYI F +F+KE GI +Q T R TPQQ GVA RKNR IME+ARSMLK+K LP++FWGDAV C VY+LNRA TKSV G+TP EAW   K +V
Subjt:  LKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTV

Query:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV
        SHLRVFG IAYSHI ++ RGKLDDKSEKCI VGYSENSKAYRLYNPV
Subjt:  SHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPV

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.1e-5827.06Show/hide
Query:  LAAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLM
        + A   L+NL    E+  L     LR    ++++    ++   F+    ++++L + G  IE+   +  +L ++   Y+ I+ AIE   +   L++  + 
Subjt:  LAAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLM

Query:  GSLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNR-GRGRSSNRGRGRSGGRGDFSHIQCFNCRRYGHF
          L   E+++K   +  S++                           N + +  + +  N  F NR  + +   +G  +         ++C +C R GH 
Subjt:  GSLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNR-GRGRSSNRGRGRSGGRGDFSHIQCFNCRRYGHF

Query:  QADCWS-KKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEI--WYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGA
        + DC+  K++ +N++K      Q+     + F+   V  +S  +   + LDSG S+++       I+ +  + + V+     K+ V  +G  +  TK G 
Subjt:  QADCWS-KKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEI--WYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGA

Query:  KRITN--------VYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTL
         R+ N        V +      NL+SV +L   G  + F  +   I +KNG ++ K       +  + +   +         +   LWH RFGH+S   L
Subjt:  KRITN--------VYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTL

Query:  SHMCQQHMVRGMS---NIKKEDQLCEACVLGKHHRNSF-PTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFK
          + +++M    S   N++   ++CE C+ GK  R  F   +      +PL +VH+D+CGP+   T     YF+ F+D ++     YL+K KS  F  F+
Subjt:  SHMCQQHMVRGMS---NIKKEDQLCEACVLGKHHRNSF-PTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFK

Query:  TFKAMVENESNLKLKSLRSDRGGEYI--VFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSV-
         F A  E   NLK+  L  D G EY+      F  + GI Y  TV  TPQ  GV+ R  R I E AR+M+   KL   FWG+AV    YL+NR  ++++ 
Subjt:  TFKAMVENESNLKLKSLRSDRGGEYI--VFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSV-

Query:  -QGITPQEAWSELKSTVSHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPVKD----ARDLELEVTQPLTSSSSSHST
            TP E W   K  + HLRVFG   Y HI + K+GK DDKS K IFVGY  N   ++L++ V +    ARD+ ++ T  + S +    T
Subjt:  -QGITPQEAWSELKSTVSHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPVKD----ARDLELEVTQPLTSSSSSHST

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.0e-8331.59Show/hide
Query:  WDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMGSL
        W  L++LY  +    L     L+ +   + M +        N    ++ QL + G  IE++     +L S+   Y+++   I   K  +T+ +  +  +L
Subjt:  WDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMGSL

Query:  QSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRGRSGGRGDFSHIQCFNCRRYGHFQADC
          +E   K     P  +   + +  RGRS  R     GR                         RG+S NR + R           C+NC + GHF+ DC
Subjt:  QSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRGRSGGRGDFSHIQCFNCRRYGHFQADC

Query:  WSKKV-------NSNQSKTTLMHEQSNNDQSLLFLTLNVQE-----SSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVK
         + +          N   T  M +  NND  +LF  +N +E     S  E  W +D+  S++ T   D+F          VK G+    ++ G G+I +K
Subjt:  WSKKV-------NSNQSKTTLMHEQSNNDQSLLFLTLNVQE-----SSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVK

Query:  TKMGAKRI-TNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMF--PIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTL
        T +G   +  +V +VP L+ NL+S   L   G++  F +   + R   G L+    +    ++    +IC  +L   +  ++    LWH R GH+S   L
Subjt:  TKMGAKRI-TNVYYVPGLKHNLLSVGQLLIRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMF--PIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTL

Query:  SHMCQQHMVRGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKA
          + ++ ++        +   C+ C+ GK HR SF T  S R    L+LV++D+CGPM   + GGN+YF+TFIDD SRK W+Y+LK K   F+ F+ F A
Subjt:  SHMCQQHMVRGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKA

Query:  MVENESNLKLKSLRSDRGGEYI--VFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITP
        +VE E+  KLK LRSD GGEY    F ++   +GI+++KTV  TPQ  GVA R NR I+E  RSML+  KLP  FWG+AV    YL+NR+ +  +    P
Subjt:  MVENESNLKLKSLRSDRGGEYI--VFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITP

Query:  QEAWSELKSTVSHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPVKD----ARDLELEVTQPLTSSSSSHSTSD
        +  W+  + + SHL+VFGC A++H+  E+R KLDDKS  CIF+GY +    YRL++PVK     +RD+    ++  T++  S    +
Subjt:  QEAWSELKSTVSHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPVKD----ARDLELEVTQPLTSSSSSHSTSD

P92512 Uncharacterized mitochondrial protein AtMg007101.4e-0837.65Show/hide
Query:  NRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTVSHLRVFGCIAYSHISDEKRGKLDDKSEK
        NR I+E  RSML    LP  F  DA    V+++N+  + ++    P E W +   T S+LR FGC+AY H  +   GKL  +++K
Subjt:  NRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTVSHLRVFGCIAYSHISDEKRGKLDDKSEK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-4724.57Show/hide
Query:  WDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMGSL
        W+ L+ +Y       + +L+T   ++     K ++TI+++   ++   +QL   G+ ++    VE++L ++   Y+ ++  I       TL+   +   L
Subjt:  WDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMGSL

Query:  QSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRGRSGGRGDFSHIQCFNCRRYGHFQADC
         +HE ++    S           S+R  +       G R N   N   N  S+       +       S    G+    G   H     C +  HF +  
Subjt:  QSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRGRSGGRGDFSHIQCFNCRRYGHFQADC

Query:  WSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGR-NDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAKRITNV
            VNS Q  +     Q   +     L L    SS    W LDSG ++++T   N++ +    +  + V   D   + +   G+  + TK     + N+
Subjt:  WSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGR-NDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAKRITNV

Query:  YYVPGLKHNLLSVGQLL-IRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVND--TSWLWHCRFGHLSFDTLSHMCQQHMVRG
         YVP +  NL+SV +L    G  V F     +++  N  +      T ++++   I   + V      +   T   WH R GH +   L+ +   + +  
Subjt:  YYVPGLKHNLLSVGQLL-IRGHDVIFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVND--TSWLWHCRFGHLSFDTLSHMCQQHMVRG

Query:  MSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK
        + N   +   C  C++ K ++  F ++ +  +++PLE +++D+       +H   RY++ F+D ++R TW+Y LK+KS   E F TFK ++EN    ++ 
Subjt:  MSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLK

Query:  SLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTVSH
        +  SD GGE++   +++ ++GI +  +   TP+  G++ RK+R I+E   ++L    +P  +W  A    VYL+NR  T  +Q  +P +           
Subjt:  SLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTVSH

Query:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAY
        LRVFGC  Y  +    + KLDDKS +C+F+GYS    AY
Subjt:  LRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAY

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE21.1e-4225.7Show/hide
Query:  RVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNG
        R +   +QL   G+ ++    VE++L ++   Y+ ++  I  +KD +  S+  +   L + E +L   +S           ++R  +  R   +  RG+ 
Subjt:  RVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMGSLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNG

Query:  RSNVVTNTESESRDNQFFSNRGRGRSSNRGRGRSGGRGDFSHIQCFNCRRYGHFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWY
        R+       + +R N +  +    RS NR      GR       C  C   GH    C   +++  QS T      S          L V        W 
Subjt:  RSNVVTNTESESRDNQFFSNRGRGRSSNRGRGRSGGRGDFSHIQCFNCRRYGHFQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWY

Query:  LDSGCSNYMTGR-NDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAKRITNVYYVPGLKHNLLSVGQLLIRGH-DVIFKDNICEIRTKNGDLIT
        LDSG ++++T   N++      +  + V   D   + +   G+  + T   +  +  V YVP +  NL+SV +L       V F     +++  N  +  
Subjt:  LDSGCSNYMTGR-NDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAKRITNVYYVPGLKHNLLSVGQLLIRGH-DVIFKDNICEIRTKNGDLIT

Query:  KVRMTHNKMFPIKICYEKLVCF--ETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTD
            T ++++   I   + V          T   WH R GH S   L+ +   H +  + N   +   C  C + K H+  F +  +  +SKPLE +++D
Subjt:  KVRMTHNKMFPIKICYEKLVCF--ETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPLELVHTD

Query:  L-CGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRK
        +   P+ +  +   RY++ F+D ++R TW+Y LK+KS   + F  FK++VEN    ++ +L SD GGE++V  D+  ++GI +  +   TP+  G++ RK
Subjt:  L-CGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRK

Query:  NRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTVSHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAY--
        +R I+E+  ++L    +P  +W  A +  VYL+NR  T  +Q  +P +           L+VFGC  Y  +    R KL+DKS++C F+GYS    AY  
Subjt:  NRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTVSHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAY--

Query:  ------RLYNPVKDARDLEL-EVTQPLTSSSSSHSTSDEKLLQGRP
              RLY     +R ++  E   P ++++   STS E+     P
Subjt:  ------RLYNPVKDARDLEL-EVTQPLTSSSSSHSTSDEKLLQGRP

Arabidopsis top hitse value%identityAlignment
AT3G20980.1 Gag-Pol-related retrotransposon family protein7.0e-0841.11Show/hide
Query:  EEIWYLDSGCSNYMTGRNDIFISLDESHQNVVK--TGDNKKLE---VKGKGNILVKTKMGAKRITNVYYVPGLKHNLLSVGQLLIRGHDV
        E IW + S  SN+MT     F +LD S +  VK  +GD  +     V+G G++   T  G K I NV YVPG++ N LSV QL   G +V
Subjt:  EEIWYLDSGCSNYMTGRNDIFISLDESHQNVVK--TGDNKKLE---VKGKGNILVKTKMGAKRITNVYYVPGLKHNLLSVGQLLIRGHDV

AT3G21000.1 Gag-Pol-related retrotransposon family protein5.6e-0519.51Show/hide
Query:  WDALQNLYEGEEKVKLVRLQ-----TLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINS
        WD L+   +G E+  + RL+      L  + + ++M D E+   + ++ L I+ +L        D  I + +  +++  ++ +   +EE  D+  ++  S
Subjt:  WDALQNLYEGEEKVKLVRLQ-----TLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINS

Query:  LMGSLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRGRSGGRGDFSHIQCFNCRRYGH
        L+          ++ +S+  E  F +    R +S   +                                                     C  C +  H
Subjt:  LMGSLQSHELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRGRSGGRGDFSHIQCFNCRRYGH

Query:  FQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAKR
         Q DC  +     + K     ++   D  L  +     ++  ++IW +       MT     F +LD + +  V T D   L V+GKG++ ++ K G K+
Subjt:  FQADCWSKKVNSNQSKTTLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAKR

Query:  -ITNVYYVPGLKHNLLSVGQLLIRGHDV
         I NV +VPGL  N+LS G+++ + + +
Subjt:  -ITNVYYVPGLKHNLLSVGQLLIRGHDV

ATMG00710.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein9.8e-1037.65Show/hide
Query:  NRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTVSHLRVFGCIAYSHISDEKRGKLDDKSEK
        NR I+E  RSML    LP  F  DA    V+++N+  + ++    P E W +   T S+LR FGC+AY H  +   GKL  +++K
Subjt:  NRIIMEVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTVSHLRVFGCIAYSHISDEKRGKLDDKSEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCAAGATACAATTAGCAGCATGGGATGCATTGCAAAATTTGTATGAAGGAGAAGAAAAGGTAAAATTGGTCCGATTACAAACACTTCGAGCTGAATTTGATACAAT
TCGAATGAAAGATTCTGAAACTATTGAAGAATTTTTCAACCGTGTTCTCTTAATTGTTAATCAATTGAGATCAAATGGAGAAACAATTGAAGATCAAAGGATTGTTGAGA
AGATTCTTAGAAGCATGACTAGAAGATATGAGCATATTGTTGTAGCAATTGAAGAATCCAAAGATTTGTCAACTCTCTCTATAAATAGCTTAATGGGATCTCTTCAATCT
CATGAGCTCAGATTGAAGATGTTTGATTCTACTCCTTCAGAAGAAGCTTTTCATATGCAGTCCTCCTATAGAGGTCGATCCAATGGAAGAAGAGGTGGACGTGGTGGTAG
AGGCAATGGACGATCCAACGTTGTAACAAATACAGAGTCAGAAAGCAGAGACAATCAATTTTTTTCAAATAGAGGACGAGGAAGAAGTTCAAATAGAGGAAGAGGTAGAA
GTGGTGGTCGTGGAGATTTTTCTCACATACAATGTTTCAATTGTAGACGTTATGGACATTTTCAAGCAGACTGTTGGTCTAAGAAGGTTAATTCTAATCAATCAAAAACC
ACACTAATGCATGAGCAATCAAATAATGATCAAAGTCTTCTCTTCCTCACTCTCAACGTTCAAGAATCAAGCACTGAAGAAATATGGTATCTTGATAGTGGTTGTAGTAA
CTACATGACAGGAAGAAATGATATTTTTATATCTTTAGATGAATCTCATCAAAATGTAGTGAAGACTGGGGACAACAAGAAGCTTGAAGTCAAAGGAAAAGGAAATATTC
TTGTCAAGACAAAAATGGGAGCAAAAAGAATTACTAATGTGTATTATGTTCCAGGTCTCAAACACAATCTTTTAAGTGTTGGGCAACTTCTCATAAGAGGACATGATGTT
ATTTTTAAAGATAACATATGTGAGATTAGAACCAAGAATGGAGATCTCATAACGAAGGTTCGTATGACTCACAACAAAATGTTTCCAATTAAAATATGTTATGAGAAGCT
TGTTTGTTTTGAGACTTTAGTAAATGACACCTCATGGTTATGGCATTGTCGATTTGGGCACCTAAGTTTTGACACTTTGTCTCACATGTGTCAACAACATATGGTGAGAG
GAATGTCAAATATTAAAAAGGAAGATCAACTCTGTGAAGCATGTGTTCTCGGAAAGCATCATCGAAATTCATTTCCGACTGAAGGTTCTTGGAGAGCATCAAAACCACTC
GAGCTTGTTCATACAGACTTATGTGGACCTATGAGAACTACTACACATGGAGGTAACCGTTATTTTCTCACATTTATTGATGACTACAGTCGAAAAACATGGATTTATCT
ACTAAAAGAAAAGAGTGCTACTTTCGAATGTTTCAAGACATTCAAAGCAATGGTGGAAAATGAAAGTAACTTGAAATTGAAATCATTGCGTTCGGATCGTGGAGGAGAAT
ATATTGTTTTTGCAGATTTCTGGAAGGAAAATGGAATCAAGTATCAGAAGACTGTTCGAAGAACTCCTCAACAAAAGGGAGTTGCAGTGAGGAAAAATAGAATAATAATG
GAAGTTGCAAGAAGTATGCTGAAGTCAAAGAAGCTTCCTGATCAATTTTGGGGAGACGCAGTAACTTGTGTTGTTTATCTCCTAAATCGAGCTTCAACGAAAAGTGTGCA
AGGTATTACTCCTCAAGAAGCATGGAGCGAATTGAAATCAACTGTTAGTCACCTAAGAGTGTTTGGGTGCATTGCTTACTCTCACATTTCAGATGAGAAAAGAGGTAAGC
TAGATGATAAATCAGAAAAATGCATTTTTGTTGGGTACAGTGAGAACTCAAAGGCTTATAGACTATACAACCCTGTAAAAGATGCTCGAGACTTGGAGCTTGAAGTAACT
CAACCACTGACTTCATCTTCTTCATCACACTCCACAAGTGATGAGAAACTACTCCAAGGAAGACCAGAAATATTCAAGAAATCTATAATATTTCAAGAAGGATACTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCAAGATACAATTAGCAGCATGGGATGCATTGCAAAATTTGTATGAAGGAGAAGAAAAGGTAAAATTGGTCCGATTACAAACACTTCGAGCTGAATTTGATACAAT
TCGAATGAAAGATTCTGAAACTATTGAAGAATTTTTCAACCGTGTTCTCTTAATTGTTAATCAATTGAGATCAAATGGAGAAACAATTGAAGATCAAAGGATTGTTGAGA
AGATTCTTAGAAGCATGACTAGAAGATATGAGCATATTGTTGTAGCAATTGAAGAATCCAAAGATTTGTCAACTCTCTCTATAAATAGCTTAATGGGATCTCTTCAATCT
CATGAGCTCAGATTGAAGATGTTTGATTCTACTCCTTCAGAAGAAGCTTTTCATATGCAGTCCTCCTATAGAGGTCGATCCAATGGAAGAAGAGGTGGACGTGGTGGTAG
AGGCAATGGACGATCCAACGTTGTAACAAATACAGAGTCAGAAAGCAGAGACAATCAATTTTTTTCAAATAGAGGACGAGGAAGAAGTTCAAATAGAGGAAGAGGTAGAA
GTGGTGGTCGTGGAGATTTTTCTCACATACAATGTTTCAATTGTAGACGTTATGGACATTTTCAAGCAGACTGTTGGTCTAAGAAGGTTAATTCTAATCAATCAAAAACC
ACACTAATGCATGAGCAATCAAATAATGATCAAAGTCTTCTCTTCCTCACTCTCAACGTTCAAGAATCAAGCACTGAAGAAATATGGTATCTTGATAGTGGTTGTAGTAA
CTACATGACAGGAAGAAATGATATTTTTATATCTTTAGATGAATCTCATCAAAATGTAGTGAAGACTGGGGACAACAAGAAGCTTGAAGTCAAAGGAAAAGGAAATATTC
TTGTCAAGACAAAAATGGGAGCAAAAAGAATTACTAATGTGTATTATGTTCCAGGTCTCAAACACAATCTTTTAAGTGTTGGGCAACTTCTCATAAGAGGACATGATGTT
ATTTTTAAAGATAACATATGTGAGATTAGAACCAAGAATGGAGATCTCATAACGAAGGTTCGTATGACTCACAACAAAATGTTTCCAATTAAAATATGTTATGAGAAGCT
TGTTTGTTTTGAGACTTTAGTAAATGACACCTCATGGTTATGGCATTGTCGATTTGGGCACCTAAGTTTTGACACTTTGTCTCACATGTGTCAACAACATATGGTGAGAG
GAATGTCAAATATTAAAAAGGAAGATCAACTCTGTGAAGCATGTGTTCTCGGAAAGCATCATCGAAATTCATTTCCGACTGAAGGTTCTTGGAGAGCATCAAAACCACTC
GAGCTTGTTCATACAGACTTATGTGGACCTATGAGAACTACTACACATGGAGGTAACCGTTATTTTCTCACATTTATTGATGACTACAGTCGAAAAACATGGATTTATCT
ACTAAAAGAAAAGAGTGCTACTTTCGAATGTTTCAAGACATTCAAAGCAATGGTGGAAAATGAAAGTAACTTGAAATTGAAATCATTGCGTTCGGATCGTGGAGGAGAAT
ATATTGTTTTTGCAGATTTCTGGAAGGAAAATGGAATCAAGTATCAGAAGACTGTTCGAAGAACTCCTCAACAAAAGGGAGTTGCAGTGAGGAAAAATAGAATAATAATG
GAAGTTGCAAGAAGTATGCTGAAGTCAAAGAAGCTTCCTGATCAATTTTGGGGAGACGCAGTAACTTGTGTTGTTTATCTCCTAAATCGAGCTTCAACGAAAAGTGTGCA
AGGTATTACTCCTCAAGAAGCATGGAGCGAATTGAAATCAACTGTTAGTCACCTAAGAGTGTTTGGGTGCATTGCTTACTCTCACATTTCAGATGAGAAAAGAGGTAAGC
TAGATGATAAATCAGAAAAATGCATTTTTGTTGGGTACAGTGAGAACTCAAAGGCTTATAGACTATACAACCCTGTAAAAGATGCTCGAGACTTGGAGCTTGAAGTAACT
CAACCACTGACTTCATCTTCTTCATCACACTCCACAAGTGATGAGAAACTACTCCAAGGAAGACCAGAAATATTCAAGAAATCTATAATATTTCAAGAAGGATACTAG
Protein sequenceShow/hide protein sequence
MVKIQLAAWDALQNLYEGEEKVKLVRLQTLRAEFDTIRMKDSETIEEFFNRVLLIVNQLRSNGETIEDQRIVEKILRSMTRRYEHIVVAIEESKDLSTLSINSLMGSLQS
HELRLKMFDSTPSEEAFHMQSSYRGRSNGRRGGRGGRGNGRSNVVTNTESESRDNQFFSNRGRGRSSNRGRGRSGGRGDFSHIQCFNCRRYGHFQADCWSKKVNSNQSKT
TLMHEQSNNDQSLLFLTLNVQESSTEEIWYLDSGCSNYMTGRNDIFISLDESHQNVVKTGDNKKLEVKGKGNILVKTKMGAKRITNVYYVPGLKHNLLSVGQLLIRGHDV
IFKDNICEIRTKNGDLITKVRMTHNKMFPIKICYEKLVCFETLVNDTSWLWHCRFGHLSFDTLSHMCQQHMVRGMSNIKKEDQLCEACVLGKHHRNSFPTEGSWRASKPL
ELVHTDLCGPMRTTTHGGNRYFLTFIDDYSRKTWIYLLKEKSATFECFKTFKAMVENESNLKLKSLRSDRGGEYIVFADFWKENGIKYQKTVRRTPQQKGVAVRKNRIIM
EVARSMLKSKKLPDQFWGDAVTCVVYLLNRASTKSVQGITPQEAWSELKSTVSHLRVFGCIAYSHISDEKRGKLDDKSEKCIFVGYSENSKAYRLYNPVKDARDLELEVT
QPLTSSSSSHSTSDEKLLQGRPEIFKKSIIFQEGY