; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0009568 (gene) of Chayote v1 genome

Gene IDSed0009568
OrganismSechium edule (Chayote v1)
DescriptionHAT transposon superfamily
Genome locationLG03:8139884..8145294
RNA-Seq ExpressionSed0009568
SyntenySed0009568
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR003656 - Zinc finger, BED-type
IPR007021 - Domain of unknown function DUF659
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015651.1 hypothetical protein SDJN02_23288, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0089.9Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVP DVRDHIQGILSTPKKQRAPKKPK+DMETATNG QHSSS+SGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG

Query:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPSTFP  SPS+QP IDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAE+GVGY+APSYEKLKSTLL KVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS
        KKYRDEWKE GCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV++SGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLM KYTSLFWS
Subjt:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHAWILNTMRKFTSGKELIRPR++RFVTNFLSLRS+V LED L+ MF+HSEW SSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA

Query:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        I+S L+LDRFWK+AREAVNISEPL+RILR+VDGDMPAMGY+YEGIERAKVE+K Y NGIEDKYMPIW TIDRRWNLQLHTTLH AAAFLNPSIFYNPNFK
Subjt:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK
        IDLRIRNGFQEAMLKMATTD+DKMEITREHP YVN QGALGTDFA+LGRTIN PGDWWSGYGYEIPTLQR AIRIL+QPCS+YGC +WNWSTF TLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK

Query:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP
        RS  EQEKLNDLVFVQCNLWLQH+ WTRDGKYKPVVFDDIDVSL+WPTE ESSA +LDDSWL+N+ LEC GSP
Subjt:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP

XP_022923437.1 uncharacterized protein LOC111431132 [Cucurbita moschata]0.0e+0090.19Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVP DVRDHIQGILSTPKKQRAPKKPK+DMETATNG QHSSS+SGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG

Query:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPSTFP  SPS+QP IDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAE+GVGY+APSYEKLKSTLL KVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS
        KKYRDEWKE GCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV++SGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLM KYTSLFWS
Subjt:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHAWILNTMRKFTSGKELIRPR++RFVTNFLSLRS+V LED L+ MF+HSEW SSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA

Query:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        I+S L+LDRFWK+AREAVNISEPL+RILR+VDGDMPAMGY+YEGIERAKVE+K Y NGIEDKYMPIW TIDRRWNLQLHTTLH AAAFLNPSIFYNPNFK
Subjt:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK
        IDLRIRNGFQEAMLKMATTD+DKMEITREHP YVN QGALGTDFA+LGRTIN PGDWWSGYGYEIPTLQRAAIRIL+QPCS+YGC +WNWSTF TLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK

Query:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP
        RS  EQEKLNDLVFVQCNLWLQH+ WTRDGKYKPVVFDDIDVSL+WPTE ESSA +LDDSWL+N+PLEC GSP
Subjt:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP

XP_023007736.1 uncharacterized protein LOC111500259 [Cucurbita maxima]0.0e+0090.04Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVP DVRD IQGILSTPKKQRAPKKPK+DMETATNG QHSSS+SGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG

Query:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPST PC SPS+QPLIDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAE+GVGY+APSY+KLKSTLLDKVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS
        KKYRDEWKE GCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV++SGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLM KYTSLFWS
Subjt:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHAWIL+TMRKFTSGKELIRPR++RFVTNFLSLRS+V LED L+ MF+HSEW SSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA

Query:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        I+S L+LDRFWK+AREAVNISEPL+RILR+VDGDMPAMGY+YEGIERAKVE+K Y NGIEDKYMPIW TIDRRWNLQLHTTLH AAAFLNPSIFYNPNFK
Subjt:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK
        IDLRIRNGFQEAMLKMATTD+DKMEITREHP YVN QGALGTDFA+LGRTIN PGDWWSGYGYEIPTLQR AIRIL+QPCS+YGC +WNWSTF TLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK

Query:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP
        RSR EQEKLNDLVFVQCNLWLQH+ WTRDGKYKPVVFDDIDVSL+WPTE ESSA +LDDSWL+N+PLEC GSP
Subjt:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP

XP_038876874.1 uncharacterized protein LOC120069237 isoform X1 [Benincasa hispida]0.0e+0091.98Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVPTDVRDHIQGILSTPKKQ+APKKPK+DMETATNG QHSSS+SGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG

Query:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPSTFPCPSPS+QP IDD QKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAE+GVGYKAPSYEKLKSTLLDKVKGDI NSY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS
        KKY DEWKE GCTILCDSWSDGRTKSFLVISITCSKG LFLKSVD+SG EDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLM KYTSLFWS
Subjt:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHA ILNTMRKFTSGKELIRPR++RFVTNFLSLRS+V+ ED+L+ MF+HSEWLSSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA

Query:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        IISLL+LDRFWK+A+EAVNI+EPL+RILRIVDGDMPAMGY++EGIERAKVEIKTY NGIEDKY+PIW TIDRRWNLQLHTTLH AAAFLNPS+FYNPNFK
Subjt:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK
        IDLRIRNGFQEAMLKMATTD+DKMEITREHP YVNGQGALGTDFA+LGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCS+YGC +WNWSTF TLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK

Query:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP
        RSRAEQEKLNDLVFVQCNLWLQH+C TRDGKYKPVVFDDIDVSL+WPTEFE+SA +LDDSWL+N+PLECRGSP
Subjt:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP

XP_038876877.1 uncharacterized protein LOC120069237 isoform X2 [Benincasa hispida]0.0e+0091.98Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVPTDVRDHIQGILSTPKKQ+APKKPK+DMETATNG QHSSS+SGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG

Query:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPSTFPCPSPS+QP IDD QKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAE+GVGYKAPSYEKLKSTLLDKVKGDI NSY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS
        KKY DEWKE GCTILCDSWSDGRTKSFLVISITCSKG LFLKSVD+SG EDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLM KYTSLFWS
Subjt:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHA ILNTMRKFTSGKELIRPR++RFVTNFLSLRS+V+ ED+L+ MF+HSEWLSSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA

Query:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        IISLL+LDRFWK+A+EAVNI+EPL+RILRIVDGDMPAMGY++EGIERAKVEIKTY NGIEDKY+PIW TIDRRWNLQLHTTLH AAAFLNPS+FYNPNFK
Subjt:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK
        IDLRIRNGFQEAMLKMATTD+DKMEITREHP YVNGQGALGTDFA+LGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCS+YGC +WNWSTF TLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK

Query:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP
        RSRAEQEKLNDLVFVQCNLWLQH+C TRDGKYKPVVFDDIDVSL+WPTEFE+SA +LDDSWL+N+PLECRGSP
Subjt:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP

TrEMBL top hitse value%identityAlignment
A0A0A0L2E4 BED-type domain-containing protein0.0e+0089.3Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVPTDVRDHIQGILSTPKKQ+APKKPK+DMETATNG QHSSS+SGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG

Query:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPST+PC SPS+QP IDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAE+G GYKAPSYEKLKSTLLDKVKGDI +SY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS
        KK+RDEWKE GCTILCDSWSDG+TKSFLVIS+TCSKGTLFLKSVD+SG EDDATYLSDLLETI+LEVGVENVVQ+ITDATASYVYAGRLLM KYTSLFWS
Subjt:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLEDISKIEWV  VLEEAKIITRYIYSHA ILNTMRKFT GKELIRPR++RFVTNFLSLRS+V+LED+L+ MF+HSEWLSSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA

Query:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        IISLL+LDRFWK+A EA+NI EPL+RILRIVDGDMPAMGY++EGIERAKVEIKTY NG EDKYMPIW TIDRRWNLQLHTTLH AAAFLNPS+FYNPNFK
Subjt:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGCK-WNWSTFLTLHSKK
        IDLRIRNGFQEAMLKMATTD+DKMEITREHP YVNGQGALGTDFA+LGRTINAPGDWWSGYGYEIPTLQRAA+RIL+QPCS+YGC  WNWSTF TLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGCK-WNWSTFLTLHSKK

Query:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP
         SRAEQEKL DLVFVQCNLWLQHVC TRD KYKPVVFDD+DVSL+WP+E E SA +LDDSWL+N+PLE RGSP
Subjt:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP

A0A1S3BLP8 uncharacterized protein LOC1034909270.0e+0089.61Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVPTDVRDHIQGILSTPKKQ+APKKPK+DMETATNG QHSSS+SGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG

Query:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPSTFPC SPS+QP IDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAE+G GYKAPSYEKLKSTLLDKVKGDI +SY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS
        KK RDEWKE GCTILCDSWSDGRTKSFLVIS+TCSKGTLFLKSVD SG EDDATYLSDLLETIVLEVGVENVVQ+ITDATASYVYAGRLLM KYTSLFWS
Subjt:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLEDISKIEWV TVLEEAKIITRYIYSHA ILNTMRKFT GKELIRPR++RFVTNFLSLRS+V+LE++L+ MF+HSEWLSSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA

Query:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        IISLL+LDRFWK+A EA+NI EPL+RILRIVDGDMPAMGY++EGIERAKVEIKTY NG EDKYMPIW TIDRRWNLQLHTTLH AAAFLNPS+FYNPNFK
Subjt:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK
        IDLRIRNGFQEAMLKMATTD+DKMEITREHP YVNGQGALGTDFA+LGRTIN+PGDWWSGYGYEIPTLQRAA+RIL+QPCS+YGC +WNWSTF TLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK

Query:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLE-NMPLECRGSP
        RSRAEQEKL DLVFVQCNLWLQH+C TRD KYKP+VFDDIDVSL+WP+E E SA +LDDSWL+ N+PLECRGSP
Subjt:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLE-NMPLECRGSP

A0A5D3D7G5 HAT transposon superfamily0.0e+0089.61Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVPTDVRDHIQGILSTPKKQ+APKKPK+DMETATNG QHSSS+SGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG

Query:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPSTFPC SPS+QP IDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAE+G GYKAPSYEKLKSTLLDKVKGDI +SY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS
        KK RDEWKE GCTILCDSWSDGRTKSFLVIS+TCSKGTLFLKSVD SG EDDATYLSDLLETIVLEVGVENVVQ+ITDATASYVYAGRLLM KYTSLFWS
Subjt:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLEDISKIEWV TVLEEAKIITRYIYSHA ILNTMRKFT GKELIRPR++RFVTNFLSLRS+V+LE++L+ MF+HSEWLSSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA

Query:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        IISLL+LDRFWK+A EA+NI EPL+RILRIVDGDMPAMGY++EGIERAKVEIKTY NG EDKYMPIW TIDRRWNLQLHTTLH AAAFLNPS+FYNPNFK
Subjt:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK
        IDLRIRNGFQEAMLKMATTD+DKMEITREHP YVNGQGALGTDFA+LGRTIN+PGDWWSGYGYEIPTLQRAA+RIL+QPCS+YGC +WNWSTF TLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK

Query:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLE-NMPLECRGSP
        RSRAEQEKL DLVFVQCNLWLQH+C TRD KYKP+VFDDIDVSL+WP+E E SA +LDDSWL+ N+PLECRGSP
Subjt:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLE-NMPLECRGSP

A0A6J1E9N1 uncharacterized protein LOC1114311320.0e+0090.19Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVP DVRDHIQGILSTPKKQRAPKKPK+DMETATNG QHSSS+SGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG

Query:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPSTFP  SPS+QP IDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAE+GVGY+APSYEKLKSTLL KVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS
        KKYRDEWKE GCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV++SGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLM KYTSLFWS
Subjt:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHAWILNTMRKFTSGKELIRPR++RFVTNFLSLRS+V LED L+ MF+HSEW SSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA

Query:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        I+S L+LDRFWK+AREAVNISEPL+RILR+VDGDMPAMGY+YEGIERAKVE+K Y NGIEDKYMPIW TIDRRWNLQLHTTLH AAAFLNPSIFYNPNFK
Subjt:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK
        IDLRIRNGFQEAMLKMATTD+DKMEITREHP YVN QGALGTDFA+LGRTIN PGDWWSGYGYEIPTLQRAAIRIL+QPCS+YGC +WNWSTF TLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK

Query:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP
        RS  EQEKLNDLVFVQCNLWLQH+ WTRDGKYKPVVFDDIDVSL+WPTE ESSA +LDDSWL+N+PLEC GSP
Subjt:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP

A0A6J1KZI0 uncharacterized protein LOC1115002590.0e+0090.04Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVP DVRD IQGILSTPKKQRAPKKPK+DMETATNG QHSSS+SGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNG-QHSSSSSGG

Query:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPST PC SPS+QPLIDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAE+GVGY+APSY+KLKSTLLDKVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS
        KKYRDEWKE GCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV++SGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLM KYTSLFWS
Subjt:  KKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHAWIL+TMRKFTSGKELIRPR++RFVTNFLSLRS+V LED L+ MF+HSEW SSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQA

Query:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        I+S L+LDRFWK+AREAVNISEPL+RILR+VDGDMPAMGY+YEGIERAKVE+K Y NGIEDKYMPIW TIDRRWNLQLHTTLH AAAFLNPSIFYNPNFK
Subjt:  IISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK
        IDLRIRNGFQEAMLKMATTD+DKMEITREHP YVN QGALGTDFA+LGRTIN PGDWWSGYGYEIPTLQR AIRIL+QPCS+YGC +WNWSTF TLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGC-KWNWSTFLTLHSKK

Query:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP
        RSR EQEKLNDLVFVQCNLWLQH+ WTRDGKYKPVVFDDIDVSL+WPTE ESSA +LDDSWL+N+PLEC GSP
Subjt:  RSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLENMPLECRGSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily2.3e-11733.28Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNGQHSSSSSGGI
        MVR +D CWE+   +D    KV+C +C R  +GG+ R+K HL+++ +K + PC++V  DV D ++ ILS         K K                   
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNGQHSSSSSGGI

Query:  HHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSYK
                    P + P  +P+S+ +   +    +D  ++ +++FFF N I F+ A+S  Y  M++A+A+ G G+ APS    K+  LD+VK DI    K
Subjt:  HHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSYK

Query:  KYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWSP
            EW   GCTI+ ++W+D ++++ +  S++      F KSVD S    ++  L+DL ++++ ++G E++VQ+I D +  Y      L+  Y ++F SP
Subjt:  KYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWSP

Query:  CVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQAI
        C S C+N +LE+ SK++WV   + +A++I++++Y+++ +L+ +RK T G+++IR  ++R V+NFLSL+SM+  +  L+ MF+  E+ ++  + +P + + 
Subjt:  CVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQAI

Query:  ISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFKI
        +++L  + FW+   E+V ISEP++++LR V    PA+G +YE + +AK  I+TY    E+K+      +D  W   LH+ LHAAAAFLNPSI YNP  K 
Subjt:  ISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYVYEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFKI

Query:  DLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGCKWNWSTFLTLHSKKRS
           ++  F + + K+  T   + +IT +   +   +G  G + A+  R   +PG WW  +G   P LQR AIRIL+Q CS Y  +  WSTF  +H ++R+
Subjt:  DLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGCKWNWSTFLTLHSKKRS

Query:  RAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLE
        + ++E LN L +V  NL L  +      +  P+  +DID+  +W  E E+ +P     WL+
Subjt:  RAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWLE

AT3G17450.1 hAT dimerisation domain-containing protein7.3e-9530.29Show/hide
Query:  DACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETAT-----------------
        D  WEH +  D  ++KV+CNYC +  SGG+ R K HLA+I   ++ PC   P +V   I+  +   +  +   +P  +M   T                 
Subjt:  DACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETAT-----------------

Query:  -------------NGQHS-----SSSSGGIHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQK---KDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNA
                     NG+ S     S  S  +   S  + +      F  PS S Q  +  +   +   + +    ++ F  H  +P  AA SLY+Q+M+  
Subjt:  -------------NGQHS-----SSSSGGIHHGSSGQNESNCPSTFPCPSPSSQPLIDDAQKQK---KDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNA

Query:  IAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVG
        I  +G G+  PS +     LL +    I++  ++YR  W   GC+I+ D+W++   K  +   ++C +G  F  S+D +   +DA  L   L+ +V ++G
Subjt:  IAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVG

Query:  VENVVQVITDATASYVYAGRLLMNKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMR-KFTSGKELIRPRMSRFVTNFLS
         ENVVQVIT  TA +  AG+LL  K  +L+W+PC  +C   +LED SK+E+V   LE+A+ ITR+IY+  W+LN M+ +FT G +L+RP + R  + F +
Subjt:  VENVVQVITDATASYVYAGRLLMNKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMR-KFTSGKELIRPRMSRFVTNFLS

Query:  LRSMVVLEDSLRLMFSHSEW-LSSIYSRRPDTQAIISLLFLDRFWKNAREAVNISEPLVRILRIVD--GDMPAMGYVYEGIERAKVEIKTYCNGIEDKYM
        L+S++  + SLR +F    W LS   ++  + + +  ++    FWK  +  +   +P+++++ +++  GD  +M Y Y  +  AK+ IK+  +    KY 
Subjt:  LRSMVVLEDSLRLMFSHSEW-LSSIYSRRPDTQAIISLLFLDRFWKNAREAVNISEPLVRILRIVD--GDMPAMGYVYEGIERAKVEIKTYCNGIEDKYM

Query:  PIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYE
        P W  I+ RWN   H  L+ AA F NP+  Y P+F     +  G  E ++++   +  ++    + P Y   +   GTD A+  RT   P  WW  +G  
Subjt:  PIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGYGYE

Query:  IPTLQRAAIRILNQPCSTYGCKWNWSTFLTLHSKKRSRAEQEKLNDLVFVQCNLWLQ
           LQR A+RIL+  CS+ GC+  WS +  ++S+ +S+  ++   DL +V  NL L+
Subjt:  IPTLQRAAIRILNQPCSTYGCKWNWSTFLTLHSKKRSRAEQEKLNDLVFVQCNLWLQ

AT3G22220.1 hAT transposon superfamily4.1e-9030.86Show/hide
Query:  RDACWEHC-VLVDATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGIL-STPKKQRAPKK-----------PKMDMETA---
        +D+ W+HC V     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP +VR  +Q  +  T ++QR  +K           P  ++ET    
Subjt:  RDACWEHC-VLVDATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGIL-STPKKQRAPKK-----------PKMDMETA---

Query:  ---TNGQHSSSSSGGIHHGSSGQN-----------------------ESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYY
            N    S SS  +   S+G+                        + +  +  P    S + ++    K+++      +  F F     F AA S+  
Subjt:  ---TNGQHSSSSSGGIHHGSSGQN-----------------------ESNCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYY

Query:  QEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLET
        Q  ++AI   G G   P++E L+  +L     +++    + +  WK  GC++L    +       L   + C +  +FLKSVD S   D    L +LL+ 
Subjt:  QEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLET

Query:  IVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFV
        +V E+G  NVVQVIT     Y  AG+ LM+ Y SL+W PC ++C+++MLE+  K++W+  ++E+A+ +TR IY+H+ +LN MRKFT G ++++P  +   
Subjt:  IVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRMSRFV

Query:  TNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQAIISLLFLDRFWKNAREAVNISEPLVRILRIVDGD-MPAMGYVYEGIERAKVEIKTYCNGIED
        TNF ++  +  L+  L+ M + SEW    YS+     A+   +  + FWK    A +I+ P++R+LRIV  +  PAMGYVY  + RAK  IKT      +
Subjt:  TNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQAIISLLFLDRFWKNAREAVNISEPLVRILRIVDGD-MPAMGYVYEGIERAKVEIKTYCNGIED

Query:  KYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGY
        +Y+  W  IDR W   L   L+AA  +LNP  FY+ + ++   I     + + K+      +  + ++   Y N  G  G + A+  R    P +WWS Y
Subjt:  KYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDWWSGY

Query:  GYEIPTLQRAAIRILNQPC-STYGCKWNWSTFLTLHSKKRSRAEQEKLNDLVFVQCNLWLQHVC--WTRDGKYKPVVFDDIDVSLDW
        G     L R AIRIL+Q C S+ G   N ++   ++  K S  E+++LNDLVFVQ N+ L+ +    + D    P+   +++V  DW
Subjt:  GYEIPTLQRAAIRILNQPC-STYGCKWNWSTFLTLHSKKRSRAEQEKLNDLVFVQCNLWLQHVC--WTRDGKYKPVVFDDIDVSLDW

AT4G15020.1 hAT transposon superfamily7.3e-9532.03Show/hide
Query:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGIL-STPKKQRAPKKPKMD------------------
        +D  W+HC +     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP DVR  +Q  +  T ++QR   K   +                  
Subjt:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGIL-STPKKQRAPKKPKMD------------------

Query:  ----------------------METATNGQHSSSSSGGIHHGSSGQNES----NCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK
                              +   T  +   S      +GS+  N      +  +  P    S + ++  + + +++     +  F F     F A  
Subjt:  ----------------------METATNGQHSSSSSGGIHHGSSGQNES----NCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK

Query:  SLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSD
        S+ +Q M++AIA  G G  AP+++ L+  +L     ++     + +  WK  GC+IL +  +  +    L   + C +  +FLKSVD S     A  L +
Subjt:  SLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSD

Query:  LLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRM
        LL  +V EVG  NVVQVIT     YV AG+ LM  Y SL+W PC ++C++QMLE+  K+ W+   +E+A+ ITR++Y+H+ +LN M KFTSG +++ P  
Subjt:  LLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRM

Query:  SRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQAIISLLFLDRFWKNAREAVNISEPLVRILRIVDGD-MPAMGYVYEGIERAKVEIKTYCN
        S   TNF +L  +  L+ +L+ M + +EW    YS  P +  +++ L  + FWK      +++ PL+R LRIV  +  PAMGYVY  + RAK  IKT+  
Subjt:  SRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQAIISLLFLDRFWKNAREAVNISEPLVRILRIVDGD-MPAMGYVYEGIERAKVEIKTYCN

Query:  GIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDW
          ED Y+  W  IDR W  Q H  L AA  FLNP +FYN N +I   +     + + ++   D+ + +I +E   Y    G  G + A+  R    P +W
Subjt:  GIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDW

Query:  WSGYGYEIPTLQRAAIRILNQPC-STYGCKWNWSTFLTLHSKKRSRAEQEKLNDLVFVQCNLWLQHV-CWTRDGKYKPVVFDDIDVSLDW
        WS YG     L R AIRIL+Q C S+  C+ N      ++  K S  EQ++L+DLVFVQ N+ L+ +   + D    P+  + IDV  +W
Subjt:  WSGYGYEIPTLQRAAIRILNQPC-STYGCKWNWSTFLTLHSKKRSRAEQEKLNDLVFVQCNLWLQHV-CWTRDGKYKPVVFDDIDVSLDW

AT4G15020.2 hAT transposon superfamily7.3e-9532.03Show/hide
Query:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGIL-STPKKQRAPKKPKMD------------------
        +D  W+HC +     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP DVR  +Q  +  T ++QR   K   +                  
Subjt:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGIL-STPKKQRAPKKPKMD------------------

Query:  ----------------------METATNGQHSSSSSGGIHHGSSGQNES----NCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK
                              +   T  +   S      +GS+  N      +  +  P    S + ++  + + +++     +  F F     F A  
Subjt:  ----------------------METATNGQHSSSSSGGIHHGSSGQNES----NCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK

Query:  SLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSD
        S+ +Q M++AIA  G G  AP+++ L+  +L     ++     + +  WK  GC+IL +  +  +    L   + C +  +FLKSVD S     A  L +
Subjt:  SLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKEAGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSD

Query:  LLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRM
        LL  +V EVG  NVVQVIT     YV AG+ LM  Y SL+W PC ++C++QMLE+  K+ W+   +E+A+ ITR++Y+H+ +LN M KFTSG +++ P  
Subjt:  LLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRM

Query:  SRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQAIISLLFLDRFWKNAREAVNISEPLVRILRIVDGD-MPAMGYVYEGIERAKVEIKTYCN
        S   TNF +L  +  L+ +L+ M + +EW    YS  P +  +++ L  + FWK      +++ PL+R LRIV  +  PAMGYVY  + RAK  IKT+  
Subjt:  SRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQAIISLLFLDRFWKNAREAVNISEPLVRILRIVDGD-MPAMGYVYEGIERAKVEIKTYCN

Query:  GIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDW
          ED Y+  W  IDR W  Q H  L AA  FLNP +FYN N +I   +     + + ++   D+ + +I +E   Y    G  G + A+  R    P +W
Subjt:  GIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTINAPGDW

Query:  WSGYGYEIPTLQRAAIRILNQPC-STYGCKWNWSTFLTLHSKKRSRAEQEKLNDLVFVQCNLWLQHV-CWTRDGKYKPVVFDDIDVSLDW
        WS YG     L R AIRIL+Q C S+  C+ N      ++  K S  EQ++L+DLVFVQ N+ L+ +   + D    P+  + IDV  +W
Subjt:  WSGYGYEIPTLQRAAIRILNQPC-STYGCKWNWSTFLTLHSKKRSRAEQEKLNDLVFVQCNLWLQHV-CWTRDGKYKPVVFDDIDVSLDW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGAGGAAGGGATGCTTGCTGGGAACATTGTGTCCTTGTTGATGCGACAAGACAGAAGGTTCGTTGTAATTATTGCCAGCGGGAATTCAGTGGAGGTGTATACAG
GATGAAATTTCATTTGGCTCAAATTAAAAACAAGGATATAGTTCCATGTTCTGAAGTCCCAACTGATGTTCGAGACCATATTCAAGGCATCTTAAGTACTCCTAAGAAAC
AGAGAGCACCCAAGAAACCAAAGATGGATATGGAAACTGCGACAAATGGACAACATAGCTCCTCATCAAGTGGTGGCATTCATCATGGATCCAGTGGACAGAATGAAAGC
AACTGCCCGTCAACGTTTCCGTGCCCTTCGCCAAGTTCACAACCACTAATTGATGATGCTCAAAAGCAGAAGAAGGATGAGACTGATAAAAAAGTTGCAATTTTTTTCTT
TCATAATTCTATTCCTTTCAGTGCTGCCAAGTCCTTGTATTATCAGGAAATGGTGAATGCAATTGCAGAATTTGGAGTAGGATACAAAGCACCAAGTTATGAGAAATTAA
AATCTACTCTTTTGGATAAAGTTAAAGGTGACATTCAGAATTCTTACAAAAAATATAGGGATGAATGGAAAGAAGCAGGCTGCACTATCCTGTGTGATAGCTGGTCAGAT
GGAAGGACCAAATCATTTCTAGTCATTTCCATTACATGTTCTAAAGGAACACTGTTTCTGAAGTCAGTCGATGTATCAGGTCGTGAAGATGATGCAACTTACCTTTCTGA
CTTGCTTGAGACTATCGTCCTCGAGGTTGGAGTGGAGAATGTTGTCCAAGTTATCACGGATGCTACTGCCAGTTACGTCTATGCTGGGAGGCTTCTCATGAACAAGTACA
CTTCCTTATTTTGGTCTCCATGTGTTTCTTATTGTGTCAATCAGATGTTGGAGGACATTAGTAAAATCGAGTGGGTCGGTACAGTTTTGGAGGAGGCAAAGATCATAACC
CGCTACATTTATAGTCATGCATGGATTTTGAATACAATGCGAAAATTCACAAGCGGGAAGGAATTGATCAGGCCGAGAATGAGTAGATTTGTGACTAATTTTCTCTCTTT
GAGGTCCATGGTGGTTCTTGAGGATAGTCTCAGGCTTATGTTTTCTCATTCTGAGTGGCTGTCTTCAATATATAGCAGGCGTCCCGACACGCAGGCAATTATTTCCTTGC
TGTTTTTGGATAGGTTTTGGAAGAATGCACGTGAAGCTGTCAACATTTCTGAACCACTTGTTAGAATTCTGAGAATTGTTGATGGAGACATGCCTGCCATGGGCTATGTA
TATGAAGGAATAGAGAGGGCAAAGGTTGAAATCAAAACATATTGCAATGGCATTGAGGATAAATATATGCCTATTTGGGGAACAATTGACAGGAGATGGAATTTGCAGCT
TCACACGACACTGCACGCAGCAGCTGCATTCCTTAACCCTTCCATTTTTTACAATCCGAATTTTAAGATTGATTTGAGAATTAGGAATGGATTTCAGGAAGCCATGTTGA
AGATGGCGACTACGGATAGAGATAAAATGGAGATTACCAGAGAACATCCTGTGTATGTAAATGGACAAGGTGCTCTTGGTACTGACTTTGCTGTCTTGGGGAGAACTATA
AATGCCCCAGGTGATTGGTGGTCCGGGTACGGTTACGAAATCCCAACACTCCAGAGAGCGGCGATACGAATACTAAACCAACCCTGTAGTACTTATGGGTGCAAATGGAA
CTGGAGCACATTCTTAACGTTACATTCAAAGAAGCGTAGTAGAGCCGAACAGGAAAAGTTGAATGATTTAGTGTTTGTACAGTGTAATCTTTGGTTGCAACACGTTTGTT
GGACTCGAGATGGTAAATATAAACCTGTTGTTTTTGATGATATAGATGTAAGTTTAGATTGGCCTACAGAGTTTGAATCCTCAGCTCCTATTTTAGATGATTCATGGTTG
GAGAATATGCCCCTTGAATGTAGAGGCAGCCCATAA
mRNA sequenceShow/hide mRNA sequence
ATTTCAATTACTCGGCTCCTTTCCCTTTCCTCCAATCGGCCGTCACTTTCTTCCCGGATTCCCACCGGCCGGAATCTCACCACTCCGCCGTTACGGAACTTCCGGTGAAA
TTAAGATTGGATTTCACATCTTTTCCTTCTCTATAATTCTCTCTCCTCAGGGGCACAAAAGATTGTATACCTCCTGCTATGGTAAAGACATTCTACTGAACAATACCTGC
ACAGATTGTGATACTTCCTACAGAAAATGGTTCGAGGAAGGGATGCTTGCTGGGAACATTGTGTCCTTGTTGATGCGACAAGACAGAAGGTTCGTTGTAATTATTGCCAG
CGGGAATTCAGTGGAGGTGTATACAGGATGAAATTTCATTTGGCTCAAATTAAAAACAAGGATATAGTTCCATGTTCTGAAGTCCCAACTGATGTTCGAGACCATATTCA
AGGCATCTTAAGTACTCCTAAGAAACAGAGAGCACCCAAGAAACCAAAGATGGATATGGAAACTGCGACAAATGGACAACATAGCTCCTCATCAAGTGGTGGCATTCATC
ATGGATCCAGTGGACAGAATGAAAGCAACTGCCCGTCAACGTTTCCGTGCCCTTCGCCAAGTTCACAACCACTAATTGATGATGCTCAAAAGCAGAAGAAGGATGAGACT
GATAAAAAAGTTGCAATTTTTTTCTTTCATAATTCTATTCCTTTCAGTGCTGCCAAGTCCTTGTATTATCAGGAAATGGTGAATGCAATTGCAGAATTTGGAGTAGGATA
CAAAGCACCAAGTTATGAGAAATTAAAATCTACTCTTTTGGATAAAGTTAAAGGTGACATTCAGAATTCTTACAAAAAATATAGGGATGAATGGAAAGAAGCAGGCTGCA
CTATCCTGTGTGATAGCTGGTCAGATGGAAGGACCAAATCATTTCTAGTCATTTCCATTACATGTTCTAAAGGAACACTGTTTCTGAAGTCAGTCGATGTATCAGGTCGT
GAAGATGATGCAACTTACCTTTCTGACTTGCTTGAGACTATCGTCCTCGAGGTTGGAGTGGAGAATGTTGTCCAAGTTATCACGGATGCTACTGCCAGTTACGTCTATGC
TGGGAGGCTTCTCATGAACAAGTACACTTCCTTATTTTGGTCTCCATGTGTTTCTTATTGTGTCAATCAGATGTTGGAGGACATTAGTAAAATCGAGTGGGTCGGTACAG
TTTTGGAGGAGGCAAAGATCATAACCCGCTACATTTATAGTCATGCATGGATTTTGAATACAATGCGAAAATTCACAAGCGGGAAGGAATTGATCAGGCCGAGAATGAGT
AGATTTGTGACTAATTTTCTCTCTTTGAGGTCCATGGTGGTTCTTGAGGATAGTCTCAGGCTTATGTTTTCTCATTCTGAGTGGCTGTCTTCAATATATAGCAGGCGTCC
CGACACGCAGGCAATTATTTCCTTGCTGTTTTTGGATAGGTTTTGGAAGAATGCACGTGAAGCTGTCAACATTTCTGAACCACTTGTTAGAATTCTGAGAATTGTTGATG
GAGACATGCCTGCCATGGGCTATGTATATGAAGGAATAGAGAGGGCAAAGGTTGAAATCAAAACATATTGCAATGGCATTGAGGATAAATATATGCCTATTTGGGGAACA
ATTGACAGGAGATGGAATTTGCAGCTTCACACGACACTGCACGCAGCAGCTGCATTCCTTAACCCTTCCATTTTTTACAATCCGAATTTTAAGATTGATTTGAGAATTAG
GAATGGATTTCAGGAAGCCATGTTGAAGATGGCGACTACGGATAGAGATAAAATGGAGATTACCAGAGAACATCCTGTGTATGTAAATGGACAAGGTGCTCTTGGTACTG
ACTTTGCTGTCTTGGGGAGAACTATAAATGCCCCAGGTGATTGGTGGTCCGGGTACGGTTACGAAATCCCAACACTCCAGAGAGCGGCGATACGAATACTAAACCAACCC
TGTAGTACTTATGGGTGCAAATGGAACTGGAGCACATTCTTAACGTTACATTCAAAGAAGCGTAGTAGAGCCGAACAGGAAAAGTTGAATGATTTAGTGTTTGTACAGTG
TAATCTTTGGTTGCAACACGTTTGTTGGACTCGAGATGGTAAATATAAACCTGTTGTTTTTGATGATATAGATGTAAGTTTAGATTGGCCTACAGAGTTTGAATCCTCAG
CTCCTATTTTAGATGATTCATGGTTGGAGAATATGCCCCTTGAATGTAGAGGCAGCCCATAACATTTAAGGCAGTCAGAAACAGATAGTTCATTTTTTCCCTTGATTGTA
TATTATGTCCAAATATTGCACTCAAGGGTTGATAGATATCAAAGAGAAGTTGTAAATTAGCATTTAACTCAAGTCAAACTCTTGCTGCCTTAAACATGAAAGAACTTAG
Protein sequenceShow/hide protein sequence
MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKMDMETATNGQHSSSSSGGIHHGSSGQNES
NCPSTFPCPSPSSQPLIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEFGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKEAGCTILCDSWSD
GRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMNKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIIT
RYIYSHAWILNTMRKFTSGKELIRPRMSRFVTNFLSLRSMVVLEDSLRLMFSHSEWLSSIYSRRPDTQAIISLLFLDRFWKNAREAVNISEPLVRILRIVDGDMPAMGYV
YEGIERAKVEIKTYCNGIEDKYMPIWGTIDRRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDRDKMEITREHPVYVNGQGALGTDFAVLGRTI
NAPGDWWSGYGYEIPTLQRAAIRILNQPCSTYGCKWNWSTFLTLHSKKRSRAEQEKLNDLVFVQCNLWLQHVCWTRDGKYKPVVFDDIDVSLDWPTEFESSAPILDDSWL
ENMPLECRGSP