; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0002250 (gene) of Snake gourd v1 genome

Gene IDTan0002250
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionHAT transposon superfamily
Genome locationLG01:22174163..22181784
RNA-Seq ExpressionTan0002250
SyntenyTan0002250
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR003656 - Zinc finger, BED-type
IPR007021 - Domain of unknown function DUF659
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015651.1 hypothetical protein SDJN02_23288, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0092.72Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVP DVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FP  SPSA+PPIDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAEYGVGY+APSYEKLKSTLL KVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV++SGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHAWILNTMRKFTSGKELIRPRI+RFVTNFLSLRSIV LED LKHMF+HSEW SSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        I+S LYLDRFWKDA EAVNISEPLIRILR+VDGDMPAMGY+YEGIERAK E+K YYNGIEDKYMPIW+TI+RRWNLQLHTTLH AAAFLNPSIFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK
        ID+RIRNGFQEAMLKMATTDKDKMEITREHP YVN QGALGTDFAILGRTIN PGDWWSGYGYEIPTLQR AIR+LSQPCSSY C RWNWSTFETLHSKK
Subjt:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK

Query:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP
        RS  EQEKLNDLVFVQCNLWLQHI WTRDGKYKPVVFDDIDVSLEWPTE ESSA VLDDSWLDN+ LEC GSP
Subjt:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP

XP_022923437.1 uncharacterized protein LOC111431132 [Cucurbita moschata]0.0e+0093.02Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVP DVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FP  SPSA+PPIDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAEYGVGY+APSYEKLKSTLL KVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV++SGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHAWILNTMRKFTSGKELIRPRI+RFVTNFLSLRSIV LED LKHMF+HSEW SSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        I+S LYLDRFWKDA EAVNISEPLIRILR+VDGDMPAMGY+YEGIERAK E+K YYNGIEDKYMPIW+TI+RRWNLQLHTTLH AAAFLNPSIFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK
        ID+RIRNGFQEAMLKMATTDKDKMEITREHP YVN QGALGTDFAILGRTIN PGDWWSGYGYEIPTLQRAAIR+LSQPCSSY C RWNWSTFETLHSKK
Subjt:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK

Query:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP
        RS  EQEKLNDLVFVQCNLWLQHI WTRDGKYKPVVFDDIDVSLEWPTE ESSA VLDDSWLDN+PLEC GSP
Subjt:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP

XP_023007736.1 uncharacterized protein LOC111500259 [Cucurbita maxima]0.0e+0092.57Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVP DVRD IQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS  PC SPSA+P IDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAEYGVGY+APSY+KLKSTLLDKVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV++SGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHAWIL+TMRKFTSGKELIRPRI+RFVTNFLSLRSIV LED LKHMF+HSEW SSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        I+S LYLDRFWKDA EAVNISEPLIRILR+VDGDMPAMGY+YEGIERAK E+K YYNGIEDKYMPIW+TI+RRWNLQLHTTLH AAAFLNPSIFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK
        ID+RIRNGFQEAMLKMATTDKDKMEITREHP YVN QGALGTDFAILGRTIN PGDWWSGYGYEIPTLQR AIR+LSQPCSSY C RWNWSTFETLHSKK
Subjt:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK

Query:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP
        RSR EQEKLNDLVFVQCNLWLQHI WTRDGKYKPVVFDDIDVSLEWPTE ESSA VLDDSWLDN+PLEC GSP
Subjt:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP

XP_038876874.1 uncharacterized protein LOC120069237 isoform X1 [Benincasa hispida]0.0e+0094.65Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVP DVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FPCPSPSA+PPIDD QKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDI NSY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKY DEWKETGCTILCDSWSDGRTKSFLVISITCSKG LFLKSVD+SG EDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHA ILNTMRKFTSGKELIRPRI+RFVTNFLSLRSIVI EDNLKHMF+HSEWLSSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        IISLLYLDRFWKDA EAVNI+EPLIRILRIVDGDMPAMGY++EGIERAK EIKTYYNGIEDKY+PIWETI+RRWNLQLHTTLH AAAFLNPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK
        ID+RIRNGFQEAMLKMATTDKDKMEITREHP YVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIR+L+QPCSSY C RWNWSTFETLHSKK
Subjt:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK

Query:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP
        RSR EQEKLNDLVFVQCNLWLQHI  TRDGKYKPVVFDDIDVSLEWPTEFE+SA VLDDSWLDN+PLECRGSP
Subjt:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP

XP_038876877.1 uncharacterized protein LOC120069237 isoform X2 [Benincasa hispida]0.0e+0094.65Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVP DVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FPCPSPSA+PPIDD QKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDI NSY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKY DEWKETGCTILCDSWSDGRTKSFLVISITCSKG LFLKSVD+SG EDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHA ILNTMRKFTSGKELIRPRI+RFVTNFLSLRSIVI EDNLKHMF+HSEWLSSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        IISLLYLDRFWKDA EAVNI+EPLIRILRIVDGDMPAMGY++EGIERAK EIKTYYNGIEDKY+PIWETI+RRWNLQLHTTLH AAAFLNPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK
        ID+RIRNGFQEAMLKMATTDKDKMEITREHP YVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIR+L+QPCSSY C RWNWSTFETLHSKK
Subjt:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK

Query:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP
        RSR EQEKLNDLVFVQCNLWLQHI  TRDGKYKPVVFDDIDVSLEWPTEFE+SA VLDDSWLDN+PLECRGSP
Subjt:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP

TrEMBL top hitse value%identityAlignment
A0A0A0L2E4 BED-type domain-containing protein0.0e+0091.83Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVP DVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS +PC SPSA+PPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAEYG GYKAPSYEKLKSTLLDKVKGDI +SY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KK+RDEWKETGCTILCDSWSDG+TKSFLVIS+TCSKGTLFLKSVD+SG EDDATYLSDLLETI+LEVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLEDISKIEWV  VLEEAKIITRYIYSHA ILNTMRKFT GKELIRPRI+RFVTNFLSLRSIVILEDNLKHMF+HSEWLSSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        IISLLYLDRFWKDAHEA+NI EPLIRILRIVDGDMPAMGY++EGIERAK EIKTYYNG EDKYMPIWETI+RRWNLQLHTTLH AAAFLNPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRCR-WNWSTFETLHSKK
        ID+RIRNGFQEAMLKMATTDKDKMEITREHP YVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAA+R+LSQPCSSY C  WNWSTFETLHSKK
Subjt:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRCR-WNWSTFETLHSKK

Query:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP
         SR EQEKL DLVFVQCNLWLQH+  TRD KYKPVVFDD+DVSLEWP+E E SA VLDDSWLDN+PLE RGSP
Subjt:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP

A0A1S3BLP8 uncharacterized protein LOC1034909270.0e+0092.58Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVP DVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FPC SPSA+PPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAEYG GYKAPSYEKLKSTLLDKVKGDI +SY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KK RDEWKETGCTILCDSWSDGRTKSFLVIS+TCSKGTLFLKSVD SG EDDATYLSDLLETIVLEVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLEDISKIEWV TVLEEAKIITRYIYSHA ILNTMRKFT GKELIRPRI+RFVTNFLSLRSIVILE+NLKHMF+HSEWLSSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        IISLLYLDRFWKDAHEA+NI EPLIRILRIVDGDMPAMGY++EGIERAK EIKTYYNG EDKYMPIWETI+RRWNLQLHTTLH AAAFLNPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK
        ID+RIRNGFQEAMLKMATTDKDKMEITREHP YVNGQGALGTDFAILGRTIN+PGDWWSGYGYEIPTLQRAA+R+LSQPCSSY C RWNWSTFETLHSKK
Subjt:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK

Query:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLD-NMPLECRGSP
        RSR EQEKL DLVFVQCNLWLQHI  TRD KYKP+VFDDIDVSLEWP+E E SA VLDDSWLD N+PLECRGSP
Subjt:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLD-NMPLECRGSP

A0A5D3D7G5 HAT transposon superfamily0.0e+0092.58Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVP DVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FPC SPSA+PPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAEYG GYKAPSYEKLKSTLLDKVKGDI +SY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KK RDEWKETGCTILCDSWSDGRTKSFLVIS+TCSKGTLFLKSVD SG EDDATYLSDLLETIVLEVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLEDISKIEWV TVLEEAKIITRYIYSHA ILNTMRKFT GKELIRPRI+RFVTNFLSLRSIVILE+NLKHMF+HSEWLSSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        IISLLYLDRFWKDAHEA+NI EPLIRILRIVDGDMPAMGY++EGIERAK EIKTYYNG EDKYMPIWETI+RRWNLQLHTTLH AAAFLNPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK
        ID+RIRNGFQEAMLKMATTDKDKMEITREHP YVNGQGALGTDFAILGRTIN+PGDWWSGYGYEIPTLQRAA+R+LSQPCSSY C RWNWSTFETLHSKK
Subjt:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK

Query:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLD-NMPLECRGSP
        RSR EQEKL DLVFVQCNLWLQHI  TRD KYKP+VFDDIDVSLEWP+E E SA VLDDSWLD N+PLECRGSP
Subjt:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLD-NMPLECRGSP

A0A6J1E9N1 uncharacterized protein LOC1114311320.0e+0093.02Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVP DVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FP  SPSA+PPIDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAEYGVGY+APSYEKLKSTLL KVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV++SGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHAWILNTMRKFTSGKELIRPRI+RFVTNFLSLRSIV LED LKHMF+HSEW SSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        I+S LYLDRFWKDA EAVNISEPLIRILR+VDGDMPAMGY+YEGIERAK E+K YYNGIEDKYMPIW+TI+RRWNLQLHTTLH AAAFLNPSIFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK
        ID+RIRNGFQEAMLKMATTDKDKMEITREHP YVN QGALGTDFAILGRTIN PGDWWSGYGYEIPTLQRAAIR+LSQPCSSY C RWNWSTFETLHSKK
Subjt:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK

Query:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP
        RS  EQEKLNDLVFVQCNLWLQHI WTRDGKYKPVVFDDIDVSLEWPTE ESSA VLDDSWLDN+PLEC GSP
Subjt:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP

A0A6J1KZI0 uncharacterized protein LOC1115002590.0e+0092.57Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVP DVRD IQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS  PC SPSA+P IDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAEYGVGY+APSY+KLKSTLLDKVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILC+SWSDGRTKSFL+ISITCSKGTLFLKSV++SGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHAWIL+TMRKFTSGKELIRPRI+RFVTNFLSLRSIV LED LKHMF+HSEW SSIYSRRPD QA
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK
        I+S LYLDRFWKDA EAVNISEPLIRILR+VDGDMPAMGY+YEGIERAK E+K YYNGIEDKYMPIW+TI+RRWNLQLHTTLH AAAFLNPSIFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK
        ID+RIRNGFQEAMLKMATTDKDKMEITREHP YVN QGALGTDFAILGRTIN PGDWWSGYGYEIPTLQR AIR+LSQPCSSY C RWNWSTFETLHSKK
Subjt:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRC-RWNWSTFETLHSKK

Query:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP
        RSR EQEKLNDLVFVQCNLWLQHI WTRDGKYKPVVFDDIDVSLEWPTE ESSA VLDDSWLDN+PLEC GSP
Subjt:  RSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLDNMPLECRGSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily1.1e-11933.84Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVR +D CWE+   +D    KV+C +C R  +GG+ R+K HL+++ +K + PCA+V  DV D ++ ILS      A   P +                  
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
             + + +   P   P  +P+++     +    +D  ++ +++FFF N I F+ A+S  Y  M++A+A+ G G+ APS    K+  LD+VK DI    
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        K    EW  TGCTI+ ++W+D ++++ +  S++      F KSVD S    ++  L+DL ++++ ++G E++VQ+I D +  Y      L+  Y ++F S
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA
        PC S C+N +LE+ SK++WV   + +A++I++++Y+++ +L+ +RK T G+++IR  ++R V+NFLSL+S++  +  LKHMF+  E+ ++  + +P + +
Subjt:  PCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQA

Query:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK
         +++L  + FW+   E+V ISEP++++LR V    PA+G +YE + +AK  I+TYY   E+K+    + ++  W   LH+ LHAAAAFLNPSI YNP  K
Subjt:  IISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGYVYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFK

Query:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRCRWNWSTFETLHSKKR
            ++  F + + K+  T   + +IT +   +   +G  G + A+  R   +PG WW  +G   P LQR AIR+LSQ CS Y     WSTF+ +H ++R
Subjt:  IDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRCRWNWSTFETLHSKKR

Query:  SRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLD
        ++ ++E LN L +V  NL L  +      +  P+  +DID+  EW  E E+ +P     WLD
Subjt:  SRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSWLD

AT3G17450.1 hAT dimerisation domain-containing protein1.2e-9430.55Show/hide
Query:  DACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETAT-----------------
        D  WEH +  D  ++KV+CNYC +  SGG+ R K HLA+I   ++ PC   P +V   I+  +   +  +   +P  +M   T                 
Subjt:  DACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETAT-----------------

Query:  -------------NGQ--QHSSSASGGIHHGSSGQNESNCPSMFPCPSPSAE------PPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVN
                     NG+  +    +    +  S  + ++    M P  SPS+           +    +KD T   ++ F  H  +P  AA SLY+Q+M+ 
Subjt:  -------------NGQ--QHSSSASGGIHHGSSGQNESNCPSMFPCPSPSAE------PPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVN

Query:  AIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEV
         I  YG G+  PS +     LL +    I++  ++YR  W  TGC+I+ D+W++   K  +   ++C +G  F  S+D +   +DA  L   L+ +V ++
Subjt:  AIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEV

Query:  GVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMR-KFTSGKELIRPRISRFVTNFL
        G ENVVQVIT  TA +  AG+LL  K  +L+W+PC  +C   +LED SK+E+V   LE+A+ ITR+IY+  W+LN M+ +FT G +L+RP + R  + F 
Subjt:  GVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMR-KFTSGKELIRPRISRFVTNFL

Query:  SLRSIVILEDNLKHMFSHSEW-LSSIYSRRPDTQAIISLLYLDRFWKDAHEAVNISEPLIRILRIVD--GDMPAMGYVYEGIERAKAEIKTYYNGIEDKY
        +L+S++  + +L+ +F    W LS   ++  + + +  ++    FWK     +   +P+++++ +++  GD  +M Y Y  +  AK  IK+ ++    KY
Subjt:  SLRSIVILEDNLKHMFSHSEW-LSSIYSRRPDTQAIISLLYLDRFWKDAHEAVNISEPLIRILRIVD--GDMPAMGYVYEGIERAKAEIKTYYNGIEDKY

Query:  MPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGY
         P W  IE RWN   H  L+ AA F NP+  Y P+F     +  G  E ++++   +  ++    + P Y   +   GTD AI  RT   P  WW  +G 
Subjt:  MPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGYGY

Query:  EIPTLQRAAIRMLSQPCSSYRCRWNWSTFETLHSKKRSRGEQEKLNDLVFVQCNLWLQ
            LQR A+R+LS  CSS  C   WS ++ ++S+ +S+  ++   DL +V  NL L+
Subjt:  EIPTLQRAAIRMLSQPCSSYRCRWNWSTFETLHSKKRSRGEQEKLNDLVFVQCNLWLQ

AT3G22220.1 hAT transposon superfamily1.7e-8831.3Show/hide
Query:  RDACWEHC-VLVDATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGIL-STPKKQRAPKK-----------PKVDMET----
        +D+ W+HC V     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP +VR  +Q  +  T ++QR  +K           P  ++ET    
Subjt:  RDACWEHC-VLVDATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGIL-STPKKQRAPKK-----------PKVDMET----

Query:  ---ATNG-QQHSSSASGGIHHGSSGQN--------------------ESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYY
             NG +  SS    G   G + Q                     + +  ++ P    S +  +    K+++      +  F F     F AA S+  
Subjt:  ---ATNG-QQHSSSASGGIHHGSSGQN--------------------ESNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYY

Query:  QEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLET
        Q  ++AI   G G   P++E L+  +L     +++    + +  WK TGC++L    +       L   + C +  +FLKSVD S   D    L +LL+ 
Subjt:  QEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLET

Query:  IVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFV
        +V E+G  NVVQVIT     Y  AG+ LM  Y SL+W PC ++C+++MLE+  K++W+  ++E+A+ +TR IY+H+ +LN MRKFT G ++++P  +   
Subjt:  IVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRISRFV

Query:  TNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQAIISLLYLDRFWKDAHEAVNISEPLIRILRIVDGD-MPAMGYVYEGIERAKAEIKTYYNGIED
        TNF ++  I  L+  L+ M + SEW    YS+     A+   +  + FWK    A +I+ P++R+LRIV  +  PAMGYVY  + RAK  IKT      +
Subjt:  TNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQAIISLLYLDRFWKDAHEAVNISEPLIRILRIVDGD-MPAMGYVYEGIERAKAEIKTYYNGIED

Query:  KYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGY
        +Y+  W+ I+R W   L   L+AA  +LNP  FY+ + ++   I     + + K+      +  + ++   Y N  G  G + AI  R    P +WWS Y
Subjt:  KYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDWWSGY

Query:  GYEIPTLQRAAIRMLSQPC-SSYRCRWNWSTFETLHSKKRSRGEQEKLNDLVFVQCNLWLQHI--YWTRDGKYKPVVFDDIDVSLEW
        G     L R AIR+LSQ C SS     N ++   ++  K S  E+++LNDLVFVQ N+ L+ I    + D    P+   +++V  +W
Subjt:  GYEIPTLQRAAIRMLSQPC-SSYRCRWNWSTFETLHSKKRSRGEQEKLNDLVFVQCNLWLQHI--YWTRDGKYKPVVFDDIDVSLEW

AT4G15020.1 hAT transposon superfamily5.4e-9833.62Show/hide
Query:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGIL-STPKKQRAPKKPK--------------------
        +D  W+HC +     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP DVR  +Q  +  T ++QR   K                      
Subjt:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGIL-STPKKQRAPKKPK--------------------

Query:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK
                        V  E+  +G   Q+   S      +GS+  N      +  ++ P    S +  +  + + +++     +  F F     F A  
Subjt:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK

Query:  SLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSD
        S+ +Q M++AIA  G G  AP+++ L+  +L     ++     + +  WK TGC+IL +  +  +    L   + C +  +FLKSVD S     A  L +
Subjt:  SLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSD

Query:  LLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRI
        LL  +V EVG  NVVQVIT     YV AG+ LM  Y SL+W PC ++C++QMLE+  K+ W+   +E+A+ ITR++Y+H+ +LN M KFTSG +++ P  
Subjt:  LLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRI

Query:  SRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQAIISLLYLDRFWKDAHEAVNISEPLIRILRIVDGD-MPAMGYVYEGIERAKAEIKTYYN
        S   TNF +L  I  L+ NL+ M + +EW    YS  P +  +++ L  + FWK      +++ PL+R LRIV  +  PAMGYVY  + RAK  IKT+  
Subjt:  SRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQAIISLLYLDRFWKDAHEAVNISEPLIRILRIVDGD-MPAMGYVYEGIERAKAEIKTYYN

Query:  GIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDW
          ED Y+  W+ I+R W  Q H  L AA  FLNP +FYN N +I   +     + + ++   DK + +I +E   Y    G  G + AI  R    P +W
Subjt:  GIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDW

Query:  WSGYGYEIPTLQRAAIRMLSQPC-SSYRCRWNWSTFETLHSKKRSRGEQEKLNDLVFVQCNLWLQHI-YWTRDGKYKPVVFDDIDVSLEW
        WS YG     L R AIR+LSQ C SS  CR N    E ++  K S  EQ++L+DLVFVQ N+ L+ +   + D    P+  + IDV  EW
Subjt:  WSGYGYEIPTLQRAAIRMLSQPC-SSYRCRWNWSTFETLHSKKRSRGEQEKLNDLVFVQCNLWLQHI-YWTRDGKYKPVVFDDIDVSLEW

AT4G15020.2 hAT transposon superfamily5.4e-9833.62Show/hide
Query:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGIL-STPKKQRAPKKPK--------------------
        +D  W+HC +     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP DVR  +Q  +  T ++QR   K                      
Subjt:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGIL-STPKKQRAPKKPK--------------------

Query:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK
                        V  E+  +G   Q+   S      +GS+  N      +  ++ P    S +  +  + + +++     +  F F     F A  
Subjt:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAK

Query:  SLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSD
        S+ +Q M++AIA  G G  AP+++ L+  +L     ++     + +  WK TGC+IL +  +  +    L   + C +  +FLKSVD S     A  L +
Subjt:  SLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSD

Query:  LLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRI
        LL  +V EVG  NVVQVIT     YV AG+ LM  Y SL+W PC ++C++QMLE+  K+ W+   +E+A+ ITR++Y+H+ +LN M KFTSG +++ P  
Subjt:  LLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKIITRYIYSHAWILNTMRKFTSGKELIRPRI

Query:  SRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQAIISLLYLDRFWKDAHEAVNISEPLIRILRIVDGD-MPAMGYVYEGIERAKAEIKTYYN
        S   TNF +L  I  L+ NL+ M + +EW    YS  P +  +++ L  + FWK      +++ PL+R LRIV  +  PAMGYVY  + RAK  IKT+  
Subjt:  SRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQAIISLLYLDRFWKDAHEAVNISEPLIRILRIVDGD-MPAMGYVYEGIERAKAEIKTYYN

Query:  GIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDW
          ED Y+  W+ I+R W  Q H  L AA  FLNP +FYN N +I   +     + + ++   DK + +I +E   Y    G  G + AI  R    P +W
Subjt:  GIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRTINAPGDW

Query:  WSGYGYEIPTLQRAAIRMLSQPC-SSYRCRWNWSTFETLHSKKRSRGEQEKLNDLVFVQCNLWLQHI-YWTRDGKYKPVVFDDIDVSLEW
        WS YG     L R AIR+LSQ C SS  CR N    E ++  K S  EQ++L+DLVFVQ N+ L+ +   + D    P+  + IDV  EW
Subjt:  WSGYGYEIPTLQRAAIRMLSQPC-SSYRCRWNWSTFETLHSKKRSRGEQEKLNDLVFVQCNLWLQHI-YWTRDGKYKPVVFDDIDVSLEW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGAGGAAGGGATGCTTGCTGGGAACATTGTGTCCTTGTTGATGCAACGAGGCAGAAGGTTCGATGTAATTATTGCCAGCGGGAATTCAGTGGAGGTGTATACAG
GATGAAATTTCATTTGGCTCAAATTAAAAACAAAGATATAGTTCCATGTGCTGAAGTTCCGGCCGATGTTCGAGACCATATTCAAGGTATCTTAAGCACTCCTAAGAAAC
AGAGGGCACCCAAGAAACCAAAGGTGGATATGGAAACAGCTACAAATGGACAGCAACATAGCTCCTCTGCTAGTGGTGGCATCCATCATGGATCCAGTGGACAGAATGAA
AGCAACTGCCCTTCGATGTTTCCATGCCCTTCACCAAGTGCAGAACCACCAATTGATGATGCTCAAAAGCAGAAGAAGGATGAGACTGATAAAAAAGTTGCCATTTTTTT
CTTTCATAATTCTATTCCTTTCAGTGCTGCCAAGTCCTTGTATTATCAGGAAATGGTGAATGCAATAGCAGAATATGGAGTAGGATACAAAGCACCAAGTTATGAGAAAT
TGAAATCTACTCTTTTGGATAAAGTGAAAGGTGACATACAGAATTCTTACAAAAAATATAGGGATGAATGGAAAGAAACAGGCTGCACTATCCTGTGTGATAGCTGGTCA
GATGGAAGGACCAAATCATTTCTAGTCATTTCCATTACTTGTTCTAAAGGAACACTGTTCCTGAAGTCAGTCGATGTGTCAGGTCGTGAAGATGATGCAACTTACCTTTC
CGACTTGCTTGAGACTATCGTCCTCGAGGTTGGAGTGGAGAATGTTGTCCAAGTTATTACAGATGCTACTGCCAGTTATGTCTATGCTGGGAGGCTTCTCATGACCAAGT
ACACTTCCTTATTTTGGTCTCCATGTGTTTCTTATTGTGTTAATCAGATGTTGGAGGACATTAGTAAAATCGAGTGGGTCGGTACAGTATTAGAGGAGGCCAAGATCATC
ACCCGCTACATTTATAGTCATGCATGGATTTTAAATACAATGCGAAAATTCACCAGCGGGAAGGAATTGATAAGGCCGAGAATTAGTAGATTTGTGACTAATTTTCTATC
CTTGAGATCCATTGTGATTCTTGAGGACAATCTCAAGCACATGTTTTCTCATTCCGAGTGGCTGTCCTCGATTTATAGCAGACGTCCCGACACGCAGGCAATTATTTCCT
TGCTGTATTTGGATAGATTTTGGAAGGATGCACATGAAGCTGTCAACATTTCTGAACCACTTATTAGAATTCTGAGAATTGTTGATGGAGACATGCCTGCCATGGGCTAT
GTATATGAAGGAATAGAGAGGGCAAAGGCTGAAATAAAAACATATTACAATGGCATTGAGGATAAATATATGCCTATTTGGGAAACAATCGAGCGGAGATGGAACTTGCA
GCTTCACACGACGCTGCACGCAGCAGCAGCGTTCCTTAACCCTTCCATTTTTTACAATCCAAACTTCAAGATTGATATGAGAATTAGGAACGGGTTTCAGGAAGCCATGT
TGAAGATGGCGACGACGGATAAAGATAAAATGGAGATCACCAGAGAACATCCTGTATATGTAAACGGGCAAGGTGCTCTTGGTACTGACTTTGCCATATTGGGGAGAACT
ATAAATGCCCCAGGTGATTGGTGGTCCGGGTACGGTTACGAAATCCCGACGCTCCAGAGAGCGGCAATACGAATGCTAAGCCAACCCTGTAGTTCTTATAGGTGCAGATG
GAACTGGAGCACGTTCGAAACTTTACATTCGAAGAAGCGTAGTCGAGGTGAACAGGAAAAGTTGAATGATTTAGTGTTTGTACAGTGTAATCTTTGGTTGCAACACATTT
ATTGGACTCGGGATGGTAAATATAAACCTGTTGTATTTGATGATATAGATGTAAGTTTAGAATGGCCTACGGAGTTTGAATCCTCAGCTCCTGTTTTAGATGATTCATGG
TTGGATAATATGCCTCTTGAATGTAGAGGTAGCCCTTAA
mRNA sequenceShow/hide mRNA sequence
CTCCAATCGGCCGTCACTTTCTTCCCGAATTCCCACCGGCCGGAATTCACCACTCCAATTTCGGACCTTCCGACGAAACTAAGCTTCGATTTCAGATCTTATCCTTCTTA
CTAATTCTGTCTGCTCAGGGGCACAAATTATTGTATACCTTCTGCTATGGTAAAGACATTCTACTAAATAATAGCTGCACAGATTGTGATACTTTCTATAAACAATGGTT
CGAGGAAGGGATGCTTGCTGGGAACATTGTGTCCTTGTTGATGCAACGAGGCAGAAGGTTCGATGTAATTATTGCCAGCGGGAATTCAGTGGAGGTGTATACAGGATGAA
ATTTCATTTGGCTCAAATTAAAAACAAAGATATAGTTCCATGTGCTGAAGTTCCGGCCGATGTTCGAGACCATATTCAAGGTATCTTAAGCACTCCTAAGAAACAGAGGG
CACCCAAGAAACCAAAGGTGGATATGGAAACAGCTACAAATGGACAGCAACATAGCTCCTCTGCTAGTGGTGGCATCCATCATGGATCCAGTGGACAGAATGAAAGCAAC
TGCCCTTCGATGTTTCCATGCCCTTCACCAAGTGCAGAACCACCAATTGATGATGCTCAAAAGCAGAAGAAGGATGAGACTGATAAAAAAGTTGCCATTTTTTTCTTTCA
TAATTCTATTCCTTTCAGTGCTGCCAAGTCCTTGTATTATCAGGAAATGGTGAATGCAATAGCAGAATATGGAGTAGGATACAAAGCACCAAGTTATGAGAAATTGAAAT
CTACTCTTTTGGATAAAGTGAAAGGTGACATACAGAATTCTTACAAAAAATATAGGGATGAATGGAAAGAAACAGGCTGCACTATCCTGTGTGATAGCTGGTCAGATGGA
AGGACCAAATCATTTCTAGTCATTTCCATTACTTGTTCTAAAGGAACACTGTTCCTGAAGTCAGTCGATGTGTCAGGTCGTGAAGATGATGCAACTTACCTTTCCGACTT
GCTTGAGACTATCGTCCTCGAGGTTGGAGTGGAGAATGTTGTCCAAGTTATTACAGATGCTACTGCCAGTTATGTCTATGCTGGGAGGCTTCTCATGACCAAGTACACTT
CCTTATTTTGGTCTCCATGTGTTTCTTATTGTGTTAATCAGATGTTGGAGGACATTAGTAAAATCGAGTGGGTCGGTACAGTATTAGAGGAGGCCAAGATCATCACCCGC
TACATTTATAGTCATGCATGGATTTTAAATACAATGCGAAAATTCACCAGCGGGAAGGAATTGATAAGGCCGAGAATTAGTAGATTTGTGACTAATTTTCTATCCTTGAG
ATCCATTGTGATTCTTGAGGACAATCTCAAGCACATGTTTTCTCATTCCGAGTGGCTGTCCTCGATTTATAGCAGACGTCCCGACACGCAGGCAATTATTTCCTTGCTGT
ATTTGGATAGATTTTGGAAGGATGCACATGAAGCTGTCAACATTTCTGAACCACTTATTAGAATTCTGAGAATTGTTGATGGAGACATGCCTGCCATGGGCTATGTATAT
GAAGGAATAGAGAGGGCAAAGGCTGAAATAAAAACATATTACAATGGCATTGAGGATAAATATATGCCTATTTGGGAAACAATCGAGCGGAGATGGAACTTGCAGCTTCA
CACGACGCTGCACGCAGCAGCAGCGTTCCTTAACCCTTCCATTTTTTACAATCCAAACTTCAAGATTGATATGAGAATTAGGAACGGGTTTCAGGAAGCCATGTTGAAGA
TGGCGACGACGGATAAAGATAAAATGGAGATCACCAGAGAACATCCTGTATATGTAAACGGGCAAGGTGCTCTTGGTACTGACTTTGCCATATTGGGGAGAACTATAAAT
GCCCCAGGTGATTGGTGGTCCGGGTACGGTTACGAAATCCCGACGCTCCAGAGAGCGGCAATACGAATGCTAAGCCAACCCTGTAGTTCTTATAGGTGCAGATGGAACTG
GAGCACGTTCGAAACTTTACATTCGAAGAAGCGTAGTCGAGGTGAACAGGAAAAGTTGAATGATTTAGTGTTTGTACAGTGTAATCTTTGGTTGCAACACATTTATTGGA
CTCGGGATGGTAAATATAAACCTGTTGTATTTGATGATATAGATGTAAGTTTAGAATGGCCTACGGAGTTTGAATCCTCAGCTCCTGTTTTAGATGATTCATGGTTGGAT
AATATGCCTCTTGAATGTAGAGGTAGCCCTTAATATTTAAGGCAATCAGAAACAGATGGTTCATTATTCCCTTGATTCTATAATTATCTCCAAA
Protein sequenceShow/hide protein sequence
MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPADVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNE
SNCPSMFPCPSPSAEPPIDDAQKQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWS
DGRTKSFLVISITCSKGTLFLKSVDVSGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVGTVLEEAKII
TRYIYSHAWILNTMRKFTSGKELIRPRISRFVTNFLSLRSIVILEDNLKHMFSHSEWLSSIYSRRPDTQAIISLLYLDRFWKDAHEAVNISEPLIRILRIVDGDMPAMGY
VYEGIERAKAEIKTYYNGIEDKYMPIWETIERRWNLQLHTTLHAAAAFLNPSIFYNPNFKIDMRIRNGFQEAMLKMATTDKDKMEITREHPVYVNGQGALGTDFAILGRT
INAPGDWWSGYGYEIPTLQRAAIRMLSQPCSSYRCRWNWSTFETLHSKKRSRGEQEKLNDLVFVQCNLWLQHIYWTRDGKYKPVVFDDIDVSLEWPTEFESSAPVLDDSW
LDNMPLECRGSP