; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh16G010960 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh16G010960
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionHAT transposon superfamily
Genome locationCmo_Chr16:7732864..7735624
RNA-Seq ExpressionCmoCh16G010960
SyntenyCmoCh16G010960
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR003656 - Zinc finger, BED-type
IPR007021 - Domain of unknown function DUF659
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7015651.1 hypothetical protein SDJN02_23288, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0099.7Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY
        IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY

Query:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
        PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA

Query:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
        ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
Subjt:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQR AIRILSQPCSSYGCSRWNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK

Query:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP
        RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNL LECGGSP
Subjt:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP

XP_022923437.1 uncharacterized protein LOC111431132 [Cucurbita moschata]0.0e+00100Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY
        IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY

Query:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
        PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA

Query:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
        ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
Subjt:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK

Query:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP
        RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP
Subjt:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP

XP_023007736.1 uncharacterized protein LOC111500259 [Cucurbita maxima]0.0e+0098.51Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRD IQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY
        IHHGSSGQNESNCPST P SSPSAQP IDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSY+KLKSTLL KVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY

Query:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
        PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWIL+TMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA

Query:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
        ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
Subjt:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQR AIRILSQPCSSYGCSRWNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK

Query:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP
        RS TEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSA VLDDSWLDNLPLECGGSP
Subjt:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP

XP_038876874.1 uncharacterized protein LOC120069237 isoform X1 [Benincasa hispida]0.0e+0093.31Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVP DVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY
        IHHGSSGQNESNCPSTFP  SPSAQPPIDD QKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAEYGVGY+APSYEKLKSTLL KVKGDI NSY
Subjt:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY

Query:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKY DEWKETGCTILC+SWSDGRTKSFL+ISITCSKG LFLKSV+ISG EDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHA ILNTMRKFTSGKELIRPRITRFVTNFLSLRSIV  ED LKHMFAHSEW SSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA

Query:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
        I+S LYLDRFWKDA+EAVNI+EPLIRILR+VDGDMPAMGYI+EGIERAKVE+K YYNGIEDKY+PIW+TIDRRWNLQLHTTLHTAAAFLNPS+FYNPNFK
Subjt:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVN QGALGTDFAILGRTIN PGDWWSGYGYEIPTLQRAAIRIL+QPCSSYGCSRWNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK

Query:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP
        RS  EQEKLNDLVFVQCNLWLQHI  TRDGKYKPVVFDDIDVSLEWPTE E+SAHVLDDSWLDNLPLEC GSP
Subjt:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP

XP_038876877.1 uncharacterized protein LOC120069237 isoform X2 [Benincasa hispida]0.0e+0093.31Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVP DVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY
        IHHGSSGQNESNCPSTFP  SPSAQPPIDD QKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAEYGVGY+APSYEKLKSTLL KVKGDI NSY
Subjt:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY

Query:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKY DEWKETGCTILC+SWSDGRTKSFL+ISITCSKG LFLKSV+ISG EDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
        PCVSYCVNQMLED+SKIEWVGTVL+EAKII RY+YSHA ILNTMRKFTSGKELIRPRITRFVTNFLSLRSIV  ED LKHMFAHSEW SSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA

Query:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
        I+S LYLDRFWKDA+EAVNI+EPLIRILR+VDGDMPAMGYI+EGIERAKVE+K YYNGIEDKY+PIW+TIDRRWNLQLHTTLHTAAAFLNPS+FYNPNFK
Subjt:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVN QGALGTDFAILGRTIN PGDWWSGYGYEIPTLQRAAIRIL+QPCSSYGCSRWNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK

Query:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP
        RS  EQEKLNDLVFVQCNLWLQHI  TRDGKYKPVVFDDIDVSLEWPTE E+SAHVLDDSWLDNLPLEC GSP
Subjt:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP

TrEMBL top hitse value%identityAlignment
A0A0A0L2E4 BED-type domain-containing protein0.0e+0090.79Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVP DVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY
        IHHGSSGQNESNCPST+P  SPSAQPPIDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMV+AIAEYG GY+APSYEKLKSTLL KVKGDI +SY
Subjt:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY

Query:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KK+RDEWKETGCTILC+SWSDG+TKSFL+IS+TCSKGTLFLKSV+ISG EDDATYLSDLLETI+LEVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
        PCVSYCVNQMLED+SKIEWV  VL+EAKII RY+YSHA ILNTMRKFT GKELIRPRITRFVTNFLSLRSIV LED LKHMFAHSEW SSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA

Query:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
        I+S LYLDRFWKDA EA+NI EPLIRILR+VDGDMPAMGYI+EGIERAKVE+K YYNG EDKYMPIW+TIDRRWNLQLHTTLHTAAAFLNPS+FYNPNFK
Subjt:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVN QGALGTDFAILGRTIN PGDWWSGYGYEIPTLQRAA+RILSQPCSSYGCS WNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK

Query:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP
         S  EQEKL DLVFVQCNLWLQH+  TRD KYKPVVFDD+DVSLEWP+ELE SAHVLDDSWLDNLPLE  GSP
Subjt:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP

A0A1S3BLP8 uncharacterized protein LOC1034909270.0e+0091.54Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVP DVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY
        IHHGSSGQNESNCPSTFP  SPSAQPPIDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMV+AIAEYG GY+APSYEKLKSTLL KVKGDI +SY
Subjt:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY

Query:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KK RDEWKETGCTILC+SWSDGRTKSFL+IS+TCSKGTLFLKSV+ SG EDDATYLSDLLETIVLEVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
        PCVSYCVNQMLED+SKIEWV TVL+EAKII RY+YSHA ILNTMRKFT GKELIRPRITRFVTNFLSLRSIV LE+ LKHMFAHSEW SSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA

Query:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
        I+S LYLDRFWKDA EA+NI EPLIRILR+VDGDMPAMGYI+EGIERAKVE+K YYNG EDKYMPIW+TIDRRWNLQLHTTLHTAAAFLNPS+FYNPNFK
Subjt:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVN QGALGTDFAILGRTIN+PGDWWSGYGYEIPTLQRAA+RILSQPCSSYGCSRWNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK

Query:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLD-NLPLECGGSP
        RS  EQEKL DLVFVQCNLWLQHI  TRD KYKP+VFDDIDVSLEWP+ELE SAHVLDDSWLD NLPLEC GSP
Subjt:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLD-NLPLECGGSP

A0A5D3D7G5 HAT transposon superfamily0.0e+0091.54Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC EVP DVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY
        IHHGSSGQNESNCPSTFP  SPSAQPPIDDAQKQKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMV+AIAEYG GY+APSYEKLKSTLL KVKGDI +SY
Subjt:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY

Query:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KK RDEWKETGCTILC+SWSDGRTKSFL+IS+TCSKGTLFLKSV+ SG EDDATYLSDLLETIVLEVGVENVVQ+ITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
        PCVSYCVNQMLED+SKIEWV TVL+EAKII RY+YSHA ILNTMRKFT GKELIRPRITRFVTNFLSLRSIV LE+ LKHMFAHSEW SSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA

Query:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
        I+S LYLDRFWKDA EA+NI EPLIRILR+VDGDMPAMGYI+EGIERAKVE+K YYNG EDKYMPIW+TIDRRWNLQLHTTLHTAAAFLNPS+FYNPNFK
Subjt:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVN QGALGTDFAILGRTIN+PGDWWSGYGYEIPTLQRAA+RILSQPCSSYGCSRWNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK

Query:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLD-NLPLECGGSP
        RS  EQEKL DLVFVQCNLWLQHI  TRD KYKP+VFDDIDVSLEWP+ELE SAHVLDDSWLD NLPLEC GSP
Subjt:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLD-NLPLECGGSP

A0A6J1E9N1 uncharacterized protein LOC1114311320.0e+00100Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY
        IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY

Query:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
        PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA

Query:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
        ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
Subjt:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK

Query:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP
        RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP
Subjt:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP

A0A6J1KZI0 uncharacterized protein LOC1115002590.0e+0098.51Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRD IQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY
        IHHGSSGQNESNCPST P SSPSAQP IDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSY+KLKSTLL KVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY

Query:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
        PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWIL+TMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
Subjt:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA

Query:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
        ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
Subjt:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK
        IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQR AIRILSQPCSSYGCSRWNWSTFETLHSKK
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK

Query:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP
        RS TEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSA VLDDSWLDNLPLECGGSP
Subjt:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLDNLPLECGGSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily3.1e-11735.14Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVR +D CWE+   +D    KV+C +C R  +GG+ R+K HL+++ +K + PCA+V  DV D ++ ILS             D    TN  +     S  
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY
            +S          FPSS P+AQ           D  ++ +++FFF N I F+ A+S  Y  M++A+A+ G G+ APS    K+  L +VK DI    
Subjt:  IHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSY

Query:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS
        K    EW  TGCTI+  +W+D ++++ +  S++      F KSV+ S    ++  L+DL ++++ ++G E++VQ+I D +  Y      L+  Y ++F S
Subjt:  KKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA
        PC S C+N +LE+ SK++WV   + +A++I+++VY+++ +L+ +RK T G+++IR  +TR V+NFLSL+S++  +  LKHMF   E+ ++  + +P + +
Subjt:  PCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQA

Query:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK
         ++ L  + FW+   E+V ISEP++++LR V    PA+G IYE + +AK  ++ YY   E+K+    D +D  W   LH+ LH AAAFLNPSI YNP  K
Subjt:  ILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGYIYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK
            ++  F + + K+  T   + +IT +   +  A+G  G + A+  R   +PG WW  +G   P LQR AIRILSQ CS Y   R  WSTF+ +H ++
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKK

Query:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLD
        R+  ++E LN L +V  NL L  +      +  P+  +DID+  EW  E E+ +      WLD
Subjt:  RSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDSWLD

AT3G17450.1 hAT dimerisation domain-containing protein4.9e-9129.64Show/hide
Query:  DACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETAT-----------------
        D  WEH +  D  ++KV+CNYC +  SGG+ R K HLA+I   ++ PC   P +V   I+  +   +  +   +P  +M   T                 
Subjt:  DACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETAT-----------------

Query:  -------------NGQ----QHSSSASGGIHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQK---KDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNA
                     NG+    +  S  S  +   S  + +      F S S S Q  +  +   +   + +    ++ F  H  +P  AA SLY+Q+M+  
Subjt:  -------------NGQ----QHSSSASGGIHHGSSGQNESNCPSTFPSSSPSAQPPIDDAQKQK---KDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNA

Query:  IAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSYKKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVG
        I  YG G+  PS +     LL +    I++  ++YR  W  TGC+I+ ++W++   K  +   ++C +G  F  S++ +   +DA  L   L+ +V ++G
Subjt:  IAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSYKKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVG

Query:  VENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMR-KFTSGKELIRPRITRFVTNFLS
         ENVVQVIT  TA +  AG+LL  K  +L+W+PC  +C   +LED SK+E+V   L++A+ I R++Y+  W+LN M+ +FT G +L+RP + R  + F +
Subjt:  VENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMR-KFTSGKELIRPRITRFVTNFLS

Query:  LRSIVFLEDGLKHMFAHSEW-QSSIYSRRPDAQAILSFLYLDRFWKDAREAVNISEPLIRILRLVD--GDMPAMGYIYEGIERAKVEVKAYYNGIEDKYM
        L+S++  +  L+ +F    W  S   ++  + + +   +    FWK  +  +   +P+++++ +++  GD  +M Y Y  +  AK+ +K+ ++    KY 
Subjt:  LRSIVFLEDGLKHMFAHSEW-QSSIYSRRPDAQAILSFLYLDRFWKDAREAVNISEPLIRILRLVD--GDMPAMGYIYEGIERAKVEVKAYYNGIEDKYM

Query:  PIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYE
        P W  I+ RWN   H  L+ AA F NP+  Y P+F     +  G  E ++++   +  ++    + P Y  A+   GTD AI  RT   P  WW  +G  
Subjt:  PIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGYGYE

Query:  IPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKKRSTTEQEKLNDLVFVQCNLWLQ
           LQR A+RILS  CSS GC    WS ++ ++S+ +S   ++   DL +V  NL L+
Subjt:  IPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKKRSTTEQEKLNDLVFVQCNLWLQ

AT3G22220.1 hAT transposon superfamily2.3e-8830.71Show/hide
Query:  RDACWEHC-VLVDATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGIL-STPKKQRAPKK-----------PKVDMET----
        +D+ W+HC V     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP +VR  +Q  +  T ++QR  +K           P  ++ET    
Subjt:  RDACWEHC-VLVDATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGIL-STPKKQRAPKK-----------PKVDMET----

Query:  ---ATNG-QQHSSSASGGIHHGSSGQN--------------------ESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYY
             NG +  SS    G   G + Q                     + +  +  P +  S +  +    K+++      +  F F     F AA S+  
Subjt:  ---ATNG-QQHSSSASGGIHHGSSGQN--------------------ESNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYY

Query:  QEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSYKKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLET
        Q  ++AI   G G   P++E L+  +L     +++    + +  WK TGC++L    +       L   + C +  +FLKSV+ S   D    L +LL+ 
Subjt:  QEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSYKKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLET

Query:  IVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFV
        +V E+G  NVVQVIT     Y  AG+ LM  Y SL+W PC ++C+++MLE+  K++W+  ++++A+ + R +Y+H+ +LN MRKFT G ++++P  T   
Subjt:  IVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRITRFV

Query:  TNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQAILSFLYLDRFWKDAREAVNISEPLIRILRLVDGD-MPAMGYIYEGIERAKVEVKAYYNGIED
        TNF ++  I  L+  L+ M   SEW    YS+     A+   +  + FWK    A +I+ P++R+LR+V  +  PAMGY+Y  + RAK  +K       +
Subjt:  TNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQAILSFLYLDRFWKDAREAVNISEPLIRILRLVDGD-MPAMGYIYEGIERAKVEVKAYYNGIED

Query:  KYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGY
        +Y+  W  IDR W   L   L+ A  +LNP  FY+ + ++   I     + + K+      +  + ++  +Y NA G  G + AI  R    P +WWS Y
Subjt:  KYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDWWSGY

Query:  GYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKKRSTTEQEKLNDLVFVQCNLWLQHI--RWTRDGKYKPVVFDDIDVSLEW
        G     L R AIRILSQ CSS   S  N ++   ++  K S  E+++LNDLVFVQ N+ L+ I    + D    P+   +++V  +W
Subjt:  GYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKKRSTTEQEKLNDLVFVQCNLWLQHI--RWTRDGKYKPVVFDDIDVSLEW

AT4G15020.1 hAT transposon superfamily5.6e-9532.61Show/hide
Query:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGIL-STPKKQRAPKKPK--------------------
        +D  W+HC +     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP DVR  +Q  +  T ++QR   K                      
Subjt:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGIL-STPKKQRAPKKPK--------------------

Query:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAK
                        V  E+  +G   Q+   S      +GS+  N      +  +  P +  S +  +  + + +++     +  F F     F A  
Subjt:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAK

Query:  SLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSYKKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSD
        S+ +Q M++AIA  G G  AP+++ L+  +L     ++     + +  WK TGC+IL    +  +    L   + C +  +FLKSV+ S     A  L +
Subjt:  SLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSYKKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSD

Query:  LLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRI
        LL  +V EVG  NVVQVIT     YV AG+ LM  Y SL+W PC ++C++QMLE+  K+ W+   +++A+ I R+VY+H+ +LN M KFTSG +++ P  
Subjt:  LLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRI

Query:  TRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQAILSFLYLDRFWKDAREAVNISEPLIRILRLVDGD-MPAMGYIYEGIERAKVEVKAYYN
        +   TNF +L  I  L+  L+ M   +EW    YS  P    +++ L  + FWK      +++ PL+R LR+V  +  PAMGY+Y  + RAK  +K +  
Subjt:  TRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQAILSFLYLDRFWKDAREAVNISEPLIRILRLVDGD-MPAMGYIYEGIERAKVEVKAYYN

Query:  GIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDW
          ED Y+  W  IDR W  Q H  L  A  FLNP +FYN N +I   +     + + ++   DK + +I +E  +Y  A G  G + AI  R    P +W
Subjt:  GIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDW

Query:  WSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKKRSTTEQEKLNDLVFVQCNLWLQHI-RWTRDGKYKPVVFDDIDVSLEW
        WS YG     L R AIRILSQ CSS    R N    E ++  K S  EQ++L+DLVFVQ N+ L+ +   + D    P+  + IDV  EW
Subjt:  WSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKKRSTTEQEKLNDLVFVQCNLWLQHI-RWTRDGKYKPVVFDDIDVSLEW

AT4G15020.2 hAT transposon superfamily5.6e-9532.61Show/hide
Query:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGIL-STPKKQRAPKKPK--------------------
        +D  W+HC +     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP DVR  +Q  +  T ++QR   K                      
Subjt:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGIL-STPKKQRAPKKPK--------------------

Query:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAK
                        V  E+  +G   Q+   S      +GS+  N      +  +  P +  S +  +  + + +++     +  F F     F A  
Subjt:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAK

Query:  SLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSYKKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSD
        S+ +Q M++AIA  G G  AP+++ L+  +L     ++     + +  WK TGC+IL    +  +    L   + C +  +FLKSV+ S     A  L +
Subjt:  SLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSYKKYRDEWKETGCTILCNSWSDGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSD

Query:  LLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRI
        LL  +V EVG  NVVQVIT     YV AG+ LM  Y SL+W PC ++C++QMLE+  K+ W+   +++A+ I R+VY+H+ +LN M KFTSG +++ P  
Subjt:  LLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDLSKIEWVGTVLDEAKIIARYVYSHAWILNTMRKFTSGKELIRPRI

Query:  TRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQAILSFLYLDRFWKDAREAVNISEPLIRILRLVDGD-MPAMGYIYEGIERAKVEVKAYYN
        +   TNF +L  I  L+  L+ M   +EW    YS  P    +++ L  + FWK      +++ PL+R LR+V  +  PAMGY+Y  + RAK  +K +  
Subjt:  TRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQAILSFLYLDRFWKDAREAVNISEPLIRILRLVDGD-MPAMGYIYEGIERAKVEVKAYYN

Query:  GIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDW
          ED Y+  W  IDR W  Q H  L  A  FLNP +FYN N +I   +     + + ++   DK + +I +E  +Y  A G  G + AI  R    P +W
Subjt:  GIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRTINTPGDW

Query:  WSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKKRSTTEQEKLNDLVFVQCNLWLQHI-RWTRDGKYKPVVFDDIDVSLEW
        WS YG     L R AIRILSQ CSS    R N    E ++  K S  EQ++L+DLVFVQ N+ L+ +   + D    P+  + IDV  EW
Subjt:  WSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKKRSTTEQEKLNDLVFVQCNLWLQHI-RWTRDGKYKPVVFDDIDVSLEW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGAGGGAGGGACGCTTGCTGGGAACATTGTGTCCTTGTTGATGCGACGAGACAGAAGGTTCGATGCAATTATTGTCAGCGGGAATTCAGTGGAGGCGTATACAG
GATGAAGTTTCATTTAGCTCAAATTAAAAACAAAGATATAGTTCCATGTGCCGAAGTCCCGATCGACGTTCGAGACCATATTCAAGGTATATTAAGCACTCCTAAAAAAC
AAAGGGCACCCAAGAAACCAAAGGTCGATATGGAAACTGCAACAAATGGACAGCAACATAGCTCCTCAGCTAGCGGAGGCATCCATCATGGATCGAGTGGACAAAACGAA
AGCAACTGCCCATCGACATTTCCGTCCTCTTCACCGAGCGCACAACCACCGATCGATGATGCTCAAAAGCAGAAGAAGGATGAAACTGATAAAAAAGTTGCTGTCTTTTT
CTTCCATAACTCTATTCCTTTCAGTGCTGCCAAGTCCTTGTATTATCAGGAAATGGTGAATGCTATAGCAGAATATGGAGTAGGATACCGAGCACCGAGTTACGAGAAAC
TAAAATCGACTCTTTTGGTTAAGGTGAAAGGTGATATACAGAATTCTTACAAAAAGTATCGAGACGAATGGAAAGAAACAGGCTGCACGATCCTATGTAATAGCTGGTCC
GATGGACGGACCAAATCGTTTCTAATCATTTCCATTACGTGTTCAAAAGGAACACTGTTTCTGAAGTCGGTCAATATATCAGGTCGTGAAGATGATGCAACTTACCTGTC
CGACTTGCTCGAGACGATCGTCCTCGAGGTTGGTGTGGAGAATGTTGTCCAAGTTATTACAGATGCTACAGCCAGCTATGTCTATGCTGGTAGGCTTCTGATGACCAAAT
ACACTTCCTTATTTTGGTCTCCATGTGTTTCTTACTGTGTTAATCAGATGTTGGAGGACCTTAGTAAAATCGAGTGGGTCGGTACGGTATTGGATGAGGCAAAGATCATC
GCCCGCTACGTTTATAGTCATGCGTGGATTTTAAACACGATGCGAAAATTCACGAGCGGGAAGGAACTGATCAGGCCAAGAATTACTAGATTTGTGACTAATTTTCTGTC
CTTGAGATCCATTGTGTTTCTTGAGGACGGTCTCAAACACATGTTTGCTCATTCGGAGTGGCAGTCCTCGATTTATAGCAGGCGCCCCGACGCACAGGCGATTCTTTCCT
TTCTGTATTTGGATCGATTTTGGAAGGACGCTCGTGAAGCTGTCAACATTTCTGAGCCACTTATTAGAATTCTCAGGCTTGTTGATGGAGACATGCCTGCCATGGGCTAT
ATATATGAAGGAATCGAGAGGGCGAAGGTCGAAGTCAAAGCGTATTACAATGGCATTGAGGATAAATATATGCCTATTTGGGACACGATCGACCGGAGATGGAACTTGCA
GCTTCACACGACGTTACACACAGCAGCAGCGTTCCTTAACCCGTCGATCTTTTATAATCCGAACTTTAAGATTGATCTGAGAATTAGAAACGGATTTCAGGAAGCTATGT
TGAAGATGGCGACGACGGATAAAGATAAAATGGAGATCACTAGAGAACATCCTGCATATGTAAATGCTCAAGGTGCTCTTGGTACCGACTTCGCTATCTTGGGGAGAACT
ATAAATACCCCAGGGGATTGGTGGTCAGGGTACGGTTACGAAATCCCGACGCTCCAGAGAGCGGCGATACGAATACTAAGCCAACCCTGTAGTTCTTATGGGTGCAGCAG
ATGGAACTGGAGCACGTTCGAAACCTTACATTCGAAGAAGCGTAGTACAACCGAACAGGAAAAGCTGAACGATTTAGTGTTTGTACAGTGCAATCTCTGGTTGCAACATA
TTCGTTGGACTCGGGATGGTAAATATAAACCCGTCGTATTTGATGATATTGATGTGAGTTTAGAATGGCCGACCGAGCTAGAATCATCGGCTCATGTTTTAGATGATTCG
TGGTTGGATAATCTGCCTCTTGAATGTGGAGGCAGCCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGAGGGAGGGACGCTTGCTGGGAACATTGTGTCCTTGTTGATGCGACGAGACAGAAGGTTCGATGCAATTATTGTCAGCGGGAATTCAGTGGAGGCGTATACAG
GATGAAGTTTCATTTAGCTCAAATTAAAAACAAAGATATAGTTCCATGTGCCGAAGTCCCGATCGACGTTCGAGACCATATTCAAGGTATATTAAGCACTCCTAAAAAAC
AAAGGGCACCCAAGAAACCAAAGGTCGATATGGAAACTGCAACAAATGGACAGCAACATAGCTCCTCAGCTAGCGGAGGCATCCATCATGGATCGAGTGGACAAAACGAA
AGCAACTGCCCATCGACATTTCCGTCCTCTTCACCGAGCGCACAACCACCGATCGATGATGCTCAAAAGCAGAAGAAGGATGAAACTGATAAAAAAGTTGCTGTCTTTTT
CTTCCATAACTCTATTCCTTTCAGTGCTGCCAAGTCCTTGTATTATCAGGAAATGGTGAATGCTATAGCAGAATATGGAGTAGGATACCGAGCACCGAGTTACGAGAAAC
TAAAATCGACTCTTTTGGTTAAGGTGAAAGGTGATATACAGAATTCTTACAAAAAGTATCGAGACGAATGGAAAGAAACAGGCTGCACGATCCTATGTAATAGCTGGTCC
GATGGACGGACCAAATCGTTTCTAATCATTTCCATTACGTGTTCAAAAGGAACACTGTTTCTGAAGTCGGTCAATATATCAGGTCGTGAAGATGATGCAACTTACCTGTC
CGACTTGCTCGAGACGATCGTCCTCGAGGTTGGTGTGGAGAATGTTGTCCAAGTTATTACAGATGCTACAGCCAGCTATGTCTATGCTGGTAGGCTTCTGATGACCAAAT
ACACTTCCTTATTTTGGTCTCCATGTGTTTCTTACTGTGTTAATCAGATGTTGGAGGACCTTAGTAAAATCGAGTGGGTCGGTACGGTATTGGATGAGGCAAAGATCATC
GCCCGCTACGTTTATAGTCATGCGTGGATTTTAAACACGATGCGAAAATTCACGAGCGGGAAGGAACTGATCAGGCCAAGAATTACTAGATTTGTGACTAATTTTCTGTC
CTTGAGATCCATTGTGTTTCTTGAGGACGGTCTCAAACACATGTTTGCTCATTCGGAGTGGCAGTCCTCGATTTATAGCAGGCGCCCCGACGCACAGGCGATTCTTTCCT
TTCTGTATTTGGATCGATTTTGGAAGGACGCTCGTGAAGCTGTCAACATTTCTGAGCCACTTATTAGAATTCTCAGGCTTGTTGATGGAGACATGCCTGCCATGGGCTAT
ATATATGAAGGAATCGAGAGGGCGAAGGTCGAAGTCAAAGCGTATTACAATGGCATTGAGGATAAATATATGCCTATTTGGGACACGATCGACCGGAGATGGAACTTGCA
GCTTCACACGACGTTACACACAGCAGCAGCGTTCCTTAACCCGTCGATCTTTTATAATCCGAACTTTAAGATTGATCTGAGAATTAGAAACGGATTTCAGGAAGCTATGT
TGAAGATGGCGACGACGGATAAAGATAAAATGGAGATCACTAGAGAACATCCTGCATATGTAAATGCTCAAGGTGCTCTTGGTACCGACTTCGCTATCTTGGGGAGAACT
ATAAATACCCCAGGGGATTGGTGGTCAGGGTACGGTTACGAAATCCCGACGCTCCAGAGAGCGGCGATACGAATACTAAGCCAACCCTGTAGTTCTTATGGGTGCAGCAG
ATGGAACTGGAGCACGTTCGAAACCTTACATTCGAAGAAGCGTAGTACAACCGAACAGGAAAAGCTGAACGATTTAGTGTTTGTACAGTGCAATCTCTGGTTGCAACATA
TTCGTTGGACTCGGGATGGTAAATATAAACCCGTCGTATTTGATGATATTGATGTGAGTTTAGAATGGCCGACCGAGCTAGAATCATCGGCTCATGTTTTAGATGATTCG
TGGTTGGATAATCTGCCTCTTGAATGTGGAGGCAGCCCTTAATATCTTCAAGGCAATCAGAAACAGATACATCAATATTCCCTTGATTGCATATCATCTCCAAATATTGC
ACTCAGGTGTTCATACAAATCAAAGAGAAGCT
Protein sequenceShow/hide protein sequence
MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCAEVPIDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNE
SNCPSTFPSSSPSAQPPIDDAQKQKKDETDKKVAVFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYRAPSYEKLKSTLLVKVKGDIQNSYKKYRDEWKETGCTILCNSWS
DGRTKSFLIISITCSKGTLFLKSVNISGREDDATYLSDLLETIVLEVGVENVVQVITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDLSKIEWVGTVLDEAKII
ARYVYSHAWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVFLEDGLKHMFAHSEWQSSIYSRRPDAQAILSFLYLDRFWKDAREAVNISEPLIRILRLVDGDMPAMGY
IYEGIERAKVEVKAYYNGIEDKYMPIWDTIDRRWNLQLHTTLHTAAAFLNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNAQGALGTDFAILGRT
INTPGDWWSGYGYEIPTLQRAAIRILSQPCSSYGCSRWNWSTFETLHSKKRSTTEQEKLNDLVFVQCNLWLQHIRWTRDGKYKPVVFDDIDVSLEWPTELESSAHVLDDS
WLDNLPLECGGSP