; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015515 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015515
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHAT transposon superfamily
Genome locationtig00004455:260048..262498
RNA-Seq ExpressionSgr015515
SyntenySgr015515
Gene Ontology termsGO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR003656 - Zinc finger, BED-type
IPR007021 - Domain of unknown function DUF659
IPR008906 - HAT, C-terminal dimerisation domain
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008448901.1 PREDICTED: uncharacterized protein LOC103490927 [Cucumis melo]0.0e+0089.78Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVPTDVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FPC SPSAQPP+DDAQ QKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAEYG GYKAPSYEKLKSTLLDKVKGDI +SY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS
        KK RDEWKETGCTILCDSWSDGRTKSFLV S+TCSKGTLFLKSVD SGHEDDATYLSDLLE+IVLEVGVENVVQ+ITD TASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVN+MLEDISK EWV TVLEEAKIITRYIYSH  ILNTMRKFT GKELIRPRITRFVTNFLSLRSIVILE+NLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK
        IISLLYLDRFWKDAHEA+NI EPL+RILRIVDGDMPAMGYI+ GIERAKVEIKTYYN  E+KY+PIW+ IDRRWNLQLHTTLH AAAF NPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN
        IDLRIRNGFQEAMLKMATTDKDKMEITREHP Y NGQGALGTDFAI+GRT+N+P           GDWWSGYGYEIPTLQRAA+RIL+QPCSSYGC RWN
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN

Query:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLD-NLPLECRGSP
        WSTFE+LH KKR++AE EKL DLVFVQCNLWLQHIC TRD KYKPIVFDDIDVSLEWP+E E SA VLDDSWLD NLPLECRGSP
Subjt:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLD-NLPLECRGSP

XP_022152799.1 uncharacterized protein LOC111020429 [Momordica charantia]0.0e+0089.46Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYC REFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQR PKKPK+DMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FPCPSPSAQP +D+ Q QKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPS EKLKSTLL+KVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS
        KKYRD+WKETGCTILCDSWSDGRTKS LVFS+ CSKGTLFLKSVDVSGHEDD TYLS+LLE++VLEVGVENVVQVITDT+ASY+YAGRLLMTKY SLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSY VN MLEDISK EW+GTVL EAKII+RYIYSH WILNT+RKFTSGKELIRPRI+ FVTNFLSLRSIVILEDNLKHMFAHSEWLSS YSRRPDAQ 
Subjt:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK
        I+SL+YLDRFWKDAHEAVNISEPL+R+LRIVDGDMPAMGYIY G+ER+KVEIKTYYN IEEKY+PIWDIID+RWNLQLHTTLHAAAAF NPSIFY+PNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGCRWNW
        ID+RIRNGFQEAMLKMA TDKDK+EITREHP Y NGQGALGTDFAI+GRT+NAP           GDWWSGYGYEIP+LQRAAIRIL+QPCSSYGCRWNW
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGCRWNW

Query:  STFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
         TFESLH KKRN+AE EKLNDLVFVQCNLWLQHICWTRD KYKP+VFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
Subjt:  STFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP

XP_022923437.1 uncharacterized protein LOC111431132 [Cucurbita moschata]0.0e+0089.47Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVP DVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FP  SPSAQPP+DDAQ QKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAEYGVGY+APSYEKLKSTLL KVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILC+SWSDGRTKSFL+ SITCSKGTLFLKSV++SG EDDATYLSDLLE+IVLEVGVENVVQVITD TASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVN+MLED+SK EWVGTVL+EAKII RY+YSH WILNTMRKFTSGKELIRPRITRFVTNFLSLRSIV LED LKHMFAHSEW SSIYSRRPDAQA
Subjt:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK
        I+S LYLDRFWKDA EAVNISEPL+RILR+VDGDMPAMGYIY GIERAKVE+K YYN IE+KY+PIWD IDRRWNLQLHTTLH AAAF NPSIFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN
        IDLRIRNGFQEAMLKMATTDKDKMEITREHP Y N QGALGTDFAI+GRT+N P           GDWWSGYGYEIPTLQRAAIRIL+QPCSSYGC RWN
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN

Query:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
        WSTFE+LH KKR+  E EKLNDLVFVQCNLWLQHI WTRDGKYKP+VFDDIDVSLEWPTE ESSA VLDDSWLDNLPLEC GSP
Subjt:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP

XP_038876874.1 uncharacterized protein LOC120069237 isoform X1 [Benincasa hispida]0.0e+0091.81Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVPTDVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FPCPSPSAQPP+DD Q QKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDI NSY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS
        KKY DEWKETGCTILCDSWSDGRTKSFLV SITCSKG LFLKSVD+SGHEDDATYLSDLLE+IVLEVGVENVVQVITD TASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVN+MLEDISK EWVGTVLEEAKIITRYIYSH  ILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVI EDNLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK
        IISLLYLDRFWKDA EAVNI+EPL+RILRIVDGDMPAMGYI+ GIERAKVEIKTYYN IE+KY+PIW+ IDRRWNLQLHTTLH AAAF NPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN
        IDLRIRNGFQEAMLKMATTDKDKMEITREHP Y NGQGALGTDFAI+GRT+NAP           GDWWSGYGYEIPTLQRAAIRILNQPCSSYGC RWN
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN

Query:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
        WSTFE+LH KKR++AE EKLNDLVFVQCNLWLQHIC TRDGKYKP+VFDDIDVSLEWPTEFE+SA VLDDSWLDNLPLECRGSP
Subjt:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP

XP_038876877.1 uncharacterized protein LOC120069237 isoform X2 [Benincasa hispida]0.0e+0091.81Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVPTDVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FPCPSPSAQPP+DD Q QKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDI NSY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS
        KKY DEWKETGCTILCDSWSDGRTKSFLV SITCSKG LFLKSVD+SGHEDDATYLSDLLE+IVLEVGVENVVQVITD TASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVN+MLEDISK EWVGTVLEEAKIITRYIYSH  ILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVI EDNLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK
        IISLLYLDRFWKDA EAVNI+EPL+RILRIVDGDMPAMGYI+ GIERAKVEIKTYYN IE+KY+PIW+ IDRRWNLQLHTTLH AAAF NPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN
        IDLRIRNGFQEAMLKMATTDKDKMEITREHP Y NGQGALGTDFAI+GRT+NAP           GDWWSGYGYEIPTLQRAAIRILNQPCSSYGC RWN
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN

Query:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
        WSTFE+LH KKR++AE EKLNDLVFVQCNLWLQHIC TRDGKYKP+VFDDIDVSLEWPTEFE+SA VLDDSWLDNLPLECRGSP
Subjt:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP

TrEMBL top hitse value%identityAlignment
A0A1S3BLP8 uncharacterized protein LOC1034909270.0e+0089.78Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVPTDVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FPC SPSAQPP+DDAQ QKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAEYG GYKAPSYEKLKSTLLDKVKGDI +SY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS
        KK RDEWKETGCTILCDSWSDGRTKSFLV S+TCSKGTLFLKSVD SGHEDDATYLSDLLE+IVLEVGVENVVQ+ITD TASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVN+MLEDISK EWV TVLEEAKIITRYIYSH  ILNTMRKFT GKELIRPRITRFVTNFLSLRSIVILE+NLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK
        IISLLYLDRFWKDAHEA+NI EPL+RILRIVDGDMPAMGYI+ GIERAKVEIKTYYN  E+KY+PIW+ IDRRWNLQLHTTLH AAAF NPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN
        IDLRIRNGFQEAMLKMATTDKDKMEITREHP Y NGQGALGTDFAI+GRT+N+P           GDWWSGYGYEIPTLQRAA+RIL+QPCSSYGC RWN
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN

Query:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLD-NLPLECRGSP
        WSTFE+LH KKR++AE EKL DLVFVQCNLWLQHIC TRD KYKPIVFDDIDVSLEWP+E E SA VLDDSWLD NLPLECRGSP
Subjt:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLD-NLPLECRGSP

A0A5D3D7G5 HAT transposon superfamily0.0e+0089.78Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVPTDVRDHIQGILSTPKKQ+APKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FPC SPSAQPP+DDAQ QKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMV+AIAEYG GYKAPSYEKLKSTLLDKVKGDI +SY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS
        KK RDEWKETGCTILCDSWSDGRTKSFLV S+TCSKGTLFLKSVD SGHEDDATYLSDLLE+IVLEVGVENVVQ+ITD TASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVN+MLEDISK EWV TVLEEAKIITRYIYSH  ILNTMRKFT GKELIRPRITRFVTNFLSLRSIVILE+NLKHMFAHSEWLSSIYSRRPDAQA
Subjt:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK
        IISLLYLDRFWKDAHEA+NI EPL+RILRIVDGDMPAMGYI+ GIERAKVEIKTYYN  E+KY+PIW+ IDRRWNLQLHTTLH AAAF NPS+FYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN
        IDLRIRNGFQEAMLKMATTDKDKMEITREHP Y NGQGALGTDFAI+GRT+N+P           GDWWSGYGYEIPTLQRAA+RIL+QPCSSYGC RWN
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN

Query:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLD-NLPLECRGSP
        WSTFE+LH KKR++AE EKL DLVFVQCNLWLQHIC TRD KYKPIVFDDIDVSLEWP+E E SA VLDDSWLD NLPLECRGSP
Subjt:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLD-NLPLECRGSP

A0A6J1DH28 uncharacterized protein LOC1110204290.0e+0089.46Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYC REFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQR PKKPK+DMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FPCPSPSAQP +D+ Q QKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPS EKLKSTLL+KVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS
        KKYRD+WKETGCTILCDSWSDGRTKS LVFS+ CSKGTLFLKSVDVSGHEDD TYLS+LLE++VLEVGVENVVQVITDT+ASY+YAGRLLMTKY SLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSY VN MLEDISK EW+GTVL EAKII+RYIYSH WILNT+RKFTSGKELIRPRI+ FVTNFLSLRSIVILEDNLKHMFAHSEWLSS YSRRPDAQ 
Subjt:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK
        I+SL+YLDRFWKDAHEAVNISEPL+R+LRIVDGDMPAMGYIY G+ER+KVEIKTYYN IEEKY+PIWDIID+RWNLQLHTTLHAAAAF NPSIFY+PNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGCRWNW
        ID+RIRNGFQEAMLKMA TDKDK+EITREHP Y NGQGALGTDFAI+GRT+NAP           GDWWSGYGYEIP+LQRAAIRIL+QPCSSYGCRWNW
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGCRWNW

Query:  STFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
         TFESLH KKRN+AE EKLNDLVFVQCNLWLQHICWTRD KYKP+VFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
Subjt:  STFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP

A0A6J1E9N1 uncharacterized protein LOC1114311320.0e+0089.47Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVP DVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS FP  SPSAQPP+DDAQ QKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAEYGVGY+APSYEKLKSTLL KVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILC+SWSDGRTKSFL+ SITCSKGTLFLKSV++SG EDDATYLSDLLE+IVLEVGVENVVQVITD TASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVN+MLED+SK EWVGTVL+EAKII RY+YSH WILNTMRKFTSGKELIRPRITRFVTNFLSLRSIV LED LKHMFAHSEW SSIYSRRPDAQA
Subjt:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK
        I+S LYLDRFWKDA EAVNISEPL+RILR+VDGDMPAMGYIY GIERAKVE+K YYN IE+KY+PIWD IDRRWNLQLHTTLH AAAF NPSIFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN
        IDLRIRNGFQEAMLKMATTDKDKMEITREHP Y N QGALGTDFAI+GRT+N P           GDWWSGYGYEIPTLQRAAIRIL+QPCSSYGC RWN
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN

Query:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
        WSTFE+LH KKR+  E EKLNDLVFVQCNLWLQHI WTRDGKYKP+VFDDIDVSLEWPTE ESSA VLDDSWLDNLPLEC GSP
Subjt:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP

A0A6J1KZI0 uncharacterized protein LOC1115002590.0e+0088.89Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPC+EVP DVRD IQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
        IHHGSSGQNESNCPS  PC SPSAQP +DDAQ QKKDETDKKVA+FFFHNSIPFSAAKSLYYQEMVNAIAEYGVGY+APSY+KLKSTLLDKVKGDIQNSY
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS
        KKYRDEWKETGCTILC+SWSDGRTKSFL+ SITCSKGTLFLKSV++SG EDDATYLSDLLE+IVLEVGVENVVQVITD TASYVYAGRLLMTKYTSLFWS
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PCVSYCVN+MLED+SK EWVGTVL+EAKII RY+YSH WIL+TMRKFTSGKELIRPRITRFVTNFLSLRSIV LED LKHMFAHSEW SSIYSRRPDAQA
Subjt:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK
        I+S LYLDRFWKDA EAVNISEPL+RILR+VDGDMPAMGYIY GIERAKVE+K YYN IE+KY+PIWD IDRRWNLQLHTTLH AAAF NPSIFYNPNFK
Subjt:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN
        IDLRIRNGFQEAMLKMATTDKDKMEITREHP Y N QGALGTDFAI+GRT+N P           GDWWSGYGYEIPTLQR AIRIL+QPCSSYGC RWN
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGC-RWN

Query:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP
        WSTFE+LH KKR++ E EKLNDLVFVQCNLWLQHI WTRDGKYKP+VFDDIDVSLEWPTE ESSA VLDDSWLDNLPLEC GSP
Subjt:  WSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLDNLPLECRGSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G79740.1 hAT transposon superfamily3.0e-12035.07Show/hide
Query:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG
        MVR +D CWE+   +D    KV+C +C R  +GG+ R+K HL+++ +K + PC++V  DV D ++ ILS             D    TN  +     S  
Subjt:  MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGG

Query:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY
            +S         +FP   P+AQ           D  ++ +++FFF N I F+ A+S  Y  M++A+A+ G G+ APS    K+  LD+VK DI    
Subjt:  IHHGSSGQNESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSY

Query:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS
        K    EW  TGCTI+ ++W+D ++++ + FS++      F KSVD S +  ++  L+DL +S++ ++G E++VQ+I D +  Y      L+  Y ++F S
Subjt:  KKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWS

Query:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA
        PC S C+N +LE+ SK +WV   + +A++I++++Y+++ +L+ +RK T G+++IR  +TR V+NFLSL+S++  +  LKHMF   E+ ++  + +P + +
Subjt:  PCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQA

Query:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK
         +++L  + FW+   E+V ISEP++++LR V    PA+G IY  + +AK  I+TYY   E K+    DI+D  W   LH+ LHAAAAF NPSI YNP  K
Subjt:  IISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGYIYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFK

Query:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGCRWNW
            ++  F + + K+  T   + +IT +   +   +G  G + A+  R              S G WW  +G   P LQR AIRIL+Q CS Y     W
Subjt:  IDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGCRWNW

Query:  STFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLD
        STF+ +H ++RNK + E LN L +V  NL L  +      +  PI  +DID+  EW  E E+ +P     WLD
Subjt:  STFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEFESSAPVLDDSWLD

AT3G17450.1 hAT dimerisation domain-containing protein5.1e-9630.34Show/hide
Query:  DACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETAT-----------------
        D  WEH +  D  ++KV+CNYC +  SGG+ R K HLA+I   ++ PC   P +V   I+  +   +  +   +P  +M   T                 
Subjt:  DACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETAT-----------------

Query:  -------------NGQ--QHSSSASGGIHHGSSGQNESNCPSMFPCPSPSAQ------PPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVN
                     NG+  +    +    +  S  + ++    M P  SPS+           +    +KD T   ++ F  H  +P  AA SLY+Q+M+ 
Subjt:  -------------NGQ--QHSSSASGGIHHGSSGQNESNCPSMFPCPSPSAQ------PPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVN

Query:  AIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEV
         I  YG G+  PS +     LL +    I++  ++YR  W  TGC+I+ D+W++   K  + F ++C +G  F  S+D +   +DA  L   L+ +V ++
Subjt:  AIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEV

Query:  GVENVVQVITDTTASYVYAGRLLMTKYTSLFWSPCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMR-KFTSGKELIRPRITRFVTNFL
        G ENVVQVIT  TA +  AG+LL  K  +L+W+PC  +C   +LED SK E+V   LE+A+ ITR+IY+ TW+LN M+ +FT G +L+RP + R  + F 
Subjt:  GVENVVQVITDTTASYVYAGRLLMTKYTSLFWSPCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMR-KFTSGKELIRPRITRFVTNFL

Query:  SLRSIVILEDNLKHMFAHSEW-LSSIYSRRPDAQAIISLLYLDRFWKDAHEAVNISEPLVRILRIVD--GDMPAMGYIYGGIERAKVEIKTYYNSIEEKY
        +L+S++  + +L+ +F    W LS   ++  + + +  ++    FWK     +   +P+++++ +++  GD  +M Y YG +  AK+ IK+ ++    KY
Subjt:  SLRSIVILEDNLKHMFAHSEW-LSSIYSRRPDAQAIISLLYLDRFWKDAHEAVNISEPLVRILRIVD--GDMPAMGYIYGGIERAKVEIKTYYNSIEEKY

Query:  LPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDA
         P W +I+ RWN   H  L+ AA FFNP+  Y P+F     +  G  E ++++   +  ++    + P Y   +   GTD AI  RT   P         
Subjt:  LPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFHDA

Query:  STGDWWSGYGYEIPTLQRAAIRILNQPCSSYGCRWNWSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQ
            WW  +G     LQR A+RIL+  CSS GC   WS ++ ++ + +++   +   DL +V  NL L+
Subjt:  STGDWWSGYGYEIPTLQRAAIRILNQPCSSYGCRWNWSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQ

AT3G22220.1 hAT transposon superfamily1.2e-8931.38Show/hide
Query:  RDACWEHC-VLVDATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGIL-STPKKQRAPKK-----------PKVDMET----
        +D+ W+HC V     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP +VR  +Q  +  T ++QR  +K           P  ++ET    
Subjt:  RDACWEHC-VLVDATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGIL-STPKKQRAPKK-----------PKVDMET----

Query:  ---ATNG-QQHSSSASGGIHHGSSGQN--------------------ESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYY
             NG +  SS    G   G + Q                     + +  ++ P    S +  V     +++      +  F F     F AA S+  
Subjt:  ---ATNG-QQHSSSASGGIHHGSSGQN--------------------ESNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYY

Query:  QEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLES
        Q  ++AI   G G   P++E L+  +L     +++    + +  WK TGC++L    +       L F + C +  +FLKSVD S   D    L +LL+ 
Subjt:  QEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLES

Query:  IVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWSPCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFV
        +V E+G  NVVQVIT     Y  AG+ LM  Y SL+W PC ++C++KMLE+  K +W+  ++E+A+ +TR IY+H+ +LN MRKFT G ++++P  T   
Subjt:  IVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWSPCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRITRFV

Query:  TNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAVNISEPLVRILRIVDGD-MPAMGYIYGGIERAKVEIKTYYNSIEE
        TNF ++  I  L+  L+ M   SEW    YS+     A+   +  + FWK    A +I+ P++R+LRIV  +  PAMGY+Y  + RAK  IKT   +  E
Subjt:  TNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAVNISEPLVRILRIVDGD-MPAMGYIYGGIERAKVEIKTYYNSIEE

Query:  KYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFH
        +Y+  W IIDR W   L   L+AA  + NP  FY+ + ++   I     + + K+      +  + ++   Y N  G  G + AI  R    P       
Subjt:  KYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVDFIFH

Query:  DASTGDWWSGYGYEIPTLQRAAIRILNQPC-SSYGCRWNWSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHIC--WTRDGKYKPIVFDDIDVSLEW
             +WWS YG     L R AIRIL+Q C SS G   N ++   ++ + +N  E ++LNDLVFVQ N+ L+ I    + D    P+   +++V  +W
Subjt:  DASTGDWWSGYGYEIPTLQRAAIRILNQPC-SSYGCRWNWSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHIC--WTRDGKYKPIVFDDIDVSLEW

AT4G15020.1 hAT transposon superfamily2.6e-9532.67Show/hide
Query:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGIL-STPKKQRAPKKPK--------------------
        +D  W+HC +     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP DVR  +Q  +  T ++QR   K                      
Subjt:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGIL-STPKKQRAPKKPK--------------------

Query:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAK
                        V  E+  +G   Q+   S      +GS+  N      +  ++ P    S +  V  +   +++     +  F F     F A  
Subjt:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAK

Query:  SLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSD
        S+ +Q M++AIA  G G  AP+++ L+  +L     ++     + +  WK TGC+IL +  +  +    L F + C +  +FLKSVD S     A  L +
Subjt:  SLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSD

Query:  LLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWSPCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRI
        LL  +V EVG  NVVQVIT     YV AG+ LM  Y SL+W PC ++C+++MLE+  K  W+   +E+A+ ITR++Y+H+ +LN M KFTSG +++ P  
Subjt:  LLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWSPCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRI

Query:  TRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAVNISEPLVRILRIVDGD-MPAMGYIYGGIERAKVEIKTYYN
        +   TNF +L  I  L+ NL+ M   +EW    YS  P    +++ L  + FWK      +++ PL+R LRIV  +  PAMGY+Y  + RAK  IKT+  
Subjt:  TRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAVNISEPLVRILRIVDGD-MPAMGYIYGGIERAKVEIKTYYN

Query:  SIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVD
        +  E Y+  W IIDR W  Q H  L AA  F NP +FYN N +I   +     + + ++   DK + +I +E   Y    G  G + AI  R    P   
Subjt:  SIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVD

Query:  FIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPC-SSYGCRWNWSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHI-CWTRDGKYKPIVFDDIDVSLE
                 +WWS YG     L R AIRIL+Q C SS  CR N    E ++ + +N  E ++L+DLVFVQ N+ L+ +   + D    P+  + IDV  E
Subjt:  FIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPC-SSYGCRWNWSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHI-CWTRDGKYKPIVFDDIDVSLE

Query:  W
        W
Subjt:  W

AT4G15020.2 hAT transposon superfamily2.6e-9532.67Show/hide
Query:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGIL-STPKKQRAPKKPK--------------------
        +D  W+HC +     R ++RC YC++ F  GG+ R+K HLA  K +  + C +VP DVR  +Q  +  T ++QR   K                      
Subjt:  RDACWEHCVLVD-ATRQKVRCNYCQREF-SGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGIL-STPKKQRAPKKPK--------------------

Query:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAK
                        V  E+  +G   Q+   S      +GS+  N      +  ++ P    S +  V  +   +++     +  F F     F A  
Subjt:  ----------------VDMETATNG---QQHSSSASGGIHHGSSGQNES----NCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAK

Query:  SLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSD
        S+ +Q M++AIA  G G  AP+++ L+  +L     ++     + +  WK TGC+IL +  +  +    L F + C +  +FLKSVD S     A  L +
Subjt:  SLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWSDGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSD

Query:  LLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWSPCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRI
        LL  +V EVG  NVVQVIT     YV AG+ LM  Y SL+W PC ++C+++MLE+  K  W+   +E+A+ ITR++Y+H+ +LN M KFTSG +++ P  
Subjt:  LLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWSPCVSYCVNKMLEDISKNEWVGTVLEEAKIITRYIYSHTWILNTMRKFTSGKELIRPRI

Query:  TRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAVNISEPLVRILRIVDGD-MPAMGYIYGGIERAKVEIKTYYN
        +   TNF +L  I  L+ NL+ M   +EW    YS  P    +++ L  + FWK      +++ PL+R LRIV  +  PAMGY+Y  + RAK  IKT+  
Subjt:  TRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAVNISEPLVRILRIVDGD-MPAMGYIYGGIERAKVEIKTYYN

Query:  SIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVD
        +  E Y+  W IIDR W  Q H  L AA  F NP +FYN N +I   +     + + ++   DK + +I +E   Y    G  G + AI  R    P   
Subjt:  SIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRTLNAPDVD

Query:  FIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPC-SSYGCRWNWSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHI-CWTRDGKYKPIVFDDIDVSLE
                 +WWS YG     L R AIRIL+Q C SS  CR N    E ++ + +N  E ++L+DLVFVQ N+ L+ +   + D    P+  + IDV  E
Subjt:  FIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPC-SSYGCRWNWSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHI-CWTRDGKYKPIVFDDIDVSLE

Query:  W
        W
Subjt:  W


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCGAGGAAGGGATGCTTGTTGGGAACATTGTGTCCTTGTTGATGCAACGAGACAGAAGGTTCGATGTAATTATTGCCAGCGGGAATTCAGTGGAGGTGTATACAG
AATGAAATTTCATTTGGCTCAAATTAAAAACAAAGATATAGTTCCATGTTCTGAAGTCCCAACCGATGTTCGAGACCATATTCAGGGTATATTAAGCACTCCTAAGAAAC
AGAGGGCACCCAAGAAACCAAAGGTTGATATGGAAACTGCAACAAATGGACAGCAACATAGCTCCTCTGCTAGTGGGGGCATCCATCATGGATCCAGTGGACAGAATGAA
AGCAACTGCCCATCGATGTTTCCGTGCCCTTCACCGAGTGCACAACCACCAGTCGATGATGCGCAGACGCAGAAAAAGGATGAGACTGACAAAAAAGTTGCTATATTTTT
CTTTCATAATTCAATTCCTTTCAGTGCTGCCAAGTCCTTGTATTATCAGGAAATGGTAAATGCTATAGCAGAATATGGAGTAGGATACAAAGCGCCAAGTTATGAGAAAT
TAAAATCTACTCTTTTGGATAAAGTGAAAGGTGACATACAGAATTCTTACAAAAAATATAGGGATGAATGGAAAGAAACAGGCTGCACTATCCTGTGTGATAGCTGGTCA
GATGGAAGGACCAAATCGTTTCTAGTATTTTCCATTACGTGTTCCAAAGGAACGCTGTTTCTGAAGTCAGTCGATGTCTCAGGCCATGAAGATGATGCAACTTACCTGTC
TGACTTGCTCGAGTCTATCGTCCTCGAGGTTGGAGTCGAGAATGTTGTCCAAGTTATTACAGATACTACTGCCAGTTACGTCTATGCAGGAAGGCTTCTCATGACCAAGT
ACACTTCCTTATTTTGGTCACCATGTGTTTCTTATTGTGTTAATAAGATGTTGGAGGACATTAGTAAAAACGAGTGGGTCGGTACAGTACTGGAGGAGGCAAAGATCATC
ACCCGCTACATTTATAGTCACACGTGGATTTTGAATACAATGCGAAAATTCACCAGCGGGAAGGAATTGATCAGGCCAAGAATTACTAGATTCGTTACCAATTTTCTCTC
CTTGAGGTCCATTGTGATTCTCGAGGACAATCTCAAGCACATGTTTGCTCATTCAGAGTGGCTGTCATCGATTTATAGCAGGCGTCCTGACGCGCAGGCAATTATTTCCT
TGCTGTATTTGGATAGATTTTGGAAGGATGCACATGAAGCTGTCAACATTTCGGAACCACTTGTTAGAATACTGAGAATTGTTGATGGTGACATGCCTGCCATGGGCTAT
ATATATGGAGGAATAGAGAGAGCAAAGGTTGAAATCAAAACATATTACAACAGCATTGAAGAGAAATATTTGCCCATTTGGGACATCATCGACCGGAGATGGAATTTGCA
GCTTCACACAACGTTGCACGCAGCAGCAGCTTTCTTTAACCCGTCAATTTTTTACAATCCAAACTTTAAGATTGATCTGAGAATTAGGAACGGATTTCAGGAAGCCATGT
TGAAGATGGCCACGACGGATAAAGATAAAATGGAAATCACCAGAGAACATCCTGTATATGCAAATGGACAAGGTGCTCTTGGTACTGATTTTGCAATCATGGGGAGAACT
TTAAATGCCCCAGATGTTGATTTCATTTTCCATGATGCATCTACAGGTGATTGGTGGTCTGGATACGGTTATGAAATCCCCACGCTCCAGAGAGCGGCAATACGAATATT
GAACCAACCCTGTAGTTCTTATGGGTGTAGATGGAACTGGAGCACGTTTGAAAGTTTACATTTGAAAAAGCGTAACAAAGCCGAACTAGAAAAGTTGAACGATTTGGTGT
TTGTACAGTGCAATCTTTGGTTGCAACACATTTGTTGGACTCGGGACGGTAAATATAAACCCATTGTATTTGATGATATAGATGTAAGTTTAGAATGGCCTACGGAGTTC
GAATCCTCGGCTCCTGTTTTAGATGATTCATGGTTGGATAATCTGCCTCTTGAATGTAGAGGAAGCCCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCGAGGAAGGGATGCTTGTTGGGAACATTGTGTCCTTGTTGATGCAACGAGACAGAAGGTTCGATGTAATTATTGCCAGCGGGAATTCAGTGGAGGTGTATACAG
AATGAAATTTCATTTGGCTCAAATTAAAAACAAAGATATAGTTCCATGTTCTGAAGTCCCAACCGATGTTCGAGACCATATTCAGGGTATATTAAGCACTCCTAAGAAAC
AGAGGGCACCCAAGAAACCAAAGGTTGATATGGAAACTGCAACAAATGGACAGCAACATAGCTCCTCTGCTAGTGGGGGCATCCATCATGGATCCAGTGGACAGAATGAA
AGCAACTGCCCATCGATGTTTCCGTGCCCTTCACCGAGTGCACAACCACCAGTCGATGATGCGCAGACGCAGAAAAAGGATGAGACTGACAAAAAAGTTGCTATATTTTT
CTTTCATAATTCAATTCCTTTCAGTGCTGCCAAGTCCTTGTATTATCAGGAAATGGTAAATGCTATAGCAGAATATGGAGTAGGATACAAAGCGCCAAGTTATGAGAAAT
TAAAATCTACTCTTTTGGATAAAGTGAAAGGTGACATACAGAATTCTTACAAAAAATATAGGGATGAATGGAAAGAAACAGGCTGCACTATCCTGTGTGATAGCTGGTCA
GATGGAAGGACCAAATCGTTTCTAGTATTTTCCATTACGTGTTCCAAAGGAACGCTGTTTCTGAAGTCAGTCGATGTCTCAGGCCATGAAGATGATGCAACTTACCTGTC
TGACTTGCTCGAGTCTATCGTCCTCGAGGTTGGAGTCGAGAATGTTGTCCAAGTTATTACAGATACTACTGCCAGTTACGTCTATGCAGGAAGGCTTCTCATGACCAAGT
ACACTTCCTTATTTTGGTCACCATGTGTTTCTTATTGTGTTAATAAGATGTTGGAGGACATTAGTAAAAACGAGTGGGTCGGTACAGTACTGGAGGAGGCAAAGATCATC
ACCCGCTACATTTATAGTCACACGTGGATTTTGAATACAATGCGAAAATTCACCAGCGGGAAGGAATTGATCAGGCCAAGAATTACTAGATTCGTTACCAATTTTCTCTC
CTTGAGGTCCATTGTGATTCTCGAGGACAATCTCAAGCACATGTTTGCTCATTCAGAGTGGCTGTCATCGATTTATAGCAGGCGTCCTGACGCGCAGGCAATTATTTCCT
TGCTGTATTTGGATAGATTTTGGAAGGATGCACATGAAGCTGTCAACATTTCGGAACCACTTGTTAGAATACTGAGAATTGTTGATGGTGACATGCCTGCCATGGGCTAT
ATATATGGAGGAATAGAGAGAGCAAAGGTTGAAATCAAAACATATTACAACAGCATTGAAGAGAAATATTTGCCCATTTGGGACATCATCGACCGGAGATGGAATTTGCA
GCTTCACACAACGTTGCACGCAGCAGCAGCTTTCTTTAACCCGTCAATTTTTTACAATCCAAACTTTAAGATTGATCTGAGAATTAGGAACGGATTTCAGGAAGCCATGT
TGAAGATGGCCACGACGGATAAAGATAAAATGGAAATCACCAGAGAACATCCTGTATATGCAAATGGACAAGGTGCTCTTGGTACTGATTTTGCAATCATGGGGAGAACT
TTAAATGCCCCAGATGTTGATTTCATTTTCCATGATGCATCTACAGGTGATTGGTGGTCTGGATACGGTTATGAAATCCCCACGCTCCAGAGAGCGGCAATACGAATATT
GAACCAACCCTGTAGTTCTTATGGGTGTAGATGGAACTGGAGCACGTTTGAAAGTTTACATTTGAAAAAGCGTAACAAAGCCGAACTAGAAAAGTTGAACGATTTGGTGT
TTGTACAGTGCAATCTTTGGTTGCAACACATTTGTTGGACTCGGGACGGTAAATATAAACCCATTGTATTTGATGATATAGATGTAAGTTTAGAATGGCCTACGGAGTTC
GAATCCTCGGCTCCTGTTTTAGATGATTCATGGTTGGATAATCTGCCTCTTGAATGTAGAGGAAGCCCTTAA
Protein sequenceShow/hide protein sequence
MVRGRDACWEHCVLVDATRQKVRCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPTDVRDHIQGILSTPKKQRAPKKPKVDMETATNGQQHSSSASGGIHHGSSGQNE
SNCPSMFPCPSPSAQPPVDDAQTQKKDETDKKVAIFFFHNSIPFSAAKSLYYQEMVNAIAEYGVGYKAPSYEKLKSTLLDKVKGDIQNSYKKYRDEWKETGCTILCDSWS
DGRTKSFLVFSITCSKGTLFLKSVDVSGHEDDATYLSDLLESIVLEVGVENVVQVITDTTASYVYAGRLLMTKYTSLFWSPCVSYCVNKMLEDISKNEWVGTVLEEAKII
TRYIYSHTWILNTMRKFTSGKELIRPRITRFVTNFLSLRSIVILEDNLKHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAVNISEPLVRILRIVDGDMPAMGY
IYGGIERAKVEIKTYYNSIEEKYLPIWDIIDRRWNLQLHTTLHAAAAFFNPSIFYNPNFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPVYANGQGALGTDFAIMGRT
LNAPDVDFIFHDASTGDWWSGYGYEIPTLQRAAIRILNQPCSSYGCRWNWSTFESLHLKKRNKAELEKLNDLVFVQCNLWLQHICWTRDGKYKPIVFDDIDVSLEWPTEF
ESSAPVLDDSWLDNLPLECRGSP