; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004572 (gene) of Snake gourd v1 genome

Gene IDTan0004572
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF3527)
Genome locationLG07:67840451..67844593
RNA-Seq ExpressionTan0004572
SyntenyTan0004572
Gene Ontology termsNA
InterPro domainsIPR021916 - Protein of unknown function DUF3527


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042757.1 DUF3527 domain-containing protein [Cucumis melo var. makuwa]0.0e+0083.4Show/hide
Query:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD
        MGQE ELDFDSLC+VDLSPNTVLPS+PR SSIK RSTRKK KH+DFVL+VKDDFTEI+FGG  RSRKSNSSSLV+LEDNE K  ELNV+DDFT+IR G D
Subjt:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD

Query:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLS---KLDQSSVKKTCIGPRS
        HSSHKSNSPS+VELEDD+ LKRRSKY SSED+  IEGMGIQGERRKIEIS D+YTSW+SGIVDSLCSSDEE PER   +LS   KL+Q SV K CIGPRS
Subjt:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLS---KLDQSSVKKTCIGPRS

Query:  SDSFIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRF
        SDSFI+IY G EN+ET S+D SN          L N KK FKRDKVHALQKS SAKVEM  N LPLES+ RFR+SPKVHISPFRKMLDPFMKSKSVRSRF
Subjt:  SDSFIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRF

Query:  SHAVEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKW
        SH VEAGGDKA+K+I+L R+ET         S+TAKSSD  S+F NND+HHNVVASSPVHLHGSLKLEKKHGMPFFEFSQS PEDVYVAK WKTGNAFKW
Subjt:  SHAVEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKW

Query:  VYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVND
        VYTFHTQDH KKS+A S GLN SCKNSLMVGQMQVSCYLSSEL+D GF+NSMVTEFVLYD ARARQSTASQ SCDSIQDAVKPPK S++GLVGE F VND
Subjt:  VYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVND

Query:  GTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGN
        GTPLEK K Q KHASENCD G IDSCPWDSA L PDLESA+IVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEP HHTTQETLKVVIP GN
Subjt:  GTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGN

Query:  HGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVA
        HGLPTVES GPSTLLDRWRLGGGCDCGGWDMGCPLL+LGT SSHCAENQAHKGKQ F LFHQGVKDTTPALTM++VKDGQYAVDFHARLSTLQAFSICVA
Subjt:  HGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVA

Query:  ILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV
        ILHATEACNAVQ++ETKELQ CNSLKVLLEEEVKFLID VTMEEKKRETRMLKETP SYLFNPPFSPIARV
Subjt:  ILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV

XP_004143981.1 uncharacterized protein LOC101216494 [Cucumis sativus]0.0e+0083.01Show/hide
Query:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD
        MGQE ELDFDSLC+VDLSPNTVLPS+PR SSIK RSTRKK KH+DFVL+VKDDFTEI+FGG  RSRKSNSSSL++ EDNE K  ELNV+DDFT+IR G D
Subjt:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD

Query:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLS---KLDQSSVKKTCIGPRS
        HSSHKSNSPSLVELEDD+ LKRRSKYQSSED+  IEGMGIQGERRKIEIS ++YTSW+SGIVDSLCSSDEE PER   +LS   KL+Q SV K C+GPRS
Subjt:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLS---KLDQSSVKKTCIGPRS

Query:  SDSFIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRF
        SDSFI+IY G ENSET S+D SN          LHN KK FKRDKVHALQKS SAKVEM  N LP+ES+ RF  SPKVHISPFRK+LDPFMKSKSVRSRF
Subjt:  SDSFIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRF

Query:  SHAVEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKW
        SHAVEAG DKA+K+I+L R+ET         S+T KSSD DS+F NND+ HNVVASSPVHLHGSLKLEKKHGMPFFEFSQS PEDVYVAK WKTGNAFKW
Subjt:  SHAVEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKW

Query:  VYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVND
        VYTFH+QDH KKS+A SFGLNHSCKNSLMVGQMQVSCYLSSEL+D GF+NSMVTEFVLYD ARARQSTASQ SCDSI DAVKPPK S++GLVGE FSVND
Subjt:  VYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVND

Query:  GTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGN
        GTPLEK K Q+KHASENCD GSIDSCPWDSADL PDLESA+IVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRK+EP HHTTQETLKVVIP GN
Subjt:  GTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGN

Query:  HGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVA
        HGLPTVES GPSTLLDRWRLGGGCDCGGWDMGCPLLILGT SSHCAENQAHKGKQ F LFHQG KDTTPALTM++VKDGQY+VDFHARLSTLQAFSICVA
Subjt:  HGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVA

Query:  ILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV
        ILHATEACNAVQ+EETKELQ CNSLKVLLEEEVKFLID V MEEKKRETRMLKETP SYLFNPPFSPIARV
Subjt:  ILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV

XP_008437275.1 PREDICTED: uncharacterized protein LOC103482749 [Cucumis melo]0.0e+0083.14Show/hide
Query:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD
        MGQE ELDFDSLC+VDLSPNTVLPS+PR SSIK RSTRKK KH+DFVL+VKDDFTEI+FGG  RSRKSNSSSLV+LEDNE K  ELNV+DDFT+IR G D
Subjt:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD

Query:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLS---KLDQSSVKKTCIGPRS
        HSSHKSNSPS+VELEDD+ LKRRSKY SSED+  IEGMGIQGERRKIEIS D+YTSW+SGIVDSLCSSDEE PER   +LS   KL+Q SV K CIGPRS
Subjt:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLS---KLDQSSVKKTCIGPRS

Query:  SDSFIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRF
        SDSFI+IY G EN+ET S+D SN          L N KK FKRDKVHALQKS SAKVEM  N LPLES+ RFR+SPKVHISPFRKMLDPFMKSKSVRSRF
Subjt:  SDSFIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRF

Query:  SHAVEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKW
        SH VEAGGDKA+K+I+L R+ET         S+TAKSSD  S+F NND+HHNVVASSPVHLHGSLKLEKKHGMPFFEFSQS PEDVYVAK WKTGNAFKW
Subjt:  SHAVEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKW

Query:  VYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVND
        VYTFHTQDH KKS+A S GLN SCKNSLMVGQMQVSCYLSSEL+D GF+NSMVTEFVLYD ARARQSTASQ SCDSIQDAVKPPK S++GLVGE F VND
Subjt:  VYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVND

Query:  GTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGN
        GTPLEK K Q KHASENCD G IDSCPWDSA L PDLESA+IVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEP HHTTQETLKVVIP GN
Subjt:  GTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGN

Query:  HGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVA
        HGLPTVES GPSTLLDRWRLGGGCDCGGWDMGCPLL+LGT SS  AENQAHKGKQ F LFHQGVKDTTPALTM++VKDGQYAVDFHARLSTLQAFSICVA
Subjt:  HGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVA

Query:  ILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV
        ILHATEACNAVQ++ETKELQ CNSLKVLLEEEVKFLID VTMEEKKRETRMLKETP SYLFNPPFSPIARV
Subjt:  ILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV

XP_022996048.1 uncharacterized protein LOC111491371 [Cucurbita maxima]0.0e+0081.66Show/hide
Query:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD
        MGQEPELDFDSLC+VDLSPNTVLPS+PRHSSIK+RS+RKK  H+DF+LNVKD FTEIRF G    RKSNSSSLV+LEDNEH + ELNVK+DFT+IR G D
Subjt:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD

Query:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLSKLDQSSVKKTCIGPRSSDS
        HSSHKSNSP LVELEDDEVL++RSK++ SEDV NIEGM IQ ER KIEISHDNYTSW +GIVDSLCSSDEE+ +      +KL+QSSV K CI PRSSDS
Subjt:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLSKLDQSSVKKTCIGPRSSDS

Query:  FIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRFSHA
        FIDIYLGSE SETA +DSSNH         LHN KK FKRDKVHAL  SLSAK+E  KNQ  L S+ RFR +PK HISPFRKMLDPFMKSKS  S F  A
Subjt:  FIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRFSHA

Query:  VEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKWVYT
        +EAGGDKAVKS+D+ +N+TYRKSLLQDF+N A +SDCDSHF +ND+HHNVVASSPVHLHGSLKLEKKHGMPFFEFSQS PEDVYVAK WKTGNAFKWVYT
Subjt:  VEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKWVYT

Query:  FHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVNDGTP
        FHTQDH+KKS+ASSFGLNHSCK +LMVGQMQVSCYLSSEL+D GF+NSM+TEFVLYDTA ARQS ASQESC+SIQ AVKPP  SNSG+VGEAFSVNDGTP
Subjt:  FHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVNDGTP

Query:  LEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGNHGL
        LEK KL +KHASENCD  SIDSCPWDSADL PDLESASI+MQIPFSKRESLKYKRGDKTS KLNSAIQ+LSKIEQRKDE  HHT QETLKVV+PIGNHGL
Subjt:  LEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGNHGL

Query:  PTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVAILH
        PTVESHGPSTLLDRWRLGG CDCGGWDMGCPLL+LGTPS HCAE QA K KQ FELF QGVKD TPALTM+VVKDGQY V FHARLSTLQAFSICVAILH
Subjt:  PTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVAILH

Query:  ATEACNAVQM-EETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV
        ATEACNAVQM EETKE+QH NSLKVLLEEEVKFLIDTVTMEEKKRETRM KETPPSYLF+PPFSPIA+V
Subjt:  ATEACNAVQM-EETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV

XP_038875789.1 uncharacterized protein LOC120068157 [Benincasa hispida]0.0e+0086.38Show/hide
Query:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD
        MGQE ELDFDSLCSVDLSPNTVLPS+PRHSSIK RSTRKK KHKDFVLNVK+DFTEIRFGG  RS+KSNSSSLV+LE NE    ELNV+DDFT+IRFG  
Subjt:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD

Query:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLS---KLDQSSVKKTCIGPRS
        HSSHK NSPSLV LEDD+VLK  SKYQSSEDV N+E M IQGERRKIE+S DNYTSW+SGIVDSLCSSDEE PER   VLS   KL+QSSV K CIGPRS
Subjt:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLS---KLDQSSVKKTCIGPRS

Query:  SDSFIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRF
        SDSFIDIYLGSENSET S+D SN          LHN KK FKRDKVHALQKSLSAK+EMSK+QLPLES+ RFR +PK HISPFRK+LDPF KSKSVRSRF
Subjt:  SDSFIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRF

Query:  SHAVEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKW
        SHAVEAGGDKAVK+IDL RNETYRK LLQDFSNTAKSS CDS+F NND+HHN VASSPVHLHGSLKLEKKHGMPFFEFSQS PEDVYVAK WKTGNAFKW
Subjt:  SHAVEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKW

Query:  VYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVND
        VYTFHTQDHQKKS+ SSFGLNHSCKNSLMVGQMQVSCYLSSEL+D GF+NSMVTEFVLYD ARARQSTASQ SCDSI D VKP K S+SGLVGEA SVND
Subjt:  VYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVND

Query:  GTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGN
        GTPLEK K  +KHA ENCD GSIDSCPWDSADL PDLESA+IVM IPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEP HHTTQETLKVVIPIGN
Subjt:  GTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGN

Query:  HGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVA
        HGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGT SSHCAENQAHKGKQ FELFHQGVKD+TPALTM+VVKDGQYAVDFHARLSTLQAFSICVA
Subjt:  HGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVA

Query:  ILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV
        ILHATEAC AVQ+EETKELQHCNSLKVLLEEEVKFLID VTMEEKKRETRMLKETP SYLFNPPFSPIARV
Subjt:  ILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV

TrEMBL top hitse value%identityAlignment
A0A0A0KNX0 Uncharacterized protein0.0e+0083.01Show/hide
Query:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD
        MGQE ELDFDSLC+VDLSPNTVLPS+PR SSIK RSTRKK KH+DFVL+VKDDFTEI+FGG  RSRKSNSSSL++ EDNE K  ELNV+DDFT+IR G D
Subjt:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD

Query:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLS---KLDQSSVKKTCIGPRS
        HSSHKSNSPSLVELEDD+ LKRRSKYQSSED+  IEGMGIQGERRKIEIS ++YTSW+SGIVDSLCSSDEE PER   +LS   KL+Q SV K C+GPRS
Subjt:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLS---KLDQSSVKKTCIGPRS

Query:  SDSFIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRF
        SDSFI+IY G ENSET S+D SN          LHN KK FKRDKVHALQKS SAKVEM  N LP+ES+ RF  SPKVHISPFRK+LDPFMKSKSVRSRF
Subjt:  SDSFIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRF

Query:  SHAVEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKW
        SHAVEAG DKA+K+I+L R+ET         S+T KSSD DS+F NND+ HNVVASSPVHLHGSLKLEKKHGMPFFEFSQS PEDVYVAK WKTGNAFKW
Subjt:  SHAVEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKW

Query:  VYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVND
        VYTFH+QDH KKS+A SFGLNHSCKNSLMVGQMQVSCYLSSEL+D GF+NSMVTEFVLYD ARARQSTASQ SCDSI DAVKPPK S++GLVGE FSVND
Subjt:  VYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVND

Query:  GTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGN
        GTPLEK K Q+KHASENCD GSIDSCPWDSADL PDLESA+IVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRK+EP HHTTQETLKVVIP GN
Subjt:  GTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGN

Query:  HGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVA
        HGLPTVES GPSTLLDRWRLGGGCDCGGWDMGCPLLILGT SSHCAENQAHKGKQ F LFHQG KDTTPALTM++VKDGQY+VDFHARLSTLQAFSICVA
Subjt:  HGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVA

Query:  ILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV
        ILHATEACNAVQ+EETKELQ CNSLKVLLEEEVKFLID V MEEKKRETRMLKETP SYLFNPPFSPIARV
Subjt:  ILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV

A0A1S3AU82 uncharacterized protein LOC1034827490.0e+0083.14Show/hide
Query:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD
        MGQE ELDFDSLC+VDLSPNTVLPS+PR SSIK RSTRKK KH+DFVL+VKDDFTEI+FGG  RSRKSNSSSLV+LEDNE K  ELNV+DDFT+IR G D
Subjt:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD

Query:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLS---KLDQSSVKKTCIGPRS
        HSSHKSNSPS+VELEDD+ LKRRSKY SSED+  IEGMGIQGERRKIEIS D+YTSW+SGIVDSLCSSDEE PER   +LS   KL+Q SV K CIGPRS
Subjt:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLS---KLDQSSVKKTCIGPRS

Query:  SDSFIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRF
        SDSFI+IY G EN+ET S+D SN          L N KK FKRDKVHALQKS SAKVEM  N LPLES+ RFR+SPKVHISPFRKMLDPFMKSKSVRSRF
Subjt:  SDSFIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRF

Query:  SHAVEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKW
        SH VEAGGDKA+K+I+L R+ET         S+TAKSSD  S+F NND+HHNVVASSPVHLHGSLKLEKKHGMPFFEFSQS PEDVYVAK WKTGNAFKW
Subjt:  SHAVEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKW

Query:  VYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVND
        VYTFHTQDH KKS+A S GLN SCKNSLMVGQMQVSCYLSSEL+D GF+NSMVTEFVLYD ARARQSTASQ SCDSIQDAVKPPK S++GLVGE F VND
Subjt:  VYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVND

Query:  GTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGN
        GTPLEK K Q KHASENCD G IDSCPWDSA L PDLESA+IVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEP HHTTQETLKVVIP GN
Subjt:  GTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGN

Query:  HGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVA
        HGLPTVES GPSTLLDRWRLGGGCDCGGWDMGCPLL+LGT SS  AENQAHKGKQ F LFHQGVKDTTPALTM++VKDGQYAVDFHARLSTLQAFSICVA
Subjt:  HGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVA

Query:  ILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV
        ILHATEACNAVQ++ETKELQ CNSLKVLLEEEVKFLID VTMEEKKRETRMLKETP SYLFNPPFSPIARV
Subjt:  ILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV

A0A5A7TLY0 DUF3527 domain-containing protein0.0e+0083.4Show/hide
Query:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD
        MGQE ELDFDSLC+VDLSPNTVLPS+PR SSIK RSTRKK KH+DFVL+VKDDFTEI+FGG  RSRKSNSSSLV+LEDNE K  ELNV+DDFT+IR G D
Subjt:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD

Query:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLS---KLDQSSVKKTCIGPRS
        HSSHKSNSPS+VELEDD+ LKRRSKY SSED+  IEGMGIQGERRKIEIS D+YTSW+SGIVDSLCSSDEE PER   +LS   KL+Q SV K CIGPRS
Subjt:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLS---KLDQSSVKKTCIGPRS

Query:  SDSFIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRF
        SDSFI+IY G EN+ET S+D SN          L N KK FKRDKVHALQKS SAKVEM  N LPLES+ RFR+SPKVHISPFRKMLDPFMKSKSVRSRF
Subjt:  SDSFIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRF

Query:  SHAVEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKW
        SH VEAGGDKA+K+I+L R+ET         S+TAKSSD  S+F NND+HHNVVASSPVHLHGSLKLEKKHGMPFFEFSQS PEDVYVAK WKTGNAFKW
Subjt:  SHAVEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKW

Query:  VYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVND
        VYTFHTQDH KKS+A S GLN SCKNSLMVGQMQVSCYLSSEL+D GF+NSMVTEFVLYD ARARQSTASQ SCDSIQDAVKPPK S++GLVGE F VND
Subjt:  VYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVND

Query:  GTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGN
        GTPLEK K Q KHASENCD G IDSCPWDSA L PDLESA+IVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEP HHTTQETLKVVIP GN
Subjt:  GTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGN

Query:  HGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVA
        HGLPTVES GPSTLLDRWRLGGGCDCGGWDMGCPLL+LGT SSHCAENQAHKGKQ F LFHQGVKDTTPALTM++VKDGQYAVDFHARLSTLQAFSICVA
Subjt:  HGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVA

Query:  ILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV
        ILHATEACNAVQ++ETKELQ CNSLKVLLEEEVKFLID VTMEEKKRETRMLKETP SYLFNPPFSPIARV
Subjt:  ILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV

A0A6J1H5K2 uncharacterized protein LOC1114597780.0e+0080.88Show/hide
Query:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD
        MGQEPELDFDSLC+VDLSPNTVLPS+PRHSSIK+RS+RKK  H+DF+LNVKD FTEIRF G    RKSNSSSLV+LEDNEH + ELNVK+DFT+IR G D
Subjt:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD

Query:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLSKLDQSSVKKTCIGPRSSDS
        HSSHKSNSP LVELEDDEVLK+RSK++ SEDV NIEGMGIQ ER KIEIS DNYTSW + IVDSLCSSDEE+ +      +KL+QSSV K CI PRSSDS
Subjt:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLSKLDQSSVKKTCIGPRSSDS

Query:  FIDIYLGSENSETASEDSSNHL---------HNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRFSHA
        FIDIYLGSE S+T  +DSSNHL          N +K FKRDKVHAL KS SAK+E  KNQ  L S+  FR +PK H SPFRKMLDPFMKSKS  S F  A
Subjt:  FIDIYLGSENSETASEDSSNHL---------HNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRFSHA

Query:  VEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKWVYT
        +EAGGDKAVKS+D+ +N+TYRKSLLQDF+N A +SDCDSHF +ND+HHNVVASSPVHLHGSLKLEKKHGMPFFEFSQS PEDVYVAK WKTGNAFKW YT
Subjt:  VEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKWVYT

Query:  FHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVNDGTP
        FHTQDH+KK +ASSFGLNHSCK++LMVGQMQVSCYLSSEL+D GF+NSM+TEFVLYDTARARQS ASQESC+SIQDAVKPP  SNSG+VGEAFSVNDGTP
Subjt:  FHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVNDGTP

Query:  LEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGNHGL
         EK KL +KHASENCD  SIDSCPWDSADL PDLESASI+MQIPFSKRESLKYKRGDKTS KLNSAIQ+LSKIEQRKDE  HHT QETLKVV+PIGNHGL
Subjt:  LEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGNHGL

Query:  PTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVAILH
        PTVESHGPSTLLDRWRLGG CDCGGWDMGCPLL+LGTPSSH AENQA KGKQ FELF QGVKD TPALTM+VVKDGQY V FHARLSTLQAFSICVAILH
Subjt:  PTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVAILH

Query:  ATEACNAVQM-EETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV
        ATEACNAVQM EETKE+QH NSLKVLLEEEVKFLIDTVTMEEKK ETRM KETPPSYLF+PPFSPIA+V
Subjt:  ATEACNAVQM-EETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV

A0A6J1K7M5 uncharacterized protein LOC1114913710.0e+0081.66Show/hide
Query:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD
        MGQEPELDFDSLC+VDLSPNTVLPS+PRHSSIK+RS+RKK  H+DF+LNVKD FTEIRF G    RKSNSSSLV+LEDNEH + ELNVK+DFT+IR G D
Subjt:  MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLD

Query:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLSKLDQSSVKKTCIGPRSSDS
        HSSHKSNSP LVELEDDEVL++RSK++ SEDV NIEGM IQ ER KIEISHDNYTSW +GIVDSLCSSDEE+ +      +KL+QSSV K CI PRSSDS
Subjt:  HSSHKSNSPSLVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLSKLDQSSVKKTCIGPRSSDS

Query:  FIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRFSHA
        FIDIYLGSE SETA +DSSNH         LHN KK FKRDKVHAL  SLSAK+E  KNQ  L S+ RFR +PK HISPFRKMLDPFMKSKS  S F  A
Subjt:  FIDIYLGSENSETASEDSSNH---------LHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRFSHA

Query:  VEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKWVYT
        +EAGGDKAVKS+D+ +N+TYRKSLLQDF+N A +SDCDSHF +ND+HHNVVASSPVHLHGSLKLEKKHGMPFFEFSQS PEDVYVAK WKTGNAFKWVYT
Subjt:  VEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKWVYT

Query:  FHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVNDGTP
        FHTQDH+KKS+ASSFGLNHSCK +LMVGQMQVSCYLSSEL+D GF+NSM+TEFVLYDTA ARQS ASQESC+SIQ AVKPP  SNSG+VGEAFSVNDGTP
Subjt:  FHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVNDGTP

Query:  LEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGNHGL
        LEK KL +KHASENCD  SIDSCPWDSADL PDLESASI+MQIPFSKRESLKYKRGDKTS KLNSAIQ+LSKIEQRKDE  HHT QETLKVV+PIGNHGL
Subjt:  LEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGNHGL

Query:  PTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVAILH
        PTVESHGPSTLLDRWRLGG CDCGGWDMGCPLL+LGTPS HCAE QA K KQ FELF QGVKD TPALTM+VVKDGQY V FHARLSTLQAFSICVAILH
Subjt:  PTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVAILH

Query:  ATEACNAVQM-EETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV
        ATEACNAVQM EETKE+QH NSLKVLLEEEVKFLIDTVTMEEKKRETRM KETPPSYLF+PPFSPIA+V
Subjt:  ATEACNAVQM-EETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G63520.1 Protein of unknown function (DUF3527)2.7e-5336.32Show/hide
Query:  SSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKWVYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCY--LSSELKDDGFNNSMV
        SSPVHLH  L++E + GMP F FS   P+DVY+A         ++VY+F     +   + S        K S ++GQMQVS    L  E  ++    S V
Subjt:  SSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKWVYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCY--LSSELKDDGFNNSMV

Query:  TEFVLYDTARARQSTASQESCDSIQDAVK------PPKGSNSGLVGEAFSVNDGTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIP
        +EFVL+D ARAR+S    E+  S Q++ +        + S S L+ E     +       +   KH SEN   G     PW  +DL P LE A+IV+Q  
Subjt:  TEFVLYDTARARQSTASQESCDSIQDAVK------PPKGSNSGLVGEAFSVNDGTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIP

Query:  FSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGNHGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAE
                    D +S          SK+  R+           +KV++P GNHGLP  E+  P+ +L RWR GGGCDC GWDMGC L +L +P      
Subjt:  FSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGNHGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAE

Query:  NQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVAILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKR
           H G    ELF +  K+ TPA+TM  +++G Y V FHA+LS LQAFSICVA L  TE       E    L  C+SL+ L+E E               
Subjt:  NQAHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVAILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKR

Query:  ETRMLKETPPSYLFNPPFSPIARV
             +E  PS++ N  FSPI+RV
Subjt:  ETRMLKETPPSYLFNPPFSPIARV

AT2G33360.1 Protein of unknown function (DUF3527)2.8e-2628.03Show/hide
Query:  KKHGMPFFEFSQSFPEDVYVAKAWKT--GNAF-KWVYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARAR
        K +  P F FS    +++YVA    T  G+ F +    +    H KK   S        +   +VG+++VS   S    ++    ++  +FVL+ +    
Subjt:  KKHGMPFFEFSQSFPEDVYVAKAWKT--GNAF-KWVYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARAR

Query:  QSTASQESCDS------IQDAVKPPKGSNSGLVGEAFSVNDGTP-LEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKR
        Q    ++   +      + DA+K  K ++       FS     P     +  Q+H S+      +D+      DLPP+LE++++V++  F   E  + + 
Subjt:  QSTASQESCDS------IQDAVKPPKGSNSGLVGEAFSVNDGTP-LEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKR

Query:  GDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGNHGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFE
         +K  G     ++ +  + + KD         ++ VVIP+G HG P   + GPS+L+ RW+ GG CDC GWD+GCPL +L        ++Q+     +FE
Subjt:  GDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGNHGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFE

Query:  LFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVAILHA
        LF +G+   +P L +  V+DG Y V   A++S LQ+FSI +A +H+
Subjt:  LFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVAILHA

AT2G33360.2 Protein of unknown function (DUF3527)3.7e-2627.04Show/hide
Query:  AGGDKAVKSIDLLRNETYRKSL-------LQDFSNTAKSS-DCDSHFLNNDSHHNVVASSPVHLHGSLKL------------EKKHGMPFFEFSQSFPED
        +GG+  +KS +LL  +  + SL       LQ   N  K S    + F +N     +V+    +L   L L             K +  P F FS    ++
Subjt:  AGGDKAVKSIDLLRNETYRKSL-------LQDFSNTAKSS-DCDSHFLNNDSHHNVVASSPVHLHGSLKL------------EKKHGMPFFEFSQSFPED

Query:  VYVAKAWKT--GNAF-KWVYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDS------
        +YVA    T  G+ F +    +    H KK   S        +   +VG+++VS   S    ++    ++  +FVL+ +    Q    ++   +      
Subjt:  VYVAKAWKT--GNAF-KWVYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSMVTEFVLYDTARARQSTASQESCDS------

Query:  IQDAVKPPKGSNSGLVGEAFSVNDGTP-LEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSK
        + DA+K  K ++       FS     P     +  Q+H S+      +D+      DLPP+LE++++V++  F   E  + +  +K  G     ++ +  
Subjt:  IQDAVKPPKGSNSGLVGEAFSVNDGTP-LEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSK

Query:  IEQRKDEPSHHTTQETLKVVIPIGNHGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDV
        + + KD         ++ VVIP+G HG P   + GPS+L+ RW+ GG CDC GWD+GCPL +L        ++Q+     +FELF +G+   +P L +  
Subjt:  IEQRKDEPSHHTTQETLKVVIPIGNHGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALTMDV

Query:  VKDGQYAVDFHARLSTLQAFSICVAILHA
        V+DG Y V   A++S LQ+FSI +A +H+
Subjt:  VKDGQYAVDFHARLSTLQAFSICVAILHA

AT4G11450.1 Protein of unknown function (DUF3527)8.2e-11939.56Show/hide
Query:  LDFDSLCSVDLSPNTVLPSV-PRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLDHSSHK
        ++F     ++ SPN+VLP   P   S K+ STR K   +D +L VK+ FTEI F      R+  S+S                                 
Subjt:  LDFDSLCSVDLSPNTVLPSV-PRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLDHSSHK

Query:  SNSPSL-VELEDDEVLKRRSKYQSSEDVCN--IEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLSKLDQSSVKKTCIGPRSSDSFI
         N PS  + + D   L+R S YQSS ++     E  G +  + K+E+S  +  S++  +VDS      E      P  + LD    +K+ + P +S +FI
Subjt:  SNSPSL-VELEDDEVLKRRSKYQSSEDVCN--IEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLSKLDQSSVKKTCIGPRSSDSFI

Query:  DIYLGSENSETASE-DSSNHLHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRFSHAVEAGGDKAVK
        DI L S   + A   DS ++ H  +          L K  SAKV+  +     +S            S  RKM DPF+KSKS+RS   +  E+G      
Subjt:  DIYLGSENSETASE-DSSNHLHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRFSHAVEAGGDKAVK

Query:  SIDLLRNETYRKSLLQDFSNTAKSSD-CDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKWVYTFHTQDHQKK
           L RN    +S+L D+SN  K S+ C    +N D + +V+ SSPVHLH  LK+E K+G+P F+F    PE+VY AK WK+ N   WVYTF +   +K+
Subjt:  SIDLLRNETYRKSLLQDFSNTAKSSD-CDSHFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKWVYTFHTQDHQKK

Query:  SSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFN--NSMVTEFVLYDTARARQSTASQES----CDSIQDAVKPPKGSNSGLVGEAFSVNDGTPLEK
        SSAS  GL    K SL+V QMQV+C + SE++  G +    MV EFVLYD A+AR+S +++E      D++ +A K    S+S +   + S  D +   K
Subjt:  SSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFN--NSMVTEFVLYDTARARQSTASQES----CDSIQDAVKPPKGSNSGLVGEAFSVNDGTPLEK

Query:  PKLQQKHASENCD-KGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGNHGLPT
         + Q K  S++ D + S  + PW +A+L PDLE A+I++Q    KRESLKY+RGDK   +  + I  LS IE+ K E     + E LKVVIP GNHGLPT
Subjt:  PKLQQKHASENCD-KGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGNHGLPT

Query:  VESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQ-AHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVAILHA
         E+  PS L+ RWR GGGCDCGGWDM CPL++LG P   C+ +Q   + +   +LF QG K+  PAL M  V++GQY V FHA+LSTLQAFSICVAILH 
Subjt:  VESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQ-AHKGKQIFELFHQGVKDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVAILHA

Query:  TEACNAVQM-EETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETP---PSYLFNPPFSPIARV
        TE  ++ +  E  ++  HCNSLK+L++++V+FL++ VT EE+    + LKE      SY+ NPPFSPI+RV
Subjt:  TEACNAVQM-EETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETP---PSYLFNPPFSPIARV

AT5G01030.1 Protein of unknown function (DUF3527)2.7e-2138.56Show/hide
Query:  ESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGNHGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAH
        ES+ +   ++  G+   A    + + ++K     +T+ E   V+IP G H  P  E   PS L+ RWR GG CDCGGWD+GC L +L        +   H
Subjt:  ESLKYKRGDKTSGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGNHGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAH

Query:  KGKQIFELFHQGV--KDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVAIL
        K  Q F LF Q V  +D++PAL M  +K G Y V+F + +S LQAF +CV +L
Subjt:  KGKQIFELFHQGV--KDTTPALTMDVVKDGQYAVDFHARLSTLQAFSICVAIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACAAGAGCCAGAATTGGACTTTGACAGCCTTTGCTCTGTGGATCTGAGTCCTAACACTGTTCTTCCATCGGTTCCACGGCATTCAAGCATTAAAAAGAGAAGCAC
GAGAAAAAAACATAAGCATAAAGATTTCGTATTGAATGTAAAGGATGATTTTACTGAGATTAGATTTGGTGGCAGTTGCAGGTCTCGAAAAAGCAATTCATCTAGCCTAG
TTAAACTGGAAGACAACGAGCATAAAGATTTAGAATTGAATGTGAAGGATGATTTTACAGACATTCGATTTGGCCTCGACCACAGTTCTCATAAAAGCAATTCACCTAGC
CTAGTTGAATTGGAAGATGACGAAGTGCTCAAGCGGCGTTCTAAGTACCAAAGCTCTGAAGATGTATGTAACATAGAAGGGATGGGAATTCAAGGGGAAAGGAGAAAAAT
AGAAATTTCACATGACAATTACACCTCTTGGGCTTCTGGTATTGTTGACTCTTTGTGTAGTTCAGATGAGGAAAACCCAGAGAGGATATCTCCAGTTTTATCTAAACTGG
ATCAATCGTCAGTTAAGAAGACTTGCATAGGCCCACGTTCGTCAGATAGCTTCATTGACATTTACTTAGGTTCTGAGAACAGTGAAACTGCATCGGAAGACTCGAGCAAC
CATCTTCACAATAGCAAGAAATTTTTCAAGAGAGACAAGGTTCACGCATTGCAGAAGTCACTCTCTGCTAAGGTAGAAATGTCTAAAAACCAACTACCATTGGAAAGCAA
TCCACGGTTTAGACGCAGCCCAAAGGTTCATATAAGTCCTTTCAGGAAAATGTTGGATCCATTTATGAAATCTAAATCTGTCAGGAGTCGTTTCAGTCATGCAGTCGAAG
CTGGTGGAGATAAAGCTGTTAAGTCAATCGACTTACTGAGAAACGAGACATACAGAAAATCTTTGTTGCAAGATTTCTCAAATACAGCAAAGAGTTCAGATTGTGATTCT
CATTTTCTGAATAATGATAGTCATCATAATGTTGTTGCATCTTCACCCGTGCATTTACATGGCTCTCTCAAGTTGGAAAAAAAACATGGGATGCCGTTTTTTGAGTTCTC
TCAGAGCTTTCCCGAAGATGTCTACGTGGCCAAGGCGTGGAAGACAGGCAATGCTTTTAAATGGGTATACACATTTCATACCCAAGATCATCAAAAAAAGAGTAGTGCAA
GTAGTTTTGGTTTGAATCACAGCTGCAAAAACTCCTTAATGGTGGGTCAGATGCAAGTTTCCTGTTATTTATCCTCTGAACTTAAAGATGATGGTTTTAACAACTCGATG
GTCACAGAATTTGTTCTGTACGATACTGCTCGTGCTAGACAAAGTACTGCATCCCAAGAAAGTTGTGACTCTATCCAAGATGCTGTCAAACCTCCTAAAGGTTCTAATTC
AGGCTTAGTTGGGGAAGCTTTCAGTGTTAATGATGGGACTCCTCTAGAAAAACCCAAACTTCAGCAAAAACATGCTTCTGAAAACTGTGATAAGGGTTCTATAGATTCTT
GTCCTTGGGATTCGGCAGATTTACCTCCAGACCTGGAATCTGCATCTATTGTTATGCAAATCCCATTTAGCAAAAGGGAGAGCTTGAAATATAAAAGAGGTGATAAAACA
AGCGGTAAACTGAACTCAGCCATACAAAACCTCTCTAAGATTGAACAAAGGAAAGATGAACCTTCTCATCACACAACTCAAGAAACTTTAAAGGTGGTAATTCCCATTGG
AAACCATGGTTTGCCAACTGTTGAAAGCCACGGCCCTTCTACGTTACTCGATCGATGGAGGTTGGGAGGTGGTTGTGACTGTGGTGGTTGGGACATGGGTTGTCCTCTAC
TGATTTTGGGCACCCCCAGTAGCCACTGTGCCGAAAATCAAGCGCACAAGGGGAAACAGATTTTTGAGCTTTTTCATCAGGGGGTAAAGGATACCACTCCGGCGTTGACC
ATGGATGTTGTAAAAGATGGACAATACGCTGTCGATTTCCATGCACGATTATCAACATTGCAAGCATTCTCCATTTGTGTTGCTATTTTACATGCGACTGAAGCCTGCAA
TGCTGTCCAAATGGAGGAAACAAAAGAGTTGCAACATTGTAATTCCTTAAAAGTGCTTCTTGAGGAGGAAGTAAAATTCTTGATCGACACAGTCACGATGGAGGAGAAGA
AAAGAGAAACAAGAATGTTGAAGGAAACACCACCATCATATTTGTTCAATCCCCCATTTTCTCCTATTGCTAGAGTGTAG
mRNA sequenceShow/hide mRNA sequence
TCTCTCTTCCTATCATTTTCACTCACTCTCTCTAACGCATGCTTTCTCTCTCTAAAATTTGCCTTGCATTACACACTCTCTCAAAGGGTTCACTTTTGTGTTCATTATGC
TTTAAGACCATTGCCCAACTTCTTCTGATCATTTTTTGTTTCCACCCTTCAACTTTCCTTGATTCTTTCTGAAATTCTCACTTAGTGACAGCTTCAATGTTTCTTTATTT
GGGGTTTCTTCGCCATGGATTCTATTGAAGCTTTCAGCTGAAGAGATTGGTCGACATGATTGCTTCTTCCTGTTGTTTGTTGGGTTTTTCTTTGCCTTCACTCTTCTGAG
ATCTGGTGGGAGCTGTGGATCACACAATGGGACAAGAGCCAGAATTGGACTTTGACAGCCTTTGCTCTGTGGATCTGAGTCCTAACACTGTTCTTCCATCGGTTCCACGG
CATTCAAGCATTAAAAAGAGAAGCACGAGAAAAAAACATAAGCATAAAGATTTCGTATTGAATGTAAAGGATGATTTTACTGAGATTAGATTTGGTGGCAGTTGCAGGTC
TCGAAAAAGCAATTCATCTAGCCTAGTTAAACTGGAAGACAACGAGCATAAAGATTTAGAATTGAATGTGAAGGATGATTTTACAGACATTCGATTTGGCCTCGACCACA
GTTCTCATAAAAGCAATTCACCTAGCCTAGTTGAATTGGAAGATGACGAAGTGCTCAAGCGGCGTTCTAAGTACCAAAGCTCTGAAGATGTATGTAACATAGAAGGGATG
GGAATTCAAGGGGAAAGGAGAAAAATAGAAATTTCACATGACAATTACACCTCTTGGGCTTCTGGTATTGTTGACTCTTTGTGTAGTTCAGATGAGGAAAACCCAGAGAG
GATATCTCCAGTTTTATCTAAACTGGATCAATCGTCAGTTAAGAAGACTTGCATAGGCCCACGTTCGTCAGATAGCTTCATTGACATTTACTTAGGTTCTGAGAACAGTG
AAACTGCATCGGAAGACTCGAGCAACCATCTTCACAATAGCAAGAAATTTTTCAAGAGAGACAAGGTTCACGCATTGCAGAAGTCACTCTCTGCTAAGGTAGAAATGTCT
AAAAACCAACTACCATTGGAAAGCAATCCACGGTTTAGACGCAGCCCAAAGGTTCATATAAGTCCTTTCAGGAAAATGTTGGATCCATTTATGAAATCTAAATCTGTCAG
GAGTCGTTTCAGTCATGCAGTCGAAGCTGGTGGAGATAAAGCTGTTAAGTCAATCGACTTACTGAGAAACGAGACATACAGAAAATCTTTGTTGCAAGATTTCTCAAATA
CAGCAAAGAGTTCAGATTGTGATTCTCATTTTCTGAATAATGATAGTCATCATAATGTTGTTGCATCTTCACCCGTGCATTTACATGGCTCTCTCAAGTTGGAAAAAAAA
CATGGGATGCCGTTTTTTGAGTTCTCTCAGAGCTTTCCCGAAGATGTCTACGTGGCCAAGGCGTGGAAGACAGGCAATGCTTTTAAATGGGTATACACATTTCATACCCA
AGATCATCAAAAAAAGAGTAGTGCAAGTAGTTTTGGTTTGAATCACAGCTGCAAAAACTCCTTAATGGTGGGTCAGATGCAAGTTTCCTGTTATTTATCCTCTGAACTTA
AAGATGATGGTTTTAACAACTCGATGGTCACAGAATTTGTTCTGTACGATACTGCTCGTGCTAGACAAAGTACTGCATCCCAAGAAAGTTGTGACTCTATCCAAGATGCT
GTCAAACCTCCTAAAGGTTCTAATTCAGGCTTAGTTGGGGAAGCTTTCAGTGTTAATGATGGGACTCCTCTAGAAAAACCCAAACTTCAGCAAAAACATGCTTCTGAAAA
CTGTGATAAGGGTTCTATAGATTCTTGTCCTTGGGATTCGGCAGATTTACCTCCAGACCTGGAATCTGCATCTATTGTTATGCAAATCCCATTTAGCAAAAGGGAGAGCT
TGAAATATAAAAGAGGTGATAAAACAAGCGGTAAACTGAACTCAGCCATACAAAACCTCTCTAAGATTGAACAAAGGAAAGATGAACCTTCTCATCACACAACTCAAGAA
ACTTTAAAGGTGGTAATTCCCATTGGAAACCATGGTTTGCCAACTGTTGAAAGCCACGGCCCTTCTACGTTACTCGATCGATGGAGGTTGGGAGGTGGTTGTGACTGTGG
TGGTTGGGACATGGGTTGTCCTCTACTGATTTTGGGCACCCCCAGTAGCCACTGTGCCGAAAATCAAGCGCACAAGGGGAAACAGATTTTTGAGCTTTTTCATCAGGGGG
TAAAGGATACCACTCCGGCGTTGACCATGGATGTTGTAAAAGATGGACAATACGCTGTCGATTTCCATGCACGATTATCAACATTGCAAGCATTCTCCATTTGTGTTGCT
ATTTTACATGCGACTGAAGCCTGCAATGCTGTCCAAATGGAGGAAACAAAAGAGTTGCAACATTGTAATTCCTTAAAAGTGCTTCTTGAGGAGGAAGTAAAATTCTTGAT
CGACACAGTCACGATGGAGGAGAAGAAAAGAGAAACAAGAATGTTGAAGGAAACACCACCATCATATTTGTTCAATCCCCCATTTTCTCCTATTGCTAGAGTGTAGGATA
AAGGCCAACTTGATCAGTACAGATATTATGGCTTCTTCAGATGGTTCCACGTTGACTTTAGAGAAATTGCTTGACTTGTGTTATGGAAGCTGCTTGATAAAGCTCAAATG
GAAAGAATTGTACAAGGAACCTTCTTAGCAACAGCACGGGCTCTCGGTGCCGGGGCATTTGAATTCTTTTGTATAACAGAGCAAGAGCTTGCTTAGAGAAGCTGAAATGA
CTGTTAGGAGAGACTTGGGAAGATGCTAATTGGGGAGCTCACATCATTGTGATAGAAGGTGTCAGGACCCTTTTATTTATTATTCTTCAAAAACCAAATCCTTTTAAATA
ACTTGATAGTTAACTGTCCATACTGTAATATGGTAGGGAGATGAGTAGCCAATTGATAACTAGAAAGATAGCATTGTCATTCACATTGGACTTTTTGTACATTATATTGT
TAAGAGGG
Protein sequenceShow/hide protein sequence
MGQEPELDFDSLCSVDLSPNTVLPSVPRHSSIKKRSTRKKHKHKDFVLNVKDDFTEIRFGGSCRSRKSNSSSLVKLEDNEHKDLELNVKDDFTDIRFGLDHSSHKSNSPS
LVELEDDEVLKRRSKYQSSEDVCNIEGMGIQGERRKIEISHDNYTSWASGIVDSLCSSDEENPERISPVLSKLDQSSVKKTCIGPRSSDSFIDIYLGSENSETASEDSSN
HLHNSKKFFKRDKVHALQKSLSAKVEMSKNQLPLESNPRFRRSPKVHISPFRKMLDPFMKSKSVRSRFSHAVEAGGDKAVKSIDLLRNETYRKSLLQDFSNTAKSSDCDS
HFLNNDSHHNVVASSPVHLHGSLKLEKKHGMPFFEFSQSFPEDVYVAKAWKTGNAFKWVYTFHTQDHQKKSSASSFGLNHSCKNSLMVGQMQVSCYLSSELKDDGFNNSM
VTEFVLYDTARARQSTASQESCDSIQDAVKPPKGSNSGLVGEAFSVNDGTPLEKPKLQQKHASENCDKGSIDSCPWDSADLPPDLESASIVMQIPFSKRESLKYKRGDKT
SGKLNSAIQNLSKIEQRKDEPSHHTTQETLKVVIPIGNHGLPTVESHGPSTLLDRWRLGGGCDCGGWDMGCPLLILGTPSSHCAENQAHKGKQIFELFHQGVKDTTPALT
MDVVKDGQYAVDFHARLSTLQAFSICVAILHATEACNAVQMEETKELQHCNSLKVLLEEEVKFLIDTVTMEEKKRETRMLKETPPSYLFNPPFSPIARV