; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025420 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025420
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTATA box-binding protein associated factor RNA polymerase I subunit C
Genome locationchr10:12727981..12730722
RNA-Seq ExpressionLag0025420
SyntenyLag0025420
Gene Ontology termsGO:0006360 - transcription by RNA polymerase I (biological process)
GO:0001650 - fibrillar center (cellular component)
GO:0001164 - RNA polymerase I CORE element sequence-specific DNA binding (molecular function)
InterPro domainsIPR038801 - TATA box-binding protein-associated factor RNA polymerase I subunit C


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0043170.1 uncharacterized protein E6C27_scaffold110G00340 [Cucumis melo var. makuwa]0.0e+0081.49Show/hide
Query:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR
        M EE+WKSLFPIG+VF SPLL+S SSVK+SIGPLVFNPVP+SL RLFSS SLLPSLSPPS+LNL RFLLTSS + PSTSSSVA LFG+QQ   D  S LR
Subjt:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR

Query:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN
        YNRLQ L CPNS+ VVVFFPTGPNSDHVGFLVVS +GSGL VQSDC+N+VFSVESELNYQIFGIAVNP   LG   D  V IGFLLAYTMYSVEWF+VKN
Subjt:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN

Query:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL
        +A      P VSLVHMGSKVF+TCSVVHACW+PHL EESVVLLEDGSLFLFDMEPLLKA+N N NA++KGI+L+VSWD LDCSKKVKWLSC+FSWHPRIL
Subjt:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL

Query:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN
        IVARSDAVFLVDLRE ECSISCL+KIET  S SL E+EQFLAFSKAGSDGFYF VAS  LLLLCDIRKPLSPVLQWTHG+D+PSYV+VFSLSELRS   N
Subjt:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN

Query:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI
        SMYK+ASE GYCIVLGSFWS EFN FCYGPS PALDQS+SS+SSKYFQSLYAWE PSNLILSGRECPC SCL RQE+LKDAI EWVEWQQKKEIVLGF I
Subjt:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI

Query:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR
        LDNNLSLP    QNEYGSFTLVRLMSSG+LEAQTYQASWN +K+ID  HKESLNLNDYLLYGWL+DDKYRF+RR+ YFNFDYLMGYLN+ LDE++DSFMR
Subjt:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR

Query:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP
        K SKDS+CE+SL+ E+HEVLCEK+KACGF RLRS+P LAVVFNDISLP+SIQEI FRKLW+SLPMELLHF+FSSYSEFLENKN VS+EFLSVPSLHQLPP
Subjt:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP

Query:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA
        F+LRDPS+RSNKWS+KV RTEN+VGPVLPLPILLVLHEFRNGCSK EEE  GKFSLEAE  EQYD+IRSAAGEMA+SPF+PKVDDGPAVSL +DREYVSA
Subjt:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA

Query:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKWE
        +S+KPKNFVS+HP AFNS T    NTQ N TN  +VFDSLIFKLE GK  SSEKSENNASRELY+GLCPVEL+F+A  ++FG KELKAY++LKRQLLKWE
Subjt:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKWE

Query:  DGFDGYKEFCSKI
        DGFD YKEF SKI
Subjt:  DGFDGYKEFCSKI

XP_008459007.1 PREDICTED: uncharacterized protein LOC103498249 [Cucumis melo]0.0e+0081.27Show/hide
Query:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR
        M EE+WKSLFPIG+V  SPLL+S SSVK+SIGPLVFNPVP+SL RLFSS SLLPSLSPPS+LNL RFLLTSS + PSTSSSVA LFG+QQ   D  S LR
Subjt:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR

Query:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN
        YNRLQ L CPNS+ VVVFFPTGPNSDHVGFLVVS +GSGL VQSDC+N+VFSVESELNYQIFGIAVNP   LG   D  V IGFLLA+TMYSVEWF+VKN
Subjt:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN

Query:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL
        +A      P VSLVHMGSKVF+TCSVVHACW+PHL EESVVLLEDGSLFLFDMEPLLKA+N N NA+LKGI+L+VSWD LDCSKKVKWLSC+FSWHPRIL
Subjt:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL

Query:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN
        IVARSDAVFLVDLRE ECSISCL+KIET  S SL E+EQFLAFSKAGSDGFYF VAS  LLLLCDIRKPLSPVLQWTHG+D+PSYV+VFSLSELRS   N
Subjt:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN

Query:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI
        SMYK+ASE GYCIVLGSFWS EFN FCYGPS PALDQS+SS+SSKYFQSLYAWERPSNLILSGRECPC SCL RQE+LKDAI EWVEWQQKKEIVLGF I
Subjt:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI

Query:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR
        LDNNLSLP    QNEYGSFTLVRLMSSG+LEAQTYQASWN +K+ID  HKESLNLNDYLLYGWL+DDKYRF+RR+ YFNFDYLMGYLN+ LDE++DSFMR
Subjt:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR

Query:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP
        K SKD++CE+SL+ E+HEVLCEK+KACGF RLRS+P LAVVFNDISLP+SIQEI FRKLW+SLPMELLHF+FSSYSEFLENKN VS+EFLSVPSLHQLPP
Subjt:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP

Query:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA
        F+LRDPS+RSNKWS+KV RTEN+VGPVLPLPILLVLHEFRNGCSK EEE  GKFSLEAE  EQYD+IRSAAGEMA+SPF+PKVDDGPAVSL +DREYVSA
Subjt:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA

Query:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKWE
        +S+KPKNFVS+HP AFNS T    N Q N TN  +VFDSLIFKLE GK  SSEKSENNASRELY+GLCPVEL+F+A  ++FG KELKAY++LKRQLLKWE
Subjt:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKWE

Query:  DGFDGYKEFCSKI
        DGFD YKEF SKI
Subjt:  DGFDGYKEFCSKI

XP_023004933.1 uncharacterized protein LOC111498092 [Cucurbita maxima]0.0e+0079.43Show/hide
Query:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR
        M EE+WKSLFPIG+VF SPLLLS SS KDSIGP+VFNP+ +SL RLFSS S LPSLSPPSILNL RFL TSS + PSTSSSV  LFG+Q  + DAASTLR
Subjt:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR

Query:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN
        YNRLQ LRCPNSN +VVFFPTGPNSDHVGFLVVSG+ SGL VQSDC+N+VFSVESEL YQI GI+VNPVSDLGF+GD  + IGFLLAYTMYSVEWF+VK+
Subjt:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN

Query:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL
         ATD  F P+VSLVH+GSKVF++CSVVHACWSPHL EESVVLLEDGSLFLFDMEPLLKA+N +  A+LKGIRLRVSWDS DCSKKVKWLSCEFSWHPRIL
Subjt:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL

Query:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN
        IVARSDAV LVDLRE E SISCLVKI+ FHSYSLA+REQFLAFSKAGSDGF+F VAS SLL+LCDIRKP+SPVLQWTH +DEP Y++VFSLS+LRS ASN
Subjt:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN

Query:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI
         +Y+LASE GYCI+LGSFWSCEFNIFCYGPS P L QSVSS+SSKYFQSLYAWERPSNLILSGREC CGSCLVRQET KDAIPEWVEWQQK+EIVLGFGI
Subjt:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI

Query:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR
        LD +LS PL   QNE+G FTL+RL+SSG LE+QTYQASWN +K ID+SHKESLNL DY LYGWLVDDKYRFSR+F YF+F+YLMGYLN++LDE+LDSF R
Subjt:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR

Query:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP
        K SKDS+CER+LTSEIH VLCEKLKACGF RLR+SP LAVVFNDISLP SIQEI F+KLW+SLPM+LLHFAFS+YSEFLE+KN VSLEF +VPSLHQLPP
Subjt:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP

Query:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA
        F+LR+PSSRSNKWS KV RTE+LVGPVLPLPILLVLHEF+NGCSK  EEEAGKFSL+ ELGEQYDQIR AA EMA+SP D KVDDGP VSL++D+EYV +
Subjt:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA

Query:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENAT-NLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKW
        DS+KPKNFVSYHPSAF+SHT    NTQ N+T +  DVFDSLIFKLE       EKS+N    EL+DGLCPVELKFD RP+NF P ELKAY LLK+QLLKW
Subjt:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENAT-NLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKW

Query:  EDGFDGYKEFCSKI
         DGF  YKEF SKI
Subjt:  EDGFDGYKEFCSKI

XP_031738950.1 uncharacterized protein LOC101205590 [Cucumis sativus]0.0e+0080.94Show/hide
Query:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR
        M EE+WKSLFPIG+VF SPLL+S SSVK+SIGPLVFNPVP+SL RLFSS SLLPSLSPPS+LNL RFLLTSS + PSTSSSVA LFG+QQ   D  S LR
Subjt:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR

Query:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN
        YNRLQ L CPNS+ VVVFFPTGPNSDHVGFLVVS +GSGL VQSDC+N+VFSVESELNYQIFGIAVNP S  GF  D    IGFLLAYTMYSVEWF+VKN
Subjt:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN

Query:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL
        +A      PRVSLVHMGSKVF+TCSVVHACW+PHL EESVVLLEDGSLFLFDMEPLLK ++ N N +LKGI+L+VSWD LDCSKKVKWLSCEFSWHPRIL
Subjt:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL

Query:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN
        IVARSDAVFLVDLRE +C+ISCL+KIETF +YSL E+EQFLAFSKAGSDGFYF +AS  LLLLCDIRKPLSPVLQWTHG+D+PSY++VFSLSELRS   N
Subjt:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN

Query:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI
         MYK+ASE GYCIVLGSFWS EFNIFCYGPS P LDQS+SS+SSKYFQS YAWERPSNLILSGRECPC SCL +QE+LKDAI EWVEWQQKKEIVLGF I
Subjt:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI

Query:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR
        LDNNLSLP    QNEYGSFTL+RLMSSG+LEAQTYQASWN +K+ID  HKESLNLNDYLLYGWLVDDKYRF+RR+ YFNFDYLMGYLN+ LDE++DSFMR
Subjt:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR

Query:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP
        K  KDS+CE+SL+ E+HEVLCEK+KACGF RLRS+P LAVVFNDISLP+SIQEI FRKLW+SLPMELLHF+FSSYSEFL+NKN VS EFLSVPSLHQLPP
Subjt:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP

Query:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA
        F+LRDPSSRS KWS+KV RTEN+VGPVLPLPILLVLHEFRNGCSK EEEEAGKFS+EAE  EQYD+IRSAAGEMA+SPFDPKVDDGPAVSL +DREYVSA
Subjt:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA

Query:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKWE
        +S+KPK+FVSY+P AFNSHT   D+TQ N TN  +VFDSLIFKL GGK  SSEKS+NNASRELY+GLCPVEL+F+A  ++FG KELKAY+LLKRQLLKWE
Subjt:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKWE

Query:  DGFDGYKEFCSKI
        DGFD YKEF SKI
Subjt:  DGFDGYKEFCSKI

XP_038894321.1 uncharacterized protein LOC120082953 [Benincasa hispida]0.0e+0084.04Show/hide
Query:  MPEEQWKSLFPIGSVFNSPLLL--SASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAAST
        M EE+WKSLFPIG+VF SPLLL  S+SSVK+SIGPLVFNPVP+SL RLFS+PSLLPSLSPPSILNLRRFLLTSSP+ PSTSSSVA LFG+QQ  GDAAST
Subjt:  MPEEQWKSLFPIGSVFNSPLLL--SASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAAST

Query:  LRYNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVV
        LR+NRLQFL CPNS+ VVVFFPTGPNSDHVGFLVVSG+ SGL VQSD +N+VFSVE+ELNYQIFGIAVNP   LGF+GD SV IGFLLAYTMYSVEWFVV
Subjt:  LRYNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVV

Query:  KNNATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPR
        +N+A D    PRVSLV+MGSKVF+TCSVVHACW+PHL EESVVLLEDGSLFLFDMEPLLKA+N N NA+LKGIRL+VSWD LDCSKKVKWLSCEFSWHPR
Subjt:  KNNATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPR

Query:  ILIVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRA
        ILIVARSDA+FLVDLRE ECSISCL+KIETF SYSLAE+ QFLAFSKAGSDGFYFC+AS  LLLLCDIRKP+SPVLQWTH +D+PSYV+VFSLSELRS  
Subjt:  ILIVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRA

Query:  SNSMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGF
         NSMYKLASE GYCIVLGSFWSCEFNIFCYGPS PALDQSVSS+SSKYFQSLYAWERPSN ILSGRECPC SCL+RQE+LKDAIPEWVEWQQKKEIVLGF
Subjt:  SNSMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGF

Query:  GILDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSF
         ILDNNLSLP    QNEYGSFTL+RLMSSG+LEAQTYQASWN +K+ID+ HK+SL+L+DYLLYG LVDDKYRFSRR+ YFNFDYLMGYLN++LD++LDSF
Subjt:  GILDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSF

Query:  MRKSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQL
        MRK SKDS+CERSLT E+HEVLCEKLKACGF RLRS+P LAVVFNDI+LP+SIQEI F+KLW+SLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSL+QL
Subjt:  MRKSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQL

Query:  PPFILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYV
        PPF+LRD SSRSNKWS+KV+RTEN+VGPVLPLPILL+LHEFRNGCSK EEEEAGKFSLEAE  EQYD+IRSAAGEMA SPFDPKVDDGPAVSLA+D+EYV
Subjt:  PPFILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYV

Query:  SADSKKPKNFVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLK
        SA+S+KPKNFVSYHP AFNSHT   DNTQ N+TN  DVFDSLIFKL+GGK  SSEKSENNAS ELY+ LCPVEL+F+A  VNFGPKELKAY LLKRQLLK
Subjt:  SADSKKPKNFVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLK

Query:  WEDGFDGYKEFCSKI
        WEDGFD YKEF SKI
Subjt:  WEDGFDGYKEFCSKI

TrEMBL top hitse value%identityAlignment
A0A0A0M158 Uncharacterized protein0.0e+0080.94Show/hide
Query:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR
        M EE+WKSLFPIG+VF SPLL+S SSVK+SIGPLVFNPVP+SL RLFSS SLLPSLSPPS+LNL RFLLTSS + PSTSSSVA LFG+QQ   D  S LR
Subjt:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR

Query:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN
        YNRLQ L CPNS+ VVVFFPTGPNSDHVGFLVVS +GSGL VQSDC+N+VFSVESELNYQIFGIAVNP S  GF  D    IGFLLAYTMYSVEWF+VKN
Subjt:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN

Query:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL
        +A      PRVSLVHMGSKVF+TCSVVHACW+PHL EESVVLLEDGSLFLFDMEPLLK ++ N N +LKGI+L+VSWD LDCSKKVKWLSCEFSWHPRIL
Subjt:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL

Query:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN
        IVARSDAVFLVDLRE +C+ISCL+KIETF +YSL E+EQFLAFSKAGSDGFYF +AS  LLLLCDIRKPLSPVLQWTHG+D+PSY++VFSLSELRS   N
Subjt:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN

Query:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI
         MYK+ASE GYCIVLGSFWS EFNIFCYGPS P LDQS+SS+SSKYFQS YAWERPSNLILSGRECPC SCL +QE+LKDAI EWVEWQQKKEIVLGF I
Subjt:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI

Query:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR
        LDNNLSLP    QNEYGSFTL+RLMSSG+LEAQTYQASWN +K+ID  HKESLNLNDYLLYGWLVDDKYRF+RR+ YFNFDYLMGYLN+ LDE++DSFMR
Subjt:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR

Query:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP
        K  KDS+CE+SL+ E+HEVLCEK+KACGF RLRS+P LAVVFNDISLP+SIQEI FRKLW+SLPMELLHF+FSSYSEFL+NKN VS EFLSVPSLHQLPP
Subjt:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP

Query:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA
        F+LRDPSSRS KWS+KV RTEN+VGPVLPLPILLVLHEFRNGCSK EEEEAGKFS+EAE  EQYD+IRSAAGEMA+SPFDPKVDDGPAVSL +DREYVSA
Subjt:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA

Query:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKWE
        +S+KPK+FVSY+P AFNSHT   D+TQ N TN  +VFDSLIFKL GGK  SSEKS+NNASRELY+GLCPVEL+F+A  ++FG KELKAY+LLKRQLLKWE
Subjt:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKWE

Query:  DGFDGYKEFCSKI
        DGFD YKEF SKI
Subjt:  DGFDGYKEFCSKI

A0A1S3C9R8 uncharacterized protein LOC1034982490.0e+0081.27Show/hide
Query:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR
        M EE+WKSLFPIG+V  SPLL+S SSVK+SIGPLVFNPVP+SL RLFSS SLLPSLSPPS+LNL RFLLTSS + PSTSSSVA LFG+QQ   D  S LR
Subjt:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR

Query:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN
        YNRLQ L CPNS+ VVVFFPTGPNSDHVGFLVVS +GSGL VQSDC+N+VFSVESELNYQIFGIAVNP   LG   D  V IGFLLA+TMYSVEWF+VKN
Subjt:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN

Query:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL
        +A      P VSLVHMGSKVF+TCSVVHACW+PHL EESVVLLEDGSLFLFDMEPLLKA+N N NA+LKGI+L+VSWD LDCSKKVKWLSC+FSWHPRIL
Subjt:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL

Query:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN
        IVARSDAVFLVDLRE ECSISCL+KIET  S SL E+EQFLAFSKAGSDGFYF VAS  LLLLCDIRKPLSPVLQWTHG+D+PSYV+VFSLSELRS   N
Subjt:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN

Query:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI
        SMYK+ASE GYCIVLGSFWS EFN FCYGPS PALDQS+SS+SSKYFQSLYAWERPSNLILSGRECPC SCL RQE+LKDAI EWVEWQQKKEIVLGF I
Subjt:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI

Query:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR
        LDNNLSLP    QNEYGSFTLVRLMSSG+LEAQTYQASWN +K+ID  HKESLNLNDYLLYGWL+DDKYRF+RR+ YFNFDYLMGYLN+ LDE++DSFMR
Subjt:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR

Query:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP
        K SKD++CE+SL+ E+HEVLCEK+KACGF RLRS+P LAVVFNDISLP+SIQEI FRKLW+SLPMELLHF+FSSYSEFLENKN VS+EFLSVPSLHQLPP
Subjt:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP

Query:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA
        F+LRDPS+RSNKWS+KV RTEN+VGPVLPLPILLVLHEFRNGCSK EEE  GKFSLEAE  EQYD+IRSAAGEMA+SPF+PKVDDGPAVSL +DREYVSA
Subjt:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA

Query:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKWE
        +S+KPKNFVS+HP AFNS T    N Q N TN  +VFDSLIFKLE GK  SSEKSENNASRELY+GLCPVEL+F+A  ++FG KELKAY++LKRQLLKWE
Subjt:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKWE

Query:  DGFDGYKEFCSKI
        DGFD YKEF SKI
Subjt:  DGFDGYKEFCSKI

A0A5A7TIM1 Uncharacterized protein0.0e+0081.49Show/hide
Query:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR
        M EE+WKSLFPIG+VF SPLL+S SSVK+SIGPLVFNPVP+SL RLFSS SLLPSLSPPS+LNL RFLLTSS + PSTSSSVA LFG+QQ   D  S LR
Subjt:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR

Query:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN
        YNRLQ L CPNS+ VVVFFPTGPNSDHVGFLVVS +GSGL VQSDC+N+VFSVESELNYQIFGIAVNP   LG   D  V IGFLLAYTMYSVEWF+VKN
Subjt:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN

Query:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL
        +A      P VSLVHMGSKVF+TCSVVHACW+PHL EESVVLLEDGSLFLFDMEPLLKA+N N NA++KGI+L+VSWD LDCSKKVKWLSC+FSWHPRIL
Subjt:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL

Query:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN
        IVARSDAVFLVDLRE ECSISCL+KIET  S SL E+EQFLAFSKAGSDGFYF VAS  LLLLCDIRKPLSPVLQWTHG+D+PSYV+VFSLSELRS   N
Subjt:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN

Query:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI
        SMYK+ASE GYCIVLGSFWS EFN FCYGPS PALDQS+SS+SSKYFQSLYAWE PSNLILSGRECPC SCL RQE+LKDAI EWVEWQQKKEIVLGF I
Subjt:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI

Query:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR
        LDNNLSLP    QNEYGSFTLVRLMSSG+LEAQTYQASWN +K+ID  HKESLNLNDYLLYGWL+DDKYRF+RR+ YFNFDYLMGYLN+ LDE++DSFMR
Subjt:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR

Query:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP
        K SKDS+CE+SL+ E+HEVLCEK+KACGF RLRS+P LAVVFNDISLP+SIQEI FRKLW+SLPMELLHF+FSSYSEFLENKN VS+EFLSVPSLHQLPP
Subjt:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP

Query:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA
        F+LRDPS+RSNKWS+KV RTEN+VGPVLPLPILLVLHEFRNGCSK EEE  GKFSLEAE  EQYD+IRSAAGEMA+SPF+PKVDDGPAVSL +DREYVSA
Subjt:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA

Query:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKWE
        +S+KPKNFVS+HP AFNS T    NTQ N TN  +VFDSLIFKLE GK  SSEKSENNASRELY+GLCPVEL+F+A  ++FG KELKAY++LKRQLLKWE
Subjt:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKWE

Query:  DGFDGYKEFCSKI
        DGFD YKEF SKI
Subjt:  DGFDGYKEFCSKI

A0A6J1H8K3 uncharacterized protein LOC1114606340.0e+0078.67Show/hide
Query:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR
        M EE+WKSLFPIG+VF SPLLLS SS KDSIGP+VFNP+ +SL RLFSS S LPSLSPPSILNL RFL TSS + PSTSS+V  LFG+Q  + DAASTLR
Subjt:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR

Query:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN
        YNRLQ LRCPNSN VVVFFPTGPNSD VGFLVVSG+ SGL VQSDC+N+VFSVESEL YQI GI+VNPVSDL F+GD  + IGFLLAYTMYSVEWF+VK+
Subjt:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN

Query:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL
         ATD  F P+VSLVH+GSKVF++CSVVHACWSPHL EESVVLLEDGSLFLFDMEPLLKA+  +  A+LKGIRLRVSWD+ DCSKKVKWLSCEFSWHPRIL
Subjt:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL

Query:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN
        IVARSDAV LVDLRE E SISCLVKI+ FHSYSLA+REQFLAFSKAGSDGF+F VAS SLL+LCDIRKP+SPVLQWTH +DEPSY++VFSLS+LRS ASN
Subjt:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN

Query:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI
         +Y+ ASE GYCI+LGSFWSCEFNIFCYGPS P L QSVSS+SSKYFQ LYAWERPSNLILSGREC CGSCLVRQET KDAIPEWVEWQQKKEIVLGFGI
Subjt:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI

Query:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR
        LD ++S P +  QNE+G FTL+RL+SSG LE+QTYQASWN +KRID+SHKESLNL DY LYGWLVDDKYRFSR+F YF+F+YLMGYLN++LDE+LDSF R
Subjt:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR

Query:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP
        K SKDS+CER+LTSEIH VLCEKLKACGF RLRSSP LAVVFNDISLP SIQEI F+KLW+SLPMELLHFAFS+YSEFLE+KN VSLEF +VPSLHQLPP
Subjt:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP

Query:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA
        F+LR+ SSRSNKWS KV RTE+LVGPVLPLPILLVLHEF+NGCSK  EEEAGKFSL++EL EQYDQIR AA EMA+SP D KVDDGP VSL++D+EYV +
Subjt:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA

Query:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENAT-NLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKW
        DS+KPKNFVSYHPSAF+SHT    NTQ N+T +  DVFD+LIFKLE       EKS+N    EL+DGLCPV LKFD RP+NF P ELKAY LLK+QLLKW
Subjt:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENAT-NLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKW

Query:  EDGFDGYKEFCSKI
         DGF  YKEF SKI
Subjt:  EDGFDGYKEFCSKI

A0A6J1L0V7 uncharacterized protein LOC1114980920.0e+0079.43Show/hide
Query:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR
        M EE+WKSLFPIG+VF SPLLLS SS KDSIGP+VFNP+ +SL RLFSS S LPSLSPPSILNL RFL TSS + PSTSSSV  LFG+Q  + DAASTLR
Subjt:  MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLR

Query:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN
        YNRLQ LRCPNSN +VVFFPTGPNSDHVGFLVVSG+ SGL VQSDC+N+VFSVESEL YQI GI+VNPVSDLGF+GD  + IGFLLAYTMYSVEWF+VK+
Subjt:  YNRLQFLRCPNSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKN

Query:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL
         ATD  F P+VSLVH+GSKVF++CSVVHACWSPHL EESVVLLEDGSLFLFDMEPLLKA+N +  A+LKGIRLRVSWDS DCSKKVKWLSCEFSWHPRIL
Subjt:  NATDLRFPPRVSLVHMGSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRIL

Query:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN
        IVARSDAV LVDLRE E SISCLVKI+ FHSYSLA+REQFLAFSKAGSDGF+F VAS SLL+LCDIRKP+SPVLQWTH +DEP Y++VFSLS+LRS ASN
Subjt:  IVARSDAVFLVDLREVECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASN

Query:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI
         +Y+LASE GYCI+LGSFWSCEFNIFCYGPS P L QSVSS+SSKYFQSLYAWERPSNLILSGREC CGSCLVRQET KDAIPEWVEWQQK+EIVLGFGI
Subjt:  SMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGI

Query:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR
        LD +LS PL   QNE+G FTL+RL+SSG LE+QTYQASWN +K ID+SHKESLNL DY LYGWLVDDKYRFSR+F YF+F+YLMGYLN++LDE+LDSF R
Subjt:  LDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHKESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMR

Query:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP
        K SKDS+CER+LTSEIH VLCEKLKACGF RLR+SP LAVVFNDISLP SIQEI F+KLW+SLPM+LLHFAFS+YSEFLE+KN VSLEF +VPSLHQLPP
Subjt:  KSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPP

Query:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA
        F+LR+PSSRSNKWS KV RTE+LVGPVLPLPILLVLHEF+NGCSK  EEEAGKFSL+ ELGEQYDQIR AA EMA+SP D KVDDGP VSL++D+EYV +
Subjt:  FILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDREYVSA

Query:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENAT-NLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKW
        DS+KPKNFVSYHPSAF+SHT    NTQ N+T +  DVFDSLIFKLE       EKS+N    EL+DGLCPVELKFD RP+NF P ELKAY LLK+QLLKW
Subjt:  DSKKPKNFVSYHPSAFNSHTFDDDNTQENAT-NLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKW

Query:  EDGFDGYKEFCSKI
         DGF  YKEF SKI
Subjt:  EDGFDGYKEFCSKI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G18310.1 unknown protein3.3e-16740.33Show/hide
Query:  LLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPS-LSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLRYNRLQFLRCPNSNHVVVFF
        L   SS ++SIGP   NP  S L  LFSSPSL P  LS    L   RFL  S  + PS SS++   F       D    L YNRLQFL  P+ N V+VFF
Subjt:  LLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPS-LSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLRYNRLQFLRCPNSNHVVVFF

Query:  PTGPNSDHVGFLVVS-GDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFN--GDCSVCIGFLLAYTMYSVEWFVVKNNATDLRFPPRVSLVHM
        PTG N D +GFL++S GD  GL V      +VF     L  +I  I V PVSD G       S  +G++L Y++YS+ W+ VK + +      +  L ++
Subjt:  PTGPNSDHVGFLVVS-GDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFN--GDCSVCIGFLLAYTMYSVEWFVVKNNATDLRFPPRVSLVHM

Query:  GSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRILIVARSDAVFLVDLREV
        G K F+   +V A WSPH+  E ++LL++G +F+FD+        S  +  ++G +L+VSW+S   S    WL CEF W   + IVARSDA+F++     
Subjt:  GSKVFRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRILIVARSDAVFLVDLREV

Query:  ECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASNSMYKLASELGYCIVLG
        +CS+ CL+++E   S + A  E F+ F+KAGSDGF F +AS+S + LCD R  + P+L+W H +++P ++ V+SLSEL  R   S          C+++G
Subjt:  ECSISCLVKIETFHSYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASNSMYKLASELGYCIVLG

Query:  SFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGILDNNLSLPLIEEQNEY
        SFW+ +  +FC+GPS P++ +  S        SLY WE P NL+L   +C CG CL R+  +K+++PEW++WQ+K  +VLGFG+L  N  LPL       
Subjt:  SFWSCEFNIFCYGPSLPALDQSVSSKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGILDNNLSLPLIEEQNEY

Query:  GSFTLVRLMSSGLLEAQTYQASWNLVKRID-KSHKESLNLNDYLLYGWLVDD-KYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMRKSSKDSVCERSLTS
        G FTL+RL SSG LEA  ++AS   +K ++  +HK S   +D +   +L DD +Y+F RRF Y   +YL  +    L   LDS MR  S D     S + 
Subjt:  GSFTLVRLMSSGLLEAQTYQASWNLVKRID-KSHKESLNLNDYLLYGWLVDD-KYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMRKSSKDSVCERSLTS

Query:  EIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEF---LENKNAVSLEFLSVPSLHQLPPFILRDPSSRSN
          HE LC+KLK CGFG+ RS+ ++  VF +I+ PTS+ +I  R+ WSSLP E+L  AFS+YSEF   L +K   SLEFL VP   QLPPF+LR+PSSRS+
Subjt:  EIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLWSSLPMELLHFAFSSYSEF---LENKNAVSLEFLSVPSLHQLPPFILRDPSSRSN

Query:  KWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDRE---YVSADS-KKPKN
        KWS K Q    +VGPV+PLP+L+ LHEF NGC   E+E    FS EAE   + +QI  A  ++A S           +SL  DR    ++++DS ++ K 
Subjt:  KWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSAAGEMAMSPFDPKVDDGPAVSLANDRE---YVSADS-KKPKN

Query:  FVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKWEDGFDGYK
        F++Y P    + T + D  Q+  T       + + ++ G K    + +      EL+D L PVE+ F+ R VNF   ++KA    K    +W+D    Y+
Subjt:  FVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVNFGPKELKAYNLLKRQLLKWEDGFDGYK

Query:  EFCSK
        EF S+
Subjt:  EFCSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTGAAGAACAATGGAAGTCTCTTTTTCCAATCGGCTCTGTTTTCAACTCTCCCCTCTTACTCTCTGCTTCTTCAGTTAAAGATTCAATTGGTCCACTCGTCTTTAA
CCCTGTTCCCAGCTCTCTCATCCGCCTCTTCTCATCGCCTTCTTTACTGCCTTCCCTTTCTCCTCCTTCAATTCTCAATCTCCGCAGATTTCTTCTCACTTCTTCGCCCA
TCACTCCTTCTACTTCTTCCTCCGTCGCTTTGCTCTTCGGGGATCAGCAATGGGATGGCGATGCGGCTTCCACTCTCCGCTACAATCGCTTGCAATTCCTTCGATGCCCC
AACTCCAACCACGTTGTTGTGTTTTTTCCCACTGGCCCCAATTCCGACCACGTTGGGTTCTTAGTGGTTTCCGGGGATGGTTCAGGTTTGGGCGTTCAGTCTGATTGCAA
TAATGAGGTTTTCAGTGTTGAAAGTGAATTGAATTATCAGATTTTTGGGATTGCTGTTAACCCTGTTTCGGACTTGGGTTTTAATGGTGATTGCTCTGTTTGTATTGGGT
TTTTGTTGGCTTATACCATGTATTCTGTTGAATGGTTTGTTGTGAAAAATAATGCAACTGATTTGAGGTTTCCGCCTAGGGTTAGTTTAGTTCATATGGGTAGCAAGGTT
TTTAGGACTTGTTCTGTTGTTCATGCTTGTTGGAGTCCTCATTTGCTGGAAGAAAGTGTGGTTTTATTGGAAGATGGTAGCTTGTTCTTGTTTGACATGGAGCCTCTCTT
GAAAGCTGAAAATTCTAATATGAATGCAGATTTGAAAGGAATTAGGCTGAGAGTGTCGTGGGACAGTTTAGATTGCTCAAAAAAGGTGAAGTGGCTAAGTTGTGAGTTTA
GTTGGCATCCTAGAATCTTGATTGTTGCGCGATCTGATGCTGTTTTCTTAGTTGATTTAAGGGAGGTTGAGTGTAGCATTTCTTGTTTAGTGAAGATAGAGACGTTCCAC
TCATATTCTTTGGCTGAAAGGGAACAATTCCTTGCATTTTCAAAAGCAGGTTCCGATGGATTCTATTTTTGTGTGGCTTCAAAAAGTCTGTTACTTCTTTGTGACATACG
TAAACCGCTATCACCGGTGTTGCAATGGACTCATGGTATTGATGAACCAAGCTATGTGAGTGTTTTTAGCTTGTCTGAGTTGAGGTCTCGTGCGAGCAATAGTATGTACA
AATTAGCTTCTGAGTTAGGCTATTGCATTGTACTGGGATCCTTTTGGAGTTGTGAGTTCAACATCTTTTGCTATGGACCTTCCCTACCAGCTCTTGACCAATCTGTTTCT
TCAAAAAGTTCAAAATATTTCCAGTCACTTTATGCGTGGGAGCGTCCTTCGAATCTCATATTATCAGGTCGAGAGTGTCCATGCGGTAGTTGCCTTGTAAGACAAGAAAC
TTTGAAGGACGCAATTCCTGAATGGGTAGAGTGGCAGCAGAAGAAGGAAATAGTGTTGGGCTTTGGCATCCTGGATAATAATCTCTCTCTACCACTTATAGAAGAACAAA
ATGAATATGGTAGTTTTACACTTGTAAGGCTTATGTCATCTGGGTTGCTTGAAGCACAGACTTATCAAGCCTCCTGGAACTTGGTGAAGCGGATAGACAAATCTCATAAA
GAATCATTGAATCTTAATGATTATTTGCTATATGGGTGGTTGGTTGATGACAAATATAGATTCAGCAGAAGATTCAGGTACTTTAATTTTGACTACCTCATGGGATATTT
AAATAATAGTTTAGATGAAATTCTTGATTCTTTCATGAGGAAGTCTAGTAAGGATTCTGTATGCGAGCGATCTTTGACCTCGGAAATTCATGAAGTTTTGTGTGAGAAGT
TAAAAGCTTGTGGGTTTGGTCGTTTGAGATCATCTCCAACACTTGCTGTTGTGTTCAATGACATTAGCCTGCCTACAAGTATACAAGAGATTACTTTCAGGAAATTGTGG
TCAAGCTTACCCATGGAACTTTTACATTTTGCCTTCTCTAGTTACTCTGAATTCCTCGAGAACAAGAATGCAGTATCTTTAGAGTTCTTAAGCGTACCTAGTCTACATCA
GTTGCCCCCTTTTATCCTAAGGGACCCATCAAGTCGCAGTAATAAGTGGTCATATAAAGTGCAGCGAACTGAAAATCTTGTGGGTCCAGTACTTCCCCTTCCCATTCTGC
TCGTACTTCATGAGTTTCGAAATGGATGTTCAAAGTTCGAAGAAGAAGAGGCAGGAAAATTTTCATTAGAGGCAGAACTTGGCGAGCAATATGATCAAATCAGGTCTGCA
GCTGGTGAGATGGCTATGTCACCGTTTGACCCAAAGGTTGATGATGGTCCTGCGGTCTCTCTTGCGAATGATCGAGAATACGTTTCTGCCGATTCTAAGAAGCCAAAGAA
TTTTGTTTCATATCACCCATCTGCCTTTAATTCCCATACTTTTGATGATGATAACACACAAGAAAATGCGACCAATCTTACAGATGTATTTGATTCCTTAATATTCAAAT
TAGAAGGGGGAAAGGCTACATCCAGTGAGAAGTCTGAAAACAACGCTAGTAGAGAACTGTACGACGGTCTCTGCCCTGTTGAGTTGAAATTCGATGCTCGTCCCGTGAAC
TTCGGGCCAAAAGAGTTGAAGGCATATAATTTACTAAAAAGGCAACTGTTGAAATGGGAAGATGGGTTTGATGGGTACAAGGAATTCTGCTCCAAGATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTGAAGAACAATGGAAGTCTCTTTTTCCAATCGGCTCTGTTTTCAACTCTCCCCTCTTACTCTCTGCTTCTTCAGTTAAAGATTCAATTGGTCCACTCGTCTTTAA
CCCTGTTCCCAGCTCTCTCATCCGCCTCTTCTCATCGCCTTCTTTACTGCCTTCCCTTTCTCCTCCTTCAATTCTCAATCTCCGCAGATTTCTTCTCACTTCTTCGCCCA
TCACTCCTTCTACTTCTTCCTCCGTCGCTTTGCTCTTCGGGGATCAGCAATGGGATGGCGATGCGGCTTCCACTCTCCGCTACAATCGCTTGCAATTCCTTCGATGCCCC
AACTCCAACCACGTTGTTGTGTTTTTTCCCACTGGCCCCAATTCCGACCACGTTGGGTTCTTAGTGGTTTCCGGGGATGGTTCAGGTTTGGGCGTTCAGTCTGATTGCAA
TAATGAGGTTTTCAGTGTTGAAAGTGAATTGAATTATCAGATTTTTGGGATTGCTGTTAACCCTGTTTCGGACTTGGGTTTTAATGGTGATTGCTCTGTTTGTATTGGGT
TTTTGTTGGCTTATACCATGTATTCTGTTGAATGGTTTGTTGTGAAAAATAATGCAACTGATTTGAGGTTTCCGCCTAGGGTTAGTTTAGTTCATATGGGTAGCAAGGTT
TTTAGGACTTGTTCTGTTGTTCATGCTTGTTGGAGTCCTCATTTGCTGGAAGAAAGTGTGGTTTTATTGGAAGATGGTAGCTTGTTCTTGTTTGACATGGAGCCTCTCTT
GAAAGCTGAAAATTCTAATATGAATGCAGATTTGAAAGGAATTAGGCTGAGAGTGTCGTGGGACAGTTTAGATTGCTCAAAAAAGGTGAAGTGGCTAAGTTGTGAGTTTA
GTTGGCATCCTAGAATCTTGATTGTTGCGCGATCTGATGCTGTTTTCTTAGTTGATTTAAGGGAGGTTGAGTGTAGCATTTCTTGTTTAGTGAAGATAGAGACGTTCCAC
TCATATTCTTTGGCTGAAAGGGAACAATTCCTTGCATTTTCAAAAGCAGGTTCCGATGGATTCTATTTTTGTGTGGCTTCAAAAAGTCTGTTACTTCTTTGTGACATACG
TAAACCGCTATCACCGGTGTTGCAATGGACTCATGGTATTGATGAACCAAGCTATGTGAGTGTTTTTAGCTTGTCTGAGTTGAGGTCTCGTGCGAGCAATAGTATGTACA
AATTAGCTTCTGAGTTAGGCTATTGCATTGTACTGGGATCCTTTTGGAGTTGTGAGTTCAACATCTTTTGCTATGGACCTTCCCTACCAGCTCTTGACCAATCTGTTTCT
TCAAAAAGTTCAAAATATTTCCAGTCACTTTATGCGTGGGAGCGTCCTTCGAATCTCATATTATCAGGTCGAGAGTGTCCATGCGGTAGTTGCCTTGTAAGACAAGAAAC
TTTGAAGGACGCAATTCCTGAATGGGTAGAGTGGCAGCAGAAGAAGGAAATAGTGTTGGGCTTTGGCATCCTGGATAATAATCTCTCTCTACCACTTATAGAAGAACAAA
ATGAATATGGTAGTTTTACACTTGTAAGGCTTATGTCATCTGGGTTGCTTGAAGCACAGACTTATCAAGCCTCCTGGAACTTGGTGAAGCGGATAGACAAATCTCATAAA
GAATCATTGAATCTTAATGATTATTTGCTATATGGGTGGTTGGTTGATGACAAATATAGATTCAGCAGAAGATTCAGGTACTTTAATTTTGACTACCTCATGGGATATTT
AAATAATAGTTTAGATGAAATTCTTGATTCTTTCATGAGGAAGTCTAGTAAGGATTCTGTATGCGAGCGATCTTTGACCTCGGAAATTCATGAAGTTTTGTGTGAGAAGT
TAAAAGCTTGTGGGTTTGGTCGTTTGAGATCATCTCCAACACTTGCTGTTGTGTTCAATGACATTAGCCTGCCTACAAGTATACAAGAGATTACTTTCAGGAAATTGTGG
TCAAGCTTACCCATGGAACTTTTACATTTTGCCTTCTCTAGTTACTCTGAATTCCTCGAGAACAAGAATGCAGTATCTTTAGAGTTCTTAAGCGTACCTAGTCTACATCA
GTTGCCCCCTTTTATCCTAAGGGACCCATCAAGTCGCAGTAATAAGTGGTCATATAAAGTGCAGCGAACTGAAAATCTTGTGGGTCCAGTACTTCCCCTTCCCATTCTGC
TCGTACTTCATGAGTTTCGAAATGGATGTTCAAAGTTCGAAGAAGAAGAGGCAGGAAAATTTTCATTAGAGGCAGAACTTGGCGAGCAATATGATCAAATCAGGTCTGCA
GCTGGTGAGATGGCTATGTCACCGTTTGACCCAAAGGTTGATGATGGTCCTGCGGTCTCTCTTGCGAATGATCGAGAATACGTTTCTGCCGATTCTAAGAAGCCAAAGAA
TTTTGTTTCATATCACCCATCTGCCTTTAATTCCCATACTTTTGATGATGATAACACACAAGAAAATGCGACCAATCTTACAGATGTATTTGATTCCTTAATATTCAAAT
TAGAAGGGGGAAAGGCTACATCCAGTGAGAAGTCTGAAAACAACGCTAGTAGAGAACTGTACGACGGTCTCTGCCCTGTTGAGTTGAAATTCGATGCTCGTCCCGTGAAC
TTCGGGCCAAAAGAGTTGAAGGCATATAATTTACTAAAAAGGCAACTGTTGAAATGGGAAGATGGGTTTGATGGGTACAAGGAATTCTGCTCCAAGATTTGA
Protein sequenceShow/hide protein sequence
MPEEQWKSLFPIGSVFNSPLLLSASSVKDSIGPLVFNPVPSSLIRLFSSPSLLPSLSPPSILNLRRFLLTSSPITPSTSSSVALLFGDQQWDGDAASTLRYNRLQFLRCP
NSNHVVVFFPTGPNSDHVGFLVVSGDGSGLGVQSDCNNEVFSVESELNYQIFGIAVNPVSDLGFNGDCSVCIGFLLAYTMYSVEWFVVKNNATDLRFPPRVSLVHMGSKV
FRTCSVVHACWSPHLLEESVVLLEDGSLFLFDMEPLLKAENSNMNADLKGIRLRVSWDSLDCSKKVKWLSCEFSWHPRILIVARSDAVFLVDLREVECSISCLVKIETFH
SYSLAEREQFLAFSKAGSDGFYFCVASKSLLLLCDIRKPLSPVLQWTHGIDEPSYVSVFSLSELRSRASNSMYKLASELGYCIVLGSFWSCEFNIFCYGPSLPALDQSVS
SKSSKYFQSLYAWERPSNLILSGRECPCGSCLVRQETLKDAIPEWVEWQQKKEIVLGFGILDNNLSLPLIEEQNEYGSFTLVRLMSSGLLEAQTYQASWNLVKRIDKSHK
ESLNLNDYLLYGWLVDDKYRFSRRFRYFNFDYLMGYLNNSLDEILDSFMRKSSKDSVCERSLTSEIHEVLCEKLKACGFGRLRSSPTLAVVFNDISLPTSIQEITFRKLW
SSLPMELLHFAFSSYSEFLENKNAVSLEFLSVPSLHQLPPFILRDPSSRSNKWSYKVQRTENLVGPVLPLPILLVLHEFRNGCSKFEEEEAGKFSLEAELGEQYDQIRSA
AGEMAMSPFDPKVDDGPAVSLANDREYVSADSKKPKNFVSYHPSAFNSHTFDDDNTQENATNLTDVFDSLIFKLEGGKATSSEKSENNASRELYDGLCPVELKFDARPVN
FGPKELKAYNLLKRQLLKWEDGFDGYKEFCSKI