; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10003081 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10003081
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSpoU_methylase domain-containing protein
Genome locationChr11:17160636..17171684
RNA-Seq ExpressionHG10003081
SyntenyHG10003081
Gene Ontology termsGO:0030488 - tRNA methylation (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0016423 - tRNA (guanine) methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011650373.1 uncharacterized protein LOC101213211 isoform X1 [Cucumis sativus]0.0e+0083.07Show/hide
Query:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF
        MS N+ FSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLS SELFDSLLETFPK IDDA+TKEGKLDADQCNYITSLVCALCHILKKDGADP ALKSF
Subjt:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF

Query:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL
        IWKSFVPLINK AT NREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSV+Q+VEL+TFE DR S ILGSNVPVHEPRMDNQT K YGFLQL
Subjt:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL

Query:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM
        PLACHVLAIMLDAVLCNRQ PQTSD VVSNGYQKAEEFTVKLIWDICNLSEQMLLQS DHRSCAIC+LLPVIFEAL+S+HSLEISIQGHACNLSRS FLM
Subjt:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM

Query:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS
        KIW CCKKLFSFGTLERRDAYRILS+Y CFFP NEELGGAGMCDD EEFDIKADK FWDEIKRGLVDKESSVRKQSLHILKKALS NGRG+ +TV KTIS
Subjt:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS

Query:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI
         GKD+NV+GITKRERWANKEA SLGVGQIC+Q++I TNSRQQ+WEAFILLYEMLEEYGSHLVEAAWSHQISLLLQ P S + DSFS+GVHQNQIE+SGEI
Subjt:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI

Query:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ
        +SWLSILWV+GFHHDNPLVRCLIMQ FL IEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKG+YSSKTVEGAARF+CQY NILDAR RVVFLHQ
Subjt:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ

Query:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS
        LTSLARKKSFGR GLISLSECIASAASIVGFDYN EGECFNGSSLS+Q DLI YSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS
Subjt:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS

Query:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------LYFYTR
        NLALEVVLHF+SALPREATDYGGCLRRKMQNWLLGCGKK     CCSTETKFMKSLIEFPKRF+ HN+S+DASVTYDDEEL AWE  A      ++   +
Subjt:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------LYFYTR

Query:  LAFHF---------------------------------------------TGH--------------SSDNWSYAEPTIFSQKIANLLPSLQVELVSLAT
           H                                               GH               SDN SYAEPTIFSQKI NLLPSLQVELVS AT
Subjt:  LAFHF---------------------------------------------TGH--------------SSDNWSYAEPTIFSQKIANLLPSLQVELVSLAT

Query:  VSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV
        +SCSIFWSNVKSDET LPGSVKGKLGGPSQRRLPSSV TLVLLAV
Subjt:  VSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV

XP_031738459.1 uncharacterized protein LOC101213211 isoform X3 [Cucumis sativus]0.0e+0083.07Show/hide
Query:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF
        MS N+ FSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLS SELFDSLLETFPK IDDA+TKEGKLDADQCNYITSLVCALCHILKKDGADP ALKSF
Subjt:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF

Query:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL
        IWKSFVPLINK AT NREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSV+Q+VEL+TFE DR S ILGSNVPVHEPRMDNQT K YGFLQL
Subjt:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL

Query:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM
        PLACHVLAIMLDAVLCNRQ PQTSD VVSNGYQKAEEFTVKLIWDICNLSEQMLLQS DHRSCAIC+LLPVIFEAL+S+HSLEISIQGHACNLSRS FLM
Subjt:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM

Query:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS
        KIW CCKKLFSFGTLERRDAYRILS+Y CFFP NEELGGAGMCDD EEFDIKADK FWDEIKRGLVDKESSVRKQSLHILKKALS NGRG+ +TV KTIS
Subjt:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS

Query:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI
         GKD+NV+GITKRERWANKEA SLGVGQIC+Q++I TNSRQQ+WEAFILLYEMLEEYGSHLVEAAWSHQISLLLQ P S + DSFS+GVHQNQIE+SGEI
Subjt:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI

Query:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ
        +SWLSILWV+GFHHDNPLVRCLIMQ FL IEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKG+YSSKTVEGAARF+CQY NILDAR RVVFLHQ
Subjt:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ

Query:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS
        LTSLARKKSFGR GLISLSECIASAASIVGFDYN EGECFNGSSLS+Q DLI YSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS
Subjt:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS

Query:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------LYFYTR
        NLALEVVLHF+SALPREATDYGGCLRRKMQNWLLGCGKK     CCSTETKFMKSLIEFPKRF+ HN+S+DASVTYDDEEL AWE  A      ++   +
Subjt:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------LYFYTR

Query:  LAFHF---------------------------------------------TGH--------------SSDNWSYAEPTIFSQKIANLLPSLQVELVSLAT
           H                                               GH               SDN SYAEPTIFSQKI NLLPSLQVELVS AT
Subjt:  LAFHF---------------------------------------------TGH--------------SSDNWSYAEPTIFSQKIANLLPSLQVELVSLAT

Query:  VSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV
        +SCSIFWSNVKSDET LPGSVKGKLGGPSQRRLPSSV TLVLLAV
Subjt:  VSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV

XP_038906253.1 uncharacterized protein LOC120092116 isoform X1 [Benincasa hispida]0.0e+0085.4Show/hide
Query:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF
        MSDN++FSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDA+ KEGK DADQCNYITSLVCALCHILKK+GA+PDALKSF
Subjt:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF

Query:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL
        IWKSFVPLINKAAT NREMLNQVSESFIDVV+ETNSWPIVEATL+PFCISSALYSTSV QNVEL+TFEGD CSVILGSN PVHEPRMDNQ TKGY FLQL
Subjt:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL

Query:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM
        PLACHVL+IMLDAV CN QAPQTS VVVSNGYQKAEEFTVKLIWDICNLS QMLLQS DHRSCAICYLLPVIFEALLS+HSLEISI+GHACNLSR+RFLM
Subjt:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM

Query:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS
        KIW CCKKLFSFGTLERRDAYRILS+YLCFFP NEELGGAGMCDD EEFDI+ADKDFWDEIKRGLVDKESSVRKQSL+ILKKALSINGRGNTS VPKTIS
Subjt:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS

Query:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI
         GKDNNVRGITKRERWANKEANSLGVGQIC+QHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAW HQIS LLQDPAS  +DSFS+GVHQNQIE+SGEI
Subjt:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI

Query:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ
        FSWLSILWV+GFHHDNPLVRCLIMQSFLAIEWR+KVPCLKS+PETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILD R R VFLHQ
Subjt:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ

Query:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS
        LTSLARKKSFGR GLISLSECIASAASIV FDYNSEGECFNGSSLSAQGD  TYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS
Subjt:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS

Query:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------------
        NLALEVVLHFISA+PREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRF+NHN+S+DASVTYDDEELVAWE  A            
Subjt:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------------

Query:  ---------------------------------------LYFYTRLAFHFTGH--------------SSDNWSYAEPTIFSQKIANLLPSLQVELVSLAT
                                               L    +L     GH               SDNWSYA  TIFSQKIANLLP LQVELVS AT
Subjt:  ---------------------------------------LYFYTRLAFHFTGH--------------SSDNWSYAEPTIFSQKIANLLPSLQVELVSLAT

Query:  VSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV
        VSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSV TLVLLAV
Subjt:  VSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV

XP_038906254.1 uncharacterized protein LOC120092116 isoform X2 [Benincasa hispida]0.0e+0085.4Show/hide
Query:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF
        MSDN++FSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDA+ KEGK DADQCNYITSLVCALCHILKK+GA+PDALKSF
Subjt:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF

Query:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL
        IWKSFVPLINKAAT NREMLNQVSESFIDVV+ETNSWPIVEATL+PFCISSALYSTSV QNVEL+TFEGD CSVILGSN PVHEPRMDNQ TKGY FLQL
Subjt:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL

Query:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM
        PLACHVL+IMLDAV CN QAPQTS VVVSNGYQKAEEFTVKLIWDICNLS QMLLQS DHRSCAICYLLPVIFEALLS+HSLEISI+GHACNLSR+RFLM
Subjt:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM

Query:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS
        KIW CCKKLFSFGTLERRDAYRILS+YLCFFP NEELGGAGMCDD EEFDI+ADKDFWDEIKRGLVDKESSVRKQSL+ILKKALSINGRGNTS VPKTIS
Subjt:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS

Query:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI
         GKDNNVRGITKRERWANKEANSLGVGQIC+QHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAW HQIS LLQDPAS  +DSFS+GVHQNQIE+SGEI
Subjt:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI

Query:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ
        FSWLSILWV+GFHHDNPLVRCLIMQSFLAIEWR+KVPCLKS+PETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILD R R VFLHQ
Subjt:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ

Query:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS
        LTSLARKKSFGR GLISLSECIASAASIV FDYNSEGECFNGSSLSAQGD  TYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS
Subjt:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS

Query:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------------
        NLALEVVLHFISA+PREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRF+NHN+S+DASVTYDDEELVAWE  A            
Subjt:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------------

Query:  ---------------------------------------LYFYTRLAFHFTGH--------------SSDNWSYAEPTIFSQKIANLLPSLQVELVSLAT
                                               L    +L     GH               SDNWSYA  TIFSQKIANLLP LQVELVS AT
Subjt:  ---------------------------------------LYFYTRLAFHFTGH--------------SSDNWSYAEPTIFSQKIANLLPSLQVELVSLAT

Query:  VSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV
        VSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSV TLVLLAV
Subjt:  VSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV

XP_038906255.1 uncharacterized protein LOC120092116 isoform X3 [Benincasa hispida]0.0e+0085.4Show/hide
Query:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF
        MSDN++FSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDA+ KEGK DADQCNYITSLVCALCHILKK+GA+PDALKSF
Subjt:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF

Query:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL
        IWKSFVPLINKAAT NREMLNQVSESFIDVV+ETNSWPIVEATL+PFCISSALYSTSV QNVEL+TFEGD CSVILGSN PVHEPRMDNQ TKGY FLQL
Subjt:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL

Query:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM
        PLACHVL+IMLDAV CN QAPQTS VVVSNGYQKAEEFTVKLIWDICNLS QMLLQS DHRSCAICYLLPVIFEALLS+HSLEISI+GHACNLSR+RFLM
Subjt:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM

Query:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS
        KIW CCKKLFSFGTLERRDAYRILS+YLCFFP NEELGGAGMCDD EEFDI+ADKDFWDEIKRGLVDKESSVRKQSL+ILKKALSINGRGNTS VPKTIS
Subjt:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS

Query:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI
         GKDNNVRGITKRERWANKEANSLGVGQIC+QHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAW HQIS LLQDPAS  +DSFS+GVHQNQIE+SGEI
Subjt:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI

Query:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ
        FSWLSILWV+GFHHDNPLVRCLIMQSFLAIEWR+KVPCLKS+PETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILD R R VFLHQ
Subjt:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ

Query:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS
        LTSLARKKSFGR GLISLSECIASAASIV FDYNSEGECFNGSSLSAQGD  TYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS
Subjt:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS

Query:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------------
        NLALEVVLHFISA+PREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRF+NHN+S+DASVTYDDEELVAWE  A            
Subjt:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------------

Query:  ---------------------------------------LYFYTRLAFHFTGH--------------SSDNWSYAEPTIFSQKIANLLPSLQVELVSLAT
                                               L    +L     GH               SDNWSYA  TIFSQKIANLLP LQVELVS AT
Subjt:  ---------------------------------------LYFYTRLAFHFTGH--------------SSDNWSYAEPTIFSQKIANLLPSLQVELVSLAT

Query:  VSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV
        VSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSV TLVLLAV
Subjt:  VSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV

TrEMBL top hitse value%identityAlignment
A0A0A0L1E9 SpoU_methylase domain-containing protein0.0e+0083.07Show/hide
Query:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF
        MS N+ FSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLS SELFDSLLETFPK IDDA+TKEGKLDADQCNYITSLVCALCHILKKDGADP ALKSF
Subjt:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF

Query:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL
        IWKSFVPLINK AT NREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSV+Q+VEL+TFE DR S ILGSNVPVHEPRMDNQT K YGFLQL
Subjt:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL

Query:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM
        PLACHVLAIMLDAVLCNRQ PQTSD VVSNGYQKAEEFTVKLIWDICNLSEQMLLQS DHRSCAIC+LLPVIFEAL+S+HSLEISIQGHACNLSRS FLM
Subjt:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM

Query:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS
        KIW CCKKLFSFGTLERRDAYRILS+Y CFFP NEELGGAGMCDD EEFDIKADK FWDEIKRGLVDKESSVRKQSLHILKKALS NGRG+ +TV KTIS
Subjt:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS

Query:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI
         GKD+NV+GITKRERWANKEA SLGVGQIC+Q++I TNSRQQ+WEAFILLYEMLEEYGSHLVEAAWSHQISLLLQ P S + DSFS+GVHQNQIE+SGEI
Subjt:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI

Query:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ
        +SWLSILWV+GFHHDNPLVRCLIMQ FL IEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKG+YSSKTVEGAARF+CQY NILDAR RVVFLHQ
Subjt:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ

Query:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS
        LTSLARKKSFGR GLISLSECIASAASIVGFDYN EGECFNGSSLS+Q DLI YSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS
Subjt:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS

Query:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------LYFYTR
        NLALEVVLHF+SALPREATDYGGCLRRKMQNWLLGCGKK     CCSTETKFMKSLIEFPKRF+ HN+S+DASVTYDDEEL AWE  A      ++   +
Subjt:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------LYFYTR

Query:  LAFHF---------------------------------------------TGH--------------SSDNWSYAEPTIFSQKIANLLPSLQVELVSLAT
           H                                               GH               SDN SYAEPTIFSQKI NLLPSLQVELVS AT
Subjt:  LAFHF---------------------------------------------TGH--------------SSDNWSYAEPTIFSQKIANLLPSLQVELVSLAT

Query:  VSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV
        +SCSIFWSNVKSDET LPGSVKGKLGGPSQRRLPSSV TLVLLAV
Subjt:  VSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV

A0A1S3BKH8 LOW QUALITY PROTEIN: uncharacterized protein LOC1034908400.0e+0082.39Show/hide
Query:  MASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSFIWKSFVPL
        MASVF SLSESFRRVPPMAVPAILDCLFASTGLSPS+LFDSLLETFPKNIDD +T EGKLDADQCNYITSLVCALCHILKKDGA+P ALKSFIWKSFVPL
Subjt:  MASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSFIWKSFVPL

Query:  INKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQLPLACHVLA
        INKAAT NREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSV+QNVEL+TFE DR S ILGSNVPVHEPRMD+QT K YGFLQLPLACHVLA
Subjt:  INKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQLPLACHVLA

Query:  IMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLMKIWTCCKK
        IMLDAVLCNRQ PQTSD VVSNG QKAEEFTVKLIWDICNLSEQMLLQS D RSCAIC+LLPVIFEAL+S+HSLEISIQGHACNLSR+ FLMKIW CCKK
Subjt:  IMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLMKIWTCCKK

Query:  LFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTISHGKDNNVR
        LFSFGTLERRDAYRILS+Y CFF  NEELGGAGMCDD EEFDIKADKDFWDEIK+GLVDKESSVRKQSLHILKKALS   +   +  PKTIS GKD++VR
Subjt:  LFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTISHGKDNNVR

Query:  GITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEIFSWLSILW
        GITKRERWANKEA SLGVGQIC+QHEI TNSRQQ+WEAFILLYEMLEEYGSHLVEAAWS QISLLLQ P S K DSFS+GVHQNQIE+SGEIFSWLSILW
Subjt:  GITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEIFSWLSILW

Query:  VQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQLTSLARKK
        V+GFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGA+RFICQY NIL+AR RVVFLHQLTSLARKK
Subjt:  VQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQLTSLARKK

Query:  SFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTSNLALEVVL
        SFGR GLISLSECIASAA IVGFDYN EGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTSNLALEVVL
Subjt:  SFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTSNLALEVVL

Query:  HFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------LYFYTRLAFHFT--
        HF+SALPREATDYGGCLRRKMQNWLLGCGKK     CCSTETKFMKSLIEFPKRF+ HN+S+DASVTYDDEELVAWE  A      ++   +   H T  
Subjt:  HFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------LYFYTRLAFHFT--

Query:  ---------------------------------------------------------GHSSDNWSYAEPTIFSQKIANLLPSLQVELVSLATVSCSIFWS
                                                                  H SD+ S+AEPTIFSQKI NLLPSLQVE VS AT+SCSIFWS
Subjt:  ---------------------------------------------------------GHSSDNWSYAEPTIFSQKIANLLPSLQVELVSLATVSCSIFWS

Query:  NVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV
        NVKSDET LPGSVKGKLGGPSQRRLPSS+ T VLLAV
Subjt:  NVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV

A0A6J1DMP5 uncharacterized protein LOC111022655 isoform X40.0e+0076.22Show/hide
Query:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF
        M +N+SFSMASV  SLSESFR+VPPMAVPAILDCL ASTGLSPSELF SLL+TFP NIDD +TKEGKLD DQCNY+TSLVCALCHILKK+GADPDALK F
Subjt:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF

Query:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL
        IWKSFVPLINKAA FNREMLNQVSESFIDVV ETNSWPIVE TL+P CISSALYST+++QN +L TFEGDRCSVILGSN  VHEP+MD Q  KGYGFL L
Subjt:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL

Query:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM
        PLACH+LAIMLDAVLCNRQAPQT++VVVSNG QKAEEFTVKLI DICNLS+QMLLQS DHRSCAI YLLPVIFEALLS H+LEISIQG+AC+LSR+RFLM
Subjt:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM

Query:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS
        KIW CCKKLFSFGTLERRDAY ILS+YL FFP NEEL GAGMCDD EEFDIKADKDFW EIKRGLVDKE  VRKQS+HILKKALSINGRGNTS+VP TIS
Subjt:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS

Query:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI
         GKDNN RGITKRERWANKEA SLGV Q C+QHEIVTNS QQ+WEAFILLYEMLEEYGSHLVEAAW+HQISLLL+DP SIK DSF+ G +QNQIE+SGEI
Subjt:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI

Query:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ
        FSWLSILWV+GFHHDNPLVRCLIMQSFL I+WR+ V CL SLP+TFIIGPFIEALNDPVQHKDFG+KGVYSSKT+EGAA FI QYAN LDAR  VVFL Q
Subjt:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ

Query:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS
        LTSL +KKSFGR GLISLSECIASAASIVGF+ + EGECF+      QG+LITYSL  K+ELLDDLRFVV+SSKQHFNPSYR QVCAKALEAAASVLCTS
Subjt:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS

Query:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------LYFYTR
        +L LEV+L FISALPREATDYGGCLR KMQ+WLLGCGKKCCSGSCCSTETKFMKSLIEFPKRF+ HN+S++ SVTYDDEEL AWEF A      ++   +
Subjt:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------LYFYTR

Query:  LAFHF----------------------------------------------TGHS--------------SDNWSYAEPTIFSQKIANLLPSLQVELVSLA
           H                                               T H+              SD+ +YA PT F +K  NL  SL  ELVS A
Subjt:  LAFHF----------------------------------------------TGHS--------------SDNWSYAEPTIFSQKIANLLPSLQVELVSLA

Query:  TVSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV
        T SCSIFWSNVKSDET LP SVKGKLGGPSQRRLPS   TLVLLAV
Subjt:  TVSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV

A0A6J1DPL7 uncharacterized protein LOC111022655 isoform X50.0e+0076.22Show/hide
Query:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF
        M +N+SFSMASV  SLSESFR+VPPMAVPAILDCL ASTGLSPSELF SLL+TFP NIDD +TKEGKLD DQCNY+TSLVCALCHILKK+GADPDALK F
Subjt:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF

Query:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL
        IWKSFVPLINKAA FNREMLNQVSESFIDVV ETNSWPIVE TL+P CISSALYST+++QN +L TFEGDRCSVILGSN  VHEP+MD Q  KGYGFL L
Subjt:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL

Query:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM
        PLACH+LAIMLDAVLCNRQAPQT++VVVSNG QKAEEFTVKLI DICNLS+QMLLQS DHRSCAI YLLPVIFEALLS H+LEISIQG+AC+LSR+RFLM
Subjt:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM

Query:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS
        KIW CCKKLFSFGTLERRDAY ILS+YL FFP NEEL GAGMCDD EEFDIKADKDFW EIKRGLVDKE  VRKQS+HILKKALSINGRGNTS+VP TIS
Subjt:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS

Query:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI
         GKDNN RGITKRERWANKEA SLGV Q C+QHEIVTNS QQ+WEAFILLYEMLEEYGSHLVEAAW+HQISLLL+DP SIK DSF+ G +QNQIE+SGEI
Subjt:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI

Query:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ
        FSWLSILWV+GFHHDNPLVRCLIMQSFL I+WR+ V CL SLP+TFIIGPFIEALNDPVQHKDFG+KGVYSSKT+EGAA FI QYAN LDAR  VVFL Q
Subjt:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ

Query:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS
        LTSL +KKSFGR GLISLSECIASAASIVGF+ + EGECF+      QG+LITYSL  K+ELLDDLRFVV+SSKQHFNPSYR QVCAKALEAAASVLCTS
Subjt:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS

Query:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------LYFYTR
        +L LEV+L FISALPREATDYGGCLR KMQ+WLLGCGKKCCSGSCCSTETKFMKSLIEFPKRF+ HN+S++ SVTYDDEEL AWEF A      ++   +
Subjt:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSA------LYFYTR

Query:  LAFHF----------------------------------------------TGHS--------------SDNWSYAEPTIFSQKIANLLPSLQVELVSLA
           H                                               T H+              SD+ +YA PT F +K  NL  SL  ELVS A
Subjt:  LAFHF----------------------------------------------TGHS--------------SDNWSYAEPTIFSQKIANLLPSLQVELVSLA

Query:  TVSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV
        T SCSIFWSNVKSDET LP SVKGKLGGPSQRRLPS   TLVLLAV
Subjt:  TVSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV

A0A6J1F8L8 uncharacterized protein LOC111441850 isoform X10.0e+0077.61Show/hide
Query:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF
        MS+N+SFSMASVFSS+SESFRRVPPMAVPAILDC+FASTGLSPSELFDSLLE FPKNIDD +TKEGKLDADQCNYITSLVCA CHILKK GADPDALKSF
Subjt:  MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSF

Query:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL
        IWKSFVPLINKAAT NREMLNQV  SFIDVVTETNSWPIVEATLIPFCISSA+YS +V+QN EL+TFEGDRCSVIL S            T K YGFLQL
Subjt:  IWKSFVPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQL

Query:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM
        PLACHVLA+MLDAVL NRQAPQ SDVVVSNG QKAEEFT+KLIWDICNLSEQMLLQS DHRSCAI YLLPVIFEALLS+HSLE+SIQG ACN+SRSRFLM
Subjt:  PLACHVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLM

Query:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS
        KIW CCKKLFSFGTLERRDAYRILS+YLCFFP NEELG AGMCDD EE DI ADKD W+EIKRGLVDKES VRKQSLHIL KAL I+GR N ST+PKTIS
Subjt:  KIWTCCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTIS

Query:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI
          KD+N RGITK+ERWANKEA SLGVGQIC+Q E VTNSRQQQWEAFILLYEMLEEYGSHLV+AAW+HQISLLLQDP S   DSF++G HQNQI++SGEI
Subjt:  HGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEI

Query:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ
        FSW+SILWV+GFHH+NPLVRCLIMQSFLAI+W+  VPCLKSLPE+FIIGPFIEALNDPVQHKDFG+KGVYSSKT+EGAA FI QYANILDA  RVVFL +
Subjt:  FSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQ

Query:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS
        LTSLARKKS GR GLISLSECIASAASI G D N EGECFNGSSLSAQGDLI  S  CK+ELLDDLRFVVESSKQHFNPSYR+QVCAKALEAAASVLCTS
Subjt:  LTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKLELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTS

Query:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSALYFYTRLAF---
        +LA E VLHFISALPREATDYGGCLR KMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRF +HN+S+DASVTYDDEEL AWE  A   + R+ F   
Subjt:  NLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSADASVTYDDEELVAWEFSALYFYTRLAF---

Query:  ---------------------------------------------------------------HFT-GHSSDNWSYAEPTIFSQKIANLLPSLQVELVSL
                                                                        FT  H SD+ SYAEPTI  QK+ NL  SLQ+ELVS 
Subjt:  ---------------------------------------------------------------HFT-GHSSDNWSYAEPTIFSQKIANLLPSLQVELVSL

Query:  ATVSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV
        A VSCSIFWS VKSDET+LPGSVKGKLGGPSQRRLPSS+ T VLLAV
Subjt:  ATVSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G17610.1 tRNA/rRNA methyltransferase (SpoU) family protein5.1e-18941.1Show/hide
Query:  ASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDG----ADPDALKSFIWKSF
        +SV +SLS SF++VPP A+PA LDC+ +STG+SPS LF+SL+E FP  ++D    + + D+D CN+I SLV  LCH+LK  G     + +AL+ F+W+ F
Subjt:  ASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDG----ADPDALKSFIWKSF

Query:  VPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFE-GDRCSVILGSNVPVHEPRMDNQTTKGYGFLQLPLAC
        +PL+     ++ +MLN++ ESF DVV ETN   ++  +L+PF + S  +S  + Q+ E +  + GD C     + + + E    N   +  G   +PL+C
Subjt:  VPLINKAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFE-GDRCSVILGSNVPVHEPRMDNQTTKGYGFLQLPLAC

Query:  HVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLMKIWT
        H+L ++L+A   + QA             K E F   ++WD+CN +E++L QS++HRSCA+ +LLP IF+A  S  SL+IS QG+   LSR+ F+ +IW 
Subjt:  HVLAIMLDAVLCNRQAPQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLMKIWT

Query:  CCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEE--EFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTISHG
        CCKKLFS G++ERRDAY +LS  LC    +   G      +++  +FD++++++FWDEIK GLV  ES VRKQSLHILK  LSI        V +TIS  
Subjt:  CCKKLFSFGTLERRDAYRILSVYLCFFPDNEELGGAGMCDDEE--EFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTISHG

Query:  K--DNNV-RGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLD----SFSNGVHQNQIE
        K   N+V R +T++E WA KEA SLGVG++    +    S QQ W+AF+LLYEMLEEYG+HLVEAAWS+QI LL++  +S++ D    S  N  H   +E
Subjt:  K--DNNV-RGITKRERWANKEANSLGVGQICNQHEIVTNSRQQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLD----SFSNGVHQNQIE

Query:  LSGE---IFSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDAR
           E   IF+WL +LW +GF HDNPLVRC +M+SF  IEWR    C +S+ +TF++GPFIE LNDP  HKDFGLKG+Y+S+T+EGAA+++  Y + L+ R
Subjt:  LSGE---IFSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGPFIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDAR

Query:  ARVVFLHQLTSLARKKSFGRAGLISLSECIASAASIV-GFDYNSEGECFNGSSLSAQGDLITY-SLECKLELLDDLRFVVESSKQHFNPSYRLQ------
         RV FL  L SLA+K+SF RAG ++L +CI S A +V G+     G   +  S +AQ     + S +    +LD L+FV ESS+QHFN  YR++      
Subjt:  ARVVFLHQLTSLARKKSFGRAGLISLSECIASAASIV-GFDYNSEGECFNGSSLSAQGDLITY-SLECKLELLDDLRFVVESSKQHFNPSYRLQ------

Query:  ---------------VCAKALEAAASVLCTSNLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYS
                       V  K LE AASV+   N+ L  +L F+SA+PRE TD+ G LR+ M  WL GC +K  S S C+  T+ + SL E+ K F + N  
Subjt:  ---------------VCAKALEAAASVLCTSNLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYS

Query:  ADASVTYDDEELVAWEFSALYFYTRLAF-------HFT--------------------------------------------------------GHSSDN
             ++DDE+L AW+ S    + R+ F       H T                                                        G  SD 
Subjt:  ADASVTYDDEELVAWEFSALYFYTRLAF-------HFT--------------------------------------------------------GHSSDN

Query:  WS----YAEPTIFSQKIANLLPSLQVELVSLATVSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV
         +      + +   +K A +L S+  EL+  A  SCSIFWS+   +   LPGSV GKLGGPSQRRL    TT VL AV
Subjt:  WS----YAEPTIFSQKIANLLPSLQVELVSLATVSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGACAACGAAAGTTTTTCCATGGCATCTGTTTTCAGTTCGTTGTCAGAAAGCTTCCGGAGAGTGCCTCCAATGGCAGTTCCAGCTATTTTGGATTGCCTTTTTGC
TTCTACTGGGTTATCCCCATCTGAGCTTTTCGATTCGCTTCTTGAGACTTTTCCCAAAAACATTGATGATGCTTCCACGAAGGAGGGAAAGCTTGATGCTGATCAATGTA
ATTACATCACATCTTTGGTCTGCGCGCTGTGCCACATACTAAAAAAAGATGGTGCTGATCCTGATGCTTTGAAGTCATTCATATGGAAAAGCTTTGTTCCTTTGATAAAC
AAGGCAGCTACATTTAATCGGGAAATGCTAAACCAGGTCTCTGAATCATTCATTGATGTCGTAACTGAGACGAACTCATGGCCAATTGTTGAAGCAACCCTAATTCCATT
TTGTATAAGTTCAGCTCTTTATTCCACGAGTGTGGTGCAAAACGTTGAGTTGAACACCTTTGAGGGTGACAGATGTTCTGTCATTTTGGGCTCAAATGTCCCTGTGCATG
AACCTAGAATGGATAATCAGACGACGAAAGGCTATGGGTTCCTTCAATTACCATTAGCATGCCATGTTTTGGCTATAATGTTAGATGCTGTCCTTTGTAATAGACAAGCA
CCACAAACATCAGACGTAGTGGTGTCAAATGGGTACCAAAAAGCTGAAGAGTTTACTGTTAAACTAATTTGGGATATTTGCAATTTATCTGAACAAATGCTCTTACAAAG
CTTGGATCATCGATCTTGTGCCATTTGCTATCTTCTTCCAGTAATCTTTGAAGCACTTCTTTCTTACCATTCTTTAGAGATCTCCATTCAAGGCCATGCATGTAACCTCT
CCAGGAGCCGTTTCCTCATGAAAATATGGACATGTTGCAAAAAACTATTTTCATTTGGAACTTTGGAGAGAAGAGATGCCTATAGGATTTTGTCTGTTTATTTATGTTTT
TTCCCTGACAATGAAGAGCTTGGAGGTGCTGGAATGTGTGATGATGAAGAAGAATTTGACATAAAGGCTGATAAAGATTTTTGGGATGAAATTAAAAGAGGCTTGGTGGA
TAAGGAGAGCTCGGTGCGGAAGCAATCACTACATATATTGAAGAAAGCACTATCTATAAATGGAAGAGGCAACACATCTACGGTTCCAAAGACAATTTCACATGGAAAAG
ATAATAATGTTCGAGGTATTACCAAAAGGGAAAGATGGGCCAATAAGGAAGCAAATTCACTGGGTGTAGGGCAAATCTGCAACCAACATGAAATTGTTACAAATAGCCGG
CAGCAACAGTGGGAAGCATTCATACTTCTTTATGAAATGCTTGAAGAATATGGTTCACACTTGGTTGAAGCTGCTTGGAGTCACCAGATATCCTTGTTACTACAAGATCC
GGCCTCTATTAAACTTGACAGCTTCAGTAATGGCGTTCATCAGAACCAAATTGAACTGTCTGGTGAAATCTTTAGTTGGTTATCGATCTTGTGGGTTCAGGGATTCCACC
ATGATAATCCTTTAGTTAGATGTTTGATCATGCAGTCCTTCTTGGCCATTGAGTGGAGGGATAAAGTTCCTTGTTTAAAGTCATTGCCAGAAACTTTTATTATTGGACCT
TTCATTGAAGCACTCAACGATCCTGTGCAGCACAAAGATTTTGGTTTAAAAGGAGTTTACTCATCCAAGACAGTTGAAGGTGCAGCCCGTTTTATATGTCAATATGCAAA
TATTCTTGATGCAAGGGCAAGAGTGGTGTTTTTGCATCAGCTCACATCTTTGGCTAGGAAGAAATCATTTGGTCGAGCTGGGTTGATCAGCCTATCTGAATGCATTGCTT
CAGCTGCCTCAATAGTTGGATTTGACTACAATAGCGAAGGAGAGTGCTTTAATGGTTCTTCATTGTCAGCCCAGGGGGATTTGATAACTTATTCTCTGGAGTGCAAACTG
GAATTGCTGGACGATCTTAGATTTGTGGTTGAGAGCAGCAAACAACACTTCAATCCTAGTTATCGCCTTCAAGTTTGTGCCAAAGCTCTGGAGGCTGCTGCGTCGGTCTT
GTGTACATCAAACTTAGCTCTTGAGGTTGTTCTGCATTTTATTTCAGCACTACCACGAGAGGCTACTGACTATGGAGGTTGCTTAAGGAGGAAAATGCAAAATTGGCTGT
TAGGGTGTGGTAAGAAGTGCTGCAGTGGCAGTTGCTGCAGTACTGAGACAAAGTTTATGAAGAGCCTTATTGAGTTCCCTAAAAGATTTATGAATCATAATTATTCAGCC
GATGCTTCTGTTACTTACGATGATGAAGAATTGGTAGCATGGGAATTTTCTGCTTTGTACTTTTACACTAGGCTTGCCTTTCACTTTACTGGCCACAGCAGTGACAATTG
GAGCTATGCAGAACCAACAATTTTTAGCCAAAAAATAGCCAACCTTCTTCCATCACTACAGGTAGAATTGGTTTCTCTTGCTACCGTGTCTTGTTCCATATTCTGGTCCA
ATGTCAAGTCAGATGAGACAGTATTACCAGGTTCTGTGAAAGGGAAACTTGGGGGCCCCAGTCAACGCCGGTTGCCATCCTCCGTTACTACTTTGGTTCTGCTAGCTGTA
TGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGACAACGAAAGTTTTTCCATGGCATCTGTTTTCAGTTCGTTGTCAGAAAGCTTCCGGAGAGTGCCTCCAATGGCAGTTCCAGCTATTTTGGATTGCCTTTTTGC
TTCTACTGGGTTATCCCCATCTGAGCTTTTCGATTCGCTTCTTGAGACTTTTCCCAAAAACATTGATGATGCTTCCACGAAGGAGGGAAAGCTTGATGCTGATCAATGTA
ATTACATCACATCTTTGGTCTGCGCGCTGTGCCACATACTAAAAAAAGATGGTGCTGATCCTGATGCTTTGAAGTCATTCATATGGAAAAGCTTTGTTCCTTTGATAAAC
AAGGCAGCTACATTTAATCGGGAAATGCTAAACCAGGTCTCTGAATCATTCATTGATGTCGTAACTGAGACGAACTCATGGCCAATTGTTGAAGCAACCCTAATTCCATT
TTGTATAAGTTCAGCTCTTTATTCCACGAGTGTGGTGCAAAACGTTGAGTTGAACACCTTTGAGGGTGACAGATGTTCTGTCATTTTGGGCTCAAATGTCCCTGTGCATG
AACCTAGAATGGATAATCAGACGACGAAAGGCTATGGGTTCCTTCAATTACCATTAGCATGCCATGTTTTGGCTATAATGTTAGATGCTGTCCTTTGTAATAGACAAGCA
CCACAAACATCAGACGTAGTGGTGTCAAATGGGTACCAAAAAGCTGAAGAGTTTACTGTTAAACTAATTTGGGATATTTGCAATTTATCTGAACAAATGCTCTTACAAAG
CTTGGATCATCGATCTTGTGCCATTTGCTATCTTCTTCCAGTAATCTTTGAAGCACTTCTTTCTTACCATTCTTTAGAGATCTCCATTCAAGGCCATGCATGTAACCTCT
CCAGGAGCCGTTTCCTCATGAAAATATGGACATGTTGCAAAAAACTATTTTCATTTGGAACTTTGGAGAGAAGAGATGCCTATAGGATTTTGTCTGTTTATTTATGTTTT
TTCCCTGACAATGAAGAGCTTGGAGGTGCTGGAATGTGTGATGATGAAGAAGAATTTGACATAAAGGCTGATAAAGATTTTTGGGATGAAATTAAAAGAGGCTTGGTGGA
TAAGGAGAGCTCGGTGCGGAAGCAATCACTACATATATTGAAGAAAGCACTATCTATAAATGGAAGAGGCAACACATCTACGGTTCCAAAGACAATTTCACATGGAAAAG
ATAATAATGTTCGAGGTATTACCAAAAGGGAAAGATGGGCCAATAAGGAAGCAAATTCACTGGGTGTAGGGCAAATCTGCAACCAACATGAAATTGTTACAAATAGCCGG
CAGCAACAGTGGGAAGCATTCATACTTCTTTATGAAATGCTTGAAGAATATGGTTCACACTTGGTTGAAGCTGCTTGGAGTCACCAGATATCCTTGTTACTACAAGATCC
GGCCTCTATTAAACTTGACAGCTTCAGTAATGGCGTTCATCAGAACCAAATTGAACTGTCTGGTGAAATCTTTAGTTGGTTATCGATCTTGTGGGTTCAGGGATTCCACC
ATGATAATCCTTTAGTTAGATGTTTGATCATGCAGTCCTTCTTGGCCATTGAGTGGAGGGATAAAGTTCCTTGTTTAAAGTCATTGCCAGAAACTTTTATTATTGGACCT
TTCATTGAAGCACTCAACGATCCTGTGCAGCACAAAGATTTTGGTTTAAAAGGAGTTTACTCATCCAAGACAGTTGAAGGTGCAGCCCGTTTTATATGTCAATATGCAAA
TATTCTTGATGCAAGGGCAAGAGTGGTGTTTTTGCATCAGCTCACATCTTTGGCTAGGAAGAAATCATTTGGTCGAGCTGGGTTGATCAGCCTATCTGAATGCATTGCTT
CAGCTGCCTCAATAGTTGGATTTGACTACAATAGCGAAGGAGAGTGCTTTAATGGTTCTTCATTGTCAGCCCAGGGGGATTTGATAACTTATTCTCTGGAGTGCAAACTG
GAATTGCTGGACGATCTTAGATTTGTGGTTGAGAGCAGCAAACAACACTTCAATCCTAGTTATCGCCTTCAAGTTTGTGCCAAAGCTCTGGAGGCTGCTGCGTCGGTCTT
GTGTACATCAAACTTAGCTCTTGAGGTTGTTCTGCATTTTATTTCAGCACTACCACGAGAGGCTACTGACTATGGAGGTTGCTTAAGGAGGAAAATGCAAAATTGGCTGT
TAGGGTGTGGTAAGAAGTGCTGCAGTGGCAGTTGCTGCAGTACTGAGACAAAGTTTATGAAGAGCCTTATTGAGTTCCCTAAAAGATTTATGAATCATAATTATTCAGCC
GATGCTTCTGTTACTTACGATGATGAAGAATTGGTAGCATGGGAATTTTCTGCTTTGTACTTTTACACTAGGCTTGCCTTTCACTTTACTGGCCACAGCAGTGACAATTG
GAGCTATGCAGAACCAACAATTTTTAGCCAAAAAATAGCCAACCTTCTTCCATCACTACAGGTAGAATTGGTTTCTCTTGCTACCGTGTCTTGTTCCATATTCTGGTCCA
ATGTCAAGTCAGATGAGACAGTATTACCAGGTTCTGTGAAAGGGAAACTTGGGGGCCCCAGTCAACGCCGGTTGCCATCCTCCGTTACTACTTTGGTTCTGCTAGCTGTA
TGA
Protein sequenceShow/hide protein sequence
MSDNESFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELFDSLLETFPKNIDDASTKEGKLDADQCNYITSLVCALCHILKKDGADPDALKSFIWKSFVPLIN
KAATFNREMLNQVSESFIDVVTETNSWPIVEATLIPFCISSALYSTSVVQNVELNTFEGDRCSVILGSNVPVHEPRMDNQTTKGYGFLQLPLACHVLAIMLDAVLCNRQA
PQTSDVVVSNGYQKAEEFTVKLIWDICNLSEQMLLQSLDHRSCAICYLLPVIFEALLSYHSLEISIQGHACNLSRSRFLMKIWTCCKKLFSFGTLERRDAYRILSVYLCF
FPDNEELGGAGMCDDEEEFDIKADKDFWDEIKRGLVDKESSVRKQSLHILKKALSINGRGNTSTVPKTISHGKDNNVRGITKRERWANKEANSLGVGQICNQHEIVTNSR
QQQWEAFILLYEMLEEYGSHLVEAAWSHQISLLLQDPASIKLDSFSNGVHQNQIELSGEIFSWLSILWVQGFHHDNPLVRCLIMQSFLAIEWRDKVPCLKSLPETFIIGP
FIEALNDPVQHKDFGLKGVYSSKTVEGAARFICQYANILDARARVVFLHQLTSLARKKSFGRAGLISLSECIASAASIVGFDYNSEGECFNGSSLSAQGDLITYSLECKL
ELLDDLRFVVESSKQHFNPSYRLQVCAKALEAAASVLCTSNLALEVVLHFISALPREATDYGGCLRRKMQNWLLGCGKKCCSGSCCSTETKFMKSLIEFPKRFMNHNYSA
DASVTYDDEELVAWEFSALYFYTRLAFHFTGHSSDNWSYAEPTIFSQKIANLLPSLQVELVSLATVSCSIFWSNVKSDETVLPGSVKGKLGGPSQRRLPSSVTTLVLLAV