; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0017419 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0017419
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSpoU_methylase domain-containing protein
Genome locationchr5:3471375..3478273
RNA-Seq ExpressionLag0017419
SyntenyLag0017419
Gene Ontology termsGO:0030488 - tRNA methylation (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0016423 - tRNA (guanine) methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011650373.1 uncharacterized protein LOC101213211 isoform X1 [Cucumis sativus]7.4e-29384.11Show/hide
Query:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA
        MS+NK FSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLS SEL    L  F K +         DATTKEGKLDADQCNYITSLVCALCHILKK+GA
Subjt:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA

Query:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM
        DP ALKSFIWKSFVPLINK  T+NREMLNQVSESFIDVV+ TNSWPIV ATLIPFCISSALYSTSVLQ+ ELDTFE DR S ILGSNVP+HEPRMD QTM
Subjt:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM

Query:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN
        K Y FLQLPLACHVLAIMLDAVLCN Q PQTSD  VSNG QKAEEFT KLIWDICNLSE+MLLQSSDHRSCAI +LLP IFEAL+SH SLEISIQGH CN
Subjt:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN

Query:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT
        LSR+ FLMKIWKCCKKLFSFGTLERRDAYRILSLY CFFPHNEELGGAGMCDDGEEFDIKADK FWDEIKRGLVDKES VRKQSLHILK A S NG G+ 
Subjt:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT

Query:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN
        +TV KTISSGKD+N +GITKRERWANKEAKSLGVGQICSQ++I TNSRQQ WEAFILLYEMLEEYGSHLVEAAW+HQISLLLQ+PTS  FDSF+ GVHQN
Subjt:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN

Query:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR
        QIEMSGEI+SWLSILWVRGFHHDNPLVRCLIMQ FL I+WR+ VPCLKSLPETFIIGPFIE+LNDPVQHKDFG+KG+YSSKT+EGA  F+ QY NILDAR
Subjt:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR

Query:  ENLM
          ++
Subjt:  ENLM

XP_031738459.1 uncharacterized protein LOC101213211 isoform X3 [Cucumis sativus]7.4e-29384.11Show/hide
Query:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA
        MS+NK FSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLS SEL    L  F K +         DATTKEGKLDADQCNYITSLVCALCHILKK+GA
Subjt:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA

Query:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM
        DP ALKSFIWKSFVPLINK  T+NREMLNQVSESFIDVV+ TNSWPIV ATLIPFCISSALYSTSVLQ+ ELDTFE DR S ILGSNVP+HEPRMD QTM
Subjt:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM

Query:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN
        K Y FLQLPLACHVLAIMLDAVLCN Q PQTSD  VSNG QKAEEFT KLIWDICNLSE+MLLQSSDHRSCAI +LLP IFEAL+SH SLEISIQGH CN
Subjt:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN

Query:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT
        LSR+ FLMKIWKCCKKLFSFGTLERRDAYRILSLY CFFPHNEELGGAGMCDDGEEFDIKADK FWDEIKRGLVDKES VRKQSLHILK A S NG G+ 
Subjt:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT

Query:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN
        +TV KTISSGKD+N +GITKRERWANKEAKSLGVGQICSQ++I TNSRQQ WEAFILLYEMLEEYGSHLVEAAW+HQISLLLQ+PTS  FDSF+ GVHQN
Subjt:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN

Query:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR
        QIEMSGEI+SWLSILWVRGFHHDNPLVRCLIMQ FL I+WR+ VPCLKSLPETFIIGPFIE+LNDPVQHKDFG+KG+YSSKT+EGA  F+ QY NILDAR
Subjt:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR

Query:  ENLM
          ++
Subjt:  ENLM

XP_038906253.1 uncharacterized protein LOC120092116 isoform X1 [Benincasa hispida]5.0e-29785.83Show/hide
Query:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA
        MS+NK+FSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSEL    L  F K +         DA  KEGK DADQCNYITSLVCALCHILKKNGA
Subjt:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA

Query:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM
        +PDALKSFIWKSFVPLINKA T+NREMLNQVSESFIDVVS TNSWPIV ATL+PFCISSALYSTSV QN ELDTFEGD  SVILGSN P+HEPRMD Q  
Subjt:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM

Query:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN
        K Y FLQLPLACHVL+IMLDAV CN QAPQTS V VSNG QKAEEFT KLIWDICNLS +MLLQSSDHRSCAI YLLP IFEALLSH SLEISI+GH CN
Subjt:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN

Query:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT
        LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDI+ADKDFWDEIKRGLVDKES VRKQSL+ILK A SING GNT
Subjt:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT

Query:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN
        S V KTIS GKDNN RGITKRERWANKEA SLGVGQICSQHEIVTNSRQQ WEAFILLYEMLEEYGSHLVEAAW HQIS LLQ+P S N DSF+ GVHQN
Subjt:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN

Query:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR
        QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAI+WR  VPCLKS+PETFIIGPFIE+LNDPVQHKDFG+KGVYSSKT+EGA  FI QYANILD R
Subjt:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR

XP_038906254.1 uncharacterized protein LOC120092116 isoform X2 [Benincasa hispida]5.0e-29785.83Show/hide
Query:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA
        MS+NK+FSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSEL    L  F K +         DA  KEGK DADQCNYITSLVCALCHILKKNGA
Subjt:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA

Query:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM
        +PDALKSFIWKSFVPLINKA T+NREMLNQVSESFIDVVS TNSWPIV ATL+PFCISSALYSTSV QN ELDTFEGD  SVILGSN P+HEPRMD Q  
Subjt:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM

Query:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN
        K Y FLQLPLACHVL+IMLDAV CN QAPQTS V VSNG QKAEEFT KLIWDICNLS +MLLQSSDHRSCAI YLLP IFEALLSH SLEISI+GH CN
Subjt:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN

Query:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT
        LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDI+ADKDFWDEIKRGLVDKES VRKQSL+ILK A SING GNT
Subjt:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT

Query:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN
        S V KTIS GKDNN RGITKRERWANKEA SLGVGQICSQHEIVTNSRQQ WEAFILLYEMLEEYGSHLVEAAW HQIS LLQ+P S N DSF+ GVHQN
Subjt:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN

Query:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR
        QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAI+WR  VPCLKS+PETFIIGPFIE+LNDPVQHKDFG+KGVYSSKT+EGA  FI QYANILD R
Subjt:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR

XP_038906255.1 uncharacterized protein LOC120092116 isoform X3 [Benincasa hispida]5.0e-29785.83Show/hide
Query:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA
        MS+NK+FSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSEL    L  F K +         DA  KEGK DADQCNYITSLVCALCHILKKNGA
Subjt:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA

Query:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM
        +PDALKSFIWKSFVPLINKA T+NREMLNQVSESFIDVVS TNSWPIV ATL+PFCISSALYSTSV QN ELDTFEGD  SVILGSN P+HEPRMD Q  
Subjt:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM

Query:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN
        K Y FLQLPLACHVL+IMLDAV CN QAPQTS V VSNG QKAEEFT KLIWDICNLS +MLLQSSDHRSCAI YLLP IFEALLSH SLEISI+GH CN
Subjt:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN

Query:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT
        LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDI+ADKDFWDEIKRGLVDKES VRKQSL+ILK A SING GNT
Subjt:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT

Query:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN
        S V KTIS GKDNN RGITKRERWANKEA SLGVGQICSQHEIVTNSRQQ WEAFILLYEMLEEYGSHLVEAAW HQIS LLQ+P S N DSF+ GVHQN
Subjt:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN

Query:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR
        QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAI+WR  VPCLKS+PETFIIGPFIE+LNDPVQHKDFG+KGVYSSKT+EGA  FI QYANILD R
Subjt:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR

TrEMBL top hitse value%identityAlignment
A0A0A0L1E9 SpoU_methylase domain-containing protein3.6e-29384.11Show/hide
Query:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA
        MS+NK FSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLS SEL    L  F K +         DATTKEGKLDADQCNYITSLVCALCHILKK+GA
Subjt:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA

Query:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM
        DP ALKSFIWKSFVPLINK  T+NREMLNQVSESFIDVV+ TNSWPIV ATLIPFCISSALYSTSVLQ+ ELDTFE DR S ILGSNVP+HEPRMD QTM
Subjt:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM

Query:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN
        K Y FLQLPLACHVLAIMLDAVLCN Q PQTSD  VSNG QKAEEFT KLIWDICNLSE+MLLQSSDHRSCAI +LLP IFEAL+SH SLEISIQGH CN
Subjt:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN

Query:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT
        LSR+ FLMKIWKCCKKLFSFGTLERRDAYRILSLY CFFPHNEELGGAGMCDDGEEFDIKADK FWDEIKRGLVDKES VRKQSLHILK A S NG G+ 
Subjt:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT

Query:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN
        +TV KTISSGKD+N +GITKRERWANKEAKSLGVGQICSQ++I TNSRQQ WEAFILLYEMLEEYGSHLVEAAW+HQISLLLQ+PTS  FDSF+ GVHQN
Subjt:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN

Query:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR
        QIEMSGEI+SWLSILWVRGFHHDNPLVRCLIMQ FL I+WR+ VPCLKSLPETFIIGPFIE+LNDPVQHKDFG+KG+YSSKT+EGA  F+ QY NILDAR
Subjt:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR

Query:  ENLM
          ++
Subjt:  ENLM

A0A6J1DMP5 uncharacterized protein LOC111022655 isoform X47.8e-28882.95Show/hide
Query:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA
        M NNKSFSMASV  SLSESFR+VPPMAVPAILDCL ASTGLSPSEL    L+ F   +         D TTKEGKLD DQCNY+TSLVCALCHILKKNGA
Subjt:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA

Query:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM
        DPDALK FIWKSFVPLINKA   NREMLNQVSESFIDVV  TNSWPIV  TL+P CISSALYST++LQNE+L TFEGDR SVILGSN  +HEP+MDKQ +
Subjt:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM

Query:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN
        K Y FL LPLACH+LAIMLDAVLCN QAPQT++V VSNGCQKAEEFT KLI DICNLS++MLLQSSDHRSCAIRYLLP IFEALLS  +LEISIQG+ C+
Subjt:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN

Query:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT
        LSRNRFLMKIW CCKKLFSFGTLERRDAY ILSLYL FFPHNEEL GAGMCDD EEFDIKADKDFW EIKRGLVDKE LVRKQS+HILK A SING GNT
Subjt:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT

Query:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN
        S+V  TISSGKDNNARGITKRERWANKEAKSLGV Q CSQHEIVTNS QQ WEAFILLYEMLEEYGSHLVEAAWNHQISLLL++PTSI FDSFTGG +QN
Subjt:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN

Query:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR
        QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFL IDWRN V CL SLP+TFIIGPFIE+LNDPVQHKDFGVKGVYSSKTIEGA HFI QYAN LDAR
Subjt:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR

Query:  ENLM
          ++
Subjt:  ENLM

A0A6J1DN74 uncharacterized protein LOC111022655 isoform X27.8e-28882.95Show/hide
Query:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA
        M NNKSFSMASV  SLSESFR+VPPMAVPAILDCL ASTGLSPSEL    L+ F   +         D TTKEGKLD DQCNY+TSLVCALCHILKKNGA
Subjt:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA

Query:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM
        DPDALK FIWKSFVPLINKA   NREMLNQVSESFIDVV  TNSWPIV  TL+P CISSALYST++LQNE+L TFEGDR SVILGSN  +HEP+MDKQ +
Subjt:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM

Query:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN
        K Y FL LPLACH+LAIMLDAVLCN QAPQT++V VSNGCQKAEEFT KLI DICNLS++MLLQSSDHRSCAIRYLLP IFEALLS  +LEISIQG+ C+
Subjt:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN

Query:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT
        LSRNRFLMKIW CCKKLFSFGTLERRDAY ILSLYL FFPHNEEL GAGMCDD EEFDIKADKDFW EIKRGLVDKE LVRKQS+HILK A SING GNT
Subjt:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT

Query:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN
        S+V  TISSGKDNNARGITKRERWANKEAKSLGV Q CSQHEIVTNS QQ WEAFILLYEMLEEYGSHLVEAAWNHQISLLL++PTSI FDSFTGG +QN
Subjt:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN

Query:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR
        QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFL IDWRN V CL SLP+TFIIGPFIE+LNDPVQHKDFGVKGVYSSKTIEGA HFI QYAN LDAR
Subjt:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR

Query:  ENLM
          ++
Subjt:  ENLM

A0A6J1DN81 uncharacterized protein LOC111022655 isoform X77.8e-28882.95Show/hide
Query:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA
        M NNKSFSMASV  SLSESFR+VPPMAVPAILDCL ASTGLSPSEL    L+ F   +         D TTKEGKLD DQCNY+TSLVCALCHILKKNGA
Subjt:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA

Query:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM
        DPDALK FIWKSFVPLINKA   NREMLNQVSESFIDVV  TNSWPIV  TL+P CISSALYST++LQNE+L TFEGDR SVILGSN  +HEP+MDKQ +
Subjt:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM

Query:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN
        K Y FL LPLACH+LAIMLDAVLCN QAPQT++V VSNGCQKAEEFT KLI DICNLS++MLLQSSDHRSCAIRYLLP IFEALLS  +LEISIQG+ C+
Subjt:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN

Query:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT
        LSRNRFLMKIW CCKKLFSFGTLERRDAY ILSLYL FFPHNEEL GAGMCDD EEFDIKADKDFW EIKRGLVDKE LVRKQS+HILK A SING GNT
Subjt:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT

Query:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN
        S+V  TISSGKDNNARGITKRERWANKEAKSLGV Q CSQHEIVTNS QQ WEAFILLYEMLEEYGSHLVEAAWNHQISLLL++PTSI FDSFTGG +QN
Subjt:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN

Query:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR
        QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFL IDWRN V CL SLP+TFIIGPFIE+LNDPVQHKDFGVKGVYSSKTIEGA HFI QYAN LDAR
Subjt:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR

Query:  ENLM
          ++
Subjt:  ENLM

A0A6J1DPL7 uncharacterized protein LOC111022655 isoform X57.8e-28882.95Show/hide
Query:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA
        M NNKSFSMASV  SLSESFR+VPPMAVPAILDCL ASTGLSPSEL    L+ F   +         D TTKEGKLD DQCNY+TSLVCALCHILKKNGA
Subjt:  MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGA

Query:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM
        DPDALK FIWKSFVPLINKA   NREMLNQVSESFIDVV  TNSWPIV  TL+P CISSALYST++LQNE+L TFEGDR SVILGSN  +HEP+MDKQ +
Subjt:  DPDALKSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTM

Query:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN
        K Y FL LPLACH+LAIMLDAVLCN QAPQT++V VSNGCQKAEEFT KLI DICNLS++MLLQSSDHRSCAIRYLLP IFEALLS  +LEISIQG+ C+
Subjt:  KAYEFLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCN

Query:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT
        LSRNRFLMKIW CCKKLFSFGTLERRDAY ILSLYL FFPHNEEL GAGMCDD EEFDIKADKDFW EIKRGLVDKE LVRKQS+HILK A SING GNT
Subjt:  LSRNRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNT

Query:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN
        S+V  TISSGKDNNARGITKRERWANKEAKSLGV Q CSQHEIVTNS QQ WEAFILLYEMLEEYGSHLVEAAWNHQISLLL++PTSI FDSFTGG +QN
Subjt:  STVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQN

Query:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR
        QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFL IDWRN V CL SLP+TFIIGPFIE+LNDPVQHKDFGVKGVYSSKTIEGA HFI QYAN LDAR
Subjt:  QIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDAR

Query:  ENLM
          ++
Subjt:  ENLM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G17610.1 tRNA/rRNA methyltransferase (SpoU) family protein7.2e-13744.26Show/hide
Query:  ASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNG----ADPDAL
        +SV +SLS SF++VPP A+PA LDC+ +STG+SPS L    +  F   L         D    + + D+D CN+I SLV  LCH+LK  G     + +AL
Subjt:  ASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNG----ADPDAL

Query:  KSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFE-GDRFSVILGSNVPLHEPRMDKQTMKAYE
        + F+W+ F+PL+      + +MLN++ ESF DVV  TN   ++G +L+PF + S  +S  + Q+EE D  + GD     L     L+   MD+  +    
Subjt:  KSFIWKSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFE-GDRFSVILGSNVPLHEPRMDKQTMKAYE

Query:  -FLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCNLSR
            +PL+CH+L ++L+A   + QA             K E F   ++WD+CN +E++L QS +HRSCA+ +LLP IF+A  S  SL+IS QG+   LSR
Subjt:  -FLQLPLACHVLAIMLDAVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCNLSR

Query:  NRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNTSTV
        N F+ +IW+CCKKLFS G++ERRDAY +LSL L      +         D  +FD++++++FWDEIK GLV  ESLVRKQSLHILK+  SI        V
Subjt:  NRFLMKIWKCCKKLFSFGTLERRDAYRILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNTSTV

Query:  QKTISSGK-DNNA--RGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFD----SFTGG
         +TIS  K + N+  R +T++E WA KEAKSLGVG++    +    S QQ W+AF+LLYEMLEEYG+HLVEAAW++QI LL++  +S+ +D    S    
Subjt:  QKTISSGK-DNNA--RGITKRERWANKEAKSLGVGQICSQHEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFD----SFTGG

Query:  VHQNQIEMSGE---IFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQY
         H   +E   E   IF+WL +LW RGF HDNPLVRC +M+SF  I+WR    C +S+ +TF++GPFIE LNDP  HKDFG+KG+Y+S+TIEGA  ++  Y
Subjt:  VHQNQIEMSGE---IFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSLPETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQY

Query:  ANILDARENL
         + L+ R  +
Subjt:  ANILDARENL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCAACAACAAGAGTTTCTCCATGGCCTCTGTTTTCAGTTCACTGTCAGAAAGCTTCCGGAGGGTGCCTCCAATGGCAGTTCCAGCTATTTTGGATTGCCTTTTTGC
TTCTACTGGGTTATCCCCATCCGAGCTTTTGATTCGCTTCTTGAATCTTTTCCGAAAAATATTGATGTACTACTCTCAAAAACCCCACTTTGATGCCACCACAAAGGAGG
GAAAGCTCGATGCTGATCAATGTAATTACATCACATCTTTAGTGTGCGCACTGTGTCACATACTTAAAAAAAATGGTGCCGATCCTGATGCTTTAAAGTCATTTATATGG
AAAAGTTTTGTTCCTTTGATAAATAAGGCAGATACAATGAATCGAGAAATGCTTAATCAGGTCTCTGAATCATTCATTGATGTTGTCTCTGGGACGAACTCATGGCCAAT
TGTTGGAGCAACTCTAATTCCATTTTGTATAAGTTCAGCTCTTTATTCCACGAGTGTGCTGCAAAATGAAGAGTTGGACACCTTTGAGGGTGATAGATTTTCTGTCATTT
TGGGCTCAAATGTCCCTCTGCACGAACCTAGAATGGATAAGCAGACGATGAAAGCATATGAGTTCCTTCAATTGCCATTAGCATGCCATGTTTTGGCTATAATGTTAGAT
GCTGTCCTTTGTAATGGTCAGGCACCACAAACATCAGATGTAGCGGTGTCAAATGGATGCCAAAAAGCTGAAGAGTTTACTGATAAACTAATTTGGGATATTTGCAATTT
ATCTGAAAAAATGCTCTTACAAAGCTCGGATCATCGATCTTGTGCCATTCGCTATCTTCTTCCAGGAATCTTTGAAGCACTTCTTTCTCACCGCTCTTTAGAGATCTCCA
TTCAAGGGCATACATGTAATCTCTCCAGGAATCGTTTCCTCATGAAAATATGGAAATGTTGCAAAAAACTATTTTCATTTGGAACTTTGGAGAGAAGAGATGCCTATAGG
ATTTTGTCTCTTTATTTATGTTTTTTCCCTCACAATGAAGAGCTTGGAGGTGCTGGAATGTGTGACGACGGCGAAGAATTTGACATAAAGGCTGATAAAGATTTTTGGGA
TGAAATTAAAAGAGGCTTGGTGGATAAGGAGAGCTTGGTGAGGAAGCAATCACTTCATATATTGAAGAACGCACAATCTATAAATGGAAGTGGCAATACATCTACAGTTC
AAAAGACAATTTCAAGTGGAAAAGATAATAATGCTCGAGGTATTACAAAAAGGGAAAGATGGGCTAATAAGGAAGCAAAATCACTCGGTGTAGGGCAAATTTGCAGTCAA
CATGAAATTGTTACAAATAGCCGGCAGCAACTGTGGGAAGCTTTCATACTTCTATATGAAATGCTTGAAGAATATGGTTCACACTTGGTTGAAGCTGCTTGGAATCACCA
GATATCCTTGTTACTACAAAATCCGACCTCTATTAATTTTGACAGCTTCACTGGTGGCGTTCATCAAAACCAAATTGAAATGTCTGGCGAAATCTTTAGTTGGTTATCAA
TCTTGTGGGTTCGGGGATTCCATCATGATAATCCTTTAGTTAGATGCTTGATCATGCAATCCTTTTTGGCTATTGACTGGAGGAATAATGTACCCTGTTTAAAGTCACTG
CCAGAAACTTTTATTATTGGACCTTTCATTGAATCACTCAACGATCCTGTGCAGCACAAAGATTTTGGTGTAAAAGGAGTTTACTCATCCAAGACAATTGAAGGTGCAAC
CCATTTTATATATCAATATGCAAATATTCTTGATGCAAGAGAGAATCTCATGAACAATTTAGACAAAGAACACCACATTGATGCCTTTAAACGAGAAGCTTCAGAACAAT
CAAACCAAGGAAGATACTTGCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCAACAACAAGAGTTTCTCCATGGCCTCTGTTTTCAGTTCACTGTCAGAAAGCTTCCGGAGGGTGCCTCCAATGGCAGTTCCAGCTATTTTGGATTGCCTTTTTGC
TTCTACTGGGTTATCCCCATCCGAGCTTTTGATTCGCTTCTTGAATCTTTTCCGAAAAATATTGATGTACTACTCTCAAAAACCCCACTTTGATGCCACCACAAAGGAGG
GAAAGCTCGATGCTGATCAATGTAATTACATCACATCTTTAGTGTGCGCACTGTGTCACATACTTAAAAAAAATGGTGCCGATCCTGATGCTTTAAAGTCATTTATATGG
AAAAGTTTTGTTCCTTTGATAAATAAGGCAGATACAATGAATCGAGAAATGCTTAATCAGGTCTCTGAATCATTCATTGATGTTGTCTCTGGGACGAACTCATGGCCAAT
TGTTGGAGCAACTCTAATTCCATTTTGTATAAGTTCAGCTCTTTATTCCACGAGTGTGCTGCAAAATGAAGAGTTGGACACCTTTGAGGGTGATAGATTTTCTGTCATTT
TGGGCTCAAATGTCCCTCTGCACGAACCTAGAATGGATAAGCAGACGATGAAAGCATATGAGTTCCTTCAATTGCCATTAGCATGCCATGTTTTGGCTATAATGTTAGAT
GCTGTCCTTTGTAATGGTCAGGCACCACAAACATCAGATGTAGCGGTGTCAAATGGATGCCAAAAAGCTGAAGAGTTTACTGATAAACTAATTTGGGATATTTGCAATTT
ATCTGAAAAAATGCTCTTACAAAGCTCGGATCATCGATCTTGTGCCATTCGCTATCTTCTTCCAGGAATCTTTGAAGCACTTCTTTCTCACCGCTCTTTAGAGATCTCCA
TTCAAGGGCATACATGTAATCTCTCCAGGAATCGTTTCCTCATGAAAATATGGAAATGTTGCAAAAAACTATTTTCATTTGGAACTTTGGAGAGAAGAGATGCCTATAGG
ATTTTGTCTCTTTATTTATGTTTTTTCCCTCACAATGAAGAGCTTGGAGGTGCTGGAATGTGTGACGACGGCGAAGAATTTGACATAAAGGCTGATAAAGATTTTTGGGA
TGAAATTAAAAGAGGCTTGGTGGATAAGGAGAGCTTGGTGAGGAAGCAATCACTTCATATATTGAAGAACGCACAATCTATAAATGGAAGTGGCAATACATCTACAGTTC
AAAAGACAATTTCAAGTGGAAAAGATAATAATGCTCGAGGTATTACAAAAAGGGAAAGATGGGCTAATAAGGAAGCAAAATCACTCGGTGTAGGGCAAATTTGCAGTCAA
CATGAAATTGTTACAAATAGCCGGCAGCAACTGTGGGAAGCTTTCATACTTCTATATGAAATGCTTGAAGAATATGGTTCACACTTGGTTGAAGCTGCTTGGAATCACCA
GATATCCTTGTTACTACAAAATCCGACCTCTATTAATTTTGACAGCTTCACTGGTGGCGTTCATCAAAACCAAATTGAAATGTCTGGCGAAATCTTTAGTTGGTTATCAA
TCTTGTGGGTTCGGGGATTCCATCATGATAATCCTTTAGTTAGATGCTTGATCATGCAATCCTTTTTGGCTATTGACTGGAGGAATAATGTACCCTGTTTAAAGTCACTG
CCAGAAACTTTTATTATTGGACCTTTCATTGAATCACTCAACGATCCTGTGCAGCACAAAGATTTTGGTGTAAAAGGAGTTTACTCATCCAAGACAATTGAAGGTGCAAC
CCATTTTATATATCAATATGCAAATATTCTTGATGCAAGAGAGAATCTCATGAACAATTTAGACAAAGAACACCACATTGATGCCTTTAAACGAGAAGCTTCAGAACAAT
CAAACCAAGGAAGATACTTGCTCTGA
Protein sequenceShow/hide protein sequence
MSNNKSFSMASVFSSLSESFRRVPPMAVPAILDCLFASTGLSPSELLIRFLNLFRKILMYYSQKPHFDATTKEGKLDADQCNYITSLVCALCHILKKNGADPDALKSFIW
KSFVPLINKADTMNREMLNQVSESFIDVVSGTNSWPIVGATLIPFCISSALYSTSVLQNEELDTFEGDRFSVILGSNVPLHEPRMDKQTMKAYEFLQLPLACHVLAIMLD
AVLCNGQAPQTSDVAVSNGCQKAEEFTDKLIWDICNLSEKMLLQSSDHRSCAIRYLLPGIFEALLSHRSLEISIQGHTCNLSRNRFLMKIWKCCKKLFSFGTLERRDAYR
ILSLYLCFFPHNEELGGAGMCDDGEEFDIKADKDFWDEIKRGLVDKESLVRKQSLHILKNAQSINGSGNTSTVQKTISSGKDNNARGITKRERWANKEAKSLGVGQICSQ
HEIVTNSRQQLWEAFILLYEMLEEYGSHLVEAAWNHQISLLLQNPTSINFDSFTGGVHQNQIEMSGEIFSWLSILWVRGFHHDNPLVRCLIMQSFLAIDWRNNVPCLKSL
PETFIIGPFIESLNDPVQHKDFGVKGVYSSKTIEGATHFIYQYANILDARENLMNNLDKEHHIDAFKREASEQSNQGRYLL