; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg035486 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg035486
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNudix hydrolase domain-containing protein
Genome locationscaffold3:1004561..1011925
RNA-Seq ExpressionSpg035486
SyntenySpg035486
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022922297.1 uncharacterized protein LOC111430319 [Cucurbita moschata]7.9e-11880.21Show/hide
Query:  PSPPPQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPNP--NPKTPLQLHPFSSPQSLFEWLTPRLP
        P P P P+PI +L HL +S  LPDFFLAALSLFVF  SSSSS+SFKFPL  I    R  LKTPSMS SHPN    P    +LHPF+SPQSL +WL PRLP
Subjt:  PSPPPQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPNP--NPKTPLQLHPFSSPQSLFEWLTPRLP

Query:  SDSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSIL
        SDSFASWGVKPGTKNVHNLWLELS GETSLADSNPP+RTVQVLSLRIID H R+LLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSIL
Subjt:  SDSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSIL

Query:  AHSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKWL
          SDCSEIV+IVPDSY+ KIEERNSASYPGLPACYVLHSMDV V+GLP+ DF TVEEEEY NS++T IADEAVSVKKHFWKW+
Subjt:  AHSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKWL

XP_038874472.1 uncharacterized protein LOC120067121 isoform X1 [Benincasa hispida]1.2e-11879.3Show/hide
Query:  PSPPPQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPN-PNPKTPLQLHPFSSPQSLFEWLTPRLPS
        P+P P P+P  NL HL+KSTALPDFFLAALSLFVFFSSSSSSKSFKFP FSI    R  LK PS S+     PN K+ L L  F+SPQSL EWL PRLPS
Subjt:  PSPPPQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPN-PNPKTPLQLHPFSSPQSLFEWLTPRLPS

Query:  DSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILA
        DSFASWGV PGTKNVHNLWLE+S GETSLADSNPP+RT+ VLSLRI+D HHR+L+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSI+ 
Subjt:  DSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILA

Query:  HSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKWLFFG
         SDCS+IVRIVPDSY+ KIEERNS SYPGLPACYVLHSMDVWV+GLPEG+F TVEEEEYGNS++T IAD+AVSVKKHFWKW+  G
Subjt:  HSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKWLFFG

XP_038874473.1 uncharacterized protein LOC120067121 isoform X2 [Benincasa hispida]2.1e-11879.3Show/hide
Query:  PSPPPQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPN-PNPKTPLQLHPFSSPQSLFEWLTPRLPS
        P+P P P+P  NL HL+KSTALPDFFLAALSLFVFFSSSSSSKSFKFP FSI    R  LK PS S+     PN K+ L L  F+SPQSL EWL PRLPS
Subjt:  PSPPPQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPN-PNPKTPLQLHPFSSPQSLFEWLTPRLPS

Query:  DSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILA
        DSFASWGV PGTKNVHNLWLE+S GETSLADSNPP+RT+ VLSLRI+D HHR+L+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSI+ 
Subjt:  DSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILA

Query:  HSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKWLFFG
         SDCS+IVRIVPDSY+ KIEERNS SYPGLPACYVLHSMDVWV+GLPEG+F TVEEEEYGNS++T IAD+AVSVKKHFWKW+  G
Subjt:  HSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKWLFFG

XP_038874474.1 uncharacterized protein LOC120067121 isoform X3 [Benincasa hispida]3.5e-11880.07Show/hide
Query:  PSPPPQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPN-PNPKTPLQLHPFSSPQSLFEWLTPRLPS
        P+P P P+P  NL HL+KSTALPDFFLAALSLFVFFSSSSSSKSFKFP FSI    R  LK PS S+     PN K+ L L  F+SPQSL EWL PRLPS
Subjt:  PSPPPQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPN-PNPKTPLQLHPFSSPQSLFEWLTPRLPS

Query:  DSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILA
        DSFASWGV PGTKNVHNLWLE+S GETSLADSNPP+RT+ VLSLRI+D HHR+L+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSI+ 
Subjt:  DSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILA

Query:  HSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKW
         SDCS+IVRIVPDSY+ KIEERNS SYPGLPACYVLHSMDVWV+GLPEG+F TVEEEEYGNS++T IAD+AVSVKKHFWKW
Subjt:  HSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKW

XP_038874475.1 uncharacterized protein LOC120067121 isoform X4 [Benincasa hispida]3.5e-11880.07Show/hide
Query:  PSPPPQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPN-PNPKTPLQLHPFSSPQSLFEWLTPRLPS
        P+P P P+P  NL HL+KSTALPDFFLAALSLFVFFSSSSSSKSFKFP FSI    R  LK PS S+     PN K+ L L  F+SPQSL EWL PRLPS
Subjt:  PSPPPQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPN-PNPKTPLQLHPFSSPQSLFEWLTPRLPS

Query:  DSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILA
        DSFASWGV PGTKNVHNLWLE+S GETSLADSNPP+RT+ VLSLRI+D HHR+L+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSI+ 
Subjt:  DSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILA

Query:  HSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKW
         SDCS+IVRIVPDSY+ KIEERNS SYPGLPACYVLHSMDVWV+GLPEG+F TVEEEEYGNS++T IAD+AVSVKKHFWKW
Subjt:  HSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKW

TrEMBL top hitse value%identityAlignment
A0A0A0KJQ6 Uncharacterized protein8.2e-11377.74Show/hide
Query:  PSPPPQPRPIFNLTHLDKS-TALPDFFLAALSLFVFF-SSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPNPNPKTPLQLHPFSSPQSLFEWLTPRLP
        P PPPQP+PI NLTHL+KS  ALPDFFLAALSLF F  SSSSSSKSFKFP FSI    R   K PS+S+  PN N K+      F+SPQSL EWL PRLP
Subjt:  PSPPPQPRPIFNLTHLDKS-TALPDFFLAALSLFVFF-SSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPNPNPKTPLQLHPFSSPQSLFEWLTPRLP

Query:  SDSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSIL
        S SFASWGV PGTKN+HNLWLE+S GETSLADSNPP+RT+ VLSLRIID HHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EELGSIL
Subjt:  SDSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSIL

Query:  AHSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKWL
          SD S++VRIVPDSY+ KIEER+S SYPGL A YVLHSMDVWV+GLP+GDF TVEEEEY NS+DT IAD AVSVKKHFWKW+
Subjt:  AHSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKWL

A0A1S3AUP2 uncharacterized protein LOC1034830012.3e-11578.37Show/hide
Query:  PSPPPQPRPIFNLTHLDKST-ALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPNPNPKTPLQLHPFSSPQSLFEWLTPRLPS
        P PPPQP+PI NLTHL+KST ALPDFFLAALSLF FFSSSS SKSFKFP FSI    R  LK PS S+  PN N K+      F+SPQSL EWL PRLPS
Subjt:  PSPPPQPRPIFNLTHLDKST-ALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPNPNPKTPLQLHPFSSPQSLFEWLTPRLPS

Query:  DSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILA
         SFASWGV PGTKN+HNLWLE+S GETSLADSNPP+R + VLSLRIID HHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EELGSILA
Subjt:  DSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILA

Query:  HSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKWL
         SDCS +VRIVPDSY+ KIEER+S SYPGLPACYVLHSMD+ V+GLP+GDF TVE+EEY NS++T IAD+AVSVKKHFWKW+
Subjt:  HSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKWL

A0A6J1CCN3 uncharacterized protein LOC1110095082.3e-11577.7Show/hide
Query:  MPSPP---PQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPL----FSILRPSLKTPSMSLSHPNPNPKTPLQLHPFSSPQSLFEWLTP
        MPSPP   P P PI NLTHL+KST LPDF+LAALSLFVFF  SSSSKSFKFPL    F+  R  LK PSMSLSH  P+PKT L  H F+SPQSL +WL P
Subjt:  MPSPP---PQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPL----FSILRPSLKTPSMSLSHPNPNPKTPLQLHPFSSPQSLFEWLTP

Query:  RLPSDSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG
        RLPSDSFASWGVKPGTKNVHNLWLE+S GETSLADSNPP+RTVQV+SLRI+DKH+R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG
Subjt:  RLPSDSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELG

Query:  SILAHSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADE-AVSVKKHFWKWL
        SI+   DC EIVRIVP+SY+ KIEERNS SYPGLPACYVLHSMDVWV+GLP+ +F TVEEEEY  S++T+IA + AVSVKKHFWKW+
Subjt:  SILAHSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADE-AVSVKKHFWKWL

A0A6J1E2U8 uncharacterized protein LOC1114303193.8e-11880.21Show/hide
Query:  PSPPPQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPNP--NPKTPLQLHPFSSPQSLFEWLTPRLP
        P P P P+PI +L HL +S  LPDFFLAALSLFVF  SSSSS+SFKFPL  I    R  LKTPSMS SHPN    P    +LHPF+SPQSL +WL PRLP
Subjt:  PSPPPQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPNP--NPKTPLQLHPFSSPQSLFEWLTPRLP

Query:  SDSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSIL
        SDSFASWGVKPGTKNVHNLWLELS GETSLADSNPP+RTVQVLSLRIID H R+LLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSIL
Subjt:  SDSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSIL

Query:  AHSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKWL
          SDCSEIV+IVPDSY+ KIEERNSASYPGLPACYVLHSMDV V+GLP+ DF TVEEEEY NS++T IADEAVSVKKHFWKW+
Subjt:  AHSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKWL

A0A6J1IA21 uncharacterized protein LOC1114729791.2e-11679.15Show/hide
Query:  PSPPPQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPNP--NPKTPLQLHPFSSPQSLFEWLTPRLP
        P P P P+PI +L HL +S  LPDFFLAALSLFVF  SSSSS+SFK PL  I    R  LKTPSMS SHPN    P    +LHPF+SPQSL +WL PRLP
Subjt:  PSPPPQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPLFSIL---RPSLKTPSMSLSHPNP--NPKTPLQLHPFSSPQSLFEWLTPRLP

Query:  SDSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSIL
        SDSFASWGVKPGTKNVHNLWLELS GETSLADS PP+RTVQVLSLRIID H R+LLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSIL
Subjt:  SDSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSIL

Query:  AHSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKWL
          SDCSEIV+IVPDSY+ KIEERNSASYPGLPACYVLHSMDV V+GLP+ DF TVEEEEY NS+++ IADEAVSVKKHFWKW+
Subjt:  AHSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKWL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G24460.1 unknown protein1.7e-7351.71Show/hide
Query:  PPPQPRPIFNLTHLDK--------STALPDFFLAALSLFVFFSSSSSSKSFKFPLFSILRPSLKTPSMSLSHPNPNPKTPLQLHPFSSPQSLFEWLTPRL
        P P   P+ N  +++         ++ALPD FLAA+SL   +SS     S     FS      +    ++S  +P P  P Q   F++PQSL +WL  RL
Subjt:  PPPQPRPIFNLTHLDK--------STALPDFFLAALSLFVFFSSSSSSKSFKFPLFSILRPSLKTPSMSLSHPNPNPKTPLQLHPFSSPQSLFEWLTPRL

Query:  PSDSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSI
        PSDSFA+WGVKPGTKNVHNLWLELS GETSLADS PP+RTV V+++R+I K+ R+L+E+HQ+LSDG++R R RPLSEKMKP E+P+ AV+RA+KEELGSI
Subjt:  PSDSFASWGVKPGTKNVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSI

Query:  L--AHSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTK------IADEAVSVKKHFWKWL
              +  + ++I+P +Y R++EERNS SYPGLPA Y LHS++  V+GLPE DF T EE+EY   D TK       A  AV+VK+H+WKW+
Subjt:  L--AHSDCSEIVRIVPDSYQRKIEERNSASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTK------IADEAVSVKKHFWKWL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATCACCCCCACCACAACCACGACCCATCTTCAATCTTACTCACCTCGACAAATCCACGGCTCTTCCCGACTTTTTCCTCGCGGCCCTCTCTCTTTTCGTTTTCTT
CTCTTCTTCTTCCTCCTCCAAATCCTTCAAATTTCCTCTTTTCTCCATTCTTCGCCCTTCTCTCAAGACACCCTCCATGTCCCTCTCACATCCCAACCCCAACCCCAAAA
CTCCCCTCCAACTCCACCCCTTCTCCTCTCCTCAATCCCTCTTCGAATGGCTTACACCTCGCTTGCCCTCAGACTCTTTTGCTTCTTGGGGCGTCAAGCCTGGCACCAAG
AACGTCCACAACCTCTGGCTCGAGCTCTCTCATGGAGAAACTTCCCTTGCCGACTCAAACCCTCCCCTCCGCACCGTTCAGGTCCTTTCTCTTCGAATTATTGATAAACA
TCACCGCCTTCTCCTCGAATCCCACCAGCAACTCTCCGATGGCACCCTACGGAATCGAAATCGACCCTTGTCGGAGAAAATGAAGCCCAATGAGACCCCTGAATCCGCCG
TCTACCGGGCTGTCAAAGAGGAGCTCGGTTCGATTCTTGCCCACTCTGATTGTTCTGAAATTGTGAGGATCGTTCCAGATTCCTATCAAAGGAAGATCGAGGAGCGGAAC
TCGGCTTCGTACCCTGGTTTGCCGGCTTGCTACGTTTTGCATTCGATGGATGTTTGGGTGGATGGCTTACCCGAGGGAGACTTCTTCACTGTGGAGGAGGAGGAATACGG
AAATTCTGACGACACAAAAATTGCGGACGAGGCTGTGTCCGTCAAGAAGCATTTTTGGAAATGGCTTTTTTTTGGTTTAGGACATGTTTGGATTGATTTTGTGAGGGTCA
AAATCACTCCTAAACATGCCATTTCTAGAATTCCTCGAATATTTGTGTGTTACGGTGGTGTATGGAAGGAGAATGAAAGAGGTTATGAAGGTGGCTATTTAGCAGGATTG
GATGTGAATATTGACATAAGGTATGAAGAGTTTTTAAGAGAGCTGTATAATCTGAGTGGCATTAATCCCGATCAATTTGACCTTATAATAAGATGTTTATACAATTTTGG
ATCGAGAGTTCCTACTTATTTGATTAGAAATGACAGAGAATTAAGATTCTTTCTAAGCGGAGACGATGCATCTAACTTGCCGTTATTCTTATCAATGATTCCAAAAGCTG
TTCATGGTAGTGGCAGTAACTTAGTGGAAGAGTTTTTAAGAGAGCTGTATAATCTGAGTGGCATTAATCCCGATCAGTTTGACCTTATAATGAGATGTTTATACAATTTT
GGATCAAGAGTTCCTACTTATTTGATTAGAAATGACAGAGAACTAAGATTCTTTCTAAGCGGAGACGATGCATCTAACTTGTCGTTATTCTTATCAATGATTCCAAAAGC
TGTTCATGGTAGTGGCAGTAACTTAGGTAGTCATAGTCATATACCACACTCTCTCTCACAACCTTTTCCACAATCGCCACCATCATTTGCACAAGTTGGACCCTACAATT
TGGTCGAAAACATACCATCATCTATTCCCTCGTATCATCCGACATATGATAACGTATCATCACCAACCCCTATAAATAATACTTTTGTCCCACTAGACTTAGCCGATGAT
AGTACACCAAATTTTGGGATTGAAAAGAATTTGTATACTGATGATGATACACTTAATTACGAGGAAGAAGAGTATGATGAAGAGGAAGATGAAAACTGGGAATACAACGA
GGAGGATGAAAAATTAGATGAATCATACGAAGAATCTGAAGCAGATGAATTTGAGGAGGTAGATGATTATGCAAAATTGCTTTCAAGACCCTCCCTCACGTTGTTCTCCT
TCTCTCCTACTCTCCCGCTGCTCGTGCTCTCATCTCTACTCTCCCGCCGGCACCGTTCTCACGCCCACGTCGCTGATCAGCCGACTGCTGAAGTATCTCTCCTTTCACGC
CAGTGCCATCAGCTTCCAGTTCTCGCCGTCGTTCTAGAGTTCACGCTGTCAGCTTCGTTTTCTCGCCGCCGTTCTCGAGTTCGCGCCGTCAACTACACTAAGCAAGGACA
GGAACAGGATGTATTTATAGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCATCACCCCCACCACAACCACGACCCATCTTCAATCTTACTCACCTCGACAAATCCACGGCTCTTCCCGACTTTTTCCTCGCGGCCCTCTCTCTTTTCGTTTTCTT
CTCTTCTTCTTCCTCCTCCAAATCCTTCAAATTTCCTCTTTTCTCCATTCTTCGCCCTTCTCTCAAGACACCCTCCATGTCCCTCTCACATCCCAACCCCAACCCCAAAA
CTCCCCTCCAACTCCACCCCTTCTCCTCTCCTCAATCCCTCTTCGAATGGCTTACACCTCGCTTGCCCTCAGACTCTTTTGCTTCTTGGGGCGTCAAGCCTGGCACCAAG
AACGTCCACAACCTCTGGCTCGAGCTCTCTCATGGAGAAACTTCCCTTGCCGACTCAAACCCTCCCCTCCGCACCGTTCAGGTCCTTTCTCTTCGAATTATTGATAAACA
TCACCGCCTTCTCCTCGAATCCCACCAGCAACTCTCCGATGGCACCCTACGGAATCGAAATCGACCCTTGTCGGAGAAAATGAAGCCCAATGAGACCCCTGAATCCGCCG
TCTACCGGGCTGTCAAAGAGGAGCTCGGTTCGATTCTTGCCCACTCTGATTGTTCTGAAATTGTGAGGATCGTTCCAGATTCCTATCAAAGGAAGATCGAGGAGCGGAAC
TCGGCTTCGTACCCTGGTTTGCCGGCTTGCTACGTTTTGCATTCGATGGATGTTTGGGTGGATGGCTTACCCGAGGGAGACTTCTTCACTGTGGAGGAGGAGGAATACGG
AAATTCTGACGACACAAAAATTGCGGACGAGGCTGTGTCCGTCAAGAAGCATTTTTGGAAATGGCTTTTTTTTGGTTTAGGACATGTTTGGATTGATTTTGTGAGGGTCA
AAATCACTCCTAAACATGCCATTTCTAGAATTCCTCGAATATTTGTGTGTTACGGTGGTGTATGGAAGGAGAATGAAAGAGGTTATGAAGGTGGCTATTTAGCAGGATTG
GATGTGAATATTGACATAAGGTATGAAGAGTTTTTAAGAGAGCTGTATAATCTGAGTGGCATTAATCCCGATCAATTTGACCTTATAATAAGATGTTTATACAATTTTGG
ATCGAGAGTTCCTACTTATTTGATTAGAAATGACAGAGAATTAAGATTCTTTCTAAGCGGAGACGATGCATCTAACTTGCCGTTATTCTTATCAATGATTCCAAAAGCTG
TTCATGGTAGTGGCAGTAACTTAGTGGAAGAGTTTTTAAGAGAGCTGTATAATCTGAGTGGCATTAATCCCGATCAGTTTGACCTTATAATGAGATGTTTATACAATTTT
GGATCAAGAGTTCCTACTTATTTGATTAGAAATGACAGAGAACTAAGATTCTTTCTAAGCGGAGACGATGCATCTAACTTGTCGTTATTCTTATCAATGATTCCAAAAGC
TGTTCATGGTAGTGGCAGTAACTTAGGTAGTCATAGTCATATACCACACTCTCTCTCACAACCTTTTCCACAATCGCCACCATCATTTGCACAAGTTGGACCCTACAATT
TGGTCGAAAACATACCATCATCTATTCCCTCGTATCATCCGACATATGATAACGTATCATCACCAACCCCTATAAATAATACTTTTGTCCCACTAGACTTAGCCGATGAT
AGTACACCAAATTTTGGGATTGAAAAGAATTTGTATACTGATGATGATACACTTAATTACGAGGAAGAAGAGTATGATGAAGAGGAAGATGAAAACTGGGAATACAACGA
GGAGGATGAAAAATTAGATGAATCATACGAAGAATCTGAAGCAGATGAATTTGAGGAGGTAGATGATTATGCAAAATTGCTTTCAAGACCCTCCCTCACGTTGTTCTCCT
TCTCTCCTACTCTCCCGCTGCTCGTGCTCTCATCTCTACTCTCCCGCCGGCACCGTTCTCACGCCCACGTCGCTGATCAGCCGACTGCTGAAGTATCTCTCCTTTCACGC
CAGTGCCATCAGCTTCCAGTTCTCGCCGTCGTTCTAGAGTTCACGCTGTCAGCTTCGTTTTCTCGCCGCCGTTCTCGAGTTCGCGCCGTCAACTACACTAAGCAAGGACA
GGAACAGGATGTATTTATAGGGTAA
Protein sequenceShow/hide protein sequence
MPSPPPQPRPIFNLTHLDKSTALPDFFLAALSLFVFFSSSSSSKSFKFPLFSILRPSLKTPSMSLSHPNPNPKTPLQLHPFSSPQSLFEWLTPRLPSDSFASWGVKPGTK
NVHNLWLELSHGETSLADSNPPLRTVQVLSLRIIDKHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVKEELGSILAHSDCSEIVRIVPDSYQRKIEERN
SASYPGLPACYVLHSMDVWVDGLPEGDFFTVEEEEYGNSDDTKIADEAVSVKKHFWKWLFFGLGHVWIDFVRVKITPKHAISRIPRIFVCYGGVWKENERGYEGGYLAGL
DVNIDIRYEEFLRELYNLSGINPDQFDLIIRCLYNFGSRVPTYLIRNDRELRFFLSGDDASNLPLFLSMIPKAVHGSGSNLVEEFLRELYNLSGINPDQFDLIMRCLYNF
GSRVPTYLIRNDRELRFFLSGDDASNLSLFLSMIPKAVHGSGSNLGSHSHIPHSLSQPFPQSPPSFAQVGPYNLVENIPSSIPSYHPTYDNVSSPTPINNTFVPLDLADD
STPNFGIEKNLYTDDDTLNYEEEEYDEEEDENWEYNEEDEKLDESYEESEADEFEEVDDYAKLLSRPSLTLFSFSPTLPLLVLSSLLSRRHRSHAHVADQPTAEVSLLSR
QCHQLPVLAVVLEFTLSASFSRRRSRVRAVNYTKQGQEQDVFIG