; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0005939 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0005939
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionNudix hydrolase domain-containing protein
Genome locationchr09:23725822..23726709
RNA-Seq ExpressionIVF0005939
SyntenyIVF0005939
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437661.1 PREDICTED: uncharacterized protein LOC103483001 [Cucumis melo]5.90e-20798.98Show/hide
Query:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPSKSFKFPVFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWL
        MPPAPPPPPPPIPPP+PQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPSKSFKFP FSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWL
Subjt:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPSKSFKFPVFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWL

Query:  QPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEE
        QPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEE
Subjt:  QPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEE

Query:  LGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKWVSPESVDL
        LGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKH+WKWVSPESVDL
Subjt:  LGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKWVSPESVDL

XP_011654586.1 uncharacterized protein LOC101208896 isoform X1 [Cucumis sativus]1.20e-18090.75Show/hide
Query:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPS-KSFKFPVFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEW
        MPPAPPPP  P+PPP+PQPISNLTHLNKS AALPDFFLAALSLFAF SSSS S KSFKFP FSIQLNPRRF KIPS SMEFPNSNSKSLTFTSPQSLSEW
Subjt:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPS-KSFKFPVFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEW

Query:  LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE
        LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIR LHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE
Subjt:  LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE

Query:  ELGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKWVSPE
        ELGSIL DSD S LVRIVPDSY+LKIEERDSVSYPGL A YVLHSMD+ VEGLPDGDFCTVE+EEYVNSE+TNIAD AVSVKKH+WKW   E
Subjt:  ELGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKWVSPE

XP_011654587.1 uncharacterized protein LOC101208896 isoform X3 [Cucumis sativus]1.56e-18091.67Show/hide
Query:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPS-KSFKFPVFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEW
        MPPAPPPP  P+PPP+PQPISNLTHLNKS AALPDFFLAALSLFAF SSSS S KSFKFP FSIQLNPRRF KIPS SMEFPNSNSKSLTFTSPQSLSEW
Subjt:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPS-KSFKFPVFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEW

Query:  LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE
        LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIR LHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE
Subjt:  LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE

Query:  ELGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKW
        ELGSIL DSD S LVRIVPDSY+LKIEERDSVSYPGL A YVLHSMD+ VEGLPDGDFCTVE+EEYVNSE+TNIAD AVSVKKH+WKW
Subjt:  ELGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKW

XP_031741555.1 uncharacterized protein LOC101208896 isoform X2 [Cucumis sativus]2.42e-18091.67Show/hide
Query:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPS-KSFKFPVFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEW
        MPPAPPPP  P+PPP+PQPISNLTHLNKS AALPDFFLAALSLFAF SSSS S KSFKFP FSIQLNPRRF KIPS SMEFPNSNSKSLTFTSPQSLSEW
Subjt:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPS-KSFKFPVFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEW

Query:  LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE
        LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIR LHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE
Subjt:  LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE

Query:  ELGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKW
        ELGSIL DSD S LVRIVPDSY+LKIEERDSVSYPGL A YVLHSMD+ VEGLPDGDFCTVE+EEYVNSE+TNIAD AVSVKKH+WKW
Subjt:  ELGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKW

XP_038874473.1 uncharacterized protein LOC120067121 isoform X2 [Benincasa hispida]2.14e-16986.48Show/hide
Query:  PIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPSKSFKFPVFSIQLNPRRFLKIPSNSM---EFPNSNSKSLTFTSPQSLSEWLQPRLPSH
        P P P PQP SNL HLNKSTA LPDFFLAALSLF FFSSSS SKSFKFP FSIQLNPRRFLKIPSNSM   +FPNS S SLTFTSPQSLSEWL+PRLPS 
Subjt:  PIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPSKSFKFPVFSIQLNPRRFLKIPSNSM---EFPNSNSKSLTFTSPQSLSEWLQPRLPSH

Query:  SFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELGSILAD
        SFASWGV PGTKN+HNLWLEISQGETSLADSNPPIR LHVLSLRI+DNHHR+L+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EELGSI+ D
Subjt:  SFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELGSILAD

Query:  SDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKWV
        SDCS +VRIVPDSYK+KIEER+SVSYPGLPACYVLHSMD+ VEGLP+G+FCTVE+EEY NSEET+IADQAVSVKKH+WKWV
Subjt:  SDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKWV

TrEMBL top hitse value%identityAlignment
A0A0A0KJQ6 Uncharacterized protein7.6e-14591.86Show/hide
Query:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFF-SSSSPSKSFKFPVFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEW
        MPPA  PPPPP+PPP+PQPISNLTHLNKS AALPDFFLAALSLFAF  SSSS SKSFKFP FSIQLNPRRF KIPS SMEFPNSNSKSLTFTSPQSLSEW
Subjt:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFF-SSSSPSKSFKFPVFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEW

Query:  LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE
        LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIR LHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE
Subjt:  LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE

Query:  ELGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKWVSPESVD
        ELGSIL DSD S LVRIVPDSY+LKIEERDSVSYPGL A YVLHSMD+ VEGLPDGDFCTVE+EEYVNSE+TNIAD AVSVKKH+WKWVSPESVD
Subjt:  ELGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKWVSPESVD

A0A1S3AUP2 uncharacterized protein LOC1034830015.8e-16198.98Show/hide
Query:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPSKSFKFPVFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWL
        MPPAPPPPPPPIPPP+PQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPSKSFKFP FSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWL
Subjt:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPSKSFKFPVFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWL

Query:  QPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEE
        QPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEE
Subjt:  QPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEE

Query:  LGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKWVSPESVDL
        LGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKH+WKWVSPESVDL
Subjt:  LGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKWVSPESVDL

A0A6J1CCN3 uncharacterized protein LOC1110095084.5e-11375.43Show/hide
Query:  PPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPSKSFKFPVFSIQLNP-RRFLKIPSNSMEFPNSNSK--SLTFTSPQSLSEWLQP
        P PPPP+PP  P PISNLTHLNKST  LPDF+LAALSLF FFSSS  SKSFKFP+   Q NP RRFLKIPS S+  P+  ++  +  F SPQSLS+WL P
Subjt:  PPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPSKSFKFPVFSIQLNP-RRFLKIPSNSMEFPNSNSK--SLTFTSPQSLSEWLQP

Query:  RLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELG
        RLPS SFASWGV PGTKN+HNLWLEIS+GETSLADSNPPIR + V+SLRI+D H+R+L+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EELG
Subjt:  RLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELG

Query:  SILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQ-AVSVKKHYWKWVSPESVD
        SI+ D DC  +VRIVP+SYK+KIEER+SVSYPGLPACYVLHSMD+ VEGLPD +FCTVE+EEY  SEET IA + AVSVKKH+WKWVS +SVD
Subjt:  SILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQ-AVSVKKHYWKWVSPESVD

A0A6J1E2U8 uncharacterized protein LOC1114303191.4e-11776.33Show/hide
Query:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPSKSFKFPVFSIQLNPRRFLKIPSNSMEFPNSN------SKSLTFTSPQ
        MP APPP PP      PQPIS+L HL +S   LPDFFLAALSLF F SSSS S+SFKFP+  IQ NPRRFLK PS S   PNS        +   FTSPQ
Subjt:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPSKSFKFPVFSIQLNPRRFLKIPSNSMEFPNSN------SKSLTFTSPQ

Query:  SLSEWLQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVY
        SLS+WL+PRLPS SFASWGV PGTKN+HNLWLE+S+GETSLADSNPPIR + VLSLRIIDNH R+LLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVY
Subjt:  SLSEWLQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVY

Query:  RAVQEELGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKWVSPESVD
        RAV+EELGSIL DSDCS +V+IVPDSYK+KIEER+S SYPGLPACYVLHSMD+ VEGLP  DFCTVE+EEYVNSEET+IAD+AVSVKKH+WKWVS +S+D
Subjt:  RAVQEELGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKWVSPESVD

A0A6J1IA21 uncharacterized protein LOC1114729791.1e-11474.67Show/hide
Query:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPSKSFKFPVFSIQLNPRRFLKIPSNSMEFPNSN------SKSLTFTSPQ
        MP APPP PP      PQPIS+L HL +S   LPDFFLAALSLF F SSSS S+SFK P+  IQ NPRRFLK PS S   PNS        +   F SPQ
Subjt:  MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPSKSFKFPVFSIQLNPRRFLKIPSNSMEFPNSN------SKSLTFTSPQ

Query:  SLSEWLQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVY
        SLS+WL+PRLPS SFASWGV PGTKN+HNLWLE+S+GETSLADS PPIR + VLSLRIIDNH R+LLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVY
Subjt:  SLSEWLQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVY

Query:  RAVQEELGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKWVSPESVD
        RAV+EELGSIL DSDCS +V+IVPDSYK+KIEER+S SYPGLPACYVLHSMD+ VEGLP  DFCTVE+EEY NSEE++IAD+AVSVKKH+WKWVS +S+D
Subjt:  RAVQEELGSILADSDCSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKWVSPESVD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G24460.1 unknown protein2.8e-7553.22Show/hide
Query:  PPPPPIPPPRPQPISNLTHLNKS-TAALPDFFLAALSLFAFFSSSSPSKSFKFPVFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWLQPRLP
        P PP  P    + I+N    N + T+ALPD FLAA+SL   +SS  P  S     FS  LNPRR   I + S   P     +  F +PQSLS+WL+ RLP
Subjt:  PPPPPIPPPRPQPISNLTHLNKS-TAALPDFFLAALSLFAFFSSSSPSKSFKFPVFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWLQPRLP

Query:  SHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELGSIL
        S SFA+WGV PGTKN+HNLWLE+S GETSLADS PP+R ++V+++R+I  + R+L+E+HQ+LSDG++R R RPLSEKMKP E+P+ AV+RA++EELGSI 
Subjt:  SHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELGSIL

Query:  -ADSD-CSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKE-----EYVNSEETNIADQAVSVKKHYWKWVSPESV
          D D     ++I+P +Y  ++EER+S+SYPGLPA Y LHS++  VEGLP+ DFCT EKE        +S ET  A  AV+VK+HYWKWVSP+S+
Subjt:  -ADSD-CSPLVRIVPDSYKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKE-----EYVNSEETNIADQAVSVKKHYWKWVSPESV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACCAGCCCCACCTCCACCTCCACCTCCAATTCCACCCCCACGACCGCAGCCCATCTCCAATCTTACTCACCTCAACAAATCTACGGCAGCTCTTCCTGACTTTTT
CCTCGCGGCTCTATCACTTTTCGCTTTCTTCTCCTCTTCTTCCCCCTCCAAATCCTTCAAATTTCCTGTTTTCTCTATTCAATTAAACCCTCGCCGTTTTCTCAAGATAC
CTTCCAATTCCATGGAATTCCCCAACTCCAACTCCAAATCCTTGACCTTCACCTCTCCTCAATCCCTCTCCGAGTGGCTTCAACCTCGCCTCCCTTCCCATTCTTTTGCT
TCTTGGGGCGTTATCCCTGGCACCAAGAACCTCCATAACCTCTGGCTCGAGATCTCCCAAGGAGAAACTTCCCTTGCCGATTCCAACCCTCCCATCCGCATCCTTCATGT
CCTTTCTCTTCGAATTATTGATAATCATCACCGACTTCTCCTTGAATCCCACCAGCAGCTCTCTGATGGCACCCTGCGGAATCGAAATCGACCCCTGTCCGAGAAAATGA
AGCCCAATGAGACTCCTGAATCTGCCGTCTACCGGGCTGTCCAAGAAGAGCTCGGTTCCATTCTTGCCGATTCCGATTGTTCTCCACTTGTCAGGATTGTTCCAGATTCC
TACAAATTGAAGATTGAGGAGCGCGACTCGGTTTCCTACCCTGGTTTGCCGGCTTGTTACGTTTTGCATTCCATGGATATTCGGGTGGAAGGTTTACCCGATGGAGACTT
CTGCACCGTAGAGAAGGAGGAATACGTAAATTCTGAGGAGACAAACATTGCGGATCAGGCTGTGTCCGTCAAGAAGCATTATTGGAAATGGGTTAGTCCTGAATCTGTGG
ATTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCACCAGCCCCACCTCCACCTCCACCTCCAATTCCACCCCCACGACCGCAGCCCATCTCCAATCTTACTCACCTCAACAAATCTACGGCAGCTCTTCCTGACTTTTT
CCTCGCGGCTCTATCACTTTTCGCTTTCTTCTCCTCTTCTTCCCCCTCCAAATCCTTCAAATTTCCTGTTTTCTCTATTCAATTAAACCCTCGCCGTTTTCTCAAGATAC
CTTCCAATTCCATGGAATTCCCCAACTCCAACTCCAAATCCTTGACCTTCACCTCTCCTCAATCCCTCTCCGAGTGGCTTCAACCTCGCCTCCCTTCCCATTCTTTTGCT
TCTTGGGGCGTTATCCCTGGCACCAAGAACCTCCATAACCTCTGGCTCGAGATCTCCCAAGGAGAAACTTCCCTTGCCGATTCCAACCCTCCCATCCGCATCCTTCATGT
CCTTTCTCTTCGAATTATTGATAATCATCACCGACTTCTCCTTGAATCCCACCAGCAGCTCTCTGATGGCACCCTGCGGAATCGAAATCGACCCCTGTCCGAGAAAATGA
AGCCCAATGAGACTCCTGAATCTGCCGTCTACCGGGCTGTCCAAGAAGAGCTCGGTTCCATTCTTGCCGATTCCGATTGTTCTCCACTTGTCAGGATTGTTCCAGATTCC
TACAAATTGAAGATTGAGGAGCGCGACTCGGTTTCCTACCCTGGTTTGCCGGCTTGTTACGTTTTGCATTCCATGGATATTCGGGTGGAAGGTTTACCCGATGGAGACTT
CTGCACCGTAGAGAAGGAGGAATACGTAAATTCTGAGGAGACAAACATTGCGGATCAGGCTGTGTCCGTCAAGAAGCATTATTGGAAATGGGTTAGTCCTGAATCTGTGG
ATTTATGA
Protein sequenceShow/hide protein sequence
MPPAPPPPPPPIPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFSSSSPSKSFKFPVFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWLQPRLPSHSFA
SWGVIPGTKNLHNLWLEISQGETSLADSNPPIRILHVLSLRIIDNHHRLLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELGSILADSDCSPLVRIVPDS
YKLKIEERDSVSYPGLPACYVLHSMDIRVEGLPDGDFCTVEKEEYVNSEETNIADQAVSVKKHYWKWVSPESVDL