; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0028830 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0028830
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionNudix hydrolase domain-containing protein
Genome locationchr09:71544..74151
RNA-Seq ExpressionPI0028830
SyntenyPI0028830
Gene Ontology termsNA
InterPro domainsIPR015797 - NUDIX hydrolase-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437661.1 PREDICTED: uncharacterized protein LOC103483001 [Cucumis melo]3.3e-14794.08Show/hide
Query:  MPPA--PPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFFSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWL
        MPPA  PPPP +PPP+PQPISNLTHLNKSTAALPDFFLAALSLFAFF SSS SKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWL
Subjt:  MPPA--PPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFFSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWL

Query:  QPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEE
        QPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIR LHVLSLRIIDNH R+LLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEE
Subjt:  QPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEE

Query:  LGSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKW
        LGSIL DSDCS LVRIVPDSYKLKIEER+SVSYPGLPACYVLHSMD+ VEGLP+GDFCTVE+EEYVNSEETNIADQAVSVKKHFWKW
Subjt:  LGSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKW

XP_011654586.1 uncharacterized protein LOC101208896 isoform X1 [Cucumis sativus]2.4e-14593.4Show/hide
Query:  MPPAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFF-FSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWLQ
        MPPAPPPP VPPP+PQPISNLTHLNKS AALPDFFLAALSLFAF   SSSSSKSFKFPAFSIQLNPRRF KIPS SMEFPNSNSKSLTFTSPQSLSEWLQ
Subjt:  MPPAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFF-FSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWLQ

Query:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
        PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNH R+LLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
Subjt:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL

Query:  GSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKWCK
        GSILGDSD SQLVRIVPDSY+LKIEER+SVSYPGL A YVLHSMDVWVEGLP+GDFCTVEEEEYVNSE+TNIAD AVSVKKHFWKW K
Subjt:  GSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKWCK

XP_011654587.1 uncharacterized protein LOC101208896 isoform X3 [Cucumis sativus]9.1e-14593.71Show/hide
Query:  MPPAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFF-FSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWLQ
        MPPAPPPP VPPP+PQPISNLTHLNKS AALPDFFLAALSLFAF   SSSSSKSFKFPAFSIQLNPRRF KIPS SMEFPNSNSKSLTFTSPQSLSEWLQ
Subjt:  MPPAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFF-FSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWLQ

Query:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
        PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNH R+LLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
Subjt:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL

Query:  GSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKW
        GSILGDSD SQLVRIVPDSY+LKIEER+SVSYPGL A YVLHSMDVWVEGLP+GDFCTVEEEEYVNSE+TNIAD AVSVKKHFWKW
Subjt:  GSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKW

XP_031741555.1 uncharacterized protein LOC101208896 isoform X2 [Cucumis sativus]9.1e-14593.71Show/hide
Query:  MPPAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFF-FSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWLQ
        MPPAPPPP VPPP+PQPISNLTHLNKS AALPDFFLAALSLFAF   SSSSSKSFKFPAFSIQLNPRRF KIPS SMEFPNSNSKSLTFTSPQSLSEWLQ
Subjt:  MPPAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFF-FSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWLQ

Query:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
        PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNH R+LLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
Subjt:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL

Query:  GSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKW
        GSILGDSD SQLVRIVPDSY+LKIEER+SVSYPGL A YVLHSMDVWVEGLP+GDFCTVEEEEYVNSE+TNIAD AVSVKKHFWKW
Subjt:  GSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKW

XP_038874474.1 uncharacterized protein LOC120067121 isoform X3 [Benincasa hispida]4.5e-13688.66Show/hide
Query:  MPPAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFFSSSSSKSFKFPAFSIQLNPRRFLKIPSNSM---EFPNSNSKSLTFTSPQSLSEW
        MP AP P  VPP  PQP SNL HLNKST ALPDFFLAALSLF FF SSSSSKSFKFPAFSIQLNPRRFLKIPSNSM   +FPNS S SLTFTSPQSLSEW
Subjt:  MPPAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFFSSSSSKSFKFPAFSIQLNPRRFLKIPSNSM---EFPNSNSKSLTFTSPQSLSEW

Query:  LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE
        L+PRLPS SFASWGV PGTKN+HNLWLEISQGETSLADSNPPIRTLHVLSLRI+DNH RVL+ESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAV+E
Subjt:  LQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQE

Query:  ELGSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKWCKK
        ELGSI+GDSDCSQ+VRIVPDSYK+KIEERNSVSYPGLPACYVLHSMDVWVEGLPEG+FCTVEEEEY NSEET+IADQAVSVKKHFWKW  K
Subjt:  ELGSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKWCKK

TrEMBL top hitse value%identityAlignment
A0A0A0KJQ6 Uncharacterized protein4.4e-14593.71Show/hide
Query:  MPPAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFF-FSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWLQ
        MPPAPPPP VPPP+PQPISNLTHLNKS AALPDFFLAALSLFAF   SSSSSKSFKFPAFSIQLNPRRF KIPS SMEFPNSNSKSLTFTSPQSLSEWLQ
Subjt:  MPPAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFF-FSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWLQ

Query:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
        PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNH R+LLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
Subjt:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL

Query:  GSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKW
        GSILGDSD SQLVRIVPDSY+LKIEER+SVSYPGL A YVLHSMDVWVEGLP+GDFCTVEEEEYVNSE+TNIAD AVSVKKHFWKW
Subjt:  GSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKW

A0A1S3AUP2 uncharacterized protein LOC1034830011.6e-14794.08Show/hide
Query:  MPPA--PPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFFSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWL
        MPPA  PPPP +PPP+PQPISNLTHLNKSTAALPDFFLAALSLFAFF SSS SKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWL
Subjt:  MPPA--PPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFFSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWL

Query:  QPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEE
        QPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIR LHVLSLRIIDNH R+LLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEE
Subjt:  QPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEE

Query:  LGSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKW
        LGSIL DSDCS LVRIVPDSYKLKIEER+SVSYPGLPACYVLHSMD+ VEGLP+GDFCTVE+EEYVNSEETNIADQAVSVKKHFWKW
Subjt:  LGSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKW

A0A6J1CCN3 uncharacterized protein LOC1110095083.1e-11477.7Show/hide
Query:  PAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFFSSSSSKSFKFPAFSIQLNP-RRFLKIPSNSMEFPNSNSK--SLTFTSPQSLSEWLQ
        P+PPPP +PP  P PISNLTHLNKST  LPDF+LAALSLF FF  SSSSKSFKFP    Q NP RRFLKIPS S+  P+  ++  +  F SPQSLS+WL 
Subjt:  PAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFFSSSSSKSFKFPAFSIQLNP-RRFLKIPSNSMEFPNSNSK--SLTFTSPQSLSEWLQ

Query:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL
        PRLPS SFASWGV PGTKN+HNLWLEIS+GETSLADSNPPIRT+ V+SLRI+D H RVL+ESHQ+LSDGTLRNRNRPLSEKMKPNETPESAVYRAV+EEL
Subjt:  PRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEEL

Query:  GSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQ-AVSVKKHFWKW
        GSI+GD DC ++VRIVP+SYK+KIEERNSVSYPGLPACYVLHSMDVWVEGLP+ +FCTVEEEEY  SEET IA + AVSVKKHFWKW
Subjt:  GSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQ-AVSVKKHFWKW

A0A6J1E2U8 uncharacterized protein LOC1114303191.1e-11979.73Show/hide
Query:  MPPAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFFSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSN------SKSLTFTSPQSL
        MP APPP  VPP  PQPIS+L HL +S   LPDFFLAALSLF  F SSSSS+SFKFP   IQ NPRRFLK PS S   PNS        +   FTSPQSL
Subjt:  MPPAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFFSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSN------SKSLTFTSPQSL

Query:  SEWLQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRA
        S+WL+PRLPS SFASWGV PGTKN+HNLWLE+S+GETSLADSNPPIRT+ VLSLRIIDNH+R+LLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRA
Subjt:  SEWLQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRA

Query:  VQEELGSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKW
        V+EELGSILGDSDCS++V+IVPDSYK+KIEERNS SYPGLPACYVLHSMDV VEGLP+ DFCTVEEEEYVNSEET+IAD+AVSVKKHFWKW
Subjt:  VQEELGSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKW

A0A6J1IA21 uncharacterized protein LOC1114729798.7e-11778.01Show/hide
Query:  MPPAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFFSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSN------SKSLTFTSPQSL
        MP APPP  VPP  PQPIS+L HL +S   LPDFFLAALSLF  F SSSSS+SFK P   IQ NPRRFLK PS S   PNS        +   F SPQSL
Subjt:  MPPAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFFSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSN------SKSLTFTSPQSL

Query:  SEWLQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRA
        S+WL+PRLPS SFASWGV PGTKN+HNLWLE+S+GETSLADS PPIRT+ VLSLRIIDNH+R+LLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRA
Subjt:  SEWLQPRLPSHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRA

Query:  VQEELGSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKW
        V+EELGSILGDSDCS++V+IVPDSYK+KIEERNS SYPGLPACYVLHSMDV VEGLP+ DFCTVEEEEY NSEE++IAD+AVSVKKHFWKW
Subjt:  VQEELGSILGDSDCSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G24460.1 unknown protein1.8e-7453.79Show/hide
Query:  PPPPSVPPPRPQPISNLTHLNKS-TAALPDFFLAALSLFAFFFSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWLQPRLP
        P PP  P    + I+N    N + T+ALPD FLAA+SL   + S     S     FS  LNPRR   I + S   P     +  F +PQSLS+WL+ RLP
Subjt:  PPPPSVPPPRPQPISNLTHLNKS-TAALPDFFLAALSLFAFFFSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWLQPRLP

Query:  SHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELGSIL
        S SFA+WGV PGTKN+HNLWLE+S GETSLADS PP+RT++V+++R+I  + R+L+E+HQ+LSDG++R R RPLSEKMKP E+P+ AV+RA++EELGSI 
Subjt:  SHSFASWGVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELGSIL

Query:  -GDSD-CSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEY------VNSEETNIADQAVSVKKHFWKW
         GD D   Q ++I+P +Y  ++EERNS+SYPGLPA Y LHS++  VEGLPE DFCT EE+EY       +S ET  A  AV+VK+H+WKW
Subjt:  -GDSD-CSQLVRIVPDSYKLKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEY------VNSEETNIADQAVSVKKHFWKW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACCAGCCCCACCTCCACCATCAGTTCCACCCCCACGACCGCAGCCCATCTCCAATCTTACTCACCTCAACAAATCTACGGCAGCTCTTCCTGACTTTTTCCTCGC
GGCTCTATCACTTTTCGCTTTCTTCTTTTCTTCTTCCTCCTCCAAATCCTTCAAATTTCCTGCTTTCTCTATTCAATTAAACCCCCGCCGTTTTCTCAAGATACCTTCCA
ATTCCATGGAATTCCCCAACTCCAACTCCAAATCCTTGACCTTCACCTCTCCTCAATCCCTCTCCGAATGGCTTCAACCTCGCCTCCCTTCCCATTCTTTTGCCTCTTGG
GGCGTTATCCCTGGCACCAAGAACCTCCATAACCTCTGGCTCGAGATCTCCCAAGGAGAAACTTCCCTTGCCGATTCCAACCCTCCCATCCGCACCCTTCATGTCCTTTC
TCTTCGAATTATTGATAATCATCAGCGAGTTCTCCTTGAATCCCACCAGCAGCTCTCTGATGGCACCCTACGGAATCGAAATCGACCCTTGTCCGAGAAAATGAAGCCCA
ATGAGACTCCTGAATCTGCCGTCTACCGGGCTGTCCAAGAAGAGCTCGGTTCCATTCTTGGTGATTCCGATTGTTCTCAACTTGTCAGGATTGTTCCAGACTCCTATAAA
TTGAAGATTGAGGAGCGCAACTCGGTTTCCTACCCTGGTTTGCCGGCTTGTTACGTTTTGCATTCCATGGATGTTTGGGTGGAAGGTTTACCCGAGGGAGACTTCTGCAC
TGTGGAGGAGGAGGAATACGTAAATTCTGAGGAGACAAACATTGCGGACCAGGCGGTGTCCGTCAAGAAGCATTTTTGGAAATGGTGCAAAAAAGCGGGATATTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCACCAGCCCCACCTCCACCATCAGTTCCACCCCCACGACCGCAGCCCATCTCCAATCTTACTCACCTCAACAAATCTACGGCAGCTCTTCCTGACTTTTTCCTCGC
GGCTCTATCACTTTTCGCTTTCTTCTTTTCTTCTTCCTCCTCCAAATCCTTCAAATTTCCTGCTTTCTCTATTCAATTAAACCCCCGCCGTTTTCTCAAGATACCTTCCA
ATTCCATGGAATTCCCCAACTCCAACTCCAAATCCTTGACCTTCACCTCTCCTCAATCCCTCTCCGAATGGCTTCAACCTCGCCTCCCTTCCCATTCTTTTGCCTCTTGG
GGCGTTATCCCTGGCACCAAGAACCTCCATAACCTCTGGCTCGAGATCTCCCAAGGAGAAACTTCCCTTGCCGATTCCAACCCTCCCATCCGCACCCTTCATGTCCTTTC
TCTTCGAATTATTGATAATCATCAGCGAGTTCTCCTTGAATCCCACCAGCAGCTCTCTGATGGCACCCTACGGAATCGAAATCGACCCTTGTCCGAGAAAATGAAGCCCA
ATGAGACTCCTGAATCTGCCGTCTACCGGGCTGTCCAAGAAGAGCTCGGTTCCATTCTTGGTGATTCCGATTGTTCTCAACTTGTCAGGATTGTTCCAGACTCCTATAAA
TTGAAGATTGAGGAGCGCAACTCGGTTTCCTACCCTGGTTTGCCGGCTTGTTACGTTTTGCATTCCATGGATGTTTGGGTGGAAGGTTTACCCGAGGGAGACTTCTGCAC
TGTGGAGGAGGAGGAATACGTAAATTCTGAGGAGACAAACATTGCGGACCAGGCGGTGTCCGTCAAGAAGCATTTTTGGAAATGGTGCAAAAAAGCGGGATATTAATAAA
GTTATCAACAACAATAATGTCCTCAATAACTTCTAATCAATGCATCAGAATCCACCTCAAGTGGCGGACTTGTCCCAAATATATCACTCATGCTTGACCGCAGAAAGACC
TTCCAAAATTGCTTTGACCTCAAGCATGCTAATGCCCCAATCCTTGTCAACACTCTTGTAGGCACTGGGATAGGCCGACCAAAATGATTAAAAACAATCCACCTGAGCTT
GCTCGATTCTCCTTTGACCAGATTACATCAACATTCAACTTCCAATAATCCACCTTCGGTGGAGACCAACGATGACACTTGATGTTGGGAGGAGAAAGGGTCAACTCTGC
CTAAAACCTGTATCCTCACCCATAAGGTTCTGATAATGGACTTCGATCAAGCTCATTAAGGCGCTAAGATCAGGCACCCCTTTCCCATAGAAAAAGTCATTTTACAGCTT
CCACACGCATCAAAGGATTATTAAAGCGCGATTTAAGTCTTCGCTTATACACATAAACCTTAGGTAACCCCAATAGTTCGAGGGAGACCAAAAGTTATTGACAATACCAA
ACACATTCCAATATAAGCAGGGAAGAAATGATTCCATATTTTCTTACTCACATTCCCACATGAAATGACTAGTAGTCTCCAACTGATCACTGC
Protein sequenceShow/hide protein sequence
MPPAPPPPSVPPPRPQPISNLTHLNKSTAALPDFFLAALSLFAFFFSSSSSKSFKFPAFSIQLNPRRFLKIPSNSMEFPNSNSKSLTFTSPQSLSEWLQPRLPSHSFASW
GVIPGTKNLHNLWLEISQGETSLADSNPPIRTLHVLSLRIIDNHQRVLLESHQQLSDGTLRNRNRPLSEKMKPNETPESAVYRAVQEELGSILGDSDCSQLVRIVPDSYK
LKIEERNSVSYPGLPACYVLHSMDVWVEGLPEGDFCTVEEEEYVNSEETNIADQAVSVKKHFWKWCKKAGY