; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G014480 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G014480
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionSAM domain-containing protein
Genome locationCiama_Chr01:27495367..27497071
RNA-Seq ExpressionCaUC01G014480
SyntenyCaUC01G014480
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001660 - Sterile alpha motif domain
IPR013761 - Sterile alpha motif/pointed domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008462633.1 PREDICTED: uncharacterized protein LOC103500944 [Cucumis melo]4.0e-11485.77Show/hide
Query:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ
        MVK+KQ+QNSNPNHLSGSSN VIDLDMHGDDSWVVVKKQK+TILVPPIS+VTKSSTP A QSQLQPITQKVSN QT ALE+T +EAPA VLPSSS+N NQ
Subjt:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ

Query:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV
        QS  SAAA C LT K+PLKQA+ PPNPD A +SR  KVLGL NSTKSMK QPRQLH PGGFL GGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV
Subjt:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV

Query:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFER
        GIF+RKSI KFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSE F+R
Subjt:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFER

XP_011657741.1 uncharacterized protein LOC105435895 [Cucumis sativus]5.4e-11184.52Show/hide
Query:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ
        MVK+KQ+QNS+PNHLSGSSNSVIDLDMHGDD WVVVKKQKVTILVPPISIVTKS+ P  EQSQLQPITQKVSN QT AL +TCLEAPA VL S+S+N NQ
Subjt:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ

Query:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV
        QS  SAAA C LT K+PLKQA+ PPNPD   +SR  KVLGL NSTKSMK QPRQLH PGGFLTGGTLLN RLRALNLER LQKAGGL RWLESLGLDQFV
Subjt:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV

Query:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE
        GIFQRKSI+KFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSE FE
Subjt:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE

XP_022925561.1 uncharacterized protein LOC111432962 [Cucurbita moschata]1.5e-10884.52Show/hide
Query:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ
        MVK+KQRQNSNP HLSGSSNS IDLD+HGDDSWVVVKKQKVTILVPP SIVTKSS+P A QSQLQPITQKVSN Q  ALE+TCLEAPA  LPS+SKNV Q
Subjt:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ

Query:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV
            S AA CN T K+PLKQA+  PNPDEAC+SRPCKV GLLNS KSMK QP  LH PGGFLTG TLLNQRLRALNLERKLQKAGGLSRWLESLGL+QFV
Subjt:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV

Query:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE
        GIFQR+SI+KFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE
Subjt:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE

XP_022977231.1 uncharacterized protein LOC111477603 [Cucurbita maxima]3.1e-10683.33Show/hide
Query:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ
        MVK+KQRQNSNP HLSGSSNS IDLD+ GDDSWVVVKKQKVTILVPP SIVTKSS+P A QSQLQPITQ VSN Q  ALE+TC E+PA VLPS+SKNV Q
Subjt:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ

Query:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV
            S AA CN T K+PLKQA+  PNPDEAC+SRPCKV GLLNS KSMK QP  LH PGGFLTG TLLNQRL+ALNLERKLQKAGGLSRWLESLGL+QFV
Subjt:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV

Query:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE
        GIFQRKSI+KFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE
Subjt:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE

XP_038880855.1 uncharacterized protein LOC120072539 [Benincasa hispida]9.2e-11990.16Show/hide
Query:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ
        MVK+KQ QN NPNHLSGSSNSVIDLDMH DDSWVVVKKQKVTILVPPISIVTKSSTP  EQ QLQPITQKVSN QTEAL +TCLE+PATVLPSSSK VNQ
Subjt:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ

Query:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMK-PQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQF
        QS  SA A CNLTTK+  KQAI P NPDEACDSRPCKVLGLLNSTKSMK  QPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGL+QF
Subjt:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMK-PQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQF

Query:  VGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFER
        VGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPY+SETFER
Subjt:  VGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFER

TrEMBL top hitse value%identityAlignment
A0A0A0KF34 SAM domain-containing protein2.6e-11184.52Show/hide
Query:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ
        MVK+KQ+QNS+PNHLSGSSNSVIDLDMHGDD WVVVKKQKVTILVPPISIVTKS+ P  EQSQLQPITQKVSN QT AL +TCLEAPA VL S+S+N NQ
Subjt:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ

Query:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV
        QS  SAAA C LT K+PLKQA+ PPNPD   +SR  KVLGL NSTKSMK QPRQLH PGGFLTGGTLLN RLRALNLER LQKAGGL RWLESLGLDQFV
Subjt:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV

Query:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE
        GIFQRKSI+KFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSE FE
Subjt:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE

A0A1S3CIZ6 uncharacterized protein LOC1035009441.9e-11485.77Show/hide
Query:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ
        MVK+KQ+QNSNPNHLSGSSN VIDLDMHGDDSWVVVKKQK+TILVPPIS+VTKSSTP A QSQLQPITQKVSN QT ALE+T +EAPA VLPSSS+N NQ
Subjt:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ

Query:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV
        QS  SAAA C LT K+PLKQA+ PPNPD A +SR  KVLGL NSTKSMK QPRQLH PGGFL GGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV
Subjt:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV

Query:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFER
        GIF+RKSI KFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSE F+R
Subjt:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFER

A0A6J1BU16 uncharacterized protein LOC1110055502.5e-9374.7Show/hide
Query:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ
        MVK+KQRQN+NP+HLSGSSNSVI LD+HGDDSWVVVKKQKVTILVPP SIV KSSTP A QSQLQPI          +LE+ CLE PA VLP SSKNV  
Subjt:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ

Query:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV
        ++  SA     LT+ +PLKQA  P +PDEAC+S P KVLG++N+TK M  QPR+LH P  FL+GGTLLN RLRALNLERKLQKAGGLSRWL SLGL QFV
Subjt:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV

Query:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFER
        GIFQRKSINKF LVNLTM KLKDMGANAVGPRRKLIHAIECVCQPY+SE FER
Subjt:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFER

A0A6J1ECJ6 uncharacterized protein LOC1114329627.1e-10984.52Show/hide
Query:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ
        MVK+KQRQNSNP HLSGSSNS IDLD+HGDDSWVVVKKQKVTILVPP SIVTKSS+P A QSQLQPITQKVSN Q  ALE+TCLEAPA  LPS+SKNV Q
Subjt:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ

Query:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV
            S AA CN T K+PLKQA+  PNPDEAC+SRPCKV GLLNS KSMK QP  LH PGGFLTG TLLNQRLRALNLERKLQKAGGLSRWLESLGL+QFV
Subjt:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV

Query:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE
        GIFQR+SI+KFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE
Subjt:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE

A0A6J1IPD6 uncharacterized protein LOC1114776031.5e-10683.33Show/hide
Query:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ
        MVK+KQRQNSNP HLSGSSNS IDLD+ GDDSWVVVKKQKVTILVPP SIVTKSS+P A QSQLQPITQ VSN Q  ALE+TC E+PA VLPS+SKNV Q
Subjt:  MVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISIVTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQ

Query:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV
            S AA CN T K+PLKQA+  PNPDEAC+SRPCKV GLLNS KSMK QP  LH PGGFLTG TLLNQRL+ALNLERKLQKAGGLSRWLESLGL+QFV
Subjt:  QSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFV

Query:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE
        GIFQRKSI+KFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE
Subjt:  GIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFE

SwissProt top hitse value%identityAlignment
Q3UES3 Poly [ADP-ribose] polymerase tankyrase-24.6e-0437.14Show/hide
Query:  ALNLERKLQKAG---GLSRWLESLGLDQFVGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIE
        A  L+RK + +G    +++++ +LGL+  + IF+R+ I    LV +  K+LK++G NA G R KLI  +E
Subjt:  ALNLERKLQKAG---GLSRWLESLGLDQFVGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIE

Arabidopsis top hitse value%identityAlignment
AT2G45700.1 sterile alpha motif (SAM) domain-containing protein5.0e-0636.67Show/hide
Query:  RWLESLGLDQFVGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSS
        +WL SLGL ++  +F R+ I+   L +LT + L  +G  ++GPR+K+++A+  V  P++S
Subjt:  RWLESLGLDQFVGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSS

AT3G11890.1 Sterile alpha motif (SAM) domain-containing protein4.7e-2048.42Show/hide
Query:  RQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLD-QFVGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPY
        + + +P    +   + N++LR LNLE+K++KAGGL+ W+ S+GL  +F  + + + ++KF + NLTM+KLK MGA AVGPRRKLIHAI CV  P+
Subjt:  RQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLD-QFVGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPY

AT3G11890.2 Sterile alpha motif (SAM) domain-containing protein4.7e-2048.42Show/hide
Query:  RQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLD-QFVGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPY
        + + +P    +   + N++LR LNLE+K++KAGGL+ W+ S+GL  +F  + + + ++KF + NLTM+KLK MGA AVGPRRKLIHAI CV  P+
Subjt:  RQLHYPGGFLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLD-QFVGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPY

AT3G48800.1 Sterile alpha motif (SAM) domain-containing protein1.7e-0640.32Show/hide
Query:  GLSRWLESLGLDQFVGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYS
        G+  WL+ LGL ++  +F+   +++  L  LT++ LKDMG NAVG RRK+  AI+ + + +S
Subjt:  GLSRWLESLGLDQFVGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYS

AT5G23680.1 Sterile alpha motif (SAM) domain-containing protein1.3e-0640.32Show/hide
Query:  GLSRWLESLGLDQFVGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYS
        G+  WL+ LGL ++  +F+   +++  L  LT++ LKDMG NAVG RRK+  AI+ + + +S
Subjt:  GLSRWLESLGLDQFVGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTTATTGTTAGCTAGAAAATGCAAATTAGCTTGGGAATTCTCAAATTTATGTGAGCTTGAAGATAACCACTCGTTAACGGGGACAATATTGTGTGTAGATGATAC
GAGAATTATACCATTACTTTGCATCGACAACTTCAGTTGCTTCTTGTCTATAACATTTAGGTTAAATGTGATGGTGAAATCAAAGCAAAGACAGAATAGTAATCCTAATC
ATCTTTCTGGCTCCTCAAACTCTGTTATTGATTTGGATATGCATGGAGATGATAGTTGGGTGGTGGTAAAGAAGCAGAAAGTCACAATCCTAGTGCCTCCTATTTCAATA
GTAACAAAATCCTCAACTCCCATTGCAGAACAAAGTCAGCTGCAACCAATTACCCAAAAAGTATCAAATTCCCAAACAGAAGCTCTTGAAAAGACATGTCTTGAAGCGCC
TGCTACTGTGCTGCCATCATCTTCTAAAAATGTTAATCAACAAAGTCCAATATCTGCTGCTGCTCGTTGTAATTTGACGACGAAACAGCCATTGAAGCAAGCTATCATGC
CCCCAAATCCAGATGAAGCTTGCGATTCAAGGCCTTGTAAGGTTTTAGGATTATTGAACAGCACGAAATCTATGAAGCCGCAGCCTAGACAATTACATTACCCTGGTGGC
TTTCTCACTGGAGGCACATTGTTAAATCAGAGACTCAGGGCACTTAATCTTGAGAGGAAGCTGCAAAAAGCTGGTGGTTTAAGTAGGTGGTTGGAATCACTGGGACTGGA
CCAATTTGTAGGTATTTTCCAGAGAAAAAGTATCAATAAGTTTCACCTGGTAAATCTAACCATGAAAAAGCTGAAAGATATGGGCGCAAATGCGGTGGGGCCGCGTAGAA
AACTGATACACGCAATTGAGTGTGTCTGTCAACCCTATAGTTCCGAAACATTTGAGCGG
mRNA sequenceShow/hide mRNA sequence
ATGAAGTTATTGTTAGCTAGAAAATGCAAATTAGCTTGGGAATTCTCAAATTTATGTGAGCTTGAAGATAACCACTCGTTAACGGGGACAATATTGTGTGTAGATGATAC
GAGAATTATACCATTACTTTGCATCGACAACTTCAGTTGCTTCTTGTCTATAACATTTAGGTTAAATGTGATGGTGAAATCAAAGCAAAGACAGAATAGTAATCCTAATC
ATCTTTCTGGCTCCTCAAACTCTGTTATTGATTTGGATATGCATGGAGATGATAGTTGGGTGGTGGTAAAGAAGCAGAAAGTCACAATCCTAGTGCCTCCTATTTCAATA
GTAACAAAATCCTCAACTCCCATTGCAGAACAAAGTCAGCTGCAACCAATTACCCAAAAAGTATCAAATTCCCAAACAGAAGCTCTTGAAAAGACATGTCTTGAAGCGCC
TGCTACTGTGCTGCCATCATCTTCTAAAAATGTTAATCAACAAAGTCCAATATCTGCTGCTGCTCGTTGTAATTTGACGACGAAACAGCCATTGAAGCAAGCTATCATGC
CCCCAAATCCAGATGAAGCTTGCGATTCAAGGCCTTGTAAGGTTTTAGGATTATTGAACAGCACGAAATCTATGAAGCCGCAGCCTAGACAATTACATTACCCTGGTGGC
TTTCTCACTGGAGGCACATTGTTAAATCAGAGACTCAGGGCACTTAATCTTGAGAGGAAGCTGCAAAAAGCTGGTGGTTTAAGTAGGTGGTTGGAATCACTGGGACTGGA
CCAATTTGTAGGTATTTTCCAGAGAAAAAGTATCAATAAGTTTCACCTGGTAAATCTAACCATGAAAAAGCTGAAAGATATGGGCGCAAATGCGGTGGGGCCGCGTAGAA
AACTGATACACGCAATTGAGTGTGTCTGTCAACCCTATAGTTCCGAAACATTTGAGCGG
Protein sequenceShow/hide protein sequence
MKLLLARKCKLAWEFSNLCELEDNHSLTGTILCVDDTRIIPLLCIDNFSCFLSITFRLNVMVKSKQRQNSNPNHLSGSSNSVIDLDMHGDDSWVVVKKQKVTILVPPISI
VTKSSTPIAEQSQLQPITQKVSNSQTEALEKTCLEAPATVLPSSSKNVNQQSPISAAARCNLTTKQPLKQAIMPPNPDEACDSRPCKVLGLLNSTKSMKPQPRQLHYPGG
FLTGGTLLNQRLRALNLERKLQKAGGLSRWLESLGLDQFVGIFQRKSINKFHLVNLTMKKLKDMGANAVGPRRKLIHAIECVCQPYSSETFER