; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cmc05g0136301 (gene) of Melon (Charmono) v1.1 genome

Gene IDCmc05g0136301
OrganismCucumis melo var. cantalupensis cv. Charmono (Melon (Charmono) v1.1)
DescriptionR3H domain-containing protein 1 isoform X1
Genome locationCMiso1.1chr05:19285347..19292946
RNA-Seq ExpressionCmc05g0136301
SyntenyCmc05g0136301
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR001374 - R3H domain
IPR024771 - SUZ domain
IPR036867 - R3H domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047706.1 R3H domain-containing protein 1 isoform X1 [Cucumis melo var. makuwa]3.4e-14888.96Show/hide
Query:  VEELAFLVKDNLPSKHLILSMEETFINFLHDET-SSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDTQSHRCIP
        VEELAFLVKDNLPSKHLILSMEETFINFLHDET SSDGILELKPMDSYNRLLLHRLADIFGL +     +   +     LV          +      IP
Subjt:  VEELAFLVKDNLPSKHLILSMEETFINFLHDET-SSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDTQSHRCIP

Query:  SILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSFPEDTN
        SILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSFPEDTN
Subjt:  SILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSFPEDTN

Query:  CHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKRMFSQALG
        CHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKRMFSQALG
Subjt:  CHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKRMFSQALG

Query:  KHCRKNESLQTRCGEAD
        KHCRKNESLQTRCGEAD
Subjt:  KHCRKNESLQTRCGEAD

TYK08360.1 R3H domain-containing protein 1 isoform X1 [Cucumis melo var. makuwa]1.1e-16278.75Show/hide
Query:  VEELAFLVKDNLPSKHLILSMEETFINFLHDET------------------------------------------------------SSDGILELKPMDS
        VEELAFLVKDNLPSKHLILSMEETFINFLHDET                                                      SSDGILELKPMDS
Subjt:  VEELAFLVKDNLPSKHLILSMEETFINFLHDET------------------------------------------------------SSDGILELKPMDS

Query:  YNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDTQSHR------------------------CI------PSILVSDILWEYDEPQMS
        YNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDTQSHR                        C+      PSILVSDILWEYDEPQMS
Subjt:  YNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDTQSHR------------------------CI------PSILVSDILWEYDEPQMS

Query:  TIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSFPEDTNCHRKVQGVANNAYIQAR
        TIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSFPEDTNCHRKVQGVANNAYIQAR
Subjt:  TIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSFPEDTNCHRKVQGVANNAYIQAR

Query:  DSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKRMFSQALGKHCRKNESLQTRCGEAD
        DSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKRMFSQALGKHCRKNESLQTRCGEAD
Subjt:  DSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKRMFSQALGKHCRKNESLQTRCGEAD

XP_004152835.1 R3H domain-containing protein 1 isoform X2 [Cucumis sativus]1.2e-13783.38Show/hide
Query:  MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDT
        MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLH+ETSSDGILELKPMDSYNRLLLHRLADIFGL +   + +   D     L           + 
Subjt:  MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDT

Query:  QSHRCIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVN
             IPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASS KSSPQRSLEERE AYLAVRERIFMTH+GEDNEPLKPKPRCDPAVARRMIAHALGQRVN
Subjt:  QSHRCIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVN

Query:  SFPEDTNCHRKVQ-GVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAK
        S  EDTNCH+K Q GV NNAYIQARDSKLP+STVEAINKTIS+SDQC+NLKNE DKNCNP+VSLARGSTAAKMK  KS PKASH VDNEHLKREHLGAAK
Subjt:  SFPEDTNCHRKVQ-GVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAK

Query:  RMFSQALGKHCRKNESLQTRCGEAD
        RMFSQALGKHCRKNESLQTR GEAD
Subjt:  RMFSQALGKHCRKNESLQTRCGEAD

XP_008441898.1 PREDICTED: uncharacterized protein LOC103485903 isoform X1 [Cucumis melo]2.3e-15289.23Show/hide
Query:  MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDET-SSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSD
        MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDET SSDGILELKPMDSYNRLLLHRLADIFGL +     +   +     LV          +
Subjt:  MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDET-SSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSD

Query:  TQSHRCIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRV
              IPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRV
Subjt:  TQSHRCIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRV

Query:  NSFPEDTNCHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAK
        NSFPEDTNCHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAK
Subjt:  NSFPEDTNCHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAK

Query:  RMFSQALGKHCRKNESLQTRCGEAD
        RMFSQALGKHCRKNESLQTRCGEAD
Subjt:  RMFSQALGKHCRKNESLQTRCGEAD

XP_008441899.1 PREDICTED: uncharacterized protein LOC103485903 isoform X2 [Cucumis melo]9.1e-15489.51Show/hide
Query:  MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDT
        MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELKPMDSYNRLLLHRLADIFGL +     +   +     LV          + 
Subjt:  MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDT

Query:  QSHRCIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVN
             IPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVN
Subjt:  QSHRCIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVN

Query:  SFPEDTNCHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKR
        SFPEDTNCHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKR
Subjt:  SFPEDTNCHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKR

Query:  MFSQALGKHCRKNESLQTRCGEAD
        MFSQALGKHCRKNESLQTRCGEAD
Subjt:  MFSQALGKHCRKNESLQTRCGEAD

TrEMBL top hitse value%identityAlignment
A0A0A0LH15 Uncharacterized protein5.8e-13883.38Show/hide
Query:  MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDT
        MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLH+ETSSDGILELKPMDSYNRLLLHRLADIFGL +   + +   D     L           + 
Subjt:  MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDT

Query:  QSHRCIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVN
             IPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASS KSSPQRSLEERE AYLAVRERIFMTH+GEDNEPLKPKPRCDPAVARRMIAHALGQRVN
Subjt:  QSHRCIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVN

Query:  SFPEDTNCHRKVQ-GVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAK
        S  EDTNCH+K Q GV NNAYIQARDSKLP+STVEAINKTIS+SDQC+NLKNE DKNCNP+VSLARGSTAAKMK  KS PKASH VDNEHLKREHLGAAK
Subjt:  SFPEDTNCHRKVQ-GVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAK

Query:  RMFSQALGKHCRKNESLQTRCGEAD
        RMFSQALGKHCRKNESLQTR GEAD
Subjt:  RMFSQALGKHCRKNESLQTRCGEAD

A0A1S3B4G2 uncharacterized protein LOC103485903 isoform X11.1e-15289.23Show/hide
Query:  MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDET-SSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSD
        MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDET SSDGILELKPMDSYNRLLLHRLADIFGL +     +   +     LV          +
Subjt:  MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDET-SSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSD

Query:  TQSHRCIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRV
              IPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRV
Subjt:  TQSHRCIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRV

Query:  NSFPEDTNCHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAK
        NSFPEDTNCHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAK
Subjt:  NSFPEDTNCHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAK

Query:  RMFSQALGKHCRKNESLQTRCGEAD
        RMFSQALGKHCRKNESLQTRCGEAD
Subjt:  RMFSQALGKHCRKNESLQTRCGEAD

A0A1S3B4H8 uncharacterized protein LOC103485903 isoform X24.4e-15489.51Show/hide
Query:  MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDT
        MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELKPMDSYNRLLLHRLADIFGL +     +   +     LV          + 
Subjt:  MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDT

Query:  QSHRCIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVN
             IPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVN
Subjt:  QSHRCIPSILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVN

Query:  SFPEDTNCHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKR
        SFPEDTNCHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKR
Subjt:  SFPEDTNCHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKR

Query:  MFSQALGKHCRKNESLQTRCGEAD
        MFSQALGKHCRKNESLQTRCGEAD
Subjt:  MFSQALGKHCRKNESLQTRCGEAD

A0A5A7U0E9 R3H domain-containing protein 1 isoform X11.6e-14888.96Show/hide
Query:  VEELAFLVKDNLPSKHLILSMEETFINFLHDET-SSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDTQSHRCIP
        VEELAFLVKDNLPSKHLILSMEETFINFLHDET SSDGILELKPMDSYNRLLLHRLADIFGL +     +   +     LV          +      IP
Subjt:  VEELAFLVKDNLPSKHLILSMEETFINFLHDET-SSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDTQSHRCIP

Query:  SILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSFPEDTN
        SILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSFPEDTN
Subjt:  SILVSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSFPEDTN

Query:  CHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKRMFSQALG
        CHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKRMFSQALG
Subjt:  CHRKVQGVANNAYIQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKRMFSQALG

Query:  KHCRKNESLQTRCGEAD
        KHCRKNESLQTRCGEAD
Subjt:  KHCRKNESLQTRCGEAD

A0A5D3CAI2 R3H domain-containing protein 1 isoform X15.2e-16378.75Show/hide
Query:  VEELAFLVKDNLPSKHLILSMEETFINFLHDET------------------------------------------------------SSDGILELKPMDS
        VEELAFLVKDNLPSKHLILSMEETFINFLHDET                                                      SSDGILELKPMDS
Subjt:  VEELAFLVKDNLPSKHLILSMEETFINFLHDET------------------------------------------------------SSDGILELKPMDS

Query:  YNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDTQSHR------------------------CI------PSILVSDILWEYDEPQMS
        YNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDTQSHR                        C+      PSILVSDILWEYDEPQMS
Subjt:  YNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDTQSHR------------------------CI------PSILVSDILWEYDEPQMS

Query:  TIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSFPEDTNCHRKVQGVANNAYIQAR
        TIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSFPEDTNCHRKVQGVANNAYIQAR
Subjt:  TIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSFPEDTNCHRKVQGVANNAYIQAR

Query:  DSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKRMFSQALGKHCRKNESLQTRCGEAD
        DSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKRMFSQALGKHCRKNESLQTRCGEAD
Subjt:  DSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKRMFSQALGKHCRKNESLQTRCGEAD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTGTCGCCCAATTCGCCATGGTGGAGGAGTTGGCCTTTCTTGTTAAGGACAACCTCCCCAGCAAGCATCTTATTCTATCAATGGAAGAAACCTTCATCAACTTTCT
TCACGATGAAACCAGTTCAGATGGAATCTTGGAGTTGAAACCAATGGATTCATACAACCGTCTTCTTTTGCATCGTCTTGCGGATATTTTTGGCCTCTCTAACTTTGTAC
TTCTCCCCATTCCTCCGTCAGACTTGGCCACGTATCAGCTGGTGAAGGTGATAATAGACACTTGGTTTTGGAGCGATACCCAGAGTCATCGATGTATACCGTCCATTCTT
GTGAGTGATATTCTGTGGGAGTATGATGAACCTCAAATGTCAACGATACCACACCAACTATTAAGGAGAAAGGAAAATTCTTCTGCAAGTTCAGCAAAATCATCTCCTCA
ACGTTCGCTTGAAGAGAGAGAAACAGCTTATCTGGCTGTCCGTGAGCGAATATTCATGACGCACATTGGAGAGGACAATGAACCCTTGAAACCAAAGCCACGTTGTGATC
CTGCGGTTGCACGACGCATGATTGCACATGCACTGGGACAGCGAGTTAATTCATTTCCAGAGGATACTAATTGCCATCGCAAAGTGCAAGGTGTAGCAAACAATGCATAC
ATTCAAGCAAGAGATTCAAAGTTGCCTAATTCTACTGTGGAAGCCATTAACAAAACCATTTCACAGTCGGATCAATGTATGAACTTGAAGAATGAGTCGGATAAGAATTG
TAATCCAAACGTGTCATTGGCAAGGGGAAGTACTGCTGCCAAAATGAAACTTGACAAGAGTTCTCCGAAGGCAAGTCATGATGTCGACAATGAGCACTTAAAGAGAGAAC
ATTTAGGAGCTGCAAAGAGGATGTTTTCTCAGGCTTTAGGCAAGCACTGCCGAAAGAATGAATCTCTTCAAACTCGTTGTGGGGAAGCAGATTAA
mRNA sequenceShow/hide mRNA sequence
GGACACTTTGTCGTTTGTTAGATGCAAGACTCGTCAGACAACCATAGAGTTTAGGTTCCATTGAAAATCGAAAGCCCTGCTGAAATTCTTTCCTTCGAGTCTTCAACCTT
CATACTTCACAATGACTGTCGCCCAATTCGCCATGGTGGAGGAGTTGGCCTTTCTTGTTAAGGACAACCTCCCCAGCAAGCATCTTATTCTATCAATGGAAGAAACCTTC
ATCAACTTTCTTCACGATGAAACCAGTTCAGATGGAATCTTGGAGTTGAAACCAATGGATTCATACAACCGTCTTCTTTTGCATCGTCTTGCGGATATTTTTGGCCTCTC
TAACTTTGTACTTCTCCCCATTCCTCCGTCAGACTTGGCCACGTATCAGCTGGTGAAGGTGATAATAGACACTTGGTTTTGGAGCGATACCCAGAGTCATCGATGTATAC
CGTCCATTCTTGTGAGTGATATTCTGTGGGAGTATGATGAACCTCAAATGTCAACGATACCACACCAACTATTAAGGAGAAAGGAAAATTCTTCTGCAAGTTCAGCAAAA
TCATCTCCTCAACGTTCGCTTGAAGAGAGAGAAACAGCTTATCTGGCTGTCCGTGAGCGAATATTCATGACGCACATTGGAGAGGACAATGAACCCTTGAAACCAAAGCC
ACGTTGTGATCCTGCGGTTGCACGACGCATGATTGCACATGCACTGGGACAGCGAGTTAATTCATTTCCAGAGGATACTAATTGCCATCGCAAAGTGCAAGGTGTAGCAA
ACAATGCATACATTCAAGCAAGAGATTCAAAGTTGCCTAATTCTACTGTGGAAGCCATTAACAAAACCATTTCACAGTCGGATCAATGTATGAACTTGAAGAATGAGTCG
GATAAGAATTGTAATCCAAACGTGTCATTGGCAAGGGGAAGTACTGCTGCCAAAATGAAACTTGACAAGAGTTCTCCGAAGGCAAGTCATGATGTCGACAATGAGCACTT
AAAGAGAGAACATTTAGGAGCTGCAAAGAGGATGTTTTCTCAGGCTTTAGGCAAGCACTGCCGAAAGAATGAATCTCTTCAAACTCGTTGTGGGGAAGCAGATTAAAATG
GAAACAAAGATTCAAATACATCTGCAGTAGCTATTGTTTTCTGGCATGATCTCGTCCTACTCACAACTTTGAGTTGTCAGGATCCCTGGTGCTGCTAACATAAAGATGAA
TGGGACTATGTGATGACAAATTTGTCTATGTCGCGTGGCCAATGGTCGTCGACTGTGTCAATGGCGTGATGATCAGGGTTGGATGCAACGCCTATGGCTGCTATCATGTT
GCCAATTCAGTTGGCAAATATGAAATCTGACCTTGTGTTCTCTAGCTTATATACTGGGAATTCAATCTGGTCGTCTAAGCAGCACTGTATGAGGAGTTTGAACTATATGT
TCGGAGCGGAGTTGTGAGGAAAGGTTGTGGAAATCCTGGGATCGTATATGTTCACTGATTGAGAGCACATAGTGGGTTGTGTCAGGTTAACGTAGAGGTTTCTAACTTGA
ATTTGTGTTGGTACTTGATTGTTTATAGCCTCTCTCTCATCTTTGCTTATGGCATCAAATAGCATCTTTTGGTTTATGTACTTGACTCGATGATATCGAGCAATGCACTT
TGTTCTATGTGAGATTGTAGTGGTTGAATTTTAAGCTTTATTGTTCCTTTGAAATCTTGAGGTATTGGCTAATACTTTCTTTTTGAGTGAAAATCAAAATGTACATGTAT
TTCAAGTATTTTTGTGTCTAGTGGAACTGCAGGGCAAGTTTGAGGTTCACTTTG
Protein sequenceShow/hide protein sequence
MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHDETSSDGILELKPMDSYNRLLLHRLADIFGLSNFVLLPIPPSDLATYQLVKVIIDTWFWSDTQSHRCIPSIL
VSDILWEYDEPQMSTIPHQLLRRKENSSASSAKSSPQRSLEERETAYLAVRERIFMTHIGEDNEPLKPKPRCDPAVARRMIAHALGQRVNSFPEDTNCHRKVQGVANNAY
IQARDSKLPNSTVEAINKTISQSDQCMNLKNESDKNCNPNVSLARGSTAAKMKLDKSSPKASHDVDNEHLKREHLGAAKRMFSQALGKHCRKNESLQTRCGEAD