; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0015599 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0015599
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationchr05:5761078..5762489
RNA-Seq ExpressionPay0015599
SyntenyPay0015599
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140154.1 uncharacterized protein LOC101216953 [Cucumis sativus]2.3e-13198.85Show/hide
Query:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIG GGGSSVQASPPPSSP PATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVRRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNV+RIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVRRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_008449565.1 PREDICTED: uncharacterized protein LOC103491410 [Cucumis melo]1.4e-133100Show/hide
Query:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVRRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNVRRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVRRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_022138704.1 uncharacterized protein LOC111009802 [Momordica charantia]1.8e-12091.19Show/hide
Query:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIG     +   SPPP  PPP TEPNSSPRISFSSEFLDESNFISITPNSQIERDQE+CERQ+K+RSEKLAWSADFEFLSNKVSSHSM TADELFFEG
Subjt:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERL KISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASS LSPSSSSSSSSSSSRSMAD ATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGT GNKEKN++RIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_022930387.1 uncharacterized protein LOC111436849 [Cucurbita moschata]2.4e-11289.66Show/hide
Query:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSI      SVQ SPP SS     EPNSSPRISFSSEFLDESNFISITP+SQIERDQEICERQKK+RSE+LA SADFEFLSN+VSSHSM+TADELFFEG
Subjt:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQR SSALSPSSSSSSSSSSSRSMADAAT+E
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
           EGTTGNKEKN++RIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_038876013.1 uncharacterized protein LOC120068347 [Benincasa hispida]2.2e-12195.06Show/hide
Query:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQK--KDRSEKLAWSADFEFLSNKVSSHSMITADELFF
        MVSIG GG  SVQAS PPSSP P TEPNSSPRISFSSEFLDESNFISITPNSQIERDQEIC+RQK  KDRSEKLAWSADFEFLSNKVSSHSMITADELFF
Subjt:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQK--KDRSEKLAWSADFEFLSNKVSSHSMITADELFF

Query:  EGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAAT
        EGKLLPFWQMQQAERLNKISLKSPKDVDEED+VEIEVNKEA+NKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAAT
Subjt:  EGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAAT

Query:  TEEGKEGTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        TEEGKEGTTGNKEKNV+RIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  TEEGKEGTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

TrEMBL top hitse value%identityAlignment
A0A1S3BMY8 uncharacterized protein LOC1034914107.0e-134100Show/hide
Query:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVRRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNVRRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVRRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A5D3DC08 SEY17.0e-134100Show/hide
Query:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVRRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNVRRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVRRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A6J1CAV2 uncharacterized protein LOC1110098028.8e-12191.19Show/hide
Query:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIG     +   SPPP  PPP TEPNSSPRISFSSEFLDESNFISITPNSQIERDQE+CERQ+K+RSEKLAWSADFEFLSNKVSSHSM TADELFFEG
Subjt:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERL KISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASS LSPSSSSSSSSSSSRSMAD ATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGT GNKEKN++RIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A6J1EQC3 uncharacterized protein LOC1114368491.2e-11289.66Show/hide
Query:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSI      SVQ SPP SS     EPNSSPRISFSSEFLDESNFISITP+SQIERDQEICERQKK+RSE+LA SADFEFLSN+VSSHSM+TADELFFEG
Subjt:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQR SSALSPSSSSSSSSSSSRSMADAAT+E
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
           EGTTGNKEKN++RIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A6J1KCD5 uncharacterized protein LOC1114943311.2e-11289.66Show/hide
Query:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSI      SVQ SPP SS     EPNSSPRISFSSEFLDESNFISITP+SQIERDQEICERQKK+RSE+LA SADFEFLSN+VSSHSM+TADELFFEG
Subjt:  MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQR SSALSPSSSSSSSSSSSRSMADAAT+E
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
           EGTTGNKEKN++RIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G05980.1 unknown protein2.2e-4752.31Show/hide
Query:  SSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNK-VSSHSMITADELFFEGKLLPFWQMQQAERLNKI
        + PPP       PRISFSS+  D  +FI ITP         +C+      S K+   +DFEFLS++ VS   M+TADELF EGKLLPFWQ++ +E+L  I
Subjt:  SSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNK-VSSHSMITADELFFEGKLLPFWQMQQAERLNKI

Query:  SLKSPKDVDEEDLVEIEVNKE------AENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-------SSSSSSSSSSSRSMADAATTEEGKE
        +LK+ ++ +E +  ++EV K+       +N+V WF+D+DPSPRPPKCTVLWKELLRLKKQR  S+ SP       S S SSS+SSS S+ DAA  EE   
Subjt:  SLKSPKDVDEEDLVEIEVNKE------AENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-------SSSSSSSSSSSRSMADAATTEEGKE

Query:  GTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSV-LPPLFP--LKKGRFDR
             KEK  +R KK LERTRSAS+RIRPMI+VPICT  KSS+ LPPLFP  LKK R +R
Subjt:  GTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSV-LPPLFP--LKKGRFDR

AT3G12970.1 unknown protein3.5e-0533.91Show/hide
Query:  DFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDP-SPRPPKCTVLWKELLRLKKQRASS
        DFEFL       +M++ADELF +GKL+P        + + ++    K +       ++  +  E +++  +D    SPR P+CTV W+ELL LK+   + 
Subjt:  DFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDP-SPRPPKCTVLWKELLRLKKQRASS

Query:  ALSPSSSSSSSSSSS
          + +SSSS  SSSS
Subjt:  ALSPSSSSSSSSSSS

AT5G19340.1 unknown protein3.5e-4548.33Show/hide
Query:  EPNSS-PRISFSSEFL---DESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAERLNKISLKS
        EP+++ PRISFS++      + +FI I P   +     I  R++KD+S   A   DFEFLS      +M++ADELF EGKLLPFWQ++ +E+L  ++LK 
Subjt:  EPNSS-PRISFSSEFL---DESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAERLNKISLKS

Query:  PKDVDEEDLVEIEVNKEA-----------------ENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRAS---------SALSPSSSSSSSSSSSRSMAD
          +V +E+     VN+E                   N+ +WFLDDDPSPRPPKCTVLWKELLRLKKQR +         S+LSPSSSSSS+SSSS S+ D
Subjt:  PKDVDEEDLVEIEVNKEA-----------------ENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRAS---------SALSPSSSSSSSSSSSRSMAD

Query:  AATTEEGKEGTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSV-LPPLFPLK--KGRFDR
        A   EE        +EK  +R KK LERTRS ++RIRPMI+VP+CT  KSS  LPPLFPL+  K R +R
Subjt:  AATTEEGKEGTTGNKEKNVRRIKK-LERTRSASIRIRPMINVPICTQVKSSV-LPPLFPLK--KGRFDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCAATAGGCATTGGTGGTGGTAGTAGTGTTCAAGCATCGCCGCCACCATCATCACCACCACCAGCCACTGAACCAAATTCCAGTCCTAGGATATCTTTCTCTTC
TGAGTTTCTTGATGAAAGCAACTTCATTTCCATCACTCCAAATTCCCAGATAGAGAGAGATCAAGAGATTTGTGAGAGACAGAAGAAGGATAGATCAGAGAAGCTGGCAT
GGAGTGCTGATTTTGAGTTTCTTTCTAATAAAGTTAGTAGTCACTCCATGATTACAGCTGATGAGCTTTTCTTTGAAGGGAAGCTTCTTCCTTTTTGGCAAATGCAACAA
GCAGAGCGGCTTAACAAAATCAGTCTGAAATCTCCTAAAGATGTAGATGAAGAAGATTTGGTGGAGATAGAGGTAAACAAGGAGGCAGAGAACAAAGTGAATTGGTTTCT
CGACGACGACCCGTCTCCGAGACCACCAAAATGCACTGTTCTCTGGAAAGAACTGTTGAGGTTGAAGAAGCAGCGTGCTTCATCTGCACTATCGCCATCTTCTTCTTCGT
CCTCGTCGTCTTCCTCTTCCAGGTCCATGGCTGATGCAGCCACAACGGAGGAAGGAAAGGAAGGGACAACAGGAAACAAAGAGAAGAACGTAAGGAGGATAAAGAAGTTG
GAAAGGACAAGATCAGCCAGTATAAGAATAAGGCCTATGATTAATGTGCCAATCTGCACACAGGTGAAGAGCAGTGTTTTGCCACCCTTGTTCCCACTTAAGAAAGGAAG
ATTTGATAGATGA
mRNA sequenceShow/hide mRNA sequence
AAAGAATAGTAATTTTATATATATTTGCTAAAGAGGACCTTTTTGTCAGGTAGCTCCCTAGAGCCCTAGGTGTCCATGTGCTGTTTTGCTTTCATATCATAGAACTCCAA
CTCTCTTTTAACCTACACACCTTGCTTGTTTCGGCTTCACCAGACCACACTTGCTTTTTTATTAATCTCCCTCTACTTCCCCTTCAAACATCTACTTTTTATTTTCCCCT
CAAAACATCTATATAAATCCCACATTTCCAAACTCAAGCTCATTTACCCAACTTTCCCACCTCTGACACAATCATTTTCTTTCCCTATTTCCCGGAATCCAAACACCCAC
CACCACCACCAAGAGAGAGAGAGAGATAGAAAGAGAGAGAGAGAGAGAATGGTTTCAATAGGCATTGGTGGTGGTAGTAGTGTTCAAGCATCGCCGCCACCATCATCACC
ACCACCAGCCACTGAACCAAATTCCAGTCCTAGGATATCTTTCTCTTCTGAGTTTCTTGATGAAAGCAACTTCATTTCCATCACTCCAAATTCCCAGATAGAGAGAGATC
AAGAGATTTGTGAGAGACAGAAGAAGGATAGATCAGAGAAGCTGGCATGGAGTGCTGATTTTGAGTTTCTTTCTAATAAAGTTAGTAGTCACTCCATGATTACAGCTGAT
GAGCTTTTCTTTGAAGGGAAGCTTCTTCCTTTTTGGCAAATGCAACAAGCAGAGCGGCTTAACAAAATCAGTCTGAAATCTCCTAAAGATGTAGATGAAGAAGATTTGGT
GGAGATAGAGGTAAACAAGGAGGCAGAGAACAAAGTGAATTGGTTTCTCGACGACGACCCGTCTCCGAGACCACCAAAATGCACTGTTCTCTGGAAAGAACTGTTGAGGT
TGAAGAAGCAGCGTGCTTCATCTGCACTATCGCCATCTTCTTCTTCGTCCTCGTCGTCTTCCTCTTCCAGGTCCATGGCTGATGCAGCCACAACGGAGGAAGGAAAGGAA
GGGACAACAGGAAACAAAGAGAAGAACGTAAGGAGGATAAAGAAGTTGGAAAGGACAAGATCAGCCAGTATAAGAATAAGGCCTATGATTAATGTGCCAATCTGCACACA
GGTGAAGAGCAGTGTTTTGCCACCCTTGTTCCCACTTAAGAAAGGAAGATTTGATAGATGAGAAGATGAAATTCGACGCTAAAGTCAGCAGCTCAAGATCTCATCAACTC
TAGTCCAACGTGTTAATCTCCCACTTCCCCCTTCCCCCCTCTCTATTGTCATTTCTTGACAAATTTTTTCAACTTTCTGTCTGTGTATGGATTAATGTTGATTATGGCGG
TGGATCTAATAATCTACGATTGTAAATGCTGACCTCTAGTTTTTTGTTTGTTGGTATTCCTTCATTAATCAAATTACGCTTTTTTGGTTCCA
Protein sequenceShow/hide protein sequence
MVSIGIGGGSSVQASPPPSSPPPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQ
AERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEGTTGNKEKNVRRIKKL
ERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR