; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI06G23140 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI06G23140
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationChr6:20888753..20890183
RNA-Seq ExpressionCSPI06G23140
SyntenyCSPI06G23140
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140154.1 uncharacterized protein LOC101216953 [Cucumis sativus]1.4e-133100Show/hide
Query:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNVKRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_008449565.1 PREDICTED: uncharacterized protein LOC103491410 [Cucumis melo]1.0e-13198.85Show/hide
Query:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIG GGGSSVQASPPPSSP PATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNV+RIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_022138704.1 uncharacterized protein LOC111009802 [Momordica charantia]3.1e-12091.19Show/hide
Query:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIG     +   SPPP  P P TEPNSSPRISFSSEFLDESNFISITPNSQIERDQE+CERQ+K+RSEKLAWSADFEFLSNKVSSHSM TADELFFEG
Subjt:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERL KISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASS LSPSSSSSSSSSSSRSMAD ATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGT GNKEKN+KRIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_022930387.1 uncharacterized protein LOC111436849 [Cucurbita moschata]6.3e-11390.04Show/hide
Query:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSI      SVQ SPP SS     EPNSSPRISFSSEFLDESNFISITP+SQIERDQEICERQKK+RSE+LA SADFEFLSN+VSSHSM+TADELFFEG
Subjt:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQR SSALSPSSSSSSSSSSSRSMADAAT+E
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
           EGTTGNKEKN+KRIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_038876013.1 uncharacterized protein LOC120068347 [Benincasa hispida]1.1e-12295.82Show/hide
Query:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQK--KDRSEKLAWSADFEFLSNKVSSHSMITADELFF
        MVSIGSGG  SVQAS PPSSP P TEPNSSPRISFSSEFLDESNFISITPNSQIERDQEIC+RQK  KDRSEKLAWSADFEFLSNKVSSHSMITADELFF
Subjt:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQK--KDRSEKLAWSADFEFLSNKVSSHSMITADELFF

Query:  EGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAAT
        EGKLLPFWQMQQAERLNKISLKSPKDVDEED+VEIEVNKEA+NKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAAT
Subjt:  EGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAAT

Query:  TEEGKEGTTGNKEKNVKRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        TEEGKEGTTGNKEKNVKRIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  TEEGKEGTTGNKEKNVKRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

TrEMBL top hitse value%identityAlignment
A0A1S3BMY8 uncharacterized protein LOC1034914105.0e-13298.85Show/hide
Query:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIG GGGSSVQASPPPSSP PATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNV+RIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A5D3DC08 SEY15.0e-13298.85Show/hide
Query:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIG GGGSSVQASPPPSSP PATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNV+RIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKKLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A6J1CAV2 uncharacterized protein LOC1110098021.5e-12091.19Show/hide
Query:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSIG     +   SPPP  P P TEPNSSPRISFSSEFLDESNFISITPNSQIERDQE+CERQ+K+RSEKLAWSADFEFLSNKVSSHSM TADELFFEG
Subjt:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERL KISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASS LSPSSSSSSSSSSSRSMAD ATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGT GNKEKN+KRIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A6J1EQC3 uncharacterized protein LOC1114368493.1e-11390.04Show/hide
Query:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSI      SVQ SPP SS     EPNSSPRISFSSEFLDESNFISITP+SQIERDQEICERQKK+RSE+LA SADFEFLSN+VSSHSM+TADELFFEG
Subjt:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQR SSALSPSSSSSSSSSSSRSMADAAT+E
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
           EGTTGNKEKN+KRIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A6J1KCD5 uncharacterized protein LOC1114943313.1e-11390.04Show/hide
Query:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSI      SVQ SPP SS     EPNSSPRISFSSEFLDESNFISITP+SQIERDQEICERQKK+RSE+LA SADFEFLSN+VSSHSM+TADELFFEG
Subjt:  MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQR SSALSPSSSSSSSSSSSRSMADAAT+E
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
           EGTTGNKEKN+KRIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKK-LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G05980.1 unknown protein4.4e-4853.57Show/hide
Query:  PNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNK-VSSHSMITADELFFEGKLLPFWQMQQAERLNKISLKSPKDV
        P   PRISFSS+  D  +FI ITP         +C+      S K+   +DFEFLS++ VS   M+TADELF EGKLLPFWQ++ +E+L  I+LK+ ++ 
Subjt:  PNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNK-VSSHSMITADELFFEGKLLPFWQMQQAERLNKISLKSPKDV

Query:  DEEDLVEIEVNKE------AENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-------SSSSSSSSSSSRSMADAATTEEGKEGTTGNKEK
        +E +  ++EV K+       +N+V WF+D+DPSPRPPKCTVLWKELLRLKKQR  S+ SP       S S SSS+SSS S+ DAA  EE        KEK
Subjt:  DEEDLVEIEVNKE------AENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-------SSSSSSSSSSSRSMADAATTEEGKEGTTGNKEK

Query:  NVKRIKK-LERTRSASIRIRPMINVPICTQVKSSV-LPPLFP--LKKGRFDR
          KR KK LERTRSAS+RIRPMI+VPICT  KSS+ LPPLFP  LKK R +R
Subjt:  NVKRIKK-LERTRSASIRIRPMINVPICTQVKSSV-LPPLFP--LKKGRFDR

AT3G12970.1 unknown protein3.6e-0533.91Show/hide
Query:  DFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDP-SPRPPKCTVLWKELLRLKKQRASS
        DFEFL       +M++ADELF +GKL+P        + + ++    K +       ++  +  E +++  +D    SPR P+CTV W+ELL LK+   + 
Subjt:  DFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDP-SPRPPKCTVLWKELLRLKKQRASS

Query:  ALSPSSSSSSSSSSS
          + +SSSS  SSSS
Subjt:  ALSPSSSSSSSSSSS

AT5G19340.1 unknown protein1.6e-4547.46Show/hide
Query:  SSPLPATEPNSS-PRISFSSEFL---DESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAERL
        ++ +   EP+++ PRISFS++      + +FI I P   +     I  R++KD+S   A   DFEFLS      +M++ADELF EGKLLPFWQ++ +E+L
Subjt:  SSPLPATEPNSS-PRISFSSEFL---DESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAERL

Query:  NKISLKSPKDVDEEDLVEIEVNKEA-----------------ENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRAS---------SALSPSSSSSSSSS
          ++LK   +V +E+     VN+E                   N+ +WFLDDDPSPRPPKCTVLWKELLRLKKQR +         S+LSPSSSSSS+SS
Subjt:  NKISLKSPKDVDEEDLVEIEVNKEA-----------------ENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRAS---------SALSPSSSSSSSSS

Query:  SSRSMADAATTEEGKEGTTGNKEKNVKRIKK-LERTRSASIRIRPMINVPICTQVKSSV-LPPLFPLK--KGRFDR
        SS S+ DA   EE        +EK  KR KK LERTRS ++RIRPMI+VP+CT  KSS  LPPLFPL+  K R +R
Subjt:  SSRSMADAATTEEGKEGTTGNKEKNVKRIKK-LERTRSASIRIRPMINVPICTQVKSSV-LPPLFPLK--KGRFDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCAATAGGCAGTGGTGGTGGTAGTAGTGTTCAAGCATCGCCGCCACCATCATCACCACTACCAGCCACTGAACCAAATTCAAGTCCTAGGATATCTTTCTCTTC
TGAGTTTCTTGATGAAAGCAACTTCATTTCCATCACTCCAAATTCCCAGATAGAGAGAGATCAAGAGATTTGTGAGAGACAGAAGAAGGATAGATCAGAGAAGCTGGCAT
GGAGTGCTGATTTTGAGTTTCTTTCTAATAAAGTTAGTAGCCACTCCATGATTACAGCTGATGAGCTTTTCTTTGAAGGGAAGCTTCTTCCTTTTTGGCAAATGCAACAA
GCAGAGCGGCTTAACAAAATCAGTCTGAAATCTCCTAAAGATGTGGATGAAGAAGATTTGGTGGAGATAGAGGTAAACAAGGAAGCAGAGAACAAAGTGAATTGGTTTCT
CGACGACGACCCGTCTCCAAGACCACCAAAATGCACTGTTCTCTGGAAAGAACTATTGAGGTTGAAGAAGCAGCGTGCTTCATCTGCACTATCGCCATCTTCTTCTTCGT
CCTCGTCGTCTTCCTCTTCCAGGTCCATGGCTGATGCAGCCACAACAGAGGAAGGAAAGGAAGGGACAACAGGAAACAAAGAGAAGAACGTAAAGAGGATAAAGAAGTTG
GAAAGGACAAGATCAGCCAGTATAAGAATAAGGCCTATGATTAATGTGCCAATCTGCACACAGGTGAAGAGCAGTGTTTTGCCACCCTTGTTCCCACTTAAGAAAGGAAG
ATTTGATAGATGA
mRNA sequenceShow/hide mRNA sequence
AGCCATGCAAAAGAATAGTAATTTTATATATATTTGCTAAAGAGGACCTTTTTGTCAGGTAGCTCCCTAGAGCCCTAGGTGTCCATGTGCTGTTTTGCTTTCATATCATA
GAACTCCAACTCTCTTTTAACCTACACACCTTGCTTGTTTCGGCTTCACCAGACCACACTTGCTTTTTTATTAATCTCCCTCTACTTCCCCTTCAAACATCTACTTTTTA
TTTTCCCTTCAAAACATCTATATAAATCCCACATTTCCAAACTCAAGCTCATTTACCCAACTTTCCCACCCCTGACACAATCATTTTCTTTCCCTATTTCCCGGAATCCA
AACACCCACCACCACCACCACCAAGAGAAGAGAGAGAGAGAGAGATAGAGAGAGAGATAGAGAGAGAATGGTTTCAATAGGCAGTGGTGGTGGTAGTAGTGTTCAAGCAT
CGCCGCCACCATCATCACCACTACCAGCCACTGAACCAAATTCAAGTCCTAGGATATCTTTCTCTTCTGAGTTTCTTGATGAAAGCAACTTCATTTCCATCACTCCAAAT
TCCCAGATAGAGAGAGATCAAGAGATTTGTGAGAGACAGAAGAAGGATAGATCAGAGAAGCTGGCATGGAGTGCTGATTTTGAGTTTCTTTCTAATAAAGTTAGTAGCCA
CTCCATGATTACAGCTGATGAGCTTTTCTTTGAAGGGAAGCTTCTTCCTTTTTGGCAAATGCAACAAGCAGAGCGGCTTAACAAAATCAGTCTGAAATCTCCTAAAGATG
TGGATGAAGAAGATTTGGTGGAGATAGAGGTAAACAAGGAAGCAGAGAACAAAGTGAATTGGTTTCTCGACGACGACCCGTCTCCAAGACCACCAAAATGCACTGTTCTC
TGGAAAGAACTATTGAGGTTGAAGAAGCAGCGTGCTTCATCTGCACTATCGCCATCTTCTTCTTCGTCCTCGTCGTCTTCCTCTTCCAGGTCCATGGCTGATGCAGCCAC
AACAGAGGAAGGAAAGGAAGGGACAACAGGAAACAAAGAGAAGAACGTAAAGAGGATAAAGAAGTTGGAAAGGACAAGATCAGCCAGTATAAGAATAAGGCCTATGATTA
ATGTGCCAATCTGCACACAGGTGAAGAGCAGTGTTTTGCCACCCTTGTTCCCACTTAAGAAAGGAAGATTTGATAGATGAGAAGATGAAATTAGACGCTACAGTCAGCAG
CTCAAGATCTCATCAACTCTAGTCCAACGTGTTAATCTCCCACTTTCCCCTTTCCCCCTCTCTATTGTCATTTCTTGACAAATTTTTTAAACTTTCTGTCTGTGTATGGA
TTAATGTTGATTATGGCGGTGGATCTAATAATCTACGATTGTAAATGCTGACCTCTAGTTTTTTGTTTGTTGGTATTCCTTCATTAATCAAATTACGCTTTTTTGGTTCC
A
Protein sequenceShow/hide protein sequence
MVSIGSGGGSSVQASPPPSSPLPATEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQ
AERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEGTTGNKEKNVKRIKKL
ERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR