; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh17G002390 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh17G002390
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationCmo_Chr17:1414826..1415572
RNA-Seq ExpressionCmoCh17G002390
SyntenyCmoCh17G002390
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574997.1 hypothetical protein SDJN03_25636, partial [Cucurbita argyrosperma subsp. sororia]5.2e-12597.61Show/hide
Query:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
        MVSIG+GCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQK+DRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
Subjt:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ

Query:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP---SSSSSSSSSSKSMADAPKTEEGTTRNK
        QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP   SSSSSSSSSSKSMADAPKTEEGTTRNK
Subjt:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP---SSSSSSSSSSKSMADAPKTEEGTTRNK

Query:  EKNIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR
        EKNIKRIKKGLERTRSGSIRI PMINVPICTQLKSSVLPPLFPLKKGRFDR
Subjt:  EKNIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR

KAG7013568.1 hypothetical protein SDJN02_23735, partial [Cucurbita argyrosperma subsp. argyrosperma]8.3e-12396.41Show/hide
Query:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
        MVSIG+GCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQK+DRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
Subjt:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ

Query:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP---SSSSSSSSSSKSMADAPKTEEGTTRNK
        QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP   SSSSSSSSSSKSMADAPKTEEGTTRNK
Subjt:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP---SSSSSSSSSSKSMADAPKTEEGTTRNK

Query:  EKNIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR
        EKNIKRIKK LERTRSGSIRI PMINV ICTQLK SVLPPLFPLKKGRFDR
Subjt:  EKNIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR

XP_022958805.1 uncharacterized protein LOC111459965 [Cucurbita moschata]2.3e-128100Show/hide
Query:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
        MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
Subjt:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ

Query:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSKSMADAPKTEEGTTRNKEKN
        QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSKSMADAPKTEEGTTRNKEKN
Subjt:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSKSMADAPKTEEGTTRNKEKN

Query:  IKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR
        IKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR
Subjt:  IKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR

XP_023006453.1 uncharacterized protein LOC111499176 [Cucurbita maxima]3.4e-12497.59Show/hide
Query:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
        MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQK+DRSEKLAWSADFEFLSNKVSSHSMITADELF++GKLLPFW MQ
Subjt:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ

Query:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-SSSSSSSSSSKSMADAPKTEEGTTRNKEK
        QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP SSSSSSSSSSKSMADAPKTEEGTTRNKEK
Subjt:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-SSSSSSSSSSKSMADAPKTEEGTTRNKEK

Query:  NIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR
        NIKRIKKGLERTRSGSIRI PMINVPICTQLKSSVLPPLFPLKKGRFDR
Subjt:  NIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR

XP_023547494.1 uncharacterized protein LOC111806416 [Cucurbita pepo subsp. pepo]1.7e-12397.22Show/hide
Query:  MVSIGSGCS-VHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQM
        MVSIGSGCS VHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQK+DRSEKLAWSADFEFLSNKVSSHSMITADELFF+GKLLPFWQM
Subjt:  MVSIGSGCS-VHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQM

Query:  QQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP---SSSSSSSSSSKSMADAPKTEEGTTRN
        QQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP   SSSSSSSSSSKSMADAPKTEEGTTRN
Subjt:  QQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP---SSSSSSSSSSKSMADAPKTEEGTTRN

Query:  KEKNIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR
        KEKNIKRIKKGLERTRSGSIRI PMINVPICTQLKSSVLPPLFPLKKGRFDR
Subjt:  KEKNIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR

TrEMBL top hitse value%identityAlignment
A0A5D3DC08 SEY11.0e-10282.56Show/hide
Query:  VSIGSGCSVHGS-------ASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLL
        + IG G SV  S        ++EPNSSPRISFSSEFLDE++FISITPNS +ER+QEI ERQK+DRSEKLAWSADFEFLSNKVSSHSMITADELFF+GKLL
Subjt:  VSIGSGCSVHGS-------ASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLL

Query:  PFWQMQQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-SSSSSSSSSSKSMADAPKTE---
        PFWQMQQAERLNKISLKS KDVD++ LV+IEVNK+AENKVNW LDDDPSPRPPKCTVLWKELLRLKKQRASSALSP SSSSSSSSSS+SMADA  TE   
Subjt:  PFWQMQQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-SSSSSSSSSSKSMADAPKTE---

Query:  EGTTRNKEKNIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR
        EGTT NKEKN++RIKK LERTRS SIRI PMINVPICTQ+KSSVLPPLFPLKKGRFDR
Subjt:  EGTTRNKEKNIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR

A0A6J1EQC3 uncharacterized protein LOC1114368491.0e-10283.94Show/hide
Query:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
        MVSI S  +   S+S EPNSSPRISFSSEFLDE++FISITP+S +ER+QEI ERQK++RSE+LA SADFEFLSN+VSSHSM+TADELFF+GKLLPFWQMQ
Subjt:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ

Query:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-SSSSSSSSSSKSMADAPKTEEGTTRNKEK
        QAERLNKISLKS KDVD++ LV+IEVNK+AENKVNW LDDDPSPRPPKCTVLWKELLRLKKQR SSALSP SSSSSSSSSS+SMADA  +EEGTT NKEK
Subjt:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-SSSSSSSSSSKSMADAPKTEEGTTRNKEK

Query:  NIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR
        NIKRIKKGLERTRS SIRI PMINVPICTQ+KSSVLPPLFPLKKGRFDR
Subjt:  NIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR

A0A6J1H4I1 uncharacterized protein LOC1114599651.1e-128100Show/hide
Query:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
        MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
Subjt:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ

Query:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSKSMADAPKTEEGTTRNKEKN
        QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSKSMADAPKTEEGTTRNKEKN
Subjt:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSKSMADAPKTEEGTTRNKEKN

Query:  IKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR
        IKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR
Subjt:  IKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR

A0A6J1KCD5 uncharacterized protein LOC1114943311.0e-10283.94Show/hide
Query:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
        MVSI S  +   S+S EPNSSPRISFSSEFLDE++FISITP+S +ER+QEI ERQK++RSE+LA SADFEFLSN+VSSHSM+TADELFF+GKLLPFWQMQ
Subjt:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ

Query:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-SSSSSSSSSSKSMADAPKTEEGTTRNKEK
        QAERLNKISLKS KDVD++ LV+IEVNK+AENKVNW LDDDPSPRPPKCTVLWKELLRLKKQR SSALSP SSSSSSSSSS+SMADA  +EEGTT NKEK
Subjt:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-SSSSSSSSSSKSMADAPKTEEGTTRNKEK

Query:  NIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR
        NIKRIKKGLERTRS SIRI PMINVPICTQ+KSSVLPPLFPLKKGRFDR
Subjt:  NIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR

A0A6J1L4Y8 uncharacterized protein LOC1114991761.6e-12497.59Show/hide
Query:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
        MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQK+DRSEKLAWSADFEFLSNKVSSHSMITADELF++GKLLPFW MQ
Subjt:  MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ

Query:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-SSSSSSSSSSKSMADAPKTEEGTTRNKEK
        QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP SSSSSSSSSSKSMADAPKTEEGTTRNKEK
Subjt:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-SSSSSSSSSSKSMADAPKTEEGTTRNKEK

Query:  NIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR
        NIKRIKKGLERTRSGSIRI PMINVPICTQLKSSVLPPLFPLKKGRFDR
Subjt:  NIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSVLPPLFPLKKGRFDR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G05980.1 unknown protein2.2e-4952.82Show/hide
Query:  PNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNK-VSSHSMITADELFFDGKLLPFWQMQQAERLNKISLKSSKDV
        P   PRISFSS+  D  DFI ITP              KED  +     +DFEFLS++ VS   M+TADELF +GKLLPFWQ++ +E+L  I+LK++++ 
Subjt:  PNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNK-VSSHSMITADELFFDGKLLPFWQMQQAERLNKISLKSSKDV

Query:  DKQGLVDIEVNKK------AENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSA-------LSPSSSSSSSSSSKSMADAPKTEEGTTRNKEKNIKR
        + +    +EV KK       +N+V W +D+DPSPRPPKCTVLWKELLRLKKQR  S+       +S  S SSS+SSS S+ DA K EE     KEK  KR
Subjt:  DKQGLVDIEVNKK------AENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSA-------LSPSSSSSSSSSSKSMADAPKTEEGTTRNKEKNIKR

Query:  IKKGLERTRSGSIRIGPMINVPICTQLKSSV-LPPLFP--LKKGRFDR
         KKGLERTRS S+RI PMI+VPICT  KSS+ LPPLFP  LKK R +R
Subjt:  IKKGLERTRSGSIRIGPMINVPICTQLKSSV-LPPLFP--LKKGRFDR

AT3G12970.1 unknown protein3.1e-0635Show/hide
Query:  DFEFLSNKVSSHSMITADELFFDGKLLPFWQMQQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDP-SPRPPKCTVLWKELLRLKK----Q
        DFEFL       +M++ADELF DGKL+P        + + ++    K +       ++  ++ E +++ ++D    SPR P+CTV W+ELL LK+    Q
Subjt:  DFEFLSNKVSSHSMITADELFFDGKLLPFWQMQQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDP-SPRPPKCTVLWKELLRLKK----Q

Query:  RASSALSPSSSSSSSSSSKS
        + +SA S S  SSSS + K+
Subjt:  RASSALSPSSSSSSSSSSKS

AT5G19340.1 unknown protein1.5e-4549.63Show/hide
Query:  ASSEPNSS-PRISFSSEFL---DENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQQAERLNKIS
        A +EP+++ PRISFS++      + DFI I P  +L     I  R+++D+S   A   DFEFLS      +M++ADELF +GKLLPFWQ++ +E+L  ++
Subjt:  ASSEPNSS-PRISFSSEFL---DENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQQAERLNKIS

Query:  LK----------SSKDVDKQGLVDIEVNKKAENKVN----------WLLDDDPSPRPPKCTVLWKELLRLKKQRAS---------SALSPSSSSSS-SSS
        LK            K V+++G V    NK+ EN  N          W LDDDPSPRPPKCTVLWKELLRLKKQR +         S+LSPSSSSSS SSS
Subjt:  LK----------SSKDVDKQGLVDIEVNKKAENKVN----------WLLDDDPSPRPPKCTVLWKELLRLKKQRAS---------SALSPSSSSSS-SSS

Query:  SKSMADAPKTEEGTTRNKEKNIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSV-LPPLFPLK--KGRFDR
        S S+ DA K EE     +EK  KR KKGLERTRS ++RI PMI+VP+CT  KSS  LPPLFPL+  K R +R
Subjt:  SKSMADAPKTEEGTTRNKEKNIKRIKKGLERTRSGSIRIGPMINVPICTQLKSSV-LPPLFPLK--KGRFDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCAATAGGCAGTGGTTGTAGTGTTCATGGATCAGCCTCATCAGAGCCAAATTCCAGTCCTCGGATATCTTTCTCTTCTGAGTTTCTTGATGAAAACGACTTCAT
TTCCATCACTCCAAATTCACATCTAGAGAGAGAACAAGAGATTTCTGAGAGACAGAAGGAGGACAGATCAGAGAAGCTAGCATGGAGTGCTGATTTTGAGTTTCTTTCTA
ATAAAGTTAGTAGCCACTCCATGATTACAGCTGATGAGCTCTTCTTTGACGGGAAGCTTCTTCCCTTTTGGCAAATGCAGCAAGCAGAGAGGCTTAACAAAATCAGTCTG
AAATCTTCAAAAGATGTAGATAAACAAGGCTTGGTGGACATAGAGGTAAACAAGAAGGCAGAGAACAAAGTGAATTGGTTACTCGACGACGACCCGTCACCGAGACCACC
AAAATGCACTGTTCTGTGGAAAGAATTGTTGAGGTTGAAGAAACAACGCGCGTCGTCTGCGCTATCACCATCTTCTTCTTCGTCTTCGTCGTCTTCTTCCAAGTCGATGG
CTGATGCACCCAAAACAGAGGAAGGGACAACAAGAAACAAAGAGAAGAACATTAAGAGGATAAAGAAGGGTTTGGAAAGGACAAGATCAGGCAGTATAAGAATAGGGCCT
ATGATTAATGTGCCAATCTGCACACAGCTGAAGAGCAGTGTTTTGCCACCCTTATTCCCACTTAAGAAAGGAAGATTTGATAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCAATAGGCAGTGGTTGTAGTGTTCATGGATCAGCCTCATCAGAGCCAAATTCCAGTCCTCGGATATCTTTCTCTTCTGAGTTTCTTGATGAAAACGACTTCAT
TTCCATCACTCCAAATTCACATCTAGAGAGAGAACAAGAGATTTCTGAGAGACAGAAGGAGGACAGATCAGAGAAGCTAGCATGGAGTGCTGATTTTGAGTTTCTTTCTA
ATAAAGTTAGTAGCCACTCCATGATTACAGCTGATGAGCTCTTCTTTGACGGGAAGCTTCTTCCCTTTTGGCAAATGCAGCAAGCAGAGAGGCTTAACAAAATCAGTCTG
AAATCTTCAAAAGATGTAGATAAACAAGGCTTGGTGGACATAGAGGTAAACAAGAAGGCAGAGAACAAAGTGAATTGGTTACTCGACGACGACCCGTCACCGAGACCACC
AAAATGCACTGTTCTGTGGAAAGAATTGTTGAGGTTGAAGAAACAACGCGCGTCGTCTGCGCTATCACCATCTTCTTCTTCGTCTTCGTCGTCTTCTTCCAAGTCGATGG
CTGATGCACCCAAAACAGAGGAAGGGACAACAAGAAACAAAGAGAAGAACATTAAGAGGATAAAGAAGGGTTTGGAAAGGACAAGATCAGGCAGTATAAGAATAGGGCCT
ATGATTAATGTGCCAATCTGCACACAGCTGAAGAGCAGTGTTTTGCCACCCTTATTCCCACTTAAGAAAGGAAGATTTGATAGATAA
Protein sequenceShow/hide protein sequence
MVSIGSGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKEDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQQAERLNKISL
KSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSKSMADAPKTEEGTTRNKEKNIKRIKKGLERTRSGSIRIGP
MINVPICTQLKSSVLPPLFPLKKGRFDR