; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0012211 (gene) of Chayote v1 genome

Gene IDSed0012211
OrganismSechium edule (Chayote v1)
DescriptionSEY1
Genome locationLG09:15399141..15400567
RNA-Seq ExpressionSed0012211
SyntenySed0012211
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140154.1 uncharacterized protein LOC101216953 [Cucumis sativus]7.7e-10588.16Show/hide
Query:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKIS
        S P P+ EP SSPRISFSSEFLDESNFISITPNSQI RDQE+CERQ K+RSEKL WSADFEFLSNKVSSHSMI ADELFFEGKLLPFWQMQQAERLNKIS
Subjt:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKIS

Query:  LKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKR
        LKSP    EE+L +IEVNKEAEN+VNWFLDDDPSPRPPKCTVLWKELLRLKKQR+SSALSPSSSSSSSS  SSSRSMADAATTEEGKEGTTGNKEK +KR
Subjt:  LKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKR

Query:  IKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR
        IKK LERTRSA++RIRPMINV ICTQVKSSVLPPLFPLKKGR DR
Subjt:  IKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR

XP_008449565.1 PREDICTED: uncharacterized protein LOC103491410 [Cucumis melo]1.3e-10487.76Show/hide
Query:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKIS
        S P P+ EP SSPRISFSSEFLDESNFISITPNSQI RDQE+CERQ K+RSEKL WSADFEFLSNKVSSHSMI ADELFFEGKLLPFWQMQQAERLNKIS
Subjt:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKIS

Query:  LKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKR
        LKSP    EE+L +IEVNKEAEN+VNWFLDDDPSPRPPKCTVLWKELLRLKKQR+SSALSPSSSSSSSS  SSSRSMADAATTEEGKEGTTGNKEK ++R
Subjt:  LKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKR

Query:  IKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR
        IKK LERTRSA++RIRPMINV ICTQVKSSVLPPLFPLKKGR DR
Subjt:  IKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR

XP_022138704.1 uncharacterized protein LOC111009802 [Momordica charantia]4.5e-10588.11Show/hide
Query:  PPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKISL
        PP P  EP SSPRISFSSEFLDESNFISITPNSQI RDQEVCERQ KERSEKL WSADFEFLSNKVSSHSM  ADELFFEGKLLPFWQMQQAERL KISL
Subjt:  PPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKISL

Query:  KSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKRI
        KSP    EE+L +IEVNKEAEN+VNWFLDDDPSPRPPKCTVLWKELLRLKKQR+SS LSPSSSSSSSS  SSSRSMAD ATTEEGKEGT GNKEK IKRI
Subjt:  KSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKRI

Query:  KKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR
        KKGLERTRSA++RIRPMINV ICTQVKSSVLPPLFPLKKGR DR
Subjt:  KKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR

XP_022930387.1 uncharacterized protein LOC111436849 [Cucurbita moschata]4.7e-10286.94Show/hide
Query:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKIS
        SPPS SIEP SSPRISFSSEFLDESNFISITP+SQI RDQE+CERQ KERSE+L  SADFEFLSN+VSSHSM+ ADELFFEGKLLPFWQMQQAERLNKIS
Subjt:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKIS

Query:  LKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKR
        LKSP    EE+L +IEVNKEAEN+VNWFLDDDPSPRPPKCTVLWKELLRLKKQR+SSALSPSSSSSSSS  SSSRSMADAAT+E   EGTTGNKEK IKR
Subjt:  LKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKR

Query:  IKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR
        IKKGLERTRSA++RIRPMINV ICTQVKSSVLPPLFPLKKGR DR
Subjt:  IKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR

XP_038876013.1 uncharacterized protein LOC120068347 [Benincasa hispida]6.5e-10487.04Show/hide
Query:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQ--SKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNK
        S PSP+ EP SSPRISFSSEFLDESNFISITPNSQI RDQE+C+RQ   K+RSEKL WSADFEFLSNKVSSHSMI ADELFFEGKLLPFWQMQQAERLNK
Subjt:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQ--SKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNK

Query:  ISLKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTI
        ISLKSP    EE++ +IEVNKEA+N+VNWFLDDDPSPRPPKCTVLWKELLRLKKQR+SSALSPSSSSSSSS  SSSRSMADAATTEEGKEGTTGNKEK +
Subjt:  ISLKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTI

Query:  KRIKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR
        KRIKKGLERTRSA++RIRPMINV ICTQVKSSVLPPLFPLKKGR DR
Subjt:  KRIKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR

TrEMBL top hitse value%identityAlignment
A0A1S3BMY8 uncharacterized protein LOC1034914106.3e-10587.76Show/hide
Query:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKIS
        S P P+ EP SSPRISFSSEFLDESNFISITPNSQI RDQE+CERQ K+RSEKL WSADFEFLSNKVSSHSMI ADELFFEGKLLPFWQMQQAERLNKIS
Subjt:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKIS

Query:  LKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKR
        LKSP    EE+L +IEVNKEAEN+VNWFLDDDPSPRPPKCTVLWKELLRLKKQR+SSALSPSSSSSSSS  SSSRSMADAATTEEGKEGTTGNKEK ++R
Subjt:  LKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKR

Query:  IKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR
        IKK LERTRSA++RIRPMINV ICTQVKSSVLPPLFPLKKGR DR
Subjt:  IKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR

A0A5D3DC08 SEY16.3e-10587.76Show/hide
Query:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKIS
        S P P+ EP SSPRISFSSEFLDESNFISITPNSQI RDQE+CERQ K+RSEKL WSADFEFLSNKVSSHSMI ADELFFEGKLLPFWQMQQAERLNKIS
Subjt:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKIS

Query:  LKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKR
        LKSP    EE+L +IEVNKEAEN+VNWFLDDDPSPRPPKCTVLWKELLRLKKQR+SSALSPSSSSSSSS  SSSRSMADAATTEEGKEGTTGNKEK ++R
Subjt:  LKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKR

Query:  IKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR
        IKK LERTRSA++RIRPMINV ICTQVKSSVLPPLFPLKKGR DR
Subjt:  IKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR

A0A6J1CAV2 uncharacterized protein LOC1110098022.2e-10588.11Show/hide
Query:  PPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKISL
        PP P  EP SSPRISFSSEFLDESNFISITPNSQI RDQEVCERQ KERSEKL WSADFEFLSNKVSSHSM  ADELFFEGKLLPFWQMQQAERL KISL
Subjt:  PPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKISL

Query:  KSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKRI
        KSP    EE+L +IEVNKEAEN+VNWFLDDDPSPRPPKCTVLWKELLRLKKQR+SS LSPSSSSSSSS  SSSRSMAD ATTEEGKEGT GNKEK IKRI
Subjt:  KSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKRI

Query:  KKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR
        KKGLERTRSA++RIRPMINV ICTQVKSSVLPPLFPLKKGR DR
Subjt:  KKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR

A0A6J1EQC3 uncharacterized protein LOC1114368492.3e-10286.94Show/hide
Query:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKIS
        SPPS SIEP SSPRISFSSEFLDESNFISITP+SQI RDQE+CERQ KERSE+L  SADFEFLSN+VSSHSM+ ADELFFEGKLLPFWQMQQAERLNKIS
Subjt:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKIS

Query:  LKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKR
        LKSP    EE+L +IEVNKEAEN+VNWFLDDDPSPRPPKCTVLWKELLRLKKQR+SSALSPSSSSSSSS  SSSRSMADAAT+E   EGTTGNKEK IKR
Subjt:  LKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKR

Query:  IKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR
        IKKGLERTRSA++RIRPMINV ICTQVKSSVLPPLFPLKKGR DR
Subjt:  IKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR

A0A6J1KCD5 uncharacterized protein LOC1114943312.3e-10286.94Show/hide
Query:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKIS
        SPPS SIEP SSPRISFSSEFLDESNFISITP+SQI RDQE+CERQ KERSE+L  SADFEFLSN+VSSHSM+ ADELFFEGKLLPFWQMQQAERLNKIS
Subjt:  SPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKIS

Query:  LKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKR
        LKSP    EE+L +IEVNKEAEN+VNWFLDDDPSPRPPKCTVLWKELLRLKKQR+SSALSPSSSSSSSS  SSSRSMADAAT+E   EGTTGNKEK IKR
Subjt:  LKSP----EENLGKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKR

Query:  IKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR
        IKKGLERTRSA++RIRPMINV ICTQVKSSVLPPLFPLKKGR DR
Subjt:  IKKGLERTRSANLRIRPMINVAICTQVKSSVLPPLFPLKKGRLDR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G05980.1 unknown protein3.4e-5054.18Show/hide
Query:  PISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNK-VSSHSMIAADELFFEGKLLPFWQMQQAERLNKISLKSPEEN
        P+  PRISFSS+  D  +FI ITP         +C+    + S K+   +DFEFLS++ VS   M+ ADELF EGKLLPFWQ++ +E+L  I+LK+ EE 
Subjt:  PISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNK-VSSHSMIAADELFFEGKLLPFWQMQQAERLNKISLKSPEEN

Query:  LG---KIEVNKE------AENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSP-----SSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKT
             K+EV K+       +N+V WF+D+DPSPRPPKCTVLWKELLRLKKQR+ S+ SP      SS S SSS SSS S+ DAA  EE        KEK 
Subjt:  LG---KIEVNKE------AENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSP-----SSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKT

Query:  IKRIKKGLERTRSANLRIRPMINVAICTQVKSSV-LPPLFP--LKKGRLDR
         KR KKGLERTRSA++RIRPMI+V ICT  KSS+ LPPLFP  LKK R++R
Subjt:  IKRIKKGLERTRSANLRIRPMINVAICTQVKSSV-LPPLFP--LKKGRLDR

AT3G12970.1 unknown protein4.8e-0436.13Show/hide
Query:  DFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKISLKSPEEN------LGKIEVNKEAENQVNWFLDDDP-SPRPPKCTVLWKELLRLKKQRS
        DFEFL       +M++ADELF +GKL+P         L    +  PEE          ++  +  E +++  +D    SPR P+CTV W+ELL LK+   
Subjt:  DFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKISLKSPEEN------LGKIEVNKEAENQVNWFLDDDP-SPRPPKCTVLWKELLRLKKQRS

Query:  SSALSPSSSSSSSSSLSSS
        +      +S+SSSS LSSS
Subjt:  SSALSPSSSSSSSSSLSSS

AT5G19340.1 unknown protein4.0e-4347.57Show/hide
Query:  SSPRISFSSEFL---DESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKISLKSPEE-
        + PRISFS++      + +FI I P   ++  +E  ++ S +       + DFEFLS      +M++ADELF EGKLLPFWQ++ +E+L  ++LK   E 
Subjt:  SSPRISFSSEFL---DESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKISLKSPEE-

Query:  ----------NLGKIEVNKEAENQVN----------WFLDDDPSPRPPKCTVLWKELLRLKKQRSS---------SALSPSSSSSSSSSLSSSRSMADAA
                  N      NKE EN  N          WFLDDDPSPRPPKCTVLWKELLRLKKQR++         S+LSPSSSSSS+S  SSS S+ DA 
Subjt:  ----------NLGKIEVNKEAENQVN----------WFLDDDPSPRPPKCTVLWKELLRLKKQRSS---------SALSPSSSSSSSSSLSSSRSMADAA

Query:  TTEEGKEGTTGNKEKTIKRIKKGLERTRSANLRIRPMINVAICTQVKSSV-LPPLFPLK--KGRLDR
          EE        +EK  KR KKGLERTRS  +RIRPMI+V +CT  KSS  LPPLFPL+  K R++R
Subjt:  TTEEGKEGTTGNKEKTIKRIKKGLERTRSANLRIRPMINVAICTQVKSSV-LPPLFPLK--KGRLDR

AT5G66800.1 unknown protein8.7e-0632.87Show/hide
Query:  SPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAER-LNKISLKSPEENLGK
        SPRISFS++F++     + T  S  L         SK+      +S +FEF    VS+++M+ ADELF +GKLLPF +  Q +R L +  L   +E  G 
Subjt:  SPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAER-LNKISLKSPEENLGK

Query:  IEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSS
         +       +   F     S    K    WK LL LK+    S
Subjt:  IEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCACCACCATCACCCTCCATAGAACCAATTTCTAGTCCTCGGATATCCTTCTCTTCTGAGTTTCTTGATGAAAGCAACTTCATTTCCATCACTCCTAATTCTCA
GATATTGAGAGATCAAGAAGTTTGTGAGAGACAAAGCAAGGAGAGATCAGAGAAGCTGAAATGGAGTGCTGATTTTGAGTTTCTTTCTAATAAAGTTAGCAGTCACTCCA
TGATTGCGGCCGACGAGCTTTTCTTTGAAGGGAAGCTTCTTCCCTTTTGGCAAATGCAGCAAGCGGAGAGGCTTAACAAAATCAGCCTCAAATCTCCTGAAGAGAACTTG
GGGAAGATAGAGGTAAACAAGGAAGCAGAGAACCAAGTGAATTGGTTCCTCGATGACGATCCGTCTCCCAGACCGCCAAAATGCACGGTTCTTTGGAAAGAATTGTTGAG
GTTGAAGAAGCAACGCTCCTCGTCTGCGCTATCACCATCTTCTTCTTCATCCTCGTCGTCATCGTTGTCTTCTTCCAGGTCCATGGCTGATGCAGCCACGACAGAGGAAG
GCAAGGAAGGGACAACGGGAAACAAAGAGAAAACCATAAAGAGGATAAAGAAGGGTTTGGAAAGGACAAGATCAGCCAATCTAAGAATAAGGCCTATGATTAATGTGGCA
ATCTGCACACAGGTGAAGAGCAGTGTTTTGCCACCCTTGTTCCCACTTAAGAAAGGAAGACTTGATAGATGA
mRNA sequenceShow/hide mRNA sequence
TGCCATGCAAAAAAACTTGTAGAAAGAGAGGCTTTTTGTCAGGTAGCTCCCTAGAGCCCTAGGTGTCCATGTGCTGTTTTCTTACATATCATAATCTCAACTCTTTAAAC
CTACACCCTTGCTTGTTTTGGCTTCACCAGACCACTTATGCTTTTTTTTCTTTTTCTTTTTTAAATTAGTCTCTCTACTTTTCCTTCATAATTTTGCTACTTGTTAAACA
ACTTCCATCTATAAATCCCCTACCCATTTGCAAACTCAGCTCATTTACCCAACTTTCCCACCTCTGACAATCATTTTCTTTCCCCATTTTCCTGGAATCCAAACACCCCA
AGAGAGAAAGAAAACAATGGTTTCACCACCATCACCCTCCATAGAACCAATTTCTAGTCCTCGGATATCCTTCTCTTCTGAGTTTCTTGATGAAAGCAACTTCATTTCCA
TCACTCCTAATTCTCAGATATTGAGAGATCAAGAAGTTTGTGAGAGACAAAGCAAGGAGAGATCAGAGAAGCTGAAATGGAGTGCTGATTTTGAGTTTCTTTCTAATAAA
GTTAGCAGTCACTCCATGATTGCGGCCGACGAGCTTTTCTTTGAAGGGAAGCTTCTTCCCTTTTGGCAAATGCAGCAAGCGGAGAGGCTTAACAAAATCAGCCTCAAATC
TCCTGAAGAGAACTTGGGGAAGATAGAGGTAAACAAGGAAGCAGAGAACCAAGTGAATTGGTTCCTCGATGACGATCCGTCTCCCAGACCGCCAAAATGCACGGTTCTTT
GGAAAGAATTGTTGAGGTTGAAGAAGCAACGCTCCTCGTCTGCGCTATCACCATCTTCTTCTTCATCCTCGTCGTCATCGTTGTCTTCTTCCAGGTCCATGGCTGATGCA
GCCACGACAGAGGAAGGCAAGGAAGGGACAACGGGAAACAAAGAGAAAACCATAAAGAGGATAAAGAAGGGTTTGGAAAGGACAAGATCAGCCAATCTAAGAATAAGGCC
TATGATTAATGTGGCAATCTGCACACAGGTGAAGAGCAGTGTTTTGCCACCCTTGTTCCCACTTAAGAAAGGAAGACTTGATAGATGAGAAGATGAATACAGTACTAAAG
TCAGCAGCGCTCAAGATCTCATCATGTCTACCCCAACGTGTTAATCTCTCTCTATTGCCTCATTTATTGACAAAATTTTCAACTTTCTTTCTTCCTGAGTTGTATGGATT
AATGTGTTGATTATGGCAGTGGATCTAATAATCTACAATTGTAAATGCTGACCTCTATTTTTTGTCTGTTGCTATTCCTTGATTAATCGCATTATGCTTCTTTGCTTCGT
CATAACTTTTGGTGTACACTTTATCTTTTGTCCTCCTAAATAGCAAAATTTGGCAGTAAACAGTTTAATGTCACATGCTGGGTGCAGCAGCCTCGGATTGGTGGGTC
Protein sequenceShow/hide protein sequence
MVSPPSPSIEPISSPRISFSSEFLDESNFISITPNSQILRDQEVCERQSKERSEKLKWSADFEFLSNKVSSHSMIAADELFFEGKLLPFWQMQQAERLNKISLKSPEENL
GKIEVNKEAENQVNWFLDDDPSPRPPKCTVLWKELLRLKKQRSSSALSPSSSSSSSSSLSSSRSMADAATTEEGKEGTTGNKEKTIKRIKKGLERTRSANLRIRPMINVA
ICTQVKSSVLPPLFPLKKGRLDR