; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020100 (gene) of Snake gourd v1 genome

Gene IDTan0020100
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationLG06:78115196..78115966
RNA-Seq ExpressionTan0020100
SyntenyTan0020100
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140154.1 uncharacterized protein LOC101216953 [Cucumis sativus]2.1e-12195.4Show/hide
Query:  MVSIDS---SSVQAS-PPSSP-PSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSI S   SSVQAS PPSSP P+ EPNSSPRISFSSEFLDESNFISITPNSQIERDQE+CERQKK+RSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIDS---SSVQAS-PPSSP-PSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDE+DLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNVKRIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_008449565.1 PREDICTED: uncharacterized protein LOC103491410 [Cucumis melo]2.8e-12194.25Show/hide
Query:  MVSI---DSSSVQASPP--SSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSI     SSVQASPP  S PP+ EPNSSPRISFSSEFLDESNFISITPNSQIERDQE+CERQKK+RSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSI---DSSSVQASPP--SSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDE+DLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNV+RIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_022138704.1 uncharacterized protein LOC111009802 [Momordica charantia]1.6e-12192.72Show/hide
Query:  MVSIDSSSVQASPPS-----SPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSI  SSVQA+PPS      PP  EPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQ+KERSEKLAWSADFEFLSNKVSSHSM TADELFFEG
Subjt:  MVSIDSSSVQASPPS-----SPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERL KISLKSPKDVDE+DLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASS LSPSSSSSSSSSSSRSMAD ATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGT GNKEKN+KRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_022930387.1 uncharacterized protein LOC111436849 [Cucurbita moschata]7.1e-11792.97Show/hide
Query:  MVSIDSSSVQASPPSSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPF
        MVSID  SVQ SPPSS  SIEPNSSPRISFSSEFLDESNFISITP+SQIERDQE+CERQKKERSE+LA SADFEFLSN+VSSHSM+TADELFFEGKLLPF
Subjt:  MVSIDSSSVQASPPSSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPF

Query:  WQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEG
        WQMQQAERLNKISLKSPKDVDE+DLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQR SSALSPSSSSSSSSSSSRSMADAAT+E   EG
Subjt:  WQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEG

Query:  TTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        TTGNKEKN+KRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  TTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_038876013.1 uncharacterized protein LOC120068347 [Benincasa hispida]1.9e-12294.59Show/hide
Query:  MVSIDS-SSVQASPPSSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQK--KERSEKLAWSADFEFLSNKVSSHSMITADELFFEGKL
        MVSI S  SVQASPPSSP   EPNSSPRISFSSEFLDESNFISITPNSQIERDQE+C+RQK  K+RSEKLAWSADFEFLSNKVSSHSMITADELFFEGKL
Subjt:  MVSIDS-SSVQASPPSSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQK--KERSEKLAWSADFEFLSNKVSSHSMITADELFFEGKL

Query:  LPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEG
        LPFWQMQQAERLNKISLKSPKDVDE+D+VEIEVNKEA+NKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEG
Subjt:  LPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEG

Query:  KEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        KEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  KEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

TrEMBL top hitse value%identityAlignment
A0A1S3BMY8 uncharacterized protein LOC1034914101.3e-12194.25Show/hide
Query:  MVSI---DSSSVQASPP--SSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSI     SSVQASPP  S PP+ EPNSSPRISFSSEFLDESNFISITPNSQIERDQE+CERQKK+RSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSI---DSSSVQASPP--SSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDE+DLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNV+RIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A5D3DC08 SEY11.3e-12194.25Show/hide
Query:  MVSI---DSSSVQASPP--SSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSI     SSVQASPP  S PP+ EPNSSPRISFSSEFLDESNFISITPNSQIERDQE+CERQKK+RSEKLAWSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSI---DSSSVQASPP--SSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDE+DLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNV+RIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A6J1CAV2 uncharacterized protein LOC1110098027.9e-12292.72Show/hide
Query:  MVSIDSSSVQASPPS-----SPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEG
        MVSI  SSVQA+PPS      PP  EPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQ+KERSEKLAWSADFEFLSNKVSSHSM TADELFFEG
Subjt:  MVSIDSSSVQASPPS-----SPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERL KISLKSPKDVDE+DLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASS LSPSSSSSSSSSSSRSMAD ATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGT GNKEKN+KRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A6J1EQC3 uncharacterized protein LOC1114368493.4e-11792.97Show/hide
Query:  MVSIDSSSVQASPPSSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPF
        MVSID  SVQ SPPSS  SIEPNSSPRISFSSEFLDESNFISITP+SQIERDQE+CERQKKERSE+LA SADFEFLSN+VSSHSM+TADELFFEGKLLPF
Subjt:  MVSIDSSSVQASPPSSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPF

Query:  WQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEG
        WQMQQAERLNKISLKSPKDVDE+DLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQR SSALSPSSSSSSSSSSSRSMADAAT+E   EG
Subjt:  WQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEG

Query:  TTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        TTGNKEKN+KRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  TTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A6J1KCD5 uncharacterized protein LOC1114943313.4e-11792.97Show/hide
Query:  MVSIDSSSVQASPPSSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPF
        MVSID  SVQ SPPSS  SIEPNSSPRISFSSEFLDESNFISITP+SQIERDQE+CERQKKERSE+LA SADFEFLSN+VSSHSM+TADELFFEGKLLPF
Subjt:  MVSIDSSSVQASPPSSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPF

Query:  WQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEG
        WQMQQAERLNKISLKSPKDVDE+DLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQR SSALSPSSSSSSSSSSSRSMADAAT+E   EG
Subjt:  WQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEG

Query:  TTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        TTGNKEKN+KRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  TTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G05980.1 unknown protein1.0e-4953.28Show/hide
Query:  SSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNK-VSSHSMITADELFFEGKLLPFWQMQQAERLNKIS
        + PP   P   PRISFSS+  D  +FI ITP         +C+    + S K+   +DFEFLS++ VS   M+TADELF EGKLLPFWQ++ +E+L  I+
Subjt:  SSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNK-VSSHSMITADELFFEGKLLPFWQMQQAERLNKIS

Query:  LKSPKDVDEDDLVEIEVNKE------AENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-------SSSSSSSSSSSRSMADAATTEEGKEG
        LK+ ++ +E +  ++EV K+       +N+V WF+D+DPSPRPPKCTVLWKELLRLKKQR  S+ SP       S S SSS+SSS S+ DAA  EE    
Subjt:  LKSPKDVDEDDLVEIEVNKE------AENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-------SSSSSSSSSSSRSMADAATTEEGKEG

Query:  TTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSV-LPPLFP--LKKGRFDR
            KEK  KR KKGLERTRSAS+RIRPMI+VPICT  KSS+ LPPLFP  LKK R +R
Subjt:  TTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSV-LPPLFP--LKKGRFDR

AT3G12970.1 unknown protein3.5e-0533.91Show/hide
Query:  DFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDP-SPRPPKCTVLWKELLRLKKQRASS
        DFEFL       +M++ADELF +GKL+P        + + ++    K +       ++  +  E +++  +D    SPR P+CTV W+ELL LK+   + 
Subjt:  DFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAERLNKISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDP-SPRPPKCTVLWKELLRLKKQRASS

Query:  ALSPSSSSSSSSSSS
          + +SSSS  SSSS
Subjt:  ALSPSSSSSSSSSSS

AT5G19340.1 unknown protein6.3e-4747.92Show/hide
Query:  MVSIDSSSVQASPPSSPPSIEPNSSPRISFSSEFL---DESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEGKL
        MVS +++++  + PS+       + PRISFS++      + +FI I P   +     +  R++K++S   A   DFEFLS      +M++ADELF EGKL
Subjt:  MVSIDSSSVQASPPSSPPSIEPNSSPRISFSSEFL---DESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEGKL

Query:  LPFWQMQQAERLNKISLKSPKDV---DEDDLVEIEV----NKEAENKVN----------WFLDDDPSPRPPKCTVLWKELLRLKKQRAS---------SA
        LPFWQ++ +E+L  ++LK   +V   +ED  V  E     NKE EN  N          WFLDDDPSPRPPKCTVLWKELLRLKKQR +         S+
Subjt:  LPFWQMQQAERLNKISLKSPKDV---DEDDLVEIEV----NKEAENKVN----------WFLDDDPSPRPPKCTVLWKELLRLKKQRAS---------SA

Query:  LSPSSSSSSSSSSSRSMADAATTEEGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSV-LPPLFPLK--KGRFDR
        LSPSSSSSS+SSSS S+ DA   EE        +EK  KR KKGLERTRS ++RIRPMI+VP+CT  KSS  LPPLFPL+  K R +R
Subjt:  LSPSSSSSSSSSSSRSMADAATTEEGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSV-LPPLFPLK--KGRFDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCAATAGACAGTAGTAGTGTTCAAGCATCGCCACCATCATCACCACCATCCATAGAGCCGAATTCCAGTCCTCGGATATCTTTCTCTTCTGAGTTTCTTGATGA
AAGCAACTTCATTTCTATCACCCCTAATTCCCAGATAGAGAGAGATCAAGAAGTTTGTGAAAGACAGAAGAAGGAGAGATCAGAGAAGCTGGCATGGAGTGCTGATTTTG
AGTTTCTTTCTAATAAAGTTAGCAGCCACTCCATGATTACGGCCGACGAGCTTTTCTTTGAAGGGAAGCTTCTTCCCTTTTGGCAAATGCAGCAAGCAGAGAGGCTTAAC
AAAATCAGCCTGAAATCTCCAAAAGATGTAGATGAAGACGACTTGGTGGAGATAGAGGTAAACAAGGAGGCAGAGAACAAAGTGAATTGGTTCCTTGACGACGACCCATC
TCCGAGGCCGCCAAAATGCACTGTTCTATGGAAAGAATTGTTGAGGTTGAAGAAGCAACGCGCTTCATCTGCGCTATCGCCATCTTCTTCTTCATCCTCGTCGTCGTCTT
CTTCCAGGTCCATGGCTGATGCAGCCACAACAGAGGAAGGCAAGGAAGGGACAACAGGAAACAAAGAGAAGAACGTAAAGAGGATAAAGAAGGGTTTGGAAAGGACAAGA
TCAGCCAGTATAAGAATAAGGCCTATGATTAATGTGCCAATCTGCACACAGGTGAAGAGCAGTGTTTTGCCACCCTTGTTCCCACTTAAGAAAGGAAGATTTGATAGATG
A
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCAATAGACAGTAGTAGTGTTCAAGCATCGCCACCATCATCACCACCATCCATAGAGCCGAATTCCAGTCCTCGGATATCTTTCTCTTCTGAGTTTCTTGATGA
AAGCAACTTCATTTCTATCACCCCTAATTCCCAGATAGAGAGAGATCAAGAAGTTTGTGAAAGACAGAAGAAGGAGAGATCAGAGAAGCTGGCATGGAGTGCTGATTTTG
AGTTTCTTTCTAATAAAGTTAGCAGCCACTCCATGATTACGGCCGACGAGCTTTTCTTTGAAGGGAAGCTTCTTCCCTTTTGGCAAATGCAGCAAGCAGAGAGGCTTAAC
AAAATCAGCCTGAAATCTCCAAAAGATGTAGATGAAGACGACTTGGTGGAGATAGAGGTAAACAAGGAGGCAGAGAACAAAGTGAATTGGTTCCTTGACGACGACCCATC
TCCGAGGCCGCCAAAATGCACTGTTCTATGGAAAGAATTGTTGAGGTTGAAGAAGCAACGCGCTTCATCTGCGCTATCGCCATCTTCTTCTTCATCCTCGTCGTCGTCTT
CTTCCAGGTCCATGGCTGATGCAGCCACAACAGAGGAAGGCAAGGAAGGGACAACAGGAAACAAAGAGAAGAACGTAAAGAGGATAAAGAAGGGTTTGGAAAGGACAAGA
TCAGCCAGTATAAGAATAAGGCCTATGATTAATGTGCCAATCTGCACACAGGTGAAGAGCAGTGTTTTGCCACCCTTGTTCCCACTTAAGAAAGGAAGATTTGATAGATG
A
Protein sequenceShow/hide protein sequence
MVSIDSSSVQASPPSSPPSIEPNSSPRISFSSEFLDESNFISITPNSQIERDQEVCERQKKERSEKLAWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAERLN
KISLKSPKDVDEDDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEGTTGNKEKNVKRIKKGLERTR
SASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR