; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G013370 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G013370
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationchr09:21318060..21318842
RNA-Seq ExpressionLsi09G013370
SyntenyLsi09G013370
Gene Ontology termsGO:0032259 - methylation (biological process)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140154.1 uncharacterized protein LOC101216953 [Cucumis sativus]6.5e-12696.93Show/hide
Query:  MVSIGSGGRGSVQAS-TPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEG
        MVSIGSGG  SVQAS  PSSP P TEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKL WSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGSGGRGSVQAS-TPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNVKRIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_008449565.1 PREDICTED: uncharacterized protein LOC103491410 [Cucumis melo]5.0e-12696.55Show/hide
Query:  MVSIGSGGRGSVQAS-TPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEG
        MVSIG GG  SVQAS  PSSPPP TEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKL WSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGSGGRGSVQAS-TPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNV+RIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_022138704.1 uncharacterized protein LOC111009802 [Momordica charantia]1.1e-12292.05Show/hide
Query:  MVSIGSGGRGSVQASTPS----SPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELF
        MVSI   GR SVQA+ PS     PPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQE+CERQ+K+RSEKL WSADFEFLSNKVSSHSM TADELF
Subjt:  MVSIGSGGRGSVQASTPS----SPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELF

Query:  FEGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAA
        FEGKLLPFWQMQQAERL KISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASS LSPSSSSSSSSSSSRSMAD A
Subjt:  FEGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAA

Query:  TTEEGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        TTEEGKEGT GNKEKN+KRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  TTEEGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_022930387.1 uncharacterized protein LOC111436849 [Cucurbita moschata]9.7e-11492Show/hide
Query:  SVQASTPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQA
        SVQ S PSS     EPNSSPRISFSSEFLDESNFISITP+SQIERDQEICERQKK+RSE+L  SADFEFLSN+VSSHSM+TADELFFEGKLLPFWQMQQA
Subjt:  SVQASTPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQA

Query:  ERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEGTTGNKE
        ERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQR SSALSPSSSSSSSSSSSRSMADAAT+E   EGTTGNKE
Subjt:  ERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEGTTGNKE

Query:  KNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        KN+KRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  KNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

XP_038876013.1 uncharacterized protein LOC120068347 [Benincasa hispida]7.2e-12595.8Show/hide
Query:  MVSIGSGGRGSVQASTPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQK--KDRSEKLGWSADFEFLSNKVSSHSMITADELFFE
        MVSIGSG  GSVQAS PSS P PTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEIC+RQK  KDRSEKL WSADFEFLSNKVSSHSMITADELFFE
Subjt:  MVSIGSGGRGSVQASTPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQK--KDRSEKLGWSADFEFLSNKVSSHSMITADELFFE

Query:  GKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATT
        GKLLPFWQMQQAERLNKISLKSPKDVDEED+VEIEVNKEA+NKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATT
Subjt:  GKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATT

Query:  EEGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EEGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EEGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

TrEMBL top hitse value%identityAlignment
A0A1S3BMY8 uncharacterized protein LOC1034914102.4e-12696.55Show/hide
Query:  MVSIGSGGRGSVQAS-TPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEG
        MVSIG GG  SVQAS  PSSPPP TEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKL WSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGSGGRGSVQAS-TPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNV+RIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A5D3DC08 SEY12.4e-12696.55Show/hide
Query:  MVSIGSGGRGSVQAS-TPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEG
        MVSIG GG  SVQAS  PSSPPP TEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKL WSADFEFLSNKVSSHSMITADELFFEG
Subjt:  MVSIGSGGRGSVQAS-TPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEG

Query:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
        KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE
Subjt:  KLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTE

Query:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        EGKEGTTGNKEKNV+RIKK LERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  EGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A6J1CAV2 uncharacterized protein LOC1110098025.5e-12392.05Show/hide
Query:  MVSIGSGGRGSVQASTPS----SPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELF
        MVSI   GR SVQA+ PS     PPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQE+CERQ+K+RSEKL WSADFEFLSNKVSSHSM TADELF
Subjt:  MVSIGSGGRGSVQASTPS----SPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELF

Query:  FEGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAA
        FEGKLLPFWQMQQAERL KISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASS LSPSSSSSSSSSSSRSMAD A
Subjt:  FEGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAA

Query:  TTEEGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        TTEEGKEGT GNKEKN+KRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  TTEEGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A6J1EQC3 uncharacterized protein LOC1114368494.7e-11492Show/hide
Query:  SVQASTPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQA
        SVQ S PSS     EPNSSPRISFSSEFLDESNFISITP+SQIERDQEICERQKK+RSE+L  SADFEFLSN+VSSHSM+TADELFFEGKLLPFWQMQQA
Subjt:  SVQASTPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQA

Query:  ERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEGTTGNKE
        ERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQR SSALSPSSSSSSSSSSSRSMADAAT+E   EGTTGNKE
Subjt:  ERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEGTTGNKE

Query:  KNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        KN+KRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  KNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

A0A6J1KCD5 uncharacterized protein LOC1114943314.7e-11492Show/hide
Query:  SVQASTPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQA
        SVQ S PSS     EPNSSPRISFSSEFLDESNFISITP+SQIERDQEICERQKK+RSE+L  SADFEFLSN+VSSHSM+TADELFFEGKLLPFWQMQQA
Subjt:  SVQASTPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQA

Query:  ERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEGTTGNKE
        ERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQR SSALSPSSSSSSSSSSSRSMADAAT+E   EGTTGNKE
Subjt:  ERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEGTTGNKE

Query:  KNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
        KN+KRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR
Subjt:  KNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G05980.1 unknown protein6.2e-5053.44Show/hide
Query:  TPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNK-VSSHSMITADELFFEGKLLPFWQMQQAERLN
        T S   PP  P   PRISFSS+  D  +FI ITP         +C+      S K+   +DFEFLS++ VS   M+TADELF EGKLLPFWQ++ +E+L 
Subjt:  TPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNK-VSSHSMITADELFFEGKLLPFWQMQQAERLN

Query:  KISLKSPKDVDEEDLVEIEVNKE------AENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-------SSSSSSSSSSSRSMADAATTEEG
         I+LK+ ++ +E +  ++EV K+       +N+V WF+D+DPSPRPPKCTVLWKELLRLKKQR  S+ SP       S S SSS+SSS S+ DAA  EE 
Subjt:  KISLKSPKDVDEEDLVEIEVNKE------AENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-------SSSSSSSSSSSRSMADAATTEEG

Query:  KEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSV-LPPLFP--LKKGRFDR
               KEK  KR KKGLERTRSAS+RIRPMI+VPICT  KSS+ LPPLFP  LKK R +R
Subjt:  KEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSV-LPPLFP--LKKGRFDR

AT3G12970.1 unknown protein3.5e-0533.91Show/hide
Query:  DFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDP-SPRPPKCTVLWKELLRLKKQRASS
        DFEFL       +M++ADELF +GKL+P        + + ++    K +       ++  +  E +++  +D    SPR P+CTV W+ELL LK+   + 
Subjt:  DFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQAERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDP-SPRPPKCTVLWKELLRLKKQRASS

Query:  ALSPSSSSSSSSSSS
          + +SSSS  SSSS
Subjt:  ALSPSSSSSSSSSSS

AT5G19340.1 unknown protein4.9e-4747.16Show/hide
Query:  SVQASTPSSPPPPTEPNSSPRISFSSEFL---DESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQM
        S + +T +   P T   + PRISFS++      + +FI I P   +     I  R++KD+S     + DFEFLS      +M++ADELF EGKLLPFWQ+
Subjt:  SVQASTPSSPPPPTEPNSSPRISFSSEFL---DESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQM

Query:  QQAERLNKISLKSPKDVDEEDLVEIEVNKEA-----------------ENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRAS---------SALSPSSS
        + +E+L  ++LK   +V +E+     VN+E                   N+ +WFLDDDPSPRPPKCTVLWKELLRLKKQR +         S+LSPSSS
Subjt:  QQAERLNKISLKSPKDVDEEDLVEIEVNKEA-----------------ENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRAS---------SALSPSSS

Query:  SSSSSSSSRSMADAATTEEGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSV-LPPLFPLK--KGRFDR
        SSS+SSSS S+ DA   EE        +EK  KR KKGLERTRS ++RIRPMI+VP+CT  KSS  LPPLFPL+  K R +R
Subjt:  SSSSSSSSRSMADAATTEEGKEGTTGNKEKNVKRIKKGLERTRSASIRIRPMINVPICTQVKSSV-LPPLFPLK--KGRFDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCAATAGGCAGTGGTGGTCGTGGTAGTGTTCAAGCATCAACGCCATCATCACCACCACCACCCACAGAACCAAATTCCAGTCCTAGGATATCTTTCTCTTCTGA
ATTTCTTGATGAAAGCAACTTCATTTCCATCACTCCAAATTCCCAGATAGAGAGAGATCAAGAAATTTGTGAGAGACAGAAGAAGGATAGATCAGAGAAGCTGGGATGGA
GTGCTGATTTTGAGTTTCTTTCTAATAAAGTTAGTAGTCACTCCATGATTACAGCTGATGAGCTTTTCTTTGAAGGGAAGCTTCTTCCATTTTGGCAAATGCAGCAAGCT
GAGCGGCTTAACAAAATCAGTCTGAAATCTCCAAAAGATGTAGATGAAGAAGACTTGGTGGAAATAGAGGTAAACAAGGAGGCAGAGAACAAAGTGAATTGGTTTCTCGA
CGACGACCCGTCTCCCAGACCACCAAAATGCACTGTTCTCTGGAAAGAACTGCTGAGGTTGAAGAAGCAACGCGCTTCATCTGCACTGTCGCCATCTTCTTCTTCATCCT
CGTCGTCTTCCTCTTCCAGGTCCATGGCTGATGCAGCCACAACAGAGGAAGGCAAGGAAGGGACAACAGGAAACAAAGAGAAGAACGTAAAGAGGATAAAGAAGGGTTTG
GAAAGGACAAGATCAGCCAGTATAAGAATAAGGCCTATGATTAATGTGCCAATCTGCACACAGGTGAAGAGCAGTGTTTTGCCACCCTTGTTTCCACTTAAGAAAGGAAG
ATTTGATAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCAATAGGCAGTGGTGGTCGTGGTAGTGTTCAAGCATCAACGCCATCATCACCACCACCACCCACAGAACCAAATTCCAGTCCTAGGATATCTTTCTCTTCTGA
ATTTCTTGATGAAAGCAACTTCATTTCCATCACTCCAAATTCCCAGATAGAGAGAGATCAAGAAATTTGTGAGAGACAGAAGAAGGATAGATCAGAGAAGCTGGGATGGA
GTGCTGATTTTGAGTTTCTTTCTAATAAAGTTAGTAGTCACTCCATGATTACAGCTGATGAGCTTTTCTTTGAAGGGAAGCTTCTTCCATTTTGGCAAATGCAGCAAGCT
GAGCGGCTTAACAAAATCAGTCTGAAATCTCCAAAAGATGTAGATGAAGAAGACTTGGTGGAAATAGAGGTAAACAAGGAGGCAGAGAACAAAGTGAATTGGTTTCTCGA
CGACGACCCGTCTCCCAGACCACCAAAATGCACTGTTCTCTGGAAAGAACTGCTGAGGTTGAAGAAGCAACGCGCTTCATCTGCACTGTCGCCATCTTCTTCTTCATCCT
CGTCGTCTTCCTCTTCCAGGTCCATGGCTGATGCAGCCACAACAGAGGAAGGCAAGGAAGGGACAACAGGAAACAAAGAGAAGAACGTAAAGAGGATAAAGAAGGGTTTG
GAAAGGACAAGATCAGCCAGTATAAGAATAAGGCCTATGATTAATGTGCCAATCTGCACACAGGTGAAGAGCAGTGTTTTGCCACCCTTGTTTCCACTTAAGAAAGGAAG
ATTTGATAGATGA
Protein sequenceShow/hide protein sequence
MVSIGSGGRGSVQASTPSSPPPPTEPNSSPRISFSSEFLDESNFISITPNSQIERDQEICERQKKDRSEKLGWSADFEFLSNKVSSHSMITADELFFEGKLLPFWQMQQA
ERLNKISLKSPKDVDEEDLVEIEVNKEAENKVNWFLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSRSMADAATTEEGKEGTTGNKEKNVKRIKKGL
ERTRSASIRIRPMINVPICTQVKSSVLPPLFPLKKGRFDR