; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg05432 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg05432
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat (PPR) superfamily protein
Genome locationCarg_Chr17:1402626..1403381
RNA-Seq ExpressionCarg05432
SyntenyCarg05432
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574997.1 hypothetical protein SDJN03_25636, partial [Cucurbita argyrosperma subsp. sororia]1.3e-12698.8Show/hide
Query:  MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
        MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
Subjt:  MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ

Query:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK
        QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK
Subjt:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK

Query:  EKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR
        EKNIKRIKK LERTRSGSIRIRPMINV ICTQLK SVLPPLFPLKKGRFDR
Subjt:  EKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR

KAG7013568.1 hypothetical protein SDJN02_23735, partial [Cucurbita argyrosperma subsp. argyrosperma]1.0e-128100Show/hide
Query:  MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
        MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
Subjt:  MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ

Query:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK
        QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK
Subjt:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK

Query:  EKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR
        EKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR
Subjt:  EKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR

XP_022958805.1 uncharacterized protein LOC111459965 [Cucurbita moschata]3.2e-12296.41Show/hide
Query:  MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
        MVSIG+GCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQK+DRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
Subjt:  MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ

Query:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK
        QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP   SSSSSSSSSSKSMADAPKTEEGTTRNK
Subjt:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK

Query:  EKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR
        EKNIKRIKK LERTRSGSIRI PMINV ICTQLK SVLPPLFPLKKGRFDR
Subjt:  EKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR

XP_023006453.1 uncharacterized protein LOC111499176 [Cucurbita maxima]1.4e-12296.41Show/hide
Query:  MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
        MVSIG+GCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELF++GKLLPFW MQ
Subjt:  MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ

Query:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK
        QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP  SSSSSSSSSSSKSMADAPKTEEGTTRNK
Subjt:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK

Query:  EKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR
        EKNIKRIKK LERTRSGSIRIRPMINV ICTQLK SVLPPLFPLKKGRFDR
Subjt:  EKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR

XP_023547494.1 uncharacterized protein LOC111806416 [Cucurbita pepo subsp. pepo]3.4e-12497.62Show/hide
Query:  MVSIGNGCS-VHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQM
        MVSIG+GCS VHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFF+GKLLPFWQM
Subjt:  MVSIGNGCS-VHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQM

Query:  QQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRN
        QQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRN
Subjt:  QQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRN

Query:  KEKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR
        KEKNIKRIKK LERTRSGSIRIRPMINV ICTQLK SVLPPLFPLKKGRFDR
Subjt:  KEKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR

TrEMBL top hitse value%identityAlignment
A0A1S3BMY8 uncharacterized protein LOC1034914103.0e-10282.31Show/hide
Query:  VSIGNGCSVHGS-------ASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLL
        + IG G SV  S        ++EPNSSPRISFSSEFLDE++FISITPNS +ER+QEI ERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFF+GKLL
Subjt:  VSIGNGCSVHGS-------ASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLL

Query:  PFWQMQQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTE-
        PFWQMQQAERLNKISLKS KDVD++ LV+IEVNK+AENKVNW LDDDPSPRPPKCTVLWKELLRLKKQRASSALSP  SSSSSSSSSSS+SMADA  TE 
Subjt:  PFWQMQQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTE-

Query:  --EGTTRNKEKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR
          EGTT NKEKN++RIKK LERTRS SIRIRPMINV ICTQ+K SVLPPLFPLKKGRFDR
Subjt:  --EGTTRNKEKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR

A0A5D3DC08 SEY13.0e-10282.31Show/hide
Query:  VSIGNGCSVHGS-------ASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLL
        + IG G SV  S        ++EPNSSPRISFSSEFLDE++FISITPNS +ER+QEI ERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFF+GKLL
Subjt:  VSIGNGCSVHGS-------ASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLL

Query:  PFWQMQQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTE-
        PFWQMQQAERLNKISLKS KDVD++ LV+IEVNK+AENKVNW LDDDPSPRPPKCTVLWKELLRLKKQRASSALSP  SSSSSSSSSSS+SMADA  TE 
Subjt:  PFWQMQQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTE-

Query:  --EGTTRNKEKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR
          EGTT NKEKN++RIKK LERTRS SIRIRPMINV ICTQ+K SVLPPLFPLKKGRFDR
Subjt:  --EGTTRNKEKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR

A0A6J1H4I1 uncharacterized protein LOC1114599651.6e-12296.41Show/hide
Query:  MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
        MVSIG+GCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQK+DRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
Subjt:  MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ

Query:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK
        QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP   SSSSSSSSSSKSMADAPKTEEGTTRNK
Subjt:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK

Query:  EKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR
        EKNIKRIKK LERTRSGSIRI PMINV ICTQLK SVLPPLFPLKKGRFDR
Subjt:  EKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR

A0A6J1KCD5 uncharacterized protein LOC1114943314.4e-10182.87Show/hide
Query:  MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
        MVSI +  +   S+S EPNSSPRISFSSEFLDE++FISITP+S +ER+QEI ERQKK+RSE+LA SADFEFLSN+VSSHSM+TADELFF+GKLLPFWQMQ
Subjt:  MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ

Query:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK
        QAERLNKISLKS KDVD++ LV+IEVNK+AENKVNW LDDDPSPRPPKCTVLWKELLRLKKQR SSALSP  SSSSSSSSSSS+SMADA  +EEGTT NK
Subjt:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK

Query:  EKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR
        EKNIKRIKK LERTRS SIRIRPMINV ICTQ+K SVLPPLFPLKKGRFDR
Subjt:  EKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR

A0A6J1L4Y8 uncharacterized protein LOC1114991767.0e-12396.41Show/hide
Query:  MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ
        MVSIG+GCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELF++GKLLPFW MQ
Subjt:  MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQ

Query:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK
        QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP  SSSSSSSSSSSKSMADAPKTEEGTTRNK
Subjt:  QAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNK

Query:  EKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR
        EKNIKRIKK LERTRSGSIRIRPMINV ICTQLK SVLPPLFPLKKGRFDR
Subjt:  EKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSVLPPLFPLKKGRFDR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G05980.1 unknown protein1.4e-4652.61Show/hide
Query:  PNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNK-VSSHSMITADELFFDGKLLPFWQMQQAERLNKISLKSSKDV
        P   PRISFSS+  D  DFI ITP              K+D  +     +DFEFLS++ VS   M+TADELF +GKLLPFWQ++ +E+L  I+LK++++ 
Subjt:  PNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNK-VSSHSMITADELFFDGKLLPFWQMQQAERLNKISLKSSKDV

Query:  DKQGLVDIEVNKK------AENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-----SSSSSSSSSSSSSKSMADAPKTEEGTTRNKEKNIK
        + +    +EV KK       +N+V W +D+DPSPRPPKCTVLWKELLRLKKQR  S+ SP      SS S SSS+SSS S+ DA K EE     KEK  K
Subjt:  DKQGLVDIEVNKK------AENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSP-----SSSSSSSSSSSSSKSMADAPKTEEGTTRNKEKNIK

Query:  RIKKDLERTRSGSIRIRPMINVSICTQLKGSV-LPPLFP--LKKGRFDR
        R KK LERTRS S+RIRPMI+V ICT  K S+ LPPLFP  LKK R +R
Subjt:  RIKKDLERTRSGSIRIRPMINVSICTQLKGSV-LPPLFP--LKKGRFDR

AT3G12970.1 unknown protein1.2e-0533.33Show/hide
Query:  DFEFLSNKVSSHSMITADELFFDGKLLPFWQMQQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDP-SPRPPKCTVLWKELLRLKKQRASS
        DFEFL       +M++ADELF DGKL+P        + + ++    K +       ++  ++ E +++ ++D    SPR P+CTV W+ELL LK+     
Subjt:  DFEFLSNKVSSHSMITADELFFDGKLLPFWQMQQAERLNKISLKSSKDVDKQGLVDIEVNKKAENKVNWLLDDDP-SPRPPKCTVLWKELLRLKKQRASS

Query:  ALSPSSSSSSSSSSSSSKSMADAPKT
         L+ +   +S+SSSS   S +  PKT
Subjt:  ALSPSSSSSSSSSSSSSKSMADAPKT

AT5G19340.1 unknown protein3.7e-4449.27Show/hide
Query:  ASSEPNSS-PRISFSSEFL---DENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQQAERLNKIS
        A +EP+++ PRISFS++      + DFI I P  +L     I  R++KD+S   A   DFEFLS      +M++ADELF +GKLLPFWQ++ +E+L  ++
Subjt:  ASSEPNSS-PRISFSSEFL---DENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQQAERLNKIS

Query:  LK----------SSKDVDKQGLVDIEVNKKAENKVN----------WLLDDDPSPRPPKCTVLWKELLRLKKQRAS---------SALSPSSSSSSSSSS
        LK            K V+++G V    NK+ EN  N          W LDDDPSPRPPKCTVLWKELLRLKKQR +         S+LSPSSSSSS+SSS
Subjt:  LK----------SSKDVDKQGLVDIEVNKKAENKVN----------WLLDDDPSPRPPKCTVLWKELLRLKKQRAS---------SALSPSSSSSSSSSS

Query:  SSSKSMADAPKTEEGTTRNKEKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSV-LPPLFPLK--KGRFDR
        SS  S+ DA K EE     +EK  KR KK LERTRS ++RIRPMI+V +CT  K S  LPPLFPL+  K R +R
Subjt:  SSSKSMADAPKTEEGTTRNKEKNIKRIKKDLERTRSGSIRIRPMINVSICTQLKGSV-LPPLFPLK--KGRFDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTCAATAGGCAATGGTTGTAGTGTTCATGGATCAGCCTCATCAGAGCCAAATTCCAGTCCTCGGATATCTTTCTCTTCTGAGTTTCTTGATGAAAACGACTTCAT
TTCCATCACTCCAAATTCACATCTAGAGAGAGAACAAGAGATTTCTGAGAGACAGAAGAAGGACAGATCAGAGAAGCTAGCATGGAGTGCTGATTTTGAGTTTCTTTCTA
ATAAAGTTAGTAGCCACTCCATGATTACAGCTGATGAGCTCTTCTTTGACGGGAAGCTTCTTCCCTTTTGGCAAATGCAGCAAGCAGAGAGGCTTAACAAAATCAGTCTG
AAATCTTCAAAAGATGTAGATAAACAAGGCTTGGTGGACATAGAGGTAAACAAGAAGGCAGAGAACAAAGTGAATTGGTTACTCGACGACGACCCGTCGCCGAGACCACC
AAAATGCACTGTTCTGTGGAAAGAATTGTTGAGGTTGAAGAAACAACGCGCGTCGTCTGCGCTATCACCATCTTCTTCCTCGTCTTCGTCGTCGTCGTCGTCTTCTTCCA
AGTCGATGGCTGATGCACCCAAAACAGAGGAAGGGACAACAAGAAACAAAGAGAAGAACATTAAGAGGATAAAGAAGGATTTGGAAAGGACAAGATCAGGCAGTATAAGA
ATAAGGCCTATGATTAATGTGTCAATTTGCACACAGCTGAAGGGCAGTGTTTTGCCACCCTTATTCCCACTTAAGAAAGGAAGATTTGATAGATAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTTCAATAGGCAATGGTTGTAGTGTTCATGGATCAGCCTCATCAGAGCCAAATTCCAGTCCTCGGATATCTTTCTCTTCTGAGTTTCTTGATGAAAACGACTTCAT
TTCCATCACTCCAAATTCACATCTAGAGAGAGAACAAGAGATTTCTGAGAGACAGAAGAAGGACAGATCAGAGAAGCTAGCATGGAGTGCTGATTTTGAGTTTCTTTCTA
ATAAAGTTAGTAGCCACTCCATGATTACAGCTGATGAGCTCTTCTTTGACGGGAAGCTTCTTCCCTTTTGGCAAATGCAGCAAGCAGAGAGGCTTAACAAAATCAGTCTG
AAATCTTCAAAAGATGTAGATAAACAAGGCTTGGTGGACATAGAGGTAAACAAGAAGGCAGAGAACAAAGTGAATTGGTTACTCGACGACGACCCGTCGCCGAGACCACC
AAAATGCACTGTTCTGTGGAAAGAATTGTTGAGGTTGAAGAAACAACGCGCGTCGTCTGCGCTATCACCATCTTCTTCCTCGTCTTCGTCGTCGTCGTCGTCTTCTTCCA
AGTCGATGGCTGATGCACCCAAAACAGAGGAAGGGACAACAAGAAACAAAGAGAAGAACATTAAGAGGATAAAGAAGGATTTGGAAAGGACAAGATCAGGCAGTATAAGA
ATAAGGCCTATGATTAATGTGTCAATTTGCACACAGCTGAAGGGCAGTGTTTTGCCACCCTTATTCCCACTTAAGAAAGGAAGATTTGATAGATAA
Protein sequenceShow/hide protein sequence
MVSIGNGCSVHGSASSEPNSSPRISFSSEFLDENDFISITPNSHLEREQEISERQKKDRSEKLAWSADFEFLSNKVSSHSMITADELFFDGKLLPFWQMQQAERLNKISL
KSSKDVDKQGLVDIEVNKKAENKVNWLLDDDPSPRPPKCTVLWKELLRLKKQRASSALSPSSSSSSSSSSSSSKSMADAPKTEEGTTRNKEKNIKRIKKDLERTRSGSIR
IRPMINVSICTQLKGSVLPPLFPLKKGRFDR