; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg00552 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg00552
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionDUF1990 domain-containing protein
Genome locationCarg_Chr03:5575883..5577153
RNA-Seq ExpressionCarg00552
SyntenyCarg00552
Gene Ontology termsNA
InterPro domainsIPR018960 - Domain of unknown function DUF1990


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034054.1 UPF0548 protein, partial [Cucurbita argyrosperma subsp. argyrosperma]7.7e-108100Show/hide
Query:  MMFIIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMP
        MMFIIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMP
Subjt:  MMFIIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMP

Query:  LQIVYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS
        LQIVYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS
Subjt:  LQIVYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS

XP_008440884.1 PREDICTED: UPF0548 protein At2g17695 [Cucumis melo]2.9e-9993.68Show/hide
Query:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI
        I R+GSFNYSSKFRGATANPSSCLQEDKG+SQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVV+PLQI
Subjt:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI

Query:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS
        VYVNENR TN  RTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILS SKPAHILSFLSYPYVILRQKYFA+QSTNA KKYLT  HS
Subjt:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS

XP_022950650.1 UPF0548 protein At2g17695 [Cucurbita moschata]3.6e-10599.47Show/hide
Query:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI
        I RSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI
Subjt:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI

Query:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS
        VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS
Subjt:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS

XP_022979037.1 UPF0548 protein At2g17695 [Cucurbita maxima]4.7e-10598.95Show/hide
Query:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI
        I RSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI
Subjt:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI

Query:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS
        VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPY+ILRQKYFAYQSTNAVKKYLTAKHS
Subjt:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS

XP_031742966.1 UPF0548 protein At2g17695 isoform X2 [Cucumis sativus]8.5e-9992.27Show/hide
Query:  MMFIIRSGSFNYSSKFRGATANPSSCLQEDK-GLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVM
        MM I R+GSFNY+SKFRGATANPSSCLQEDK G+SQEGFLLNHARILVGSGV TYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVV+
Subjt:  MMFIIRSGSFNYSSKFRGATANPSSCLQEDK-GLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVM

Query:  PLQIVYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS
        PLQIVYVNENR TN  RTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPY+ILRQKYFA+QSTNAVKKYLT  HS
Subjt:  PLQIVYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS

TrEMBL top hitse value%identityAlignment
A0A0A0KKV8 DUF1990 domain-containing protein4.6e-9892.67Show/hide
Query:  IIRSGSFNYSSKFRGATANPSSCLQEDK-GLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQ
        I R+GSFNY+SKFRGATANPSSCLQEDK G+SQEGFLLNHARILVGSGV TYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVV+PLQ
Subjt:  IIRSGSFNYSSKFRGATANPSSCLQEDK-GLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQ

Query:  IVYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS
        IVYVNENR TN  RTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPY+ILRQKYFA+QSTNAVKKYLT  HS
Subjt:  IVYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS

A0A1S3B2T0 UPF0548 protein At2g176951.4e-9993.68Show/hide
Query:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI
        I R+GSFNYSSKFRGATANPSSCLQEDKG+SQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVV+PLQI
Subjt:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI

Query:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS
        VYVNENR TN  RTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILS SKPAHILSFLSYPYVILRQKYFA+QSTNA KKYLT  HS
Subjt:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS

A0A6J1CKD3 UPF0548 protein At2g176951.2e-9591.98Show/hide
Query:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI
        I RSGSFNYSSKFRGATANPSS L+EDKGL QEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPV+PGVKFCVC+KEFLPWVVMPLQI
Subjt:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI

Query:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTA
        VYVNENR   MARTCFSFGSGTLQGHLLAGEERFSIEMD N+QVWYEILSFSKPAHILSF+SYPYVILRQKYFA+QS+NAVKKY TA
Subjt:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTA

A0A6J1GGC4 UPF0548 protein At2g176951.7e-10599.47Show/hide
Query:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI
        I RSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI
Subjt:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI

Query:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS
        VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS
Subjt:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS

A0A6J1IMP9 UPF0548 protein At2g176952.3e-10598.95Show/hide
Query:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI
        I RSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI
Subjt:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI

Query:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS
        VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPY+ILRQKYFAYQSTNAVKKYLTAKHS
Subjt:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS

SwissProt top hitse value%identityAlignment
Q86JL6 UPF0548 protein2.5e-1630.16Show/hide
Query:  RSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWA-FVDSSTPVHPGVKFCVCAKEFLPWVVMPLQIV
        R   F YS+ +   T + ++  + +       F ++  +I +G+GVE ++K   AL+ W+HF L+W  F   +TP+  G    + +K+   W++   +I 
Subjt:  RSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWA-FVDSSTPVHPGVKFCVCAKEFLPWVVMPLQIV

Query:  YVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIE-------MDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKK
        Y+ +    +     F +  GTL+ H+  GEERF IE             V+YE+LSFS+P++ LS L YP     Q  F   S N + K
Subjt:  YVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIE-------MDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKK

Q8GXB1 UPF0548 protein At2g176957.3e-6962.57Show/hide
Query:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI
        I ++G+FNY +K+RG ++   + L+ED  + ++GFL+NHAR+LVGSG E+YEKGKKALQNW+HFG++WAFVD +TPV  G KFC+C KE LPWV++PLQ+
Subjt:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI

Query:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTA
        VYV+E+R +      F +GSGTLQGHLLAGEE+FSIE+D N +VWYEI SFSKPAH LSFL YPYV LRQK+FA  S+ AV K++ A
Subjt:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTA

Q9RST8 UPF0548 protein DR_20353.5e-1031.39Show/hide
Query:  RILVGSGVETYEKGKKALQNWRHFGLNWAF---VDSSTPV-HPGVKFCVCAKEFLPWVVMPLQIVYVNENRGTNMA--RTCFSFGSGTLQGHLLAGEERF
        R+ VG G   +E+ K AL+  + F  +W      ++STP+   G    +  + F PW      +  +  NR   +      + FG GTL GHL+ GEERF
Subjt:  RILVGSGVETYEKGKKALQNWRHFGLNWAF---VDSSTPV-HPGVKFCVCAKEFLPWVVMPLQIVYVNENRGTNMA--RTCFSFGSGTLQGHLLAGEERF

Query:  SIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQK
         +E D+   VW+++ +FS+ A   S  + P V   Q+
Subjt:  SIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQK

Arabidopsis top hitse value%identityAlignment
AT2G17695.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; CONTAINS InterPro DOMAIN/s: Domain of unknown function DUF1990 (InterPro:IPR018960); Has 259 Blast hits to 259 proteins in 120 species: Archae - 0; Bacteria - 197; Metazoa - 0; Fungi - 0; Plants - 57; Viruses - 0; Other Eukaryotes - 5 (source: NCBI BLink).5.2e-7062.57Show/hide
Query:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI
        I ++G+FNY +K+RG ++   + L+ED  + ++GFL+NHAR+LVGSG E+YEKGKKALQNW+HFG++WAFVD +TPV  G KFC+C KE LPWV++PLQ+
Subjt:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI

Query:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTA
        VYV+E+R +      F +GSGTLQGHLLAGEE+FSIE+D N +VWYEI SFSKPAH LSFL YPYV LRQK+FA  S+ AV K++ A
Subjt:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTA

AT2G17695.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; CONTAINS InterPro DOMAIN/s: Domain of unknown function DUF1990 (InterPro:IPR018960); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).5.2e-7062.57Show/hide
Query:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI
        I ++G+FNY +K+RG ++   + L+ED  + ++GFL+NHAR+LVGSG E+YEKGKKALQNW+HFG++WAFVD +TPV  G KFC+C KE LPWV++PLQ+
Subjt:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI

Query:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTA
        VYV+E+R +      F +GSGTLQGHLLAGEE+FSIE+D N +VWYEI SFSKPAH LSFL YPYV LRQK+FA  S+ AV K++ A
Subjt:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTA

AT2G17695.3 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; CONTAINS InterPro DOMAIN/s: Domain of unknown function DUF1990 (InterPro:IPR018960).5.2e-7062.57Show/hide
Query:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI
        I ++G+FNY +K+RG ++   + L+ED  + ++GFL+NHAR+LVGSG E+YEKGKKALQNW+HFG++WAFVD +TPV  G KFC+C KE LPWV++PLQ+
Subjt:  IIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQI

Query:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTA
        VYV+E+R +      F +GSGTLQGHLLAGEE+FSIE+D N +VWYEI SFSKPAH LSFL YPYV LRQK+FA  S+ AV K++ A
Subjt:  VYVNENRGTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGTTCATCATTAGGTCTGGTTCCTTCAACTATAGCAGCAAGTTTAGAGGAGCTACTGCTAATCCCAGCTCTTGCCTGCAAGAAGATAAGGGGCTCTCACAAGAAGG
GTTTCTTCTCAACCATGCTCGTATTTTGGTGGGTTCTGGTGTTGAGACTTATGAAAAGGGGAAGAAAGCTCTTCAGAACTGGAGGCATTTTGGGTTGAATTGGGCATTTG
TTGATTCCTCAACGCCAGTTCATCCGGGAGTGAAGTTTTGTGTCTGTGCCAAGGAGTTCCTTCCATGGGTGGTGATGCCTCTTCAAATTGTATATGTAAATGAGAACAGA
GGTACCAATATGGCTAGGACATGTTTCAGTTTTGGCAGTGGTACCCTTCAAGGCCATCTTCTGGCGGGTGAAGAACGGTTTTCGATCGAGATGGACTCGAACAGCCAGGT
GTGGTATGAAATTCTTTCCTTCTCAAAGCCTGCCCACATATTATCATTTCTTAGTTATCCTTATGTAATTCTTCGGCAAAAGTATTTTGCTTATCAATCTACAAATGCAG
TTAAGAAATATTTGACTGCCAAGCATTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGTTCATCATTAGGTCTGGTTCCTTCAACTATAGCAGCAAGTTTAGAGGAGCTACTGCTAATCCCAGCTCTTGCCTGCAAGAAGATAAGGGGCTCTCACAAGAAGG
GTTTCTTCTCAACCATGCTCGTATTTTGGTGGGTTCTGGTGTTGAGACTTATGAAAAGGGGAAGAAAGCTCTTCAGAACTGGAGGCATTTTGGGTTGAATTGGGCATTTG
TTGATTCCTCAACGCCAGTTCATCCGGGAGTGAAGTTTTGTGTCTGTGCCAAGGAGTTCCTTCCATGGGTGGTGATGCCTCTTCAAATTGTATATGTAAATGAGAACAGA
GGTACCAATATGGCTAGGACATGTTTCAGTTTTGGCAGTGGTACCCTTCAAGGCCATCTTCTGGCGGGTGAAGAACGGTTTTCGATCGAGATGGACTCGAACAGCCAGGT
GTGGTATGAAATTCTTTCCTTCTCAAAGCCTGCCCACATATTATCATTTCTTAGTTATCCTTATGTAATTCTTCGGCAAAAGTATTTTGCTTATCAATCTACAAATGCAG
TTAAGAAATATTTGACTGCCAAGCATTCCTAACTCCTGGTGTGCTTGTGTGTATGTTCTTAGCTATCCTTGTGTGTATGTTCTTAGCTATCCTTGTGTGTATGTTCTTAG
CTATCCTTGTGTGTGTGTGTATGTTCTTGTGTGTATGCAGTTAAGAATGCTTCTTCCTGCTCTTCAATCCTCCTATAGTGTAGCATTCAGTCCTTAGATCGTAGCCTTTC
TGTTAAGAAAAATGGCATCATCAGGACATTGGAATAGCTCAACTATACAAATGAACTCTGTCATGATCATTGATGTTTCTTAAATTTGAGAACATCTTATCACTGGCTGC
TTCTAACCAGTTGCCAACTTCCATTAAAAGTGCAATCCCTAGAGCTTTTTTTTTTTTTTTTCACAGCCCCATCAAAATTCAGGTCTAGGATGATCACTTTTGGCTGCCGA
TGGGTGAAATTCCTATGCAGCACAAGCTAGAAAGGAGACTTTCTGAAACTAGGGCTGCATGGT
Protein sequenceShow/hide protein sequence
MMFIIRSGSFNYSSKFRGATANPSSCLQEDKGLSQEGFLLNHARILVGSGVETYEKGKKALQNWRHFGLNWAFVDSSTPVHPGVKFCVCAKEFLPWVVMPLQIVYVNENR
GTNMARTCFSFGSGTLQGHLLAGEERFSIEMDSNSQVWYEILSFSKPAHILSFLSYPYVILRQKYFAYQSTNAVKKYLTAKHS