; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G024630 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G024630
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionFUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages;
Genome locationCG_Chr05:36186279..36188602
RNA-Seq ExpressionClCG05G024630
SyntenyClCG05G024630
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009577 - Putative small multi-drug export


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7013172.1 hypothetical protein SDJN02_25928 [Cucurbita argyrosperma subsp. argyrosperma]1.3e-13288.61Show/hide
Query:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS
        M T+L  TSPL+SAFS RKTL SL LNRPSISQ KQSL  SS C+N+RHFNCFNPVFSTSR+  TV RCSS+ FLE D+I+PSFEEKPVKVLLLVLFWAS
Subjt:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS

Query:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR
        LSLAWFAASGDAKAAVDSIRASNFGLKIA AL SSGWPAEA+VFALATLPV+ELRGAIPVGYWMQLKP+ALTVLSVLGNMVPVPFIILYLKKFATFLAGR
Subjt:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR

Query:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
        NA+ASQFLDMLFKRAK KAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSG+SANFFGVV+AGLLVNLLVNL
Subjt:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL

XP_004135199.1 uncharacterized protein LOC101204187 [Cucumis sativus]8.9e-13993.24Show/hide
Query:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS
        MTT+LP TSPL SAFSPRKTLFSLKLNRPSI++S QSLH SSP VNV HFNCF+PV  TSRI RTVPR SS+GFLEDDEIIPSFEEKPVKVLLLVLFWAS
Subjt:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS

Query:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR
        LSLAWFAASGDAKAAVDSIRASNFGLKIA AL +SGWPAEAVVFALATLPVIELRGAIPVGYWMQLKP+ALTVLSVLGNMVPVPFIILYLKKFATFLAGR
Subjt:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR

Query:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
        NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
Subjt:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL

XP_008446308.1 PREDICTED: uncharacterized protein LOC103489082 [Cucumis melo]6.4e-13792.17Show/hide
Query:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS
        MTT+LP TSPL SAFSPRKTLFSLKLNRPSI+QS  SLH SSP VNV H NC +PV STSRI RTVPR SS+GFLEDDEIIPSFEEKP+KVL+LVLFWAS
Subjt:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS

Query:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR
        LSLAWFAASGDAKAAVDSIRASNFGLKIA AL SSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKP+ LTVLSVLGNMVPVPFIILYLKKFATFLAGR
Subjt:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR

Query:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
        NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
Subjt:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL

XP_023542277.1 uncharacterized protein LOC111802220 [Cucurbita pepo subsp. pepo]6.2e-13288.26Show/hide
Query:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS
        M T+L  TSPL+SAFS RKTL SL LNRPSISQ KQ L  SS C+N+RHFNCFNPVFSTSR+  TV RCSS+ FLE D+I+PSFEEKPVKVLLLVLFWAS
Subjt:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS

Query:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR
        LSLAWFAASGDAKAAVDSIRASNFGLKIA AL SSGWPAEA+VFALATLPV+ELRGAIPVGYWMQLKP+ALTVLSVLGNMVPVPFIILYLKKFATFLAGR
Subjt:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR

Query:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
        NA+ASQFLDMLFKRAK KAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSG+SANFFGVV+AGLLVNLLVNL
Subjt:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL

XP_038893409.1 uncharacterized protein LOC120082205 [Benincasa hispida]3.8e-13792.17Show/hide
Query:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS
        MTT+LPFTSPLIS FSPRKTLFSLKLNRPSI+QSKQSLH SS CVNVRHFN F  VFSTSRIFRTV R SS+GFLEDDEI+PSFEEKPVKV+LLVLFWAS
Subjt:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS

Query:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR
        LSLAWFAASGDAKAA DSIRASNFGLKIA AL SSGW  EAVVFALATLPVIELRGAIPVGYW+ LKP+ALTVLSVLGNMVPVPFIILYL+KFATFLAGR
Subjt:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR

Query:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
        NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
Subjt:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL

TrEMBL top hitse value%identityAlignment
A0A0A0KQH3 Uncharacterized protein4.3e-13993.24Show/hide
Query:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS
        MTT+LP TSPL SAFSPRKTLFSLKLNRPSI++S QSLH SSP VNV HFNCF+PV  TSRI RTVPR SS+GFLEDDEIIPSFEEKPVKVLLLVLFWAS
Subjt:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS

Query:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR
        LSLAWFAASGDAKAAVDSIRASNFGLKIA AL +SGWPAEAVVFALATLPVIELRGAIPVGYWMQLKP+ALTVLSVLGNMVPVPFIILYLKKFATFLAGR
Subjt:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR

Query:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
        NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
Subjt:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL

A0A1S3BER4 uncharacterized protein LOC1034890823.1e-13792.17Show/hide
Query:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS
        MTT+LP TSPL SAFSPRKTLFSLKLNRPSI+QS  SLH SSP VNV H NC +PV STSRI RTVPR SS+GFLEDDEIIPSFEEKP+KVL+LVLFWAS
Subjt:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS

Query:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR
        LSLAWFAASGDAKAAVDSIRASNFGLKIA AL SSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKP+ LTVLSVLGNMVPVPFIILYLKKFATFLAGR
Subjt:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR

Query:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
        NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
Subjt:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL

A0A5A7SV67 Sm_multidrug_ex domain-containing protein3.1e-13792.17Show/hide
Query:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS
        MTT+LP TSPL SAFSPRKTLFSLKLNRPSI+QS  SLH SSP VNV H NC +PV STSRI RTVPR SS+GFLEDDEIIPSFEEKP+KVL+LVLFWAS
Subjt:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS

Query:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR
        LSLAWFAASGDAKAAVDSIRASNFGLKIA AL SSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKP+ LTVLSVLGNMVPVPFIILYLKKFATFLAGR
Subjt:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR

Query:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
        NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
Subjt:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL

A0A6J1DB35 uncharacterized protein LOC1110190666.7e-13287.19Show/hide
Query:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS
        M T++  T P++SAFSPRKT   LKLNRPS+SQSKQSLHSSSPC+NVRHFN F+P+F+TSRIFRTV R  S+GF+E+D+I+PSFEEKPVK+LLLVLFWAS
Subjt:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS

Query:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR
        LSL+WFAASGDAKAA DSIRASNFGLKIA  L SSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKP+ALTVLSVLGNMVPVP IILYLKKFATFLAGR
Subjt:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR

Query:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
        NASAS+FLDMLFKR KEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVV+AGLLVNLLVNL
Subjt:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL

A0A6J1FZI4 uncharacterized protein LOC1114493498.7e-13287.9Show/hide
Query:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS
        M T+L  TSPL+SAFS RKTL SL LNRPSISQ  QSL  SS C+N+RHFNCFNPVFSTSR+  TV RCSS+ FLE D+I+PSFEEKPVKVLLLVLFWAS
Subjt:  MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWAS

Query:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR
        LSLAWFAASGDAKAAVDSIRASNFGLKIA AL SSGWPAEA+VFALATLPV+ELRGAIPVGYWMQLKP+ALTVLSVLGNMVPVPFIILYLKK ATFLAGR
Subjt:  LSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGR

Query:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL
        NA+ASQFLDMLFKRAK KAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSG+SANFFGVV+AGLLVNLLVNL
Subjt:  NASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G02590.1 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; LOCATED IN: chloroplast; EXPRESSED IN: 21 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Putative small multi-drug export (InterPro:IPR009577); Has 405 Blast hits to 405 proteins in 185 species: Archae - 65; Bacteria - 295; Metazoa - 0; Fungi - 0; Plants - 23; Viruses - 0; Other Eukaryotes - 22 (source: NCBI BLink).6.1e-8573.99Show/hide
Query:  CSS-SGFL-------EDDEII--PSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAI
        CSS  GFL       E +EII  PS    PVK  + V+ WAS SL WFA SGDAKAA DSI++S+FGL+IA  L   GWP EAVVFALATLPVIELRGAI
Subjt:  CSS-SGFL-------EDDEII--PSFEEKPVKVLLLVLFWASLSLAWFAASGDAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAI

Query:  PVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFW
        PVGYWMQLKP+ LT  SVLGNMVPVPFI+LYLK FA+F+AG++ +AS+ LD+LFKRAKEKA PVEEF+WLGLMLFVAVPFPGTGAWTGAIIASILDMPFW
Subjt:  PVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAAPVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFW

Query:  SGVSANFFGVVVAGLLVNLLVNL
        S VS+NF GVV+AGLLVNLLVNL
Subjt:  SGVSANFFGVVVAGLLVNLLVNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTACTACTTTACCATTCACTTCACCATTAATCTCCGCATTTTCGCCGAGAAAGACCCTCTTCTCGCTCAAGCTTAATCGGCCCTCCATTAGTCAGAGTAAACAATC
TCTCCACTCGTCGAGTCCATGTGTAAATGTTCGCCATTTCAACTGTTTTAATCCTGTTTTCTCGACTTCTCGGATTTTTCGTACTGTTCCTCGGTGTTCGTCAAGTGGGT
TTCTCGAAGATGACGAGATTATCCCCTCTTTTGAGGAGAAGCCGGTTAAAGTTCTGCTGTTGGTTCTGTTTTGGGCATCTCTATCCCTTGCTTGGTTTGCTGCTTCTGGG
GATGCCAAGGCTGCTGTTGATTCTATCAGAGCTTCGAATTTTGGCCTAAAGATCGCCAGAGCATTGCATAGCTCAGGCTGGCCTGCTGAGGCTGTTGTATTTGCCCTCGC
TACGCTTCCTGTAATTGAGCTCCGTGGGGCCATTCCTGTTGGTTACTGGATGCAGCTTAAGCCTATAGCTCTAACCGTTCTATCCGTACTTGGGAACATGGTTCCTGTAC
CCTTCATCATACTCTATTTGAAGAAATTTGCTACTTTCCTAGCGGGAAGGAATGCTTCTGCCTCTCAATTCCTTGATATGTTATTCAAGAGGGCCAAAGAGAAAGCTGCA
CCTGTTGAAGAGTTTCAGTGGCTTGGTCTAATGCTATTTGTGGCCGTGCCTTTTCCTGGAACCGGAGCTTGGACTGGTGCCATAATAGCTTCCATCCTAGATATGCCATT
CTGGTCAGGTGTCTCTGCAAATTTCTTTGGTGTTGTAGTAGCAGGGCTTCTGGTCAACTTGTTGGTGAATCTTGAAGAGAAACCATCTGCAAACAACTTTCATTTATCAT
ACCGAGCTGAGAAATGTTGTGTAAATGCTGGTATTGTGATTGTTATTTGCACGTGTAGAATTGCAAATCTTGTTCTTAAGATCCCAGTAGGAGCAAATATAGTTGTGCTT
CGAAGATTGCAAAATCATGCTCGAATATTGTGGTCCGCTTTGATTTGCCGGCAAGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGACTACTACTTTACCATTCACTTCACCATTAATCTCCGCATTTTCGCCGAGAAAGACCCTCTTCTCGCTCAAGCTTAATCGGCCCTCCATTAGTCAGAGTAAACAATC
TCTCCACTCGTCGAGTCCATGTGTAAATGTTCGCCATTTCAACTGTTTTAATCCTGTTTTCTCGACTTCTCGGATTTTTCGTACTGTTCCTCGGTGTTCGTCAAGTGGGT
TTCTCGAAGATGACGAGATTATCCCCTCTTTTGAGGAGAAGCCGGTTAAAGTTCTGCTGTTGGTTCTGTTTTGGGCATCTCTATCCCTTGCTTGGTTTGCTGCTTCTGGG
GATGCCAAGGCTGCTGTTGATTCTATCAGAGCTTCGAATTTTGGCCTAAAGATCGCCAGAGCATTGCATAGCTCAGGCTGGCCTGCTGAGGCTGTTGTATTTGCCCTCGC
TACGCTTCCTGTAATTGAGCTCCGTGGGGCCATTCCTGTTGGTTACTGGATGCAGCTTAAGCCTATAGCTCTAACCGTTCTATCCGTACTTGGGAACATGGTTCCTGTAC
CCTTCATCATACTCTATTTGAAGAAATTTGCTACTTTCCTAGCGGGAAGGAATGCTTCTGCCTCTCAATTCCTTGATATGTTATTCAAGAGGGCCAAAGAGAAAGCTGCA
CCTGTTGAAGAGTTTCAGTGGCTTGGTCTAATGCTATTTGTGGCCGTGCCTTTTCCTGGAACCGGAGCTTGGACTGGTGCCATAATAGCTTCCATCCTAGATATGCCATT
CTGGTCAGGTGTCTCTGCAAATTTCTTTGGTGTTGTAGTAGCAGGGCTTCTGGTCAACTTGTTGGTGAATCTTGAAGAGAAACCATCTGCAAACAACTTTCATTTATCAT
ACCGAGCTGAGAAATGTTGTGTAAATGCTGGTATTGTGATTGTTATTTGCACGTGTAGAATTGCAAATCTTGTTCTTAAGATCCCAGTAGGAGCAAATATAGTTGTGCTT
CGAAGATTGCAAAATCATGCTCGAATATTGTGGTCCGCTTTGATTTGCCGGCAAGCATAG
Protein sequenceShow/hide protein sequence
MTTTLPFTSPLISAFSPRKTLFSLKLNRPSISQSKQSLHSSSPCVNVRHFNCFNPVFSTSRIFRTVPRCSSSGFLEDDEIIPSFEEKPVKVLLLVLFWASLSLAWFAASG
DAKAAVDSIRASNFGLKIARALHSSGWPAEAVVFALATLPVIELRGAIPVGYWMQLKPIALTVLSVLGNMVPVPFIILYLKKFATFLAGRNASASQFLDMLFKRAKEKAA
PVEEFQWLGLMLFVAVPFPGTGAWTGAIIASILDMPFWSGVSANFFGVVVAGLLVNLLVNLEEKPSANNFHLSYRAEKCCVNAGIVIVICTCRIANLVLKIPVGANIVVL
RRLQNHARILWSALICRQA