; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg01628 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg01628
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionUnknown protein
Genome locationCarg_Chr15:2861116..2871885
RNA-Seq ExpressionCarg01628
SyntenyCarg01628
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009943 - Protein of unknown function DUF1475


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6578860.1 hypothetical protein SDJN03_23308, partial [Cucurbita argyrosperma subsp. sororia]1.9e-10679.7Show/hide
Query:  LVQTLGSSRGCSRPSTTPRALCITGSLMASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAA
        L     SSRGCSRPSTTPRALCITGSLMASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAA
Subjt:  LVQTLGSSRGCSRPSTTPRALCITGSLMASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAA

Query:  VWIVFLVCLGSIATCAYILWQLWQLSSQESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLID
        VWIVFLVCLGSIATCAYILWQLWQLSSQESFEDIVYNVLIKNPN                                                WMVATLID
Subjt:  VWIVFLVCLGSIATCAYILWQLWQLSSQESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLID

Query:  FYINGTALSVWMFYKEESWLTALLWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQE
        FYINGTALSVWMFYKEESWLTALLWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNR E
Subjt:  FYINGTALSVWMFYKEESWLTALLWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQE

KAG7016393.1 hypothetical protein SDJN02_21502 [Cucurbita argyrosperma subsp. argyrosperma]1.9e-247100Show/hide
Query:  MAISAVIGWRIVFVVLACTMATSIAYTIAIDGSPFRTENFSRLMVATIIDIYISTVAIATWISYKEPNWIASTIWIVFLVCLGRKGMEQHRKHSSVMIAK
        MAISAVIGWRIVFVVLACTMATSIAYTIAIDGSPFRTENFSRLMVATIIDIYISTVAIATWISYKEPNWIASTIWIVFLVCLGRKGMEQHRKHSSVMIAK
Subjt:  MAISAVIGWRIVFVVLACTMATSIAYTIAIDGSPFRTENFSRLMVATIIDIYISTVAIATWISYKEPNWIASTIWIVFLVCLGRKGMEQHRKHSSVMIAK

Query:  KIYSVLGCLMAVNLVYLFSDGWPFRKEIFTPWMVTTLIDYFILVTVLSIWMFYKEESWLTAIFWIVLVQTLGSSRGCSRPSTTPRALCITGSLMASSAVI
        KIYSVLGCLMAVNLVYLFSDGWPFRKEIFTPWMVTTLIDYFILVTVLSIWMFYKEESWLTAIFWIVLVQTLGSSRGCSRPSTTPRALCITGSLMASSAVI
Subjt:  KIYSVLGCLMAVNLVYLFSDGWPFRKEIFTPWMVTTLIDYFILVTVLSIWMFYKEESWLTAIFWIVLVQTLGSSRGCSRPSTTPRALCITGSLMASSAVI

Query:  GWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSSQESFEDI
        GWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSSQESFEDI
Subjt:  GWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSSQESFEDI

Query:  VYNVLIKNPNNWMVATLIDFYINGTALSVWMFYKEESWLTALLWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQENMLGVVRGHSLTMVH
        VYNVLIKNPNNWMVATLIDFYINGTALSVWMFYKEESWLTALLWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQENMLGVVRGHSLTMVH
Subjt:  VYNVLIKNPNNWMVATLIDFYINGTALSVWMFYKEESWLTALLWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQENMLGVVRGHSLTMVH

Query:  SPRQKEDMRERRAHKQKLDGLSSIWYVSKMLHEEQNP
        SPRQKEDMRERRAHKQKLDGLSSIWYVSKMLHEEQNP
Subjt:  SPRQKEDMRERRAHKQKLDGLSSIWYVSKMLHEEQNP

XP_023551243.1 uncharacterized protein LOC111809123 isoform X1 [Cucurbita pepo subsp. pepo]1.6e-10580.69Show/hide
Query:  SRGCSRPSTTPRALCITGSLMASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLV
        SRGCSRPSTTPRALCITGSLMASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLV
Subjt:  SRGCSRPSTTPRALCITGSLMASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLV

Query:  CLGSIATCAYILWQLWQLSSQESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTA
        CLGSIATCAYILWQLWQLSSQESFEDIVYNVLIKNPN                                                WMVATLIDFYINGTA
Subjt:  CLGSIATCAYILWQLWQLSSQESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTA

Query:  LSVWMFYKEESWLTALLWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQE
        LSVWMFYKEESWLTALLWIALF+IFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNR E
Subjt:  LSVWMFYKEESWLTALLWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQE

XP_023551244.1 uncharacterized protein LOC111809123 isoform X2 [Cucurbita pepo subsp. pepo]3.6e-10580.31Show/hide
Query:  SRGCSRPSTTPRALCITGSLMASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLV
        SRGCSRPSTTPRALCITGSLMASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLV
Subjt:  SRGCSRPSTTPRALCITGSLMASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLV

Query:  CLGSIATCAYILWQLWQLSSQESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTA
        CLGSIATCAYILWQLWQLSSQESFEDIVYNVLIKNPN                                                WMVATLIDFYINGTA
Subjt:  CLGSIATCAYILWQLWQLSSQESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTA

Query:  LSVWMFYKEESWLTALLWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQE
        LSVWMFYKEESWLTALLWIALF+IFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNR++
Subjt:  LSVWMFYKEESWLTALLWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQE

XP_023551245.1 uncharacterized protein LOC111809123 isoform X3 [Cucurbita pepo subsp. pepo]4.7e-10580.62Show/hide
Query:  SRGCSRPSTTPRALCITGSLMASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLV
        SRGCSRPSTTPRALCITGSLMASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLV
Subjt:  SRGCSRPSTTPRALCITGSLMASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLV

Query:  CLGSIATCAYILWQLWQLSSQESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTA
        CLGSIATCAYILWQLWQLSSQESFEDIVYNVLIKNPN                                                WMVATLIDFYINGTA
Subjt:  CLGSIATCAYILWQLWQLSSQESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTA

Query:  LSVWMFYKEESWLTALLWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQ
        LSVWMFYKEESWLTALLWIALF+IFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNR+
Subjt:  LSVWMFYKEESWLTALLWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQ

TrEMBL top hitse value%identityAlignment
A0A6J1FFW0 uncharacterized protein LOC111445346 isoform X13.8e-9277.82Show/hide
Query:  MASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSS
        MASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVV LIDFYFNVIVIAAWVCYKESNWIAAA+WIVFLVCLGSIATCAYILW LWQLSS
Subjt:  MASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSS

Query:  QESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTALLWIA
        QESFEDIVYNVL KNPN                                                WMVATLIDFYINGTALSVWMFYKEESWLTALLWIA
Subjt:  QESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTALLWIA

Query:  LFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQE
        LFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNR E
Subjt:  LFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQE

A0A6J1FGT1 uncharacterized protein LOC111445346 isoform X28.5e-9277.41Show/hide
Query:  MASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSS
        MASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVV LIDFYFNVIVIAAWVCYKESNWIAAA+WIVFLVCLGSIATCAYILW LWQLSS
Subjt:  MASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSS

Query:  QESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTALLWIA
        QESFEDIVYNVL KNPN                                                WMVATLIDFYINGTALSVWMFYKEESWLTALLWIA
Subjt:  QESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTALLWIA

Query:  LFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQE
        LFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNR++
Subjt:  LFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQE

A0A6J1JQK7 uncharacterized protein LOC111488951 isoform X12.0e-9379.08Show/hide
Query:  MASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSS
        MASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSS
Subjt:  MASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSS

Query:  QESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTALLWIA
        QESFEDIVYNVLIKNPN                                                WMVATLIDFYINGTALSVWMFYKEESWLTALLWIA
Subjt:  QESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTALLWIA

Query:  LFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQE
        LFIIFGSAS CPFIVKELFKLNSEDPAYLVLFKNSNR E
Subjt:  LFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQE

A0A6J1JY71 uncharacterized protein LOC111488951 isoform X37.7e-9378.99Show/hide
Query:  MASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSS
        MASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSS
Subjt:  MASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSS

Query:  QESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTALLWIA
        QESFEDIVYNVLIKNPN                                                WMVATLIDFYINGTALSVWMFYKEESWLTALLWIA
Subjt:  QESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTALLWIA

Query:  LFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQ
        LFIIFGSAS CPFIVKELFKLNSEDPAYLVLFKNSNR+
Subjt:  LFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQ

A0A6J1JZY5 uncharacterized protein LOC111488951 isoform X24.5e-9378.66Show/hide
Query:  MASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSS
        MASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSS
Subjt:  MASSAVIGWRILFILLGCTMVATLGYTLATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSS

Query:  QESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTALLWIA
        QESFEDIVYNVLIKNPN                                                WMVATLIDFYINGTALSVWMFYKEESWLTALLWIA
Subjt:  QESFEDIVYNVLIKNPNN-----------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTALLWIA

Query:  LFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQE
        LFIIFGSAS CPFIVKELFKLNSEDPAYLVLFKNSNR++
Subjt:  LFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22750.1 unknown protein6.1e-3433.2Show/hide
Query:  LMASSAVIGWRILFILLGCTMVATLGYTLATDGSPF--RRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQ
        +  +S V G +++  ++ C M+ATL YT+ TDG P   R+++ +   V  ++DFY N++ IA W+ YKES W  + +W + L+  GS+ TC Y+  QL +
Subjt:  LMASSAVIGWRILFILLGCTMVATLGYTLATDGSPF--RRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQ

Query:  LSSQESFEDIVYNVLIKNPNN------------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTAL
        L++QE+ ED +Y +L+++                                                   WMV  L++FYI+   LSVW+ YKE S +  +
Subjt:  LSSQESFEDIVYNVLIKNPNN------------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTAL

Query:  LWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQEN
        LW+AL I  GS  +   IV +LF+L+  DP YLVL  NSNR+++
Subjt:  LWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQEN

AT1G22750.2 unknown protein1.0e-3333.2Show/hide
Query:  LMASSAVIGWRILFILLGCTMVATLGYTLATDGSPF--RRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQ
        +  +S V G +++  ++ C M+ATL YT+ TDG P   R+++ +   V  ++DFY N++ IA W+ YKES W  + +W + L+  GS+ TC Y+  QL +
Subjt:  LMASSAVIGWRILFILLGCTMVATLGYTLATDGSPF--RRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQ

Query:  LSSQESFEDIVYNVLIKNPNN------------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTAL
        L++QE+ ED +Y +L+++                                                   WMV  L++FYI+   LSVW+ YKE S +  +
Subjt:  LSSQESFEDIVYNVLIKNPNN------------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTAL

Query:  LWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQENMLG
        LW+AL I  GS  +   IV +LF+L+  DP YLVL  NSNR+    G
Subjt:  LWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQENMLG

AT1G22750.3 unknown protein1.8e-3333.61Show/hide
Query:  LMASSAVIGWRILFILLGCTMVATLGYTLATDGSPF--RRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQ
        +  +S V G +++  ++ C M+ATL YT+ TDG P   R+++ +   V  ++DFY N++ IA W+ YKES W  + +W + L+  GS+ TC Y+  QL +
Subjt:  LMASSAVIGWRILFILLGCTMVATLGYTLATDGSPF--RRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQ

Query:  LSSQESFEDIVYNVLIKNPNN------------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTAL
        L++QE+ ED +Y +L+++                                                   WMV  L++FYI+   LSVW+ YKE S +  +
Subjt:  LSSQESFEDIVYNVLIKNPNN------------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTAL

Query:  LWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNR
        LW+AL I  GS  +   IV +LF+L+  DP YLVL  NSNR
Subjt:  LWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNR

AT1G22750.4 unknown protein8.0e-3433.47Show/hide
Query:  LMASSAVIGWRILFILLGCTMVATLGYTLATDGSPF--RRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQ
        +  +S V G +++  ++ C M+ATL YT+ TDG P   R+++ +   V  ++DFY N++ IA W+ YKES W  + +W + L+  GS+ TC Y+  QL +
Subjt:  LMASSAVIGWRILFILLGCTMVATLGYTLATDGSPF--RRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQ

Query:  LSSQESFEDIVYNVLIKNPNN------------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTAL
        L++QE+ ED +Y +L+++                                                   WMV  L++FYI+   LSVW+ YKE S +  +
Subjt:  LSSQESFEDIVYNVLIKNPNN------------------------------------------------WMVATLIDFYINGTALSVWMFYKEESWLTAL

Query:  LWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQENM
        LW+AL I  GS  +   IV +LF+L+  DP YLVL  NSNR  +M
Subjt:  LWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQENM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATCTCGGCGGTAATTGGATGGCGGATTGTGTTCGTTGTCTTGGCCTGTACGATGGCTACATCTATCGCTTACACCATCGCCATCGATGGCTCTCCTTTCCGCAC
GGAAAATTTCTCGCGGTTAATGGTGGCAACCATAATTGACATTTATATCAGCACCGTAGCCATTGCGACATGGATTTCCTATAAGGAACCCAACTGGATTGCTTCGACAA
TTTGGATTGTTTTTCTTGTATGTCTTGGCAGGAAAGGCATGGAACAGCACAGGAAGCACTCCAGTGTTATGATTGCAAAAAAAATTTACAGTGTTTTGGGTTGTTTGATG
GCGGTAAATTTAGTATATCTTTTCAGTGATGGTTGGCCTTTCCGCAAGGAGATTTTTACGCCTTGGATGGTAACAACACTCATCGATTACTTCATACTTGTCACAGTTTT
GTCGATCTGGATGTTCTATAAAGAAGAATCATGGCTTACTGCAATTTTTTGGATAGTTCTAGTACAGACTCTTGGGAGCTCCCGCGGTTGCTCTCGTCCTTCTACTACTC
CCCGCGCTCTCTGTATCACTGGGTCTCTAATGGCGAGCTCGGCGGTAATCGGATGGAGGATTCTGTTCATTCTACTGGGCTGCACGATGGTTGCAACTCTCGGATACACA
CTCGCCACTGATGGCTCTCCTTTCCGCAGAGAACTTCTCTCACGGTTAATGGTGGTGGTATTGATTGATTTCTATTTCAACGTCATAGTTATTGCGGCATGGGTTTGCTA
CAAGGAATCAAACTGGATTGCTGCAGCGGTTTGGATAGTTTTTCTTGTGTGTCTAGGAAGCATCGCTACCTGTGCCTACATTCTTTGGCAGTTATGGCAATTATCATCTC
AGGAATCATTTGAAGATATTGTGTACAATGTTCTGATCAAGAATCCAAACAACTGGATGGTGGCCACGCTGATCGATTTCTATATAAATGGCACAGCTTTATCGGTCTGG
ATGTTCTATAAAGAAGAATCTTGGCTTACTGCACTCTTGTGGATAGCTCTATTCATAATCTTTGGGAGCGCCTCTTCATGTCCTTTCATTGTTAAGGAGCTATTCAAGCT
CAACTCCGAAGATCCAGCATACCTTGTTTTATTCAAAAATTCCAACAGGCAAGAAAATATGTTGGGAGTGGTGCGAGGGCACTCTCTTACCATGGTTCACTCTCCACGTC
AGAAAGAAGATATGAGAGAACGTCGTGCTCATAAGCAGAAGTTAGATGGCTTAAGCTCGATTTGGTACGTATCGAAAATGTTACATGAAGAGCAGAATCCATAA
mRNA sequenceShow/hide mRNA sequence
CATCTATAAAAGCGACAGTGCTCAAGTCCAAGGCGCTGCAATGGCGATCTCGGCGGTAATTGGATGGCGGATTGTGTTCGTTGTCTTGGCCTGTACGATGGCTACATCTA
TCGCTTACACCATCGCCATCGATGGCTCTCCTTTCCGCACGGAAAATTTCTCGCGGTTAATGGTGGCAACCATAATTGACATTTATATCAGCACCGTAGCCATTGCGACA
TGGATTTCCTATAAGGAACCCAACTGGATTGCTTCGACAATTTGGATTGTTTTTCTTGTATGTCTTGGCAGGAAAGGCATGGAACAGCACAGGAAGCACTCCAGTGTTAT
GATTGCAAAAAAAATTTACAGTGTTTTGGGTTGTTTGATGGCGGTAAATTTAGTATATCTTTTCAGTGATGGTTGGCCTTTCCGCAAGGAGATTTTTACGCCTTGGATGG
TAACAACACTCATCGATTACTTCATACTTGTCACAGTTTTGTCGATCTGGATGTTCTATAAAGAAGAATCATGGCTTACTGCAATTTTTTGGATAGTTCTAGTACAGACT
CTTGGGAGCTCCCGCGGTTGCTCTCGTCCTTCTACTACTCCCCGCGCTCTCTGTATCACTGGGTCTCTAATGGCGAGCTCGGCGGTAATCGGATGGAGGATTCTGTTCAT
TCTACTGGGCTGCACGATGGTTGCAACTCTCGGATACACACTCGCCACTGATGGCTCTCCTTTCCGCAGAGAACTTCTCTCACGGTTAATGGTGGTGGTATTGATTGATT
TCTATTTCAACGTCATAGTTATTGCGGCATGGGTTTGCTACAAGGAATCAAACTGGATTGCTGCAGCGGTTTGGATAGTTTTTCTTGTGTGTCTAGGAAGCATCGCTACC
TGTGCCTACATTCTTTGGCAGTTATGGCAATTATCATCTCAGGAATCATTTGAAGATATTGTGTACAATGTTCTGATCAAGAATCCAAACAACTGGATGGTGGCCACGCT
GATCGATTTCTATATAAATGGCACAGCTTTATCGGTCTGGATGTTCTATAAAGAAGAATCTTGGCTTACTGCACTCTTGTGGATAGCTCTATTCATAATCTTTGGGAGCG
CCTCTTCATGTCCTTTCATTGTTAAGGAGCTATTCAAGCTCAACTCCGAAGATCCAGCATACCTTGTTTTATTCAAAAATTCCAACAGGCAAGAAAATATGTTGGGAGTG
GTGCGAGGGCACTCTCTTACCATGGTTCACTCTCCACGTCAGAAAGAAGATATGAGAGAACGTCGTGCTCATAAGCAGAAGTTAGATGGCTTAAGCTCGATTTGGTACGT
ATCGAAAATGTTACATGAAGAGCAGAATCCATAAGATGTCATGATATTTGAAGGGTAATTTCGTCCAAATGCTACCATGTTTATATTTAGTCTCTAGGATTAATACTATT
ATTGAAATGCTACTATAACAACTTTGTTCTCGTTAAATTGACATAAATAACAAATTATTGGCTTTAATTTTCGAGTTTAAATAATTTGCTCATA
Protein sequenceShow/hide protein sequence
MAISAVIGWRIVFVVLACTMATSIAYTIAIDGSPFRTENFSRLMVATIIDIYISTVAIATWISYKEPNWIASTIWIVFLVCLGRKGMEQHRKHSSVMIAKKIYSVLGCLM
AVNLVYLFSDGWPFRKEIFTPWMVTTLIDYFILVTVLSIWMFYKEESWLTAIFWIVLVQTLGSSRGCSRPSTTPRALCITGSLMASSAVIGWRILFILLGCTMVATLGYT
LATDGSPFRRELLSRLMVVVLIDFYFNVIVIAAWVCYKESNWIAAAVWIVFLVCLGSIATCAYILWQLWQLSSQESFEDIVYNVLIKNPNNWMVATLIDFYINGTALSVW
MFYKEESWLTALLWIALFIIFGSASSCPFIVKELFKLNSEDPAYLVLFKNSNRQENMLGVVRGHSLTMVHSPRQKEDMRERRAHKQKLDGLSSIWYVSKMLHEEQNP