; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg00112 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg00112
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionEncodes a protein whose expression is responsive to nematode infection, putative
Genome locationCarg_Chr04:5383531..5384491
RNA-Seq ExpressionCarg00112
SyntenyCarg00112
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600820.1 hypothetical protein SDJN03_06053, partial [Cucurbita argyrosperma subsp. sororia]5.2e-10599.53Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES
        MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREKRRVSFM
        LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREKRRVSFM
Subjt:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREKRRVSFM

Query:  TTVEAMPQIAVAT
        TTVEAM QIAVAT
Subjt:  TTVEAMPQIAVAT

KAG7031458.1 hypothetical protein SDJN02_05498, partial [Cucurbita argyrosperma subsp. argyrosperma]6.1e-106100Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES
        MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREKRRVSFM
        LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREKRRVSFM
Subjt:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREKRRVSFM

Query:  TTVEAMPQIAVAT
        TTVEAMPQIAVAT
Subjt:  TTVEAMPQIAVAT

XP_022942788.1 uncharacterized protein At1g66480 isoform X1 [Cucurbita moschata]8.5e-10094.44Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES
        MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+ KEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPRE---KRRV
        LMLSRRSASDLTIMK KSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKA E++CQNE KMEKEKDV+IKPRE   KRRV
Subjt:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPRE---KRRV

Query:  SFMTTVEAMPQIAVAT
        SFMTT+EAMPQIAVAT
Subjt:  SFMTTVEAMPQIAVAT

XP_022942789.1 uncharacterized protein At1g66480 isoform X2 [Cucurbita moschata]2.0e-10195.77Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES
        MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+ KEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREKRRVSFM
        LMLSRRSASDLTIMK KSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKA E++CQNE KMEKEKDV+IKPREKRRVSFM
Subjt:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREKRRVSFM

Query:  TTVEAMPQIAVAT
        TT+EAMPQIAVAT
Subjt:  TTVEAMPQIAVAT

XP_023515798.1 uncharacterized protein At1g66480 [Cucurbita pepo subsp. pepo]5.9e-10195.77Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES
        MGN FGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKV KEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREKRRVSFM
        LMLSRRSASDLTIMK KSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKA E++CQNE  MEKEKDVMIKPREKRRVSFM
Subjt:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREKRRVSFM

Query:  TTVEAMPQIAVAT
        TT+EAMPQIAVAT
Subjt:  TTVEAMPQIAVAT

TrEMBL top hitse value%identityAlignment
A0A1S3C0K8 uncharacterized protein At1g66480 isoform X22.1e-8080.93Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES
        MGN FG+KKTVKVM +SG+T+KL  PVQ  DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LPK+ KEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKSKSVLAEEGGEQRE--GSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREKRRVS
        LML+RRSASDLTIMK KSVL EEGG + E  GS ATRVKVRLPKAEVER+LKE KDEAEAAERIMGLY  K  E++C+N+ K EKEK  +IKPREKRRVS
Subjt:  LMLSRRSASDLTIMKSKSVLAEEGGEQRE--GSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREKRRVS

Query:  FMTTVEAMPQIAVAT
        FMTT+EA  QIAVA+
Subjt:  FMTTVEAMPQIAVAT

A0A6J1FSC9 uncharacterized protein At1g66480 isoform X29.8e-10295.77Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES
        MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+ KEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREKRRVSFM
        LMLSRRSASDLTIMK KSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKA E++CQNE KMEKEKDV+IKPREKRRVSFM
Subjt:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREKRRVSFM

Query:  TTVEAMPQIAVAT
        TT+EAMPQIAVAT
Subjt:  TTVEAMPQIAVAT

A0A6J1FVP5 uncharacterized protein At1g66480 isoform X14.1e-10094.44Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES
        MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+ KEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPRE---KRRV
        LMLSRRSASDLTIMK KSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKA E++CQNE KMEKEKDV+IKPRE   KRRV
Subjt:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPRE---KRRV

Query:  SFMTTVEAMPQIAVAT
        SFMTT+EAMPQIAVAT
Subjt:  SFMTTVEAMPQIAVAT

A0A6J1JB09 uncharacterized protein At1g66480 isoform X22.3e-9591.12Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES
        MGN FGLKKT KVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKV KEQAPRRVRS INMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREK-RRVSF
        LMLSRRSASDLTIMK KSVLAEEG  ++EGSEATRVKVRLPKAEVERVLKES+DEAEAAERIMGLYMAKATEN+CQN  KMEKEKDV+IKPREK RRVSF
Subjt:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREK-RRVSF

Query:  MTTVEAMPQIAVAT
        M T+EAMPQI V T
Subjt:  MTTVEAMPQIAVAT

A0A6J1JDY8 uncharacterized protein At1g66480 isoform X15.2e-9589.86Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES
        MGN FGLKKT KVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKV KEQAPRRVRS INMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREK----RR
        LMLSRRSASDLTIMK KSVLAEEG  ++EGSEATRVKVRLPKAEVERVLKES+DEAEAAERIMGLYMAKATEN+CQN  KMEKEKDV+IKPREK    RR
Subjt:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREK----RR

Query:  VSFMTTVEAMPQIAVAT
        VSFM T+EAMPQI V T
Subjt:  VSFMTTVEAMPQIAVAT

SwissProt top hitse value%identityAlignment
Q6NLC8 Uncharacterized protein At1g664801.6e-3245.36Show/hide
Query:  MGNAFGLK-KTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAP---------RRVRSAI
        MGN+  +K K  KVM + G+T ++  PV AR+V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLVELPK+  E            RRV S I
Subjt:  MGNAFGLK-KTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAP---------RRVRSAI

Query:  NMSAKDRLESLMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEA-AERIMGLYMAKATE
        ++ AK+RL+ LMLSRR+ SD+TI +S      +G     G   T V++RLP++++ ++++E+ ++A A AE+I+G+YM ++ E
Subjt:  NMSAKDRLESLMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEA-AERIMGLYMAKATE

Arabidopsis top hitse value%identityAlignment
AT1G66480.1 plastid movement impaired 21.1e-3345.36Show/hide
Query:  MGNAFGLK-KTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAP---------RRVRSAI
        MGN+  +K K  KVM + G+T ++  PV AR+V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLVELPK+  E            RRV S I
Subjt:  MGNAFGLK-KTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAP---------RRVRSAI

Query:  NMSAKDRLESLMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEA-AERIMGLYMAKATE
        ++ AK+RL+ LMLSRR+ SD+TI +S      +G     G   T V++RLP++++ ++++E+ ++A A AE+I+G+YM ++ E
Subjt:  NMSAKDRLESLMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEA-AERIMGLYMAKATE

AT1G71015.1 unknown protein9.6e-4155.15Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES
        MGN+ G KKT  +M ++G++ KL  PV+A  VVKD+PG VLLESEAVK  G+RAKPLE HQ L +KR+YF+VELP+  KE+ PRRVRS I MSAK+RLE+
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGL
        L LSRRS+SDL++MK K+   E   E+RE S    VK++LPK ++E++ KES+  ++ + +I  L
Subjt:  LMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGL

AT2G01340.1 Encodes a protein whose expression is responsive to nematode infection.2.7e-4351.33Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES
        MGN+ G KKT KVM + G+T KL  PV A +V+KD+PG VLL+SE+VKHYG RAKPLE  Q+L  KRLYF+VE     KE  PRRVRS I++SAK+RLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKSKSVLAEEGG---EQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAK-----ATENLCQNEPKMEKEKDVM----
        LML+RRS+SDL+I+K        GG   E+ EG+   RVKVR+PKAE+E+++KE   EAEA ++I  L+MAK     A +N  Q+EP             
Subjt:  LMLSRRSASDLTIMKSKSVLAEEGG---EQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAK-----ATENLCQNEPKMEKEKDVM----

Query:  --IKPREKRRVSFMTTVEAMPQIAVA
          +K R K RVSFM       +I VA
Subjt:  --IKPREKRRVSFMTTVEAMPQIAVA

AT5G37840.1 BEST Arabidopsis thaliana protein match is: plastid movement impaired 2 (TAIR:AT1G66480.1)1.9e-2846.78Show/hide
Query:  MGNAFGLKKT-VKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQ--APRRVRSA-INMSAKD
        MGN   +++  VKVM + GD  +L  PV A D  K+YPGFVLL+SE VK  GVRAKPLE +Q L     YFLV+LP V K      RRV S  I++ AK+
Subjt:  MGNAFGLKKT-VKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQ--APRRVRSA-INMSAKD

Query:  RLESLMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYM
        RLE LMLSRR+ SD+   +S  V     G+  E    TRV++RLP++++ ++++ES D +E A +I+  YM
Subjt:  RLESLMLSRRSASDLTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAACGCCTTTGGGCTCAAGAAGACGGTTAAGGTGATGACGGTCTCCGGTGACACGATCAAACTAACCCCCCCGGTTCAAGCTCGGGATGTCGTCAAGGACTATCC
CGGCTTTGTTTTACTCGAATCCGAGGCTGTGAAACACTACGGAGTTCGAGCAAAGCCATTGGAACTCCACCAGAAGCTTAGCACGAAAAGACTCTATTTTCTTGTGGAGC
TGCCTAAAGTTTCAAAAGAACAGGCTCCACGGCGAGTACGGTCGGCGATCAACATGAGTGCGAAGGATAGGCTAGAGAGCTTGATGTTGTCACGGCGGTCAGCATCGGAC
CTAACTATCATGAAATCGAAGAGCGTGTTGGCGGAGGAGGGCGGTGAACAGAGAGAGGGATCGGAAGCGACACGGGTGAAGGTACGGCTGCCGAAGGCGGAGGTGGAAAG
GGTGTTGAAGGAGAGCAAAGATGAGGCAGAGGCAGCGGAGAGGATTATGGGGTTGTACATGGCCAAAGCCACAGAAAATCTCTGTCAAAATGAACCGAAGATGGAGAAGG
AGAAGGATGTCATGATCAAGCCACGTGAGAAGCGACGAGTAAGTTTCATGACGACTGTAGAAGCAATGCCTCAAATTGCGGTAGCAACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAACGCCTTTGGGCTCAAGAAGACGGTTAAGGTGATGACGGTCTCCGGTGACACGATCAAACTAACCCCCCCGGTTCAAGCTCGGGATGTCGTCAAGGACTATCC
CGGCTTTGTTTTACTCGAATCCGAGGCTGTGAAACACTACGGAGTTCGAGCAAAGCCATTGGAACTCCACCAGAAGCTTAGCACGAAAAGACTCTATTTTCTTGTGGAGC
TGCCTAAAGTTTCAAAAGAACAGGCTCCACGGCGAGTACGGTCGGCGATCAACATGAGTGCGAAGGATAGGCTAGAGAGCTTGATGTTGTCACGGCGGTCAGCATCGGAC
CTAACTATCATGAAATCGAAGAGCGTGTTGGCGGAGGAGGGCGGTGAACAGAGAGAGGGATCGGAAGCGACACGGGTGAAGGTACGGCTGCCGAAGGCGGAGGTGGAAAG
GGTGTTGAAGGAGAGCAAAGATGAGGCAGAGGCAGCGGAGAGGATTATGGGGTTGTACATGGCCAAAGCCACAGAAAATCTCTGTCAAAATGAACCGAAGATGGAGAAGG
AGAAGGATGTCATGATCAAGCCACGTGAGAAGCGACGAGTAAGTTTCATGACGACTGTAGAAGCAATGCCTCAAATTGCGGTAGCAACTTAA
Protein sequenceShow/hide protein sequence
MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKVSKEQAPRRVRSAINMSAKDRLESLMLSRRSASD
LTIMKSKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKATENLCQNEPKMEKEKDVMIKPREKRRVSFMTTVEAMPQIAVAT