; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh04G010760 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh04G010760
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionEncodes a protein whose expression is responsive to nematode infection, putative
Genome locationCmo_Chr04:5412632..5413852
RNA-Seq ExpressionCmoCh04G010760
SyntenyCmoCh04G010760
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600820.1 hypothetical protein SDJN03_06053, partial [Cucurbita argyrosperma subsp. sororia]1.0e-10095.31Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
        MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+ KEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREKRRVSFM
        LMLSRRSASDLTIMK KSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKA E++CQNE KMEKEKDV+IKPREKRRVSFM
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREKRRVSFM

Query:  TTIEAMPQIAVAT
        TT+EAM QIAVAT
Subjt:  TTIEAMPQIAVAT

KAG7031458.1 hypothetical protein SDJN02_05498, partial [Cucurbita argyrosperma subsp. argyrosperma]1.2e-10195.77Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
        MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+ KEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREKRRVSFM
        LMLSRRSASDLTIMK KSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKA E++CQNE KMEKEKDV+IKPREKRRVSFM
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREKRRVSFM

Query:  TTIEAMPQIAVAT
        TT+EAMPQIAVAT
Subjt:  TTIEAMPQIAVAT

XP_022942788.1 uncharacterized protein At1g66480 isoform X1 [Cucurbita moschata]1.1e-10498.61Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
        MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPRE---KRRV
        LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPRE   KRRV
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPRE---KRRV

Query:  SFMTTIEAMPQIAVAT
        SFMTTIEAMPQIAVAT
Subjt:  SFMTTIEAMPQIAVAT

XP_022942789.1 uncharacterized protein At1g66480 isoform X2 [Cucurbita moschata]2.7e-106100Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
        MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREKRRVSFM
        LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREKRRVSFM
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREKRRVSFM

Query:  TTIEAMPQIAVAT
        TTIEAMPQIAVAT
Subjt:  TTIEAMPQIAVAT

XP_023515798.1 uncharacterized protein At1g66480 [Cucurbita pepo subsp. pepo]2.6e-10497.65Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
        MGN FGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREKRRVSFM
        LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNE+ MEKEKDV+IKPREKRRVSFM
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREKRRVSFM

Query:  TTIEAMPQIAVAT
        TTIEAMPQIAVAT
Subjt:  TTIEAMPQIAVAT

TrEMBL top hitse value%identityAlignment
A0A1S3C0K8 uncharacterized protein At1g66480 isoform X21.9e-8483.72Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
        MGN FG+KKTVKVM +SG+T+KL  PVQ  DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LPK+PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGGEQRE--GSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREKRRVS
        LML+RRSASDLTIMKPKSVL EEGG + E  GS ATRVKVRLPKAEVER+LKE KDEAEAAERIMGLY  K RESVC+N+ K EKEK  IIKPREKRRVS
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGGEQRE--GSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREKRRVS

Query:  FMTTIEAMPQIAVAT
        FMTT+EA  QIAVA+
Subjt:  FMTTIEAMPQIAVAT

A0A6J1FSC9 uncharacterized protein At1g66480 isoform X21.3e-106100Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
        MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREKRRVSFM
        LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREKRRVSFM
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREKRRVSFM

Query:  TTIEAMPQIAVAT
        TTIEAMPQIAVAT
Subjt:  TTIEAMPQIAVAT

A0A6J1FVP5 uncharacterized protein At1g66480 isoform X15.6e-10598.61Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
        MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPRE---KRRV
        LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPRE   KRRV
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPRE---KRRV

Query:  SFMTTIEAMPQIAVAT
        SFMTTIEAMPQIAVAT
Subjt:  SFMTTIEAMPQIAVAT

A0A6J1JB09 uncharacterized protein At1g66480 isoform X24.3e-9792.06Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
        MGN FGLKKT KVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+PKEQAPRRVRS INMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREK-RRVSF
        LMLSRRSASDLTIMKPKSVLAEEG  ++EGSEATRVKVRLPKAEVERVLKES+DEAEAAERIMGLYMAKA E+VCQN +KMEKEKDVIIKPREK RRVSF
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREK-RRVSF

Query:  MTTIEAMPQIAVAT
        M TIEAMPQI V T
Subjt:  MTTIEAMPQIAVAT

A0A6J1JDY8 uncharacterized protein At1g66480 isoform X19.5e-9790.78Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
        MGN FGLKKT KVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPK+PKEQAPRRVRS INMSAKDRLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREK----RR
        LMLSRRSASDLTIMKPKSVLAEEG  ++EGSEATRVKVRLPKAEVERVLKES+DEAEAAERIMGLYMAKA E+VCQN +KMEKEKDVIIKPREK    RR
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREK----RR

Query:  VSFMTTIEAMPQIAVAT
        VSFM TIEAMPQI V T
Subjt:  VSFMTTIEAMPQIAVAT

SwissProt top hitse value%identityAlignment
Q6NLC8 Uncharacterized protein At1g664804.2e-3345.36Show/hide
Query:  MGNAFGLK-KTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAP---------RRVRSAI
        MGN+  +K K  KVM + G+T ++  PV AR+V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLVELPK+P E            RRV S I
Subjt:  MGNAFGLK-KTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAP---------RRVRSAI

Query:  NMSAKDRLESLMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEA-AERIMGLYMAKARE
        ++ AK+RL+ LMLSRR+ SD+TI +       +G     G   T V++RLP++++ ++++E+ ++A A AE+I+G+YM ++ E
Subjt:  NMSAKDRLESLMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEA-AERIMGLYMAKARE

Arabidopsis top hitse value%identityAlignment
AT1G66480.1 plastid movement impaired 23.0e-3445.36Show/hide
Query:  MGNAFGLK-KTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAP---------RRVRSAI
        MGN+  +K K  KVM + G+T ++  PV AR+V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLVELPK+P E            RRV S I
Subjt:  MGNAFGLK-KTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAP---------RRVRSAI

Query:  NMSAKDRLESLMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEA-AERIMGLYMAKARE
        ++ AK+RL+ LMLSRR+ SD+TI +       +G     G   T V++RLP++++ ++++E+ ++A A AE+I+G+YM ++ E
Subjt:  NMSAKDRLESLMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEA-AERIMGLYMAKARE

AT1G71015.1 unknown protein9.6e-4155.15Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
        MGN+ G KKT  +M ++G++ KL  PV+A  VVKD+PG VLLESEAVK  G+RAKPLE HQ L +KR+YF+VELP+  KE+ PRRVRS I MSAK+RLE+
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGL
        L LSRRS+SDL++MK K+   E   E+RE S    VK++LPK ++E++ KES+  ++ + +I  L
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGL

AT2G01340.1 Encodes a protein whose expression is responsive to nematode infection.8.3e-4552.21Show/hide
Query:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES
        MGN+ G KKT KVM + G+T KL  PV A +V+KD+PG VLL+SE+VKHYG RAKPLE  Q+L  KRLYF+VE     KE  PRRVRS I++SAK+RLES
Subjt:  MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLES

Query:  LMLSRRSASDLTIMKPKSVLAEEGG---EQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKAR-ESVCQNERKMEKEKDVI--------
        LML+RRS+SDL+I+KP       GG   E+ EG+   RVKVR+PKAE+E+++KE   EAEA ++I  L+MAK R E   QN R+ E              
Subjt:  LMLSRRSASDLTIMKPKSVLAEEGG---EQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKAR-ESVCQNERKMEKEKDVI--------

Query:  --IKPREKRRVSFMTTIEAMPQIAVA
          +K R K RVSFM       +I VA
Subjt:  --IKPREKRRVSFMTTIEAMPQIAVA

AT5G37840.1 BEST Arabidopsis thaliana protein match is: plastid movement impaired 2 (TAIR:AT1G66480.1)4.2e-2845.61Show/hide
Query:  MGNAFGLKKT-VKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQ--APRRVRSA-INMSAKD
        MGN   +++  VKVM + GD  +L  PV A D  K+YPGFVLL+SE VK  GVRAKPLE +Q L     YFLV+LP + K      RRV S  I++ AK+
Subjt:  MGNAFGLKKT-VKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQ--APRRVRSA-INMSAKD

Query:  RLESLMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYM
        RLE LMLSRR+ SD  +   +S +  +G E       TRV++RLP++++ ++++ES D +E A +I+  YM
Subjt:  RLESLMLSRRSASDLTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAACGCCTTTGGGCTCAAGAAGACGGTTAAGGTGATGACAGTCTCCGGTGACACGATCAAACTAACCCCCCCGGTTCAAGCTCGGGATGTCGTCAAGGACTATCC
CGGCTTTGTTTTACTCGAATCCGAGGCTGTGAAACACTACGGAGTTCGAGCAAAGCCATTGGAACTCCACCAGAAGCTTAGCACGAAAAGACTCTATTTTCTTGTGGAGC
TGCCTAAAATTCCAAAAGAACAGGCTCCACGGCGAGTAAGGTCGGCGATCAACATGAGTGCGAAGGATAGGCTAGAGAGCTTGATGTTGTCACGGCGGTCAGCATCAGAC
CTAACCATCATGAAACCGAAGAGCGTGTTGGCGGAGGAGGGCGGTGAACAGAGAGAGGGATCGGAAGCGACACGGGTGAAGGTACGGCTGCCGAAGGCGGAGGTGGAAAG
GGTGTTGAAGGAGAGCAAAGATGAGGCAGAGGCAGCGGAGAGGATTATGGGGTTGTACATGGCCAAAGCTAGAGAAAGTGTTTGTCAAAATGAACGGAAGATGGAGAAGG
AGAAGGATGTCATCATCAAGCCACGTGAGAAGCGACGAGTAAGTTTCATGACGACAATAGAAGCAATGCCTCAAATTGCGGTAGCAACTTAA
mRNA sequenceShow/hide mRNA sequence
CTTCCTCTCACCTCTTTCCAACTCTTTCCCTTTTGGCTGCCATGGGAAACGCCTTTGGGCTCAAGAAGACGGTTAAGGTGATGACAGTCTCCGGTGACACGATCAAACTA
ACCCCCCCGGTTCAAGCTCGGGATGTCGTCAAGGACTATCCCGGCTTTGTTTTACTCGAATCCGAGGCTGTGAAACACTACGGAGTTCGAGCAAAGCCATTGGAACTCCA
CCAGAAGCTTAGCACGAAAAGACTCTATTTTCTTGTGGAGCTGCCTAAAATTCCAAAAGAACAGGCTCCACGGCGAGTAAGGTCGGCGATCAACATGAGTGCGAAGGATA
GGCTAGAGAGCTTGATGTTGTCACGGCGGTCAGCATCAGACCTAACCATCATGAAACCGAAGAGCGTGTTGGCGGAGGAGGGCGGTGAACAGAGAGAGGGATCGGAAGCG
ACACGGGTGAAGGTACGGCTGCCGAAGGCGGAGGTGGAAAGGGTGTTGAAGGAGAGCAAAGATGAGGCAGAGGCAGCGGAGAGGATTATGGGGTTGTACATGGCCAAAGC
TAGAGAAAGTGTTTGTCAAAATGAACGGAAGATGGAGAAGGAGAAGGATGTCATCATCAAGCCACGTGAGAAGCGACGAGTAAGTTTCATGACGACAATAGAAGCAATGC
CTCAAATTGCGGTAGCAACTTAATTAACAAGGGCAACAAAAATCATAGACAAATTAATTGAAGTGACACAGTGACGTGACGATCTACTTACTGAGAGGATTACACTACGT
CCTTCACTTCAACTTAAAACATTTAAATTTTAAACCGCTTTTGTAGAATATAGCTAGATATAGTTTTGTTTTCTTTTTACGCCCATTATGTACTCATAAATAAATTAATG
TTATATAAGTAGCAAGCTCCCGAGTCATGGAG
Protein sequenceShow/hide protein sequence
MGNAFGLKKTVKVMTVSGDTIKLTPPVQARDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVELPKIPKEQAPRRVRSAINMSAKDRLESLMLSRRSASD
LTIMKPKSVLAEEGGEQREGSEATRVKVRLPKAEVERVLKESKDEAEAAERIMGLYMAKARESVCQNERKMEKEKDVIIKPREKRRVSFMTTIEAMPQIAVAT