; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg028739 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg028739
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionEncodes a protein whose expression is responsive to nematode infection, putative
Genome locationscaffold7:14671397..14676081
RNA-Seq ExpressionSpg028739
SyntenySpg028739
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6599897.1 hypothetical protein SDJN03_05130, partial [Cucurbita argyrosperma subsp. sororia]5.3e-8682.79Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGN FG KKTVKVMK++GETMKLN PVQAGDVVKDYPGFVLL+SEAVKHYGVRAKPLEP Q LS KRLYFLV+LPK+PNQ  PRR+RSAI+MSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEEKKDMIKPREKRRVS
        LMLARRSASDLTIMKPKSVL EED GEGS  AG TRLKVRLP+AEVE+LLKESKD+ EAAEKIV   GLYM+KTTQS S++A K+EK+D IKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEEKKDMIKPREKRRVS

Query:  FMTTMEAGTQIAVTS
        FMTTMEAGT+IAV S
Subjt:  FMTTMEAGTQIAVTS

XP_008454771.1 PREDICTED: uncharacterized protein At1g66480 isoform X1 [Cucumis melo]1.8e-8686.05Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGNTFG+KKTVKVMKISGET+KLN PVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLV+LPK+P +QAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-EKKDMIKPRE-KRRVS
        LMLARRSASDLTIMKPKSVL EE GG  EGS +GATR+KVRLPKAEVERLLKE KDEAEAAE+I+GLY  KT +SV EN  KE EKKD+IKPRE KRRVS
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-EKKDMIKPRE-KRRVS

Query:  FMTTMEAGTQIAVTS
        FMTTMEAGTQIAV S
Subjt:  FMTTMEAGTQIAVTS

XP_008454773.1 PREDICTED: uncharacterized protein At1g66480 isoform X2 [Cucumis melo]7.4e-8886.45Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGNTFG+KKTVKVMKISGET+KLN PVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLV+LPK+P +QAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-EKKDMIKPREKRRVSF
        LMLARRSASDLTIMKPKSVL EE GG  EGS +GATR+KVRLPKAEVERLLKE KDEAEAAE+I+GLY  KT +SV EN  KE EKKD+IKPREKRRVSF
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-EKKDMIKPREKRRVSF

Query:  MTTMEAGTQIAVTS
        MTTMEAGTQIAV S
Subjt:  MTTMEAGTQIAVTS

XP_022941735.1 uncharacterized protein At1g66480-like [Cucurbita moschata]6.3e-8783.26Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGN FG KKTVKVMK++GETMKLN PVQAGDVVKDYPGFVLL+SEAVKHYGVRAKPLEPHQ LS KRLYFLV+LPK+PNQ  PRR+RSAI+MSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEEKKDMIKPREKRRVS
        LMLARRSASDLTIMKPKSVL EED GEGS  AG TRLKVRLP+AEVE+LLKESKD+ EAAEKIV   GLYM+KTTQS S++A K+EK+D IKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEEKKDMIKPREKRRVS

Query:  FMTTMEAGTQIAVTS
        FMTTMEAGT+IAV S
Subjt:  FMTTMEAGTQIAVTS

XP_023511768.1 uncharacterized protein At1g66480-like [Cucurbita pepo subsp. pepo]1.7e-8783.72Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGN FG KKTVKVMK++GETMKLN PVQAGDVVKDYPGFV+L+SEAVKHYGVRAKPLEPHQ LS KRLYFLV+LPK+PNQ  PRR+RSAI+MSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEEKKDMIKPREKRRVS
        LMLARRSASDLTIMKPKSVL EED GEGS  AG TRLKVRLP+AEVE+LLKESKD+ EAAEKIV   GLYM+KTTQS S+NA KEEK+D IKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEEKKDMIKPREKRRVS

Query:  FMTTMEAGTQIAVTS
        FMTTMEAGT+IAV S
Subjt:  FMTTMEAGTQIAVTS

TrEMBL top hitse value%identityAlignment
A0A1S3BZC7 uncharacterized protein At1g66480 isoform X18.8e-8786.05Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGNTFG+KKTVKVMKISGET+KLN PVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLV+LPK+P +QAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-EKKDMIKPRE-KRRVS
        LMLARRSASDLTIMKPKSVL EE GG  EGS +GATR+KVRLPKAEVERLLKE KDEAEAAE+I+GLY  KT +SV EN  KE EKKD+IKPRE KRRVS
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-EKKDMIKPRE-KRRVS

Query:  FMTTMEAGTQIAVTS
        FMTTMEAGTQIAV S
Subjt:  FMTTMEAGTQIAVTS

A0A1S3C0K8 uncharacterized protein At1g66480 isoform X23.6e-8886.45Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGNTFG+KKTVKVMKISGET+KLN PVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLV+LPK+P +QAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-EKKDMIKPREKRRVSF
        LMLARRSASDLTIMKPKSVL EE GG  EGS +GATR+KVRLPKAEVERLLKE KDEAEAAE+I+GLY  KT +SV EN  KE EKKD+IKPREKRRVSF
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGG--EGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKE-EKKDMIKPREKRRVSF

Query:  MTTMEAGTQIAVTS
        MTTMEAGTQIAV S
Subjt:  MTTMEAGTQIAVTS

A0A6J1FSC9 uncharacterized protein At1g66480 isoform X23.3e-8180.28Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGN FGLKKTVKVM +SG+T+KL PPVQA DVVKDYPGFVLLESEAVKHYGVRAKPLE HQKLSTKRLYFLVELPK+P +QAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKEEK-KD-MIKPREKRRVSFM
        LML+RRSASDLTIMKPKSVL EE G +   + ATR+KVRLPKAEVER+LKESKDEAEAAE+I+GLYM K  +SV +N +K EK KD +IKPREKRRVSFM
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKEEK-KD-MIKPREKRRVSFM

Query:  TTMEAGTQIAVTS
        TT+EA  QIAV +
Subjt:  TTMEAGTQIAVTS

A0A6J1FUK8 uncharacterized protein At1g66480-like3.0e-8783.26Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGN FG KKTVKVMK++GETMKLN PVQAGDVVKDYPGFVLL+SEAVKHYGVRAKPLEPHQ LS KRLYFLV+LPK+PNQ  PRR+RSAI+MSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEEKKDMIKPREKRRVS
        LMLARRSASDLTIMKPKSVL EED GEGS  AG TRLKVRLP+AEVE+LLKESKD+ EAAEKIV   GLYM+KTTQS S++A K+EK+D IKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEEKKDMIKPREKRRVS

Query:  FMTTMEAGTQIAVTS
        FMTTMEAGT+IAV S
Subjt:  FMTTMEAGTQIAVTS

A0A6J1JGY1 uncharacterized protein At1g66480-like1.6e-8381.86Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGN FG KKTVKVMK+SGETMKL+ PVQAGDVVKDYPGFVLL+SEAVKHYGVRAKPLEPHQ LS KRLYFLV+LPK+PNQ  PRR+RSAI+MSAKDRLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEEKKDMIKPREKRRVS
        LMLARRSASDLTIMKPKSVL EED GEGS  AG TRLKVRLP+AEVE+LLKESKD+ EAAEKIV   GLYM+KTTQS    A  +E++D IKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGGEGSR-AGATRLKVRLPKAEVERLLKESKDEAEAAEKIV---GLYMNKTTQSVSENAQKEEKKDMIKPREKRRVS

Query:  FMTTMEAGTQIAVTS
        FMTTMEAGT+IAV S
Subjt:  FMTTMEAGTQIAVTS

SwissProt top hitse value%identityAlignment
Q6NLC8 Uncharacterized protein At1g664802.4e-3643.42Show/hide
Query:  MGNTFGLK-KTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVP---------NQQAPRRVRSAI
        MGN+  +K K  KVMKI GET ++  PV A +V  DYPG+VLL+S+AVKH+GVR+KPLEP+Q L  K+ YFLVELPK+P         N+   RRV S I
Subjt:  MGNTFGLK-KTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVP---------NQQAPRRVRSAI

Query:  NMSAKDRLESLMLARRSASDLTIMKPKSVLEEEDGGEG----SRAGATRLKVRLPKAEVERLLKESKDEAEA-AEKIVGLYMNKTTQSVSENAQKEEKKD
        ++ AK+RL+ LML+RR+ SD+TI +        DGG+G       G T +++RLP++++ +L++E+ ++A A AEKI+G+YM ++ +        + +++
Subjt:  NMSAKDRLESLMLARRSASDLTIMKPKSVLEEEDGGEG----SRAGATRLKVRLPKAEVERLLKESKDEAEA-AEKIVGLYMNKTTQSVSENAQKEEKKD

Query:  M----IKPREKRRVSFMTTMEAGTQIAV
        +    IK REK +VSF    E G ++ V
Subjt:  M----IKPREKRRVSFMTTMEAGTQIAV

Arabidopsis top hitse value%identityAlignment
AT1G66480.1 plastid movement impaired 21.7e-3743.42Show/hide
Query:  MGNTFGLK-KTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVP---------NQQAPRRVRSAI
        MGN+  +K K  KVMKI GET ++  PV A +V  DYPG+VLL+S+AVKH+GVR+KPLEP+Q L  K+ YFLVELPK+P         N+   RRV S I
Subjt:  MGNTFGLK-KTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVP---------NQQAPRRVRSAI

Query:  NMSAKDRLESLMLARRSASDLTIMKPKSVLEEEDGGEG----SRAGATRLKVRLPKAEVERLLKESKDEAEA-AEKIVGLYMNKTTQSVSENAQKEEKKD
        ++ AK+RL+ LML+RR+ SD+TI +        DGG+G       G T +++RLP++++ +L++E+ ++A A AEKI+G+YM ++ +        + +++
Subjt:  NMSAKDRLESLMLARRSASDLTIMKPKSVLEEEDGGEG----SRAGATRLKVRLPKAEVERLLKESKDEAEA-AEKIVGLYMNKTTQSVSENAQKEEKKD

Query:  M----IKPREKRRVSFMTTMEAGTQIAV
        +    IK REK +VSF    E G ++ V
Subjt:  M----IKPREKRRVSFMTTMEAGTQIAV

AT1G71015.1 unknown protein2.7e-4354.55Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGN+ G KKT  +M I+GE+ KL  PV+AG VVKD+PG VLLESEAVK  G+RAKPLEPHQ L +KR+YF+VELP+   ++ PRRVRS I MSAK+RLE+
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSVLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGL
        L L+RRS+SDL++MK K+ + +E+    S      +K++LPK ++E+L KES+  ++ + KI  L
Subjt:  LMLARRSASDLTIMKPKSVLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGL

AT2G01340.1 Encodes a protein whose expression is responsive to nematode infection.6.8e-4752.23Show/hide
Query:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES
        MGN+ G KKT KVMKI GET KL  PV A +V+KD+PG VLL+SE+VKHYG RAKPLE  Q+L  KRLYF+VE  K   +  PRRVRS I++SAK+RLES
Subjt:  MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKS--VLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVS-ENAQKEEKKDM-----------
        LMLARRS+SDL+I+KP      EEE+G         R+KVR+PKAE+E+L+KE   EAEA +KI  L+M K  Q  + +N +++E               
Subjt:  LMLARRSASDLTIMKPKS--VLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVS-ENAQKEEKKDM-----------

Query:  -IKPREKRRVSFMTTMEAGTQIAV
         +K R K RVSFM     G++I V
Subjt:  -IKPREKRRVSFMTTMEAGTQIAV

AT5G37840.1 BEST Arabidopsis thaliana protein match is: plastid movement impaired 2 (TAIR:AT1G66480.1)5.0e-3447.7Show/hide
Query:  MGNTFGLKKT-VKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVP--NQQAPRRVRSA-INMSAKD
        MGNT  +++  VKVMKI G+  +L  PV A D  K+YPGFVLL+SE VK  GVRAKPLEP+Q L     YFLV+LP V   N+   RRV S  I++ AK+
Subjt:  MGNTFGLKKT-VKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVP--NQQAPRRVRSA-INMSAKD

Query:  RLESLMLARRSASDLTIMKPKSVLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKT
        RLE LML+RR+ SD+   +   V      G+G   G TR+++RLP++++ +L++ES D +E A KI+  YM  +
Subjt:  RLESLMLARRSASDLTIMKPKSVLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAACACCTTTGGACTTAAGAAGACGGTGAAGGTGATGAAGATCTCCGGCGAGACGATGAAGCTCAACCCCCCGGTTCAAGCTGGGGACGTCGTCAAGGATTACCC
TGGCTTTGTTCTGCTCGAATCTGAGGCCGTGAAGCACTATGGAGTTCGAGCAAAGCCATTGGAGCCCCACCAGAAGCTCAGCACGAAGAGGCTCTATTTCCTCGTCGAGC
TGCCTAAGGTTCCAAATCAACAGGCTCCACGACGGGTACGGTCGGCGATCAATATGAGTGCCAAGGATAGGCTAGAGAGCTTGATGTTGGCGCGAAGGTCGGCGTCGGAC
CTAACTATCATGAAACCGAAGAGCGTGTTGGAGGAGGAGGACGGAGGAGAGGGATCGAGAGCGGGAGCGACACGGTTGAAGGTGCGGCTGCCGAAGGCGGAGGTGGAGAG
GCTGTTGAAGGAGAGCAAAGATGAGGCAGAGGCAGCGGAGAAGATTGTGGGACTGTACATGAATAAAACCACACAAAGTGTTTCTGAAAATGCTCAAAAGGAAGAGAAGA
AGGATATGATCAAGCCACGTGAGAAGCGACGTGTAAGCTTCATGACGACAATGGAAGCAGGGACACAAATTGCAGTGACATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAACACCTTTGGACTTAAGAAGACGGTGAAGGTGATGAAGATCTCCGGCGAGACGATGAAGCTCAACCCCCCGGTTCAAGCTGGGGACGTCGTCAAGGATTACCC
TGGCTTTGTTCTGCTCGAATCTGAGGCCGTGAAGCACTATGGAGTTCGAGCAAAGCCATTGGAGCCCCACCAGAAGCTCAGCACGAAGAGGCTCTATTTCCTCGTCGAGC
TGCCTAAGGTTCCAAATCAACAGGCTCCACGACGGGTACGGTCGGCGATCAATATGAGTGCCAAGGATAGGCTAGAGAGCTTGATGTTGGCGCGAAGGTCGGCGTCGGAC
CTAACTATCATGAAACCGAAGAGCGTGTTGGAGGAGGAGGACGGAGGAGAGGGATCGAGAGCGGGAGCGACACGGTTGAAGGTGCGGCTGCCGAAGGCGGAGGTGGAGAG
GCTGTTGAAGGAGAGCAAAGATGAGGCAGAGGCAGCGGAGAAGATTGTGGGACTGTACATGAATAAAACCACACAAAGTGTTTCTGAAAATGCTCAAAAGGAAGAGAAGA
AGGATATGATCAAGCCACGTGAGAAGCGACGTGTAAGCTTCATGACGACAATGGAAGCAGGGACACAAATTGCAGTGACATCTTAA
Protein sequenceShow/hide protein sequence
MGNTFGLKKTVKVMKISGETMKLNPPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLEPHQKLSTKRLYFLVELPKVPNQQAPRRVRSAINMSAKDRLESLMLARRSASD
LTIMKPKSVLEEEDGGEGSRAGATRLKVRLPKAEVERLLKESKDEAEAAEKIVGLYMNKTTQSVSENAQKEEKKDMIKPREKRRVSFMTTMEAGTQIAVTS