; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G27800 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G27800
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionEncodes a protein whose expression is responsive to nematode infection.
Genome locationChr4:24497331..24501510
RNA-Seq ExpressionCSPI04G27800
SyntenyCSPI04G27800
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140437.1 uncharacterized protein At1g66480 [Cucumis sativus]2.4e-110100Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVS
        LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVS

Query:  FMTTMEGGTQIAVAS
        FMTTMEGGTQIAVAS
Subjt:  FMTTMEGGTQIAVAS

XP_008454771.1 PREDICTED: uncharacterized protein At1g66480 isoform X1 [Cucumis melo]1.2e-9693.06Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMKISGET+KL TPVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLP+LPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPRE-KRRV
        LMLARRSASDLTIMKPKS+LTEEGGGES EGS SGA TRVK+RLPKAEVERLLKECKDEAEAAERIMGLYKTRE+  ENDHKEKE KKDIIKPRE KRRV
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPRE-KRRV

Query:  SFMTTMEGGTQIAVAS
        SFMTTME GTQIAVAS
Subjt:  SFMTTMEGGTQIAVAS

XP_008454773.1 PREDICTED: uncharacterized protein At1g66480 isoform X2 [Cucumis melo]4.7e-9893.49Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMKISGET+KL TPVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLP+LPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVS
        LMLARRSASDLTIMKPKS+LTEEGGGES EGS SGA TRVK+RLPKAEVERLLKECKDEAEAAERIMGLYKTRE+  ENDHKEKE KKDIIKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVS

Query:  FMTTMEGGTQIAVAS
        FMTTME GTQIAVAS
Subjt:  FMTTMEGGTQIAVAS

XP_022942789.1 uncharacterized protein At1g66480 isoform X2 [Cucurbita moschata]7.6e-8078.8Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGN FG+KKTVKVM +SG+T+KL  PVQA DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LP++PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY--KTRENAYENDHKEKEIKKDIIKPREKRR
        LML+RRSASDLTIMKPKS+L EE GGE  EGS +   TRVK+RLPKAEVER+LKE KDEAEAAERIMGLY  K RE+  +N+ K ++ K  IIKPREKRR
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY--KTRENAYENDHKEKEIKKDIIKPREKRR

Query:  VSFMTTMEGGTQIAVAS
        VSFMTT+E   QIAVA+
Subjt:  VSFMTTMEGGTQIAVAS

XP_038891934.1 uncharacterized protein At1g66480 [Benincasa hispida]3.8e-9590.32Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVM ISGETMKL TPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LP++ KEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESE-EGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRV
        LMLARRSASDLTIMKPKS+L EEGGGES+  GS SGA TRVK+RLPKAEVERLLKECKDEAEAAERIMGLYKTREN  ENDHK++E KKDIIKPREKRRV
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESE-EGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRV

Query:  SFMTTMEG-GTQIAVAS
        SFMTTME  GTQIAVAS
Subjt:  SFMTTMEG-GTQIAVAS

TrEMBL top hitse value%identityAlignment
A0A1S3BZC7 uncharacterized protein At1g66480 isoform X15.7e-9793.06Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMKISGET+KL TPVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLP+LPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPRE-KRRV
        LMLARRSASDLTIMKPKS+LTEEGGGES EGS SGA TRVK+RLPKAEVERLLKECKDEAEAAERIMGLYKTRE+  ENDHKEKE KKDIIKPRE KRRV
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPRE-KRRV

Query:  SFMTTMEGGTQIAVAS
        SFMTTME GTQIAVAS
Subjt:  SFMTTMEGGTQIAVAS

A0A1S3C0K8 uncharacterized protein At1g66480 isoform X22.3e-9893.49Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMKISGET+KL TPVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLP+LPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVS
        LMLARRSASDLTIMKPKS+LTEEGGGES EGS SGA TRVK+RLPKAEVERLLKECKDEAEAAERIMGLYKTRE+  ENDHKEKE KKDIIKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVS

Query:  FMTTMEGGTQIAVAS
        FMTTME GTQIAVAS
Subjt:  FMTTMEGGTQIAVAS

A0A6J1DRH0 uncharacterized protein At1g664801.2e-7876.36Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGNTFG K+TVKVMKI+GETMKLK+PVQAGDVVKDYPGFVLLESE VKHYGVRAKPLE HQKLS KRLYFLV+LP++PKEQ PRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGA-PTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY----KTRENAYENDHKEKEIKKDIIKPRE
        LMLARRSASDL IMKPKS+L EE GG   EG+VSG+  T+VK+RLP+AEVERLLKE +DEAEAAE+I+G Y    + + +  +N H  KE K + IKPRE
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGA-PTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY----KTRENAYENDHKEKEIKKDIIKPRE

Query:  KRRVSFMTTMEGGTQIAVAS
        KRRVSFM T E GTQIAVAS
Subjt:  KRRVSFMTTMEGGTQIAVAS

A0A6J1FSC9 uncharacterized protein At1g66480 isoform X23.7e-8078.8Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGN FG+KKTVKVM +SG+T+KL  PVQA DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LP++PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY--KTRENAYENDHKEKEIKKDIIKPREKRR
        LML+RRSASDLTIMKPKS+L EE GGE  EGS +   TRVK+RLPKAEVER+LKE KDEAEAAERIMGLY  K RE+  +N+ K ++ K  IIKPREKRR
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY--KTRENAYENDHKEKEIKKDIIKPREKRR

Query:  VSFMTTMEGGTQIAVAS
        VSFMTT+E   QIAVA+
Subjt:  VSFMTTMEGGTQIAVAS

A0A6J1FVP5 uncharacterized protein At1g66480 isoform X11.5e-7877.73Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGN FG+KKTVKVM +SG+T+KL  PVQA DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LP++PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY--KTRENAYENDHKEKEIKKDIIKPRE---
        LML+RRSASDLTIMKPKS+L EE GGE  EGS +   TRVK+RLPKAEVER+LKE KDEAEAAERIMGLY  K RE+  +N+ K ++ K  IIKPRE   
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY--KTRENAYENDHKEKEIKKDIIKPRE---

Query:  KRRVSFMTTMEGGTQIAVAS
        KRRVSFMTT+E   QIAVA+
Subjt:  KRRVSFMTTMEGGTQIAVAS

SwissProt top hitse value%identityAlignment
Q6NLC8 Uncharacterized protein At1g664802.0e-3543.42Show/hide
Query:  MGNTFGVK-KTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAP---------RRVRSAI
        MGN+  VK K  KVMKI GET ++KTPV A +V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLV+LP+LP E            RRV S I
Subjt:  MGNTFGVK-KTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAP---------RRVRSAI

Query:  NMSAKDRLESLMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEA-AERIMGLYKTRENAYENDH----KEKE
        ++ AK+RL+ LML+RR+ SD+TI +         GG+     +    T V++RLP++++ +L++E  ++A A AE+I+G+Y  R              +E
Subjt:  NMSAKDRLESLMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEA-AERIMGLYKTRENAYENDH----KEKE

Query:  IKKDIIKPREKRRVSFMTTMEGGTQIAV
        +    IK REK +VSF    EGG ++ V
Subjt:  IKKDIIKPREKRRVSFMTTMEGGTQIAV

Arabidopsis top hitse value%identityAlignment
AT1G66480.1 plastid movement impaired 21.4e-3643.42Show/hide
Query:  MGNTFGVK-KTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAP---------RRVRSAI
        MGN+  VK K  KVMKI GET ++KTPV A +V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLV+LP+LP E            RRV S I
Subjt:  MGNTFGVK-KTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAP---------RRVRSAI

Query:  NMSAKDRLESLMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEA-AERIMGLYKTRENAYENDH----KEKE
        ++ AK+RL+ LML+RR+ SD+TI +         GG+     +    T V++RLP++++ +L++E  ++A A AE+I+G+Y  R              +E
Subjt:  NMSAKDRLESLMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEA-AERIMGLYKTRENAYENDH----KEKE

Query:  IKKDIIKPREKRRVSFMTTMEGGTQIAV
        +    IK REK +VSF    EGG ++ V
Subjt:  IKKDIIKPREKRRVSFMTTMEGGTQIAV

AT1G71015.1 unknown protein6.1e-4355.62Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGN+ G KKT  +M I+GE+ KLKTPV+AG VVKD+PG VLLESEAVK  G+RAKPLE HQ L +KR+YF+V+LPR  KE+ PRRVRS I MSAK+RLE+
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGL
        L L+RRS+SDL++MK K+ + +      EE  VS     VK++LPK ++E+L KE +  ++ + +I  L
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGL

AT2G01340.1 Encodes a protein whose expression is responsive to nematode infection.5.7e-4953.3Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGN+ G KKT KVMKI GET KLKTPV A +V+KD+PG VLL+SE+VKHYG RAKPLE  Q+L  KRLYF+V+     KE  PRRVRS I++SAK+RLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY---KTRENAYENDHKEKEIKKDI-------
        LMLARRS+SDL+I+KP       GG  +EE    GA  RVK+R+PKAE+E+L+KE   EAEA ++I  L+   + +E AY+N  +++             
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY---KTRENAYENDHKEKEIKKDI-------

Query:  ---IKPREKRRVSFMTTMEGGTQIAVA
           +K R K RVSFM    GG++I VA
Subjt:  ---IKPREKRRVSFMTTMEGGTQIAVA

AT5G37840.1 BEST Arabidopsis thaliana protein match is: plastid movement impaired 2 (TAIR:AT1G66480.1)5.9e-3046.07Show/hide
Query:  MGNTFGVKKT-VKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQ--APRRVRSA-INMSAKD
        MGNT  V++  VKVMKI G+  +LKTPV A D  K+YPGFVLL+SE VK  GVRAKPLE +Q L     YFLVDLP + K      RRV S  I++ AK+
Subjt:  MGNTFGVKKT-VKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQ--APRRVRSA-INMSAKD

Query:  RLESLMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAP----TRVKMRLPKAEVERLLKECKDEAEAAERIMGLY
        RLE LML+RR+ SD+              G +    V   P    TRV++RLP++++ +L++E  D +E A +I+  Y
Subjt:  RLESLMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAP----TRVKMRLPKAEVERLLKECKDEAEAAERIMGLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAACACCTTTGGTGTCAAGAAGACGGTTAAGGTTATGAAGATCTCGGGGGAGACCATGAAACTAAAAACCCCGGTTCAAGCTGGTGATGTCGTCAAGGATTATCC
TGGCTTCGTTCTACTCGAATCTGAGGCTGTTAAACACTATGGGGTTCGAGCAAAGCCTTTGGAGCTCCACCAGAAGCTCAGCACTAAGAGACTTTATTTCCTCGTCGATC
TTCCTAGACTTCCAAAAGAACAGGCTCCACGACGAGTACGGTCAGCGATCAACATGAGTGCAAAGGATAGGTTAGAGAGCTTGATGTTGGCACGACGATCAGCATCGGAC
CTAACTATCATGAAACCAAAGAGCATGTTGACGGAGGAGGGCGGTGGAGAGAGTGAGGAGGGATCGGTATCGGGAGCGCCAACACGGGTGAAGATGCGGCTGCCGAAGGC
CGAAGTGGAAAGACTGTTGAAGGAGTGCAAAGATGAGGCAGAGGCAGCAGAAAGGATTATGGGATTGTACAAAACAAGAGAAAATGCTTATGAAAATGATCATAAGGAGA
AGGAGATCAAGAAGGATATCATCAAGCCACGTGAGAAACGACGTGTAAGTTTCATGACGACAATGGAAGGTGGAACTCAAATTGCAGTAGCATCTTAA
mRNA sequenceShow/hide mRNA sequence
CACCAAACAAAATCTTTTCCCTATAATAAGACCCTTTAGACTAAAATTGATTTTCTCTTTTATACGCCCACCCACCACCCTCTTCTTTCTCTCTCTCATTTTCAAATCTC
TTTTCCCTTTTGGCTGCCATGGGAAACACCTTTGGTGTCAAGAAGACGGTTAAGGTTATGAAGATCTCGGGGGAGACCATGAAACTAAAAACCCCGGTTCAAGCTGGTGA
TGTCGTCAAGGATTATCCTGGCTTCGTTCTACTCGAATCTGAGGCTGTTAAACACTATGGGGTTCGAGCAAAGCCTTTGGAGCTCCACCAGAAGCTCAGCACTAAGAGAC
TTTATTTCCTCGTCGATCTTCCTAGACTTCCAAAAGAACAGGCTCCACGACGAGTACGGTCAGCGATCAACATGAGTGCAAAGGATAGGTTAGAGAGCTTGATGTTGGCA
CGACGATCAGCATCGGACCTAACTATCATGAAACCAAAGAGCATGTTGACGGAGGAGGGCGGTGGAGAGAGTGAGGAGGGATCGGTATCGGGAGCGCCAACACGGGTGAA
GATGCGGCTGCCGAAGGCCGAAGTGGAAAGACTGTTGAAGGAGTGCAAAGATGAGGCAGAGGCAGCAGAAAGGATTATGGGATTGTACAAAACAAGAGAAAATGCTTATG
AAAATGATCATAAGGAGAAGGAGATCAAGAAGGATATCATCAAGCCACGTGAGAAACGACGTGTAAGTTTCATGACGACAATGGAAGGTGGAACTCAAATTGCAGTAGCA
TCTTAATATATATATTAACCAAGAAAACAAAATGACAAGATTCATAAATAAATTAAGTAAAGTGATACAGTGACGATCTTCCCATGGCTACCAGATTACGTCCTTCACTT
CAACTTAAAATATTTCAATATCTTGGACATCCTTTTGTAGAATATTTACTTACATTTCATTTACTGCCTTCCCCTATATGTACACCCAAATGCACTTAATTACGATATCG
ATCAACTTCTTCTTCATCATCATCACACACACACATATATATAGCTTTCTTCCTTTAAATATTCTGAAGAACGACTGCTCGATCTTTTTTGGCTCAATATATAGTTTTAA
CTTTTTATGCCCC
Protein sequenceShow/hide protein sequence
MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLESLMLARRSASD
LTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVSFMTTMEGGTQIAVAS