; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy5G010230 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy5G010230
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionEncodes a protein whose expression is responsive to nematode infection.
Genome locationGy14Chr5:9954095..9958811
RNA-Seq ExpressionCsGy5G010230
SyntenyCsGy5G010230
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140437.1 uncharacterized protein At1g66480 [Cucumis sativus]5.38e-143100Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVS
        LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVS

Query:  FMTTMEGGTQIAVAS
        FMTTMEGGTQIAVAS
Subjt:  FMTTMEGGTQIAVAS

XP_008454771.1 PREDICTED: uncharacterized protein At1g66480 isoform X1 [Cucumis melo]3.41e-12593.06Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMKISGET+KL TPVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLP+LPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREK-RRV
        LMLARRSASDLTIMKPKS+LTEEGGGESE GS SGA TRVK+RLPKAEVERLLKECKDEAEAAERIMGLYKTRE+  ENDHKEKE KKDIIKPREK RRV
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREK-RRV

Query:  SFMTTMEGGTQIAVAS
        SFMTTME GTQIAVAS
Subjt:  SFMTTMEGGTQIAVAS

XP_008454773.1 PREDICTED: uncharacterized protein At1g66480 isoform X2 [Cucumis melo]4.88e-12793.49Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMKISGET+KL TPVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLP+LPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVS
        LMLARRSASDLTIMKPKS+LTEEGGGESE GS SGA TRVK+RLPKAEVERLLKECKDEAEAAERIMGLYKTRE+  ENDHKEKE KKDIIKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVS

Query:  FMTTMEGGTQIAVAS
        FMTTME GTQIAVAS
Subjt:  FMTTMEGGTQIAVAS

XP_022942789.1 uncharacterized protein At1g66480 isoform X2 [Cucurbita moschata]5.87e-10378.8Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGN FG+KKTVKVM +SG+T+KL  PVQA DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LP++PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY--KTRENAYENDHKEKEIKKDIIKPREKRR
        LML+RRSASDLTIMKPKS+L EEGG E  EGS +   TRVK+RLPKAEVER+LKE KDEAEAAERIMGLY  K RE+  +N+ K ++ K  IIKPREKRR
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY--KTRENAYENDHKEKEIKKDIIKPREKRR

Query:  VSFMTTMEGGTQIAVAS
        VSFMTT+E   QIAVA+
Subjt:  VSFMTTMEGGTQIAVAS

XP_038891934.1 uncharacterized protein At1g66480 [Benincasa hispida]3.63e-12390.32Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVM ISGETMKL TPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LP++ KEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESE-EGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRV
        LMLARRSASDLTIMKPKS+L EEGGGES+  GS SGA TRVK+RLPKAEVERLLKECKDEAEAAERIMGLYKTREN  ENDHK++E KKDIIKPREKRRV
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESE-EGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRV

Query:  SFMTTMEG-GTQIAVAS
        SFMTTME  GTQIAVAS
Subjt:  SFMTTMEG-GTQIAVAS

TrEMBL top hitse value%identityAlignment
A0A1S3BZC7 uncharacterized protein At1g66480 isoform X11.65e-12593.06Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMKISGET+KL TPVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLP+LPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREK-RRV
        LMLARRSASDLTIMKPKS+LTEEGGGESE GS SGA TRVK+RLPKAEVERLLKECKDEAEAAERIMGLYKTRE+  ENDHKEKE KKDIIKPREK RRV
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREK-RRV

Query:  SFMTTMEGGTQIAVAS
        SFMTTME GTQIAVAS
Subjt:  SFMTTMEGGTQIAVAS

A0A1S3C0K8 uncharacterized protein At1g66480 isoform X22.36e-12793.49Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGNTFGVKKTVKVMKISGET+KL TPVQ GDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLP+LPKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVS
        LMLARRSASDLTIMKPKS+LTEEGGGESE GS SGA TRVK+RLPKAEVERLLKECKDEAEAAERIMGLYKTRE+  ENDHKEKE KKDIIKPREKRRVS
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVS

Query:  FMTTMEGGTQIAVAS
        FMTTME GTQIAVAS
Subjt:  FMTTMEGGTQIAVAS

A0A6J1DRH0 uncharacterized protein At1g664802.34e-10177.27Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGNTFG K+TVKVMKI+GETMKLK+PVQAGDVVKDYPGFVLLESE VKHYGVRAKPLE HQKLS KRLYFLV+LP++PKEQ PRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGA-PTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY--KTRENAY--ENDHKEKEIKKDIIKPRE
        LMLARRSASDL IMKPKS+L EE GG   EG+VSG+  T+VK+RLP+AEVERLLKE +DEAEAAE+I+G Y  K R+     +N H  KE K + IKPRE
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGA-PTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY--KTRENAY--ENDHKEKEIKKDIIKPRE

Query:  KRRVSFMTTMEGGTQIAVAS
        KRRVSFM T E GTQIAVAS
Subjt:  KRRVSFMTTMEGGTQIAVAS

A0A6J1FSC9 uncharacterized protein At1g66480 isoform X22.84e-10378.8Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGN FG+KKTVKVM +SG+T+KL  PVQA DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LP++PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY--KTRENAYENDHKEKEIKKDIIKPREKRR
        LML+RRSASDLTIMKPKS+L EEGG E  EGS +   TRVK+RLPKAEVER+LKE KDEAEAAERIMGLY  K RE+  +N+ K ++ K  IIKPREKRR
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY--KTRENAYENDHKEKEIKKDIIKPREKRR

Query:  VSFMTTMEGGTQIAVAS
        VSFMTT+E   QIAVA+
Subjt:  VSFMTTMEGGTQIAVAS

A0A6J1FVP5 uncharacterized protein At1g66480 isoform X14.26e-10177.73Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGN FG+KKTVKVM +SG+T+KL  PVQA DVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLV+LP++PKEQAPRRVRSAINMSAKDRLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY--KTRENAYENDHKEKEIKKDIIKPREK--
        LML+RRSASDLTIMKPKS+L EEGG E  EGS +   TRVK+RLPKAEVER+LKE KDEAEAAERIMGLY  K RE+  +N+ K ++ K  IIKPREK  
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY--KTRENAYENDHKEKEIKKDIIKPREK--

Query:  -RRVSFMTTMEGGTQIAVAS
         RRVSFMTT+E   QIAVA+
Subjt:  -RRVSFMTTMEGGTQIAVAS

SwissProt top hitse value%identityAlignment
Q6NLC8 Uncharacterized protein At1g664802.0e-3543.42Show/hide
Query:  MGNTFGVK-KTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAP---------RRVRSAI
        MGN+  VK K  KVMKI GET ++KTPV A +V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLV+LP+LP E            RRV S I
Subjt:  MGNTFGVK-KTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAP---------RRVRSAI

Query:  NMSAKDRLESLMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEA-AERIMGLYKTRENAYENDH----KEKE
        ++ AK+RL+ LML+RR+ SD+TI +         GG+     +    T V++RLP++++ +L++E  ++A A AE+I+G+Y  R              +E
Subjt:  NMSAKDRLESLMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEA-AERIMGLYKTRENAYENDH----KEKE

Query:  IKKDIIKPREKRRVSFMTTMEGGTQIAV
        +    IK REK +VSF    EGG ++ V
Subjt:  IKKDIIKPREKRRVSFMTTMEGGTQIAV

Arabidopsis top hitse value%identityAlignment
AT1G66480.1 plastid movement impaired 21.4e-3643.42Show/hide
Query:  MGNTFGVK-KTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAP---------RRVRSAI
        MGN+  VK K  KVMKI GET ++KTPV A +V  DYPG+VLL+S+AVKH+GVR+KPLE +Q L  K+ YFLV+LP+LP E            RRV S I
Subjt:  MGNTFGVK-KTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAP---------RRVRSAI

Query:  NMSAKDRLESLMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEA-AERIMGLYKTRENAYENDH----KEKE
        ++ AK+RL+ LML+RR+ SD+TI +         GG+     +    T V++RLP++++ +L++E  ++A A AE+I+G+Y  R              +E
Subjt:  NMSAKDRLESLMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEA-AERIMGLYKTRENAYENDH----KEKE

Query:  IKKDIIKPREKRRVSFMTTMEGGTQIAV
        +    IK REK +VSF    EGG ++ V
Subjt:  IKKDIIKPREKRRVSFMTTMEGGTQIAV

AT1G71015.1 unknown protein6.1e-4355.62Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGN+ G KKT  +M I+GE+ KLKTPV+AG VVKD+PG VLLESEAVK  G+RAKPLE HQ L +KR+YF+V+LPR  KE+ PRRVRS I MSAK+RLE+
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGL
        L L+RRS+SDL++MK K+ + +      EE  VS     VK++LPK ++E+L KE +  ++ + +I  L
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGL

AT2G01340.1 Encodes a protein whose expression is responsive to nematode infection.5.7e-4953.3Show/hide
Query:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES
        MGN+ G KKT KVMKI GET KLKTPV A +V+KD+PG VLL+SE+VKHYG RAKPLE  Q+L  KRLYF+V+     KE  PRRVRS I++SAK+RLES
Subjt:  MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLES

Query:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY---KTRENAYENDHKEKEIKKDI-------
        LMLARRS+SDL+I+KP       GG  +EE    GA  RVK+R+PKAE+E+L+KE   EAEA ++I  L+   + +E AY+N  +++             
Subjt:  LMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLY---KTRENAYENDHKEKEIKKDI-------

Query:  ---IKPREKRRVSFMTTMEGGTQIAVA
           +K R K RVSFM    GG++I VA
Subjt:  ---IKPREKRRVSFMTTMEGGTQIAVA

AT5G37840.1 BEST Arabidopsis thaliana protein match is: plastid movement impaired 2 (TAIR:AT1G66480.1)5.9e-3046.07Show/hide
Query:  MGNTFGVKKT-VKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQ--APRRVRSA-INMSAKD
        MGNT  V++  VKVMKI G+  +LKTPV A D  K+YPGFVLL+SE VK  GVRAKPLE +Q L     YFLVDLP + K      RRV S  I++ AK+
Subjt:  MGNTFGVKKT-VKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQ--APRRVRSA-INMSAKD

Query:  RLESLMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAP----TRVKMRLPKAEVERLLKECKDEAEAAERIMGLY
        RLE LML+RR+ SD+              G +    V   P    TRV++RLP++++ +L++E  D +E A +I+  Y
Subjt:  RLESLMLARRSASDLTIMKPKSMLTEEGGGESEEGSVSGAP----TRVKMRLPKAEVERLLKECKDEAEAAERIMGLY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAACACCTTTGGTGTCAAGAAGACGGTTAAGGTTATGAAGATCTCGGGGGAGACCATGAAACTAAAAACCCCGGTTCAAGCTGGTGATGTCGTCAAGGATTATCC
TGGCTTCGTTCTACTCGAATCTGAGGCTGTTAAACACTACGGGGTTCGAGCAAAGCCTTTGGAGCTCCACCAGAAGCTCAGCACTAAGAGACTTTATTTCCTCGTCGATC
TTCCTAGACTTCCAAAAGAACAGGCTCCACGACGAGTACGGTCAGCGATCAACATGAGTGCAAAGGATAGGTTAGAGAGCTTGATGTTGGCACGACGATCAGCATCGGAC
CTAACTATCATGAAACCAAAGAGCATGTTGACGGAGGAGGGCGGTGGAGAGAGTGAGGAGGGATCGGTATCGGGAGCGCCAACACGGGTGAAGATGCGGTTGCCGAAGGC
CGAAGTGGAAAGACTGTTGAAGGAGTGCAAAGATGAGGCAGAGGCAGCAGAAAGGATTATGGGATTGTACAAAACAAGAGAAAATGCTTATGAAAATGATCATAAGGAGA
AGGAGATCAAGAAGGATATCATCAAGCCACGTGAGAAACGACGTGTAAGTTTCATGACGACAATGGAAGGTGGAACTCAAATTGCAGTAGCATCTTAA
mRNA sequenceShow/hide mRNA sequence
GCCTTCTTTGTTTGACCCAAAAAAGAATTCATTAAATTAAATTAACAATCACTTTTTTTCTTCTTCTTTTTAACCCACCAAACAAAATCTTTTCCCTATAATAAGACCCT
TTAGACTAAAATTGATTTTCTCTTTTATACGCCCACCCACCACCCTCTTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCATTTTCAAATCTCTTTTCCCTTTTGG
CTGCCATGGGAAACACCTTTGGTGTCAAGAAGACGGTTAAGGTTATGAAGATCTCGGGGGAGACCATGAAACTAAAAACCCCGGTTCAAGCTGGTGATGTCGTCAAGGAT
TATCCTGGCTTCGTTCTACTCGAATCTGAGGCTGTTAAACACTACGGGGTTCGAGCAAAGCCTTTGGAGCTCCACCAGAAGCTCAGCACTAAGAGACTTTATTTCCTCGT
CGATCTTCCTAGACTTCCAAAAGAACAGGCTCCACGACGAGTACGGTCAGCGATCAACATGAGTGCAAAGGATAGGTTAGAGAGCTTGATGTTGGCACGACGATCAGCAT
CGGACCTAACTATCATGAAACCAAAGAGCATGTTGACGGAGGAGGGCGGTGGAGAGAGTGAGGAGGGATCGGTATCGGGAGCGCCAACACGGGTGAAGATGCGGTTGCCG
AAGGCCGAAGTGGAAAGACTGTTGAAGGAGTGCAAAGATGAGGCAGAGGCAGCAGAAAGGATTATGGGATTGTACAAAACAAGAGAAAATGCTTATGAAAATGATCATAA
GGAGAAGGAGATCAAGAAGGATATCATCAAGCCACGTGAGAAACGACGTGTAAGTTTCATGACGACAATGGAAGGTGGAACTCAAATTGCAGTAGCATCTTAATATATAT
ATTAACCAAGAAAACAAAATGACAAGATTCATAAATAAATTAAGTAAAGTGATACAGTGACGATCTTCCCATGGCTACCAGATTACGTCCTTCACTTCAACTTAATATAT
TTCAATATCTTGGACATCCTTTTGTAGAATATTTACTTACATTTCATTTACTGCCTTCCCCTATATGTACACCCAAATGTACTTAATTACGATATCCATCAACTTCTTCT
TCATCATCATCACACACACACACACATATATATAGCTTTCTTCCTTTAAATATTCTGAAGAACGACTGCTCGATCTTTTATGGCTCAATATATAGTTTTAACTTTTTATG
CCCCTACCATTTGTTTCCATTTCATTGATATTCTATTAAGTTGTAACTTCATCAACCTATTACTGATATGTTTTTTAAGTCATGCTCTCCATTCTTTGTAGGAATTGTTT
TTGACATCTTCAAATATAGCAAAATGAATCAAAATATTTAGAAAATATATCAAAAATAAAATTCTATTAACGATAGAAACGGATACACTTGTTTATCAGGGACTTTGATA
GAAACTAATAGAAGTCTATCAATGTCTATCTTCACTAATAGAATCTAAAATTATTGAGTATTTGTCCTAGTTTATTGTATTCGTATGTTTATTGCACTGATACGTATTAT
TCTCTTTCGACTAAATTCAATGTTGTTTTATCCCACCAGAATTTTTAGTCCAAATGAGAGTTTGTTAGAATTTGTCATAAAACTTGTAGTTTATAACTTACCGTTGTTGG
ATTTTGTGTCCTAAAACTTATATACTACTACAAATTTGATCTTTTTTGATGTGCAAAGCATGTCAAAAAATGTCTATCTCGATAGTCTTTGTCCATCAGTAGGCAGATCC
ATGTCATCAAAAACTTTTTCTTGATGGTATTGGGCCATCAATAAAGGGCACTTTAGCTCTCGACATCTTGATTTCCTCCTCAAGGAGGATTCTTGTACTAACAATTGTAA
GGTTGAAATCCA
Protein sequenceShow/hide protein sequence
MGNTFGVKKTVKVMKISGETMKLKTPVQAGDVVKDYPGFVLLESEAVKHYGVRAKPLELHQKLSTKRLYFLVDLPRLPKEQAPRRVRSAINMSAKDRLESLMLARRSASD
LTIMKPKSMLTEEGGGESEEGSVSGAPTRVKMRLPKAEVERLLKECKDEAEAAERIMGLYKTRENAYENDHKEKEIKKDIIKPREKRRVSFMTTMEGGTQIAVAS