; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CsGy2G020630 (gene) of Cucumber (Gy14) v2.1 genome

Gene IDCsGy2G020630
OrganismCucumis sativus L. var. sativus cv. Gy14 (Cucumber (Gy14) v2.1)
DescriptionProtein of unknown function (DUF668)
Genome locationGy14Chr2:29530401..29534518
RNA-Seq ExpressionCsGy2G020630
SyntenyCsGy2G020630
Gene Ontology termsGO:0045927 - positive regulation of growth (biological process)
InterPro domainsIPR007700 - Domain of unknown function DUF668
IPR021864 - Domain of unknown function DUF3475
IPR045021 - Protein PSK SIMULATOR


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065159.1 uncharacterized protein E6C27_scaffold82G004950 [Cucumis melo var. makuwa]8.56e-24993.4Show/hide
Query:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS
        MGNTMGGVCSNGIVKDDFVSEK+ QASEDRKGNS+LN EA DPNEMPEKS S VILLPSPPSK GSNKVAPMNAQAGARGRAVDLWKTIGISVSN HIN+
Subjt:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS

Query:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY
        G  T M PSGREISILAFEVANTISKVANLSKSLSEEN+QLLK ELLQSE IKQLIS SLEELLSIAAADKRQEFGVILRE+IRFGN+CKDS+WHNLDQY
Subjt:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY

Query:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        FSRLDSNDSSQKQAREARAA+QELTVLAQNTSELYHELQALERLEQDYRRRVEEVE LNQAGIGETLSIFQGELNVQR+LVRSFQSKCLWSRNLDEIVEK
Subjt:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        LVIVVTWINQTI+KEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIA RPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
Subjt:  LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

XP_004152648.2 uncharacterized protein LOC101204577 isoform X3 [Cucumis sativus]3.45e-269100Show/hide
Query:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS
        MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS
Subjt:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS

Query:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY
        GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY
Subjt:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY

Query:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
Subjt:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
Subjt:  LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

XP_011649665.1 uncharacterized protein LOC101204577 isoform X1 [Cucumis sativus]2.36e-267100Show/hide
Query:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS
        MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS
Subjt:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS

Query:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY
        GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY
Subjt:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY

Query:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
Subjt:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
Subjt:  LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

XP_031736339.1 uncharacterized protein LOC101204577 isoform X2 [Cucumis sativus]5.05e-281100Show/hide
Query:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS
        MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS
Subjt:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS

Query:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY
        GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY
Subjt:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY

Query:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
Subjt:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREESASILW
        LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREESASILW
Subjt:  LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREESASILW

Query:  QNWRMGNPK
        QNWRMGNPK
Subjt:  QNWRMGNPK

XP_031736340.1 uncharacterized protein LOC101204577 isoform X4 [Cucumis sativus]2.93e-283100Show/hide
Query:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS
        MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS
Subjt:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS

Query:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY
        GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY
Subjt:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY

Query:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
Subjt:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREESASILW
        LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREESASILW
Subjt:  LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREESASILW

Query:  QNWRMGNPK
        QNWRMGNPK
Subjt:  QNWRMGNPK

TrEMBL top hitse value%identityAlignment
A0A0A0LLM8 Uncharacterized protein1.55e-264100Show/hide
Query:  MGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
        MGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST
Subjt:  MGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVST

Query:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
        GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL
Subjt:  GMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
        DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
Subjt:  DSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV

Query:  VTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        VTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
Subjt:  VTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

A0A5A7VI62 Uncharacterized protein4.14e-24993.4Show/hide
Query:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS
        MGNTMGGVCSNGIVKDDFVSEK+ QASEDRKGNS+LN EA DPNEMPEKS S VILLPSPPSK GSNKVAPMNAQAGARGRAVDLWKTIGISVSN HIN+
Subjt:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS

Query:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY
        G  T M PSGREISILAFEVANTISKVANLSKSLSEEN+QLLK ELLQSE IKQLIS SLEELLSIAAADKRQEFGVILRE+IRFGN+CKDS+WHNLDQY
Subjt:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY

Query:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        FSRLDSNDSSQKQAREARAA+QELTVLAQNTSELYHELQALERLEQDYRRRVEEVE LNQAGIGETLSIFQGELNVQR+LVRSFQSKCLWSRNLDEIVEK
Subjt:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        LVIVVTWINQTI+KEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIA RPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
Subjt:  LVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

A0A6J1HB43 uncharacterized protein LOC111462458 isoform X31.12e-21380.59Show/hide
Query:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS
        MGNTMGGVCSNGI KD F SEKI Q SEDRKGNS L+SEA DPNEMP++SRS V LL SPPSKTGSNKVAP+N+QAG+RGRA+DL KTIG SVSN H+NS
Subjt:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS

Query:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY
        G  TGMA +GREISILAFEVANTISKVANLS+SLSEENIQLLK ELLQSE IKQL+S S EELLSIAAADKRQEF V+L E+IRFG +CKD QWHNLDQY
Subjt:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY

Query:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        FSRLD NDSS+KQAREARAA+QEL VLAQ+TSELYHEL ALER EQDYRR+V+EVE +NQAG GE+LSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEK
Subjt:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIIKEFGVDNT---------DKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDA
        LVIVVTWINQTI K F   NT         DKTL I+DRSNGQKLG+VGLALHYA IISQINLIACRPTSIPSNMRDALYRALPTS+KI LRSRLR V+ 
Subjt:  LVIVVTWINQTIIKEFGVDNT---------DKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDA

Query:  REESASI
         EE   I
Subjt:  REESASI

A0A6J1HF33 uncharacterized protein LOC111462458 isoform X12.95e-21280.59Show/hide
Query:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS
        MGNTMGGVCSNGI KD F SEKI Q SEDRKGNS L+SEA DPNEMP++SRS V LL SPPSKTGSNKVAP+N+QAG+RGRA+DL KTIG SVSN H+NS
Subjt:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS

Query:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY
        G  TGMA +GREISILAFEVANTISKVANLS+SLSEENIQLLK ELLQSE IKQL+S S EELLSIAAADKRQEF V+L E+IRFG +CKD QWHNLDQY
Subjt:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY

Query:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        FSRLD NDSS+KQAREARAA+QEL VLAQ+TSELYHEL ALER EQDYRR+V+EVE +NQAG GE+LSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEK
Subjt:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIIKEFGVDNT---------DKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDA
        LVIVVTWINQTI K F   NT         DKTL I+DRSNGQKLG+VGLALHYA IISQINLIACRPTSIPSNMRDALYRALPTS+KI LRSRLR V+ 
Subjt:  LVIVVTWINQTIIKEFGVDNT---------DKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDA

Query:  REESASI
         EE   I
Subjt:  REESASI

A0A6J1KCI8 uncharacterized protein LOC111492046 isoform X25.18e-21280.34Show/hide
Query:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS
        MGNTMGGVCSNGI KD F SEKI Q SEDR GNS LNSEA D NEMP++SRS V LLPSPPSK GSNKVAP+N+QAG+RGRA+DL KTIG SVSN H+N 
Subjt:  MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINS

Query:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY
        G  TGMA +G EISILAFEVANTISKV NLS+SLSEENIQLLK ELLQSE IKQL+S S EELLSIAAADKRQEF V+LRE+IRFG +CKD QWHNLDQY
Subjt:  GVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQY

Query:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        FSRLD NDSS+KQAREARAA+QEL VLAQ+TSELYHEL ALER EQDYRR+V+EVE LNQ GIGE+LSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEK
Subjt:  FSRLDSNDSSQKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIIKEFGVDNT---------DKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDA
        LVIVVTWINQTI K FG  NT         DKTL I+DRS GQKLG+VGLALHYANIISQINLIACRP SIPSNMRDALYRALPTS+KI LRSRLR VD 
Subjt:  LVIVVTWINQTIIKEFGVDNT---------DKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDA

Query:  REESASI
         EE   I
Subjt:  REESASI

SwissProt top hitse value%identityAlignment
P0DO24 Protein PSK SIMULATOR 36.8e-6444.05Show/hide
Query:  SKTGSNKVAPMNAQAG--ARGRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISAS
        ++T  +KV   +   G    GRA D+  T+G S+++   + G ++G+A  G E+ ILAFEVANTI K +NL +SLS+ NI+ LK  +L SE ++ L+S  
Subjt:  SKTGSNKVAPMNAQAG--ARGRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISAS

Query:  LEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQARE-ARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFL
         +ELL + AADKRQE  V   E++RFGNR KD QWHNL +YF R+    + Q+Q +E A   + +L VL Q T+ELY ELQ L RLE+DY ++  E E  
Subjt:  LEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQARE-ARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFL

Query:  NQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACR
          +  G+ L+I + EL  QRK+V+S + K LWSR  +E++EKLV +V ++   I   FG    D+          ++LG  GLALHYANII QI+ +  R
Subjt:  NQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACR

Query:  PTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
         +SI SN RD+LY++LP  IK+ALRS++++ +  +E
Subjt:  PTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

Q9SA91 Protein PSK SIMULATOR 28.0e-8144.6Show/hide
Query:  MGGVCSNGIVKDDFVSEKIIQASEDR------KGNSYLNSEARDP-------NEMPEKSRSDVILL-------PSPPSKTGSNKVAPMNAQAGARG----
        MGGVCS  + KDD   +K+    +D+      K  S   S+  D            + S+ D ++        P PP +  S K    N+  G  G    
Subjt:  MGGVCSNGIVKDDFVSEKIIQASEDR------KGNSYLNSEARDP-------NEMPEKSRSDVILL-------PSPPSKTGSNKVAPMNAQAGARG----

Query:  -RAVDLWKTIGISVSNFHINSGVSTGMAPS-GREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVI
         +AV++  T+G S++  + ++   +G+  S G +++ILAFEVANTI+K A L +SLSEEN++ +K ++L SE +K+L+S    EL  +AA+DKR+E  + 
Subjt:  -RAVDLWKTIGISVSNFHINSGVSTGMAPS-GREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVI

Query:  LREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQ
          E+IRFGN CKD QWHNLD+YF +LD+ +S  K  + +A A +QEL  LA+ TSELYHELQAL+R EQDYRR++ EVE LN    GE + I Q EL  Q
Subjt:  LREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQ

Query:  RKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTS
        +KLV+S Q K LWS+NL EI+EKLV VV++I QTI++ FG +        +     ++LG  GL+LHYAN+I QI+ IA RP+S+PSN+RD LY ALP +
Subjt:  RKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTS

Query:  IKIALRSRLRAVDAREE
        +K ALR RL+ +D  EE
Subjt:  IKIALRSRLRAVDAREE

Q9XID5 Protein PSK SIMULATOR 12.6e-7146.55Show/hide
Query:  NKVAPMNAQAGAR--GRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELL
        ++V+ +  +AG    G+AVD+  T+G S++N +++ G S+     G +ISIL+FEVANTI K ANL  SLS+++I  LK  +L SE ++ LIS  ++ELL
Subjt:  NKVAPMNAQAGAR--GRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELL

Query:  SIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVE--FLNQA
         IAAADKR+E  +   E++RFGNRCKD Q+HNLD++F RL S  + QK  + EA   + ++      T++LYHEL AL+R EQDY+R+++E E     Q 
Subjt:  SIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVE--FLNQA

Query:  GIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTS
        G+G+TL+I + EL  Q+K VR+ + K LWSR L+E++EKLV VV +++  I + FG  + DK        N +KLG+ GLALHYANII+QI+ +  R ++
Subjt:  GIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTS

Query:  IPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        +P++ RDALY+ LP SIK ALRSR+++   +EE
Subjt:  IPSNMRDALYRALPTSIKIALRSRLRAVDAREE

Arabidopsis top hitse value%identityAlignment
AT1G30755.1 Protein of unknown function (DUF668)5.6e-8244.6Show/hide
Query:  MGGVCSNGIVKDDFVSEKIIQASEDR------KGNSYLNSEARDP-------NEMPEKSRSDVILL-------PSPPSKTGSNKVAPMNAQAGARG----
        MGGVCS  + KDD   +K+    +D+      K  S   S+  D            + S+ D ++        P PP +  S K    N+  G  G    
Subjt:  MGGVCSNGIVKDDFVSEKIIQASEDR------KGNSYLNSEARDP-------NEMPEKSRSDVILL-------PSPPSKTGSNKVAPMNAQAGARG----

Query:  -RAVDLWKTIGISVSNFHINSGVSTGMAPS-GREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVI
         +AV++  T+G S++  + ++   +G+  S G +++ILAFEVANTI+K A L +SLSEEN++ +K ++L SE +K+L+S    EL  +AA+DKR+E  + 
Subjt:  -RAVDLWKTIGISVSNFHINSGVSTGMAPS-GREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVI

Query:  LREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQ
          E+IRFGN CKD QWHNLD+YF +LD+ +S  K  + +A A +QEL  LA+ TSELYHELQAL+R EQDYRR++ EVE LN    GE + I Q EL  Q
Subjt:  LREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQ

Query:  RKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTS
        +KLV+S Q K LWS+NL EI+EKLV VV++I QTI++ FG +        +     ++LG  GL+LHYAN+I QI+ IA RP+S+PSN+RD LY ALP +
Subjt:  RKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTS

Query:  IKIALRSRLRAVDAREE
        +K ALR RL+ +D  EE
Subjt:  IKIALRSRLRAVDAREE

AT1G34320.1 Protein of unknown function (DUF668)1.8e-7246.55Show/hide
Query:  NKVAPMNAQAGAR--GRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELL
        ++V+ +  +AG    G+AVD+  T+G S++N +++ G S+     G +ISIL+FEVANTI K ANL  SLS+++I  LK  +L SE ++ LIS  ++ELL
Subjt:  NKVAPMNAQAGAR--GRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELL

Query:  SIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVE--FLNQA
         IAAADKR+E  +   E++RFGNRCKD Q+HNLD++F RL S  + QK  + EA   + ++      T++LYHEL AL+R EQDY+R+++E E     Q 
Subjt:  SIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVE--FLNQA

Query:  GIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTS
        G+G+TL+I + EL  Q+K VR+ + K LWSR L+E++EKLV VV +++  I + FG  + DK        N +KLG+ GLALHYANII+QI+ +  R ++
Subjt:  GIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACRPTS

Query:  IPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        +P++ RDALY+ LP SIK ALRSR+++   +EE
Subjt:  IPSNMRDALYRALPTSIKIALRSRLRAVDAREE

AT3G23160.1 Protein of unknown function (DUF668)1.0e-1425.21Show/hide
Query:  ISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQK
        I IL+FEVAN +SK  +L +SLS+  I  LK E+  SE +++L+S+    LL ++ ++K  +   +   + R G +C +      +  +  + +     +
Subjt:  ISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQK

Query:  Q----AREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWI
        +     ++  + ++++      T  LY E++ +  LEQ        V+        E++  F+ +L  QR+ V+S +   LW++  D++VE L   V  I
Subjt:  Q----AREARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWI

Query:  N---QTIIKEFGVDNTDKTLLIKDRSNGQKLGAV
            +T+    G+       L +DRS  +   AV
Subjt:  N---QTIIKEFGVDNTDKTLLIKDRSNGQKLGAV

AT5G08660.1 Protein of unknown function (DUF668)4.8e-6544.05Show/hide
Query:  SKTGSNKVAPMNAQAG--ARGRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISAS
        ++T  +KV   +   G    GRA D+  T+G S+++   + G ++G+A  G E+ ILAFEVANTI K +NL +SLS+ NI+ LK  +L SE ++ L+S  
Subjt:  SKTGSNKVAPMNAQAG--ARGRAVDLWKTIGISVSNFHINSGVSTGMAPSGREISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISAS

Query:  LEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQARE-ARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFL
         +ELL + AADKRQE  V   E++RFGNR KD QWHNL +YF R+    + Q+Q +E A   + +L VL Q T+ELY ELQ L RLE+DY ++  E E  
Subjt:  LEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQARE-ARAALQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFL

Query:  NQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACR
          +  G+ L+I + EL  QRK+V+S + K LWSR  +E++EKLV +V ++   I   FG    D+          ++LG  GLALHYANII QI+ +  R
Subjt:  NQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIIKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIACR

Query:  PTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
         +SI SN RD+LY++LP  IK+ALRS++++ +  +E
Subjt:  PTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

AT5G51670.1 Protein of unknown function (DUF668)2.8e-1221.32Show/hide
Query:  ISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDS---QWHNLDQYFSRLDSNDS
        + +L+FEVA  ++K+ +L+ SL++ N+   ++  L  E + ++++      LS+  A+           + R  NRC  +    +H L   F+ +  +  
Subjt:  ISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDS---QWHNLDQYFSRLDSNDS

Query:  S-QKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRR--------VEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
              ++  A  +++      T+ LY E++ +  LE   R++         EE ++ N+  + + + + Q ++  Q++ V+  + + LW+++ D +V  
Subjt:  S-QKQAREARAALQELTVLAQNTSELYHELQALERLEQDYRRR--------VEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LV-------------------------IVVTWINQTIIKEFGVDN--------------TDKTLLIKDRSNGQK-----LGAVGLALHYANIISQINLIA
        L                           VV+ + +++       N              T  +  +++ S   K     LG  G+ALHYAN+I  +  + 
Subjt:  LV-------------------------IVVTWINQTIIKEFGVDN--------------TDKTLLIKDRSNGQK-----LGAVGLALHYANIISQINLIA

Query:  CRPTSIPSNMRDALYRALPTSIKIALRSRLRAV
         +P  +  + RD LY  LP S++ +LRSRL+ V
Subjt:  CRPTSIPSNMRDALYRALPTSIKIALRSRLRAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAACACAATGGGAGGGGTTTGTTCTAATGGAATAGTTAAGGATGATTTTGTATCTGAGAAGATAATTCAAGCATCTGAGGATCGGAAAGGGAATTCGTATTTGAA
TTCTGAGGCTAGGGATCCGAATGAAATGCCAGAGAAGTCTCGTTCTGATGTGATACTATTGCCTTCACCTCCGTCTAAGACTGGAAGCAATAAGGTTGCACCAATGAACG
CACAAGCAGGGGCTCGCGGGAGGGCAGTTGATTTATGGAAAACAATTGGGATCAGTGTGTCAAATTTCCATATCAACAGTGGGGTTTCTACAGGCATGGCGCCTAGTGGT
AGGGAGATTTCTATATTGGCTTTTGAAGTAGCAAACACAATAAGCAAAGTAGCAAATTTGTCAAAATCTCTCTCAGAAGAAAATATTCAGCTCCTTAAAAACGAACTATT
ACAATCAGAAGCGATAAAACAATTGATCTCAGCAAGTTTAGAGGAATTGCTTAGCATTGCAGCTGCTGACAAAAGGCAGGAATTTGGCGTCATCTTACGGGAGATAATAC
GATTTGGAAATCGGTGCAAGGATTCACAGTGGCATAATCTGGATCAATACTTTTCAAGACTAGATTCGAATGATTCAAGTCAAAAACAAGCTCGAGAGGCCAGAGCAGCC
TTACAGGAACTAACCGTTTTAGCTCAGAATACTTCTGAATTATACCACGAATTACAAGCATTGGAAAGACTTGAGCAAGATTATAGACGGAGAGTTGAGGAAGTGGAGTT
CTTGAACCAAGCAGGAATAGGAGAAACTCTCTCAATCTTCCAAGGAGAACTAAACGTACAAAGAAAACTTGTAAGGAGTTTCCAAAGCAAGTGTCTTTGGTCCAGAAATC
TAGATGAGATTGTGGAAAAGCTCGTTATCGTTGTAACATGGATAAATCAAACAATAATCAAAGAATTTGGAGTTGACAATACAGATAAAACATTGCTTATCAAGGACAGA
AGTAATGGCCAGAAACTGGGTGCCGTTGGTCTTGCTTTGCATTATGCGAACATAATCAGCCAGATAAATCTCATTGCGTGCCGTCCAACTTCCATTCCTTCAAACATGAG
GGATGCATTATACCGGGCATTGCCTACAAGCATTAAAATAGCTCTGCGTTCTCGATTGCGGGCTGTGGATGCAAGAGAGGAGAGCGCATCAATCTTGTGGCAGAATTGGA
GAATGGGCAACCCAAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAACACAATGGGAGGGGTTTGTTCTAATGGAATAGTTAAGGATGATTTTGTATCTGAGAAGATAATTCAAGCATCTGAGGATCGGAAAGGGAATTCGTATTTGAA
TTCTGAGGCTAGGGATCCGAATGAAATGCCAGAGAAGTCTCGTTCTGATGTGATACTATTGCCTTCACCTCCGTCTAAGACTGGAAGCAATAAGGTTGCACCAATGAACG
CACAAGCAGGGGCTCGCGGGAGGGCAGTTGATTTATGGAAAACAATTGGGATCAGTGTGTCAAATTTCCATATCAACAGTGGGGTTTCTACAGGCATGGCGCCTAGTGGT
AGGGAGATTTCTATATTGGCTTTTGAAGTAGCAAACACAATAAGCAAAGTAGCAAATTTGTCAAAATCTCTCTCAGAAGAAAATATTCAGCTCCTTAAAAACGAACTATT
ACAATCAGAAGCGATAAAACAATTGATCTCAGCAAGTTTAGAGGAATTGCTTAGCATTGCAGCTGCTGACAAAAGGCAGGAATTTGGCGTCATCTTACGGGAGATAATAC
GATTTGGAAATCGGTGCAAGGATTCACAGTGGCATAATCTGGATCAATACTTTTCAAGACTAGATTCGAATGATTCAAGTCAAAAACAAGCTCGAGAGGCCAGAGCAGCC
TTACAGGAACTAACCGTTTTAGCTCAGAATACTTCTGAATTATACCACGAATTACAAGCATTGGAAAGACTTGAGCAAGATTATAGACGGAGAGTTGAGGAAGTGGAGTT
CTTGAACCAAGCAGGAATAGGAGAAACTCTCTCAATCTTCCAAGGAGAACTAAACGTACAAAGAAAACTTGTAAGGAGTTTCCAAAGCAAGTGTCTTTGGTCCAGAAATC
TAGATGAGATTGTGGAAAAGCTCGTTATCGTTGTAACATGGATAAATCAAACAATAATCAAAGAATTTGGAGTTGACAATACAGATAAAACATTGCTTATCAAGGACAGA
AGTAATGGCCAGAAACTGGGTGCCGTTGGTCTTGCTTTGCATTATGCGAACATAATCAGCCAGATAAATCTCATTGCGTGCCGTCCAACTTCCATTCCTTCAAACATGAG
GGATGCATTATACCGGGCATTGCCTACAAGCATTAAAATAGCTCTGCGTTCTCGATTGCGGGCTGTGGATGCAAGAGAGGAGAGCGCATCAATCTTGTGGCAGAATTGGA
GAATGGGCAACCCAAAGTAAGGAACACAGCAAAGGCAGAGCTACACAAAACAACAATGCAAACCGCCTTCAAACGCTCTACTATGCAGACAAAGTAAAAACAGAGTTACA
AATTCTTGAATTAGTCACATTGCTTCACCATCTCATCCATTTAGCAAAACACCAACAACGACGCTCCTCATCTCTCCGTTGCCGATCACCAACTCCCAAGGACATGGCAA
ATACTTCTCGTCGTATCCAATTCAAGAGCCAAATTATCAGAACCACCAAGGATGGATTTCCGACCGACAATATACCATCGCCCGGCCAAACTCCGATCAGAAAAAAGGTG
CTCGGTAACAAAAAAGGAATGGAATCTTACAAAAATGAGAATAAAGGAATTTGGACGTTAAGTAAAGCAGTTTCGGTTTCAACCTTGAGGTCTCTTGGTAGAGTCTAGAT
TTTGAGTTGAAGAAGGATGAAGCTTTGGAATTGGTTTAATGTGGATAATAACAATTTTGAGTACCTCACACGTTTATACCAAAAATAAGTACAGGGAATTTTTTAAAGAT
TTTTTGACCGTGTCTTCTAGTTGAATTCCTTTGTTTATATTACTCTTTTGGGTGAGGTTTTGTTCCGAATTCCCCACTTGTTTTGTATATTAATTGCACCCCTCCAGAAT
AATGGTCAATGTTCTTAGTTACATGTGCTAGACGATTTGCATAAAATTGAAGTTTTTTTTAAAATGAATAAAAATATTTTTCTCTATAGAAAGAGAGAATAAATGATCGC
ACAAGATTTTGAAGTAATAACTTGAAAACGAATAATTGGCTTTGGAGGTTATCTGTTTTCAAAAATGGTTAAGATGAACAAGCTTGGCTTTACTTAAAATCCAAAAGACA
AAAGGATATTCAAAACAGAAAAGAAAAAAAACAGTATTATACATATTTTTTTAAAAAATCATTAAATGTTAGAAGTAAAAAAAAAAAATTAGTATATTAAAAGAAAAGGA
AACAAAATAAGTTGTGCTGGTCCTTCAGTAGGGATCAGATTGGAAGAGATGGGAGTGCCCAATTGTCGTAACCAGGCGTTGGGGCGGACACCCAACTCGCATTATATTAG
CGAGAGCCAACCACTCATCAT
Protein sequenceShow/hide protein sequence
MGNTMGGVCSNGIVKDDFVSEKIIQASEDRKGNSYLNSEARDPNEMPEKSRSDVILLPSPPSKTGSNKVAPMNAQAGARGRAVDLWKTIGISVSNFHINSGVSTGMAPSG
REISILAFEVANTISKVANLSKSLSEENIQLLKNELLQSEAIKQLISASLEELLSIAAADKRQEFGVILREIIRFGNRCKDSQWHNLDQYFSRLDSNDSSQKQAREARAA
LQELTVLAQNTSELYHELQALERLEQDYRRRVEEVEFLNQAGIGETLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIIKEFGVDNTDKTLLIKDR
SNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREESASILWQNWRMGNPK