; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0006659 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0006659
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionProtein of unknown function (DUF668)
Genome locationchr03:25089993..25094264
RNA-Seq ExpressionIVF0006659
SyntenyIVF0006659
Gene Ontology termsGO:0045927 - positive regulation of growth (biological process)
InterPro domainsIPR007700 - Domain of unknown function DUF668
IPR021864 - Domain of unknown function DUF3475
IPR045021 - Protein PSK SIMULATOR


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0065159.1 uncharacterized protein E6C27_scaffold82G004950 [Cucumis melo var. makuwa]4.01e-25998.21Show/hide
Query:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT
        MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT
Subjt:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT

Query:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL
        AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDS+WHNLDQYFSRL
Subjt:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI
        DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIG   S+  QGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI
Subjt:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI

Query:  VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
Subjt:  VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

XP_004152648.2 uncharacterized protein LOC101204577 isoform X3 [Cucumis sativus]5.53e-24392.07Show/hide
Query:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT
        MGGVCSNGIVKDDFVSEK+ QASEDRKGNS+LN EA DPNEMPEKS S VILLPSPPSK GSNKVAPMNAQAGARGRAVDLWKTIGISVSN HIN+G  T
Subjt:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT

Query:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL
         M PSGREISILAFEVANTISKVANLSKSLSEEN+QLLK ELLQSE IKQLIS SLEELLSIAAADKRQEFGVILRE+IRFGN+CKDSQWHNLDQYFSRL
Subjt:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI
        DSNDSSQKQAREARAA+QELTVLAQNTSELYHELQALERLEQDYRRRVEEVE LNQAGIG   S+  QGELNVQR+LVRSFQSKCLWSRNLDEIVEKLVI
Subjt:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI

Query:  VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        VVTWINQTI+KEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIA RPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
Subjt:  VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

XP_011649665.1 uncharacterized protein LOC101204577 isoform X1 [Cucumis sativus]3.55e-24192.07Show/hide
Query:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT
        MGGVCSNGIVKDDFVSEK+ QASEDRKGNS+LN EA DPNEMPEKS S VILLPSPPSK GSNKVAPMNAQAGARGRAVDLWKTIGISVSN HIN+G  T
Subjt:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT

Query:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL
         M PSGREISILAFEVANTISKVANLSKSLSEEN+QLLK ELLQSE IKQLIS SLEELLSIAAADKRQEFGVILRE+IRFGN+CKDSQWHNLDQYFSRL
Subjt:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI
        DSNDSSQKQAREARAA+QELTVLAQNTSELYHELQALERLEQDYRRRVEEVE LNQAGIG   S+  QGELNVQR+LVRSFQSKCLWSRNLDEIVEKLVI
Subjt:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI

Query:  VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        VVTWINQTI+KEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIA RPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
Subjt:  VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

XP_031736339.1 uncharacterized protein LOC101204577 isoform X2 [Cucumis sativus]1.35e-24192.07Show/hide
Query:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT
        MGGVCSNGIVKDDFVSEK+ QASEDRKGNS+LN EA DPNEMPEKS S VILLPSPPSK GSNKVAPMNAQAGARGRAVDLWKTIGISVSN HIN+G  T
Subjt:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT

Query:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL
         M PSGREISILAFEVANTISKVANLSKSLSEEN+QLLK ELLQSE IKQLIS SLEELLSIAAADKRQEFGVILRE+IRFGN+CKDSQWHNLDQYFSRL
Subjt:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI
        DSNDSSQKQAREARAA+QELTVLAQNTSELYHELQALERLEQDYRRRVEEVE LNQAGIG   S+  QGELNVQR+LVRSFQSKCLWSRNLDEIVEKLVI
Subjt:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI

Query:  VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        VVTWINQTI+KEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIA RPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
Subjt:  VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

XP_031736340.1 uncharacterized protein LOC101204577 isoform X4 [Cucumis sativus]8.72e-24492.07Show/hide
Query:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT
        MGGVCSNGIVKDDFVSEK+ QASEDRKGNS+LN EA DPNEMPEKS S VILLPSPPSK GSNKVAPMNAQAGARGRAVDLWKTIGISVSN HIN+G  T
Subjt:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT

Query:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL
         M PSGREISILAFEVANTISKVANLSKSLSEEN+QLLK ELLQSE IKQLIS SLEELLSIAAADKRQEFGVILRE+IRFGN+CKDSQWHNLDQYFSRL
Subjt:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI
        DSNDSSQKQAREARAA+QELTVLAQNTSELYHELQALERLEQDYRRRVEEVE LNQAGIG   S+  QGELNVQR+LVRSFQSKCLWSRNLDEIVEKLVI
Subjt:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI

Query:  VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        VVTWINQTI+KEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIA RPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
Subjt:  VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

TrEMBL top hitse value%identityAlignment
A0A0A0LLM8 Uncharacterized protein3.6e-19092.07Show/hide
Query:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT
        MGGVCSNGIVKDDFVSEK+ QASEDRKGNS+LN EA DPNEMPEKS S VILLPSPPSK GSNKVAPMNAQAGARGRAVDLWKTIGISVSN HIN+G  T
Subjt:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT

Query:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL
         M PSGREISILAFEVANTISKVANLSKSLSEEN+QLLK ELLQSE IKQLIS SLEELLSIAAADKRQEFGVILRE+IRFGN+CKDSQWHNLDQYFSRL
Subjt:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI
        DSNDSSQKQAREARAA+QELTVLAQNTSELYHELQALERLEQDYRRRVEEVE LNQAGIG   S+  QGELNVQR+LVRSFQSKCLWSRNLDEIVEKLVI
Subjt:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI

Query:  VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        VVTWINQTI+KEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIA RPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
Subjt:  VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

A0A5A7VI62 Uncharacterized protein7.6e-20498.21Show/hide
Query:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT
        MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT
Subjt:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT

Query:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL
        AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDS+WHNLDQYFSRL
Subjt:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI
        DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIG   S+  QGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI
Subjt:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI

Query:  VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
Subjt:  VVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

A0A6J1HF33 uncharacterized protein LOC111462458 isoform X17.2e-16279.25Show/hide
Query:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT
        MGGVCSNGI KD F SEK+TQ SEDRKGNS L+ EA DPNEMP++S SGV LL SPPSK GSNKVAP+N+QAG+RGRA+DL KTIG SVSNLH+N+GFFT
Subjt:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT

Query:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL
         M  +GREISILAFEVANTISKVANLS+SLSEEN+QLLK+ELLQSEGIKQL+S S EELLSIAAADKRQEF V+L EVIRFG QCKD QWHNLDQYFSRL
Subjt:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI
        D NDSS+KQAREARAA+QEL VLAQ+TSELYHEL ALER EQDYRR+V+EVE +NQAG G   S+  QGELNVQR+LVRSFQSKCLWSR+LDEIVEKLVI
Subjt:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI

Query:  VVTWINQTIVKEFGVDN---------TDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        VVTWINQTI K F   N         TDKTL I+DRSNGQKLG+VGLALHYA IISQINLIA RPTSIPSNMRDALYRALPTS+KI LRSRLR V+  EE
Subjt:  VVTWINQTIVKEFGVDN---------TDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

A0A6J1K3J2 uncharacterized protein LOC111492046 isoform X18.4e-16379.75Show/hide
Query:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT
        MGGVCSNGI KD F SEK+TQ SEDR GNS LN EA D NEMP++S SGV LLPSPPSKAGSNKVAP+N+QAG+RGRA+DL KTIG SVSNLH+N GFFT
Subjt:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT

Query:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL
         M  +G EISILAFEVANTISKV NLS+SLSEEN+QLLK+ELLQSEGIKQL+S S EELLSIAAADKRQEF V+LREVIRFG QCKD QWHNLDQYFSRL
Subjt:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI
        D NDSS+KQAREARAA+QEL VLAQ+TSELYHEL ALER EQDYRR+V+EVE LNQ GIG   S+  QGELNVQR+LVRSFQSKCLWSR+LDEIVEKLVI
Subjt:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI

Query:  VVTWINQTIVKEFGVDNT---------DKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        VVTWINQTI K FG  NT         DKTL I+DRS GQKLG+VGLALHYANIISQINLIA RP SIPSNMRDALYRALPTS+KI LRSRLR VD  EE
Subjt:  VVTWINQTIVKEFGVDNT---------DKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

A0A6J1KCI8 uncharacterized protein LOC111492046 isoform X28.4e-16379.75Show/hide
Query:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT
        MGGVCSNGI KD F SEK+TQ SEDR GNS LN EA D NEMP++S SGV LLPSPPSKAGSNKVAP+N+QAG+RGRA+DL KTIG SVSNLH+N GFFT
Subjt:  MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFT

Query:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL
         M  +G EISILAFEVANTISKV NLS+SLSEEN+QLLK+ELLQSEGIKQL+S S EELLSIAAADKRQEF V+LREVIRFG QCKD QWHNLDQYFSRL
Subjt:  AMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRL

Query:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI
        D NDSS+KQAREARAA+QEL VLAQ+TSELYHEL ALER EQDYRR+V+EVE LNQ GIG   S+  QGELNVQR+LVRSFQSKCLWSR+LDEIVEKLVI
Subjt:  DSNDSSQKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVI

Query:  VVTWINQTIVKEFGVDNT---------DKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE
        VVTWINQTI K FG  NT         DKTL I+DRS GQKLG+VGLALHYANIISQINLIA RP SIPSNMRDALYRALPTS+KI LRSRLR VD  EE
Subjt:  VVTWINQTIVKEFGVDNT---------DKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREE

SwissProt top hitse value%identityAlignment
P0DO24 Protein PSK SIMULATOR 32.1e-6245.77Show/hide
Query:  GRAVDLWKTIGISVSNLHINNGFFTAMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVIL
        GRA D+  T+G S+++L  + GF + +   G E+ ILAFEVANTI K +NL +SLS+ N++ LK  +L SEG++ L+S   +ELL + AADKRQE  V  
Subjt:  GRAVDLWKTIGISVSNLHINNGFFTAMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVIL

Query:  REVIRFGNQCKDSQWHNLDQYFSRLDSNDSSQKQARE-ARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLN-LQGELNV
         EV+RFGN+ KD QWHNL +YF R+    + Q+Q +E A   V +L VL Q T+ELY ELQ L RLE+DY ++  E E  N A    G+ L  L+ EL  
Subjt:  REVIRFGNQCKDSQWHNLDQYFSRLDSNDSSQKQARE-ARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLN-LQGELNV

Query:  QRRLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPT
        QR++V+S + K LWSR  +E++EKLV +V ++   I   FG    D+          ++LG  GLALHYANII QI+ +  R +SI SN RD+LY++LP 
Subjt:  QRRLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPT

Query:  SIKIALRSRLRAVDAREEV
         IK+ALRS++++ +  +E+
Subjt:  SIKIALRSRLRAVDAREEV

Q9SA91 Protein PSK SIMULATOR 23.9e-8045.35Show/hide
Query:  MGGVCSNGIVKDDFVSEKVTQASEDR------KGNSHLNFEASDPNEMPEKSGS--------------GVILLPSPPSKAGSNKVAPMNAQAGARG----
        MGGVCS  + KDD   +K+    +D+      K  S    + SD        GS                 L P PP +  S K    N+  G  G    
Subjt:  MGGVCSNGIVKDDFVSEKVTQASEDR------KGNSHLNFEASDPNEMPEKSGS--------------GVILLPSPPSKAGSNKVAPMNAQAGARG----

Query:  -RAVDLWKTIGISVSNLHINNGFFTAMTPS-GREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVI
         +AV++  T+G S++ ++ +N + + +T S G +++ILAFEVANTI+K A L +SLSEEN++ +KK++L SE +K+L+S    EL  +AA+DKR+E  + 
Subjt:  -RAVDLWKTIGISVSNLHINNGFFTAMTPS-GREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVI

Query:  LREVIRFGNQCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNV
          EVIRFGN CKD QWHNLD+YF +LD+ +S  K  + +A A +QEL  LA+ TSELYHELQAL+R EQDYRR++ EVE LN    G G  + LQ EL  
Subjt:  LREVIRFGNQCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNV

Query:  QRRLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPT
        Q++LV+S Q K LWS+NL EI+EKLV VV++I QTIV+ FG +        +     ++LG  GL+LHYAN+I QI+ IA RP+S+PSN+RD LY ALP 
Subjt:  QRRLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPT

Query:  SIKIALRSRLRAVDAREEV
        ++K ALR RL+ +D  EE+
Subjt:  SIKIALRSRLRAVDAREEV

Q9XID5 Protein PSK SIMULATOR 11.8e-6942.5Show/hide
Query:  VKDDFVSEKVTQASEDRKGNSH-LNFEASDPNEMPEKSGSGVILLPSPPSK------------AGSNKVAPMNAQAGAR--GRAVDLWKTIGISVSNLHI
        +KDD     VT+  +D K  S   +F        P+    G+  L    S+            A  ++V+ +  +AG    G+AVD+  T+G S++NL++
Subjt:  VKDDFVSEKVTQASEDRKGNSH-LNFEASDPNEMPEKSGSGVILLPSPPSK------------AGSNKVAPMNAQAGAR--GRAVDLWKTIGISVSNLHI

Query:  NNGFFTAMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLD
        + GF +A T  G +ISIL+FEVANTI K ANL  SLS++++  LK+ +L SEG++ LIS  ++ELL IAAADKR+E  +   EV+RFGN+CKD Q+HNLD
Subjt:  NNGFFTAMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLD

Query:  QYFSRLDSNDSSQKQAR-EARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLN-LQGELNVQRRLVRSFQSKCLWSRNLD
        ++F RL S  + QK  + EA   + ++      T++LYHEL AL+R EQDY+R+++E E  + A  G G++L  L+ EL  Q++ VR+ + K LWSR L+
Subjt:  QYFSRLDSNDSSQKQAR-EARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLN-LQGELNVQRRLVRSFQSKCLWSRNLD

Query:  EIVEKLVIVVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEV
        E++EKLV VV +++  I + FG  + DK        N +KLG+ GLALHYANII+QI+ +  R +++P++ RDALY+ LP SIK ALRSR+++   +EE+
Subjt:  EIVEKLVIVVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEV

Arabidopsis top hitse value%identityAlignment
AT1G30755.1 Protein of unknown function (DUF668)2.7e-8145.35Show/hide
Query:  MGGVCSNGIVKDDFVSEKVTQASEDR------KGNSHLNFEASDPNEMPEKSGS--------------GVILLPSPPSKAGSNKVAPMNAQAGARG----
        MGGVCS  + KDD   +K+    +D+      K  S    + SD        GS                 L P PP +  S K    N+  G  G    
Subjt:  MGGVCSNGIVKDDFVSEKVTQASEDR------KGNSHLNFEASDPNEMPEKSGS--------------GVILLPSPPSKAGSNKVAPMNAQAGARG----

Query:  -RAVDLWKTIGISVSNLHINNGFFTAMTPS-GREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVI
         +AV++  T+G S++ ++ +N + + +T S G +++ILAFEVANTI+K A L +SLSEEN++ +KK++L SE +K+L+S    EL  +AA+DKR+E  + 
Subjt:  -RAVDLWKTIGISVSNLHINNGFFTAMTPS-GREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVI

Query:  LREVIRFGNQCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNV
          EVIRFGN CKD QWHNLD+YF +LD+ +S  K  + +A A +QEL  LA+ TSELYHELQAL+R EQDYRR++ EVE LN    G G  + LQ EL  
Subjt:  LREVIRFGNQCKDSQWHNLDQYFSRLDSNDSSQKQAR-EARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNV

Query:  QRRLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPT
        Q++LV+S Q K LWS+NL EI+EKLV VV++I QTIV+ FG +        +     ++LG  GL+LHYAN+I QI+ IA RP+S+PSN+RD LY ALP 
Subjt:  QRRLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPT

Query:  SIKIALRSRLRAVDAREEV
        ++K ALR RL+ +D  EE+
Subjt:  SIKIALRSRLRAVDAREEV

AT1G34320.1 Protein of unknown function (DUF668)1.3e-7042.5Show/hide
Query:  VKDDFVSEKVTQASEDRKGNSH-LNFEASDPNEMPEKSGSGVILLPSPPSK------------AGSNKVAPMNAQAGAR--GRAVDLWKTIGISVSNLHI
        +KDD     VT+  +D K  S   +F        P+    G+  L    S+            A  ++V+ +  +AG    G+AVD+  T+G S++NL++
Subjt:  VKDDFVSEKVTQASEDRKGNSH-LNFEASDPNEMPEKSGSGVILLPSPPSK------------AGSNKVAPMNAQAGAR--GRAVDLWKTIGISVSNLHI

Query:  NNGFFTAMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLD
        + GF +A T  G +ISIL+FEVANTI K ANL  SLS++++  LK+ +L SEG++ LIS  ++ELL IAAADKR+E  +   EV+RFGN+CKD Q+HNLD
Subjt:  NNGFFTAMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLD

Query:  QYFSRLDSNDSSQKQAR-EARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLN-LQGELNVQRRLVRSFQSKCLWSRNLD
        ++F RL S  + QK  + EA   + ++      T++LYHEL AL+R EQDY+R+++E E  + A  G G++L  L+ EL  Q++ VR+ + K LWSR L+
Subjt:  QYFSRLDSNDSSQKQAR-EARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLN-LQGELNVQRRLVRSFQSKCLWSRNLD

Query:  EIVEKLVIVVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEV
        E++EKLV VV +++  I + FG  + DK        N +KLG+ GLALHYANII+QI+ +  R +++P++ RDALY+ LP SIK ALRSR+++   +EE+
Subjt:  EIVEKLVIVVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEV

AT3G23160.1 Protein of unknown function (DUF668)2.5e-1325.11Show/hide
Query:  ISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRLDSNDSSQK
        I IL+FEVAN +SK  +L +SLS+  +  LK E+  SEG+++L+S     LL ++ ++K  +   +   V R G +C +      +  +  + +     +
Subjt:  ISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRLDSNDSSQK

Query:  Q----AREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVIVVTW
        +     ++  + V+++      T  LY E++ +  LEQ        +  L ++     +    + +L  QR+ V+S +   LW++  D++VE L   V  
Subjt:  Q----AREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVIVVTW

Query:  IN---QTIVKEFGVDNTDKTLLIKDRSNGQKLGAV
        I    +T+    G+       L +DRS  +   AV
Subjt:  IN---QTIVKEFGVDNTDKTLLIKDRSNGQKLGAV

AT5G08660.1 Protein of unknown function (DUF668)1.5e-6345.77Show/hide
Query:  GRAVDLWKTIGISVSNLHINNGFFTAMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVIL
        GRA D+  T+G S+++L  + GF + +   G E+ ILAFEVANTI K +NL +SLS+ N++ LK  +L SEG++ L+S   +ELL + AADKRQE  V  
Subjt:  GRAVDLWKTIGISVSNLHINNGFFTAMTPSGREISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVIL

Query:  REVIRFGNQCKDSQWHNLDQYFSRLDSNDSSQKQARE-ARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLN-LQGELNV
         EV+RFGN+ KD QWHNL +YF R+    + Q+Q +E A   V +L VL Q T+ELY ELQ L RLE+DY ++  E E  N A    G+ L  L+ EL  
Subjt:  REVIRFGNQCKDSQWHNLDQYFSRLDSNDSSQKQARE-ARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLN-LQGELNV

Query:  QRRLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPT
        QR++V+S + K LWSR  +E++EKLV +V ++   I   FG    D+          ++LG  GLALHYANII QI+ +  R +SI SN RD+LY++LP 
Subjt:  QRRLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIVKEFGVDNTDKTLLIKDRSNGQKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPT

Query:  SIKIALRSRLRAVDAREEV
         IK+ALRS++++ +  +E+
Subjt:  SIKIALRSRLRAVDAREEV

AT5G51670.1 Protein of unknown function (DUF668)3.5e-1221.39Show/hide
Query:  ISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDS---QWHNLDQYFSRLDSNDS
        + +L+FEVA  ++K+ +L+ SL++ N+   +   L  EG+ ++++      LS+  A+           V R  N+C  +    +H L   F+ +  +  
Subjt:  ISILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDS---QWHNLDQYFSRLDSNDS

Query:  S-QKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNS------LNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKL
              ++  A  +++      T+ LY E++ +  LE   R++  ++ +  +    + N       ++LQ ++  Q++ V+  + + LW+++ D +V  L
Subjt:  S-QKQAREARAAVQELTVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNS------LNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKL

Query:  V-------------------------IVVTWINQTIVKEFGVDN--------------TDKTLLIKDRSNGQK-----LGAVGLALHYANIISQINLIAL
                                   VV+ + +++       N              T  +  +++ S   K     LG  G+ALHYAN+I  +  +  
Subjt:  V-------------------------IVVTWINQTIVKEFGVDN--------------TDKTLLIKDRSNGQK-----LGAVGLALHYANIISQINLIAL

Query:  RPTSIPSNMRDALYRALPTSIKIALRSRLRAV
        +P  +  + RD LY  LP S++ +LRSRL+ V
Subjt:  RPTSIPSNMRDALYRALPTSIKIALRSRLRAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGGGTTTGTTCTAATGGAATAGTTAAGGATGATTTTGTATCTGAGAAGGTAACTCAAGCATCTGAGGATCGGAAAGGGAATTCGCATTTGAATTTTGAGGCAAG
TGATCCGAATGAAATGCCGGAGAAGTCTGGTTCTGGTGTGATACTATTGCCCTCACCTCCGTCTAAGGCTGGAAGCAATAAGGTTGCACCAATGAACGCACAAGCAGGGG
CTCGCGGGAGGGCAGTTGATTTATGGAAAACGATTGGGATCAGTGTTTCAAATTTGCATATCAACAATGGGTTTTTTACAGCCATGACGCCTAGTGGTAGGGAGATTTCT
ATATTGGCTTTTGAAGTAGCAAACACAATAAGCAAAGTAGCAAATTTGTCAAAATCTCTCTCAGAAGAAAATATGCAGCTCCTTAAAAAGGAACTATTACAATCAGAAGG
GATAAAACAATTGATCTCACCAAGTTTAGAGGAATTGCTTAGCATTGCAGCTGCTGACAAAAGGCAGGAATTTGGGGTCATCTTACGAGAAGTAATACGATTTGGCAATC
AGTGCAAGGATTCACAGTGGCATAATCTGGATCAGTACTTTTCAAGACTAGATTCGAATGATTCAAGTCAAAAACAAGCTCGAGAGGCCAGAGCAGCCGTGCAGGAACTA
ACCGTCTTAGCTCAGAATACTTCTGAATTATACCACGAATTACAAGCATTGGAAAGACTTGAGCAAGATTATAGACGGAGAGTTGAGGAAGTGGAGTTGTTGAACCAAGC
AGGAATAGGTTGGGGAAACTCTCTCAATCTTCAAGGAGAACTGAACGTACAAAGAAGACTTGTAAGGAGTTTCCAAAGCAAGTGCCTCTGGTCCAGAAATCTGGATGAGA
TTGTGGAAAAGCTCGTTATTGTTGTAACATGGATAAACCAAACAATAGTCAAAGAATTCGGAGTTGACAACACAGATAAAACATTGCTTATCAAGGACAGAAGTAATGGC
CAGAAACTGGGTGCCGTTGGTCTTGCCTTGCATTATGCAAACATAATTAGCCAGATAAATCTCATTGCGTTACGTCCAACCTCCATTCCTTCAAACATGAGGGATGCATT
ATACCGGGCATTGCCTACAAGCATTAAAATAGCTCTGCGTTCACGATTGCGGGCTGTGGATGCAAGAGAGGAGGTATTACCTTGTTTCAATCTTGATGCAGCCTTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGGGTTTGTTCTAATGGAATAGTTAAGGATGATTTTGTATCTGAGAAGGTAACTCAAGCATCTGAGGATCGGAAAGGGAATTCGCATTTGAATTTTGAGGCAAG
TGATCCGAATGAAATGCCGGAGAAGTCTGGTTCTGGTGTGATACTATTGCCCTCACCTCCGTCTAAGGCTGGAAGCAATAAGGTTGCACCAATGAACGCACAAGCAGGGG
CTCGCGGGAGGGCAGTTGATTTATGGAAAACGATTGGGATCAGTGTTTCAAATTTGCATATCAACAATGGGTTTTTTACAGCCATGACGCCTAGTGGTAGGGAGATTTCT
ATATTGGCTTTTGAAGTAGCAAACACAATAAGCAAAGTAGCAAATTTGTCAAAATCTCTCTCAGAAGAAAATATGCAGCTCCTTAAAAAGGAACTATTACAATCAGAAGG
GATAAAACAATTGATCTCACCAAGTTTAGAGGAATTGCTTAGCATTGCAGCTGCTGACAAAAGGCAGGAATTTGGGGTCATCTTACGAGAAGTAATACGATTTGGCAATC
AGTGCAAGGATTCACAGTGGCATAATCTGGATCAGTACTTTTCAAGACTAGATTCGAATGATTCAAGTCAAAAACAAGCTCGAGAGGCCAGAGCAGCCGTGCAGGAACTA
ACCGTCTTAGCTCAGAATACTTCTGAATTATACCACGAATTACAAGCATTGGAAAGACTTGAGCAAGATTATAGACGGAGAGTTGAGGAAGTGGAGTTGTTGAACCAAGC
AGGAATAGGTTGGGGAAACTCTCTCAATCTTCAAGGAGAACTGAACGTACAAAGAAGACTTGTAAGGAGTTTCCAAAGCAAGTGCCTCTGGTCCAGAAATCTGGATGAGA
TTGTGGAAAAGCTCGTTATTGTTGTAACATGGATAAACCAAACAATAGTCAAAGAATTCGGAGTTGACAACACAGATAAAACATTGCTTATCAAGGACAGAAGTAATGGC
CAGAAACTGGGTGCCGTTGGTCTTGCCTTGCATTATGCAAACATAATTAGCCAGATAAATCTCATTGCGTTACGTCCAACCTCCATTCCTTCAAACATGAGGGATGCATT
ATACCGGGCATTGCCTACAAGCATTAAAATAGCTCTGCGTTCACGATTGCGGGCTGTGGATGCAAGAGAGGAGGTATTACCTTGTTTCAATCTTGATGCAGCCTTGTAAC
TCCTACTCACTCATTACTTAGTCATGAACAAAATGGTTTTTCTGCAGCCAACTTATTATGACGTCAAAGCTGAAATGGATAAGATCCTTGAATGGCTTGTTCCGATTGCT
GCAAACACGAGCAAGTAGGAGAACTTTTCCCTTATCCACTTAAAATAACTTTTTATAACGATAAATAAGTACTCTTCACCAATTATTTTTCCAAATTTCTATGCGTTTTA
TGAGCCTTTTCCCGAAAAGTGCCCAATTTTGCTCTGGAGTTGTTTTAAAATTCTAAAATCTCAATCACCGTCAACTATAAAAACAAACACCATCAAATTTGAAAATGGCA
TTATTACAAAAAGTCGTCTTCTTTCTCCCAATCCATTTCTTTGCTTTCCATTGCACGTTCCTAATTTCACCGCGATCAATGACAGAGCGCATCAATCTTGTGGCAGAATT
GGAGAATGGGCAACCCAAAGGTTCGTTTCATGATTCTTCTTTCACAGAAATAGAACAAAGTTCCAACATGAAGCAACGAATTAAAATGATTTGTCCATTGTTTTTTGTGT
AGTAAGGAACTCAGCAAAGGCAGAGCTACACAAAGCAACAATCCAAACCGCCTCCAAACGCTCTACCATGCAGACAAAGTAAAAACAGAGAAACAAATCCTTGAATTGGT
CATATTGCTTCACCATCTCATCCATCTAGCCAAACACCAACAACGGCGCTCCTCATCTCTCCGTTGCCGATCACCAACTCCCAAGGACATGGCAAATAGTTCTCGTCGTA
TCCAATTCAAGAGCCAAATTATCAGAACCACCAGGATGGATTTCCGACAGACAACATACCACCGGCCAGCCAATCTCCGATCAGAAAAAGGGTGCTGGGTAACAAAAAAG
GTATGGAATCTTACAAAAATGAGAATAAAGGAATTTGGACGTTAAGTAAAGCGGTTTCGTTTCAACCTTGAGGTCTATTGGTAGAGTCTAGATTTTGAGTTCAAGAAGGA
TGAAGCTTGGAATTGGTTTAATGTCGATAAAAACAATTTTGAGTTCCTCACACGTTTATACCACAATGAAAATAAGTACAGGGATTTTTAAAAAAGTTTTTGACCGTGTC
TTCTAGTTGAATTCCTTTATGTAGATACTTTTTTGGTGAGTTTTGTTCCAAATCCCCCTTGTTTTGTATATCAATCACACCGCTACAGAATAATGGTCATTGTTCCTAGT
TAAATGTCATACGTTCGCATTTATGCGATAAATTTGCATGAATTGATAATTTTTTTAAAAACGAATCAAAATATTATTCTGCACGTATAGAAAGAGAGCACGGAGATTTC
TAAGTATAGCTCGAATTAAATGGGGCAGGACGAAGGAAAATTCTAAACAAGAAACTTAATTTAAATTTTACCTTTCAGTAATCATCTAATTAATCTATTAACGGTTTTCA
TACCCAAAAACAGAAAAGTAGTCTAGATGATCCGACCGATTAAAACAACTTCCATTTTAACATCAAGCCGAGCACCAACATTTCAAGTAACAATAGTAATAAACCCCAAA
AGGAGCGTACTGGAGAATAGACGCACAACATAAAAGGCTTTCTGTTTAAAATCCAAAAAACGAAGGACATTCGAAGCTGAAAGAGATTTTAATGTGGAGAGGAAAAAATA
CTTGCACTAACACAAGCACATAAATTTTTAAAATAAAACATTATAAATTATTAGAATTAAAAAAATATTAGTAGATTGAAGAAAAATAAACAAAATAGTTTATTGGTCCT
TCAGTAGGGATCAGATTGGAAGAGATGTGAGTGCCCAATTGTCGTAACCAGGCGTTGGGGCAGAACCAACTCGCATTATAATAGCGAGAGCCAACCACTCATCATATCCC
G
Protein sequenceShow/hide protein sequence
MGGVCSNGIVKDDFVSEKVTQASEDRKGNSHLNFEASDPNEMPEKSGSGVILLPSPPSKAGSNKVAPMNAQAGARGRAVDLWKTIGISVSNLHINNGFFTAMTPSGREIS
ILAFEVANTISKVANLSKSLSEENMQLLKKELLQSEGIKQLISPSLEELLSIAAADKRQEFGVILREVIRFGNQCKDSQWHNLDQYFSRLDSNDSSQKQAREARAAVQEL
TVLAQNTSELYHELQALERLEQDYRRRVEEVELLNQAGIGWGNSLNLQGELNVQRRLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIVKEFGVDNTDKTLLIKDRSNG
QKLGAVGLALHYANIISQINLIALRPTSIPSNMRDALYRALPTSIKIALRSRLRAVDAREEVLPCFNLDAAL