; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh05G003130 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh05G003130
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF668)
Genome locationCmo_Chr05:1386253..1389695
RNA-Seq ExpressionCmoCh05G003130
SyntenyCmoCh05G003130
Gene Ontology termsGO:0045927 - positive regulation of growth (biological process)
InterPro domainsIPR007700 - Domain of unknown function DUF668
IPR021864 - Domain of unknown function DUF3475
IPR045021 - Protein PSK SIMULATOR


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598505.1 Protein PSK SIMULATOR 2, partial [Cucurbita argyrosperma subsp. sororia]1.3e-17882.35Show/hide
Query:  MGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNSGFFT
        MGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMN GFFT
Subjt:  MGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNSGFFT

Query:  GMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRL
         MASNGREISILAFEVANTISK+ANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLL EVIRFGKQCKDPQWHNLDQYFSRL
Subjt:  GMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRL

Query:  DLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEKLVIV
        DLNDSSKKQAREARAAIQELAVLAQSTS                                                                IVEKLVIV
Subjt:  DLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEKLVIV

Query:  VTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNLSEEP
        VTWINQTIAK FDDHNTGACNG VITDKTLPIEDRSNGQKLGSVGLALHYA IISQINLIACRPTSIPSNMRDALYRALP SVKIGLRSRLR V++SEEP
Subjt:  VTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNLSEEP

Query:  TYIGVKAEMDKILEWLVPIAANTSK
        TYIGVKAEMDKILEWLVPIAANTSK
Subjt:  TYIGVKAEMDKILEWLVPIAANTSK

KAG7029441.1 hypothetical protein SDJN02_07780, partial [Cucurbita argyrosperma subsp. argyrosperma]3.8e-18182.52Show/hide
Query:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS
        MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMN 
Subjt:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS

Query:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY
        GFFT MASNGREISILAFEVANTISK+ANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLL EVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY

Query:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK
        FSRLDLNDSSKKQAREARAAIQELAVLAQSTS                                                                IVEK
Subjt:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK

Query:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL
        LVIVVTWINQTIAK FDDHNTGACNG VITDKTLPIEDRSNGQKLGSVGLALHYA IISQINLIACRPTSIPSNMRDALYRALP SVKIGLRSRLR V++
Subjt:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL

Query:  SEEPTYIGVKAEMDKILEWLVPIAANTSK
        SEEPTYIGVKAEMDKILEWLVPIAANTSK
Subjt:  SEEPTYIGVKAEMDKILEWLVPIAANTSK

XP_022961804.1 uncharacterized protein LOC111462458 isoform X1 [Cucurbita moschata]3.2e-18885.08Show/hide
Query:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS
        MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS
Subjt:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS

Query:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY
        GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY

Query:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK
        FSRLDLNDSSKKQAREARAAIQELAVLAQSTS                                                                IVEK
Subjt:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK

Query:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL
        LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL
Subjt:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL

Query:  SEEPTYIGVKAEMDKILEWLVPIAANTSK
        SEEPTYIGVKAEMDKILEWLVPIAANTSK
Subjt:  SEEPTYIGVKAEMDKILEWLVPIAANTSK

XP_022961807.1 uncharacterized protein LOC111462458 isoform X3 [Cucurbita moschata]3.2e-18885.08Show/hide
Query:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS
        MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS
Subjt:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS

Query:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY
        GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY

Query:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK
        FSRLDLNDSSKKQAREARAAIQELAVLAQSTS                                                                IVEK
Subjt:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK

Query:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL
        LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL
Subjt:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL

Query:  SEEPTYIGVKAEMDKILEWLVPIAANTSK
        SEEPTYIGVKAEMDKILEWLVPIAANTSK
Subjt:  SEEPTYIGVKAEMDKILEWLVPIAANTSK

XP_023546943.1 uncharacterized protein LOC111805888 [Cucurbita pepo subsp. pepo]1.3e-18182.75Show/hide
Query:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS
        MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCL+SEAIDPNEM QRSRSG TLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMN 
Subjt:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS

Query:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY
        GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLL EVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY

Query:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK
        FSRLDLNDSSKKQAREARAAIQELAVLAQSTS                                                                IVEK
Subjt:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK

Query:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL
        LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYA IISQINLIACRPTSIPSNMRDALYRALP SVKIGLRSRLR V++
Subjt:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL

Query:  SEEPTYIGVKAEMDKILEWLVPIAANTSK
        SEEPTYIGVKAEMDKILEWLVPIAANTSK
Subjt:  SEEPTYIGVKAEMDKILEWLVPIAANTSK

TrEMBL top hitse value%identityAlignment
A0A6J1HB43 uncharacterized protein LOC111462458 isoform X31.5e-18885.08Show/hide
Query:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS
        MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS
Subjt:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS

Query:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY
        GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY

Query:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK
        FSRLDLNDSSKKQAREARAAIQELAVLAQSTS                                                                IVEK
Subjt:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK

Query:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL
        LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL
Subjt:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL

Query:  SEEPTYIGVKAEMDKILEWLVPIAANTSK
        SEEPTYIGVKAEMDKILEWLVPIAANTSK
Subjt:  SEEPTYIGVKAEMDKILEWLVPIAANTSK

A0A6J1HBD1 uncharacterized protein LOC111462458 isoform X21.7e-16376.46Show/hide
Query:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS
        MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMN 
Subjt:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS

Query:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY
                                           +ENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY

Query:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK
        FSRLDLNDSSKKQAREARAAIQELAVLAQSTS                                                                IVEK
Subjt:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK

Query:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL
        LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL
Subjt:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL

Query:  SEEPTYIGVKAEMDKILEWLVPIAANTSK
        SEEPTYIGVKAEMDKILEWLVPIAANTSK
Subjt:  SEEPTYIGVKAEMDKILEWLVPIAANTSK

A0A6J1HF33 uncharacterized protein LOC111462458 isoform X11.5e-18885.08Show/hide
Query:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS
        MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS
Subjt:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS

Query:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY
        GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY

Query:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK
        FSRLDLNDSSKKQAREARAAIQELAVLAQSTS                                                                IVEK
Subjt:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK

Query:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL
        LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL
Subjt:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL

Query:  SEEPTYIGVKAEMDKILEWLVPIAANTSK
        SEEPTYIGVKAEMDKILEWLVPIAANTSK
Subjt:  SEEPTYIGVKAEMDKILEWLVPIAANTSK

A0A6J1K3J2 uncharacterized protein LOC111492046 isoform X12.5e-17580.42Show/hide
Query:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS
        MGNTMGGVCSNGIDKDYFESEKITQTSEDR GNSCL+SEAID NEMPQRSRSGV+LL SPPSK GSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMN 
Subjt:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS

Query:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY
        GFFTGMASNG EISILAFEVANTISKV NLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLL EVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY

Query:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK
        FSRLDLNDSSKKQAREARAAIQELAVLAQSTS                                                                IVEK
Subjt:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK

Query:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL
        LVIVVTWINQTIAK F DHNTGACNGP+I+DKTLPIEDRS GQKLGSVGLALHYA IISQINLIACRP SIPSNMRDALYRALPTSVKIGLRSRLR V++
Subjt:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL

Query:  SEEPTYIGVKAEMDKILEWLVPIAANTSK
        SEEPTYIGVKAEMDKILEWLVPIAANTSK
Subjt:  SEEPTYIGVKAEMDKILEWLVPIAANTSK

A0A6J1KCI8 uncharacterized protein LOC111492046 isoform X22.5e-17580.42Show/hide
Query:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS
        MGNTMGGVCSNGIDKDYFESEKITQTSEDR GNSCL+SEAID NEMPQRSRSGV+LL SPPSK GSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMN 
Subjt:  MGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAPVNSQAGSRGRAIDLLKTIGNSVSNLHMNS

Query:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY
        GFFTGMASNG EISILAFEVANTISKV NLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLL EVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQY

Query:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK
        FSRLDLNDSSKKQAREARAAIQELAVLAQSTS                                                                IVEK
Subjt:  FSRLDLNDSSKKQAREARAAIQELAVLAQSTS----------------------------------------------------------------IVEK

Query:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL
        LVIVVTWINQTIAK F DHNTGACNGP+I+DKTLPIEDRS GQKLGSVGLALHYA IISQINLIACRP SIPSNMRDALYRALPTSVKIGLRSRLR V++
Subjt:  LVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNL

Query:  SEEPTYIGVKAEMDKILEWLVPIAANTSK
        SEEPTYIGVKAEMDKILEWLVPIAANTSK
Subjt:  SEEPTYIGVKAEMDKILEWLVPIAANTSK

SwissProt top hitse value%identityAlignment
P0DO24 Protein PSK SIMULATOR 37.9e-4936.01Show/hide
Query:  SKTGSNKVAPVNSQAGSR--GRAIDLLKTIGNSVSNLHMNSGFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTS
        ++T  +KV   +   G    GRA D+L T+G+S+++L  + GF +G+A+ G E+ ILAFEVANTI K +NL +SLS+ NI+ LK  +L SEG++ LVS  
Subjt:  SKTGSNKVAPVNSQAGSR--GRAIDLLKTIGNSVSNLHMNSGFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTS

Query:  SEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRLDLNDSSKKQARE-ARAAIQELAVLAQSTSIV------------------------
         +ELL + AADKRQE  V  GEV+RFG + KD QWHNL +YF R+    + ++Q +E A   + +L VL Q T+ +                        
Subjt:  SEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRLDLNDSSKKQARE-ARAAIQELAVLAQSTSIV------------------------

Query:  ------EKLVIVVTWI--NQTIAKTFDDHNTGACNGPVITDKTLPI---------------EDRSN-------GQKLGSVGLALHYAIIISQINLIACRP
              + L I+ T +   + + K+    +  +     + +K + I               +D+ +        ++LG  GLALHYA II QI+ +  R 
Subjt:  ------EKLVIVVTWI--NQTIAKTFDDHNTGACNGPVITDKTLPI---------------EDRSN-------GQKLGSVGLALHYAIIISQINLIACRP

Query:  TSIPSNMRDALYRALPTSVKIGLRSRLRNVNLSEEPTYIGVKAEMDKILEWLVPIAANTSK
        +SI SN RD+LY++LP  +K+ LRS++++ N+ +E +   +K EM++ L WLVP+A NT+K
Subjt:  TSIPSNMRDALYRALPTSVKIGLRSRLRNVNLSEEPTYIGVKAEMDKILEWLVPIAANTSK

Q9SA91 Protein PSK SIMULATOR 21.1e-5839.52Show/hide
Query:  PPSKTGSNKVAPVNSQAGSRG-----RAIDLLKTIGNSVSNLHMNSGFFTGM-ASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIK
        PP +  S K    NS  G  G     +A+++L T+G+S++ ++ ++ + +G+ +S G +++ILAFEVANTI+K A L QSLSEEN++ +K+++L SE +K
Subjt:  PPSKTGSNKVAPVNSQAGSRG-----RAIDLLKTIGNSVSNLHMNSGFFTGM-ASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIK

Query:  QLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRLDLNDSSKKQAR-EARAAIQELAVLAQSTS--------------------
        +LVST + EL  +AA+DKR+E D+  GEVIRFG  CKD QWHNLD+YF +LD  +S  K  + +A A +QEL  LA+ TS                    
Subjt:  QLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRLDLNDSSKKQAR-EARAAIQELAVLAQSTS--------------------

Query:  --------------------------------------------IVEKLVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLAL
                                                    I+EKLV VV++I QTI + F +      NG  + D     E     ++LG  GL+L
Subjt:  --------------------------------------------IVEKLVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLAL

Query:  HYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNLSEEPTYIGVKAEMDKILEWLVPIAANTSK
        HYA +I QI+ IA RP+S+PSN+RD LY ALP +VK  LR RL+ ++  EE +   +KAEM+K L+WLVP A NT+K
Subjt:  HYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNLSEEPTYIGVKAEMDKILEWLVPIAANTSK

Q9XID5 Protein PSK SIMULATOR 12.4e-5335.68Show/hide
Query:  ITQTSEDRKGNS-CLSSEAIDPNEMPQRSRSGVTLL---LSPPSKTGSN------KVAPVNSQAG-----SRGRAIDLLKTIGNSVSNLHMNSGFFTGMA
        +T+  +D K  S   S   +     PQ    G+  L   LS  S++  +      KV+ V+S  G       G+A+D+L T+G+S++NL+++ GF +   
Subjt:  ITQTSEDRKGNS-CLSSEAIDPNEMPQRSRSGVTLL---LSPPSKTGSN------KVAPVNSQAG-----SRGRAIDLLKTIGNSVSNLHMNSGFFTGMA

Query:  SNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRLDLN
          G +ISIL+FEVANTI K ANL  SLS+++I  LKE +L SEG++ L+S   +ELL IAAADKR+E  +  GEV+RFG +CKDPQ+HNLD++F RL   
Subjt:  SNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRLDLN

Query:  DSSKKQAREA--------------------------------RAAIQE--------------LAVLAQS---------------------TSIVEKLVIV
         + +K  ++                                 +  IQE              LA+L                          ++EKLV V
Subjt:  DSSKKQAREA--------------------------------RAAIQE--------------LAVLAQS---------------------TSIVEKLVIV

Query:  VTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRS-NGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNLSEE
        V +++  I + F              D   P  D   N +KLGS GLALHYA II+QI+ +  R +++P++ RDALY+ LP S+K  LRSR+++  + EE
Subjt:  VTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRS-NGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNLSEE

Query:  PTYIGVKAEMDKILEWLVPIAANTSK
         T   +KAEM+K L+WLVP+A NT+K
Subjt:  PTYIGVKAEMDKILEWLVPIAANTSK

Arabidopsis top hitse value%identityAlignment
AT1G30755.1 Protein of unknown function (DUF668)7.8e-6039.52Show/hide
Query:  PPSKTGSNKVAPVNSQAGSRG-----RAIDLLKTIGNSVSNLHMNSGFFTGM-ASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIK
        PP +  S K    NS  G  G     +A+++L T+G+S++ ++ ++ + +G+ +S G +++ILAFEVANTI+K A L QSLSEEN++ +K+++L SE +K
Subjt:  PPSKTGSNKVAPVNSQAGSRG-----RAIDLLKTIGNSVSNLHMNSGFFTGM-ASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIK

Query:  QLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRLDLNDSSKKQAR-EARAAIQELAVLAQSTS--------------------
        +LVST + EL  +AA+DKR+E D+  GEVIRFG  CKD QWHNLD+YF +LD  +S  K  + +A A +QEL  LA+ TS                    
Subjt:  QLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRLDLNDSSKKQAR-EARAAIQELAVLAQSTS--------------------

Query:  --------------------------------------------IVEKLVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLAL
                                                    I+EKLV VV++I QTI + F +      NG  + D     E     ++LG  GL+L
Subjt:  --------------------------------------------IVEKLVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLAL

Query:  HYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNLSEEPTYIGVKAEMDKILEWLVPIAANTSK
        HYA +I QI+ IA RP+S+PSN+RD LY ALP +VK  LR RL+ ++  EE +   +KAEM+K L+WLVP A NT+K
Subjt:  HYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNLSEEPTYIGVKAEMDKILEWLVPIAANTSK

AT1G34320.1 Protein of unknown function (DUF668)1.7e-5435.68Show/hide
Query:  ITQTSEDRKGNS-CLSSEAIDPNEMPQRSRSGVTLL---LSPPSKTGSN------KVAPVNSQAG-----SRGRAIDLLKTIGNSVSNLHMNSGFFTGMA
        +T+  +D K  S   S   +     PQ    G+  L   LS  S++  +      KV+ V+S  G       G+A+D+L T+G+S++NL+++ GF +   
Subjt:  ITQTSEDRKGNS-CLSSEAIDPNEMPQRSRSGVTLL---LSPPSKTGSN------KVAPVNSQAG-----SRGRAIDLLKTIGNSVSNLHMNSGFFTGMA

Query:  SNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRLDLN
          G +ISIL+FEVANTI K ANL  SLS+++I  LKE +L SEG++ L+S   +ELL IAAADKR+E  +  GEV+RFG +CKDPQ+HNLD++F RL   
Subjt:  SNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRLDLN

Query:  DSSKKQAREA--------------------------------RAAIQE--------------LAVLAQS---------------------TSIVEKLVIV
         + +K  ++                                 +  IQE              LA+L                          ++EKLV V
Subjt:  DSSKKQAREA--------------------------------RAAIQE--------------LAVLAQS---------------------TSIVEKLVIV

Query:  VTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRS-NGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNLSEE
        V +++  I + F              D   P  D   N +KLGS GLALHYA II+QI+ +  R +++P++ RDALY+ LP S+K  LRSR+++  + EE
Subjt:  VTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRS-NGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNLSEE

Query:  PTYIGVKAEMDKILEWLVPIAANTSK
         T   +KAEM+K L+WLVP+A NT+K
Subjt:  PTYIGVKAEMDKILEWLVPIAANTSK

AT3G23160.1 Protein of unknown function (DUF668)1.5e-1021.04Show/hide
Query:  ISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRL--------
        I IL+FEVAN +SK  +L +SLS+  I  LK E+  SEG+++LVS+    LL ++ ++K  +   +   V R GK+C +P     +  +  +        
Subjt:  ISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRL--------

Query:  -------DLNDSSKKQAREARAAI-------------QELAVLAQS----------------------------------TSIVEKLVIVVTWINQTIAK
               D+    KK  R   A               Q +  L +S                                    +VE L   V  I   I  
Subjt:  -------DLNDSSKKQAREARAAI-------------QELAVLAQS----------------------------------TSIVEKLVIVVTWINQTIAK

Query:  TF-------------------------------------------------------------------------------DDHNTGACNGPVITDKTL-
         F                                                                               DD + G    P+ T + + 
Subjt:  TF-------------------------------------------------------------------------------DDHNTGACNGPVITDKTL-

Query:  --------PIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTS----VKIGLRSRLRNVNLSEEPTYIGVKAEMDKILEWLVP
                 +   ++   +G   L+LHYA ++  +  +   P  I    RD LY+ LPTS    +K  LRS L+N+++ + P     K  +D IL WL P
Subjt:  --------PIEDRSNGQKLGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTS----VKIGLRSRLRNVNLSEEPTYIGVKAEMDKILEWLVP

Query:  IAAN
        +A N
Subjt:  IAAN

AT5G04550.1 Protein of unknown function (DUF668)4.1e-0843.48Show/hide
Query:  LGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLR--NVNLSEEPTY-IGVKAE----MDKILEWLVPIAANTSK
        LG+  LALHYA +I  I      P  I  + RD LY  LP SV+  LR RL+  + NLS    Y  G+  E    M  ILEWL P+A N  K
Subjt:  LGSVGLALHYAIIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLR--NVNLSEEPTY-IGVKAE----MDKILEWLVPIAANTSK

AT5G04550.1 Protein of unknown function (DUF668)5.3e-0828.68Show/hide
Query:  ISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRLDLNDSS--
        + +LAFEVA+ +SK+ +L QSLS++N+  L++E+  S GIK+LVS   + ++ +   +  +  + +   V R  ++C DP+    +  FS +    +   
Subjt:  ISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRLDLNDSS--

Query:  ---------KKQAREARAAIQELAVLAQSTSIVEKL
                  K+A++    I   A L Q T I+  L
Subjt:  ---------KKQAREARAAIQELAVLAQSTSIVEKL

AT5G08660.1 Protein of unknown function (DUF668)5.6e-5036.01Show/hide
Query:  SKTGSNKVAPVNSQAGSR--GRAIDLLKTIGNSVSNLHMNSGFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTS
        ++T  +KV   +   G    GRA D+L T+G+S+++L  + GF +G+A+ G E+ ILAFEVANTI K +NL +SLS+ NI+ LK  +L SEG++ LVS  
Subjt:  SKTGSNKVAPVNSQAGSR--GRAIDLLKTIGNSVSNLHMNSGFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTS

Query:  SEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRLDLNDSSKKQARE-ARAAIQELAVLAQSTSIV------------------------
         +ELL + AADKRQE  V  GEV+RFG + KD QWHNL +YF R+    + ++Q +E A   + +L VL Q T+ +                        
Subjt:  SEELLSIAAADKRQEFDVLLGEVIRFGKQCKDPQWHNLDQYFSRLDLNDSSKKQARE-ARAAIQELAVLAQSTSIV------------------------

Query:  ------EKLVIVVTWI--NQTIAKTFDDHNTGACNGPVITDKTLPI---------------EDRSN-------GQKLGSVGLALHYAIIISQINLIACRP
              + L I+ T +   + + K+    +  +     + +K + I               +D+ +        ++LG  GLALHYA II QI+ +  R 
Subjt:  ------EKLVIVVTWI--NQTIAKTFDDHNTGACNGPVITDKTLPI---------------EDRSN-------GQKLGSVGLALHYAIIISQINLIACRP

Query:  TSIPSNMRDALYRALPTSVKIGLRSRLRNVNLSEEPTYIGVKAEMDKILEWLVPIAANTSK
        +SI SN RD+LY++LP  +K+ LRS++++ N+ +E +   +K EM++ L WLVP+A NT+K
Subjt:  TSIPSNMRDALYRALPTSVKIGLRSRLRNVNLSEEPTYIGVKAEMDKILEWLVPIAANTSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACAGTAATGGATTTTCCATTTGCGTATCGGTTGATAATTTTCGAATCTTAGTTCAATCCGCCGGTTATAACTTATTGTACAGGGACTTTGGAGATTTGAAGAC
GTTTGAGATGGGGAACACAATGGGAGGAGTATGTTCCAATGGAATAGACAAGGATTACTTCGAATCAGAGAAGATAACGCAAACATCTGAGGATCGGAAAGGGAATTCGT
GTTTGAGTTCTGAGGCAATTGACCCGAATGAAATGCCGCAAAGGTCTCGTTCTGGTGTGACACTATTGCTCTCACCTCCGTCTAAGACAGGAAGCAATAAGGTTGCACCA
GTGAACTCACAAGCAGGGTCACGCGGGAGGGCAATTGATTTATTGAAAACAATTGGGAATAGTGTGTCAAATTTGCACATGAACAGTGGGTTTTTTACAGGCATGGCTTC
AAATGGTCGGGAGATCTCTATATTAGCTTTTGAAGTAGCTAATACAATAAGCAAAGTAGCGAATTTGTCACAATCTCTCTCAGAAGAAAACATCCAGCTTCTCAAAGAGG
AACTTTTACAATCAGAAGGGATAAAACAATTAGTCTCAACAAGTTCAGAAGAATTGCTAAGCATTGCAGCTGCTGACAAAAGGCAGGAATTTGACGTTCTCTTAGGGGAG
GTTATACGATTTGGAAAGCAGTGCAAGGATCCACAGTGGCATAATCTGGATCAGTACTTTTCAAGACTAGATTTGAATGATTCAAGTAAAAAACAAGCTCGAGAGGCCAG
AGCAGCGATCCAGGAACTAGCTGTTTTAGCTCAGTCTACATCTATTGTAGAAAAGCTCGTTATTGTTGTAACATGGATAAACCAAACAATAGCCAAAACATTTGATGATC
ACAACACAGGTGCCTGCAATGGACCAGTCATTACAGATAAAACATTGCCTATTGAGGACAGAAGTAATGGCCAGAAACTGGGCTCTGTTGGTCTTGCCTTGCATTATGCA
ATCATAATCAGCCAGATAAATCTCATTGCGTGTCGCCCGACCTCCATTCCTTCAAATATGAGGGATGCATTATACCGGGCATTGCCTACAAGCGTTAAAATTGGTCTGCG
CTCTCGATTGCGGAATGTGAATCTCAGTGAGGAGCCAACTTATATTGGTGTCAAAGCGGAAATGGATAAGATCCTTGAATGGCTTGTTCCAATAGCTGCAAACACGAGCA
AGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAAACAGTAATGGATTTTCCATTTGCGTATCGGTTGATAATTTTCGAATCTTAGTTCAATCCGCCGGTTATAACTTATTGTACAGGGACTTTGGAGATTTGAAGAC
GTTTGAGATGGGGAACACAATGGGAGGAGTATGTTCCAATGGAATAGACAAGGATTACTTCGAATCAGAGAAGATAACGCAAACATCTGAGGATCGGAAAGGGAATTCGT
GTTTGAGTTCTGAGGCAATTGACCCGAATGAAATGCCGCAAAGGTCTCGTTCTGGTGTGACACTATTGCTCTCACCTCCGTCTAAGACAGGAAGCAATAAGGTTGCACCA
GTGAACTCACAAGCAGGGTCACGCGGGAGGGCAATTGATTTATTGAAAACAATTGGGAATAGTGTGTCAAATTTGCACATGAACAGTGGGTTTTTTACAGGCATGGCTTC
AAATGGTCGGGAGATCTCTATATTAGCTTTTGAAGTAGCTAATACAATAAGCAAAGTAGCGAATTTGTCACAATCTCTCTCAGAAGAAAACATCCAGCTTCTCAAAGAGG
AACTTTTACAATCAGAAGGGATAAAACAATTAGTCTCAACAAGTTCAGAAGAATTGCTAAGCATTGCAGCTGCTGACAAAAGGCAGGAATTTGACGTTCTCTTAGGGGAG
GTTATACGATTTGGAAAGCAGTGCAAGGATCCACAGTGGCATAATCTGGATCAGTACTTTTCAAGACTAGATTTGAATGATTCAAGTAAAAAACAAGCTCGAGAGGCCAG
AGCAGCGATCCAGGAACTAGCTGTTTTAGCTCAGTCTACATCTATTGTAGAAAAGCTCGTTATTGTTGTAACATGGATAAACCAAACAATAGCCAAAACATTTGATGATC
ACAACACAGGTGCCTGCAATGGACCAGTCATTACAGATAAAACATTGCCTATTGAGGACAGAAGTAATGGCCAGAAACTGGGCTCTGTTGGTCTTGCCTTGCATTATGCA
ATCATAATCAGCCAGATAAATCTCATTGCGTGTCGCCCGACCTCCATTCCTTCAAATATGAGGGATGCATTATACCGGGCATTGCCTACAAGCGTTAAAATTGGTCTGCG
CTCTCGATTGCGGAATGTGAATCTCAGTGAGGAGCCAACTTATATTGGTGTCAAAGCGGAAATGGATAAGATCCTTGAATGGCTTGTTCCAATAGCTGCAAACACGAGCA
AGTAG
Protein sequenceShow/hide protein sequence
MENSNGFSICVSVDNFRILVQSAGYNLLYRDFGDLKTFEMGNTMGGVCSNGIDKDYFESEKITQTSEDRKGNSCLSSEAIDPNEMPQRSRSGVTLLLSPPSKTGSNKVAP
VNSQAGSRGRAIDLLKTIGNSVSNLHMNSGFFTGMASNGREISILAFEVANTISKVANLSQSLSEENIQLLKEELLQSEGIKQLVSTSSEELLSIAAADKRQEFDVLLGE
VIRFGKQCKDPQWHNLDQYFSRLDLNDSSKKQAREARAAIQELAVLAQSTSIVEKLVIVVTWINQTIAKTFDDHNTGACNGPVITDKTLPIEDRSNGQKLGSVGLALHYA
IIISQINLIACRPTSIPSNMRDALYRALPTSVKIGLRSRLRNVNLSEEPTYIGVKAEMDKILEWLVPIAANTSK