; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg033617 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg033617
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF668)
Genome locationscaffold13:38032780..38037743
RNA-Seq ExpressionSpg033617
SyntenySpg033617
Gene Ontology termsGO:0045927 - positive regulation of growth (biological process)
InterPro domainsIPR007700 - Domain of unknown function DUF668
IPR021864 - Domain of unknown function DUF3475
IPR045021 - Protein PSK SIMULATOR


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6598505.1 Protein PSK SIMULATOR 2, partial [Cucurbita argyrosperma subsp. sororia]1.3e-25983.59Show/hide
Query:  MGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNSGFFT
        MGGVCSNGI K+ F SEK TQTSEDRKGNSCL+SEA DP ++PQ+S SGV LL SPPSKTGSNKVAP+NSQAG+ GRA+DLLKTIGNSVS LHMN GFFT
Subjt:  MGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNSGFFT

Query:  GPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARL
          ASNGREISILAFEVANTISK+ANLSQSLSEENIQ LK+EL QSEGIKQLVS S EELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYF+RL
Subjt:  GPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARL

Query:  DLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV
        DLNDSS+KQAREARAA+QEL VLAQ+TSELYHEL ALERFEQDYRRKVDEVES+NQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEKLVIV
Subjt:  DLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIV

Query:  VTWINQTIAKAFGDNNTG---------KTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEP
        VTWINQTIAKAF D+NTG         KTL IEDRSNGQKLG+VGLALHYANIISQINLIACRPTSIPSNMRDALYRALP SVK  LRSRL+ VD SEEP
Subjt:  VTWINQTIAKAFGDNNTG---------KTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEP

Query:  TYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLRCRSP
        TY+ VKAEMDKIL WLVPIAANTSKAHQACGRIGEWA+QSKE SKGRATQSNPIRLQTLYHAD+ KTEQ ILELVTLLH  IHL KQQ QRFTSLRC+SP
Subjt:  TYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLRCRSP

Query:  TAKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR
        T K+MA+PQP NARRIQFKSQIIKAN DG            TP IRKRDP    GTE+ +N+NKDKGIWTLSKG SVSTLSSL+R
Subjt:  TAKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR

KAG7029441.1 hypothetical protein SDJN02_07780, partial [Cucurbita argyrosperma subsp. argyrosperma]3.6e-26283.7Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNTMGGVCSNGI K+ F SEK TQTSEDRKGNSCL+SEA DP ++PQ+S SGV LL SPPSKTGSNKVAP+NSQAG+ GRA+DLLKTIGNSVS LHMN 
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFT  ASNGREISILAFEVANTISK+ANLSQSLSEENIQ LK+EL QSEGIKQLVS S EELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLDLNDSS+KQAREARAA+QEL VLAQ+TSELYHEL ALERFEQDYRRKVDEVES+NQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTG---------KTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA
        LVIVVTWINQTIAKAF D+NTG         KTL IEDRSNGQKLG+VGLALHYANIISQINLIACRPTSIPSNMRDALYRALP SVK  LRSRL+ VD 
Subjt:  LVIVVTWINQTIAKAFGDNNTG---------KTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA

Query:  SEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLR
        SEEPTY+ VKAEMDKIL WLVPIAANTSKAHQACGRIGEWA+QSKE SKGRATQSNPIRLQTLYHAD+ KTEQ ILELVTLLH  IHL KQQ QRFTSLR
Subjt:  SEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLR

Query:  CRSPTAKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR
        C+SPT K+MA+PQP NARRIQFKSQIIKAN DG            TP IRKRDP    GTE+ +N+NKDKGIWTLSKG SVSTLSSL+R
Subjt:  CRSPTAKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR

XP_022961804.1 uncharacterized protein LOC111462458 isoform X1 [Cucurbita moschata]3.1e-26183.7Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNTMGGVCSNGI K+ F SEK TQTSEDRKGNSCL+SEA DP ++PQ+S SGV LL SPPSKTGSNKVAP+NSQAG+ GRA+DLLKTIGNSVS LHMNS
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFTG ASNGREISILAFEVANTISKVANLSQSLSEENIQ LK+EL QSEGIKQLVS S EELLSIAAADKRQEFDVLL EVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLDLNDSS+KQAREARAA+QEL VLAQ+TSELYHEL ALERFEQDYRRKVDEVES+NQAG GESLSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTG---------KTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA
        LVIVVTWINQTIAK F D+NTG         KTL IEDRSNGQKLG+VGLALHYA IISQINLIACRPTSIPSNMRDALYRALPTSVK  LRSRL+ V+ 
Subjt:  LVIVVTWINQTIAKAFGDNNTG---------KTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA

Query:  SEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLR
        SEEPTY+ VKAEMDKIL WLVPIAANTSKAHQACGRIGEWA+QSKEHSKGRATQSNPIRLQTLYHAD+ KTEQ ILELVTLLH  IHL KQQ QRFTSLR
Subjt:  SEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLR

Query:  CRSPTAKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR
        C+SPT+K+MA+PQP NARRIQFKSQIIKAN DG            TP IRKRDP    GTE+ +N NKDKGIWTLSKG SVSTLSSL+R
Subjt:  CRSPTAKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR

XP_022996976.1 uncharacterized protein LOC111492046 isoform X1 [Cucurbita maxima]2.3e-26183.87Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNTMGGVCSNGI K+ F SEK TQTSEDR GNSCLNSEA D  ++PQ+S SGV LLPSPPSK GSNKVAP+NSQAG+ GRA+DLLKTIGNSVS LHMN 
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFTG ASNG EISILAFEVANTISKV NLSQSLSEENIQ LK+EL QSEGIKQLVS S EELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLDLNDSS+KQAREARAA+QEL VLAQ+TSELYHEL ALERFEQDYRRKVDEVESLNQ GIGESLSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTG---------KTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA
        LVIVVTWINQTIAKAFGD+NTG         KTL IEDRS GQKLG+VGLALHYANIISQINLIACRP SIPSNMRDALYRALPTSVK  LRSRL+ VD 
Subjt:  LVIVVTWINQTIAKAFGDNNTG---------KTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA

Query:  SEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLR
        SEEPTY+ VKAEMDKIL WLVPIAANTSKAHQACGRIGEWA+QSKEHSKGRATQSNPIRLQTLYHAD+ KTEQ ILELVTLLH LIHL+KQQ QRFTSLR
Subjt:  SEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLR

Query:  CRSPTAKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR
        C+SPT K+MA+PQP NARRIQFKSQIIKAN DG            TP  RKRDP    GTE+ KNEN DKGIWTLSKG SVSTLSSL+R
Subjt:  CRSPTAKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR

XP_023546943.1 uncharacterized protein LOC111805888 [Cucurbita pepo subsp. pepo]3.1e-26183.7Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNTMGGVCSNGI K+ F SEK TQTSEDRKGNSCLNSEA DP ++ Q+S SG  LL SPPSKTGSNKVAP+NSQAG+ GRA+DLLKTIGNSVS LHMN 
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFTG ASNGREISILAFEVANTISKVANLSQSLSEENIQ LK+EL QSEGIKQLVS S EELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLDLNDSS+KQAREARAA+QEL VLAQ+TSELYHEL ALERFEQDYRRKVDEVES+NQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTG---------KTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA
        LVIVVTWINQTIAK F D+NTG         KTL IEDRSNGQKLG+VGLALHYANIISQINLIACRPTSIPSNMRDALYRALP SVK  LRSRL+ V  
Subjt:  LVIVVTWINQTIAKAFGDNNTG---------KTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA

Query:  SEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLR
        SEEPTY+ VKAEMDKIL WLVPIAANTSKAHQACGRIGEWA+QSKEHSKGRATQSNPIRLQTL+HAD+ KTEQ ILELVTLLH LIHLAKQQ QRFTSLR
Subjt:  SEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLR

Query:  CRSPTAKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR
        C+SPT K+MA+PQP NARRIQFKSQIIKAN DG            TP IRKRDP    GTE+ +N+ KDKGIWTLSKG SVSTLSSL+R
Subjt:  CRSPTAKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR

TrEMBL top hitse value%identityAlignment
A0A6J1CSY9 uncharacterized protein LOC111014009 isoform X11.0e-24982.67Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTS-EDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMN
        MGNTMGGVCSNGI K+DF+ EKKT+ S +DRKGNSCL SEATDP+ LPQKS SGVILLPSPPSKTGSNKVAPMN Q GA GRAV+L KTIGNSVS LH N
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTS-EDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMN

Query:  SGFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQ
        +GFFTG ASNGREISILAFEVANTISK+ANLSQSLSEE+IQFLKKEL QSEGIKQLVS ++EELLSIAAADKRQEFD LLREVIRFGKQCKDPQWHNLDQ
Subjt:  SGFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQ

Query:  YFARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVE
        YF+RLDLNDSSQKQAREARAA+QELTVLAQ TSELYHELQALERFEQDYRRK+DEVE LNQAGIGESL+IFQGELNVQRKLVRS QSKCLWSRNLDEIV 
Subjt:  YFARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVE

Query:  KLVIVVTWINQTIAKAFGDNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLS
        KLV +VTWI QTIAKAFGDN++G+   I DRS+GQKLG+VGLALHYANII+QINLIACRPTSIPS+MRDALYRALPTSVK  LRSRLQAVDASEEPTYLS
Subjt:  KLVIVVTWINQTIAKAFGDNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLS

Query:  VKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLRCRSPTAKD
        VKAEMD+ L+WLVPIAANTSKAHQACGRIGEWASQSKE S+GRA Q+N IRLQTLYHADKEKTEQ ILELVT LH +IHLAKQQQQRFTSLRCRSPT KD
Subjt:  VKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLRCRSPTAKD

Query:  MAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTPIRKRDPGT----ETSKNENKD-KGIWTLSKGVSVSTLS
        +A PQ    RRIQF+S II      A A+N      QTPIRKRDPG     ETSKNEN+D KGIWTL  G   S+LS
Subjt:  MAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTPIRKRDPGT----ETSKNENKD-KGIWTLSKGVSVSTLS

A0A6J1GJ92 uncharacterized protein LOC111454363 isoform X12.9e-24983.82Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNT+GGVCSNGIVK+DF++EKK +  EDRKGNSCLN +A+DP++LP K  SGVILLPSPPSKTGSNKVAP N+  GA  +AVD LKT GNSVS +H NS
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFTG AS+GREISILAFEVANTISKVANLS+SLSE+NIQ LKKELSQSEG++QLVS ++EELLSIAAADKRQEFDVLLREVIRFG +CKDPQWHNLDQ+
Subjt:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLDLNDS+QK+AREARAAMQELT+LAQ TSELYHE+Q LERFEQDYRR+VDEVE +NQAGIGESLSIFQGELNVQRKLVRSFQ K LWSRNLDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDAS-EEPTYLS
        LV VVTWINQ IAKAFGD NT KTL IE+RS GQKLG+VGLALHY+NIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQ VDAS EEPTYL 
Subjt:  LVIVVTWINQTIAKAFGDNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDAS-EEPTYLS

Query:  VKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLRCRSPTAKD
        VK EMDKILRWLVPIA NT+KAHQACGRIGEWASQSKE SKGRATQSNPIRLQTLYHADK KTEQ I+ELVTLLH LIHLAK QQQR TSLRCRSPT K+
Subjt:  VKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLRCRSPTAKD

Query:  MAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTPIRKRD
        MAI QPN ARRIQ+ SQ +K  KDGAPAD+NRPSAGQTPI KR+
Subjt:  MAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTPIRKRD

A0A6J1HF33 uncharacterized protein LOC111462458 isoform X11.5e-26183.7Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNTMGGVCSNGI K+ F SEK TQTSEDRKGNSCL+SEA DP ++PQ+S SGV LL SPPSKTGSNKVAP+NSQAG+ GRA+DLLKTIGNSVS LHMNS
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFTG ASNGREISILAFEVANTISKVANLSQSLSEENIQ LK+EL QSEGIKQLVS S EELLSIAAADKRQEFDVLL EVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLDLNDSS+KQAREARAA+QEL VLAQ+TSELYHEL ALERFEQDYRRKVDEVES+NQAG GESLSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTG---------KTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA
        LVIVVTWINQTIAK F D+NTG         KTL IEDRSNGQKLG+VGLALHYA IISQINLIACRPTSIPSNMRDALYRALPTSVK  LRSRL+ V+ 
Subjt:  LVIVVTWINQTIAKAFGDNNTG---------KTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA

Query:  SEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLR
        SEEPTY+ VKAEMDKIL WLVPIAANTSKAHQACGRIGEWA+QSKEHSKGRATQSNPIRLQTLYHAD+ KTEQ ILELVTLLH  IHL KQQ QRFTSLR
Subjt:  SEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLR

Query:  CRSPTAKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR
        C+SPT+K+MA+PQP NARRIQFKSQIIKAN DG            TP IRKRDP    GTE+ +N NKDKGIWTLSKG SVSTLSSL+R
Subjt:  CRSPTAKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR

A0A6J1K3J2 uncharacterized protein LOC111492046 isoform X11.1e-26183.87Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNTMGGVCSNGI K+ F SEK TQTSEDR GNSCLNSEA D  ++PQ+S SGV LLPSPPSK GSNKVAP+NSQAG+ GRA+DLLKTIGNSVS LHMN 
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFTG ASNG EISILAFEVANTISKV NLSQSLSEENIQ LK+EL QSEGIKQLVS S EELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
Subjt:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLDLNDSS+KQAREARAA+QEL VLAQ+TSELYHEL ALERFEQDYRRKVDEVESLNQ GIGESLSIFQGELNVQRKLVRSFQSKCLWSR+LDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTG---------KTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA
        LVIVVTWINQTIAKAFGD+NTG         KTL IEDRS GQKLG+VGLALHYANIISQINLIACRP SIPSNMRDALYRALPTSVK  LRSRL+ VD 
Subjt:  LVIVVTWINQTIAKAFGDNNTG---------KTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDA

Query:  SEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLR
        SEEPTY+ VKAEMDKIL WLVPIAANTSKAHQACGRIGEWA+QSKEHSKGRATQSNPIRLQTLYHAD+ KTEQ ILELVTLLH LIHL+KQQ QRFTSLR
Subjt:  SEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLR

Query:  CRSPTAKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR
        C+SPT K+MA+PQP NARRIQFKSQIIKAN DG            TP  RKRDP    GTE+ KNEN DKGIWTLSKG SVSTLSSL+R
Subjt:  CRSPTAKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTP-IRKRDP----GTETSKNENKDKGIWTLSKGVSVSTLSSLSR

A0A6J1KT53 uncharacterized protein LOC111496177 isoform X11.1e-24884.19Show/hide
Query:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS
        MGNT+GGVCSNGIVK+DF++EKK + SEDRKGNSCLN +A+DP++LP K  SGVILLPSPPSKTGSNKVAP N+  GA  +AVD LKT GNSVS +H N+
Subjt:  MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNS

Query:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY
        GFFTG AS+GREISILAFEVANTISKVANLS+SLSEENIQ LKKELSQSEG+KQLVS ++EELLSIAAADKRQEFDVLLREV RFG +CKDPQWHNLDQ+
Subjt:  GFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQY

Query:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
        F+RLDLNDS+QKQAREARAAMQELTVLAQ TSELYHE+Q LERFEQDYRRKVDEVE +NQAGIGESLSIFQGELNVQRKLVRSFQ+K LWSRNLDEIVEK
Subjt:  FARLDLNDSSQKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LVIVVTWINQTIAKAFGDNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDAS-EEPTYLS
        LV+VVTWINQ IAKAFGD NT K L IE+RS GQKLG+VGLALHY+NIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVD S EEPTYL 
Subjt:  LVIVVTWINQTIAKAFGDNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDAS-EEPTYLS

Query:  VKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLRCRSPTAKD
        VK EMDKILRWLVPIA NT+KAHQACGRIGEWASQSKE SKGRATQSNPIRLQTLYHADK KTEQ I+ELVTLLH LIHLAK QQQR TSLRCRSPT ++
Subjt:  VKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLRCRSPTAKD

Query:  MAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTPIRKRD
        MAI QPN ARRIQ+ SQ +K  KDGAPAD NRPSAGQTPI KR+
Subjt:  MAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTPIRKRD

SwissProt top hitse value%identityAlignment
P0DO24 Protein PSK SIMULATOR 36.8e-8642.7Show/hide
Query:  SKTGSNKVAPMNSQAG--AYGRAVDLLKTIGNSVSTLHMNSGFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSIS
        ++T  +KV   +   G    GRA D+L T+G+S++ L  + GF +G A+ G E+ ILAFEVANTI K +NL +SLS+ NI+ LK  +  SEG++ LVS  
Subjt:  SKTGSNKVAPMNSQAG--AYGRAVDLLKTIGNSVSTLHMNSGFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSIS

Query:  LEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQARE-ARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESL
         +ELL + AADKRQE  V   EV+RFG + KD QWHNL +YF R+    + Q+Q +E A   + +L VL Q T+ELY ELQ L R E+DY +K  E E+ 
Subjt:  LEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQARE-ARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESL

Query:  NQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFG--DNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIA
          +  G+ L+I + EL  QRK+V+S + K LWSR  +E++EKLV +V ++   I   FG  D+   K    E     ++LG  GLALHYANII QI+ + 
Subjt:  NQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFG--DNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIA

Query:  CRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYH
         R +SI SN RD+LY++LP  +K ALRS++++ +  +E +   +K EM++ L WLVP+A NT+KAH   G +GEWA+   + +  + +  + +R++TLYH
Subjt:  CRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYH

Query:  ADKEKTEQNILELVTLLHTLIHLAKQQQQ---RFTSLRCRSPTAKDMAIPQP
        A KEKTE  IL  +  L  L+  AK   +   R +S++    T     I +P
Subjt:  ADKEKTEQNILELVTLLHTLIHLAKQQQQ---RFTSLRCRSPTAKDMAIPQP

Q9SA91 Protein PSK SIMULATOR 23.2e-11248.43Show/hide
Query:  MGGVCSNGIVKEDFLSEKKTQTSEDRKG-------NSCLNSEATD---------PEKLPQKSCSGVI-----LLPSPPSKTGSNKVAPMNS---QAGAYG
        MGGVCS   V +D   +KK ++++D K         S   S+ +D           +   K    V      L P PP +  S K    NS   +AG  G
Subjt:  MGGVCSNGIVKEDFLSEKKTQTSEDRKG-------NSCLNSEATD---------PEKLPQKSCSGVI-----LLPSPPSKTGSNKVAPMNS---QAGAYG

Query:  --RAVDLLKTIGNSVSTLHMNSGFFTG-PASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDV
          +AV++L T+G+S++ ++ ++ + +G  +S G +++ILAFEVANTI+K A L QSLSEEN++F+KK++  SE +K+LVS    EL  +AA+DKR+E D+
Subjt:  --RAVDLLKTIGNSVSTLHMNSGFFTG-PASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDV

Query:  LLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNV
           EVIRFG  CKD QWHNLD+YF +LD  +S  K  + +A A MQEL  LA+ TSELYHELQAL+RFEQDYRRK+ EVESLN    GE + I Q EL  
Subjt:  LLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNV

Query:  QRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPT
        Q+KLV+S Q K LWS+NL EI+EKLV VV++I QTI + FG+N        E     ++LG  GL+LHYAN+I QI+ IA RP+S+PSN+RD LY ALP 
Subjt:  QRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPT

Query:  SVKTALRSRLQAVDASEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRA---TQSNPIRLQTLYHADKEKTEQNILELVTLL
        +VKTALR RLQ +D  EE +   +KAEM+K L+WLVP A NT+KAHQ  G +GEWA+   E  KG+       NP RLQTL+HADK   +  +LELV  L
Subjt:  SVKTALRSRLQAVDASEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRA---TQSNPIRLQTLYHADKEKTEQNILELVTLL

Query:  HTLIHLAKQQ
        H L+  +K++
Subjt:  HTLIHLAKQQ

Q9XID5 Protein PSK SIMULATOR 11.1e-9946.48Show/hide
Query:  NKVAPMNSQAG--AYGRAVDLLKTIGNSVSTLHMNSGFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELL
        ++V+ +  +AG    G+AVD+L T+G+S++ L+++ GF +     G +ISIL+FEVANTI K ANL  SLS+++I  LK+ +  SEG++ L+S  ++ELL
Subjt:  NKVAPMNSQAG--AYGRAVDLLKTIGNSVSTLHMNSGFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELL

Query:  SIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVE--SLNQA
         IAAADKR+E  +   EV+RFG +CKDPQ+HNLD++F RL    + QK  + EA   M ++      T++LYHEL AL+RFEQDY+RK+ E E  S  Q 
Subjt:  SIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVE--SLNQA

Query:  GIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTS
        G+G++L+I + EL  Q+K VR+ + K LWSR L+E++EKLV VV +++  I +AFG  +  K  + +   N +KLG+ GLALHYANII+QI+ +  R ++
Subjt:  GIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTS

Query:  IPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEK
        +P++ RDALY+ LP S+K+ALRSR+Q+    EE T   +KAEM+K L+WLVP+A NT+KAH   G +GEWAS   E ++  A Q+  +R+ TL+HADKEK
Subjt:  IPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEK

Query:  TEQNILELVTLLHTLIHLAKQQQQRFTSLRCRSPTAKDMAIPQPNNARRIQFKS
        TE  IL+LV  LH L+     Q +  T    RSP    +  P   N + IQ  S
Subjt:  TEQNILELVTLLHTLIHLAKQQQQRFTSLRCRSPTAKDMAIPQPNNARRIQFKS

Arabidopsis top hitse value%identityAlignment
AT1G30755.1 Protein of unknown function (DUF668)2.3e-11348.43Show/hide
Query:  MGGVCSNGIVKEDFLSEKKTQTSEDRKG-------NSCLNSEATD---------PEKLPQKSCSGVI-----LLPSPPSKTGSNKVAPMNS---QAGAYG
        MGGVCS   V +D   +KK ++++D K         S   S+ +D           +   K    V      L P PP +  S K    NS   +AG  G
Subjt:  MGGVCSNGIVKEDFLSEKKTQTSEDRKG-------NSCLNSEATD---------PEKLPQKSCSGVI-----LLPSPPSKTGSNKVAPMNS---QAGAYG

Query:  --RAVDLLKTIGNSVSTLHMNSGFFTG-PASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDV
          +AV++L T+G+S++ ++ ++ + +G  +S G +++ILAFEVANTI+K A L QSLSEEN++F+KK++  SE +K+LVS    EL  +AA+DKR+E D+
Subjt:  --RAVDLLKTIGNSVSTLHMNSGFFTG-PASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDV

Query:  LLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNV
           EVIRFG  CKD QWHNLD+YF +LD  +S  K  + +A A MQEL  LA+ TSELYHELQAL+RFEQDYRRK+ EVESLN    GE + I Q EL  
Subjt:  LLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNV

Query:  QRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPT
        Q+KLV+S Q K LWS+NL EI+EKLV VV++I QTI + FG+N        E     ++LG  GL+LHYAN+I QI+ IA RP+S+PSN+RD LY ALP 
Subjt:  QRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPT

Query:  SVKTALRSRLQAVDASEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRA---TQSNPIRLQTLYHADKEKTEQNILELVTLL
        +VKTALR RLQ +D  EE +   +KAEM+K L+WLVP A NT+KAHQ  G +GEWA+   E  KG+       NP RLQTL+HADK   +  +LELV  L
Subjt:  SVKTALRSRLQAVDASEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRA---TQSNPIRLQTLYHADKEKTEQNILELVTLL

Query:  HTLIHLAKQQ
        H L+  +K++
Subjt:  HTLIHLAKQQ

AT1G34320.1 Protein of unknown function (DUF668)7.6e-10146.48Show/hide
Query:  NKVAPMNSQAG--AYGRAVDLLKTIGNSVSTLHMNSGFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELL
        ++V+ +  +AG    G+AVD+L T+G+S++ L+++ GF +     G +ISIL+FEVANTI K ANL  SLS+++I  LK+ +  SEG++ L+S  ++ELL
Subjt:  NKVAPMNSQAG--AYGRAVDLLKTIGNSVSTLHMNSGFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELL

Query:  SIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVE--SLNQA
         IAAADKR+E  +   EV+RFG +CKDPQ+HNLD++F RL    + QK  + EA   M ++      T++LYHEL AL+RFEQDY+RK+ E E  S  Q 
Subjt:  SIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAR-EARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVE--SLNQA

Query:  GIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTS
        G+G++L+I + EL  Q+K VR+ + K LWSR L+E++EKLV VV +++  I +AFG  +  K  + +   N +KLG+ GLALHYANII+QI+ +  R ++
Subjt:  GIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTS

Query:  IPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEK
        +P++ RDALY+ LP S+K+ALRSR+Q+    EE T   +KAEM+K L+WLVP+A NT+KAH   G +GEWAS   E ++  A Q+  +R+ TL+HADKEK
Subjt:  IPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYHADKEK

Query:  TEQNILELVTLLHTLIHLAKQQQQRFTSLRCRSPTAKDMAIPQPNNARRIQFKS
        TE  IL+LV  LH L+     Q +  T    RSP    +  P   N + IQ  S
Subjt:  TEQNILELVTLLHTLIHLAKQQQQRFTSLRCRSPTAKDMAIPQPNNARRIQFKS

AT3G23160.1 Protein of unknown function (DUF668)4.7e-2624.11Show/hide
Query:  ISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQK
        I IL+FEVAN +SK  +L +SLS+  I  LK E+  SEG+++LVS     LL ++ ++K  +   +   V R GK+C +P     +  +  +       +
Subjt:  ISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQK

Query:  Q----AREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWI
        +     ++  + ++++      T  LY E++ +   EQ        V+        ES+  F+ +L  QR+ V+S +   LW++  D++VE L   V  I
Subjt:  Q----AREARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWI

Query:  NQTIAKAFG-------------------------------------------------------------------------------------------
           I   FG                                                                                           
Subjt:  NQTIAKAFG-------------------------------------------------------------------------------------------

Query:  ------DNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKT----ALRSRLQAVDASEEPTYLSVKAEMDKI
               N  G    +   ++   +G   L+LHYAN++  +  +   P  I    RD LY+ LPTS+KT    +LRS L+ +   + P     K  +D I
Subjt:  ------DNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKT----ALRSRLQAVDASEEPTYLSVKAEMDKI

Query:  LRWLVPIAANTSKAHQACGRIGEWASQSK-EHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQ
        L WL P+A N  +          W S+   E       ++N + LQTLY AD+EKTE  I +L+  L+ + H  +QQ
Subjt:  LRWLVPIAANTSKAHQACGRIGEWASQSK-EHSKGRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQ

AT5G08660.1 Protein of unknown function (DUF668)4.8e-8742.7Show/hide
Query:  SKTGSNKVAPMNSQAG--AYGRAVDLLKTIGNSVSTLHMNSGFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSIS
        ++T  +KV   +   G    GRA D+L T+G+S++ L  + GF +G A+ G E+ ILAFEVANTI K +NL +SLS+ NI+ LK  +  SEG++ LVS  
Subjt:  SKTGSNKVAPMNSQAG--AYGRAVDLLKTIGNSVSTLHMNSGFFTGPASNGREISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSIS

Query:  LEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQARE-ARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESL
         +ELL + AADKRQE  V   EV+RFG + KD QWHNL +YF R+    + Q+Q +E A   + +L VL Q T+ELY ELQ L R E+DY +K  E E+ 
Subjt:  LEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQARE-ARAAMQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESL

Query:  NQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFG--DNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIA
          +  G+ L+I + EL  QRK+V+S + K LWSR  +E++EKLV +V ++   I   FG  D+   K    E     ++LG  GLALHYANII QI+ + 
Subjt:  NQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFG--DNNTGKTLHIEDRSNGQKLGAVGLALHYANIISQINLIA

Query:  CRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYH
         R +SI SN RD+LY++LP  +K ALRS++++ +  +E +   +K EM++ L WLVP+A NT+KAH   G +GEWA+   + +  + +  + +R++TLYH
Subjt:  CRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNPIRLQTLYH

Query:  ADKEKTEQNILELVTLLHTLIHLAKQQQQ---RFTSLRCRSPTAKDMAIPQP
        A KEKTE  IL  +  L  L+  AK   +   R +S++    T     I +P
Subjt:  ADKEKTEQNILELVTLLHTLIHLAKQQQQ---RFTSLRCRSPTAKDMAIPQP

AT5G51670.1 Protein of unknown function (DUF668)8.6e-2023.25Show/hide
Query:  ISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDP---QWHNLDQYFARLDLNDS
        + +L+FEVA  ++K+ +L+ SL++ N+   +      EG+ ++V+      LS+  A+           V R   +C       +H L   FA +  +  
Subjt:  ISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDP---QWHNLDQYFARLDLNDS

Query:  S-QKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRK--------VDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK
              ++  A  +++      T+ LY E++ +   E   R++         +E +  N+  + + + + Q ++  Q++ V+  + + LW+++ D +V  
Subjt:  S-QKQAREARAAMQELTVLAQTTSELYHELQALERFEQDYRRK--------VDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEK

Query:  LV-------------------------IVVTWINQTIAKAFG--------------DNNTGKTLHIEDRSNGQK-----LGAVGLALHYANIISQINLIA
        L                           VV+ + ++++ +                D  T  +  +E+ S   K     LG  G+ALHYAN+I  +  + 
Subjt:  LV-------------------------IVVTWINQTIAKAFG--------------DNNTGKTLHIEDRSNGQK-----LGAVGLALHYANIISQINLIA

Query:  CRPTSIPSNMRDALYRALPTSVKTALRSRLQAV--DASEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNP----IR
         +P  +  + RD LY  LP SV+++LRSRL+ V   A++       KA + +ILRWL+P+A N  +          W S+     +  AT +N     + 
Subjt:  CRPTSIPSNMRDALYRALPTSVKTALRSRLQAV--DASEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSKGRATQSNP----IR

Query:  LQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQR-FTSLRC
        +QTL  ADK KTE  I EL+  L+ +    ++   +   +L+C
Subjt:  LQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQR-FTSLRC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAACACAATGGGAGGTGTATGTTCCAATGGAATAGTTAAGGAGGATTTCTTGTCAGAGAAGAAAACGCAAACGTCTGAGGATCGGAAAGGGAATTCGTGTTTGAA
TTCTGAGGCAACTGATCCGGAGAAATTGCCGCAGAAGTCTTGTTCTGGTGTGATACTATTGCCCTCACCTCCGTCTAAGACAGGAAGCAATAAGGTTGCACCAATGAACT
CACAAGCAGGGGCTTACGGGAGGGCGGTTGATTTACTGAAAACAATTGGGAATAGTGTGTCAACTTTGCACATGAACAGTGGGTTTTTTACAGGCCCGGCTTCAAATGGT
AGGGAGATTTCTATATTGGCTTTTGAAGTAGCAAACACAATAAGCAAAGTAGCAAATTTATCCCAATCTCTCTCAGAAGAAAACATCCAGTTCCTCAAAAAGGAACTTTC
ACAATCAGAAGGGATAAAACAATTAGTCTCAATAAGTTTAGAGGAATTGCTAAGCATTGCAGCTGCAGACAAAAGGCAGGAATTTGACGTTCTCTTACGGGAGGTAATAC
GATTTGGAAAGCAGTGCAAGGATCCACAATGGCACAATCTGGACCAGTACTTTGCAAGACTAGATTTGAATGATTCAAGTCAAAAACAAGCTCGAGAGGCCAGAGCAGCC
ATGCAGGAACTAACTGTTTTAGCTCAGACTACTTCTGAATTATACCATGAATTACAAGCATTGGAAAGATTTGAGCAAGATTACAGAAGGAAAGTTGACGAAGTGGAGTC
CTTGAACCAAGCAGGAATAGGAGAAAGTCTCTCAATATTCCAAGGAGAATTAAACGTACAAAGAAAGCTCGTAAGGAGCTTTCAAAGCAAGTGCCTTTGGTCCAGAAATT
TAGATGAGATTGTGGAAAAGCTCGTTATTGTTGTAACATGGATAAATCAAACAATAGCCAAAGCATTTGGTGACAACAACACAGGTAAAACATTGCATATCGAGGACAGA
AGTAATGGCCAGAAACTGGGCGCTGTTGGTCTTGCCTTACATTATGCAAACATAATCAGCCAGATAAATCTCATTGCGTGTCGTCCAACCTCTATTCCTTCAAATATGAG
GGATGCATTATACCGGGCATTGCCAACAAGTGTTAAAACAGCTCTGCGGTCTCGGTTGCAGGCTGTGGATGCCAGTGAGGAGCCAACTTATCTTAGTGTCAAAGCTGAAA
TGGATAAGATCCTTCGATGGCTTGTTCCAATTGCTGCAAACACGAGCAAAGCACATCAAGCTTGTGGCCGGATTGGAGAATGGGCATCTCAAAGTAAGGAACACAGCAAA
GGAAGAGCCACACAGAGCAACCCAATCCGCCTTCAAACACTGTACCACGCAGACAAAGAAAAAACAGAGCAAAACATTCTCGAGCTGGTCACATTGCTCCACACTCTCAT
CCATTTAGCAAAACAGCAACAACAACGCTTCACATCGCTCCGTTGCCGATCTCCAACTGCCAAAGACATGGCAATTCCTCAGCCAAACAATGCTCGTCGGATCCAATTCA
AGAGCCAAATCATCAAAGCCAACAAAGACGGAGCTCCAGCCGACAACAACAGACCATCGGCCGGCCAAACTCCGATCAGAAAGAGGGATCCAGGTACGGAAACCTCTAAA
AATGAGAATAAAGATAAAGGAATTTGGACTTTAAGTAAAGGGGTTTCAGTTTCAACCTTGAGTTCTCTTAGTAGAGTATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGAACACAATGGGAGGTGTATGTTCCAATGGAATAGTTAAGGAGGATTTCTTGTCAGAGAAGAAAACGCAAACGTCTGAGGATCGGAAAGGGAATTCGTGTTTGAA
TTCTGAGGCAACTGATCCGGAGAAATTGCCGCAGAAGTCTTGTTCTGGTGTGATACTATTGCCCTCACCTCCGTCTAAGACAGGAAGCAATAAGGTTGCACCAATGAACT
CACAAGCAGGGGCTTACGGGAGGGCGGTTGATTTACTGAAAACAATTGGGAATAGTGTGTCAACTTTGCACATGAACAGTGGGTTTTTTACAGGCCCGGCTTCAAATGGT
AGGGAGATTTCTATATTGGCTTTTGAAGTAGCAAACACAATAAGCAAAGTAGCAAATTTATCCCAATCTCTCTCAGAAGAAAACATCCAGTTCCTCAAAAAGGAACTTTC
ACAATCAGAAGGGATAAAACAATTAGTCTCAATAAGTTTAGAGGAATTGCTAAGCATTGCAGCTGCAGACAAAAGGCAGGAATTTGACGTTCTCTTACGGGAGGTAATAC
GATTTGGAAAGCAGTGCAAGGATCCACAATGGCACAATCTGGACCAGTACTTTGCAAGACTAGATTTGAATGATTCAAGTCAAAAACAAGCTCGAGAGGCCAGAGCAGCC
ATGCAGGAACTAACTGTTTTAGCTCAGACTACTTCTGAATTATACCATGAATTACAAGCATTGGAAAGATTTGAGCAAGATTACAGAAGGAAAGTTGACGAAGTGGAGTC
CTTGAACCAAGCAGGAATAGGAGAAAGTCTCTCAATATTCCAAGGAGAATTAAACGTACAAAGAAAGCTCGTAAGGAGCTTTCAAAGCAAGTGCCTTTGGTCCAGAAATT
TAGATGAGATTGTGGAAAAGCTCGTTATTGTTGTAACATGGATAAATCAAACAATAGCCAAAGCATTTGGTGACAACAACACAGGTAAAACATTGCATATCGAGGACAGA
AGTAATGGCCAGAAACTGGGCGCTGTTGGTCTTGCCTTACATTATGCAAACATAATCAGCCAGATAAATCTCATTGCGTGTCGTCCAACCTCTATTCCTTCAAATATGAG
GGATGCATTATACCGGGCATTGCCAACAAGTGTTAAAACAGCTCTGCGGTCTCGGTTGCAGGCTGTGGATGCCAGTGAGGAGCCAACTTATCTTAGTGTCAAAGCTGAAA
TGGATAAGATCCTTCGATGGCTTGTTCCAATTGCTGCAAACACGAGCAAAGCACATCAAGCTTGTGGCCGGATTGGAGAATGGGCATCTCAAAGTAAGGAACACAGCAAA
GGAAGAGCCACACAGAGCAACCCAATCCGCCTTCAAACACTGTACCACGCAGACAAAGAAAAAACAGAGCAAAACATTCTCGAGCTGGTCACATTGCTCCACACTCTCAT
CCATTTAGCAAAACAGCAACAACAACGCTTCACATCGCTCCGTTGCCGATCTCCAACTGCCAAAGACATGGCAATTCCTCAGCCAAACAATGCTCGTCGGATCCAATTCA
AGAGCCAAATCATCAAAGCCAACAAAGACGGAGCTCCAGCCGACAACAACAGACCATCGGCCGGCCAAACTCCGATCAGAAAGAGGGATCCAGGTACGGAAACCTCTAAA
AATGAGAATAAAGATAAAGGAATTTGGACTTTAAGTAAAGGGGTTTCAGTTTCAACCTTGAGTTCTCTTAGTAGAGTATAG
Protein sequenceShow/hide protein sequence
MGNTMGGVCSNGIVKEDFLSEKKTQTSEDRKGNSCLNSEATDPEKLPQKSCSGVILLPSPPSKTGSNKVAPMNSQAGAYGRAVDLLKTIGNSVSTLHMNSGFFTGPASNG
REISILAFEVANTISKVANLSQSLSEENIQFLKKELSQSEGIKQLVSISLEELLSIAAADKRQEFDVLLREVIRFGKQCKDPQWHNLDQYFARLDLNDSSQKQAREARAA
MQELTVLAQTTSELYHELQALERFEQDYRRKVDEVESLNQAGIGESLSIFQGELNVQRKLVRSFQSKCLWSRNLDEIVEKLVIVVTWINQTIAKAFGDNNTGKTLHIEDR
SNGQKLGAVGLALHYANIISQINLIACRPTSIPSNMRDALYRALPTSVKTALRSRLQAVDASEEPTYLSVKAEMDKILRWLVPIAANTSKAHQACGRIGEWASQSKEHSK
GRATQSNPIRLQTLYHADKEKTEQNILELVTLLHTLIHLAKQQQQRFTSLRCRSPTAKDMAIPQPNNARRIQFKSQIIKANKDGAPADNNRPSAGQTPIRKRDPGTETSK
NENKDKGIWTLSKGVSVSTLSSLSRV