; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027431 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027431
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUDP-N-acetylglucosamine--N-acetylmuramyl- (Pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase
Genome locationtig00153054:1390542..1407300
RNA-Seq ExpressionSgr027431
SyntenySgr027431
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0007049 - cell cycle (biological process)
GO:0008360 - regulation of cell shape (biological process)
GO:0030259 - lipid glycosylation (biological process)
GO:0051301 - cell division (biological process)
GO:0071555 - cell wall organization (biological process)
GO:0005886 - plasma membrane (cellular component)
GO:0004252 - serine-type endopeptidase activity (molecular function)
GO:0050511 - undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR002471 - Peptidase S9, serine active site
IPR004276 - Glycosyltransferase family 28, N-terminal domain
IPR006009 - N-acetylglucosaminyltransferase, MurG
IPR006747 - Protein of unknown function DUF599
IPR007235 - Glycosyl transferase, family 28, C-terminal
IPR029058 - Alpha/Beta hydrolase fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG2321866.1 hypothetical protein Bca52824_015079 [Brassica carinata]6.2e-22646.18Show/hide
Query:  PPFLSDQPKFQFPI-------VSYVNIIRCLSIHRSNNQDSTISNSTIDS--LRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIP
        P FLS    F  PI           + + CLS+ R N+  STI N    S  LRVV +AGGT GH+  A+AI DELK   P AQILF+G PNS ES  +P
Subjt:  PPFLSDQPKFQFPI-------VSYVNIIRCLSIHRSNNQDSTISNSTIDS--LRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIP

Query:  SAGYEFDTVPAT--QLDRPLLSPRNVL-LPFHLIKSVVAGYKKLRDFKPHIVIGTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFV
        SAG++F  +        RP L   + L  P  LIKS    Y+ LR+F+P IV+GTGG+ SFP+C AA ++  +KL IQEQ+S+PG  NW+LS FAD +F 
Subjt:  SAGYEFDTVPAT--QLDRPLLSPRNVL-LPFHLIKSVVAGYKKLRDFKPHIVIGTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFV

Query:  VLNSTVECFPSK--KKCLVCGNPVRLALRQYVSKAVARLHFFPR-LGKTEDLEAKVLLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDE
          N TV   P +   KC+V GNP+R ALR+YVSK  AR+ FF +  G   D  AKV+LVL GSLGANAINIA+LN Y ++L E++N + +WQTGVE FDE
Subjt:  VLNSTVECFPSK--KKCLVCGNPVRLALRQYVSKAVARLHFFPR-LGKTEDLEAKVLLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDE

Query:  MDSLVKNHPHLHLMPFMHSLHLAYAAADLIVSRAGAMTCSEILATGKPSILIPSPYEDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDAS
        MDSLV++HP L L PF+ ++ +AYAAADL++SRAGAMTCSEI+A GKPSILIPSP+ DEG Q R+AS+MAD+ GS+VI E+ELD+ TL  AI+EILG+  
Subjt:  MDSLVKNHPHLHLMPFMHSLHLAYAAADLIVSRAGAMTCSEILATGKPSILIPSPYEDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDAS

Query:  KMADMSERALRVSKPNAAAEIVLHIGKLINLSTGRESSNDTTDGVKTSQRVVGSTVGRKKSKLGQAGPTANSTSTHPHDCIANSQLRDVTSSAFANRSER
         M +MSERALR  KP AA ++  HI     +S  +   N +     +  R   S                               LR   SS        
Subjt:  KMADMSERALRVSKPNAAAEIVLHIGKLINLSTGRESSNDTTDGVKTSQRVVGSTVGRKKSKLGQAGPTANSTSTHPHDCIANSQLRDVTSSAFANRSER

Query:  PRLSHLVLFFLTNAPQAITSLQASALYLLTPTPIVSSAEPMATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFDGAFGLH
                                    LT + + SS      LP S L L  +G S                          A G  SV A +      
Subjt:  PRLSHLVLFFLTNAPQAITSLQASALYLLTPTPIVSSAEPMATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFDGAFGLH

Query:  DKEKGLICTADELHYVSVPNSDWRLALWRYLPSLRAPSRNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISS
               CTADELHYV VPNSDWR ALWRYLPS +AP R HPLLLLSG+ SNA  YDLS E SFAR MS  G+DTWILE+RG+GLS              
Subjt:  DKEKGLICTADELHYVSVPNSDWRLALWRYLPSLRAPSRNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISS

Query:  ETLVRKIGHYPEENRSLVTSASSEGSSISSKDGQTSTIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALEN----FQEQLDVYEKYDWDFDHYLE
                           S S++    S KD Q   I + L +   N+ + +E     G    F + G+   L      F+++L++  +Y+WDFD+YLE
Subjt:  ETLVRKIGHYPEENRSLVTSASSEGSSISSKDGQTSTIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALEN----FQEQLDVYEKYDWDFDHYLE

Query:  EDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISRCSFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLAS
        ED+P+AMEY+R Q+KP DGKLLAIGHSMGG+LLYAM+SRC F  VD  LASV T+ S+ DY  S + L+ LLP+++PAQA NV  +PI  +L +AHPL  
Subjt:  EDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISRCSFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLAS

Query:  RPPYVLSWLKGQISAEDMIHPTLLEKLVMNGFESVPAKVLVQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFV
        RPPY LSWL   ISA  M+ P ++EKLV+N    VPAK+L QL+T  ++GGLRDR  +F+YKDH+ ++NVPILA+AGD D++CPP AVYETVKLIP    
Subjt:  RPPYVLSWLKGQISAEDMIHPTLLEKLVMNGFESVPAKVLVQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFV

Query:  SYKVVGKPGGPHYAHYDIVGSR
        ++KV G PGGPHY H D++  R
Subjt:  SYKVVGKPGGPHYAHYDIVGSR

KFK41817.1 hypothetical protein AALP_AA2G175000 [Arabis alpina]3.1e-22545.27Show/hide
Query:  CLSIHRSNNQDSTISNSTIDSLRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTVP--ATQLDRPLLSPRNVL-LPF
        CLS++R +  ++T        LRVV  AGGT GH+ PA+AI DELK   P+ QILF+G PNS ES  IPSAG+ F ++    +   RP L   ++   P 
Subjt:  CLSIHRSNNQDSTISNSTIDSLRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTVP--ATQLDRPLLSPRNVL-LPF

Query:  HLIKSVVAGYKKLRDFKPHIVIGTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPSK--KKCLVCGNPVRLALRQY
         LI+S    YK L+DFKP IV+GTGG+ SFP+C AA ++  + L IQEQ+S+PG  NW+LS+FAD +F   N TV   P +   KC+V GNP+R  LR++
Subjt:  HLIKSVVAGYKKLRDFKPHIVIGTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPSK--KKCLVCGNPVRLALRQY

Query:  VSKAVARLHFFPRLGKTEDLEAKVLLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHLMPFMHSLHLAYAAADLIVS
        VSK  AR+ FF +      ++AKV+LVLGGSLGANAINIA+LN Y ++L E++N + +WQTGVE FDEMDSLVK+HP L L PF+ S+ +AYAAADL++S
Subjt:  VSKAVARLHFFPRLGKTEDLEAKVLLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHLMPFMHSLHLAYAAADLIVS

Query:  RAGAMTCSEILATGKPSILIPSPYEDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVSKPNAAAEIVLHIGKLINLS
        RAGAMTCSEILA GKPSILIPSP+ DEG Q RNAS+MAD+ GS++I E+ELD+ TL  A++EILG+   M +MS RA   +KP+AA ++  HI  ++   
Subjt:  RAGAMTCSEILATGKPSILIPSPYEDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVSKPNAAAEIVLHIGKLINLS

Query:  TGRESSNDTTDGVKTSQRVVGSTVGRKKSKLGQAGPTANSTSTHPHDCIANSQLRDVTSSAFANRSERPRLSHLVLFFLTNAPQAITSLQASALYLLTPT
             S D +                 K +  Q  P+ +        C  N     +  S          + +L     ++     + +  +  + L+  
Subjt:  TGRESSNDTTDGVKTSQRVVGSTVGRKKSKLGQAGPTANSTSTHPHDCIANSQLRDVTSSAFANRSERPRLSHLVLFFLTNAPQAITSLQASALYLLTPT

Query:  PIVSSAEPMATL-PLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFDGAFGLHDKEKGLICTADELHYVSVPNSDWRLALWRYL
         + S+A   ++L   SR    S+     R   +N            +S    AV   +V                ICTADELHYV VPNSDWR ALWRYL
Subjt:  PIVSSAEPMATL-PLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFDGAFGLHDKEKGLICTADELHYVSVPNSDWRLALWRYL

Query:  PSLRAPSRNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEM---KENEQISSETLVRKIGHYPEENRSLVTSASSEGSSI
        PS + P R HPLLLLSG+ +NA  YDLSPEFSFAR MS  G+DTWILE+RG+GLS+  ++     + +QI S+ L   I                     
Subjt:  PSLRAPSRNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEM---KENEQISSETLVRKIGHYPEENRSLVTSASSEGSSI

Query:  SSKDGQTSTIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALE----NFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSM
                           N+ + IE A   G    F + G+   L     +F+E+L++   Y+WDFD+YLEED+P+AMEY+R+Q+KP DGKLLAIGHSM
Subjt:  SSKDGQTSTIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALE----NFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSM

Query:  GGILLYAMISRCSFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDMIHPTLLEKLV
        GGILLYA++SRC     D  LASV TL ++LDY  S + L+ LLP++DPAQA N+  +PI  +L + HPL  RPPY LSWL   ISA  M+ P ++EKLV
Subjt:  GGILLYAMISRCSFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDMIHPTLLEKLV

Query:  MNGFESVPAKVLVQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYDIVGSR
        +    +VP K+L+QL+T  +  GLRDR  TF+YKDH+ ++NVPILA+AGD D++CPP AVY+TVK+IP    ++KVVG PGGPHY H D++  R
Subjt:  MNGFESVPAKVLVQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYDIVGSR

RXH96587.1 hypothetical protein DVH24_009091 [Malus domestica]9.2e-27053.08Show/hide
Query:  PPFLSDQPKFQFPIVSYVNIIRCLSIHRSNNQDSTISNSTIDSLRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTV
        P FL        P    + I  CLS+ +SN+Q     N+ +D+LRVVFAAGGTGGH+YPAVAIADEL+LT P+++ILFLG+PNS ES AIPSAGY+FDTV
Subjt:  PPFLSDQPKFQFPIVSYVNIIRCLSIHRSNNQDSTISNSTIDSLRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTV

Query:  PATQLDRPLLSPRNVLLPFHLIKSVVAGYKKLRDFKPHIVIGTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPS-
        P T+L RP+ SPRN+LLP++LI+SVV  Y KLRDF PH+VIGTGGYVSF + LAA+LI G KL IQEQN VPG ANWVLS+FADVVFV  N T++CFPS 
Subjt:  PATQLDRPLLSPRNVLLPFHLIKSVVAGYKKLRDFKPHIVIGTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPS-

Query:  KKKCLVCGNPVRLALRQYVSKAVARLHFFPRLGKTEDLE-AKVLLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHL
        + KC+VCGNPVRL+L++ V KA A   FFP+ G   +LE AKVLLVLGGSLGANAINIA+LNLYYQMLLEN NL+IIWQTGVE  +EM+SLVKNHPHL L
Subjt:  KKKCLVCGNPVRLALRQYVSKAVARLHFFPRLGKTEDLE-AKVLLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHL

Query:  MPFMHSLHLAYAAADLIVSRAGAMTCSEILATGKPSILIPSPYEDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVS
         PF+HS+ LAYAAADLIVSRAGAMTC EILATGKPSILIPSP   EGHQFRNAS+MAD+AG+RVI EDELDSTTL  AI+EILGD SKMA++SERAL+ +
Subjt:  MPFMHSLHLAYAAADLIVSRAGAMTCSEILATGKPSILIPSPYEDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVS

Query:  KPNAAAEIVLHIGKLINLSTGRESSNDTTDGVKTSQRVVGSTVGRKKSKLGQAGPTANSTSTHPHDCIANSQLRDVTSSAFANRSERPRLSHLVLFFLTN
          NA+ EIV                  +T   K   R+    +  +  K G + P       +      +S LR +T++   +R           F    
Subjt:  KPNAAAEIVLHIGKLINLSTGRESSNDTTDGVKTSQRVVGSTVGRKKSKLGQAGPTANSTSTHPHDCIANSQLRDVTSSAFANRSERPRLSHLVLFFLTN

Query:  APQAITSLQASALYLLTPTPIVSSAEPMATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFDGAFGLHDKEKGLICTADEL
         P                             P  +L   S  +S    G    L++R  P   W+                            +CTADEL
Subjt:  APQAITSLQASALYLLTPTPIVSSAEPMATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFDGAFGLHDKEKGLICTADEL

Query:  HYVSVPNSDWRLALWRYLPSLRAPSRNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISSETLVRKIGHYPEE
        HYVSVP SDW LALWRYLPS +A  RNHPLLLLSG+ +NA+GYDLSPE SFAR MS +GYDTWILE+RG+GLS   +++   ++             P  
Subjt:  HYVSVPNSDWRLALWRYLPSLRAPSRNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISSETLVRKIGHYPEE

Query:  NRSLVTSASSEGSSISSKDGQTSTIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPN
          S  T +S+   ++ S+  +   +    R + +  M +   +++L R   F  +  ++ALE+FQ+QLD+  K DWDFDHYLEEDVP AMEYIR Q KP 
Subjt:  NRSLVTSASSEGSSISSKDGQTSTIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPN

Query:  DGKLLAIGHSMGGILLYAMISRCSFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAED
        DGKLLA+GHSMG           ++   D  LASV TL SSLDY  S SS +L LPL +PAQ  NVP +P+G      H L +R P + SWLK Q+SA+ 
Subjt:  DGKLLAIGHSMGGILLYAMISRCSFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAED

Query:  MIHPTLLEKLVMNGFESVPAKVLVQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYD
        M+ P L EKLV+N F +VPAKVL+QLST+F +GGL DR+ TF YKDHL +SNVPILA+ GDQDLICP +AVYETVKLIP + V++KV+G+  GPH+AHYD
Subjt:  MIHPTLLEKLVMNGFESVPAKVLVQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYD

Query:  IVGSRL
        +VG RL
Subjt:  IVGSRL

XP_022137193.1 uncharacterized protein LOC111008720 isoform X1 [Momordica charantia]7.6e-22482.34Show/hide
Query:  MATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFD-GAFGLH-DKEKGLICTADELHYVSVPNSDWRLALWRYLPSLRAPS
        MATLPLSR DLCSI RSD  C +LNP+RQR KP+GCW  RRRN +   +VRAFD GAFGL+ +KEK  ICTADELHYVSVPNSDWRLALWRY PS RAPS
Subjt:  MATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFD-GAFGLH-DKEKGLICTADELHYVSVPNSDWRLALWRYLPSLRAPS

Query:  RNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISSETLVRKIGHYPEENRSLVTS--ASSEGSSISSKDGQTS
        RNHPLLLLSGVGSNALGYDLSP  SFAR+MSNQGYDTWILEVRGSGLS +RVEMKENEQI S TL        E+  S  T    SS GSSISSKDG+TS
Subjt:  RNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISSETLVRKIGHYPEENRSLVTS--ASSEGSSISSKDGQTS

Query:  TIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISRC
        TIATQLRQWNQNLMDIIEGAQQLG L+PFNLQGVTSALE FQEQLD+YEKYDWDFDHYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRC
Subjt:  TIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISRC

Query:  SFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDMIHPTLLEKLVMNGFESVPAKVL
        SFNKVDPQLASVVTLGSSLD+RPSNSSLRLLLPLRDP QAFNVP  PIGPLL IAHPLASRPPY+LSWLKGQISA DM+HPTLLEKLV++G+ESVP+KVL
Subjt:  SFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDMIHPTLLEKLVMNGFESVPAKVL

Query:  VQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYDIVGSRLQARQ
        VQLSTVFEEGGLRDRN  F YKDHL QSN+PILA+AGDQD+ICPP+AVYETVK IP ++VSYKV+GKPGGPHYAHYDIVGSRL + +
Subjt:  VQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYDIVGSRLQARQ

XP_038894452.1 uncharacterized protein LOC120083031 [Benincasa hispida]5.0e-21580.41Show/hide
Query:  MATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAF-DGAFGLH-DKEKGLICTADELHYVSVPNSDWRLALWRYLPSLRAPS
        MATL LSR DL SIG+S +R  HL+ LRQ GK +  WA RRRN + +KSVRAF  GA GL+ +KEKGLICTADELHYVSVPNSDW+LALWRYLPS+RAPS
Subjt:  MATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAF-DGAFGLH-DKEKGLICTADELHYVSVPNSDWRLALWRYLPSLRAPS

Query:  RNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISSETLVRKIGHYPEENRSLVTSASSEGSSISSKDGQTSTI
        RNHPLLLLSGVGSNALGYDLSPE SFAR+MSNQGYDTWILEVRG GLS +R +MK+ EQI SETL ++    P    S  T  SSEGS ISS+DGQTS I
Subjt:  RNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISSETLVRKIGHYPEENRSLVTSASSEGSSISSKDGQTSTI

Query:  ATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISRCSF
        ATQLRQWN+NL+++I+GAQQLG  QPFNLQGVTSALE FQEQL VYEKYDWDFD+YLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRCSF
Subjt:  ATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISRCSF

Query:  NKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDMIHPTLLEKLVMNGFESVPAKVLVQ
         KVDPQLASVVTL SSLDYRPSNSSLRLLLPLRDPAQ  NVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDM+HPTLLEKLVMNGF SVPAKVL+Q
Subjt:  NKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDMIHPTLLEKLVMNGFESVPAKVLVQ

Query:  LSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYDIVGSRLQARQ
        LS+VFEEGGL DR+ TF YKD+L Q NVP+LA+AGDQDLICPP+AVYETVK IP Q VSYKV+GK GGPHYAHYDIVGS L + +
Subjt:  LSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYDIVGSRLQARQ

TrEMBL top hitse value%identityAlignment
A0A498JNY4 Uncharacterized protein4.5e-27053.08Show/hide
Query:  PPFLSDQPKFQFPIVSYVNIIRCLSIHRSNNQDSTISNSTIDSLRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTV
        P FL        P    + I  CLS+ +SN+Q     N+ +D+LRVVFAAGGTGGH+YPAVAIADEL+LT P+++ILFLG+PNS ES AIPSAGY+FDTV
Subjt:  PPFLSDQPKFQFPIVSYVNIIRCLSIHRSNNQDSTISNSTIDSLRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTV

Query:  PATQLDRPLLSPRNVLLPFHLIKSVVAGYKKLRDFKPHIVIGTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPS-
        P T+L RP+ SPRN+LLP++LI+SVV  Y KLRDF PH+VIGTGGYVSF + LAA+LI G KL IQEQN VPG ANWVLS+FADVVFV  N T++CFPS 
Subjt:  PATQLDRPLLSPRNVLLPFHLIKSVVAGYKKLRDFKPHIVIGTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPS-

Query:  KKKCLVCGNPVRLALRQYVSKAVARLHFFPRLGKTEDLE-AKVLLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHL
        + KC+VCGNPVRL+L++ V KA A   FFP+ G   +LE AKVLLVLGGSLGANAINIA+LNLYYQMLLEN NL+IIWQTGVE  +EM+SLVKNHPHL L
Subjt:  KKKCLVCGNPVRLALRQYVSKAVARLHFFPRLGKTEDLE-AKVLLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHL

Query:  MPFMHSLHLAYAAADLIVSRAGAMTCSEILATGKPSILIPSPYEDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVS
         PF+HS+ LAYAAADLIVSRAGAMTC EILATGKPSILIPSP   EGHQFRNAS+MAD+AG+RVI EDELDSTTL  AI+EILGD SKMA++SERAL+ +
Subjt:  MPFMHSLHLAYAAADLIVSRAGAMTCSEILATGKPSILIPSPYEDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVS

Query:  KPNAAAEIVLHIGKLINLSTGRESSNDTTDGVKTSQRVVGSTVGRKKSKLGQAGPTANSTSTHPHDCIANSQLRDVTSSAFANRSERPRLSHLVLFFLTN
          NA+ EIV                  +T   K   R+    +  +  K G + P       +      +S LR +T++   +R           F    
Subjt:  KPNAAAEIVLHIGKLINLSTGRESSNDTTDGVKTSQRVVGSTVGRKKSKLGQAGPTANSTSTHPHDCIANSQLRDVTSSAFANRSERPRLSHLVLFFLTN

Query:  APQAITSLQASALYLLTPTPIVSSAEPMATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFDGAFGLHDKEKGLICTADEL
         P                             P  +L   S  +S    G    L++R  P   W+                            +CTADEL
Subjt:  APQAITSLQASALYLLTPTPIVSSAEPMATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFDGAFGLHDKEKGLICTADEL

Query:  HYVSVPNSDWRLALWRYLPSLRAPSRNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISSETLVRKIGHYPEE
        HYVSVP SDW LALWRYLPS +A  RNHPLLLLSG+ +NA+GYDLSPE SFAR MS +GYDTWILE+RG+GLS   +++   ++             P  
Subjt:  HYVSVPNSDWRLALWRYLPSLRAPSRNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISSETLVRKIGHYPEE

Query:  NRSLVTSASSEGSSISSKDGQTSTIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPN
          S  T +S+   ++ S+  +   +    R + +  M +   +++L R   F  +  ++ALE+FQ+QLD+  K DWDFDHYLEEDVP AMEYIR Q KP 
Subjt:  NRSLVTSASSEGSSISSKDGQTSTIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPN

Query:  DGKLLAIGHSMGGILLYAMISRCSFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAED
        DGKLLA+GHSMG           ++   D  LASV TL SSLDY  S SS +L LPL +PAQ  NVP +P+G      H L +R P + SWLK Q+SA+ 
Subjt:  DGKLLAIGHSMGGILLYAMISRCSFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAED

Query:  MIHPTLLEKLVMNGFESVPAKVLVQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYD
        M+ P L EKLV+N F +VPAKVL+QLST+F +GGL DR+ TF YKDHL +SNVPILA+ GDQDLICP +AVYETVKLIP + V++KV+G+  GPH+AHYD
Subjt:  MIHPTLLEKLVMNGFESVPAKVLVQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYD

Query:  IVGSRL
        +VG RL
Subjt:  IVGSRL

A0A6J1C9M1 uncharacterized protein LOC111008720 isoform X13.7e-22482.34Show/hide
Query:  MATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFD-GAFGLH-DKEKGLICTADELHYVSVPNSDWRLALWRYLPSLRAPS
        MATLPLSR DLCSI RSD  C +LNP+RQR KP+GCW  RRRN +   +VRAFD GAFGL+ +KEK  ICTADELHYVSVPNSDWRLALWRY PS RAPS
Subjt:  MATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFD-GAFGLH-DKEKGLICTADELHYVSVPNSDWRLALWRYLPSLRAPS

Query:  RNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISSETLVRKIGHYPEENRSLVTS--ASSEGSSISSKDGQTS
        RNHPLLLLSGVGSNALGYDLSP  SFAR+MSNQGYDTWILEVRGSGLS +RVEMKENEQI S TL        E+  S  T    SS GSSISSKDG+TS
Subjt:  RNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISSETLVRKIGHYPEENRSLVTS--ASSEGSSISSKDGQTS

Query:  TIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISRC
        TIATQLRQWNQNLMDIIEGAQQLG L+PFNLQGVTSALE FQEQLD+YEKYDWDFDHYLEED+PAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRC
Subjt:  TIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISRC

Query:  SFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDMIHPTLLEKLVMNGFESVPAKVL
        SFNKVDPQLASVVTLGSSLD+RPSNSSLRLLLPLRDP QAFNVP  PIGPLL IAHPLASRPPY+LSWLKGQISA DM+HPTLLEKLV++G+ESVP+KVL
Subjt:  SFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDMIHPTLLEKLVMNGFESVPAKVL

Query:  VQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYDIVGSRLQARQ
        VQLSTVFEEGGLRDRN  F YKDHL QSN+PILA+AGDQD+ICPP+AVYETVK IP ++VSYKV+GKPGGPHYAHYDIVGSRL + +
Subjt:  VQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYDIVGSRLQARQ

A0A6J1GTQ4 uncharacterized protein LOC111457026 isoform X11.2e-21474.39Show/hide
Query:  SAFANRSERPRLSHLVLFFLTNAPQAITSLQASALYLLTPTPIVSSAEPMATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVR
        S+F  +  +P     V+ FL N PQAITSL     +  +P+  +   E MATL LS   L S+GRSDNR  HL+  R++GK E  WA  RRN V +KSV 
Subjt:  SAFANRSERPRLSHLVLFFLTNAPQAITSLQASALYLLTPTPIVSSAEPMATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVR

Query:  AFDGAFGLHDKEKGLICTADELHYVSVPNSDWRLALWRYLPSLRAPSRNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVE
        AF G     DKEKG ICTADELHYVSVPNSDW+LALWRY P L+A SRNHPLLLLSGVGSNALGYDLSP+ SFAR+MSNQGYDTWILEVRGSGLS +RVE
Subjt:  AFDGAFGLHDKEKGLICTADELHYVSVPNSDWRLALWRYLPSLRAPSRNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVE

Query:  MKENE-QISSETLVRKIGHYPEEN--RSLVTSASSEGSSISSKDGQTSTIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYD
         K+   QI SET        PE+     + T +SSEGSSISSK GQ STIATQL  WN+NL++IIEGAQQLG L+PFNLQGVTSALE+FQEQLDVYEKYD
Subjt:  MKENE-QISSETLVRKIGHYPEEN--RSLVTSASSEGSSISSKDGQTSTIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYD

Query:  WDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISRCSFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLL
        WDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA IS CSFNKVDPQLASVVTL SSLDYRPSNSSLRLLLPLRDPAQAFNVPV PIGPLL
Subjt:  WDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISRCSFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLL

Query:  VIAHPLASRPPYVLSWLKGQISAEDMIHPTLLEKLVMNGFESVPAKVLVQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETV
         IAHPLASRPPY+L WLK QIS EDM+ PTLLEKLV+NGFESVPAKVL+QLS+VFEEGGLRDRN TF Y DHL Q+NVPILAIAGDQD ICPP+AVYETV
Subjt:  VIAHPLASRPPYVLSWLKGQISAEDMIHPTLLEKLVMNGFESVPAKVLVQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETV

Query:  KLIPRQFVSYKVVGKPGGPHYAHYDIVGSRLQARQ
        K IPR+FVSY+V+GKPGGPHY+HYD+VGSRL + +
Subjt:  KLIPRQFVSYKVVGKPGGPHYAHYDIVGSRLQARQ

A0A6J1JVB9 uncharacterized protein LOC111490055 isoform X21.1e-21280.08Show/hide
Query:  MATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFDGAFGLHDKEKGLICTADELHYVSVPNSDWRLALWRYLPSLRAPSRN
        MATL LSR DL SIGRSDNR  HL+  R++GK E  WA  RRN V +KSV AF G     +KEKG ICTADELHYVSVPNSDW+LALWRY PSL+A SRN
Subjt:  MATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFDGAFGLHDKEKGLICTADELHYVSVPNSDWRLALWRYLPSLRAPSRN

Query:  HPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKEN-EQISSETLVRKIGHYPEEN--RSLVTSASSEGSSISSKDGQTST
        HPLLLLSGVGSNALGYDLSPE SFAR+MSNQGYDTWILEVRGSGLS +RVE K+   QI SET        PE+     + T +SSEGSS+SSK GQ ST
Subjt:  HPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKEN-EQISSETLVRKIGHYPEEN--RSLVTSASSEGSSISSKDGQTST

Query:  IATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISRCS
        IATQL  WN+NL++IIEGAQQLG L+PFNLQGVTSALE+FQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRCS
Subjt:  IATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISRCS

Query:  FNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDMIHPTLLEKLVMNGFESVPAKVLV
        FNKVDPQLASVVTL SSLDYRPSNSSLRLLLPLRDPAQAFNVPV PIGPLL IAHPLASRPPY+L WLK QIS EDM+ PTLLEKLV+NGFESVPAKVL+
Subjt:  FNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDMIHPTLLEKLVMNGFESVPAKVLV

Query:  QLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYDIVGSRL
        QLS+VFEEGGLRDRN TF Y DHL Q+NVPILAIAGDQD ICPP+AVYETVK IPR+FVSY+V+GKPGGPHY+HYD+VGSRL
Subjt:  QLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYDIVGSRL

A0A6J1K0R1 uncharacterized protein LOC111490055 isoform X11.1e-21280.08Show/hide
Query:  MATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFDGAFGLHDKEKGLICTADELHYVSVPNSDWRLALWRYLPSLRAPSRN
        MATL LSR DL SIGRSDNR  HL+  R++GK E  WA  RRN V +KSV AF G     +KEKG ICTADELHYVSVPNSDW+LALWRY PSL+A SRN
Subjt:  MATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFDGAFGLHDKEKGLICTADELHYVSVPNSDWRLALWRYLPSLRAPSRN

Query:  HPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKEN-EQISSETLVRKIGHYPEEN--RSLVTSASSEGSSISSKDGQTST
        HPLLLLSGVGSNALGYDLSPE SFAR+MSNQGYDTWILEVRGSGLS +RVE K+   QI SET        PE+     + T +SSEGSS+SSK GQ ST
Subjt:  HPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKEN-EQISSETLVRKIGHYPEEN--RSLVTSASSEGSSISSKDGQTST

Query:  IATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISRCS
        IATQL  WN+NL++IIEGAQQLG L+PFNLQGVTSALE+FQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYA ISRCS
Subjt:  IATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLAIGHSMGGILLYAMISRCS

Query:  FNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDMIHPTLLEKLVMNGFESVPAKVLV
        FNKVDPQLASVVTL SSLDYRPSNSSLRLLLPLRDPAQAFNVPV PIGPLL IAHPLASRPPY+L WLK QIS EDM+ PTLLEKLV+NGFESVPAKVL+
Subjt:  FNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDMIHPTLLEKLVMNGFESVPAKVLV

Query:  QLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYDIVGSRL
        QLS+VFEEGGLRDRN TF Y DHL Q+NVPILAIAGDQD ICPP+AVYETVK IPR+FVSY+V+GKPGGPHY+HYD+VGSRL
Subjt:  QLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYDIVGSRL

SwissProt top hitse value%identityAlignment
A0M527 UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase1.0e-5838.24Show/hide
Query:  DSLRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTVPATQLDRPLLSPRNVLLPFHLIKSVVAGYKKLRDFKPHIVI
        + LRV+ + GGTGGH+YPA+AIADE+K  YP A+ILF+G  +  E   +P AGYE   +  + +DR   S +N + PF L+ S+    K ++ FKP IVI
Subjt:  DSLRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTVPATQLDRPLLSPRNVLLPFHLIKSVVAGYKKLRDFKPHIVI

Query:  GTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPSKKKCLVCGNPVRLALRQYVSKAVARLHFFPRLGKTEDLEAKV
        GTGG+ S P+ L   +  G+   IQEQNS+PG  N +LS  A ++        + FP+ +K ++ GNPVR  L +        L +F +L K    + K 
Subjt:  GTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPSKKKCLVCGNPVRLALRQYVSKAVARLHFFPRLGKTEDLEAKV

Query:  LLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHLMPFMHSLHLAYAAADLIVSRAGAMTCSEILATGKPSILIPSPY
        +LVLGGSLGA  IN  + N   +   +++ + ++WQ G   FDE      +   +    F++ + LAYAAAD+I+SRAGA + SE+   GKP + IPSP 
Subjt:  LLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHLMPFMHSLHLAYAAADLIVSRAGAMTCSEILATGKPSILIPSPY

Query:  EDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVSKPNAAAEIVLHIGKLIN
          E HQ +NA  + +   + +I EDEL +         +L D  +M   +    +++ PNA ++IV  + KLIN
Subjt:  EDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVSKPNAAAEIVLHIGKLIN

A5FIY3 UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase4.4e-5737.57Show/hide
Query:  RVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTVPATQLDRPLLSPRNVLLPFHLIKSVVAGYKKLRDFKPHIVIGTG
        + + + GGTGGH+YPA+AIA+ELKL +P A+ LF+G  +  E   +P AGYE   +    L R  L+ +N++ P  L  S++   + ++ FKP++VIGTG
Subjt:  RVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTVPATQLDRPLLSPRNVLLPFHLIKSVVAGYKKLRDFKPHIVIGTG

Query:  GYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPSKKKCLVCGNPVRLALRQYVSKAVARLHFFPRLGKTEDLEAKVLLV
        G+ S P+  AA    G+   +QEQNS PG  N +LS  A+ + V   +    FP K+K ++ GNPVR  L    +K    + F+       D   K LLV
Subjt:  GYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPSKKKCLVCGNPVRLALRQYVSKAVARLHFFPRLGKTEDLEAKVLLV

Query:  LGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHLMPFMHSLHLAYAAADLIVSRAGAMTCSEILATGKPSILIPSPYEDE
        LGGSLGA  IN  ++    Q  L ++++ IIWQ G   F++      N  ++ ++ F+  +   YAAAD+I+SRAGA + SE+   GKP I IPSP   E
Subjt:  LGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHLMPFMHSLHLAYAAADLIVSRAGAMTCSEILATGKPSILIPSPYEDE

Query:  GHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVSKPNAAAEIVLHIGKLI
         HQ +NA  + +  G+ ++ E ELD+   +I  + +L D  K   +S    ++++P+A   IV  I KL+
Subjt:  GHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVSKPNAAAEIVLHIGKLI

A6H195 UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase3.1e-5835.83Show/hide
Query:  IDSLRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTVPATQLDRPLLSPRNVLLPFHLIKSVVAGYKKLRDFKPHIV
        ++  + + + GGTGGH+YPA+AIA+ELK  +P  +ILF+G  +  E   +P AGY+   +    L R  ++ +N + PF L+ S+V  +  ++ FKP +V
Subjt:  IDSLRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTVPATQLDRPLLSPRNVLLPFHLIKSVVAGYKKLRDFKPHIV

Query:  IGTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPSKKKCLVCGNPVRLALRQYVSKAVARLHFFPRLGKTEDLEAK
        IGTGG+ S  +   A ++ G+   IQEQNS PG  N +LS  A+ + V   +  + FP K K ++ GNPVR  L     K    + +F       D   K
Subjt:  IGTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPSKKKCLVCGNPVRLALRQYVSKAVARLHFFPRLGKTEDLEAK

Query:  VLLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHLMPFMHSLHLAYAAADLIVSRAGAMTCSEILATGKPSILIPSP
         +L+LGGSLGA  IN  +      +L  ++N+ IIWQ G   F++         ++ ++ F+  + L YAAAD+++SR+GA + SE+   GKP I IPSP
Subjt:  VLLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHLMPFMHSLHLAYAAADLIVSRAGAMTCSEILATGKPSILIPSP

Query:  YEDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVSKPNAAAEIVLHIGKLI
           E HQ +NA  + +  G+ ++ E +LDS       + +L D +K  D+S+   +++ PNA  +IV  I KL+
Subjt:  YEDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVSKPNAAAEIVLHIGKLI

Q2S528 UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase1.4e-5837.12Show/hide
Query:  VVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTVPATQLDRPLLSPRNVLLPFHLIKSVVAGYKKLRDFKPHIVIGTGG
        ++   GGTGGHVYPA+AIAD ++   P AQI+F GT +  E+ A+P AGY    + A  L R  ++  N+LLPF + + +V  ++ +   +P + +GTGG
Subjt:  VVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTVPATQLDRPLLSPRNVLLPFHLIKSVVAGYKKLRDFKPHIVIGTGG

Query:  YVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPSKKKCLVCGNPVRLALRQYVSKAVARLHFFPRLGKTEDLEAKVLLVL
        YV+ P+ +AA L  G  L IQEQN+  G  N VL+  A  + +      +  P+ +  +V GNP R +LR     A       P        + +VLLV+
Subjt:  YVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPSKKKCLVCGNPVRLALRQYVSKAVARLHFFPRLGKTEDLEAKVLLVL

Query:  GGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHLMPFMHSLHLAYAAADLIVSRAGAMTCSEILATGKPSILIPSPYEDEG
        GGSLG+ AIN A+  +   +L E  +++++WQTG   +D++   +  HP L ++ ++  +  AYAAADL V RAGA+TCSE+  TG P++L+PSP     
Subjt:  GGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHLMPFMHSLHLAYAAADLIVSRAGAMTCSEILATGKPSILIPSPYEDEG

Query:  HQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVSKPNAAAEI
        HQ +NA  +     +  +DE +LD+  L   + ++LG++ + A M+E A   ++P+AA  I
Subjt:  HQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVSKPNAAAEI

Q8R9G6 UDP-N-acetylglucosamine--N-acetylmuramyl-(pentapeptide) pyrophosphoryl-undecaprenol N-acetylglucosamine transferase1.9e-5537.33Show/hide
Query:  LRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTVPATQLDRPLLSPRNVLLPFHLIKSVVAGY-------KKLRDFK
        +R +FA GGTGGH+YPAVAIA E+      AQILF+GT    E   +P  G+E  T+      R         L F  +K+V   +       K L+DFK
Subjt:  LRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTVPATQLDRPLLSPRNVLLPFHLIKSVVAGY-------KKLRDFK

Query:  PHIVIGTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPSKKKCLVCGNPVRLALRQYVSKAVARLHFFPRLGKTED
        PH+VIGTGGYV  P+ +AA +I  +   I EQN+ PG  N +LS F D+V V    +V+ F   KK +V GNP+R  L + V K   R     +LG +  
Subjt:  PHIVIGTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPSKKKCLVCGNPVRLALRQYVSKAVARLHFFPRLGKTED

Query:  LEAKVLLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNH-----PHLHLMPFMHSLHLAYAAADLIVSRAGAMTCSEILATG
        +   +++ +GGS GA  IN  M+ L   + ++++   ++  TG   +D++   VK         + ++P+ H +   YAAAD+++ RAGA+T SEI A G
Subjt:  LEAKVLLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNH-----PHLHLMPFMHSLHLAYAAADLIVSRAGAMTCSEILATG

Query:  KPSILIPSPYEDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVSKPNAAAEI
         PSILIPSPY    HQ  NA ++       VI E +LD+  L   I+ +L + S + +M E+A  +S+ +A+ +I
Subjt:  KPSILIPSPYEDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVSKPNAAAEI

Arabidopsis top hitse value%identityAlignment
AT1G15060.1 Uncharacterised conserved protein UCP031088, alpha/beta hydrolase3.2e-13550.4Show/hide
Query:  KGLICTADELHYVSVPNSDWRLALWRYLPSLRAPSRNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISSE--
        K  +CTADELHYVSVPN+DWRLALWRYLP  +AP+RNHPLLLLSGVG+NA+GYDLSP  SFARHMS QG++TWILEVRG+GLS    ++K+ E+ + E  
Subjt:  KGLICTADELHYVSVPNSDWRLALWRYLPSLRAPSRNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISSE--

Query:  -----------------------TLVRKIGHYPEENRSLVTSAS--------------------------SEGSSI--SSK-------------------
                                ++      P  + S+V  AS                          SEG S+  S+K                   
Subjt:  -----------------------TLVRKIGHYPEENRSLVTSAS--------------------------SEGSSI--SSK-------------------

Query:  -----------DGQTSTIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQ-GVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLA
                     Q S +  Q+R   Q L+++ +  Q+       +LQ  +T+ +E+FQ+QLD+  KYDWDFDHYLEEDVPAA+EY+R QSKP DGKL A
Subjt:  -----------DGQTSTIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQ-GVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLA

Query:  IGHSMGGILLYAMISRCSFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDMIHPTL
        IGHSMGGILLYAM+SRC+F   +P +A+V TL SS+DY  SNS+L+LL+PL +PA+A +VPV+P+G LL  A PL++RPPYVLSWL   IS+ DM+HP +
Subjt:  IGHSMGGILLYAMISRCSFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDMIHPTL

Query:  LEKLVMNGFESVPAKVLVQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYDIVGSRL
        LEKLV+N F ++PAK+L+QL+T F EGGLRDR+  F+YKDHL +++VP+LA+AGD+DLICPP AV +TVKL P   V+YK++G+P GPHYAHYD+VG RL
Subjt:  LEKLVMNGFESVPAKVLVQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYDIVGSRL

Query:  QARQ
           Q
Subjt:  QARQ

AT1G73740.1 UDP-Glycosyltransferase superfamily protein1.8e-11453.83Show/hide
Query:  VSYVNIIRCLSIHRSNNQDSTISNSTIDSLRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTVP--ATQLDRPLLSP
        +S  + + CLS+ R  N  S++SN T   LRVV +AGGT GH+  A+AI DELK   P A+ILF+G PNS ES  +PSAG++F T+    +   RP L  
Subjt:  VSYVNIIRCLSIHRSNNQDSTISNSTIDSLRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTVP--ATQLDRPLLSP

Query:  RNVL-LPFHLIKSVVAGYKKLRDFKPHIVIGTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPSK--KKCLVCGNP
         + L  P  LI+S    YK LR+ KP IVIGTGG+ SFP+C AA +I+  K  IQEQ+S+PG  NW+LS+FAD +F   N TV   P +   KC+V GNP
Subjt:  RNVL-LPFHLIKSVVAGYKKLRDFKPHIVIGTGGYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPSK--KKCLVCGNP

Query:  VRLALRQYVSKAVARLHFFPRLGKTEDLEAKVLLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHLMPFMHSLHLAY
        +R  LR+Y SK  AR+ FF +       E KV+L+LGGSLGANAINIA+LN Y Q+L E++N + +WQTGVE FDEMDSLV++HP L L PF+ S+ +AY
Subjt:  VRLALRQYVSKAVARLHFFPRLGKTEDLEAKVLLVLGGSLGANAINIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHLMPFMHSLHLAY

Query:  AAADLIVSRAGAMTCSEILATGKPSILIPSPYEDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVSKPNAAAEIVLH
        AAADL++SRAGAMTCSEI+A GKPSILIPSP+ DEG Q RNAS+MAD+ GS++I E+ELD+ TL  A+++ILG+   M +MSERA + +K +AA+++  H
Subjt:  AAADLIVSRAGAMTCSEILATGKPSILIPSPYEDEGHQFRNASIMADMAGSRVIDEDELDSTTLAIAIQEILGDASKMADMSERALRVSKPNAAAEIVLH

Query:  IGKLI
        I  +I
Subjt:  IGKLI

AT1G73750.1 Uncharacterised conserved protein UCP031088, alpha/beta hydrolase4.0e-12251.21Show/hide
Query:  ICTADELHYVSVPNSDWRLALWRYLPSLRAPSRNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISSETLVRK
        ICTADELHYV VPNSDWR+ALWRYLPS +AP RNHPLLLLSG+G+NA+ YDLSPE SFAR MS  G+DTWILE+RG+GLS+  V+    +  + + +V  
Subjt:  ICTADELHYVSVPNSDWRLALWRYLPSLRAPSRNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISSETLVRK

Query:  IGHYPEENRSLVTSASSEGSSISSKDGQTSTIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYI
                                     S +       ++ L ++++G  ++  +Q      ++    +F+++ ++   Y+WDFD+YLEEDVP+AM+Y+
Subjt:  IGHYPEENRSLVTSASSEGSSISSKDGQTSTIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYI

Query:  RNQSKPNDGKLLAIGHSMGGILLYAMISRCSFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLK
        R Q+K  DGKLLA+GHSMGGILLYA++SRC F  +D  LA V TL S+ DY  S + L+ LLP+++PAQA N+P++PI  +L +AHPL  RPPY LSWL 
Subjt:  RNQSKPNDGKLLAIGHSMGGILLYAMISRCSFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLK

Query:  GQISAEDMIHPTLLEKLVMNGFESVPAKVLVQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGG
          ISA  M+ P ++EKLV+N   +VP K+L+QL+T  + GGLRDR  TF YKDH+ ++NVPILA+AGD D+ICPP AVY+TVKLIP    +YKVVG PGG
Subjt:  GQISAEDMIHPTLLEKLVMNGFESVPAKVLVQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGG

Query:  PHYAHYDIVGSR
        PHY H D++  R
Subjt:  PHYAHYDIVGSR

AT3G18215.1 Protein of unknown function, DUF5993.7e-5952.61Show/hide
Query:  LECVMVPLGFAVLGSYHLWLIITIYRRPRSTVIGINADSRRQWVSTMISDPLKNGVLAVQTIRNNIMASTLMATTTITIGSLISVFVSSTSSTG------
        L+ V+VP G  V+ +YH+WL+  I  RP+ TVI +NA+SRRQWV +M+++PLKNG LAVQTIRNNIMASTL+ATT IT+ S+I VFVS++SS+       
Subjt:  LECVMVPLGFAVLGSYHLWLIITIYRRPRSTVIGINADSRRQWVSTMISDPLKNGVLAVQTIRNNIMASTLMATTTITIGSLISVFVSSTSSTG------

Query:  ------------KYHYIVLCFLVAFLCNVQSIRYYAHASFLATLPACEGRK---DYLAGTLNRGSLFWSLGLRAFYFAIPLFLWIFGPLSMFMSCCLITF
                    K   I++CFL+AFLCN+QSIRYYAH SFL T+P   G++   +Y++  LNR S FWSLGLRAFYF+ PLFLW FGP+ MF+ CC+++ 
Subjt:  ------------KYHYIVLCFLVAFLCNVQSIRYYAHASFLATLPACEGRK---DYLAGTLNRGSLFWSLGLRAFYFAIPLFLWIFGPLSMFMSCCLITF

Query:  VLYFLDFTRSSSYDPNEYAQKDEANSNDIE
        +LYFLD T S +   +  + ++ A+S D E
Subjt:  VLYFLDFTRSSSYDPNEYAQKDEANSNDIE

AT5G24600.2 Protein of unknown function, DUF5993.0e-5352.42Show/hide
Query:  MEARHLECVMVPLGFAVLGSYHLWLIITIYRRPRSTVIGINADSRRQWVSTMISDPLKNGVLAVQTIRNNIMASTLMATTTITIGSLISVFVSSTSSTG-
        M+  +L+  +VPLG A++  YHLWL+  I  RP STV+G+NA +RR WV  M+ D  KNGVLAVQT+RNNIMASTL+A+T I + SLI+V +  TS+TG 
Subjt:  MEARHLECVMVPLGFAVLGSYHLWLIITIYRRPRSTVIGINADSRRQWVSTMISDPLKNGVLAVQTIRNNIMASTLMATTTITIGSLISVFVSSTSSTG-

Query:  -----------------KYHYIVLCFLVAFLCNVQSIRYYAHASFLATLP-------ACEGR--------KDYLAGTLNRGSLFWSLGLRAFYFAIPLFL
                         K+  I++CFLVAFL NVQSIRYY+HAS L  +P       +  GR        +DY+A T+NRGS FWSLGLRAFYF+ PLFL
Subjt:  -----------------KYHYIVLCFLVAFLCNVQSIRYYAHASFLATLP-------ACEGR--------KDYLAGTLNRGSLFWSLGLRAFYFAIPLFL

Query:  WIFGPLSMFMSCCLITFVLYFLDFTRS
        WIFGP+ MF++CC++   LYFLD T S
Subjt:  WIFGPLSMFMSCCLITFVLYFLDFTRS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGCGAGACATCTTGAGTGTGTGATGGTTCCGCTTGGTTTTGCGGTGCTGGGGAGCTACCATCTTTGGCTGATAATCACCATTTACCGGAGACCCAGAAGCACGGT
CATCGGAATCAATGCCGATTCCCGCCGGCAATGGGTCTCCACCATGATTTCAGATCCTTTAAAGAATGGTGTTCTTGCTGTCCAAACTATACGAAACAACATCATGGCTT
CTACGCTTATGGCCACGACAACAATCACCATTGGCTCATTGATCAGTGTTTTTGTGAGCAGCACATCAAGTACTGGAAAGTACCATTACATTGTATTGTGTTTTCTCGTT
GCATTTCTCTGCAATGTGCAGTCTATTAGATACTATGCTCATGCCAGCTTCTTAGCTACTTTGCCTGCTTGTGAAGGCAGAAAGGATTATCTTGCTGGAACCTTGAACCG
AGGAAGCTTATTTTGGTCGCTCGGATTGCGAGCTTTCTATTTCGCAATTCCTCTCTTCCTATGGATCTTTGGACCTCTGTCGATGTTTATGAGTTGCTGCCTGATAACAT
TTGTTCTTTACTTCTTGGACTTTACTAGAAGTTCCAGCTATGATCCTAACGAATATGCGCAAAAGGATGAAGCGAACAGCAATGACATTGAATCTGTTGGTCAACCCGGG
GGAAACAACTGCGCCGCGAACTCCTGCCTTCACTCTTCTCTACTTGCAGATATTGTTTTATCATTTTCTGCTCTAAGATGTAATGCGAACCATGGCATATATCGTTATGA
AACTACTATTGGCCTCATCTTCCAGCCTTTGAGGCTGCATCGAGATGCCTTCTGGTTTCATGTGTTTCAAGCTCTGTCCTTTTTCACAAATTGCAATGTGGAGTGGAAAG
CCCACCCACCTCCTTTTCTCTCTGACCAACCCAAATTCCAATTCCCAATTGTATCGTATGTCAACATCATCCGCTGTCTATCTATACACCGATCAAACAACCAAGACTCC
ACTATTTCAAACAGCACGATCGACAGCCTGAGAGTCGTCTTCGCCGCTGGGGGCACTGGCGGGCACGTATATCCTGCCGTTGCCATTGCTGATGAGCTCAAACTCACTTA
CCCCACAGCGCAAATCCTCTTTCTAGGAACACCCAACAGCACGGAAAGCGCCGCCATCCCCTCCGCCGGGTACGAGTTCGACACAGTTCCGGCTACCCAGTTAGATCGCC
CCCTCCTCTCTCCCCGAAACGTCCTTCTCCCTTTCCATCTAATCAAATCCGTGGTTGCGGGCTACAAAAAATTACGCGATTTCAAACCCCACATTGTCATAGGCACAGGC
GGGTACGTCTCCTTCCCAATCTGCCTCGCCGCAAGGCTGATTAATGGCGTCAAGCTAGCAATCCAAGAGCAGAACTCGGTTCCAGGCTTTGCGAATTGGGTTCTTTCCTA
TTTTGCAGATGTGGTGTTTGTTGTATTGAACTCCACGGTTGAGTGCTTTCCAAGTAAGAAGAAGTGTTTGGTGTGCGGGAATCCGGTGAGGTTGGCATTGAGGCAGTACG
TATCGAAGGCTGTAGCACGTTTGCATTTCTTTCCAAGATTAGGGAAGACCGAGGATTTGGAGGCTAAGGTGCTGCTAGTTCTTGGAGGGTCTTTGGGTGCCAATGCCATA
AACATTGCCATGTTGAATCTGTATTATCAGATGTTGTTGGAGAACAAGAACTTGTATATTATATGGCAGACTGGCGTGGAGACATTTGATGAGATGGACAGCCTTGTGAA
GAACCATCCCCATCTGCATTTAATGCCGTTCATGCATTCTCTGCATTTGGCATATGCAGCTGCAGATCTCATTGTATCTAGAGCAGGGGCCATGACTTGCTCTGAGATCC
TAGCAACTGGGAAGCCTTCTATTCTGATACCATCACCATATGAAGATGAAGGACATCAGTTCAGAAATGCTTCAATAATGGCTGACATGGCTGGTTCGAGGGTCATAGAT
GAAGATGAACTAGACTCAACTACACTTGCCATTGCAATTCAAGAGATATTAGGTGATGCGAGTAAAATGGCAGATATGTCTGAGAGAGCACTGAGAGTATCAAAACCAAA
TGCCGCTGCTGAAATTGTTCTACACATTGGAAAGTTAATAAACTTGTCAACTGGAAGGGAGAGCAGCAATGATACCACTGATGGAGTTAAGACAAGTCAGCGTGTAGTAG
GGTCCACCGTAGGAAGGAAAAAGTCAAAACTTGGTCAAGCGGGACCCACGGCGAATAGTACCAGTACTCACCCACACGATTGCATCGCCAATTCCCAGCTGCGTGACGTA
ACCTCCTCCGCCTTCGCCAATCGATCCGAGCGGCCACGATTGTCACACCTTGTCCTTTTCTTTCTAACCAATGCCCCACAAGCAATAACCTCTCTACAGGCCTCTGCATT
ATATCTTCTTACACCAACTCCGATCGTCTCCAGCGCAGAGCCCATGGCGACTCTTCCATTGTCTCGTCTGGATCTCTGTTCGATCGGCCGGAGCGACAACCGCTGCGGTC
ACCTCAACCCACTCAGGCAACGAGGGAAACCAGAAGGCTGCTGGGCCTCACGCCGACGGAATGCAGTCGGCATGAAGTCCGTCAGGGCGTTTGACGGAGCATTTGGTTTG
CATGACAAGGAGAAGGGTTTGATCTGTACCGCTGACGAGCTTCATTACGTCTCCGTTCCCAACTCTGATTGGAGGCTCGCTCTCTGGCGTTACCTCCCTTCTCTTCGGGC
GCCATCAAGGAATCATCCGCTTTTGCTGTTATCAGGAGTTGGGAGCAATGCTCTTGGATATGACCTTTCTCCAGAGTTCTCATTTGCTCGCCACATGTCCAACCAAGGAT
ATGACACATGGATTCTTGAAGTTCGAGGATCTGGGCTTAGTGCTAACAGAGTAGAAATGAAGGAAAATGAGCAGATAAGCTCTGAAACTCTGGTTCGAAAGATTGGTCAC
TATCCTGAGGAGAATCGGTCATTAGTCACATCTGCTAGTTCTGAGGGTTCCAGCATTTCTTCAAAAGATGGACAGACTTCTACTATTGCTACGCAACTTAGGCAATGGAA
TCAAAATCTTATGGATATAATTGAAGGAGCTCAACAACTGGGTCGATTGCAGCCTTTTAACTTACAAGGTGTTACCTCTGCATTAGAAAACTTCCAGGAACAACTTGATG
TGTATGAGAAGTATGATTGGGACTTCGATCACTACTTGGAAGAAGATGTGCCTGCTGCGATGGAGTACATAAGGAACCAATCCAAACCAAATGATGGCAAGTTACTAGCA
ATCGGCCACTCAATGGGGGGTATCTTGCTATATGCAATGATCTCGCGGTGTAGCTTTAACAAAGTTGACCCGCAGTTGGCATCAGTTGTTACTTTGGGTTCATCACTTGA
CTACAGACCTTCAAATTCATCACTCAGATTACTGTTACCTTTGAGAGATCCTGCACAAGCTTTTAACGTTCCTGTGATTCCCATCGGGCCTTTGCTTGTTATTGCTCATC
CTCTTGCATCACGTCCTCCTTATGTCTTGTCTTGGTTAAAGGGTCAAATTTCTGCAGAAGACATGATACATCCAACCTTGCTTGAGAAGCTTGTGATGAATGGCTTTGAA
TCTGTGCCTGCAAAGGTTCTTGTGCAGCTATCAACTGTTTTTGAAGAGGGGGGCTTGCGTGACAGGAACAGGACGTTCCATTACAAGGACCATTTACACCAAAGCAATGT
TCCAATCCTTGCTATTGCTGGAGACCAAGACCTCATTTGTCCACCTAAAGCTGTATATGAAACTGTGAAACTAATTCCCAGGCAGTTTGTTTCCTACAAAGTTGTTGGGA
AGCCCGGTGGTCCTCACTATGCTCACTATGATATTGTGGGAAGTCGTTTGCAAGCCCGTCAGGGGATGGATGCCGAACATGACGATGATACCGACTTCGAGACACTTCCG
ATGATTGCGCTGTTGGAAGATCGGAATCAGAATCCAAAAATAAGAGCGTCTCAGGGTTCACTGTTCTCCTTCCTAAACAAGAACGCTGCCATGGCCGAAGCAGAGGAAAC
CCGTCAGCCATTCCTTGCTGGGTTTTGTTCTTGGAGTTCTTCTGCTTTTTTTTCCTCGTCTTCTTTTTCTGCCAATCCGCAGAAGTTCGAATTACTGCCGGGACTGATAC
CTGCTGAGAAGCGGAGGCAGTGCATCCGAAGGGTAAAGATTCATTGGCGCCGCTCGTTGTACTGTTGGAAAAGGTCGAGAGTAGAAGGGAAGAAATGGTGGATTTGCAGC
GAGTGGATTGAATTATGGAAGGGATTTGAGGAGGAGGGATTGATGGGTTTGGATCTGATTCTGAAATGGGTTGGCTCAAATGGCCTCTGGGTCTTCTCAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGCGAGACATCTTGAGTGTGTGATGGTTCCGCTTGGTTTTGCGGTGCTGGGGAGCTACCATCTTTGGCTGATAATCACCATTTACCGGAGACCCAGAAGCACGGT
CATCGGAATCAATGCCGATTCCCGCCGGCAATGGGTCTCCACCATGATTTCAGATCCTTTAAAGAATGGTGTTCTTGCTGTCCAAACTATACGAAACAACATCATGGCTT
CTACGCTTATGGCCACGACAACAATCACCATTGGCTCATTGATCAGTGTTTTTGTGAGCAGCACATCAAGTACTGGAAAGTACCATTACATTGTATTGTGTTTTCTCGTT
GCATTTCTCTGCAATGTGCAGTCTATTAGATACTATGCTCATGCCAGCTTCTTAGCTACTTTGCCTGCTTGTGAAGGCAGAAAGGATTATCTTGCTGGAACCTTGAACCG
AGGAAGCTTATTTTGGTCGCTCGGATTGCGAGCTTTCTATTTCGCAATTCCTCTCTTCCTATGGATCTTTGGACCTCTGTCGATGTTTATGAGTTGCTGCCTGATAACAT
TTGTTCTTTACTTCTTGGACTTTACTAGAAGTTCCAGCTATGATCCTAACGAATATGCGCAAAAGGATGAAGCGAACAGCAATGACATTGAATCTGTTGGTCAACCCGGG
GGAAACAACTGCGCCGCGAACTCCTGCCTTCACTCTTCTCTACTTGCAGATATTGTTTTATCATTTTCTGCTCTAAGATGTAATGCGAACCATGGCATATATCGTTATGA
AACTACTATTGGCCTCATCTTCCAGCCTTTGAGGCTGCATCGAGATGCCTTCTGGTTTCATGTGTTTCAAGCTCTGTCCTTTTTCACAAATTGCAATGTGGAGTGGAAAG
CCCACCCACCTCCTTTTCTCTCTGACCAACCCAAATTCCAATTCCCAATTGTATCGTATGTCAACATCATCCGCTGTCTATCTATACACCGATCAAACAACCAAGACTCC
ACTATTTCAAACAGCACGATCGACAGCCTGAGAGTCGTCTTCGCCGCTGGGGGCACTGGCGGGCACGTATATCCTGCCGTTGCCATTGCTGATGAGCTCAAACTCACTTA
CCCCACAGCGCAAATCCTCTTTCTAGGAACACCCAACAGCACGGAAAGCGCCGCCATCCCCTCCGCCGGGTACGAGTTCGACACAGTTCCGGCTACCCAGTTAGATCGCC
CCCTCCTCTCTCCCCGAAACGTCCTTCTCCCTTTCCATCTAATCAAATCCGTGGTTGCGGGCTACAAAAAATTACGCGATTTCAAACCCCACATTGTCATAGGCACAGGC
GGGTACGTCTCCTTCCCAATCTGCCTCGCCGCAAGGCTGATTAATGGCGTCAAGCTAGCAATCCAAGAGCAGAACTCGGTTCCAGGCTTTGCGAATTGGGTTCTTTCCTA
TTTTGCAGATGTGGTGTTTGTTGTATTGAACTCCACGGTTGAGTGCTTTCCAAGTAAGAAGAAGTGTTTGGTGTGCGGGAATCCGGTGAGGTTGGCATTGAGGCAGTACG
TATCGAAGGCTGTAGCACGTTTGCATTTCTTTCCAAGATTAGGGAAGACCGAGGATTTGGAGGCTAAGGTGCTGCTAGTTCTTGGAGGGTCTTTGGGTGCCAATGCCATA
AACATTGCCATGTTGAATCTGTATTATCAGATGTTGTTGGAGAACAAGAACTTGTATATTATATGGCAGACTGGCGTGGAGACATTTGATGAGATGGACAGCCTTGTGAA
GAACCATCCCCATCTGCATTTAATGCCGTTCATGCATTCTCTGCATTTGGCATATGCAGCTGCAGATCTCATTGTATCTAGAGCAGGGGCCATGACTTGCTCTGAGATCC
TAGCAACTGGGAAGCCTTCTATTCTGATACCATCACCATATGAAGATGAAGGACATCAGTTCAGAAATGCTTCAATAATGGCTGACATGGCTGGTTCGAGGGTCATAGAT
GAAGATGAACTAGACTCAACTACACTTGCCATTGCAATTCAAGAGATATTAGGTGATGCGAGTAAAATGGCAGATATGTCTGAGAGAGCACTGAGAGTATCAAAACCAAA
TGCCGCTGCTGAAATTGTTCTACACATTGGAAAGTTAATAAACTTGTCAACTGGAAGGGAGAGCAGCAATGATACCACTGATGGAGTTAAGACAAGTCAGCGTGTAGTAG
GGTCCACCGTAGGAAGGAAAAAGTCAAAACTTGGTCAAGCGGGACCCACGGCGAATAGTACCAGTACTCACCCACACGATTGCATCGCCAATTCCCAGCTGCGTGACGTA
ACCTCCTCCGCCTTCGCCAATCGATCCGAGCGGCCACGATTGTCACACCTTGTCCTTTTCTTTCTAACCAATGCCCCACAAGCAATAACCTCTCTACAGGCCTCTGCATT
ATATCTTCTTACACCAACTCCGATCGTCTCCAGCGCAGAGCCCATGGCGACTCTTCCATTGTCTCGTCTGGATCTCTGTTCGATCGGCCGGAGCGACAACCGCTGCGGTC
ACCTCAACCCACTCAGGCAACGAGGGAAACCAGAAGGCTGCTGGGCCTCACGCCGACGGAATGCAGTCGGCATGAAGTCCGTCAGGGCGTTTGACGGAGCATTTGGTTTG
CATGACAAGGAGAAGGGTTTGATCTGTACCGCTGACGAGCTTCATTACGTCTCCGTTCCCAACTCTGATTGGAGGCTCGCTCTCTGGCGTTACCTCCCTTCTCTTCGGGC
GCCATCAAGGAATCATCCGCTTTTGCTGTTATCAGGAGTTGGGAGCAATGCTCTTGGATATGACCTTTCTCCAGAGTTCTCATTTGCTCGCCACATGTCCAACCAAGGAT
ATGACACATGGATTCTTGAAGTTCGAGGATCTGGGCTTAGTGCTAACAGAGTAGAAATGAAGGAAAATGAGCAGATAAGCTCTGAAACTCTGGTTCGAAAGATTGGTCAC
TATCCTGAGGAGAATCGGTCATTAGTCACATCTGCTAGTTCTGAGGGTTCCAGCATTTCTTCAAAAGATGGACAGACTTCTACTATTGCTACGCAACTTAGGCAATGGAA
TCAAAATCTTATGGATATAATTGAAGGAGCTCAACAACTGGGTCGATTGCAGCCTTTTAACTTACAAGGTGTTACCTCTGCATTAGAAAACTTCCAGGAACAACTTGATG
TGTATGAGAAGTATGATTGGGACTTCGATCACTACTTGGAAGAAGATGTGCCTGCTGCGATGGAGTACATAAGGAACCAATCCAAACCAAATGATGGCAAGTTACTAGCA
ATCGGCCACTCAATGGGGGGTATCTTGCTATATGCAATGATCTCGCGGTGTAGCTTTAACAAAGTTGACCCGCAGTTGGCATCAGTTGTTACTTTGGGTTCATCACTTGA
CTACAGACCTTCAAATTCATCACTCAGATTACTGTTACCTTTGAGAGATCCTGCACAAGCTTTTAACGTTCCTGTGATTCCCATCGGGCCTTTGCTTGTTATTGCTCATC
CTCTTGCATCACGTCCTCCTTATGTCTTGTCTTGGTTAAAGGGTCAAATTTCTGCAGAAGACATGATACATCCAACCTTGCTTGAGAAGCTTGTGATGAATGGCTTTGAA
TCTGTGCCTGCAAAGGTTCTTGTGCAGCTATCAACTGTTTTTGAAGAGGGGGGCTTGCGTGACAGGAACAGGACGTTCCATTACAAGGACCATTTACACCAAAGCAATGT
TCCAATCCTTGCTATTGCTGGAGACCAAGACCTCATTTGTCCACCTAAAGCTGTATATGAAACTGTGAAACTAATTCCCAGGCAGTTTGTTTCCTACAAAGTTGTTGGGA
AGCCCGGTGGTCCTCACTATGCTCACTATGATATTGTGGGAAGTCGTTTGCAAGCCCGTCAGGGGATGGATGCCGAACATGACGATGATACCGACTTCGAGACACTTCCG
ATGATTGCGCTGTTGGAAGATCGGAATCAGAATCCAAAAATAAGAGCGTCTCAGGGTTCACTGTTCTCCTTCCTAAACAAGAACGCTGCCATGGCCGAAGCAGAGGAAAC
CCGTCAGCCATTCCTTGCTGGGTTTTGTTCTTGGAGTTCTTCTGCTTTTTTTTCCTCGTCTTCTTTTTCTGCCAATCCGCAGAAGTTCGAATTACTGCCGGGACTGATAC
CTGCTGAGAAGCGGAGGCAGTGCATCCGAAGGGTAAAGATTCATTGGCGCCGCTCGTTGTACTGTTGGAAAAGGTCGAGAGTAGAAGGGAAGAAATGGTGGATTTGCAGC
GAGTGGATTGAATTATGGAAGGGATTTGAGGAGGAGGGATTGATGGGTTTGGATCTGATTCTGAAATGGGTTGGCTCAAATGGCCTCTGGGTCTTCTCAATTTGA
Protein sequenceShow/hide protein sequence
MEARHLECVMVPLGFAVLGSYHLWLIITIYRRPRSTVIGINADSRRQWVSTMISDPLKNGVLAVQTIRNNIMASTLMATTTITIGSLISVFVSSTSSTGKYHYIVLCFLV
AFLCNVQSIRYYAHASFLATLPACEGRKDYLAGTLNRGSLFWSLGLRAFYFAIPLFLWIFGPLSMFMSCCLITFVLYFLDFTRSSSYDPNEYAQKDEANSNDIESVGQPG
GNNCAANSCLHSSLLADIVLSFSALRCNANHGIYRYETTIGLIFQPLRLHRDAFWFHVFQALSFFTNCNVEWKAHPPPFLSDQPKFQFPIVSYVNIIRCLSIHRSNNQDS
TISNSTIDSLRVVFAAGGTGGHVYPAVAIADELKLTYPTAQILFLGTPNSTESAAIPSAGYEFDTVPATQLDRPLLSPRNVLLPFHLIKSVVAGYKKLRDFKPHIVIGTG
GYVSFPICLAARLINGVKLAIQEQNSVPGFANWVLSYFADVVFVVLNSTVECFPSKKKCLVCGNPVRLALRQYVSKAVARLHFFPRLGKTEDLEAKVLLVLGGSLGANAI
NIAMLNLYYQMLLENKNLYIIWQTGVETFDEMDSLVKNHPHLHLMPFMHSLHLAYAAADLIVSRAGAMTCSEILATGKPSILIPSPYEDEGHQFRNASIMADMAGSRVID
EDELDSTTLAIAIQEILGDASKMADMSERALRVSKPNAAAEIVLHIGKLINLSTGRESSNDTTDGVKTSQRVVGSTVGRKKSKLGQAGPTANSTSTHPHDCIANSQLRDV
TSSAFANRSERPRLSHLVLFFLTNAPQAITSLQASALYLLTPTPIVSSAEPMATLPLSRLDLCSIGRSDNRCGHLNPLRQRGKPEGCWASRRRNAVGMKSVRAFDGAFGL
HDKEKGLICTADELHYVSVPNSDWRLALWRYLPSLRAPSRNHPLLLLSGVGSNALGYDLSPEFSFARHMSNQGYDTWILEVRGSGLSANRVEMKENEQISSETLVRKIGH
YPEENRSLVTSASSEGSSISSKDGQTSTIATQLRQWNQNLMDIIEGAQQLGRLQPFNLQGVTSALENFQEQLDVYEKYDWDFDHYLEEDVPAAMEYIRNQSKPNDGKLLA
IGHSMGGILLYAMISRCSFNKVDPQLASVVTLGSSLDYRPSNSSLRLLLPLRDPAQAFNVPVIPIGPLLVIAHPLASRPPYVLSWLKGQISAEDMIHPTLLEKLVMNGFE
SVPAKVLVQLSTVFEEGGLRDRNRTFHYKDHLHQSNVPILAIAGDQDLICPPKAVYETVKLIPRQFVSYKVVGKPGGPHYAHYDIVGSRLQARQGMDAEHDDDTDFETLP
MIALLEDRNQNPKIRASQGSLFSFLNKNAAMAEAEETRQPFLAGFCSWSSSAFFSSSSFSANPQKFELLPGLIPAEKRRQCIRRVKIHWRRSLYCWKRSRVEGKKWWICS
EWIELWKGFEEEGLMGLDLILKWVGSNGLWVFSI