; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc01G19990 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc01G19990
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionUPF0261 protein
Genome locationClcChr01:32072577..32077164
RNA-Seq ExpressionClc01G19990
SyntenyClc01G19990
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR009215 - TIM-barrel domain, IGPS-like
IPR013785 - Aldolase-type TIM barrel
IPR015813 - Pyruvate/Phosphoenolpyruvate kinase-like domain superfamily
IPR044122 - UPF0261 domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025722.1 UPF0261 protein [Cucumis melo var. makuwa]0.0e+0086.49Show/hide
Query:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFS-RGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK
        MAT  ++KTPRVFCIATADTKL+ELRFISD VR NLN FS   SFKVEVT+VDVSTS Q GI+SLDDF FVSRE+VLSCY+LTGN LPDDRGKAISIMSK
Subjt:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFS-RGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK

Query:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN
        ALESYLSKAKEDGIIAG IGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSD+ILFPSIVDVCGINSVSRVIL+NAG+AFAGMVVGRLE 
Subjt:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN

Query:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL
        LK+S DSN KPTVGLTMFGVTTPCVNAVKE+L+KEGYE+LVFHATGVGGKAMESLV+EGFIQGVLDITTTEVADYL+GGVMACDS+RFD IIEKRIPLVL
Subjt:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL

Query:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN
        SVGA+DMVNFGSKDTI SN+ GRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNS+A+V VCLP+NGVSALDAPGK FYDPEATATLIEELQ+ IQLNN
Subjt:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN

Query:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI
        DRQVKVYPYHINDPEFAE LVNSFLEIT KDT DSCGPK VLAETSQD QK  ISK NLSA+GNITYS SDFPEARPETL+RT+ IL NLKAQ  KGVPI
Subjt:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI

Query:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV
        +GAGAGTGISAKFEE GGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                       V
Subjt:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV

Query:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM
        VKTVPVLAGVCASDPFR+MDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEAL+MA AGADIIVAHM
Subjt:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM

Query:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISM
        GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGV GFYGASSMERLPVEQAITSTVQEYKSIS+
Subjt:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISM

KAE8647594.1 hypothetical protein Csa_003735 [Cucumis sativus]0.0e+0086.13Show/hide
Query:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFS--RGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMS
        MAT  ++KTPRVFCIATADTKL+ELRFIS  VR NLN FS    SFKVEVT+VDVSTS Q GIESLDDF FVSRE+VLSC +LTGNHLPDDRGKAISIMS
Subjt:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFS--RGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMS

Query:  KALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLE
        KALESYLSKAKEDGIIAG IGLGGSGGTSLISSAL+SLQIGIPKLIVSTVASGQTESYIGTSD+ILFPSIVDVCGIN VSRVIL+NAGAAFAGMVVGRL 
Subjt:  KALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLE

Query:  NLKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLV
         LK+S DSNEKPTVGLTMFGVTTPCVNAVKE+L+KEGYE+LVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYL+GGVMACDS+RFD IIEK+IPLV
Subjt:  NLKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLV

Query:  LSVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLN
        LSVGA+DMVNFGSKDTI SN+ GRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNS+A+V VCLPQNGVSALDA GK FYDPEATATLIEELQR IQLN
Subjt:  LSVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLN

Query:  NDRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVP
        NDRQVKVYPYHINDPEFAE+LVNSFLEITSKDT DSCGPK VLAETS+D +K  IS+SNLSA+ NITYS SDFPEARPETLRRT+MIL NLKAQ LKGVP
Subjt:  NDRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVP

Query:  ILGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPL
        I+GAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                       
Subjt:  ILGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPL

Query:  VVKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAH
        VVKTVPVLAGVCASDPFR+MDY LKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEAL+MAKAGADIIVAH
Subjt:  VVKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAH

Query:  MGLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISM
        MGLTTSGSIGAKTALSMEESVVRVQAIADAA RIN NVLVLCHGGPISGP EAAFILKRTKGV GFYGASS+ERLPVEQAITSTVQ+YKSISM
Subjt:  MGLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISM

TYK12597.1 UPF0261 protein [Cucumis melo var. makuwa]0.0e+0086.49Show/hide
Query:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFS-RGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK
        MAT  ++KTPRVFCIATADTKL+ELRFISD VR NLN FS   SFKVEVT+VDVSTS Q GI+SLDDF FVSRE VLSCY+LTGN LPDDRGKAISIMSK
Subjt:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFS-RGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK

Query:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN
        ALESYLSKAKEDGIIAG IGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSD+ILFPSIVDVCGINSVSRVIL+NAG+AFAGMVVGRLE 
Subjt:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN

Query:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL
        LK+S DSN KPTVGLTMFGVTTPCVNAVKE+L+KEGYE+LVFHATGVGGKAMESLV+EGFIQGVLDITTTEVADYL+GGVMACDS+RFD IIEKRIPLVL
Subjt:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL

Query:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN
        SVGA+DMVNFGSKDTI SN+ GRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNS+A+V VCLP+NGVSALDAPGK FYDPEATATLIEELQ+ IQLNN
Subjt:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN

Query:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI
        DRQVKVYPYHINDPEFAE LVNSFLEIT KDT DSCGPK VLAETSQD QK  ISK NLSA+GNITYS SDFPEARPETL+RT+ IL NLKAQ  KGVPI
Subjt:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI

Query:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV
        +GAGAGTGISAKFEE GGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                       V
Subjt:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV

Query:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM
        VKTVPVLAGVCASDPFR+MDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEAL+MA AGADIIVAHM
Subjt:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM

Query:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISM
        GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGV GFYGASSMERLPVEQAITSTVQEYKSIS+
Subjt:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISM

XP_038882760.1 toMV susceptible protein tm-1(GCR26) isoform X1 [Benincasa hispida]0.0e+0088.9Show/hide
Query:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFSR-GSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK
        MAT GE+KTPRVFCIATADTKLEELRFISD VR+NLNCFSR  SFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSC DLTGNHLPDDRGKAISIMSK
Subjt:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFSR-GSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK

Query:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN
        ALESYLSKAKEDGIIAG IGLGGSGGTSLISSAL+SLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVIL+NAGAAFAGMVVGRLE 
Subjt:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN

Query:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL
        LK+S DSNEKPTVGLTMFGVTTPCVNAVKE+L+KEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFD IIEKRIPLVL
Subjt:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL

Query:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN
        S+GALDMVNFGSKDTI SNY  RNIYEHNKQVSLMRTT EE+RKIAHFIADKINNS+A+V VCLPQNGVSALDAPGKPFYDPEATATLI+ELQRVIQLNN
Subjt:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN

Query:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI
        DRQVKVYPYHINDPEFAE+LVNSFLEITSKDT DSCGPK VLAETSQD  K SIS SNLSA+ NI+YSPSDFPEA+PETL+RT+MILENLKAQ LKGVPI
Subjt:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI

Query:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV
        +GAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                       V
Subjt:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV

Query:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM
        VKTVPVLAGVCASDPFR+MD+FLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM
Subjt:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM

Query:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISMR
        GLTTSGSIGAKTALSMEES + VQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGV GFYGASSMERLPVEQAIT+T+QEYKSISMR
Subjt:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISMR

XP_038882761.1 toMV susceptible protein tm-1(GCR26) isoform X2 [Benincasa hispida]0.0e+0086.38Show/hide
Query:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFSR-GSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK
        MAT GE+KTPRVFCIATADTKLEELRFISD VR+NLNCFSR  SFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSC DLTGNHLPDDRGKAISIMSK
Subjt:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFSR-GSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK

Query:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN
        ALESYLSKAKEDGIIAG IGLGGSGGTSLISSAL+SLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVIL+NAGAAFAGMVVGRLE 
Subjt:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN

Query:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL
        LK+S DSNEKPTVGLTMFGVTTPCVNAVKE+L+KEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFD IIEKRIPLVL
Subjt:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL

Query:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN
        S+GALDMVNFGSKDTI SNY  RNIYEHNKQVSLMRTT EE+RKIAHFIADKINNS+A+V VCLPQNGVSALDAPGKPFYDPEATATLI+ELQRVIQLNN
Subjt:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN

Query:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI
        DRQVKVYPYHINDPEFAE+LVNSFLEITSKDT DSCGPK VLAETSQD  K SIS SNLSA+ NI+YSPSDFPEA+P                       
Subjt:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI

Query:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV
         GAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                       V
Subjt:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV

Query:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM
        VKTVPVLAGVCASDPFR+MD+FLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM
Subjt:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM

Query:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISMR
        GLTTSGSIGAKTALSMEES + VQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGV GFYGASSMERLPVEQAIT+T+QEYKSISMR
Subjt:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISMR

TrEMBL top hitse value%identityAlignment
A0A5A7SKR1 UPF0261 protein0.0e+0086.49Show/hide
Query:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFS-RGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK
        MAT  ++KTPRVFCIATADTKL+ELRFISD VR NLN FS   SFKVEVT+VDVSTS Q GI+SLDDF FVSRE+VLSCY+LTGN LPDDRGKAISIMSK
Subjt:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFS-RGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK

Query:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN
        ALESYLSKAKEDGIIAG IGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSD+ILFPSIVDVCGINSVSRVIL+NAG+AFAGMVVGRLE 
Subjt:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN

Query:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL
        LK+S DSN KPTVGLTMFGVTTPCVNAVKE+L+KEGYE+LVFHATGVGGKAMESLV+EGFIQGVLDITTTEVADYL+GGVMACDS+RFD IIEKRIPLVL
Subjt:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL

Query:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN
        SVGA+DMVNFGSKDTI SN+ GRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNS+A+V VCLP+NGVSALDAPGK FYDPEATATLIEELQ+ IQLNN
Subjt:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN

Query:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI
        DRQVKVYPYHINDPEFAE LVNSFLEIT KDT DSCGPK VLAETSQD QK  ISK NLSA+GNITYS SDFPEARPETL+RT+ IL NLKAQ  KGVPI
Subjt:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI

Query:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV
        +GAGAGTGISAKFEE GGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                       V
Subjt:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV

Query:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM
        VKTVPVLAGVCASDPFR+MDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEAL+MA AGADIIVAHM
Subjt:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM

Query:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISM
        GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGV GFYGASSMERLPVEQAITSTVQEYKSIS+
Subjt:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISM

A0A5D3CKY2 UPF0261 protein0.0e+0086.49Show/hide
Query:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFS-RGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK
        MAT  ++KTPRVFCIATADTKL+ELRFISD VR NLN FS   SFKVEVT+VDVSTS Q GI+SLDDF FVSRE VLSCY+LTGN LPDDRGKAISIMSK
Subjt:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFS-RGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK

Query:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN
        ALESYLSKAKEDGIIAG IGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSD+ILFPSIVDVCGINSVSRVIL+NAG+AFAGMVVGRLE 
Subjt:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN

Query:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL
        LK+S DSN KPTVGLTMFGVTTPCVNAVKE+L+KEGYE+LVFHATGVGGKAMESLV+EGFIQGVLDITTTEVADYL+GGVMACDS+RFD IIEKRIPLVL
Subjt:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL

Query:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN
        SVGA+DMVNFGSKDTI SN+ GRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNS+A+V VCLP+NGVSALDAPGK FYDPEATATLIEELQ+ IQLNN
Subjt:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN

Query:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI
        DRQVKVYPYHINDPEFAE LVNSFLEIT KDT DSCGPK VLAETSQD QK  ISK NLSA+GNITYS SDFPEARPETL+RT+ IL NLKAQ  KGVPI
Subjt:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI

Query:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV
        +GAGAGTGISAKFEE GGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                       V
Subjt:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV

Query:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM
        VKTVPVLAGVCASDPFR+MDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEAL+MA AGADIIVAHM
Subjt:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM

Query:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISM
        GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGV GFYGASSMERLPVEQAITSTVQEYKSIS+
Subjt:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISM

A0A6J1BSZ4 uncharacterized protein LOC111005487 isoform X10.0e+0084.62Show/hide
Query:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFSR-GSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK
        MAT G+SKT RVFCIATAD+KLEELRFISD VRSNLN FSR  SFKVEVTVVDVST R +GIESLDDFVFVSREDVLSCYD TGN LPDDRGKA+SIMSK
Subjt:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFSR-GSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK

Query:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN
        ALE +LSKA+ DG+IAGAIGLGGSGGTSLISSALKSL IGIPKLIVSTVASGQTESYI TSDLIL PSIVDVCGINSVSRV+L+NAGAAFAGMVVGRLE 
Subjt:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN

Query:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL
         K+SR+ NEKPTVGLTMFGVTTPCVN+VKE+L KEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYL+GGVMAC SSRFD IIEKRIPLVL
Subjt:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL

Query:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN
        SVGALDMVNFGS+D I SN+ GRNIYEHNKQVSLMRTTV+EN+KIAHFIADKINNS  +V VCLPQNGVSALDAPGKPFYDPEAT+TL++ELQR IQ N+
Subjt:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN

Query:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI
        DRQVKVYPYHIND EFAE+LVNSFLEITSK+TKD C PK V  E SQD Q++SIS+SNLS HG+I YSPSDFP+ARP TLRRT+MILENLKAQ ++GVPI
Subjt:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI

Query:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV
        +GAGAGTGISAKFEE GGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                       V
Subjt:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV

Query:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM
        V TVPVLAGVCASDPFR+MDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFN+DEALEMAKAGADIIVAHM
Subjt:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM

Query:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISMR
        GLTTSGSIGAKTAL+MEESV+RVQAIADAAHRINPNVLVLCHGGPISGP+EAAFILKRTKGV GFYGASSMERLPVEQAITSTVQEYKSISMR
Subjt:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISMR

A0A6J1HG11 uncharacterized protein LOC1114631630.0e+0083.86Show/hide
Query:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFSRGS-FKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK
        MAT GESKT RVFCIATADTKLEELRF+SD VRSNLN F+RGS +KVEVTVVDVSTS+QN IESLDDFVFVSR+ VLSCYD T NHLPDDRGKAISIMSK
Subjt:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFSRGS-FKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK

Query:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN
        ALE +LSK +EDG+IAGAIGLGGSGGTSLISSALKSLQIGIPKLIVST+ASGQTESYIGTSDLILFPSIVDVCGINSVSRV+ +NA AAFAGMV+GRLE 
Subjt:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN

Query:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL
          +S + NEKPTVGLTMFGVTTPCVNAVKE+L+KEGYESLVFHATG+GGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFD IIEK+IPLVL
Subjt:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL

Query:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN
        SVGALDMVNFGS DTI SN+ GRNIYEHNKQV+LMRTTV+EN+KIA FIADK+NNS+A+V VCLPQ G+SALDAPGKP YDP+ATATLI+ELQ++IQ NN
Subjt:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN

Query:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI
        DR+V VYPYHINDPEF+E+LVNSFLEITSK+T DSCGPK VLAETSQD +K S S+SNLSA  NI YSPSD PE RPETLRRT+ ILENLKAQ +KGVPI
Subjt:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI

Query:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV
        +GAGAGTGISAKFEE GGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                       V
Subjt:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV

Query:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM
        VKTVPVLAGVCASDPFR+MDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIE AHKMGLLTTPYAFN+DEA EMAKAGADIIVAHM
Subjt:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM

Query:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISMR
        GLTTSGSIGAKTALS+EESV+RVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRT GV GFYGASSMERLPVEQAITST+QEYKSISMR
Subjt:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISMR

A0A6J1KM41 uncharacterized protein LOC1114969600.0e+0083.73Show/hide
Query:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFSRGS-FKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK
        MAT GESKT RVFCIATADTKLEELRF+SD VRSNLN F+RGS  KVEVTVVDVSTS+QN IESLDDFVFVSREDVLSCYD T NHLPDDRGKAISIMSK
Subjt:  MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFSRGS-FKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSK

Query:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN
        ALES+LSKA+EDG+IAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVA+GQTESYIGTSDLILFPSIVDVCGINSVSRV+ +NA AAFAGMV+GRLE 
Subjt:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN

Query:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL
          +S + NEKPTVGLTMFGVTTPCVNAVKE+L+K GYE LVFHATG+GGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFD IIEKRIPLVL
Subjt:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL

Query:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN
        SVGALDMVNFGS DTI SN+ GRNIYEHNKQV+LMRTT++EN+KIA FIADK+NNS+A+V VCLP+ G+SALDAPGKPFYDP+ATATLI+ELQ++IQ NN
Subjt:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN

Query:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI
         R+V VYPYHINDPEFA  LVNSFLEITSK+T  SCGPK VLAETSQD +K+S S+SNLSA   I YSPSDFPE RPET RRT+ ILENLKAQ LKGVPI
Subjt:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPI

Query:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV
        +GAGAGTGISAKFEE GGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                       V
Subjt:  LGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLV

Query:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM
        VKTVPVLAGVCASDPFR+MDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIE AHKMGLLTTPYAFN+DEA EMAKAGADIIVAHM
Subjt:  VKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHM

Query:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISMR
        GLTTSGSIGAKTALS+EESV+RVQ IADAAHRINPNVLVLCHGGPISGPTEAAFILKRT GV GFYGASSMERLPVEQAITST+QEYKSISMR
Subjt:  GLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISMR

SwissProt top hitse value%identityAlignment
A4FLF6 UPF0261 protein SACE_56967.6e-8843.93Show/hide
Query:  FCIATADTKLEELRFISDYVRSNLNCFSRGSFKVEVTVVDVSTS-RQNGIESLDDFVFVSREDVLSCYDLTGNHLPD--------DRGKAISIMSKALES
        + + T DTK  EL +++  V       +RG   V V  VDVSTS  + G          +R   +   ++ G+H PD        DRG A++ M+ ALE 
Subjt:  FCIATADTKLEELRFISDYVRSNLNCFSRGSFKVEVTVVDVSTS-RQNGIESLDDFVFVSREDVLSCYDLTGNHLPD--------DRGKAISIMSKALES

Query:  YLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLENLKNS
        +L+     G+IA    LGGSGGT+L + A+++L +G+PK++VSTVASG   SY+  +D+ +FPS+ DV G+N +SR +L NA  A AG + G + +    
Subjt:  YLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLENLKNS

Query:  RDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVLSVGA
          + +KP V LTMFGVTTPCV  V  +L +  Y+ LVFHATG GG+AME LV +G I  VLD+TTTEV D + GGVM+    R D I    +P V S GA
Subjt:  RDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVLSVGA

Query:  LDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNNDRQV
        LDMVNFG+ +T+   Y+ RN+Y HN QV+LMRTT +E R+I  FIA K+N     V   LP+ GVS LDAPG+PF+DP+A   L E L+  ++ + DR++
Subjt:  LDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNNDRQV

Query:  KVYPYHINDPEFAEILVNSFLEITSKDT
           P++INDP FA+ ++ +F E+    T
Subjt:  KVYPYHINDPEFAEILVNSFLEITSKDT

A7M6E7 ToMV resistance protein Tm-1(GCR237)2.7e-29568.22Show/hide
Query:  TGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFS-RGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDL---TGNHLPDDRGKAISIMSK
        T +S +PRVFCI TADTK +ELRF+S++VRS+LN FS + SFKV VTVVDVSTS +    S  DF FV  +DVLSC+ L   T     D RG AI+IMSK
Subjt:  TGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFS-RGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDL---TGNHLPDDRGKAISIMSK

Query:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN
        ALE++LS A ++  +AG IGLGGSGGTSL+SSA +SL IGIPK+I+STVASGQTESYIGTSDL+LFPS+VD+CGIN+VS+V+L+NAGAAFAGMV+GRLE+
Subjt:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN

Query:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL
         K    +N K TVG+TMFGVTTPCVNAVKE+L+KEGYE+LVFHATGVGG+AME LVR GFIQGVLDITTTEVADY++GGVMACDSSRFD I+EK+IPLVL
Subjt:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL

Query:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN
        SVGALDMVNFG K TI   +Q R I+EHN+QVSLMRTTV EN+K A FIA+K+N +++ VCVCLP+ GVSALDAPGK FYDPEAT+ L  ELQ +++ N 
Subjt:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN

Query:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQK-HSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVP
          QVKV PYHIND EFA  LV+SFLEI+ K     C P    +++ QD Q  +++ +   S +G      +DFP A+PETL++  +IL+ LK Q  KG P
Subjt:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQK-HSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVP

Query:  ILGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPL
        I+GAGAGTGISAKFEE GGVDLIV+YNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                       
Subjt:  ILGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPL

Query:  VVKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAH
        VVK V VLAGVCA+DPFR+MD FLKQ+ES+GF GVQNFPTVGLFDGNFRQNLEETGMGYGLEV+MI  AH+MGLLTTPYAF  DEA+ MA+AGADIIVAH
Subjt:  VVKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAH

Query:  MGLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISM
        MGLTTSGSIGAKTA+S+EESV  VQAIADA HRI P+ +VLCHGGPIS P EAA++LKRT GV GFYGASSMERLPVEQAIT+TVQ+YKSISM
Subjt:  MGLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISM

A7M6E8 ToMV susceptible protein tm-1(GCR26)2.5e-29668.1Show/hide
Query:  TGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFS-RGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCY---DLTGNHLPDDRGKAISIMSK
        T +S +PRVFCI TADTK +ELRF+S++VRS+LN FS + SFKV VTVVDVSTSR+    S  DF FV  +DVLSCY   + T    PD RG+AI+IM+K
Subjt:  TGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFS-RGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCY---DLTGNHLPDDRGKAISIMSK

Query:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN
        ALE++LSKA  +  +AG IGLGGSGGTSL+SSA +SL IGIPK+I+STVASGQTESYIGTSDL+LFPS+VD+CGIN+VS+V+L+NAGAAFAGMV+GRLE+
Subjt:  ALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLEN

Query:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL
         K    +N K TVG+TMFGVTTPCVNAVKE+L+KEGYE+LVFHATGVGG+AME LVR GFIQGVLDITTTEVADY++GGVMACDSSRFD I+EK+IPLVL
Subjt:  LKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVL

Query:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN
        SVGALDMVNFG K TI   +Q R I++HN+QVSLM TTV EN+K A FIA+K+N +++ VCVCLP+ GVSALDAPGK FYDPEAT+ L  ELQ +++ N 
Subjt:  SVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNN

Query:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQK-HSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVP
          QVKVYPYHIND EFA  LV+SFLE++ K     C  +   +++ Q  Q  +++ +   S +G      +DFP A+PETL++  +IL+ LK Q  KG P
Subjt:  DRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQK-HSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVP

Query:  ILGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPL
        I+GAGAGTGISAKFEE GGVDLIV+YNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLP                                       
Subjt:  ILGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPL

Query:  VVKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAH
        VVK V VLAGVCA+DPFR+MD FLKQ+ES+GF GVQNFPTVGLFDGNFRQNLEETGMGYGLEV+MI  AH+MGLLTTPYAF  DEA+ MA+AGADIIVAH
Subjt:  VVKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAH

Query:  MGLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISM
        MGLTTSGSIGAKTA+S+EESV  VQAIADA HRINP+ +VLCHGGPIS P EAA++LKRT GV GFYGASSMERLPVEQAIT+TVQ+YKSISM
Subjt:  MGLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSMERLPVEQAITSTVQEYKSISM

P55606 UPF0261 protein y4oU2.2e-8743.65Show/hide
Query:  VFCIATADTKLEELRFISDYVRSNLNCFSRGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPD---DRGKAISIMSKALESYLSKA
        V+ + T DTK  ELR++ D +R    C        +  +VDVS S      S    V V   +V   +       P+   DRGKA++ M++AL  ++ ++
Subjt:  VFCIATADTKLEELRFISDYVRSNLNCFSRGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPD---DRGKAISIMSKALESYLSKA

Query:  KEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLENLKNSRDSNE
        + D  + G IG GG+GGT+LI+ AL++L IG PK++VSTVASG    Y+G +D+ +  S+ DV G+N +SRV+LANA  + AGMV+ ++   K     +E
Subjt:  KEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLENLKNSRDSNE

Query:  KPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVLSVGALDMVN
        +P +GLTMFGVTTPCV AV  + ++  ++ LVFHATG GG++ E L     + G +D++TTEV DYL+GGV  C + RF      R+P V S GALDMVN
Subjt:  KPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVLSVGALDMVN

Query:  FGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNNDRQVKVYPY
        FG+ DT+ S ++ R ++ HN QV+LMRTT EE  +I  +IA+++N     V   +P+ GVSA+DAPG+PF+DPEA + L   L+R ++    RQ+   P 
Subjt:  FGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNNDRQVKVYPY

Query:  HINDPEFAEILVNSFLE
        HINDP+FAE+LV +F E
Subjt:  HINDPEFAEILVNSFLE

Q981G1 UPF0261 protein mll93883.1e-8943.99Show/hide
Query:  VFCIATADTKLEELRFISDYVRSNLNCFSRGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCY--DLTGNHLPDDRGKAISIMSKALESYLSKAK
        V+ + T DTK  ELR++ D +R +  C        +V +VDVS S  +  E+ D  V V   +V  C+   L    L  DRGKA++ MS+AL  Y+ +++
Subjt:  VFCIATADTKLEELRFISDYVRSNLNCFSRGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCY--DLTGNHLPDDRGKAISIMSKALESYLSKAK

Query:  EDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLENLKNSRDSNEK
         D  + G IG GGSGGT+LI+ A+++L IG+PK++VSTVASG    Y+G +D+ +  S+ DV G+N +SRV+LANA  + AGMV+ ++        ++E+
Subjt:  EDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLENLKNSRDSNEK

Query:  PTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVLSVGALDMVNF
        P +GLTMFGVTTPCV AV  + ++  ++ LVFHATG GG++ E L   G + G +D++TTEV DYL+GGV  C   RF      ++P V S GALDMVNF
Subjt:  PTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVLSVGALDMVNF

Query:  GSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNNDRQVKVYPYH
        G+ +T+ S ++ R ++ HN QV+LMRT  EE  +I  +I +++N     V   +P+ GVSA+DAPG+PF+DPEA A L   L+R ++  + RQ+   P H
Subjt:  GSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNNDRQVKVYPYH

Query:  INDPEFAEILVNSFLE
        INDP+FA++LV +  E
Subjt:  INDPEFAEILVNSFLE

Arabidopsis top hitse value%identityAlignment
AT5G66420.1 CONTAINS InterPro DOMAIN/s: Uncharacterised conserved protein UCP033271 (InterPro:IPR008322), TIM-barrel signal transduction protein, predicted (InterPro:IPR009215); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).2.7e-27470.88Show/hide
Query:  MSKALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGR
        MSK LE++L +A ED  +AGAIGLGGSGGTSLISSA +SL IGIPK+IVSTVASGQTE Y+GTSDL+L PS+VDVCGINSVSRV+ +NAGA+FAGMVVGR
Subjt:  MSKALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGR

Query:  LENLKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIP
        LE  ++S   N K TVG+TMFGVTTPCVNAV+E L +EGYE+LVFHATGVGG+AMESLV+EGFIQGV+DITTTEVAD+L+GGVMACDSSRFD+ IEK IP
Subjt:  LENLKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIP

Query:  LVLSVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQ
        LVLSVGALDMVNFG KDTI S++Q R I+ HN+QVSL+RTT EEN+K A FIADK+N ST++V VC+P+ G+SALDAPGKPF DPEAT  LI ELQ +IQ
Subjt:  LVLSVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQ

Query:  LNNDRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKG
         N+DRQV +Y +HINDPEFAE LV SFLEI  K        +    + S         +S L     I YSP +FP A+PETL RTQ IL  L+ Q  KG
Subjt:  LNNDRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKG

Query:  VPILGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYT
        +PI+G GAGTGISAKFEE GG+DLIV+YNSGRFRMAGRGSLAGLLPFADANA+VLEMANEVLP                                     
Subjt:  VPILGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYT

Query:  PLVVKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIV
          VVK VPVLAGVCA+DPFR+MDYFLKQ+ESIGF GVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMI  AHKMGLLTTPYAFN  E  EMAKAGADIIV
Subjt:  PLVVKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIV

Query:  AHMGLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKG-VDGFYGASSMERLPVEQAITSTVQEYKSISMR
        AHMGLTTSG+IGAKTA+S+EESVVRVQAIADAA R NP+++VLCHGGPISGP EA F+LKRT+G V GFYGASSMERLPVEQAIT+TVQ+YKSIS++
Subjt:  AHMGLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKG-VDGFYGASSMERLPVEQAITSTVQEYKSISMR

AT5G66420.2 LOCATED IN: cellular_component unknown; EXPRESSED IN: 23 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Uncharacterised conserved protein UCP033271 (InterPro:IPR008322), TIM-barrel signal transduction protein, predicted (InterPro:IPR009215); Has 1232 Blast hits to 1230 proteins in 311 species: Archae - 18; Bacteria - 959; Metazoa - 1; Fungi - 84; Plants - 42; Viruses - 0; Other Eukaryotes - 128 (source: NCBI BLink).7.6e-30168.43Show/hide
Query:  ESKTPRVFCIATADTKLEELRFISDYVRSNLNCFSR-GSFKVEVTVVDVST-SRQNGIESLDDFVFVSREDVLSCYDLTGNH-----LPDDRGKAISIMS
        + +T RVFC+ TADTKL+ELRF++  VRSN+  FS+  S KVEV +VDVS  + Q  I+++ DF FV+RE+VLSCY  +        LPDDRG+A+ +MS
Subjt:  ESKTPRVFCIATADTKLEELRFISDYVRSNLNCFSR-GSFKVEVTVVDVST-SRQNGIESLDDFVFVSREDVLSCYDLTGNH-----LPDDRGKAISIMS

Query:  KALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLE
        K LE++L +A ED  +AGAIGLGGSGGTSLISSA +SL IGIPK+IVSTVASGQTE Y+GTSDL+L PS+VDVCGINSVSRV+ +NAGA+FAGMVVGRLE
Subjt:  KALESYLSKAKEDGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLE

Query:  NLKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLV
          ++S   N K TVG+TMFGVTTPCVNAV+E L +EGYE+LVFHATGVGG+AMESLV+EGFIQGV+DITTTEVAD+L+GGVMACDSSRFD+ IEK IPLV
Subjt:  NLKNSRDSNEKPTVGLTMFGVTTPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLV

Query:  LSVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLN
        LSVGALDMVNFG KDTI S++Q R I+ HN+QVSL+RTT EEN+K A FIADK+N ST++V VC+P+ G+SALDAPGKPF DPEAT  LI ELQ +IQ N
Subjt:  LSVGALDMVNFGSKDTISSNYQGRNIYEHNKQVSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLN

Query:  NDRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVP
        +DRQV +Y +HINDPEFAE LV SFLEI  K        +    + S         +S L     I YSP +FP A+PETL RTQ IL  L+ Q  KG+P
Subjt:  NDRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFVLAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVP

Query:  ILGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPL
        I+G GAGTGISAKFEE GG+DLIV+YNSGRFRMAGRGSLAGLLPFADANA+VLEMANEVLP                                       
Subjt:  ILGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIVLEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPL

Query:  VVKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAH
        VVK VPVLAGVCA+DPFR+MDYFLKQ+ESIGF GVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMI  AHKMGLLTTPYAFN  E  EMAKAGADIIVAH
Subjt:  VVKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLEVKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAH

Query:  MGLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKG-VDGFYGASSMERLPVEQAITSTVQEYKSISMR
        MGLTTSG+IGAKTA+S+EESVVRVQAIADAA R NP+++VLCHGGPISGP EA F+LKRT+G V GFYGASSMERLPVEQAIT+TVQ+YKSIS++
Subjt:  MGLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKG-VDGFYGASSMERLPVEQAITSTVQEYKSISMR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACAACAGGTGAAAGCAAAACTCCCCGAGTTTTCTGCATCGCGACGGCTGATACAAAGCTCGAGGAGCTCCGTTTCATTTCCGATTACGTCCGATCCAACCTCAA
TTGCTTCTCTAGAGGTTCTTTCAAGGTTGAAGTGACGGTTGTTGATGTTTCTACCAGCCGTCAGAATGGGATAGAGAGTTTGGATGATTTCGTCTTTGTGTCGAGGGAGG
ATGTTCTCTCTTGCTATGATCTCACTGGAAATCATCTTCCCGATGACCGGGGAAAAGCAATCTCGATAATGAGTAAAGCACTTGAATCTTATCTTAGTAAAGCCAAGGAG
GATGGGATAATTGCTGGAGCTATAGGACTTGGAGGCAGTGGAGGGACGTCTCTCATATCCTCTGCGTTAAAATCTCTTCAAATTGGAATACCTAAGCTTATCGTCTCAAC
GGTTGCGAGTGGTCAGACAGAATCTTATATCGGGACATCCGATCTGATACTTTTCCCGTCTATAGTGGATGTGTGTGGGATTAATAGTGTCAGTAGGGTCATTTTAGCTA
ATGCTGGTGCTGCATTTGCTGGTATGGTGGTCGGAAGACTTGAGAATTTGAAAAATTCGCGCGATTCCAACGAAAAACCAACAGTAGGTTTAACAATGTTTGGCGTTACG
ACTCCTTGTGTCAATGCTGTCAAAGAAAAATTGATTAAAGAAGGCTACGAGAGCCTTGTTTTCCACGCTACTGGGGTAGGAGGGAAGGCAATGGAATCTCTGGTCAGAGA
GGGATTTATCCAGGGAGTCTTGGACATCACAACAACAGAGGTAGCAGACTATTTAATGGGAGGTGTCATGGCTTGTGACAGTTCTCGCTTTGATGTCATCATAGAGAAGA
GAATCCCATTAGTTTTAAGTGTAGGAGCACTGGATATGGTGAACTTTGGATCCAAAGATACAATATCCTCCAATTATCAGGGAAGGAATATATATGAACATAATAAACAG
GTTTCGCTTATGCGAACTACGGTGGAGGAGAACAGAAAAATTGCTCATTTTATAGCTGATAAGATTAACAATTCAACAGCACAGGTTTGTGTATGCCTGCCACAGAATGG
CGTATCCGCTCTGGATGCACCAGGGAAGCCATTTTATGATCCTGAGGCTACTGCTACTCTTATAGAGGAACTACAGAGGGTTATTCAGTTAAATAATGATAGGCAGGTGA
AGGTATATCCTTATCATATTAATGATCCTGAGTTTGCTGAGATATTGGTTAACTCATTCTTGGAAATTACTTCAAAAGATACGAAGGACTCATGTGGTCCAAAATTTGTT
TTAGCTGAAACTAGTCAAGATTTTCAGAAGCACTCCATTTCTAAGTCCAATTTGTCTGCCCATGGAAACATTACATACAGTCCTAGCGACTTCCCAGAGGCAAGACCAGA
AACTTTGCGAAGGACCCAAATGATATTGGAGAACTTGAAAGCTCAAAAACTTAAAGGAGTGCCCATACTAGGGGCTGGTGCAGGCACTGGGATATCTGCTAAGTTTGAAG
AAGTTGGTGGTGTTGATCTGATAGTAGTGTACAATTCAGGGCGCTTTCGCATGGCTGGGAGGGGTTCTTTAGCAGGCTTGCTGCCTTTTGCCGATGCTAATGCCATAGTG
CTTGAAATGGCCAATGAAGTATTGCCTTCCAGCATTGTGATATATCCAGTTGTTCCGGAGATCTTGAATGTGGAGCATGGGTATATCAGTAGATTATTTGGGAAGTCTCC
CGATTACCTGACTAAGGGATGTTATACCCCTTTGGTGGTGAAGACAGTTCCCGTGCTTGCTGGAGTATGTGCATCTGATCCATTTCGTCAAATGGATTACTTTCTAAAGC
AGGTGGAGTCAATTGGATTCTCTGGAGTGCAGAATTTTCCTACTGTTGGATTGTTTGATGGTAACTTCAGACAAAACCTTGAAGAAACAGGAATGGGATATGGATTGGAG
GTCAAGATGATTGAGAGGGCTCACAAAATGGGTCTCTTGACAACCCCATATGCTTTTAACGAAGATGAAGCCTTGGAAATGGCAAAAGCCGGTGCAGACATTATAGTTGC
CCACATGGGACTCACTACCTCTGGATCTATTGGAGCCAAAACTGCCCTATCAATGGAGGAAAGTGTAGTCCGTGTACAGGCTATAGCAGATGCTGCTCATAGGATCAATC
CTAATGTTTTAGTGCTCTGTCATGGAGGTCCTATATCGGGCCCTACTGAAGCAGCATTCATTCTGAAGAGAACGAAGGGAGTTGATGGATTTTATGGTGCATCGAGCATG
GAGAGGCTACCAGTGGAACAAGCAATAACTAGCACTGTCCAAGAGTACAAATCAATTTCAATGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGACAACAGGTGAAAGCAAAACTCCCCGAGTTTTCTGCATCGCGACGGCTGATACAAAGCTCGAGGAGCTCCGTTTCATTTCCGATTACGTCCGATCCAACCTCAA
TTGCTTCTCTAGAGGTTCTTTCAAGGTTGAAGTGACGGTTGTTGATGTTTCTACCAGCCGTCAGAATGGGATAGAGAGTTTGGATGATTTCGTCTTTGTGTCGAGGGAGG
ATGTTCTCTCTTGCTATGATCTCACTGGAAATCATCTTCCCGATGACCGGGGAAAAGCAATCTCGATAATGAGTAAAGCACTTGAATCTTATCTTAGTAAAGCCAAGGAG
GATGGGATAATTGCTGGAGCTATAGGACTTGGAGGCAGTGGAGGGACGTCTCTCATATCCTCTGCGTTAAAATCTCTTCAAATTGGAATACCTAAGCTTATCGTCTCAAC
GGTTGCGAGTGGTCAGACAGAATCTTATATCGGGACATCCGATCTGATACTTTTCCCGTCTATAGTGGATGTGTGTGGGATTAATAGTGTCAGTAGGGTCATTTTAGCTA
ATGCTGGTGCTGCATTTGCTGGTATGGTGGTCGGAAGACTTGAGAATTTGAAAAATTCGCGCGATTCCAACGAAAAACCAACAGTAGGTTTAACAATGTTTGGCGTTACG
ACTCCTTGTGTCAATGCTGTCAAAGAAAAATTGATTAAAGAAGGCTACGAGAGCCTTGTTTTCCACGCTACTGGGGTAGGAGGGAAGGCAATGGAATCTCTGGTCAGAGA
GGGATTTATCCAGGGAGTCTTGGACATCACAACAACAGAGGTAGCAGACTATTTAATGGGAGGTGTCATGGCTTGTGACAGTTCTCGCTTTGATGTCATCATAGAGAAGA
GAATCCCATTAGTTTTAAGTGTAGGAGCACTGGATATGGTGAACTTTGGATCCAAAGATACAATATCCTCCAATTATCAGGGAAGGAATATATATGAACATAATAAACAG
GTTTCGCTTATGCGAACTACGGTGGAGGAGAACAGAAAAATTGCTCATTTTATAGCTGATAAGATTAACAATTCAACAGCACAGGTTTGTGTATGCCTGCCACAGAATGG
CGTATCCGCTCTGGATGCACCAGGGAAGCCATTTTATGATCCTGAGGCTACTGCTACTCTTATAGAGGAACTACAGAGGGTTATTCAGTTAAATAATGATAGGCAGGTGA
AGGTATATCCTTATCATATTAATGATCCTGAGTTTGCTGAGATATTGGTTAACTCATTCTTGGAAATTACTTCAAAAGATACGAAGGACTCATGTGGTCCAAAATTTGTT
TTAGCTGAAACTAGTCAAGATTTTCAGAAGCACTCCATTTCTAAGTCCAATTTGTCTGCCCATGGAAACATTACATACAGTCCTAGCGACTTCCCAGAGGCAAGACCAGA
AACTTTGCGAAGGACCCAAATGATATTGGAGAACTTGAAAGCTCAAAAACTTAAAGGAGTGCCCATACTAGGGGCTGGTGCAGGCACTGGGATATCTGCTAAGTTTGAAG
AAGTTGGTGGTGTTGATCTGATAGTAGTGTACAATTCAGGGCGCTTTCGCATGGCTGGGAGGGGTTCTTTAGCAGGCTTGCTGCCTTTTGCCGATGCTAATGCCATAGTG
CTTGAAATGGCCAATGAAGTATTGCCTTCCAGCATTGTGATATATCCAGTTGTTCCGGAGATCTTGAATGTGGAGCATGGGTATATCAGTAGATTATTTGGGAAGTCTCC
CGATTACCTGACTAAGGGATGTTATACCCCTTTGGTGGTGAAGACAGTTCCCGTGCTTGCTGGAGTATGTGCATCTGATCCATTTCGTCAAATGGATTACTTTCTAAAGC
AGGTGGAGTCAATTGGATTCTCTGGAGTGCAGAATTTTCCTACTGTTGGATTGTTTGATGGTAACTTCAGACAAAACCTTGAAGAAACAGGAATGGGATATGGATTGGAG
GTCAAGATGATTGAGAGGGCTCACAAAATGGGTCTCTTGACAACCCCATATGCTTTTAACGAAGATGAAGCCTTGGAAATGGCAAAAGCCGGTGCAGACATTATAGTTGC
CCACATGGGACTCACTACCTCTGGATCTATTGGAGCCAAAACTGCCCTATCAATGGAGGAAAGTGTAGTCCGTGTACAGGCTATAGCAGATGCTGCTCATAGGATCAATC
CTAATGTTTTAGTGCTCTGTCATGGAGGTCCTATATCGGGCCCTACTGAAGCAGCATTCATTCTGAAGAGAACGAAGGGAGTTGATGGATTTTATGGTGCATCGAGCATG
GAGAGGCTACCAGTGGAACAAGCAATAACTAGCACTGTCCAAGAGTACAAATCAATTTCAATGAGATGATATTTTGAAGTCTGGCCCTTGACGAGCTTCATACTTTGGCA
TGTATTTTGTATCTATTACTACTCACACGCTTCTGAATTGTGATGGGCGGAAACGTACCTTTTACCTTGG
Protein sequenceShow/hide protein sequence
MATTGESKTPRVFCIATADTKLEELRFISDYVRSNLNCFSRGSFKVEVTVVDVSTSRQNGIESLDDFVFVSREDVLSCYDLTGNHLPDDRGKAISIMSKALESYLSKAKE
DGIIAGAIGLGGSGGTSLISSALKSLQIGIPKLIVSTVASGQTESYIGTSDLILFPSIVDVCGINSVSRVILANAGAAFAGMVVGRLENLKNSRDSNEKPTVGLTMFGVT
TPCVNAVKEKLIKEGYESLVFHATGVGGKAMESLVREGFIQGVLDITTTEVADYLMGGVMACDSSRFDVIIEKRIPLVLSVGALDMVNFGSKDTISSNYQGRNIYEHNKQ
VSLMRTTVEENRKIAHFIADKINNSTAQVCVCLPQNGVSALDAPGKPFYDPEATATLIEELQRVIQLNNDRQVKVYPYHINDPEFAEILVNSFLEITSKDTKDSCGPKFV
LAETSQDFQKHSISKSNLSAHGNITYSPSDFPEARPETLRRTQMILENLKAQKLKGVPILGAGAGTGISAKFEEVGGVDLIVVYNSGRFRMAGRGSLAGLLPFADANAIV
LEMANEVLPSSIVIYPVVPEILNVEHGYISRLFGKSPDYLTKGCYTPLVVKTVPVLAGVCASDPFRQMDYFLKQVESIGFSGVQNFPTVGLFDGNFRQNLEETGMGYGLE
VKMIERAHKMGLLTTPYAFNEDEALEMAKAGADIIVAHMGLTTSGSIGAKTALSMEESVVRVQAIADAAHRINPNVLVLCHGGPISGPTEAAFILKRTKGVDGFYGASSM
ERLPVEQAITSTVQEYKSISMR