; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr011591 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr011591
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPyrimidine-specific ribonucleoside hydrolase RihA
Genome locationtig00153002:382328..405841
RNA-Seq ExpressionSgr011591
SyntenySgr011591
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR001910 - Inosine/uridine-preferring nucleoside hydrolase domain
IPR036452 - Ribonucleoside hydrolase-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588452.1 hypothetical protein SDJN03_17017, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0076.24Show/hide
Query:  GFDLVCVWRDQ-----MMMKMMRGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITIN
        GF++V  WRD      MM  +  GG +WKS  LICGVV V  M GS++WL  S  S       P RI+VDTDVDTDDV A+LYLLKQP+SLF LQAITIN
Subjt:  GFDLVCVWRDQ-----MMMKMMRGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITIN

Query:  GNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQNPTP------------------TFGKRKYTPL
        GNGWS+ GHAV+HLYD+LFMMGRDDIP+GVGGEGGISPNA+IS+HVGGYLPLIDQ  +T          P                      GKR+Y P+
Subjt:  GNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQNPTP------------------TFGKRKYTPL

Query:  KQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLH
        KQPTAQQV+KDAIS GPTTVF+MGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIR  CS +AD SHGKTCNNIGNLWPPN NPYAEFNIFGDPFAAYTVLH
Subjt:  KQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLH

Query:  SGIPVTLVPLDATSTIPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFER----------------------GGNNAYSKMEYLNITIVTSNE
        SGIP+TLVPLDATSTIPVD+ V+LAFEQRQNTYEA+YCFQSLKMA DTWSG GFFE                       GGNNAYSKMEYLNITIVTSNE
Subjt:  SGIPVTLVPLDATSTIPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFER----------------------GGNNAYSKMEYLNITIVTSNE

Query:  PYGILDGSNPLVDGHLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQT
        PYGI +GSNPLVDG L+PKFG+QKNGVHSGHVQTGMLDPFCL +SE+GKC+DGYTKEA+GSESV+VLVAV AKS+L+TN  I+K FY+SF D +NSPRQ+
Subjt:  PYGILDGSNPLVDGHLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQT

Query:  GRFDFRAQFPNYKEVQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGE
        GRFDFRAQFP YKEV YRPKF K L GKP++FDMDMSTGDFLTLLYLLK PVE+IDLKGIIISPNGWATAATIDVVYD+LHMMGRDDI VGLGDVFAIGE
Subjt:  GRFDFRAQFPNYKEVQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGE

Query:  AHPSFPAIGDCKYIKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQII
        AHPSFP+IGDCKYIKA+PHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVK GA RDTDHP LRQMSA+DVW  VV+SLDP VK+TVLTNGPLTNLAQI+
Subjt:  AHPSFPAIGDCKYIKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQII

Query:  RSKAISPRIQEVYITGGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYH
        RSKAIS RIQEVYITGGHI+   DKGNVFTIPSNEYAEFNFFLDP AAE V+GSGLNITLIPLN QR VSSF KIL++LK  N T EA F+RRLL RLY 
Subjt:  RSKAISPRIQEVYITGGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYH

Query:  LKQHHHQYHHVDMFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQK
        L+Q + QYHHVDMFLGEVLGAVSL G+H NLK+AFS+K +K++T GGESKVGQTIIDEKEGKWVRVLESV E LAFYE LANAL D+ Q+AVIGSFEEQK
Subjt:  LKQHHHQYHHVDMFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQK

Query:  RLWSA
         LWSA
Subjt:  RLWSA

XP_011659920.1 uncharacterized protein LOC101212769 isoform X1 [Cucumis sativus]0.0e+0079.09Show/hide
Query:  MMKMMR-GGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYD
        MM+MMR   G    SWLICGVVG+VVMLGSVVWLANSSS +S     P RI+VDTDVDTDD+FALLYLLKQP+SLFHLQ ITINGNGWSDAGHAV+HLYD
Subjt:  MMKMMR-GGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYD

Query:  VLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQ--------NPTPTF---------GKRKYTPLKQPTAQQVLKDAISEGP
        +LFMMGRDDIPVGVGG+GGISPNA+IS+++GGYLPLIDQ  +T      Y Q        N    F         GKR+Y P+KQPTAQQV+KDAIS GP
Subjt:  VLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQ--------NPTPTF---------GKRKYTPLKQPTAQQVLKDAISEGP

Query:  TTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIP
        T VFLMGAHTNLAIFL++NPHLKKNIKH+YAMGGAIR ICSESAD SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIP
Subjt:  TTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIP

Query:  VDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFE----------------------RGGNNAYSKMEYLNITIVTSNEPYGILDGSNPLVDGHLV
        V++ VFLAFEQRQNTYEA+YCFQSLKMA DTW  +GFFE                       GGNNA+SKMEYLNITIVTSN+PYGI DGSNPLVDGHL+
Subjt:  VDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFE----------------------RGGNNAYSKMEYLNITIVTSNEPYGILDGSNPLVDGHLV

Query:  PKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQFPNYKEVQY
        P  G Q NGVHSGHVQTGMLDPFCL ++ +GKCQDGYTKE+DGSESVQVLVAVEAKST+DTN+SI+K FYISF DVLNSPRQTGRFDFRAQFPNY+EV Y
Subjt:  PKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQFPNYKEVQY

Query:  RPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIGDCKYIKAL
        RPKF K+L GKP++FDMDMSTGDFLTLLYLLK P+E+I+LKGIIISPNGWATAATIDVVYD+LHMMGRDDI VGLGDVFAIGEAHP +P IGDCKY KA+
Subjt:  RPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIGDCKYIKAL

Query:  PHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRSKAISPRIQEVYITGG
        P GSGG LDSDTLYGFARDLPRSPRRYTAENSVKFGA RDTDHPELRQMS LDVWK VV+SL+   K+TVLTNGPLTNLA+II+ KAIS RI+EVYITGG
Subjt:  PHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRSKAISPRIQEVYITGG

Query:  HINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYHHVDMFLGE
        H+N+G DKGN+FTIPSNEY+EFNFFLDP AA+ V  SGLNITLIPLNVQRRVSSF KIL++LKLRNRTPEA  SRRLL RLY LKQ HHQYHHVDMFLGE
Subjt:  HINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYHHVDMFLGE

Query:  VLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFE
        VLGAVSLAGKHLNLKQ FS KPLK+++NGGESKVGQTIIDEK+GKWVRVLES+ EPLAFYED+ANAL DEKQ+AVI SFE
Subjt:  VLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFE

XP_022154597.1 uncharacterized protein LOC111021821 [Momordica charantia]0.0e+0082.03Show/hide
Query:  MMMKMM---RGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSH
        MMM+M+   RGG YWK SW ICGVVGVVVMLGSVVWL NSS  S       +RIVVDTDVDTDDVFA+ YLLKQPTSLFHLQAITINGNGWS+AGHAV+H
Subjt:  MMMKMM---RGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSH

Query:  LYDVLFMMGRDDIPVGVGGEGGISPN--ASISSH--VGGYLPLIDQVTNT---------LPSAYTYSQNPTPTFGKRK---------YTPLKQPTAQQVL
        +YD+LFMMGRDDIPVGVGGEGGISPN   SIS H  VGG+LPLIDQ  +T         +P            FG RK         YTP+KQPTAQQVL
Subjt:  LYDVLFMMGRDDIPVGVGGEGGISPN--ASISSH--VGGYLPLIDQVTNT---------LPSAYTYSQNPTPTFGKRK---------YTPLKQPTAQQVL

Query:  KDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVP
        KDAIS GPTTVFLMG HTNLAIFLMTNPHLKKNIKHIYAMGGAIR ICS   D SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVP
Subjt:  KDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVP

Query:  LDATSTIPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFE----------------------RGGNNAYSKMEYLNITIVTSNEPYGILDGSN
        LDATSTIPVD+NVFLAFEQR NTYEAQYCFQSLKMARDTW+ NGFFE                       GG+NAYSKMEY+NITIVTSNEPYGI DGSN
Subjt:  LDATSTIPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFE----------------------RGGNNAYSKMEYLNITIVTSNEPYGILDGSN

Query:  PLVDGHLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQF
        PLVDGHLVPKFG+QKNGVHSGHVQTGMLDPFCL A+ +GKCQDGYTKEA+GSESVQVLVAVEAKST DTN+SI+K FYISF DVLNSP+QTGRFDFRAQF
Subjt:  PLVDGHLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQF

Query:  PNYKEVQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIG
        PNYKEV YRPKF KKL GKP+VFDMDMSTGDF+TLLYLLK PVE+IDLKGIIISPNGWATAATIDVVYD+LHMMGRDDIPVGLGD+FAIGEAHPSFP IG
Subjt:  PNYKEVQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIG

Query:  DCKYIKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRSKAISPRI
        DCKYIKA+PHGSGGFLDSDTLYG ARDLPRSPRRYTAENSVKFGA+RDTDHPELRQMSAL+VWKA+VRSLD   K+TVLTNGPLTNLAQI+R+KAI  RI
Subjt:  DCKYIKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRSKAISPRI

Query:  QEVYITGGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYH
        QEVYITGGHI FGGDKGNVFTIPSN YAEFNFFLDP AAE V+GSGLNITLIPLNVQRRVSSF KILKRLKLRN+TPEA FS+RL SRLYHLKQHHHQYH
Subjt:  QEVYITGGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYH

Query:  HVDMFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQKRLWSA
        HVDMFLGEVLGAVSLAGKH+NLK+ FS KPLK+VTNGGESKVGQTIIDEK+GKWVRVLESV EPLAFYE LA+ALGDEKQSAV+GSFEEQKR+WSA
Subjt:  HVDMFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQKRLWSA

XP_023530226.1 uncharacterized protein LOC111792856 [Cucurbita pepo subsp. pepo]0.0e+0077.98Show/hide
Query:  MMMK-MMRGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLY
        MMMK + RGG +WKS + ICGVV V  M GS +WL       +SS   P RI+VDTDVDTDDV ALLYLLKQP+SLFHLQAITINGNGWS+ GHAV+HLY
Subjt:  MMMK-MMRGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLY

Query:  DVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQNPTP------------------TFGKRKYTPLKQPTAQQVLKDAISE
        D+LFMMGRDDIP+GVGGEGGISPNA+IS+HVGGYLPLIDQ  +T          P                      GKR+Y P+KQPTAQQV+KDAIS 
Subjt:  DVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQNPTP------------------TFGKRKYTPLKQPTAQQVLKDAISE

Query:  GPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATST
        GPTTVF+MGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIR  CS +AD SHGKTCNNIGNLWPPN NPYAEFNIFGDPFAAYTVLHSGIPVT+VPLDATST
Subjt:  GPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATST

Query:  IPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFER----------------------GGNNAYSKMEYLNITIVTSNEPYGILDGSNPLVDGH
        IPVD+ V+LAFEQRQNTYEA+YCFQSLKMA DTWSG GFFE                       GGNNAYSKMEYLNITIVTSNEPYGI DGSNPLVDG 
Subjt:  IPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFER----------------------GGNNAYSKMEYLNITIVTSNEPYGILDGSNPLVDGH

Query:  LVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQFPNYKEV
        L+PKFG+QKNGVHSGHVQTGMLDPFCL +SE+GKC+DGYTKEA+GSESV+VLVAV AKS+L+TN  I+K FY+SF D +NSPRQ+GRFDFRAQFP YKEV
Subjt:  LVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQFPNYKEV

Query:  QYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIGDCKYIK
         YRPKF K L GKP++FDMDMSTGDFLTLLYLLK PVE+IDLKGIIISPNGWATAATIDVVYD+LHMMGRDDI VGLGDVFAIGEAHPSFP+IGDCKYIK
Subjt:  QYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIGDCKYIK

Query:  ALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRSKAISPRIQEVYIT
        A+PHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVK GA RDTDHP LRQMSALDVWK VVRSLDP VK+TVLTNGPLTNLAQIIRSKAIS RIQEVYIT
Subjt:  ALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRSKAISPRIQEVYIT

Query:  GGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYHHVDMFL
        GGHI+   DKGNVFTIPSNEYAEFNFFLDP AAE V+GSGLNITLIPLN QR VSSF KIL++LK  NRT EA F+RRLL RLY L+Q +HQYHHVDMFL
Subjt:  GGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYHHVDMFL

Query:  GEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQKRLWSA
        GEV+GAVSL G+H NLK+AFSMK +K++  GGESKVGQTIIDEKEGKWVRVLESV E LAFYE LANAL D+ Q+AVIGSFEEQK LWSA
Subjt:  GEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQKRLWSA

XP_038879905.1 uncharacterized protein LOC120071620 isoform X1 [Benincasa hispida]0.0e+0081.03Show/hide
Query:  QMMMKMMRGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLY
        +MMM  +   G+WK SWLICGVVG+VVM GS+VWLANSSS S      P RI+VDTDVDTDD+FAL YLLKQP+SLFHLQAITINGNGWSDAGHAV+HLY
Subjt:  QMMMKMMRGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLY

Query:  DVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQ-------------------NPTPTFGKRKYTPLKQPTAQQVLKDAIS
        D+LFMMGRDDIPVGVGGEGGISPNA+IS HVGGYLPLIDQ  +T      Y Q                    P    GKR+Y P+KQPTAQ V+KDA+S
Subjt:  DVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQ-------------------NPTPTFGKRKYTPLKQPTAQQVLKDAIS

Query:  EGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATS
         GPTTVFLMGAHTNLAIFL++NPHLKKNIKHIYAMGGAIR ICSE+AD SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATS
Subjt:  EGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATS

Query:  TIPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFE---------------------RG-GNNAYSKMEYLNITIVTSNEPYGILDGSNPLVDG
        TIPV+E VFL FEQRQNTYEA+YCFQSLKMARDTW+GNGFFE                     RG GNNA+SKMEYLNITIVTSN+PYGI DGSNPLVDG
Subjt:  TIPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFE---------------------RG-GNNAYSKMEYLNITIVTSNEPYGILDGSNPLVDG

Query:  HLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQFPNYKE
        HL+PKFG+QKNGVHSGHVQTGMLDPFCL ++E GKCQDGYTKEADG ESVQVLVAVEAKST+DTN+SI+K FY SF DVLNSPRQTGRFDFRAQFPNY+E
Subjt:  HLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQFPNYKE

Query:  VQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIGDCKYI
        V YRPKF KKL GKP++FDMDMSTGDFLTLLYLLK P+E+I+LKGIIISPNGWAT ATIDVVYD+LHMMGRDDI VGLGDVFAIGEAHPSFP IGDCKYI
Subjt:  VQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIGDCKYI

Query:  KALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRSKAISPRIQEVYI
        KA+PHGSGGFLDSDTLYGFARDLPRSPRRYTAE+SVKFG  RDTDHPELRQMSALDVWK VV+SLD   K+TVLTNGPLTNLAQI+  KAIS RIQEVYI
Subjt:  KALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRSKAISPRIQEVYI

Query:  TGGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYHHVDMF
        +GG+IN+G DKGNVFTIPSNE++EFNFFLDP AA+ V+GSGLNITLIPLNVQR+VSSF KILK+LKL NRTPEA FSRRLLSRLYHLKQ HHQYHHVDMF
Subjt:  TGGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYHHVDMF

Query:  LGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQKRLWSA
        LGEVLG VSLAGKH+NLKQ FSMKPLK+VTNGGESKVGQTIIDEK+GKWVRVLESV EPLAFYEDLANAL DEKQSAVIGSFE QKRLWSA
Subjt:  LGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQKRLWSA

TrEMBL top hitse value%identityAlignment
A0A0A0LWJ1 Uncharacterized protein0.0e+0079.09Show/hide
Query:  MMKMMR-GGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYD
        MM+MMR   G    SWLICGVVG+VVMLGSVVWLANSSS +S     P RI+VDTDVDTDD+FALLYLLKQP+SLFHLQ ITINGNGWSDAGHAV+HLYD
Subjt:  MMKMMR-GGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYD

Query:  VLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQ--------NPTPTF---------GKRKYTPLKQPTAQQVLKDAISEGP
        +LFMMGRDDIPVGVGG+GGISPNA+IS+++GGYLPLIDQ  +T      Y Q        N    F         GKR+Y P+KQPTAQQV+KDAIS GP
Subjt:  VLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQ--------NPTPTF---------GKRKYTPLKQPTAQQVLKDAISEGP

Query:  TTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIP
        T VFLMGAHTNLAIFL++NPHLKKNIKH+YAMGGAIR ICSESAD SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIP
Subjt:  TTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIP

Query:  VDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFE----------------------RGGNNAYSKMEYLNITIVTSNEPYGILDGSNPLVDGHLV
        V++ VFLAFEQRQNTYEA+YCFQSLKMA DTW  +GFFE                       GGNNA+SKMEYLNITIVTSN+PYGI DGSNPLVDGHL+
Subjt:  VDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFE----------------------RGGNNAYSKMEYLNITIVTSNEPYGILDGSNPLVDGHLV

Query:  PKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQFPNYKEVQY
        P  G Q NGVHSGHVQTGMLDPFCL ++ +GKCQDGYTKE+DGSESVQVLVAVEAKST+DTN+SI+K FYISF DVLNSPRQTGRFDFRAQFPNY+EV Y
Subjt:  PKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQFPNYKEVQY

Query:  RPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIGDCKYIKAL
        RPKF K+L GKP++FDMDMSTGDFLTLLYLLK P+E+I+LKGIIISPNGWATAATIDVVYD+LHMMGRDDI VGLGDVFAIGEAHP +P IGDCKY KA+
Subjt:  RPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIGDCKYIKAL

Query:  PHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRSKAISPRIQEVYITGG
        P GSGG LDSDTLYGFARDLPRSPRRYTAENSVKFGA RDTDHPELRQMS LDVWK VV+SL+   K+TVLTNGPLTNLA+II+ KAIS RI+EVYITGG
Subjt:  PHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRSKAISPRIQEVYITGG

Query:  HINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYHHVDMFLGE
        H+N+G DKGN+FTIPSNEY+EFNFFLDP AA+ V  SGLNITLIPLNVQRRVSSF KIL++LKLRNRTPEA  SRRLL RLY LKQ HHQYHHVDMFLGE
Subjt:  HINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYHHVDMFLGE

Query:  VLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFE
        VLGAVSLAGKHLNLKQ FS KPLK+++NGGESKVGQTIIDEK+GKWVRVLES+ EPLAFYED+ANAL DEKQ+AVI SFE
Subjt:  VLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFE

A0A1S3BP73 uncharacterized protein LOC103492210 isoform X10.0e+0077.29Show/hide
Query:  RDQMMMKMMRGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSH
        + +MMM      G    SWLICGVVG+VVMLGS+VWLANSSS +S     P RI+VDTD D DD+FAL YLLKQP+SLFHLQ ITINGNGWSDAGHAV+H
Subjt:  RDQMMMKMMRGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSH

Query:  LYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQ--------NPTPTF---------GKRKYTPLKQPTAQQVLKDAIS
        LYD+LFMMGRDDIPVGVGG+GGISP+A+IS ++GGYLPLIDQ  +T      Y Q        N    F         GKR+Y P+KQPTAQQV+KDAIS
Subjt:  LYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQ--------NPTPTF---------GKRKYTPLKQPTAQQVLKDAIS

Query:  EGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATS
         GPT VFLMGAHTNLAIFL++NPHLKKNIKH+YAMGGAIR ICSES   SHGKTC+NIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATS
Subjt:  EGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATS

Query:  TIPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFE---------------------RGGN---NAYSKMEYLNITIVTSNEPYGILDGSNPLV
        TIPV++ VFLAFEQRQNTYEA+YCFQSLKMARDTW  +GFFE                     RGG    NA+SKMEYLN+TIVTSNEPYGI DGSNP V
Subjt:  TIPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFE---------------------RGGN---NAYSKMEYLNITIVTSNEPYGILDGSNPLV

Query:  DGHLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQFPNY
        +G L+  FG QKNGVHSGHVQTGMLDPFCL ++ +GKCQDGYTKEADGSESVQVLVAVEAKST+DTN+SI+K FYISF DVLNSPRQTGRFDFRAQFP Y
Subjt:  DGHLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQFPNY

Query:  KEVQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIGDCK
        +EV YRP F K+L GKP++FDMDMSTGDFLTLLYLLK P+E+I+LKGIIISPNGWATAATIDVVYD+LHMMGRDDI VGLGD+FAIGE HP FP IGDCK
Subjt:  KEVQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIGDCK

Query:  YIKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRSKAISPRIQEV
        Y KA+P GSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGA RDTDHPELRQMSALDVWK VVR+LD   K+TVLT+GPLTNLA+II  KA+S RI+EV
Subjt:  YIKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRSKAISPRIQEV

Query:  YITGGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYHHVD
        YITGGHI++G DKGN+FTIPSNEY+EFNFFLDP AA+ V GSGLNITLIPLNVQRRVSSF KILK+LK RNRTPEA FSRRLL RLY LKQ HHQYHHVD
Subjt:  YITGGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYHHVD

Query:  MFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFE
        MFLGEV+GAVSLAGKHLNLKQ FS KPLK+++NGGESKVGQTIID K+GKWVRVLES+ EPLA YEDLANAL DEKQ+AVI SFE
Subjt:  MFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFE

A0A6J1DM35 uncharacterized protein LOC1110218210.0e+0082.03Show/hide
Query:  MMMKMM---RGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSH
        MMM+M+   RGG YWK SW ICGVVGVVVMLGSVVWL NSS  S       +RIVVDTDVDTDDVFA+ YLLKQPTSLFHLQAITINGNGWS+AGHAV+H
Subjt:  MMMKMM---RGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSH

Query:  LYDVLFMMGRDDIPVGVGGEGGISPN--ASISSH--VGGYLPLIDQVTNT---------LPSAYTYSQNPTPTFGKRK---------YTPLKQPTAQQVL
        +YD+LFMMGRDDIPVGVGGEGGISPN   SIS H  VGG+LPLIDQ  +T         +P            FG RK         YTP+KQPTAQQVL
Subjt:  LYDVLFMMGRDDIPVGVGGEGGISPN--ASISSH--VGGYLPLIDQVTNT---------LPSAYTYSQNPTPTFGKRK---------YTPLKQPTAQQVL

Query:  KDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVP
        KDAIS GPTTVFLMG HTNLAIFLMTNPHLKKNIKHIYAMGGAIR ICS   D SHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVP
Subjt:  KDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVP

Query:  LDATSTIPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFE----------------------RGGNNAYSKMEYLNITIVTSNEPYGILDGSN
        LDATSTIPVD+NVFLAFEQR NTYEAQYCFQSLKMARDTW+ NGFFE                       GG+NAYSKMEY+NITIVTSNEPYGI DGSN
Subjt:  LDATSTIPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFE----------------------RGGNNAYSKMEYLNITIVTSNEPYGILDGSN

Query:  PLVDGHLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQF
        PLVDGHLVPKFG+QKNGVHSGHVQTGMLDPFCL A+ +GKCQDGYTKEA+GSESVQVLVAVEAKST DTN+SI+K FYISF DVLNSP+QTGRFDFRAQF
Subjt:  PLVDGHLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQF

Query:  PNYKEVQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIG
        PNYKEV YRPKF KKL GKP+VFDMDMSTGDF+TLLYLLK PVE+IDLKGIIISPNGWATAATIDVVYD+LHMMGRDDIPVGLGD+FAIGEAHPSFP IG
Subjt:  PNYKEVQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIG

Query:  DCKYIKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRSKAISPRI
        DCKYIKA+PHGSGGFLDSDTLYG ARDLPRSPRRYTAENSVKFGA+RDTDHPELRQMSAL+VWKA+VRSLD   K+TVLTNGPLTNLAQI+R+KAI  RI
Subjt:  DCKYIKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRSKAISPRI

Query:  QEVYITGGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYH
        QEVYITGGHI FGGDKGNVFTIPSN YAEFNFFLDP AAE V+GSGLNITLIPLNVQRRVSSF KILKRLKLRN+TPEA FS+RL SRLYHLKQHHHQYH
Subjt:  QEVYITGGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYH

Query:  HVDMFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQKRLWSA
        HVDMFLGEVLGAVSLAGKH+NLK+ FS KPLK+VTNGGESKVGQTIIDEK+GKWVRVLESV EPLAFYE LA+ALGDEKQSAV+GSFEEQKR+WSA
Subjt:  HVDMFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQKRLWSA

A0A6J1EZU7 uncharacterized protein LOC111441054 isoform X10.0e+0075.91Show/hide
Query:  GFDLVCVWRDQ-----MMMKMMRGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITIN
        GF++V  WRD      MM  +  GG +WKS  LICGVV V  M GS++WL       ++S+  P RI+VDTDVDTDDV ALLYLLKQP+SLF LQAITIN
Subjt:  GFDLVCVWRDQ-----MMMKMMRGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITIN

Query:  GNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQNPTPTFGK------------------RKYTPL
        GNGWS+ GHAV+HLYD+LFMMGRDDIP+GVGGEGGISPNA+IS+HVGGYLPL+DQ  +T          P    G+                  R+Y P+
Subjt:  GNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQNPTPTFGK------------------RKYTPL

Query:  KQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLH
        KQPTAQQV+KDAIS GPTTVF+MGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIR  CS +AD SHGKTCNNIGNLWPPN NPYAEFNIFGDPFAAYTVLH
Subjt:  KQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLH

Query:  SGIPVTLVPLDATSTIPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFER----------------------GGNNAYSKMEYLNITIVTSNE
        SGIP+TLVPLDATSTIPVD+ V+LAFEQRQNTYEA+YCF+SLKMA DTWSG GFFE                       GGNNAYSKMEYLNI+IVTSNE
Subjt:  SGIPVTLVPLDATSTIPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFER----------------------GGNNAYSKMEYLNITIVTSNE

Query:  PYGILDGSNPLVDGHLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQT
        PYGI DGSNPLVDG L+PKFG+QKNGVHSGHVQTGMLDPFCL +SE+GKC+DGYTKEA+GSESV+VLVAV AKS+L+TN  I+K FY+SF D +NSPRQ+
Subjt:  PYGILDGSNPLVDGHLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQT

Query:  GRFDFRAQFPNYKEVQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGE
        GRFDFRAQFP YKEV YRPKF K L GKP++FDMDMSTGDFLTLLYLLK PVE+IDLKGIIISPNGWATAATIDVVYD+LHMMGRDDI VGLGDVFAIGE
Subjt:  GRFDFRAQFPNYKEVQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGE

Query:  AHPSFPAIGDCKYIKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQII
        AHPSFP+IGDCKYIKA+PHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVK GA RDTDHP LRQMSA+DVW  +V+SLDP VK+TVLTNGPLTNLAQII
Subjt:  AHPSFPAIGDCKYIKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQII

Query:  RSKAISPRIQEVYITGGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYH
        RSKAIS RIQEVYITGGHI+   DKGNVFTIPSNEYAEFNFFLDP AAE V+GSGLNITLIPLN QR VSSF KIL++LK  N T EA F+RRLL RLY 
Subjt:  RSKAISPRIQEVYITGGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYH

Query:  LKQHHHQYHHVDMFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQK
        L+Q + QYHHVDMFLGEVLGAVSL G+H NLK+AFS+K +K++T GGESKVGQTIIDEKEGKWVRVLESV E LAFYE LANAL D+ Q+AVIGSFEEQK
Subjt:  LKQHHHQYHHVDMFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQK

Query:  RLWSA
         LWSA
Subjt:  RLWSA

A0A6J1I5L4 uncharacterized protein LOC1114694420.0e+0076.22Show/hide
Query:  FDLVCVWRDQ-----MMMKMMRGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITING
        F+LV  WRD      MM  + RGG +WKS  LICGVV +  M GS++WL +S  S       P RI+VDTDVDTDDV ALLYLLKQP+SLFHLQAITING
Subjt:  FDLVCVWRDQ-----MMMKMMRGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITING

Query:  NGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQNPTP------------------TFGKRKYTPLK
        NGWS+ GHAV+HLYD+LFMMGRDDIP+GVGGEGGISPN +IS+HVGGYLPLIDQ  +T          P                      GKR+Y P+K
Subjt:  NGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQNPTP------------------TFGKRKYTPLK

Query:  QPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHS
        QPTAQQV+KDAIS GPTTVF+MGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIR  CS +AD SHGKTCNNIGNLWPPN NPYAEFNIFGDPFAAYTVLHS
Subjt:  QPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHS

Query:  GIPVTLVPLDATSTIPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFER----------------------GGNNAYSKMEYLNITIVTSNEP
        GIPVTLVPLDATSTIPVD+ V+LAFEQRQNTYEA+YCFQSLKMA DTWSG GFFE                       GGNNAYSKMEYLNITIVTSNEP
Subjt:  GIPVTLVPLDATSTIPVDENVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFER----------------------GGNNAYSKMEYLNITIVTSNEP

Query:  YGILDGSNPLVDGHLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTG
        YGI DGSNPLVDG L+PKFG+QKNGVHSGHVQTGMLDPFCL + E+GKC+DGYTKEA+GSESV+VLVAV AKS+L+TNT I+K FY+SF D LNSPRQ+G
Subjt:  YGILDGSNPLVDGHLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTG

Query:  RFDFRAQFPNYKEVQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEA
        RFDFRAQFP YKEV YRPK  K L GKP++FDMDMSTGDFLTLLYLLK PVE+IDLKGIIISPNGWATAATID+VYD+LHMMGRDDI VGLGDVFAIGEA
Subjt:  RFDFRAQFPNYKEVQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEA

Query:  HPSFPAIGDCKYIKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIR
        HPSFP+IGDCKYI A+PHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVK GA RDTDHP LRQMSA+DVW  VV+SLDP VK+TVLTNGPLTNLAQIIR
Subjt:  HPSFPAIGDCKYIKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIR

Query:  SKAISPRIQEVYITGGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHL
        S+AIS RIQEVYITGGHI+   D+GNVFTIPSNEYAEFNFFLDP AAE V+GSGLNITLIPLN QR VSSF KIL++LK  NRT EA F+RRLL RLY L
Subjt:  SKAISPRIQEVYITGGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHL

Query:  KQHHHQYHHVDMFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQKR
        +Q ++QYHHVDMFLGEV+GAVSL G+H NLK+AF +K +K++T GGESKVGQTIIDEKEGKWVRVLESV E LAFYE LANAL D+ Q+AVIGSFEEQK 
Subjt:  KQHHHQYHHVDMFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQKR

Query:  LWSA
        LWSA
Subjt:  LWSA

SwissProt top hitse value%identityAlignment
A1A8R9 Pyrimidine-specific ribonucleoside hydrolase RihA2.6e-1230.13Show/hide
Query:  IVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTY
        I++D D   DD  A++  L  P     ++AIT +  G       + ++  +L ++ R DIPV  G    +  N  I+ +V G             S    
Subjt:  IVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTY

Query:  SQNPTPTFGKRKYTPLKQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYA
           P PTF  +  T ++     + L++  SE P T+   G  TN+A+ L ++P L   I  I  MGGA+                  +GN W     P A
Subjt:  SQNPTPTFGKRKYTPLKQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYA

Query:  EFNIFGDPFAAYTVLHSGIPVTLVPLDAT
        EFNI+ DP AA  V  SGIPV +  LD T
Subjt:  EFNIFGDPFAAYTVLHSGIPVTLVPLDAT

B1KHA5 Pyrimidine-specific ribonucleoside hydrolase RihA1.7e-1430.96Show/hide
Query:  IVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTY
        I++D D   DD  A++  L    +   L   T  GN   D    +++   +L ++GR DIPV  G    ++    I+ +V G   L   V          
Subjt:  IVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTY

Query:  SQNPTPTFGKRKYTPLKQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYA
           P P+F  +  T ++       LK   S+ P T+   G  TN+A+ L T+P L  NI+ I  MGGA                   +GN W     P A
Subjt:  SQNPTPTFGKRKYTPLKQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYA

Query:  EFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPV-DENV
        EFNIF DP AA  V  +GIP+T+  LD T    V DE++
Subjt:  EFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPV-DENV

B7MFR7 Pyrimidine-specific ribonucleoside hydrolase RihA2.6e-1230.13Show/hide
Query:  IVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTY
        I++D D   DD  A++  L  P     ++AIT +  G       + ++  +L ++ R DIPV  G    +  N  I+ +V G             S    
Subjt:  IVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTY

Query:  SQNPTPTFGKRKYTPLKQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYA
           P PTF  +  T ++     + L++  SE P T+   G  TN+A+ L ++P L   I  I  MGGA+                  +GN W     P A
Subjt:  SQNPTPTFGKRKYTPLKQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYA

Query:  EFNIFGDPFAAYTVLHSGIPVTLVPLDAT
        EFNI+ DP AA  V  SGIPV +  LD T
Subjt:  EFNIFGDPFAAYTVLHSGIPVTLVPLDAT

Q8EIM7 Pyrimidine-specific ribonucleoside hydrolase RihA1.4e-1330.96Show/hide
Query:  IVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTY
        I++D D   DD  AL+  L  P  L  L   T  GN   D    +++   +L ++ R DIPV  G    ++ +  I+ +V G   L              
Subjt:  IVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTY

Query:  SQNPTPTFGKRKYTPLKQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYA
           P P+F  +  T + +  AQQ+ K   S  P T+   G  TN+A+ L ++  L   I+ I  MGGA                   +GN W     P A
Subjt:  SQNPTPTFGKRKYTPLKQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYA

Query:  EFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPV-DENV
        EFNIF DP AA  V  SGIP+T+  LD T    + DE++
Subjt:  EFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPV-DENV

Q8FJY8 Pyrimidine-specific ribonucleoside hydrolase RihA2.6e-1230.13Show/hide
Query:  IVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTY
        I++D D   DD  A++  L  P     ++AIT +  G       + ++  +L ++ R DIPV  G    +  N  I+ +V G             S    
Subjt:  IVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTY

Query:  SQNPTPTFGKRKYTPLKQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYA
           P PTF  +  T ++     + L++  SE P T+   G  TN+A+ L ++P L   I  I  MGGA+                  +GN W     P A
Subjt:  SQNPTPTFGKRKYTPLKQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYA

Query:  EFNIFGDPFAAYTVLHSGIPVTLVPLDAT
        EFNI+ DP AA  V  SGIPV +  LD T
Subjt:  EFNIFGDPFAAYTVLHSGIPVTLVPLDAT

Arabidopsis top hitse value%identityAlignment
AT1G05620.1 uridine-ribohydrolase 22.5e-1026.38Show/hide
Query:  QRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAY
        ++I++DTD   DD  A+   L  P  +  +   TI GN ++    A  +   +L + GR DIPV  G       +  +   +  ++   D + N      
Subjt:  QRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAY

Query:  TYSQNPTPTFGKRKYTPLKQPTAQQVLKDA-ISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTN
           QN  P  GK    P+++   + +++ A +  G  TV  +G  TNLA+ +  +P   KN+  I  +GGA                          N N
Subjt:  TYSQNPTPTFGKRKYTPLKQPTAQQVLKDA-ISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTN

Query:  PYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTI
        P +E NIFGDP AA  V   G  +  V ++ T  +
Subjt:  PYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTI

AT1G05620.2 uridine-ribohydrolase 22.2e-0626.24Show/hide
Query:  TINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQNPTPTFGKRKYTPLKQPTAQQVLKDA-IS
        TI GN ++    A  +   +L + GR DIPV  G       +  +   +  ++   D + N         QN  P  GK    P+++   + +++ A + 
Subjt:  TINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQNPTPTFGKRKYTPLKQPTAQQVLKDA-IS

Query:  EGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATS
         G  TV  +G  TNLA+ +  +P   KN+  I  +GGA                          N NP +E NIFGDP AA  V   G  +  V ++ T 
Subjt:  EGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATS

Query:  TI
         +
Subjt:  TI

AT5G18860.1 inosine-uridine preferring nucleoside hydrolase family protein9.2e-27155.1Show/hide
Query:  KSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPV
        KS W +  ++G   +LG  +    SSS          RI+VDTDVDTDD+FA+LYLLK   S F L  IT++ N W++AGHAV+ +YD+L MM RDDIPV
Subjt:  KSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPV

Query:  GVGGEGGISPNASISSHVGGYLPLIDQVTNT-------------------LPSAYTYSQNPTPTFGKRKYTPLKQPTAQQVLKDAISEGPTTVFLMGAHT
        GVGGEGGIS + +I S VGGY P+I+Q   T                   + S Y + +   P  G R+YTPL+QPTAQ+V+ D ISEGPTTV L+G+HT
Subjt:  GVGGEGGISPNASISSHVGGYLPLIDQVTNT-------------------LPSAYTYSQNPTPTFGKRKYTPLKQPTAQQVLKDAISEGPTTVFLMGAHT

Query:  NLAIFLMTNPHLKKNIKHIYAMGGAIRA-----ICSESADTS--HGKTCNNIGNLWPPNT-NPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVD
        N A+FLM+NPHLK NI+HIY MGG +R+      C  ++  +    + C N GNL+   T NPY+EFNIF DPFAAY V HSG+PVTLVPLDAT+TIP++
Subjt:  NLAIFLMTNPHLKKNIKHIYAMGGAIRA-----ICSESADTS--HGKTCNNIGNLWPPNT-NPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVD

Query:  ENVFLAFEQR-QNTYEAQYCFQSLKMARDTWSGNGFFE-------------------------RGGNNAYSKMEYLNITIVTSNEPYGILDGSNPLVDGH
        +  F  FE   Q TYEAQY F SLK+ARDTW  + F++                         + G N +++MEY+NIT+VTSN+PYG  DGSNP  D  
Subjt:  ENVFLAFEQR-QNTYEAQYCFQSLKMARDTWSGNGFFE-------------------------RGGNNAYSKMEYLNITIVTSNEPYGILDGSNPLVDGH

Query:  LVPKFGIQKNGVHSGHVQTGMLDPFCLGAS--ERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQFPNYK
          PKF +   GVHSGHVQTG+ DP CL  S   RGKC+DGYT+E  GS+SV+VLVA  AK  ++  + +++EFY+ F +VLN P +TGRF+F +QFP YK
Subjt:  LVPKFGIQKNGVHSGHVQTGMLDPFCLGAS--ERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQFPNYK

Query:  EVQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIGDCKY
        E  +RP   K   GKP+VFDMDMS GDFL+L YLLKVPV+ IDLK II+SP GWA AATIDVVYD+LHMMGRDDIPVGLGD+ A+ ++ P FP +G CKY
Subjt:  EVQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIGDCKY

Query:  IKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRS-KAISPRIQEV
        +KA+P G GGFLDSDTLYG ARDLPRSPRRYTAENSV  GA RDTD PELRQ  A++VW+ + +S +   K+TVLTNGPLTNLA+II S K  S  I+EV
Subjt:  IKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSLDPRVKVTVLTNGPLTNLAQIIRS-KAISPRIQEV

Query:  YITGGHIN-FGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYHHV
        YI GGHIN    DKGN+FTIPSN YAEFN FLDP AA++V+ S LNITL+PL  Q ++SSF  +L RL    +TPEA F +RLL RL  L Q H +Y H+
Subjt:  YITGGHIN-FGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLRNRTPEALFSRRLLSRLYHLKQHHHQYHHV

Query:  DMFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQKRLWS
        DMFLGEVLGAV L G   +LK     + +K++  G ES+ G+ +ID+  GK +++LE V + ++  E  A+ L D+KQSAVIGSFEEQK++WS
Subjt:  DMFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAVIGSFEEQKRLWS

AT5G18870.1 Inosine-uridine preferring nucleoside hydrolase family protein2.7e-6050.83Show/hide
Query:  LHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNT-
        L+ P RI++DTDVDTDD  ALLYLLK   + F L  IT++ N W++AGH V+H+YD+L+MMGRDDI VGVGGEGGI  + +I   VG YLP+I+Q   T 
Subjt:  LHYPQRIVVDTDVDTDDVFALLYLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNT-

Query:  ----------------LPSAYTYSQNPTPTFGKRKYTPLKQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESA
                        + S Y + ++  P  G R+YTPL+QPTAQ+V+ D +SEGP ++F++G+HTNLA+F+M+NPHLK NI+HIY MGG++R       
Subjt:  ----------------LPSAYTYSQNPTPTFGKRKYTPLKQPTAQQVLKDAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESA

Query:  DTSHGKTCNN----IGNLWPPNT-NPYAEFNIFGDPFAAYTV
               C N     GNL+   T NPYAEFNIF DPFAAY V
Subjt:  DTSHGKTCNN----IGNLWPPNT-NPYAEFNIFGDPFAAYTV

AT5G18890.1 Inosine-uridine preferring nucleoside hydrolase family protein6.8e-17358.63Show/hide
Query:  GNNAYSKMEYLNITIVTSNEPYGILDGSNPLVDGHLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNT
        G N +++MEY+NIT+VTSNEPYG+ D SNP       PKF +   GVHSGHVQ G+ DP C+  S +G C+DGYTKE  G +SV+VLVA  AK + + N+
Subjt:  GNNAYSKMEYLNITIVTSNEPYGILDGSNPLVDGHLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQDGYTKEADGSESVQVLVAVEAKSTLDTNT

Query:  SIEKEFYISFFDVLNSPRQTGRFDFRAQFPNYKEVQYRPKF-RKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDI
         +++EFY  F +VLN P +TGRF F  QF  Y+E  +  +    +L GKP+VFDMDMS GDFL+L YLLKVPVE+IDLK +I+SP GWA  ATIDVVYD+
Subjt:  SIEKEFYISFFDVLNSPRQTGRFDFRAQFPNYKEVQYRPKF-RKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIIISPNGWATAATIDVVYDI

Query:  LHMMGRDDIPVGLGDVFAIGEAHPSFPAIGDCKYIKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSL
        LHMMGRDDIPVGLGD+FAI ++ P FP+ GDCKY KA+P G GGFLDSDTLYG ARDLPRSPRRY  ENSV  GA  DTD PELRQ  AL+VW+ + +S+
Subjt:  LHMMGRDDIPVGLGDVFAIGEAHPSFPAIGDCKYIKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVWKAVVRSL

Query:  DPRVKVTVLTNGPLTNLAQIIRS-KAISPRIQEVYITGGHINFG-GDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILK
        D   K+TVLTNGPLT+LA+II S K  S  I+EVYI GGHI+ G  DKGN+FT+PSN YAEFN FLDP AA++V+ SGLNITLIPL  QR   SF  +L 
Subjt:  DPRVKVTVLTNGPLTNLAQIIRS-KAISPRIQEVYITGGHINFG-GDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILK

Query:  RLKLRNRTPEALFSRRLLSRLYHLKQHHHQYHHVDMFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFY
        RL    +TPEA F +RLL+RL  L Q   +Y H+DMFLGE+LGA+ L G H  LK     + +K++  G ESK G  +ID+  GK +++LE V +    Y
Subjt:  RLKLRNRTPEALFSRRLLSRLYHLKQHHHQYHHVDMFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFY

Query:  EDLANALGDEKQSAVIGSFEEQKRLWS
        E  A+ L D+KQSAVIGSFEEQ+  W+
Subjt:  EDLANALGDEKQSAVIGSFEEQKRLWS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGTCTTTTCAAGGAATCCATCATCACGATTCCAAGTGATCAGTAGCAGCAACAACTTAAGTGATGGAAACAATGGAGTGGAATTAATTAAGAAGGGGTTTGATTT
GGTTTGTGTTTGGAGAGATCAGATGATGATGAAGATGATGAGAGGAGGAGGATATTGGAAGAGTAGTTGGTTGATATGTGGAGTGGTGGGTGTGGTTGTGATGTTGGGAT
CAGTGGTGTGGTTGGCAAATTCTTCTTCTTCTTCTTCTTCTTCTCTTCATTATCCTCAGAGGATTGTTGTGGACACAGATGTTGATACTGATGATGTCTTTGCTCTTCTC
TATCTTCTCAAGCAACCTACTTCTCTCTTTCATCTTCAGGCAATAACAATAAATGGGAATGGATGGAGCGACGCAGGCCATGCGGTGAGCCATCTGTACGACGTGTTGTT
CATGATGGGGCGAGACGACATCCCAGTTGGTGTCGGCGGTGAAGGCGGCATTTCCCCCAACGCTTCTATTTCCTCTCATGTCGGTGGCTACCTACCACTTATTGATCAGG
TAACCAACACCCTACCCTCTGCCTACACCTATTCCCAAAACCCTACACCTACTTTTGGTAAGAGGAAATATACTCCTCTGAAGCAACCAACTGCACAGCAAGTGCTGAAA
GATGCAATATCTGAAGGGCCTACTACAGTTTTTCTAATGGGAGCTCATACAAACTTGGCTATATTTCTTATGACAAATCCTCATTTGAAGAAAAATATCAAGCATATATA
TGCCATGGGCGGTGCTATTAGAGCAATTTGCTCAGAGAGTGCTGATACTTCTCATGGGAAGACATGCAACAATATAGGAAACTTGTGGCCTCCAAATACAAACCCATATG
CTGAGTTCAATATATTTGGAGACCCTTTTGCTGCCTATACAGTACTACATTCTGGAATTCCTGTTACACTCGTTCCTCTGGATGCAACAAGTACGATTCCTGTGGATGAG
AATGTGTTTTTGGCATTTGAACAGAGACAAAACACTTACGAGGCTCAATATTGTTTCCAGTCCTTGAAAATGGCTCGCGATACTTGGTCTGGCAATGGATTTTTTGAAAG
GGGTGGAAATAATGCATATTCGAAGATGGAATATTTAAATATCACTATTGTTACATCGAACGAACCTTACGGCATCTTAGATGGATCAAATCCACTTGTCGATGGACATT
TAGTCCCAAAATTTGGTATCCAAAAGAATGGGGTGCACAGTGGTCATGTTCAGACAGGAATGCTAGATCCATTCTGCCTTGGTGCAAGTGAAAGAGGAAAGTGTCAGGAT
GGTTACACAAAGGAGGCAGACGGTTCTGAATCAGTTCAAGTTCTTGTTGCTGTGGAAGCTAAATCAACTCTTGACACCAACACCTCAATTGAAAAAGAATTTTATATAAG
CTTCTTTGATGTTCTTAATAGTCCACGACAGACAGGGAGGTTTGATTTTAGAGCTCAGTTTCCTAACTATAAGGAAGTACAATATAGACCAAAATTTCGGAAGAAATTGC
GAGGAAAACCAATTGTTTTTGACATGGATATGAGCACAGGAGATTTTCTCACCCTTCTCTATCTCCTGAAGGTACCAGTTGAACTCATCGATCTAAAGGGAATAATAATT
AGCCCAAATGGATGGGCAACGGCTGCAACAATAGACGTCGTATATGATATATTACATATGATGGGTCGTGACGATATTCCAGTTGGTCTTGGAGATGTCTTCGCCATTGG
AGAAGCACATCCATCATTTCCTGCTATTGGTGACTGCAAGTATATCAAGGCCCTTCCGCATGGTAGTGGTGGCTTCTTGGATTCGGATACACTTTATGGATTTGCTCGAG
ACTTACCTCGTAGTCCTAGAAGGTATACAGCTGAAAATTCAGTGAAGTTTGGAGCACTTAGAGATACTGACCACCCTGAACTCAGACAAATGTCTGCACTTGATGTTTGG
AAAGCTGTTGTAAGATCTCTAGATCCGAGGGTGAAGGTTACGGTTTTAACTAACGGACCTTTGACAAATCTAGCTCAAATTATTCGTTCTAAAGCAATAAGCCCAAGGAT
TCAGGAGGTATATATTACTGGAGGACATATAAACTTTGGTGGTGACAAGGGAAATGTGTTCACTATTCCTTCAAATGAATATGCAGAATTCAACTTCTTTCTCGACCCTG
CAGCTGCAGAATCGGTTGTAGGCTCGGGATTGAACATTACACTCATCCCATTGAACGTGCAACGGAGAGTAAGTTCATTTCCCAAGATCCTCAAAAGGTTGAAGCTCAGA
AACAGAACTCCCGAAGCGCTGTTCTCTCGACGTCTACTCTCTAGGCTATATCACCTTAAGCAGCACCATCATCAGTACCATCATGTGGACATGTTCTTGGGAGAAGTCCT
TGGAGCAGTGAGCTTAGCTGGGAAACATCTGAATCTGAAGCAAGCATTCAGCATGAAGCCTCTCAAGATAGTCACAAATGGAGGTGAATCAAAAGTAGGGCAAACAATTA
TAGATGAGAAGGAAGGGAAATGGGTGAGGGTTTTGGAGAGTGTCGAAGAACCACTTGCATTTTATGAAGATCTTGCAAATGCATTGGGTGATGAAAAGCAATCTGCTGTT
ATAGGAAGCTTTGAGGAACAGAAGAGGCTGTGGAGTGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGTCTTTTCAAGGAATCCATCATCACGATTCCAAGTGATCAGTAGCAGCAACAACTTAAGTGATGGAAACAATGGAGTGGAATTAATTAAGAAGGGGTTTGATTT
GGTTTGTGTTTGGAGAGATCAGATGATGATGAAGATGATGAGAGGAGGAGGATATTGGAAGAGTAGTTGGTTGATATGTGGAGTGGTGGGTGTGGTTGTGATGTTGGGAT
CAGTGGTGTGGTTGGCAAATTCTTCTTCTTCTTCTTCTTCTTCTCTTCATTATCCTCAGAGGATTGTTGTGGACACAGATGTTGATACTGATGATGTCTTTGCTCTTCTC
TATCTTCTCAAGCAACCTACTTCTCTCTTTCATCTTCAGGCAATAACAATAAATGGGAATGGATGGAGCGACGCAGGCCATGCGGTGAGCCATCTGTACGACGTGTTGTT
CATGATGGGGCGAGACGACATCCCAGTTGGTGTCGGCGGTGAAGGCGGCATTTCCCCCAACGCTTCTATTTCCTCTCATGTCGGTGGCTACCTACCACTTATTGATCAGG
TAACCAACACCCTACCCTCTGCCTACACCTATTCCCAAAACCCTACACCTACTTTTGGTAAGAGGAAATATACTCCTCTGAAGCAACCAACTGCACAGCAAGTGCTGAAA
GATGCAATATCTGAAGGGCCTACTACAGTTTTTCTAATGGGAGCTCATACAAACTTGGCTATATTTCTTATGACAAATCCTCATTTGAAGAAAAATATCAAGCATATATA
TGCCATGGGCGGTGCTATTAGAGCAATTTGCTCAGAGAGTGCTGATACTTCTCATGGGAAGACATGCAACAATATAGGAAACTTGTGGCCTCCAAATACAAACCCATATG
CTGAGTTCAATATATTTGGAGACCCTTTTGCTGCCTATACAGTACTACATTCTGGAATTCCTGTTACACTCGTTCCTCTGGATGCAACAAGTACGATTCCTGTGGATGAG
AATGTGTTTTTGGCATTTGAACAGAGACAAAACACTTACGAGGCTCAATATTGTTTCCAGTCCTTGAAAATGGCTCGCGATACTTGGTCTGGCAATGGATTTTTTGAAAG
GGGTGGAAATAATGCATATTCGAAGATGGAATATTTAAATATCACTATTGTTACATCGAACGAACCTTACGGCATCTTAGATGGATCAAATCCACTTGTCGATGGACATT
TAGTCCCAAAATTTGGTATCCAAAAGAATGGGGTGCACAGTGGTCATGTTCAGACAGGAATGCTAGATCCATTCTGCCTTGGTGCAAGTGAAAGAGGAAAGTGTCAGGAT
GGTTACACAAAGGAGGCAGACGGTTCTGAATCAGTTCAAGTTCTTGTTGCTGTGGAAGCTAAATCAACTCTTGACACCAACACCTCAATTGAAAAAGAATTTTATATAAG
CTTCTTTGATGTTCTTAATAGTCCACGACAGACAGGGAGGTTTGATTTTAGAGCTCAGTTTCCTAACTATAAGGAAGTACAATATAGACCAAAATTTCGGAAGAAATTGC
GAGGAAAACCAATTGTTTTTGACATGGATATGAGCACAGGAGATTTTCTCACCCTTCTCTATCTCCTGAAGGTACCAGTTGAACTCATCGATCTAAAGGGAATAATAATT
AGCCCAAATGGATGGGCAACGGCTGCAACAATAGACGTCGTATATGATATATTACATATGATGGGTCGTGACGATATTCCAGTTGGTCTTGGAGATGTCTTCGCCATTGG
AGAAGCACATCCATCATTTCCTGCTATTGGTGACTGCAAGTATATCAAGGCCCTTCCGCATGGTAGTGGTGGCTTCTTGGATTCGGATACACTTTATGGATTTGCTCGAG
ACTTACCTCGTAGTCCTAGAAGGTATACAGCTGAAAATTCAGTGAAGTTTGGAGCACTTAGAGATACTGACCACCCTGAACTCAGACAAATGTCTGCACTTGATGTTTGG
AAAGCTGTTGTAAGATCTCTAGATCCGAGGGTGAAGGTTACGGTTTTAACTAACGGACCTTTGACAAATCTAGCTCAAATTATTCGTTCTAAAGCAATAAGCCCAAGGAT
TCAGGAGGTATATATTACTGGAGGACATATAAACTTTGGTGGTGACAAGGGAAATGTGTTCACTATTCCTTCAAATGAATATGCAGAATTCAACTTCTTTCTCGACCCTG
CAGCTGCAGAATCGGTTGTAGGCTCGGGATTGAACATTACACTCATCCCATTGAACGTGCAACGGAGAGTAAGTTCATTTCCCAAGATCCTCAAAAGGTTGAAGCTCAGA
AACAGAACTCCCGAAGCGCTGTTCTCTCGACGTCTACTCTCTAGGCTATATCACCTTAAGCAGCACCATCATCAGTACCATCATGTGGACATGTTCTTGGGAGAAGTCCT
TGGAGCAGTGAGCTTAGCTGGGAAACATCTGAATCTGAAGCAAGCATTCAGCATGAAGCCTCTCAAGATAGTCACAAATGGAGGTGAATCAAAAGTAGGGCAAACAATTA
TAGATGAGAAGGAAGGGAAATGGGTGAGGGTTTTGGAGAGTGTCGAAGAACCACTTGCATTTTATGAAGATCTTGCAAATGCATTGGGTGATGAAAAGCAATCTGCTGTT
ATAGGAAGCTTTGAGGAACAGAAGAGGCTGTGGAGTGCCTGA
Protein sequenceShow/hide protein sequence
MKVFSRNPSSRFQVISSSNNLSDGNNGVELIKKGFDLVCVWRDQMMMKMMRGGGYWKSSWLICGVVGVVVMLGSVVWLANSSSSSSSSLHYPQRIVVDTDVDTDDVFALL
YLLKQPTSLFHLQAITINGNGWSDAGHAVSHLYDVLFMMGRDDIPVGVGGEGGISPNASISSHVGGYLPLIDQVTNTLPSAYTYSQNPTPTFGKRKYTPLKQPTAQQVLK
DAISEGPTTVFLMGAHTNLAIFLMTNPHLKKNIKHIYAMGGAIRAICSESADTSHGKTCNNIGNLWPPNTNPYAEFNIFGDPFAAYTVLHSGIPVTLVPLDATSTIPVDE
NVFLAFEQRQNTYEAQYCFQSLKMARDTWSGNGFFERGGNNAYSKMEYLNITIVTSNEPYGILDGSNPLVDGHLVPKFGIQKNGVHSGHVQTGMLDPFCLGASERGKCQD
GYTKEADGSESVQVLVAVEAKSTLDTNTSIEKEFYISFFDVLNSPRQTGRFDFRAQFPNYKEVQYRPKFRKKLRGKPIVFDMDMSTGDFLTLLYLLKVPVELIDLKGIII
SPNGWATAATIDVVYDILHMMGRDDIPVGLGDVFAIGEAHPSFPAIGDCKYIKALPHGSGGFLDSDTLYGFARDLPRSPRRYTAENSVKFGALRDTDHPELRQMSALDVW
KAVVRSLDPRVKVTVLTNGPLTNLAQIIRSKAISPRIQEVYITGGHINFGGDKGNVFTIPSNEYAEFNFFLDPAAAESVVGSGLNITLIPLNVQRRVSSFPKILKRLKLR
NRTPEALFSRRLLSRLYHLKQHHHQYHHVDMFLGEVLGAVSLAGKHLNLKQAFSMKPLKIVTNGGESKVGQTIIDEKEGKWVRVLESVEEPLAFYEDLANALGDEKQSAV
IGSFEEQKRLWSA