; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016974 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016974
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTransglut_core2 domain-containing protein
Genome locationtig00153017:129289..137274
RNA-Seq ExpressionSgr016974
SyntenySgr016974
Gene Ontology termsNA
InterPro domainsIPR032698 - Protein SirB1, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579123.1 hypothetical protein SDJN03_23571, partial [Cucurbita argyrosperma subsp. sororia]7.3e-19683.64Show/hide
Query:  MSSFTAASASLWTPRLTTSSKFSKFN--------SSPP---CFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREIS
        M+SFT  SA L  PRL +SSK SKFN        SSP     FRVVCSGGF+Q  A KD RFLLHDA+DS GIDST+AKEARKGFLTQI YLSNIERE S
Subjt:  MSSFTAASASLWTPRLTTSSKFSKFN--------SSPP---CFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREIS

Query:  ISINRRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRT
        ISINR VDLAKAALYIA EDDSLVSHSSVPLP+DA+V R+NDLSMGYCTHYKSSFNLSPE+ LESIERY+YVMKGFRRTS K ++EPRALYLHTVLTHRT
Subjt:  ISINRRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRT

Query:  GSAALLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNV
        GSAALLSLIYSEILKMLR WSLLDFDVEIYHPHDD+SLPT YHK K +ESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAAD ANCSDRSN 
Subjt:  GSAALLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNV

Query:  IEESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMK
         EESGFQLASA+AAQHRLERGVWTSVR+GDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLY+E K+SSS  D LSCQEEEAV +L+K
Subjt:  IEESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMK

Query:  RLTLIMMEDGWSRPSYARNFIGKNSEPW
        RL LIMMEDGWSRP++AR FIGKNSEPW
Subjt:  RLTLIMMEDGWSRPSYARNFIGKNSEPW

XP_022146442.1 uncharacterized protein LOC111015657 isoform X1 [Momordica charantia]2.2e-20888.73Show/hide
Query:  MSSFTAASASLWTPRLTTSSKFSKFNSSPPCFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISISINRRVDLAK
        M+S T ASASLW P LT SSKFSKFNSSPPCFRVVCSG FQ  YAAKDL F LHDAMDS GIDST+AKEARKGFLTQI+Y SNIE+E SISINRRVDLAK
Subjt:  MSSFTAASASLWTPRLTTSSKFSKFNSSPPCFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISISINRRVDLAK

Query:  AALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGSAALLSLIYS
        AALYIA EDDSLVSHSSVPLPIDA++ R++DLSMGYCTHYKSSFNLSPE+FLESIERYMYV KGFRRTSS T+SE RALYLHTVLTHRTGSAALLSL+YS
Subjt:  AALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGSAALLSLIYS

Query:  EILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIEESGFQLASA
        EILKMLR WSLLDFDVEIYHPHD YSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQN SRSLFLRAAD ANCSDRSN IEESGFQLASA
Subjt:  EILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIEESGFQLASA

Query:  RAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRLTLIMMEDGW
        +AAQHRLERGVWTSVRFGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQE KS  S  D +SCQEEEAV NLMKRL LIMMEDGW
Subjt:  RAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRLTLIMMEDGW

Query:  SRPSYARNFIGKNSEPW
        S PSY RNFIGKNSEPW
Subjt:  SRPSYARNFIGKNSEPW

XP_022938946.1 uncharacterized protein LOC111445003 isoform X2 [Cucurbita moschata]3.4e-19384.52Show/hide
Query:  MSSFTAASASLWTPRLTTSSKFSKFN------SSPP---CFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISIS
        M+SFT  SASL  PRL +SSK SKFN      SSP     FRVVCSGGF+Q  A KD RFLLHDA+DS GIDST+AKEARKGFLTQI YLSNIERE SIS
Subjt:  MSSFTAASASLWTPRLTTSSKFSKFN------SSPP---CFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISIS

Query:  INRRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGS
        INR VDLAKAALYIA EDDSLVSHSSVPLP+DA+V R+NDLSMGYCTHYKSSFNLSPE+ LESIERY+YVMKGFRRTS K ++EPRALYLHTVLTHRTGS
Subjt:  INRRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGS

Query:  AALLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIE
        AALLSLIYSEILKMLR WSLLDFDVEIYHPHDD+SLPT YHK K +ESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAAD ANCSDRSN  E
Subjt:  AALLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIE

Query:  ESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRL
        ESGFQLASA+AAQHRLERGVWTSVR+GDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQE K+SSS  D LSCQEEEAV +LMKRL
Subjt:  ESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRL

Query:  TLIMMEDGWSRPSYARNFIG
         LIMMEDGWSRP++AR FIG
Subjt:  TLIMMEDGWSRPSYARNFIG

XP_023530796.1 uncharacterized protein LOC111793214 [Cucurbita pepo subsp. pepo]4.0e-19483.45Show/hide
Query:  MSSFTAAS--ASLWTPRLTTSSKFSKFNSS-----PPCFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISISIN
        M SFT AS  ASLW PRL+ SSKFSKFNSS      P FRVVCSGG +   A +D  F+LHDAMDS GID++++KEARKGFLTQIQYLSNIERE SISIN
Subjt:  MSSFTAAS--ASLWTPRLTTSSKFSKFNSS-----PPCFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISISIN

Query:  RRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGSAA
        RRVDLAKAALYIA EDDSLVSHSSVPLPIDA++  + DLSMGYCTHYKSSFNLSPE+FLESIERYMYV KGFRRTSSK + EP+ALYLHTVLTH TGS+ 
Subjt:  RRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGSAA

Query:  LLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIEES
        LLSLIYSEILKMLR WSLLDFDVEIYHPHD+YSLPTGYHK KSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRA D ANCSDRSN IEES
Subjt:  LLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIEES

Query:  GFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRLTL
        GFQLASA+AAQHRLERGVWTS R+GDMRRAL+ACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQE KSSSS  D  S +EEEAV NLMKRL L
Subjt:  GFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRLTL

Query:  IMMEDGWSRPSYARNFIGKNSEP
        IM+EDGWS PSYAR FIGKN+EP
Subjt:  IMMEDGWSRPSYARNFIGKNSEP

XP_038875536.1 uncharacterized protein LOC120067957 [Benincasa hispida]8.6e-19784.98Show/hide
Query:  MSSFTAASASLWTPRLTTSS--KFSKFNSS-----PPCFRVVCSGGF--QQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISIS
        MSSFT ASASL  PRL +SS  KFSKFNSS     PPCFRVVCS GF  QQ  + KD +FLLHDAMDS GIDSTHAKEARKGFL+QI YLS +ER+ SIS
Subjt:  MSSFTAASASLWTPRLTTSS--KFSKFNSS-----PPCFRVVCSGGF--QQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISIS

Query:  INRRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGS
        INRRVDLAKAALYIA EDDSLVSHSSVPLP+DA++ R++DLSMGYCTHYKSSFN SPE FLESIE YMYVMKGFRR SSK +SEPRALYLHTVLTHRTGS
Subjt:  INRRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGS

Query:  AALLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIE
        AALLSLIYSEILKMLR WSLLDFDVE+YHPHDDYSLPTGYHK KSKESDQPHI+TTQ+LLVEILSNLKESFWPFQQNQSRSLFLRAAD ANCSD  N  E
Subjt:  AALLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIE

Query:  ESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRL
        ESGFQLASA+AAQHRLERGVWTSVR+GDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQE KSSSS    LSCQEEEAV NLM RL
Subjt:  ESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRL

Query:  TLIMMEDGWSRPSYARNFIGKNSEPW
         LIMMEDGWSRPS  R FIGKNSEPW
Subjt:  TLIMMEDGWSRPSYARNFIGKNSEPW

TrEMBL top hitse value%identityAlignment
A0A6J1CZL0 uncharacterized protein LOC111015657 isoform X11.1e-20888.73Show/hide
Query:  MSSFTAASASLWTPRLTTSSKFSKFNSSPPCFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISISINRRVDLAK
        M+S T ASASLW P LT SSKFSKFNSSPPCFRVVCSG FQ  YAAKDL F LHDAMDS GIDST+AKEARKGFLTQI+Y SNIE+E SISINRRVDLAK
Subjt:  MSSFTAASASLWTPRLTTSSKFSKFNSSPPCFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISISINRRVDLAK

Query:  AALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGSAALLSLIYS
        AALYIA EDDSLVSHSSVPLPIDA++ R++DLSMGYCTHYKSSFNLSPE+FLESIERYMYV KGFRRTSS T+SE RALYLHTVLTHRTGSAALLSL+YS
Subjt:  AALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGSAALLSLIYS

Query:  EILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIEESGFQLASA
        EILKMLR WSLLDFDVEIYHPHD YSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQN SRSLFLRAAD ANCSDRSN IEESGFQLASA
Subjt:  EILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIEESGFQLASA

Query:  RAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRLTLIMMEDGW
        +AAQHRLERGVWTSVRFGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQE KS  S  D +SCQEEEAV NLMKRL LIMMEDGW
Subjt:  RAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRLTLIMMEDGW

Query:  SRPSYARNFIGKNSEPW
        S PSY RNFIGKNSEPW
Subjt:  SRPSYARNFIGKNSEPW

A0A6J1FEJ1 uncharacterized protein LOC111445003 isoform X21.6e-19384.52Show/hide
Query:  MSSFTAASASLWTPRLTTSSKFSKFN------SSPP---CFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISIS
        M+SFT  SASL  PRL +SSK SKFN      SSP     FRVVCSGGF+Q  A KD RFLLHDA+DS GIDST+AKEARKGFLTQI YLSNIERE SIS
Subjt:  MSSFTAASASLWTPRLTTSSKFSKFN------SSPP---CFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISIS

Query:  INRRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGS
        INR VDLAKAALYIA EDDSLVSHSSVPLP+DA+V R+NDLSMGYCTHYKSSFNLSPE+ LESIERY+YVMKGFRRTS K ++EPRALYLHTVLTHRTGS
Subjt:  INRRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGS

Query:  AALLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIE
        AALLSLIYSEILKMLR WSLLDFDVEIYHPHDD+SLPT YHK K +ESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAAD ANCSDRSN  E
Subjt:  AALLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIE

Query:  ESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRL
        ESGFQLASA+AAQHRLERGVWTSVR+GDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQE K+SSS  D LSCQEEEAV +LMKRL
Subjt:  ESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRL

Query:  TLIMMEDGWSRPSYARNFIG
         LIMMEDGWSRP++AR FIG
Subjt:  TLIMMEDGWSRPSYARNFIG

A0A6J1FL90 uncharacterized protein LOC111445003 isoform X18.1e-19384.49Show/hide
Query:  MSSFTAASASLWTPRLTTSSKFSKFN------SSPP---CFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISIS
        M+SFT  SASL  PRL +SSK SKFN      SSP     FRVVCSGGF+Q  A KD RFLLHDA+DS GIDST+AKEARKGFLTQI YLSNIERE SIS
Subjt:  MSSFTAASASLWTPRLTTSSKFSKFN------SSPP---CFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISIS

Query:  INRRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGS
        INR VDLAKAALYIA EDDSLVSHSSVPLP+DA+V R+NDLSMGYCTHYKSSFNLSPE+ LESIERY+YVMKGFRRTS K ++EPRALYLHTVLTHRTGS
Subjt:  INRRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGS

Query:  AALLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIE
        AALLSLIYSEILKMLR WSLLDFDVEIYHPHDD+SLPT YHK K +ESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAAD ANCSDRSN  E
Subjt:  AALLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIE

Query:  ESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRL
        ESGFQLASA+AAQHRLERGVWTSVR+GDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQE K+SSS  D LSCQEEEAV +LMKRL
Subjt:  ESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRL

Query:  TLIMMEDGWSRPSYARNFI
         LIMMEDGWSRP++AR FI
Subjt:  TLIMMEDGWSRPSYARNFI

A0A6J1GQR6 uncharacterized protein LOC1114562404.8e-19383.22Show/hide
Query:  MSSFTAAS--ASLWTPRLTTSSKFSKFNSS-----PPCFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISISIN
        M SFT AS  ASLW PRL+ SSKFSKF+SS      P FRVVCSGG +   A +D  F+LHDAMDS GID+++AKEARKGFLTQIQYLSNIERE SISIN
Subjt:  MSSFTAAS--ASLWTPRLTTSSKFSKFNSS-----PPCFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISISIN

Query:  RRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGSAA
        RRVDLAKAALYIA EDDSLVSHSSVPLPIDA++  + DLSMGYCTHYKSSFNLSPE+FLESIERYMYV KGFRRTSSK + EP+ALYLHTVLTH TGS+A
Subjt:  RRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGSAA

Query:  LLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIEES
         LSLIYSEILKMLR WSLLDFDVEIYHPHDDYSLPTGYHK KSKESDQPHIITTQSLLVEILSNLK SFWPFQQNQSRSLFLRA D ANC DRSN IEES
Subjt:  LLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIEES

Query:  GFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRLTL
        GFQLASA+AAQHRLERG+WTS R+GDMRRAL+ACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQE KSSSS  D LS +EEEAV NLMKRL L
Subjt:  GFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRLTL

Query:  IMMEDGWSRPSYARNFIGKNSEP
        IM+EDGWS PSYAR FIGKN+EP
Subjt:  IMMEDGWSRPSYARNFIGKNSEP

A0A6J1JZH6 uncharacterized protein LOC111489694 isoform X12.8e-19383.65Show/hide
Query:  MSSFTAASASLWTPRLTTSSKFSKFNSSPP-----------CFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREIS
        M+SFT  SASL  PRL +SSK SKFNSS              FRVVCSGGF+Q    KD RFLLHDA+DS GIDST+AKEARKGFLTQI YLSNIERE S
Subjt:  MSSFTAASASLWTPRLTTSSKFSKFNSSPP-----------CFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREIS

Query:  ISINRRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRT
        ISINR VDLAKAALYIA EDDSLVSHSSVPLP+DA+V R+NDLSMGYCTHYKSSFNLSPE+ LESIERY+YVMKGFRRTS K ++EPRALYLHTVLTHRT
Subjt:  ISINRRVDLAKAALYIATEDDSLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRT

Query:  GSAALLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNV
        GSAALLSLIYSEILKMLR WSLLDFDVEIYHPHDD+SLPT YHK K KESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAAD ANCSDRSN 
Subjt:  GSAALLSLIYSEILKMLRFWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNV

Query:  IEESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMK
         EESGFQLASA+AAQHRLERGVWTSVR+GDMRRALSACERLILLDVD KELRDYSILLYHCG+YEQSLEYLKLYQE K+SSS  D LSCQEEEAV +LMK
Subjt:  IEESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMK

Query:  RLTLIMMEDGWSRPSYARNFIG
        RL LIMMEDGWSRP++AR FIG
Subjt:  RLTLIMMEDGWSRPSYARNFIG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19160.1 unknown protein5.1e-0623.86Show/hide
Query:  LESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------FWSLLDFDVEIYHPHDDYSLPTGYHKQKSKE
        LE++   ++ ++GF+RTS     +P   YLH+VL  R  +A L+S+IY E+ K L               W   ++  E++      SL   +     + 
Subjt:  LESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------FWSLLDFDVEIYHPHDDYSLPTGYHKQKSKE

Query:  SDQP----HIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIEESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLD
         D P      +T +SL    ++  ++       N  R  + RA+ +++           G  L S  +  + +    +  +R  D+R A++A ERL++L 
Subjt:  SDQP----HIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIEESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLD

Query:  VDSKEL-RDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVN-LMKRLTLIMM
          +  L RD  ++LY+   Y ++++ L +            A +  EEEAV+   ++RL L+ +
Subjt:  VDSKEL-RDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVN-LMKRLTLIMM

AT4G19160.2 unknown protein1.8e-0622.09Show/hide
Query:  SFTAASASLWTPRLTTSSKFSKFNSSPPCFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISISINRRVD---LA
        S+T     +  PR  T+S      S+ P FR   S   +     K+ + +   A   F  + +   +  +  + ++ +    E E  +++NR  D   L 
Subjt:  SFTAASASLWTPRLTTSSKFSKFNSSPPCFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISISINRRVD---LA

Query:  KAALYIATEDDSLVSHSSVPLPIDA-----YVRRVNDLSMGYCTHYKS-SFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGSAA
        K    +  + D   + S   L +D      +V  ++ +S        S          LE++   ++ ++GF+RTS     +P   YLH+VL  R  +A 
Subjt:  KAALYIATEDDSLVSHSSVPLPIDA-----YVRRVNDLSMGYCTHYKS-SFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGSAA

Query:  LLSLIYSEILKMLR-------------FWSLLDFDVEIYHPHDDYSL-----------PTGYHKQKSKESDQP-HIITTQSLLVEILSNLKESFWPFQQN
        L+S+IY E+ K L               W   ++  E++      SL           P       + +S Q   + T + ++   L+NL    W     
Subjt:  LLSLIYSEILKMLR-------------FWSLLDFDVEIYHPHDDYSL-----------PTGYHKQKSKESDQP-HIITTQSLLVEILSNLKESFWPFQQN

Query:  QSRSLFLRAADAANCSDRSNVIEESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQEIK
         S  L L +        + N I  S F L                 +R  D+R A++A ERL++L   +  L RD  ++LY+   Y ++++ L +     
Subjt:  QSRSLFLRAADAANCSDRSNVIEESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQEIK

Query:  SSSSSPDALSCQEEEAVVN-LMKRLTLIMM
               A +  EEEAV+   ++RL L+ +
Subjt:  SSSSSPDALSCQEEEAVVN-LMKRLTLIMM

AT4G19160.3 unknown protein1.1e-0522.27Show/hide
Query:  SFTAASASLWTPRLTTSSKFSKFNSSPPCFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISISINRRVD---LA
        S+T     +  PR  T+S      S+ P FR   S   +     K+ + +   A   F  + +   +  +  + ++ +    E E  +++NR  D   L 
Subjt:  SFTAASASLWTPRLTTSSKFSKFNSSPPCFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISISINRRVD---LA

Query:  KAALYIATEDDSLVSHSSVPLPIDA-----YVRRVNDLSMGYCTHYKS-SFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGSAA
        K    +  + D   + S   L +D      +V  ++ +S        S          LE++   ++ ++GF+RTS     +P   YLH+VL  R  +A 
Subjt:  KAALYIATEDDSLVSHSSVPLPIDA-----YVRRVNDLSMGYCTHYKS-SFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGSAA

Query:  LLSLIYSEILKMLR-------------FWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQP----HIITTQSLLVEILSNLKESFWPFQQNQSRSLFLR
        L+S+IY E+ K L               W   ++  E++      SL   +     +  D P      +T +SL    ++  ++       N  R  + R
Subjt:  LLSLIYSEILKMLR-------------FWSLLDFDVEIYHPHDDYSLPTGYHKQKSKESDQP----HIITTQSLLVEILSNLKESFWPFQQNQSRSLFLR

Query:  AADAANCSDRSNVIEESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDA
        A+ +++           G  L S  +  + +    +  +R  D+R A++A ERL++L   +  L RD  ++LY+      S +Y +  QE+    S   A
Subjt:  AADAANCSDRSNVIEESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDA

Query:  LSCQEEEAVVN-LMKRLTLIMM
         +  EEEAV+   ++RL L+ +
Subjt:  LSCQEEEAVVN-LMKRLTLIMM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCATTTACTGCTGCTTCTGCTTCCTTGTGGACTCCAAGGTTGACAACCTCTTCCAAGTTCTCTAAATTCAATTCATCTCCTCCCTGTTTTCGAGTGGTTTGTTC
TGGTGGGTTTCAACAGCAGTATGCTGCAAAGGATTTGCGGTTCCTTCTCCACGATGCTATGGATTCTTTTGGAATCGACTCCACCCATGCAAAGGAGGCTAGGAAGGGTT
TCTTGACTCAGATTCAATATTTATCTAATATAGAGAGGGAAATAAGTATTAGCATTAACAGACGTGTTGATTTGGCAAAAGCTGCTCTTTATATTGCAACAGAGGATGAT
TCCTTGGTATCTCATTCATCTGTTCCTCTTCCCATTGATGCATATGTTCGTAGAGTAAATGACCTTTCCATGGGCTATTGTACTCACTACAAATCTTCATTCAATTTATC
ACCAGAAAATTTTCTGGAGAGTATAGAGAGGTATATGTACGTCATGAAGGGTTTCAGAAGAACCAGTTCTAAAACTCGATCAGAACCACGAGCTCTATATCTTCACACAG
TCTTGACCCACCGTACAGGGTCAGCTGCACTACTTTCACTCATATACTCTGAGATCCTGAAAATGCTTCGTTTCTGGAGCCTTCTGGATTTTGATGTAGAGATATATCAT
CCTCATGATGATTATAGCCTTCCCACGGGCTATCATAAACAGAAAAGCAAGGAATCTGATCAACCACACATAATAACGACACAAAGTCTCTTGGTGGAGATTTTAAGCAA
TTTAAAGGAATCCTTTTGGCCATTTCAACAAAATCAATCCAGAAGTTTATTCTTAAGGGCCGCAGATGCTGCTAACTGTAGTGACAGATCGAATGTAATTGAAGAAAGCG
GCTTTCAGCTTGCATCTGCAAGGGCTGCTCAACACAGGCTAGAACGTGGAGTTTGGACCAGCGTACGTTTTGGAGATATGAGGCGTGCATTATCTGCATGTGAACGGCTT
ATCCTCCTTGATGTTGATTCGAAGGAATTGAGAGATTATAGTATCCTTCTCTACCACTGTGGCTTTTATGAGCAATCTCTGGAGTATTTGAAGTTGTATCAGGAAATAAA
GAGTTCCTCAAGTTCACCCGATGCATTAAGTTGCCAGGAGGAAGAAGCTGTGGTTAACTTGATGAAACGCCTTACCCTTATTATGATGGAAGATGGTTGGAGCAGACCCT
CTTATGCTCGAAACTTCATCGGCAAGAACTCCGAACCTTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCATTTACTGCTGCTTCTGCTTCCTTGTGGACTCCAAGGTTGACAACCTCTTCCAAGTTCTCTAAATTCAATTCATCTCCTCCCTGTTTTCGAGTGGTTTGTTC
TGGTGGGTTTCAACAGCAGTATGCTGCAAAGGATTTGCGGTTCCTTCTCCACGATGCTATGGATTCTTTTGGAATCGACTCCACCCATGCAAAGGAGGCTAGGAAGGGTT
TCTTGACTCAGATTCAATATTTATCTAATATAGAGAGGGAAATAAGTATTAGCATTAACAGACGTGTTGATTTGGCAAAAGCTGCTCTTTATATTGCAACAGAGGATGAT
TCCTTGGTATCTCATTCATCTGTTCCTCTTCCCATTGATGCATATGTTCGTAGAGTAAATGACCTTTCCATGGGCTATTGTACTCACTACAAATCTTCATTCAATTTATC
ACCAGAAAATTTTCTGGAGAGTATAGAGAGGTATATGTACGTCATGAAGGGTTTCAGAAGAACCAGTTCTAAAACTCGATCAGAACCACGAGCTCTATATCTTCACACAG
TCTTGACCCACCGTACAGGGTCAGCTGCACTACTTTCACTCATATACTCTGAGATCCTGAAAATGCTTCGTTTCTGGAGCCTTCTGGATTTTGATGTAGAGATATATCAT
CCTCATGATGATTATAGCCTTCCCACGGGCTATCATAAACAGAAAAGCAAGGAATCTGATCAACCACACATAATAACGACACAAAGTCTCTTGGTGGAGATTTTAAGCAA
TTTAAAGGAATCCTTTTGGCCATTTCAACAAAATCAATCCAGAAGTTTATTCTTAAGGGCCGCAGATGCTGCTAACTGTAGTGACAGATCGAATGTAATTGAAGAAAGCG
GCTTTCAGCTTGCATCTGCAAGGGCTGCTCAACACAGGCTAGAACGTGGAGTTTGGACCAGCGTACGTTTTGGAGATATGAGGCGTGCATTATCTGCATGTGAACGGCTT
ATCCTCCTTGATGTTGATTCGAAGGAATTGAGAGATTATAGTATCCTTCTCTACCACTGTGGCTTTTATGAGCAATCTCTGGAGTATTTGAAGTTGTATCAGGAAATAAA
GAGTTCCTCAAGTTCACCCGATGCATTAAGTTGCCAGGAGGAAGAAGCTGTGGTTAACTTGATGAAACGCCTTACCCTTATTATGATGGAAGATGGTTGGAGCAGACCCT
CTTATGCTCGAAACTTCATCGGCAAGAACTCCGAACCTTGGTAA
Protein sequenceShow/hide protein sequence
MSSFTAASASLWTPRLTTSSKFSKFNSSPPCFRVVCSGGFQQQYAAKDLRFLLHDAMDSFGIDSTHAKEARKGFLTQIQYLSNIEREISISINRRVDLAKAALYIATEDD
SLVSHSSVPLPIDAYVRRVNDLSMGYCTHYKSSFNLSPENFLESIERYMYVMKGFRRTSSKTRSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRFWSLLDFDVEIYH
PHDDYSLPTGYHKQKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADAANCSDRSNVIEESGFQLASARAAQHRLERGVWTSVRFGDMRRALSACERL
ILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQEIKSSSSSPDALSCQEEEAVVNLMKRLTLIMMEDGWSRPSYARNFIGKNSEPW