; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0029484 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0029484
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTransglut_core2 domain-containing protein
Genome locationchr8:39418722..39423030
RNA-Seq ExpressionLag0029484
SyntenyLag0029484
Gene Ontology termsNA
InterPro domainsIPR032698 - Protein SirB1, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579123.1 hypothetical protein SDJN03_23571, partial [Cucurbita argyrosperma subsp. sororia]1.1e-20486.31Show/hide
Query:  MASFTAASAAFASSWIPRLTTPSKISKFN------SSSSHSIPPSFRVVCSSGSQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIER
        MASFT+A        IPRL + SK+SKFN      SSSS S   SFRVVCS G +Q +APKDF FLLHDA+DSSGIDS+YAKEARKGFLTQI YLSNIER
Subjt:  MASFTAASAAFASSWIPRLTTPSKISKFN------SSSSHSIPPSFRVVCSSGSQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIER

Query:  ETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLT
        ETSISINR VDLAKAALYIAAEDDSLVSHSSVPLP DAF+HR++DLSMGYCTHYKSSFN+SPES LESIERY+YVMKGFRRT+ KAQ+EPRALYLHTVLT
Subjt:  ETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLT

Query:  HRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDR
        HRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDD+SLP  YHKLK +ESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDR
Subjt:  HRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDR

Query:  SNAIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVEN
        SNA EESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLY+ETK SSSPTDTLSCQEEEAV++
Subjt:  SNAIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVEN

Query:  LMKRLALIMMEDGWSRPSYARNFIGRNSEPW
        L+KRLALIMMEDGWSRP++AR FIG+NSEPW
Subjt:  LMKRLALIMMEDGWSRPSYARNFIGRNSEPW

KAG6601941.1 hypothetical protein SDJN03_07174, partial [Cucurbita argyrosperma subsp. sororia]8.5e-20888.68Show/hide
Query:  MASFTAASAAFASSWIPRLTTPSKISKFNSSSSHSIPPSFRVVCSSGSQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIERETSISI
        M SFT AS +FAS WIPRL+  SK SKF+SSSSHSI P FRVVCS GS+ + AP+DFHF+LHDAMDSSGID+SYAKEARKGFLTQIQYLSNIERETSISI
Subjt:  MASFTAASAAFASSWIPRLTTPSKISKFNSSSSHSIPPSFRVVCSSGSQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIERETSISI

Query:  NRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSA
        NRRVDLAKAALYIAAEDDSLVSHSSVPLP DAFIH L+DLSMGYCTHYKSSFN+SPESFLESIERYMYV KGFRRT+SKAQ EP+ LYLHTVLTH TGS+
Subjt:  NRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSA

Query:  ALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEE
        ALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQ HIITTQSLLVEILSNLK SFWPFQQNQSRSLFLRA DVANCSDRSNAIEE
Subjt:  ALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEE

Query:  SGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLMKRLA
        SGFQLASAKAAQHRLERG+WTS RYGDMRRAL+ACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK+SS PTDTLS +EEEAVENLMKRLA
Subjt:  SGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLMKRLA

Query:  LIMMEDGWSRPSYARNFIGRNSEP
        LIM+EDGWS PSYAR FIG+N+EP
Subjt:  LIMMEDGWSRPSYARNFIGRNSEP

XP_011655252.2 uncharacterized protein LOC101204123 isoform X1 [Cucumis sativus]2.2e-20385.98Show/hide
Query:  MASFTAASAAFASSWIPRLTTPSKIS---------KFNSSSSHSIPPSFRVVCSSGSQQH-NAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLS
        MASFT+ASA+      PRLT+ S  S         KFNS SSHS PPSFRV CS G  QH N+   F+FLLH A+DSSGIDS++AKEARKGFL+QI YLS
Subjt:  MASFTAASAAFASSWIPRLTTPSKIS---------KFNSSSSHSIPPSFRVVCSSGSQQH-NAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLS

Query:  NIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLH
         IER+TSISINRRVDLAKAALYIAAEDDSLVSHSSVPLP DAFIHRLSDLSMGYCTHYKSSFN SPE FLESIERYMYVMKGFRRT SKAQSEPRALYLH
Subjt:  NIERETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLH

Query:  TVLTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVAN
        TVLTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHI+TTQ+LLVEIL+NLKESFWPFQQNQSRSLFLRAAD AN
Subjt:  TVLTHRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVAN

Query:  CSDRSNAIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEE
        CSD S+A EESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSSPT  LS QEEE
Subjt:  CSDRSNAIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEE

Query:  AVENLMKRLALIMMEDGWSRPSYARNFIGRNSEPW
        AV+NLMKRLALIMMEDGWSRPS++R FIG+NSEPW
Subjt:  AVENLMKRLALIMMEDGWSRPSYARNFIGRNSEPW

XP_022953825.1 uncharacterized protein LOC111456240 [Cucurbita moschata]5.0e-20888.68Show/hide
Query:  MASFTAASAAFASSWIPRLTTPSKISKFNSSSSHSIPPSFRVVCSSGSQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIERETSISI
        M SFT AS +FAS WIPRL+  SK SKF+SSSSHSI P FRVVCS GS+ + AP+DFHF+LHDAMDSSGID+SYAKEARKGFLTQIQYLSNIERETSISI
Subjt:  MASFTAASAAFASSWIPRLTTPSKISKFNSSSSHSIPPSFRVVCSSGSQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIERETSISI

Query:  NRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSA
        NRRVDLAKAALYIAAEDDSLVSHSSVPLP DAFIH L+DLSMGYCTHYKSSFN+SPESFLESIERYMYV KGFRRT+SKAQ EP+ALYLHTVLTH TGS+
Subjt:  NRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSA

Query:  ALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEE
        A LSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQPHIITTQSLLVEILSNLK SFWPFQQNQSRSLFLRA DVANC DRSNAIEE
Subjt:  ALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEE

Query:  SGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLMKRLA
        SGFQLASAKAAQHRLERG+WTS RYGDMRRAL+ACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK+SSSPTD LS +EEEAVENLMKRLA
Subjt:  SGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLMKRLA

Query:  LIMMEDGWSRPSYARNFIGRNSEP
        LIM+EDGWS PSYAR FIG+N+EP
Subjt:  LIMMEDGWSRPSYARNFIGRNSEP

XP_023530796.1 uncharacterized protein LOC111793214 [Cucurbita pepo subsp. pepo]2.4e-21089.39Show/hide
Query:  MASFTAASAAFASSWIPRLTTPSKISKFNSSSSHSIPPSFRVVCSSGSQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIERETSISI
        M SFT AS +FAS WIPRL+  SK SKFNSSSSHSI PSFRVVCS GS+ + AP+DFHF+LHDAMDSSGID+SY+KEARKGFLTQIQYLSNIERETSISI
Subjt:  MASFTAASAAFASSWIPRLTTPSKISKFNSSSSHSIPPSFRVVCSSGSQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIERETSISI

Query:  NRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSA
        NRRVDLAKAALYIAAEDDSLVSHSSVPLP DAFIH L+DLSMGYCTHYKSSFN+SPESFLESIERYMYV KGFRRT+SKAQ EP+ALYLHTVLTH TGS+
Subjt:  NRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSA

Query:  ALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEE
         LLSLIYSEILKMLRLWSLLDFDVEIYHPHD+YSLP GYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRA DVANCSDRSNAIEE
Subjt:  ALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEE

Query:  SGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLMKRLA
        SGFQLASAKAAQHRLERGVWTS RYGDMRRAL+ACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK+SSSPTDT S +EEEAVENLMKRLA
Subjt:  SGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLMKRLA

Query:  LIMMEDGWSRPSYARNFIGRNSEP
        LIM+EDGWS PSYAR FIG+N+EP
Subjt:  LIMMEDGWSRPSYARNFIGRNSEP

TrEMBL top hitse value%identityAlignment
A0A0A0KTQ8 Transglut_core2 domain-containing protein1.4e-20386.34Show/hide
Query:  MASFTAASAAFASSWIPRLTTPSKIS------KFNSSSSHSIPPSFRVVCSSGSQQH-NAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIE
        MASFT+ASA+      PRL++ S  S      KFNS SSHS PPSFRV CS G  QH N+   F+FLLH A+DSSGIDS++AKEARKGFL+QI YLS IE
Subjt:  MASFTAASAAFASSWIPRLTTPSKIS------KFNSSSSHSIPPSFRVVCSSGSQQH-NAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIE

Query:  RETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVL
        R+TSISINRRVDLAKAALYIAAEDDSLVSHSSVPLP DAFIHRLSDLSMGYCTHYKSSFN SPE FLESIERYMYVMKGFRRT SKAQSEPRALYLHTVL
Subjt:  RETSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVL

Query:  THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSD
        THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHI+TTQ+LLVEIL+NLKESFWPFQQNQSRSLFLRAAD ANCSD
Subjt:  THRTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSD

Query:  RSNAIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVE
         S+A EESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSSPT  LS QEEEAV+
Subjt:  RSNAIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVE

Query:  NLMKRLALIMMEDGWSRPSYARNFIGRNSEPW
        NLMKRLALIMMEDGWSRPS++R FIG+NSEPW
Subjt:  NLMKRLALIMMEDGWSRPSYARNFIGRNSEPW

A0A1S3CPY9 uncharacterized protein LOC103503468 isoform X21.5e-20287.21Show/hide
Query:  MASFTAASAAFASSWIPRLT----TPSKISKFNSSSSHSIPPSFRVVCSSG-SQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIERE
        M+S T+ASA+ AS  IPR T    + SK  KFNS SSHS PP FRVVCS G  QQ N+ KDF+FLLHDAMDSSGIDS++AKEARKGFL+QI YLS IER+
Subjt:  MASFTAASAAFASSWIPRLT----TPSKISKFNSSSSHSIPPSFRVVCSSG-SQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIERE

Query:  TSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTH
        TSISI+RRVDLAKAALYIAAEDDSLVSHSSVPLP DAFIHRLSDLSMGYCTHYKSSFN SPE FLESIERYMYVMKGFRRT SKAQSEPRALYLHTVLTH
Subjt:  TSISINRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTH

Query:  RTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRS
        RTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQ HI+TTQ+LLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSD S
Subjt:  RTGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRS

Query:  NAIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENL
        +A EESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETK SSS T  LS QEEEAV+NL
Subjt:  NAIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENL

Query:  MKRLALIMMEDGWSRPSYARNFIGRNSEPW
        MKRLALIMMEDGWSRPS++R FI ++SEPW
Subjt:  MKRLALIMMEDGWSRPSYARNFIGRNSEPW

A0A6J1CZL0 uncharacterized protein LOC111015657 isoform X14.0e-20387.29Show/hide
Query:  MASFTAASAAFASSWIPRLTTPSKISKFNSSSSHSIPPSFRVVCSSGSQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIERETSISI
        M S T AS   AS W P LT  SK SKFNSS     PP FRVVCS   Q H A KD HF LHDAMDSSGIDS+YAKEARKGFLTQI+Y SNIE+ETSISI
Subjt:  MASFTAASAAFASSWIPRLTTPSKISKFNSSSSHSIPPSFRVVCSSGSQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIERETSISI

Query:  NRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSA
        NRRVDLAKAALYIAAEDDSLVSHSSVPLP DAFI+RLSDLSMGYCTHYKSSFN+SPESFLESIERYMYV KGFRRT+S  QSE RALYLHTVLTHRTGSA
Subjt:  NRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSA

Query:  ALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEE
        ALLSL+YSEILKMLRLWSLLDFDVEIYHPHD YSLP GYHK KSKESDQPHIITTQSLLVEILSNLKESFWPFQQN SRSLFLRAADVANCSDRSNAIEE
Subjt:  ALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEE

Query:  SGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLMKRLA
        SGFQLASAKAAQHRLERGVWTSVR+GDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK+  SP+DT+SCQEEEAV NLMKRLA
Subjt:  SGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLMKRLA

Query:  LIMMEDGWSRPSYARNFIGRNSEPW
        LIMMEDGWS PSY RNFIG+NSEPW
Subjt:  LIMMEDGWSRPSYARNFIGRNSEPW

A0A6J1FEJ1 uncharacterized protein LOC111445003 isoform X24.0e-20387.47Show/hide
Query:  MASFTAASAAFASSWIPRLTTPSKISKFNSSSSHSIPP----SFRVVCSSGSQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIERET
        MASFT+AS       IPRL + SK+SKFN SSS S  P    SFRVVCS G +Q +APKDF FLLHDA+DSSGIDS+YAKEARKGFLTQI YLSNIERET
Subjt:  MASFTAASAAFASSWIPRLTTPSKISKFNSSSSHSIPP----SFRVVCSSGSQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIERET

Query:  SISINRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHR
        SISINR VDLAKAALYIAAEDDSLVSHSSVPLP DAF+HR++DLSMGYCTHYKSSFN+SPES LESIERY+YVMKGFRRT+ KAQ+EPRALYLHTVLTHR
Subjt:  SISINRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHR

Query:  TGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSN
        TGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDD+SLP  YHKLK +ESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSN
Subjt:  TGSAALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSN

Query:  AIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLM
        A EESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK SSSPTDTLSCQEEEAV++LM
Subjt:  AIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLM

Query:  KRLALIMMEDGWSRPSYARNFIG
        KRLALIMMEDGWSRP++AR FIG
Subjt:  KRLALIMMEDGWSRPSYARNFIG

A0A6J1GQR6 uncharacterized protein LOC1114562402.4e-20888.68Show/hide
Query:  MASFTAASAAFASSWIPRLTTPSKISKFNSSSSHSIPPSFRVVCSSGSQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIERETSISI
        M SFT AS +FAS WIPRL+  SK SKF+SSSSHSI P FRVVCS GS+ + AP+DFHF+LHDAMDSSGID+SYAKEARKGFLTQIQYLSNIERETSISI
Subjt:  MASFTAASAAFASSWIPRLTTPSKISKFNSSSSHSIPPSFRVVCSSGSQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIERETSISI

Query:  NRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSA
        NRRVDLAKAALYIAAEDDSLVSHSSVPLP DAFIH L+DLSMGYCTHYKSSFN+SPESFLESIERYMYV KGFRRT+SKAQ EP+ALYLHTVLTH TGS+
Subjt:  NRRVDLAKAALYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSA

Query:  ALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEE
        A LSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLP GYHKLKSKESDQPHIITTQSLLVEILSNLK SFWPFQQNQSRSLFLRA DVANC DRSNAIEE
Subjt:  ALLSLIYSEILKMLRLWSLLDFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEE

Query:  SGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLMKRLA
        SGFQLASAKAAQHRLERG+WTS RYGDMRRAL+ACERLILLDVD KELRDYSILLYHCGFYEQSLEYLKLYQETK+SSSPTD LS +EEEAVENLMKRLA
Subjt:  SGFQLASAKAAQHRLERGVWTSVRYGDMRRALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLMKRLA

Query:  LIMMEDGWSRPSYARNFIGRNSEP
        LIM+EDGWS PSYAR FIG+N+EP
Subjt:  LIMMEDGWSRPSYARNFIGRNSEP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G19160.1 unknown protein1.1e-0623.25Show/hide
Query:  LESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSL-----------
        LE++   ++ ++GF+RT+     +P   YLH+VL  R  +A L+S+IY E+ K L              +W   ++  E++      SL           
Subjt:  LESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSL-----------

Query:  -PMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSA
               L +K      + T + ++   L+NL    W      S  L L +        + N I  S F L                 +R  D+R A++A
Subjt:  -PMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALSA

Query:  CERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLMKRLALIMM
         ERL++L   +  L RD  ++LY+   Y ++++ L +      + +P      +EE  +E  ++RL L+ +
Subjt:  CERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLMKRLALIMM

AT4G19160.2 unknown protein2.8e-0723.12Show/hide
Query:  RETSI-SINRRVDLAKAALYIAAEDDSL------------------VSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFN-VSPE---------------S
        RE SI S +  + +AK   YIAAED++                   V   S P  TD+    L  L     + + S  + +S E                
Subjt:  RETSI-SINRRVDLAKAALYIAAEDDSL------------------VSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFN-VSPE---------------S

Query:  FLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSL----------
         LE++   ++ ++GF+RT+     +P   YLH+VL  R  +A L+S+IY E+ K L              +W   ++  E++      SL          
Subjt:  FLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSL----------

Query:  --PMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALS
                L +K      + T + ++   L+NL    W      S  L L +        + N I  S F L                 +R  D+R A++
Subjt:  --PMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALS

Query:  ACERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLMKRLALIMM
        A ERL++L   +  L RD  ++LY+   Y ++++ L +      + +P      +EE  +E  ++RL L+ +
Subjt:  ACERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLMKRLALIMM

AT4G19160.3 unknown protein6.2e-0724.13Show/hide
Query:  RETSI-SINRRVDLAKAALYIAAEDDSL------------------VSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFN-VSPE---------------S
        RE SI S +  + +AK   YIAAED++                   V   S P  TD+    L  L     + + S  + +S E                
Subjt:  RETSI-SINRRVDLAKAALYIAAEDDSL------------------VSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFN-VSPE---------------S

Query:  FLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSL----------
         LE++   ++ ++GF+RT+     +P   YLH+VL  R  +A L+S+IY E+ K L              +W   ++  E++      SL          
Subjt:  FLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLR-------------LWSLLDFDVEIYHPHDDYSL----------

Query:  --PMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALS
                L +K      + T + ++   L+NL    W      S  L L +        + N I  S F L                 +R  D+R A++
Subjt:  --PMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEESGFQLASAKAAQHRLERGVWTSVRYGDMRRALS

Query:  ACERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAV-ENLMKRLALIMM
        A ERL++L   +  L RD  ++LY+      S +Y +  QE     +     +  EEEAV E  ++RL L+ +
Subjt:  ACERLILLDVDSKEL-RDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAV-ENLMKRLALIMM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTTCACTGCTGCTTCGGCAGCTTTTGCTTCCTCATGGATTCCAAGGCTGACAACTCCTTCCAAAATCTCCAAATTCAATTCTTCTTCATCCCATTCCATTCC
ACCGAGTTTTCGAGTTGTTTGTTCTAGTGGGTCTCAGCAGCACAACGCTCCCAAGGATTTCCACTTCCTTCTCCATGATGCCATGGATTCTTCTGGAATTGACTCCTCCT
ATGCTAAGGAGGCTAGGAAGGGGTTCTTGACTCAGATTCAATATTTATCTAATATAGAGAGGGAAACAAGTATTAGCATTAATAGGCGTGTTGATTTGGCGAAAGCTGCT
CTTTATATTGCAGCAGAGGATGATTCCTTAGTATCTCATTCATCTGTTCCTCTTCCCACTGATGCATTTATTCATAGGTTAAGTGACCTATCCATGGGCTATTGTACTCA
CTACAAATCTTCATTCAATGTATCACCAGAAAGTTTTTTGGAGAGTATAGAGAGGTACATGTACGTCATGAAGGGTTTCAGAAGAACCAATTCTAAAGCTCAATCAGAAC
CACGAGCTCTATATCTTCACACAGTCTTGACCCATCGTACAGGCTCAGCTGCACTACTTTCACTCATATACTCCGAGATCCTGAAAATGCTTCGTTTATGGAGCCTTCTG
GATTTTGATGTAGAAATATATCATCCTCATGACGATTATAGCCTTCCCATGGGCTATCATAAGCTGAAAAGCAAGGAATCTGATCAACCACACATAATAACAACACAAAG
TCTCCTGGTGGAGATTTTAAGCAATTTAAAGGAATCTTTTTGGCCATTTCAACAAAATCAATCCAGAAGTTTATTCTTAAGGGCCGCTGATGTTGCTAACTGTAGTGATA
GATCAAATGCAATTGAAGAAAGTGGCTTTCAGCTTGCGTCTGCAAAGGCTGCTCAACACAGGCTAGAACGTGGAGTTTGGACCAGTGTGCGTTATGGAGATATGAGGCGT
GCATTATCTGCATGTGAACGGCTTATCCTCCTCGATGTTGATTCGAAGGAATTGAGAGATTATAGCATCCTTCTCTACCATTGTGGCTTTTATGAGCAATCTCTGGAGTA
TTTGAAGTTGTATCAGGAAACAAAGACTTCATCAAGTCCAACCGACACGTTAAGTTGCCAGGAGGAAGAAGCTGTGGAAAATTTGATGAAACGCCTTGCCCTTATTATGA
TGGAAGATGGCTGGAGCAGACCCTCTTATGCTCGAAATTTCATCGGTAGGAACTCCGAACCATGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTTCACTGCTGCTTCGGCAGCTTTTGCTTCCTCATGGATTCCAAGGCTGACAACTCCTTCCAAAATCTCCAAATTCAATTCTTCTTCATCCCATTCCATTCC
ACCGAGTTTTCGAGTTGTTTGTTCTAGTGGGTCTCAGCAGCACAACGCTCCCAAGGATTTCCACTTCCTTCTCCATGATGCCATGGATTCTTCTGGAATTGACTCCTCCT
ATGCTAAGGAGGCTAGGAAGGGGTTCTTGACTCAGATTCAATATTTATCTAATATAGAGAGGGAAACAAGTATTAGCATTAATAGGCGTGTTGATTTGGCGAAAGCTGCT
CTTTATATTGCAGCAGAGGATGATTCCTTAGTATCTCATTCATCTGTTCCTCTTCCCACTGATGCATTTATTCATAGGTTAAGTGACCTATCCATGGGCTATTGTACTCA
CTACAAATCTTCATTCAATGTATCACCAGAAAGTTTTTTGGAGAGTATAGAGAGGTACATGTACGTCATGAAGGGTTTCAGAAGAACCAATTCTAAAGCTCAATCAGAAC
CACGAGCTCTATATCTTCACACAGTCTTGACCCATCGTACAGGCTCAGCTGCACTACTTTCACTCATATACTCCGAGATCCTGAAAATGCTTCGTTTATGGAGCCTTCTG
GATTTTGATGTAGAAATATATCATCCTCATGACGATTATAGCCTTCCCATGGGCTATCATAAGCTGAAAAGCAAGGAATCTGATCAACCACACATAATAACAACACAAAG
TCTCCTGGTGGAGATTTTAAGCAATTTAAAGGAATCTTTTTGGCCATTTCAACAAAATCAATCCAGAAGTTTATTCTTAAGGGCCGCTGATGTTGCTAACTGTAGTGATA
GATCAAATGCAATTGAAGAAAGTGGCTTTCAGCTTGCGTCTGCAAAGGCTGCTCAACACAGGCTAGAACGTGGAGTTTGGACCAGTGTGCGTTATGGAGATATGAGGCGT
GCATTATCTGCATGTGAACGGCTTATCCTCCTCGATGTTGATTCGAAGGAATTGAGAGATTATAGCATCCTTCTCTACCATTGTGGCTTTTATGAGCAATCTCTGGAGTA
TTTGAAGTTGTATCAGGAAACAAAGACTTCATCAAGTCCAACCGACACGTTAAGTTGCCAGGAGGAAGAAGCTGTGGAAAATTTGATGAAACGCCTTGCCCTTATTATGA
TGGAAGATGGCTGGAGCAGACCCTCTTATGCTCGAAATTTCATCGGTAGGAACTCCGAACCATGGTAA
Protein sequenceShow/hide protein sequence
MASFTAASAAFASSWIPRLTTPSKISKFNSSSSHSIPPSFRVVCSSGSQQHNAPKDFHFLLHDAMDSSGIDSSYAKEARKGFLTQIQYLSNIERETSISINRRVDLAKAA
LYIAAEDDSLVSHSSVPLPTDAFIHRLSDLSMGYCTHYKSSFNVSPESFLESIERYMYVMKGFRRTNSKAQSEPRALYLHTVLTHRTGSAALLSLIYSEILKMLRLWSLL
DFDVEIYHPHDDYSLPMGYHKLKSKESDQPHIITTQSLLVEILSNLKESFWPFQQNQSRSLFLRAADVANCSDRSNAIEESGFQLASAKAAQHRLERGVWTSVRYGDMRR
ALSACERLILLDVDSKELRDYSILLYHCGFYEQSLEYLKLYQETKTSSSPTDTLSCQEEEAVENLMKRLALIMMEDGWSRPSYARNFIGRNSEPW