; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025242 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025242
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionATP-dependent zinc metalloprotease
Genome locationtig00003412:2515890..2520574
RNA-Seq ExpressionSgr025242
SyntenySgr025242
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0048366 - leaf development (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0042651 - thylakoid membrane (cellular component)
GO:0004176 - ATP-dependent peptidase activity (molecular function)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR037219 - Peptidase M41-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8646108.1 hypothetical protein Csa_016892 [Cucumis sativus]5.7e-20974.06Show/hide
Query:  RDSAIEPLND-----SAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG
        RDSAIEP+ D     SAPSA+ N RLSGWERDWEVLDTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEGRRDVTPSVLE +TGLEVSKLSPKKWG
Subjt:  RDSAIEPLND-----SAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG

Query:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
        LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGT LAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
Subjt:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA

Query:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQ-------
        GTQFWDEKMA+NLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSV QMSNQARWAVLQSYNLLKWHKHAHQ       
Subjt:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQ-------

Query:  -------VAVKALESNVSSLQLPANC--PVLSRICVFDALVLNLRKKVTLEGWKNYILRCSLLGLPGCYSLGSAFTGTPQKAVEARQKTPWADLSLYIQQ
               + ++  +S   SL LPA C        CV+     +L K ++    K                          K  +  +K PW DLSLYIQ+
Subjt:  -------VAVKALESNVSSLQLPANC--PVLSRICVFDALVLNLRKKVTLEGWKNYILRCSLLGLPGCYSLGSAFTGTPQKAVEARQKTPWADLSLYIQQ

Query:  PHSTANARSNKSMQPTTRSDSGVFVFRRTLTEGPENTSRIVGNAQGFIIPNEQFAHSS---------QPER------HAKHIGHENREELAVVGGTGSFA
        PHS ANAR N +MQP T  DSGVFVFRR LT+GPENTS+IVGNAQGFIIP+EQFA SS          PE       HAKHIGHENREE+ VVGGTGSFA
Subjt:  PHSTANARSNKSMQPTTRSDSGVFVFRRTLTEGPENTSRIVGNAQGFIIPNEQFAHSS---------QPER------HAKHIGHENREELAVVGGTGSFA

Query:  FAQGVAIFLQTDRQPSDTDTTYHLKLQLQFPK
        FAQGVAIFLQT+RQ  ++DT+YHLKLQLQFPK
Subjt:  FAQGVAIFLQTDRQPSDTDTTYHLKLQLQFPK

XP_004139896.1 uncharacterized protein LOC101213430 [Cucumis sativus]4.1e-15993.42Show/hide
Query:  RDSAIEPLND-----SAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG
        RDSAIEP+ D     SAPSA+ N RLSGWERDWEVLDTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEGRRDVTPSVLE +TGLEVSKLSPKKWG
Subjt:  RDSAIEPLND-----SAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG

Query:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
        LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGT LAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
Subjt:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA

Query:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE
        GTQFWDEKMA+NLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSV QMSNQARWAVLQSYNLLKWHKHAHQVAVKA+E
Subjt:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE

Query:  SNVS
        S  S
Subjt:  SNVS

XP_008447096.1 PREDICTED: uncharacterized protein LOC103489633 isoform X1 [Cucumis melo]1.1e-15993.42Show/hide
Query:  RDSAIEPLND-----SAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG
        RDSAIEP+ND     SAPSA+ N RLSGWERDWEVLDTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEG+RDVTPSVLES+TGLEVSKLSPKKWG
Subjt:  RDSAIEPLND-----SAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG

Query:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
        LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGT LAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
Subjt:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA

Query:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE
        GTQFWDEKMA+NLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSV QMSNQARWAVLQSYNLLKWHKHAHQVAVKA+E
Subjt:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE

Query:  SNVS
        S  S
Subjt:  SNVS

XP_022147989.1 uncharacterized protein LOC111016783 [Momordica charantia]1.5e-16194.08Show/hide
Query:  RDSAIEPLN-----DSAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG
        RDSAIEP+N     DSAPSAL NPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGL+V+KLSPKKWG
Subjt:  RDSAIEPLN-----DSAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG

Query:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
        LSGSS YALIAFLGGTSFLLS+DIDIRPNLLALLGLAFLDSILLGGT LAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
Subjt:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA

Query:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE
        GTQFWDEKMA+NLAEGRLDGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSV QMSNQARWAVLQSYNLLKWHKHAHQVAVKALE
Subjt:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE

Query:  SNVS
        S  S
Subjt:  SNVS

XP_038888049.1 uncharacterized protein LOC120077976 isoform X1 [Benincasa hispida]6.3e-16093.75Show/hide
Query:  RDSAIEPLND-----SAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG
        RDSAIEP+ND     SAPSAL NPRLSGWERDWEVLDTCLNADDMKLVA+AYGFLRDRGFLPNFGK RNIVLEGRRDVTPSVLES+TGLEVSKLSPKKWG
Subjt:  RDSAIEPLND-----SAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG

Query:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
        +SGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGT LAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
Subjt:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA

Query:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE
        GTQFWDEKMA++LAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSV QMSNQARWAVLQSYNLLKWHKHAHQ AVKALE
Subjt:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE

Query:  SNVS
        S  S
Subjt:  SNVS

TrEMBL top hitse value%identityAlignment
A0A0A0K7I5 Uncharacterized protein2.0e-15993.42Show/hide
Query:  RDSAIEPLND-----SAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG
        RDSAIEP+ D     SAPSA+ N RLSGWERDWEVLDTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEGRRDVTPSVLE +TGLEVSKLSPKKWG
Subjt:  RDSAIEPLND-----SAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG

Query:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
        LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGT LAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
Subjt:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA

Query:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE
        GTQFWDEKMA+NLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSV QMSNQARWAVLQSYNLLKWHKHAHQVAVKA+E
Subjt:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE

Query:  SNVS
        S  S
Subjt:  SNVS

A0A1S3BH83 uncharacterized protein LOC103489633 isoform X15.2e-16093.42Show/hide
Query:  RDSAIEPLND-----SAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG
        RDSAIEP+ND     SAPSA+ N RLSGWERDWEVLDTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEG+RDVTPSVLES+TGLEVSKLSPKKWG
Subjt:  RDSAIEPLND-----SAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG

Query:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
        LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGT LAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
Subjt:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA

Query:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE
        GTQFWDEKMA+NLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSV QMSNQARWAVLQSYNLLKWHKHAHQVAVKA+E
Subjt:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE

Query:  SNVS
        S  S
Subjt:  SNVS

A0A5A7U732 Uncharacterized protein5.2e-16093.42Show/hide
Query:  RDSAIEPLND-----SAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG
        RDSAIEP+ND     SAPSA+ N RLSGWERDWEVLDTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEG+RDVTPSVLES+TGLEVSKLSPKKWG
Subjt:  RDSAIEPLND-----SAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG

Query:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
        LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGT LAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
Subjt:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA

Query:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE
        GTQFWDEKMA+NLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSV QMSNQARWAVLQSYNLLKWHKHAHQVAVKA+E
Subjt:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE

Query:  SNVS
        S  S
Subjt:  SNVS

A0A6J1D1P2 uncharacterized protein LOC1110167837.3e-16294.08Show/hide
Query:  RDSAIEPLN-----DSAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG
        RDSAIEP+N     DSAPSAL NPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGL+V+KLSPKKWG
Subjt:  RDSAIEPLN-----DSAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG

Query:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
        LSGSS YALIAFLGGTSFLLS+DIDIRPNLLALLGLAFLDSILLGGT LAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
Subjt:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA

Query:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE
        GTQFWDEKMA+NLAEGRLDGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSV QMSNQARWAVLQSYNLLKWHKHAHQVAVKALE
Subjt:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE

Query:  SNVS
        S  S
Subjt:  SNVS

A0A6J1HZW5 uncharacterized protein LOC111468437 isoform X13.4e-15994.08Show/hide
Query:  RDSAIEPLN-----DSAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG
        R+SAIEP N     DSAPSAL NPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEG RDVTPSVLES+TGLEVSKLSPKKWG
Subjt:  RDSAIEPLN-----DSAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWG

Query:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
        LSGSSRYALIA LGGTSFLLSQDIDIRPNL ALLGLAFLDSILLGGT LAQISS WPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA
Subjt:  LSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQA

Query:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE
        GTQFWDEKMA+NLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSV QMSNQARWAVLQSYNLLKWHKHAHQVAVKALE
Subjt:  GTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE

Query:  SNVS
        S  S
Subjt:  SNVS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G56180.1 unknown protein9.2e-12577.49Show/hide
Query:  ERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPN
        ERDW+VLD CLNADDM+LV +A+ FL++RG L NFGK  +IVLEG R+VTP+VL+S+TGLEV+KLSPKKWGLSG S  AL A LGG S+LLSQ+ID+RPN
Subjt:  ERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPN

Query:  LLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMANNLAEGRLDGTSFDRYCMV
        L  +LGLA+LDS+ LGGT LAQ+S YWPP++RRI+VHEAGHLL AYLMGCPIRGVILDP+VAMQMG+QGQAGTQFWD+KM + +AEGRL G+SFDRY MV
Subjt:  LLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMANNLAEGRLDGTSFDRYCMV

Query:  LFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE
        LFAGIAAEALVYGEAEGGENDENLFRSI VLL+PPLSV QMSNQARW+VLQSYNLLKWHK AH+ AV+AL+
Subjt:  LFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALE

AT2G21960.1 unknown protein3.3e-2136.25Show/hide
Query:  SLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGG
        +++  S+++P Y+ RI  HEA H L AYL+G PI G  LD          G+      DE++A  +  G+LD    DR   V  AG+AAE L Y +  G 
Subjt:  SLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGG

Query:  ENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESNVSSLQ
          D    +      QP +S  Q  N  RWAVL S +LLK +K  H+  + A+  N S L+
Subjt:  ENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESNVSSLQ

AT5G27290.1 unknown protein1.9e-1629.91Show/hide
Query:  SKLSPKKWGLSGSSRYALIAFL-GGTSFLLSQDIDIRPNLLALLGLAFLDSILL----GGTSLAQIS----SYWPPYRRRILVHEAGHLLTAYLMGCPIR
        S LSP    L    R   IA + GG     + D+  +      LG  FL ++ L    GG     +     ++   Y  R++ HEAGH L AYL+G   R
Subjt:  SKLSPKKWGLSGSSRYALIAFL-GGTSFLLSQDIDIRPNLLALLGLAFLDSILL----GGTSLAQIS----SYWPPYRRRILVHEAGHLLTAYLMGCPIR

Query:  GVILDPIVAMQM--GIQGQAGTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQ
        G  L  + A+Q    +  QAG+ F D +    +  G++  T  +R+  +  AG+A E L+YG AEGG +D +    +   L    +  +  +Q RW+VL 
Subjt:  GVILDPIVAMQM--GIQGQAGTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQ

Query:  SYNLLKWHKHAHQVAVKALESNVS
        +  LL+ H+ A     +A+    S
Subjt:  SYNLLKWHKHAHQVAVKALESNVS

AT5G27290.2 unknown protein4.9e-0930Show/hide
Query:  SKLSPKKWGLSGSSRYALIAFL-GGTSFLLSQDIDIRPNLLALLGLAFLDSILL----GGTSLAQIS----SYWPPYRRRILVHEAGHLLTAYLMGCPIR
        S LSP    L    R   IA + GG     + D+  +      LG  FL ++ L    GG     +     ++   Y  R++ HEAGH L AYL+G   R
Subjt:  SKLSPKKWGLSGSSRYALIAFL-GGTSFLLSQDIDIRPNLLALLGLAFLDSILL----GGTSLAQIS----SYWPPYRRRILVHEAGHLLTAYLMGCPIR

Query:  GVILDPIVAMQM--GIQGQAGTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALV
        G  L  + A+Q    +  QAG+ F D +    +  G++  T  +R+  +  AG+A E L+
Subjt:  GVILDPIVAMQM--GIQGQAGTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALV

AT5G42655.1 Disease resistance-responsive (dirigent-like protein) family protein9.5e-2146.67Show/hide
Query:  MQPTTR-SDSGVFVFRRTLTEGPENTSRIVGNAQGFIIPNEQFAHS---------SQPER------HAKHIGHENREELAVVGGTGSFAFAQGVAIFLQT
        +QP  R    G  +FRRTLTEGPEN SRIVG A+GFIIP+E FA+S           PE        ++ + H+ +E + VVGGTG+FAFA+G+A+F + 
Subjt:  MQPTTR-SDSGVFVFRRTLTEGPENTSRIVGNAQGFIIPNEQFAHS---------SQPER------HAKHIGHENREELAVVGGTGSFAFAQGVAIFLQT

Query:  DRQPSDTDTTYHLKLQLQFP
        D    +  TTY +KL L+FP
Subjt:  DRQPSDTDTTYHLKLQLQFP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTCACTATTTTACCTCGAGAGGAGTTGGATTTGGATCCACGTAAATTGCCGTTTCCCAGTTTCCCCGCACTGCGTAAAGCTGCAATTTTGATTTTTGCACGTTT
CTCGAGTCCTTTGAGAGATAGCGCAATCGAACCCCTTAATGATTCAGCTCCGTCCGCTCTTGAGAATCCACGGTTGTCTGGTTGGGAGAGGGACTGGGAGGTACTAGACA
CTTGTTTGAATGCGGATGATATGAAGCTTGTTGCCAATGCTTATGGGTTTCTCAGGGATAGAGGATTTTTGCCCAATTTTGGAAAATGCAGGAACATTGTTTTGGAGGGT
CGAAGAGATGTCACGCCATCTGTGTTGGAATCTTCAACTGGATTAGAAGTGTCCAAGTTGTCTCCAAAGAAGTGGGGTCTTTCGGGCAGCTCTCGTTACGCTTTGATTGC
TTTTCTTGGTGGAACATCATTTCTTCTCTCACAGGACATAGATATCAGGCCGAACCTCTTGGCACTGCTGGGGCTAGCATTTTTGGATTCTATCCTCCTTGGTGGTACTA
GTCTAGCGCAAATCTCAAGCTATTGGCCGCCATATAGGCGTCGAATCCTTGTACATGAAGCTGGACATCTACTGACTGCTTACCTCATGGGTTGCCCAATTCGTGGAGTG
ATTTTGGATCCAATTGTTGCTATGCAAATGGGGATACAAGGGCAGGCAGGTACGCAGTTTTGGGATGAAAAAATGGCAAACAACCTTGCTGAAGGACGTTTAGATGGTAC
TTCCTTTGACAGGTACTGCATGGTCCTTTTTGCAGGCATTGCCGCTGAAGCGCTTGTTTATGGTGAAGCAGAGGGTGGAGAGAATGATGAAAATTTGTTTAGAAGTATCT
GCGTTCTTTTGCAACCCCCATTGTCTGTTTTGCAGATGTCAAATCAAGCAAGGTGGGCTGTTCTACAATCTTACAATCTGCTGAAGTGGCACAAACATGCACACCAAGTT
GCTGTTAAAGCTTTGGAAAGTAATGTCTCTTCACTTCAGCTACCTGCTAATTGTCCGGTTCTTTCAAGGATATGTGTGTTCGATGCCCTTGTTTTGAATCTTAGGAAAAA
GGTGACATTGGAGGGCTGGAAGAATTATATTCTGCGCTGCAGTCTGCTTGGCCTTCCTGGCTGTTATTCTCTTGGCTCTGCTTTCACCGGTACCCCACAGAAAGCAGTTG
AAGCACGGCAGAAAACCCCATGGGCGGACCTGTCCCTCTACATTCAACAGCCTCATTCTACAGCAAATGCTAGATCTAATAAGAGTATGCAGCCCACAACAAGATCTGAT
TCCGGGGTTTTTGTCTTCAGACGAACACTCACAGAGGGACCTGAGAACACTTCCCGGATCGTCGGAAATGCTCAAGGTTTCATTATTCCTAACGAACAGTTTGCTCATTC
ATCGCAGCCTGAGCGTCATGCCAAACACATTGGCCACGAGAATAGAGAAGAACTGGCGGTGGTAGGGGGGACAGGTTCTTTTGCTTTTGCACAAGGGGTAGCTATTTTTC
TACAGACAGATAGGCAGCCATCTGATACAGATACAACTTATCATTTAAAGCTTCAACTTCAATTCCCCAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGTCACTATTTTACCTCGAGAGGAGTTGGATTTGGATCCACGTAAATTGCCGTTTCCCAGTTTCCCCGCACTGCGTAAAGCTGCAATTTTGATTTTTGCACGTTT
CTCGAGTCCTTTGAGAGATAGCGCAATCGAACCCCTTAATGATTCAGCTCCGTCCGCTCTTGAGAATCCACGGTTGTCTGGTTGGGAGAGGGACTGGGAGGTACTAGACA
CTTGTTTGAATGCGGATGATATGAAGCTTGTTGCCAATGCTTATGGGTTTCTCAGGGATAGAGGATTTTTGCCCAATTTTGGAAAATGCAGGAACATTGTTTTGGAGGGT
CGAAGAGATGTCACGCCATCTGTGTTGGAATCTTCAACTGGATTAGAAGTGTCCAAGTTGTCTCCAAAGAAGTGGGGTCTTTCGGGCAGCTCTCGTTACGCTTTGATTGC
TTTTCTTGGTGGAACATCATTTCTTCTCTCACAGGACATAGATATCAGGCCGAACCTCTTGGCACTGCTGGGGCTAGCATTTTTGGATTCTATCCTCCTTGGTGGTACTA
GTCTAGCGCAAATCTCAAGCTATTGGCCGCCATATAGGCGTCGAATCCTTGTACATGAAGCTGGACATCTACTGACTGCTTACCTCATGGGTTGCCCAATTCGTGGAGTG
ATTTTGGATCCAATTGTTGCTATGCAAATGGGGATACAAGGGCAGGCAGGTACGCAGTTTTGGGATGAAAAAATGGCAAACAACCTTGCTGAAGGACGTTTAGATGGTAC
TTCCTTTGACAGGTACTGCATGGTCCTTTTTGCAGGCATTGCCGCTGAAGCGCTTGTTTATGGTGAAGCAGAGGGTGGAGAGAATGATGAAAATTTGTTTAGAAGTATCT
GCGTTCTTTTGCAACCCCCATTGTCTGTTTTGCAGATGTCAAATCAAGCAAGGTGGGCTGTTCTACAATCTTACAATCTGCTGAAGTGGCACAAACATGCACACCAAGTT
GCTGTTAAAGCTTTGGAAAGTAATGTCTCTTCACTTCAGCTACCTGCTAATTGTCCGGTTCTTTCAAGGATATGTGTGTTCGATGCCCTTGTTTTGAATCTTAGGAAAAA
GGTGACATTGGAGGGCTGGAAGAATTATATTCTGCGCTGCAGTCTGCTTGGCCTTCCTGGCTGTTATTCTCTTGGCTCTGCTTTCACCGGTACCCCACAGAAAGCAGTTG
AAGCACGGCAGAAAACCCCATGGGCGGACCTGTCCCTCTACATTCAACAGCCTCATTCTACAGCAAATGCTAGATCTAATAAGAGTATGCAGCCCACAACAAGATCTGAT
TCCGGGGTTTTTGTCTTCAGACGAACACTCACAGAGGGACCTGAGAACACTTCCCGGATCGTCGGAAATGCTCAAGGTTTCATTATTCCTAACGAACAGTTTGCTCATTC
ATCGCAGCCTGAGCGTCATGCCAAACACATTGGCCACGAGAATAGAGAAGAACTGGCGGTGGTAGGGGGGACAGGTTCTTTTGCTTTTGCACAAGGGGTAGCTATTTTTC
TACAGACAGATAGGCAGCCATCTGATACAGATACAACTTATCATTTAAAGCTTCAACTTCAATTCCCCAAATGA
Protein sequenceShow/hide protein sequence
MAVTILPREELDLDPRKLPFPSFPALRKAAILIFARFSSPLRDSAIEPLNDSAPSALENPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEG
RRDVTPSVLESSTGLEVSKLSPKKWGLSGSSRYALIAFLGGTSFLLSQDIDIRPNLLALLGLAFLDSILLGGTSLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGV
ILDPIVAMQMGIQGQAGTQFWDEKMANNLAEGRLDGTSFDRYCMVLFAGIAAEALVYGEAEGGENDENLFRSICVLLQPPLSVLQMSNQARWAVLQSYNLLKWHKHAHQV
AVKALESNVSSLQLPANCPVLSRICVFDALVLNLRKKVTLEGWKNYILRCSLLGLPGCYSLGSAFTGTPQKAVEARQKTPWADLSLYIQQPHSTANARSNKSMQPTTRSD
SGVFVFRRTLTEGPENTSRIVGNAQGFIIPNEQFAHSSQPERHAKHIGHENREELAVVGGTGSFAFAQGVAIFLQTDRQPSDTDTTYHLKLQLQFPK