; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g0024 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g0024
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionATP-dependent zinc metalloprotease
Genome locationMC04:179394..184564
RNA-Seq ExpressionMC04g0024
SyntenyMC04g0024
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016020 - membrane (cellular component)
GO:0004176 - ATP-dependent peptidase activity (molecular function)
GO:0004222 - metalloendopeptidase activity (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR037219 - Peptidase M41-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139896.1 uncharacterized protein LOC101213430 [Cucumis sativus]4.08e-25990.32Show/hide
Query:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERL--QRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADS
        MAI SPPKL ISSS    Q F + I FHF QK P GI ++FHLER   QRLL L RALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEP+ DSA A S
Subjt:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERL--QRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADS

Query:  APSALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGT
        APSA+RN RLSGWERDWEVLDTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEGRRDVTPSVLE +TGL+V+KLSPKKWGLSGSS YALIAFLGGT
Subjt:  APSALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGT

Query:  SFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
        SFLLS+DIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
Subjt:  SFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG

Query:  RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDAL
        RLDGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKA+ESGSSLSVVIRKIEDAL
Subjt:  RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDAL

Query:  STN
        STN
Subjt:  STN

XP_008447096.1 PREDICTED: uncharacterized protein LOC103489633 isoform X1 [Cucumis melo]8.61e-26190.57Show/hide
Query:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLER--LQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADS
        MAI SPPKL ISSS L  Q F + I FHF QK P GI +HFHL+R   QRLL L RALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEP+NDSA A S
Subjt:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLER--LQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADS

Query:  APSALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGT
        APSA+ N RLSGWERDWEVLDTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEG+RDVTPSVLES+TGL+V+KLSPKKWGLSGSS YALIAFLGGT
Subjt:  APSALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGT

Query:  SFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
        SFLLS+DIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
Subjt:  SFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG

Query:  RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDAL
        RLDGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKA+ESGSSLSVVIR+IEDAL
Subjt:  RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDAL

Query:  STN
        STN
Subjt:  STN

XP_022147989.1 uncharacterized protein LOC111016783 [Momordica charantia]4.07e-293100Show/hide
Query:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAP
        MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAP
Subjt:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAP

Query:  SALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGTSF
        SALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGTSF
Subjt:  SALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGTSF

Query:  LLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
        LLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
Subjt:  LLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL

Query:  DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALST
        DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALST
Subjt:  DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALST

Query:  NG
        NG
Subjt:  NG

XP_022969425.1 uncharacterized protein LOC111468437 isoform X1 [Cucurbita maxima]4.06e-25689.53Show/hide
Query:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAP
        M+I SPPKL IS S L FQ F   + FHF QK   GI  HFHL+R QRLL LPRA+REWQ+YEEAVKRKDLAEALRFLESF R+SAIEP NDSA ADSAP
Subjt:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAP

Query:  SALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGTSF
        SAL NPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEG RDVTPSVLES+TGL+V+KLSPKKWGLSGSS YALIA LGGTSF
Subjt:  SALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGTSF

Query:  LLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
        LLS+DIDIRPNL ALLGLAFLDSILLGGTCLAQISS WPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
Subjt:  LLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL

Query:  DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALST
        DGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIR++E+ALST
Subjt:  DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALST

Query:  N
        N
Subjt:  N

XP_038888049.1 uncharacterized protein LOC120077976 isoform X1 [Benincasa hispida]8.29e-26190.52Show/hide
Query:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAP
        MA+ SPPKL ISSS L FQ   + I F+F QK P GI +HF+LER QRLL L RAL EWQDYEEAVKRKDLAEALRFLESFDRDSAIEP+NDSA A SAP
Subjt:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAP

Query:  SALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGTSF
        SAL NPRLSGWERDWEVLDTCLNADDMKLVA+AYGFLRDRGFLPNFGK RNIVLEGRRDVTPSVLES+TGL+V+KLSPKKWG+SGSS YALIAFLGGTSF
Subjt:  SALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGTSF

Query:  LLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
        LLS+DIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMAS+LAEGRL
Subjt:  LLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL

Query:  DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALST
        DGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQ AVKALESGSSLSVVIR+IEDALST
Subjt:  DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALST

Query:  N
        N
Subjt:  N

TrEMBL top hitse value%identityAlignment
A0A0A0K7I5 Uncharacterized protein1.98e-25990.32Show/hide
Query:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERL--QRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADS
        MAI SPPKL ISSS    Q F + I FHF QK P GI ++FHLER   QRLL L RALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEP+ DSA A S
Subjt:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERL--QRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADS

Query:  APSALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGT
        APSA+RN RLSGWERDWEVLDTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEGRRDVTPSVLE +TGL+V+KLSPKKWGLSGSS YALIAFLGGT
Subjt:  APSALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGT

Query:  SFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
        SFLLS+DIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
Subjt:  SFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG

Query:  RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDAL
        RLDGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKA+ESGSSLSVVIRKIEDAL
Subjt:  RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDAL

Query:  STN
        STN
Subjt:  STN

A0A1S3BH83 uncharacterized protein LOC103489633 isoform X14.17e-26190.57Show/hide
Query:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLER--LQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADS
        MAI SPPKL ISSS L  Q F + I FHF QK P GI +HFHL+R   QRLL L RALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEP+NDSA A S
Subjt:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLER--LQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADS

Query:  APSALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGT
        APSA+ N RLSGWERDWEVLDTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEG+RDVTPSVLES+TGL+V+KLSPKKWGLSGSS YALIAFLGGT
Subjt:  APSALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGT

Query:  SFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
        SFLLS+DIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
Subjt:  SFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG

Query:  RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDAL
        RLDGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKA+ESGSSLSVVIR+IEDAL
Subjt:  RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDAL

Query:  STN
        STN
Subjt:  STN

A0A5A7U732 Uncharacterized protein4.17e-26190.57Show/hide
Query:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLER--LQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADS
        MAI SPPKL ISSS L  Q F + I FHF QK P GI +HFHL+R   QRLL L RALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEP+NDSA A S
Subjt:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLER--LQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADS

Query:  APSALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGT
        APSA+ N RLSGWERDWEVLDTCLNADDMKLVANAY FL+DRGFLPNFGKCRNIVLEG+RDVTPSVLES+TGL+V+KLSPKKWGLSGSS YALIAFLGGT
Subjt:  APSALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGT

Query:  SFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
        SFLLS+DIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG
Subjt:  SFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEG

Query:  RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDAL
        RLDGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKA+ESGSSLSVVIR+IEDAL
Subjt:  RLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDAL

Query:  STN
        STN
Subjt:  STN

A0A6J1D1P2 uncharacterized protein LOC1110167831.97e-293100Show/hide
Query:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAP
        MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAP
Subjt:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAP

Query:  SALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGTSF
        SALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGTSF
Subjt:  SALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGTSF

Query:  LLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
        LLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
Subjt:  LLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL

Query:  DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALST
        DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALST
Subjt:  DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALST

Query:  NG
        NG
Subjt:  NG

A0A6J1HZW5 uncharacterized protein LOC111468437 isoform X11.97e-25689.53Show/hide
Query:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAP
        M+I SPPKL IS S L FQ F   + FHF QK   GI  HFHL+R QRLL LPRA+REWQ+YEEAVKRKDLAEALRFLESF R+SAIEP NDSA ADSAP
Subjt:  MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAP

Query:  SALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGTSF
        SAL NPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEG RDVTPSVLES+TGL+V+KLSPKKWGLSGSS YALIA LGGTSF
Subjt:  SALRNPRLSGWERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGTSF

Query:  LLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
        LLS+DIDIRPNL ALLGLAFLDSILLGGTCLAQISS WPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL
Subjt:  LLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRL

Query:  DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALST
        DGTSFDRYCM+LFAGIAAEALVYGEAEGGENDENLFRSIC+LLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIR++E+ALST
Subjt:  DGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALST

Query:  N
        N
Subjt:  N

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G56180.1 unknown protein2.1e-14565.59Show/hide
Query:  SPPKLQISSSSLYFQPFRHQISFHF--LQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSA
        SPP L+  S S     F  QI F    +Q    G  R   L R       P ALREW++YE+AVKRKDLA ALRFL+S + D   + V     A      
Subjt:  SPPKLQISSSSLYFQPFRHQISFHF--LQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSA

Query:  LRNPRLSG-----WERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGG
            +LSG      ERDW+VLD CLNADDM+LV +A+ FL++RG L NFGK  +IVLEG R+VTP+VL+S+TGL+VTKLSPKKWGLSG SS AL A LGG
Subjt:  LRNPRLSG-----WERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGG

Query:  TSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAE
         S+LLS++ID+RPNL  +LGLA+LDS+ LGGTCLAQ+S YWPP++RRI+VHEAGHLL AYLMGCPIRGVILDP+VAMQMG+QGQAGTQFWD+KM S +AE
Subjt:  TSFLLSRDIDIRPNLLALLGLAFLDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAE

Query:  GRLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDA
        GRL G+SFDRY M+LFAGIAAEALVYGEAEGGENDENLFRSI +LL+PPLSVAQMSNQARW+VLQSYNLLKWHK AH+ AV+AL+ GS LS+VIR+IE+A
Subjt:  GRLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDA

Query:  LSTN
        +S++
Subjt:  LSTN

AT2G21960.1 unknown protein1.1e-2135.33Show/hide
Query:  LAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRLDGTSFDRYCMILFAGIAAEALVYGEAEGGE
        ++  S+++P Y+ RI  HEA H L AYL+G PI G  LD          G+      DE++A  +  G+LD    DR   +  AG+AAE L Y +  G  
Subjt:  LAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRLDGTSFDRYCMILFAGIAAEALVYGEAEGGE

Query:  NDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDA
         D    +      QP +S  Q  N  RWAVL S +LLK +K  H+  + A+   +S+   I+ IE A
Subjt:  NDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDA

AT5G27290.1 unknown protein1.0e-1934.38Show/hide
Query:  YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQM--GIQGQAGTQFWDEKMASNLAEGRLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRS
        Y  R++ HEAGH L AYL+G   RG  L  + A+Q    +  QAG+ F D +    +  G++  T  +R+  I  AG+A E L+YG AEGG +D +    
Subjt:  YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQM--GIQGQAGTQFWDEKMASNLAEGRLDGTSFDRYCMILFAGIAAEALVYGEAEGGENDENLFRS

Query:  ICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDAL
        +   L    +  +  +Q RW+VL +  LL+ H+ A     +A+  G S+   I+ IED++
Subjt:  ICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDAL

AT5G27290.2 unknown protein3.7e-0936.9Show/hide
Query:  YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQM--GIQGQAGTQFWDEKMASNLAEGRLDGTSFDRYCMILFAGIAAEALV
        Y  R++ HEAGH L AYL+G   RG  L  + A+Q    +  QAG+ F D +    +  G++  T  +R+  I  AG+A E L+
Subjt:  YRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQM--GIQGQAGTQFWDEKMASNLAEGRLDGTSFDRYCMILFAGIAAEALV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATCCCGAGTCCTCCCAAACTCCAGATTTCCTCTTCTTCTCTCTATTTCCAACCTTTTCGGCACCAAATTTCCTTCCATTTCCTGCAAAAAACCCCTCGTGGAAT
CACTAGACATTTCCATTTAGAACGCCTTCAGCGTCTCCTCCATCTGCCCAGAGCTCTTCGTGAATGGCAAGACTACGAAGAGGCAGTGAAGCGCAAGGACCTCGCAGAAG
CTCTTAGGTTTCTCGAGTCCTTTGACAGAGATAGCGCAATCGAACCCGTTAATGATTCAGCCGCTGCTGATTCAGCTCCGTCCGCTCTTCGAAATCCACGGTTGTCTGGC
TGGGAGCGGGACTGGGAGGTACTCGACACTTGTTTGAATGCGGATGATATGAAGCTTGTTGCCAATGCTTATGGGTTTCTCAGGGATAGAGGATTTTTGCCCAATTTTGG
AAAATGCAGGAACATTGTTTTGGAGGGTCGAAGAGACGTCACACCATCTGTGTTGGAATCTTCGACTGGATTACAAGTGACCAAGTTGTCTCCAAAGAAGTGGGGTCTTT
CAGGCAGCTCTAGTTACGCTTTGATTGCCTTTCTTGGTGGAACATCATTTCTGCTCTCACGGGACATTGATATTAGGCCGAACCTTTTGGCACTGCTGGGGCTTGCATTT
TTGGACTCTATCCTCCTTGGTGGTACGTGTCTAGCGCAAATCTCTAGCTATTGGCCACCATATAGGCGTCGAATCCTTGTACATGAAGCTGGACATCTACTGACTGCTTA
TCTCATGGGTTGCCCAATTCGTGGAGTTATTTTGGATCCAATTGTTGCCATGCAAATGGGGATACAGGGACAGGCAGGTACCCAGTTTTGGGATGAAAAAATGGCAAGCA
ACCTTGCTGAAGGACGTTTGGATGGTACTTCTTTTGACAGGTACTGCATGATCCTTTTTGCGGGCATTGCAGCTGAAGCTCTTGTTTACGGTGAAGCAGAGGGTGGAGAG
AATGATGAAAATTTGTTTAGAAGTATCTGCATTCTTTTGCAACCGCCACTATCTGTTGCGCAGATGTCAAATCAAGCAAGGTGGGCTGTTCTACAATCTTACAATCTGCT
GAAGTGGCACAAACATGCACACCAAGTTGCCGTTAAAGCTTTGGAAAGTGGAAGCAGTCTCAGTGTTGTAATTAGGAAAATCGAGGATGCTTTGTCAACCAATGGATGA
mRNA sequenceShow/hide mRNA sequence
GGTCTGTTCGGATTGCATAAACAAACATTGTTGAATTTTATCTACCAAACCGGGCCGTATTTGCTTCGTTATGGAAGAGCAAAATGCGAAGGGTGAATAATAAGAAATCA
GTCAATCACAAATTAATCTAAATCCACCAGATCCAGATGAAGAACGAATCTAAACCCAAGTGGTAAACGTACCAGAATTAGAGCCCTGTATCTATCTCTATCTATGCTTC
ACCACCACCAGGATTATCCACAGAATGAGCCGAGCAGAGGAACAATATATAGGTTTGAACCACTGCACATCGCCCACAGTCACACAACAACTCCAGATGGTTGAAAGTTG
AGAAACAGTTGATGAGAAGTAAAAAGCCAATATCCATCACTAAAACTAAATCAATCCTTATCCAGATTGGTTTACAAGCGAAGTAATATGATGGAACATTTAGACATGCT
CGCTCAGGAGCCACGAGTTGACAGGAAGAGCCAGCAGCTGCAGCGTTGTTGGTTGATTTGATTTGAATGTCTGCCTTCTATTTCTACCACCGCAACCGGTCGGTTATGGC
CTTATGGACCAAAACGACACATCGTTCCATGGTCCACGGCCCCTTAACTTAACCCAGTCCTTCGTGTAAAAGTAAAACTCCTCTATTTTGACGAGTTCCCCAGATTCTCA
AAAACTCCACTTTTGCACCTACTCCGTCTCATGGCTATCCCGAGTCCTCCCAAACTCCAGATTTCCTCTTCTTCTCTCTATTTCCAACCTTTTCGGCACCAAATTTCCTT
CCATTTCCTGCAAAAAACCCCTCGTGGAATCACTAGACATTTCCATTTAGAACGCCTTCAGCGTCTCCTCCATCTGCCCAGAGCTCTTCGTGAATGGCAAGACTACGAAG
AGGCAGTGAAGCGCAAGGACCTCGCAGAAGCTCTTAGGTTTCTCGAGTCCTTTGACAGAGATAGCGCAATCGAACCCGTTAATGATTCAGCCGCTGCTGATTCAGCTCCG
TCCGCTCTTCGAAATCCACGGTTGTCTGGCTGGGAGCGGGACTGGGAGGTACTCGACACTTGTTTGAATGCGGATGATATGAAGCTTGTTGCCAATGCTTATGGGTTTCT
CAGGGATAGAGGATTTTTGCCCAATTTTGGAAAATGCAGGAACATTGTTTTGGAGGGTCGAAGAGACGTCACACCATCTGTGTTGGAATCTTCGACTGGATTACAAGTGA
CCAAGTTGTCTCCAAAGAAGTGGGGTCTTTCAGGCAGCTCTAGTTACGCTTTGATTGCCTTTCTTGGTGGAACATCATTTCTGCTCTCACGGGACATTGATATTAGGCCG
AACCTTTTGGCACTGCTGGGGCTTGCATTTTTGGACTCTATCCTCCTTGGTGGTACGTGTCTAGCGCAAATCTCTAGCTATTGGCCACCATATAGGCGTCGAATCCTTGT
ACATGAAGCTGGACATCTACTGACTGCTTATCTCATGGGTTGCCCAATTCGTGGAGTTATTTTGGATCCAATTGTTGCCATGCAAATGGGGATACAGGGACAGGCAGGTA
CCCAGTTTTGGGATGAAAAAATGGCAAGCAACCTTGCTGAAGGACGTTTGGATGGTACTTCTTTTGACAGGTACTGCATGATCCTTTTTGCGGGCATTGCAGCTGAAGCT
CTTGTTTACGGTGAAGCAGAGGGTGGAGAGAATGATGAAAATTTGTTTAGAAGTATCTGCATTCTTTTGCAACCGCCACTATCTGTTGCGCAGATGTCAAATCAAGCAAG
GTGGGCTGTTCTACAATCTTACAATCTGCTGAAGTGGCACAAACATGCACACCAAGTTGCCGTTAAAGCTTTGGAAAGTGGAAGCAGTCTCAGTGTTGTAATTAGGAAAA
TCGAGGATGCTTTGTCAACCAATGGATGAAAAGTGGAAACAACAACATATGCGTTTCATTTCTCTTCACGTCCCTGTGGAAGGTAATTACTCGGATGCAGGGGTAGCAGT
CTTACTGATTAATAAGCGGGGTAATAATATATCAGTTCGAGAGTCACAGTCTCTTCCTTCAATCATGTCGGCTACCTGCTTATTGGCCAGTTCTTTCGAGGATATGTGTG
TTCAATGCCCACGTTTTGAATCTTAGGAAAAAGTGACACTGGCTGGAAGGTAGAGCATTAAAGGAGTTAATCATGAAACTTGACTCAGATTAGACTTTTGTTGTGTATAA
CCTTCACTTCTCTGCTGCACCGCAGTTTCAAATTTCCGAAACTTTCCTTGCCTGAAATCTTTTCCTGGCAGTTCTCTACACTGCCCATATATAGAATAGAGTGAGTGAGT
TACACACAAAAGGCCAACCTTGCCCATCTATCGTCTATTCCATGTACTTAATTTGGTTGAAGACTTTGTATATATGGCAGAATTCCTCCTGTAAGCCACCGGTTTACCTG
GCTTTGTATTGCAGAAATGACATCATAATGCATTTTTGCTGTGTTAGATCACTGTGCATTTTGTTTCAGAATGGGACAGTTCCTTGCTTGTGAACATAAGGGCATCCAAT
TGAATTATTTATTCTGATGAGGATCTAAAATTAGCATATTTCAATCACTTAAGCTCCATCAAAATGATGAAACCCATATTTTCATCTGGATCAAAACCATCGTTCCAATG
TACCAATAAGGGCATATTCATGGTCTCTGCGCCAGGATGAACACGTCAAAGCGGTTACCTCATATTGCAGCATAAGCAGGACTGACTTAAAGTCGCTGGATTTAGTGGTG
GATTTGTTATCTGCGGCACACCAAATCCTTTTGGAAGATTGGCCTTTCAAAAAAGGTGTTCTTTTGAAATAGCGACCCTTCATGGATAACATGGTGACTACTGACTGATG
GATTAGGTGCAAGACTTAAAAGTTGAATTAGTTGGCCTAAACTTTTGACAGAGTGAGCAAGTTATTTGTATAAGGTACATTTTTGAAAGAGTCGGCCTTTTCAATTCATG
ATGAAGACTAGAATTATTTTCTGCGCTGCGGTATGCTTGGCCTTCCTAGTTGTTATTCTTTTGGCTCTGCTTTCGCCGGTCCCCAACAAAAAGCAGTCGAAGCACGGCAG
AAAACCATCGTGGGCGGACCTGTCCCTCTACATTCAGCAGCCACATTCTACAGGAAATGCTAAATCTAACAATATGCAGCCCGTACCAAGATCCGATTCTGGGGTTTTCG
TATTCCTACGAACGCTCACGGAGGGACCTGAGAACACTTCCCGGATTGTTGGAAATGCTCGAGGTTTCATTATTCCTAACGAACAGTTTGCTCATTCTTCATTCAATGTC
ATCTATCTCAGTTTCGACACACCGGAATATTCAGGCAGCCTGAGCGTGCACGCCAAGCATATTGGACATGAGAATAGAGAACTGGCGGTGGTAGGGGGGACAGGTTCTTT
TGCTTTTGCACAAGGGATAGCTATTTTTCTACAGACAGATGGGCAGGCATCTGTTACAGATACAACTTATCATTTAAAGCTTCAACTTCAATTCCCCAAATGACAGTGCA
AAGTAGTCCCACATGCTTCTTTCTTTTGTGATATATAAAGCACAATTTTGTGTACCAATGGGGATGCATGACCATGTTCCTCGTCATTCCAGTTGTAAAAACTGCTTGGC
TAAGACATGCTTTGGTATTCAATAATTATATGGCCATGGGTTGCTCCTAAATACACACTGCATACGTTTCGCAGCATCTGGAACTTGAGATGCCATTGATTATTTGTTTA
TTTATTTTAATTATTTTTAGTTTTCTGCCTCAAACTGTTTTTAGTAATATGATGAAGTAACTGACTTTGTTACAATGAACCCTTTTGACAACTGGTTTATATCCATGATG
G
Protein sequenceShow/hide protein sequence
MAIPSPPKLQISSSSLYFQPFRHQISFHFLQKTPRGITRHFHLERLQRLLHLPRALREWQDYEEAVKRKDLAEALRFLESFDRDSAIEPVNDSAAADSAPSALRNPRLSG
WERDWEVLDTCLNADDMKLVANAYGFLRDRGFLPNFGKCRNIVLEGRRDVTPSVLESSTGLQVTKLSPKKWGLSGSSSYALIAFLGGTSFLLSRDIDIRPNLLALLGLAF
LDSILLGGTCLAQISSYWPPYRRRILVHEAGHLLTAYLMGCPIRGVILDPIVAMQMGIQGQAGTQFWDEKMASNLAEGRLDGTSFDRYCMILFAGIAAEALVYGEAEGGE
NDENLFRSICILLQPPLSVAQMSNQARWAVLQSYNLLKWHKHAHQVAVKALESGSSLSVVIRKIEDALSTNG