; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS009933 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS009933
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionDUF4378 domain-containing protein
Genome locationscaffold943_1:119123..121558
RNA-Seq ExpressionMS009933
SyntenyMS009933
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008462543.1 PREDICTED: uncharacterized protein LOC103500875 [Cucumis melo]8.1e-22682.41Show/hide
Query:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHV
        MMAQKHLH+LLE+DQEPFHLN+YIAEKRVNLKRVSPK+ LQV KRKPIST SIF GNFCRNACFTSF PSPD RKSPLFEF SPARN    SPNAIFLH+
Subjt:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHV

Query:  PARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNE
        PARTA LLLEAALKIHKQKSS K KK+QIKNQG ARFGSVLKRLTLRNRN NR +EACGSG DLASF QRKSSIRR+  QGETSS NGRSSYGFWSE+NE
Subjt:  PARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNE

Query:  EERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPF
        E  SMDLGTSCSSQSEDSEETSVAY G DYCESPFRFVLQRSPS+GCRTPDF SPA SPCRRNKED  I   ESL KFQV EDEEDKEQCSPVS+LD PF
Subjt:  EERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPF

Query:  DDSYDERHDDRVRD---RVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEV
        DDSYDE H +R RD     E+YD+ECSYA VQRTKQQLLNKLRRFERLADLDPIELEKIMV+E+  E +Y+YF NEECEYY   VQW NENDIEWFVKEV
Subjt:  DDSYDERHDDRVRD---RVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEV

Query:  ASDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEE
        AS+ + CKS++FLPQD+RKLV DLIAEEEAD+ + NTREEVI+RVC RLELWKEVEFNTIDMMVEEDL+KEV EWK+NQEQRGEAA DLELAIFSLLVEE
Subjt:  ASDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEE

Query:  LAVELA
        LAVELA
Subjt:  LAVELA

XP_022143695.1 uncharacterized protein LOC111013540 [Momordica charantia]7.5e-28099.6Show/hide
Query:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARNSPNAIFLHVPART
        MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARNSPNAIFLHVPART
Subjt:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARNSPNAIFLHVPART

Query:  AALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNEEERS
        AALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNEEERS
Subjt:  AALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNEEERS

Query:  MDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPFDDSY
        MDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTP FQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPFDDSY
Subjt:  MDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPFDDSY

Query:  DERHDDRVRDRVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEVASDTSSC
        DERHDDRVRDRVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFS+EECEYYKSPVQWHNENDIEWFVKEVASDTSSC
Subjt:  DERHDDRVRDRVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEVASDTSSC

Query:  KSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEELAVELAP
        KSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEELAVELAP
Subjt:  KSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEELAVELAP

XP_022925872.1 uncharacterized protein LOC111433152 isoform X1 [Cucurbita moschata]1.4e-21778.04Show/hide
Query:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHV
        MM  KHLHQLLEEDQEPFHLN+YIAEKRVNLKRVS K+DLQV KRKPIST SIF GNFC+NACFTSFQPSPD RKSPLF+F SPAR+    SPNAIFLH+
Subjt:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHV

Query:  PARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNE
        PARTAALLLEAALKIHKQKSS K KKTQIKNQG ARFGSVLKRLTLRNRN NR++  CG G +LASFGQRKSS+RR + QGETSS+NGRSSYGFWSE+NE
Subjt:  PARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNE

Query:  EERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPF
        E RSMDLGTSCSSQSEDSEETSVAY G DYCESPFRFVLQRSPS+GCRTPDF SPA SPC R KED+ ++  ESLKK Q  +DEEDKEQCSPVS+LD PF
Subjt:  EERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPF

Query:  DDSYDERHDDRVRD-------RVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWF
        D SYDE H DR RD         EDY LECSYA VQRTKQQLLNKLRRFE+LADLDPIELEK+M++E+  E D+DYF+NEECEYY    Q +NEN+IE F
Subjt:  DDSYDERHDDRVRD-------RVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWF

Query:  VKEVASDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSL
        VKEVA   + CKS+ FLP+DMRKLV DL++EEEAD+ N  TRE+VIQRVCKRLE+WKEV+FNTIDMMVEEDL+KEVDEWKKNQ QRGE A DLE+AIFSL
Subjt:  VKEVASDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSL

Query:  LVEELAVELA
        LVEELAVEL+
Subjt:  LVEELAVELA

XP_031744144.1 uncharacterized protein LOC101207103 [Cucumis sativus]9.6e-22782.41Show/hide
Query:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHV
        MMAQKHLH+LLE+DQEPFHLN+YIAEKRVNLKRVSPK+ LQV KRKPIST SIF GNFCRNACFTSF PSPD RKSPLFEF SPARN    SPNAIFLH+
Subjt:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHV

Query:  PARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNE
        PARTA LLLEAALKIHKQKSS K KK+QIKNQG ARFGSVLKRLTLRNRN NR++EACGSG DLASFGQRKSSIRR+  QGETSS NGRSSYGFWSE+NE
Subjt:  PARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNE

Query:  EERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPF
        E  SMDLGTSCSSQSEDSEETSVAY G DYCESPFRFVLQRSPS+GCRTPDF SPA SPC RNKED  +   ESL KFQV EDEEDKEQCSPVS+LD PF
Subjt:  EERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPF

Query:  DDSYDERHDDRVRD---RVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEV
        DDSYDE H DR RD     EDYD+ECSYA VQRTKQQLLNKLRRFERLADLDPIELEKIM++E+Q E +Y+YF N ECEYY   VQW NENDIEWFV+EV
Subjt:  DDSYDERHDDRVRD---RVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEV

Query:  ASDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEE
        ASD + CKS++FLPQDMRKLV DL+AEEEAD+ + NTREEVIQRVC RLELWKEVEFNTIDMMVEEDL+KEV EWK+NQEQR EAA DLELAIFSLLVEE
Subjt:  ASDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEE

Query:  LAVELA
        LAVELA
Subjt:  LAVELA

XP_038881414.1 uncharacterized protein LOC120072951 [Benincasa hispida]6.4e-23183.3Show/hide
Query:  MAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHVP
        MAQKHLH+LLEEDQEPFHLN+YIAEKRVNLKRVSPK+ LQV KRKPIST SIF GNFCRNACFTSF PSPD RKSPLFEF SPARN    SPNAIFLH+P
Subjt:  MAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHVP

Query:  ARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNEE
        ARTA LLLEAALKIHKQKSS K KK+QIKNQG ARFGSVLKRLTLRNRN NR++EACGSG DLASFGQRKSSIRR++ QGETSSYNGRSSYGFWSE+NEE
Subjt:  ARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNEE

Query:  ERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPFD
         RSMDLGTSCSSQSEDSEETSVAY G DYCESPFRFVLQRSPS+GCRTPDF SPA SPCRRNKED+ +D  E L KFQV EDEEDKEQCSPVS+LD PFD
Subjt:  ERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPFD

Query:  DSYDERHDDRVRDR-VEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEVASD
        DSYDE HDDR RDR  E+YDLECSYA VQRTKQQLLNKLRRFERLADLDPIELEKIM++E+  E +Y+Y  NEECEYY   V+W NEN IEWFVKEVA++
Subjt:  DSYDERHDDRVRDR-VEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEVASD

Query:  TSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEELAV
         + CKS++F+P+DMRKLV DLIAEEEAD+ N +TREEVIQRVCKRLELWKEVEFNTIDMMVEEDL+KEV EWK+NQEQRGEAA DLELAIFSLLVEELAV
Subjt:  TSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEELAV

Query:  ELA
        ELA
Subjt:  ELA

TrEMBL top hitse value%identityAlignment
A0A0A0KFA1 Uncharacterized protein4.6e-22782.41Show/hide
Query:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHV
        MMAQKHLH+LLE+DQEPFHLN+YIAEKRVNLKRVSPK+ LQV KRKPIST SIF GNFCRNACFTSF PSPD RKSPLFEF SPARN    SPNAIFLH+
Subjt:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHV

Query:  PARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNE
        PARTA LLLEAALKIHKQKSS K KK+QIKNQG ARFGSVLKRLTLRNRN NR++EACGSG DLASFGQRKSSIRR+  QGETSS NGRSSYGFWSE+NE
Subjt:  PARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNE

Query:  EERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPF
        E  SMDLGTSCSSQSEDSEETSVAY G DYCESPFRFVLQRSPS+GCRTPDF SPA SPC RNKED  +   ESL KFQV EDEEDKEQCSPVS+LD PF
Subjt:  EERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPF

Query:  DDSYDERHDDRVRD---RVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEV
        DDSYDE H DR RD     EDYD+ECSYA VQRTKQQLLNKLRRFERLADLDPIELEKIM++E+Q E +Y+YF N ECEYY   VQW NENDIEWFV+EV
Subjt:  DDSYDERHDDRVRD---RVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEV

Query:  ASDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEE
        ASD + CKS++FLPQDMRKLV DL+AEEEAD+ + NTREEVIQRVC RLELWKEVEFNTIDMMVEEDL+KEV EWK+NQEQR EAA DLELAIFSLLVEE
Subjt:  ASDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEE

Query:  LAVELA
        LAVELA
Subjt:  LAVELA

A0A1S3CHP7 uncharacterized protein LOC1035008753.9e-22682.41Show/hide
Query:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHV
        MMAQKHLH+LLE+DQEPFHLN+YIAEKRVNLKRVSPK+ LQV KRKPIST SIF GNFCRNACFTSF PSPD RKSPLFEF SPARN    SPNAIFLH+
Subjt:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHV

Query:  PARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNE
        PARTA LLLEAALKIHKQKSS K KK+QIKNQG ARFGSVLKRLTLRNRN NR +EACGSG DLASF QRKSSIRR+  QGETSS NGRSSYGFWSE+NE
Subjt:  PARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNE

Query:  EERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPF
        E  SMDLGTSCSSQSEDSEETSVAY G DYCESPFRFVLQRSPS+GCRTPDF SPA SPCRRNKED  I   ESL KFQV EDEEDKEQCSPVS+LD PF
Subjt:  EERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPF

Query:  DDSYDERHDDRVRD---RVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEV
        DDSYDE H +R RD     E+YD+ECSYA VQRTKQQLLNKLRRFERLADLDPIELEKIMV+E+  E +Y+YF NEECEYY   VQW NENDIEWFVKEV
Subjt:  DDSYDERHDDRVRD---RVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEV

Query:  ASDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEE
        AS+ + CKS++FLPQD+RKLV DLIAEEEAD+ + NTREEVI+RVC RLELWKEVEFNTIDMMVEEDL+KEV EWK+NQEQRGEAA DLELAIFSLLVEE
Subjt:  ASDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEE

Query:  LAVELA
        LAVELA
Subjt:  LAVELA

A0A5A7SKT4 Histone-lysine N-methyltransferase SETD1B-like3.9e-22682.41Show/hide
Query:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHV
        MMAQKHLH+LLE+DQEPFHLN+YIAEKRVNLKRVSPK+ LQV KRKPIST SIF GNFCRNACFTSF PSPD RKSPLFEF SPARN    SPNAIFLH+
Subjt:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHV

Query:  PARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNE
        PARTA LLLEAALKIHKQKSS K KK+QIKNQG ARFGSVLKRLTLRNRN NR +EACGSG DLASF QRKSSIRR+  QGETSS NGRSSYGFWSE+NE
Subjt:  PARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNE

Query:  EERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPF
        E  SMDLGTSCSSQSEDSEETSVAY G DYCESPFRFVLQRSPS+GCRTPDF SPA SPCRRNKED  I   ESL KFQV EDEEDKEQCSPVS+LD PF
Subjt:  EERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPF

Query:  DDSYDERHDDRVRD---RVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEV
        DDSYDE H +R RD     E+YD+ECSYA VQRTKQQLLNKLRRFERLADLDPIELEKIMV+E+  E +Y+YF NEECEYY   VQW NENDIEWFVKEV
Subjt:  DDSYDERHDDRVRD---RVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEV

Query:  ASDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEE
        AS+ + CKS++FLPQD+RKLV DLIAEEEAD+ + NTREEVI+RVC RLELWKEVEFNTIDMMVEEDL+KEV EWK+NQEQRGEAA DLELAIFSLLVEE
Subjt:  ASDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEE

Query:  LAVELA
        LAVELA
Subjt:  LAVELA

A0A6J1CPH7 uncharacterized protein LOC1110135403.6e-28099.6Show/hide
Query:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARNSPNAIFLHVPART
        MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARNSPNAIFLHVPART
Subjt:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARNSPNAIFLHVPART

Query:  AALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNEEERS
        AALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNEEERS
Subjt:  AALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNEEERS

Query:  MDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPFDDSY
        MDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTP FQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPFDDSY
Subjt:  MDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPFDDSY

Query:  DERHDDRVRDRVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEVASDTSSC
        DERHDDRVRDRVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFS+EECEYYKSPVQWHNENDIEWFVKEVASDTSSC
Subjt:  DERHDDRVRDRVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEVASDTSSC

Query:  KSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEELAVELAP
        KSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEELAVELAP
Subjt:  KSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEELAVELAP

A0A6J1ECT2 uncharacterized protein LOC111433152 isoform X16.7e-21878.04Show/hide
Query:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHV
        MM  KHLHQLLEEDQEPFHLN+YIAEKRVNLKRVS K+DLQV KRKPIST SIF GNFC+NACFTSFQPSPD RKSPLF+F SPAR+    SPNAIFLH+
Subjt:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARN----SPNAIFLHV

Query:  PARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNE
        PARTAALLLEAALKIHKQKSS K KKTQIKNQG ARFGSVLKRLTLRNRN NR++  CG G +LASFGQRKSS+RR + QGETSS+NGRSSYGFWSE+NE
Subjt:  PARTAALLLEAALKIHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNE

Query:  EERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPF
        E RSMDLGTSCSSQSEDSEETSVAY G DYCESPFRFVLQRSPS+GCRTPDF SPA SPC R KED+ ++  ESLKK Q  +DEEDKEQCSPVS+LD PF
Subjt:  EERSMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPF

Query:  DDSYDERHDDRVRD-------RVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWF
        D SYDE H DR RD         EDY LECSYA VQRTKQQLLNKLRRFE+LADLDPIELEK+M++E+  E D+DYF+NEECEYY    Q +NEN+IE F
Subjt:  DDSYDERHDDRVRD-------RVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWF

Query:  VKEVASDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSL
        VKEVA   + CKS+ FLP+DMRKLV DL++EEEAD+ N  TRE+VIQRVCKRLE+WKEV+FNTIDMMVEEDL+KEVDEWKKNQ QRGE A DLE+AIFSL
Subjt:  VKEVASDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSL

Query:  LVEELAVELA
        LVEELAVEL+
Subjt:  LVEELAVELA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G36420.1 unknown protein1.5e-6036.71Show/hide
Query:  QKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNF-CRNACFTSFQPSPDLRKSPLFEFHSPARNS--PNAIFLHVPART
        +KHLH+ LE+DQEPFHLN YI     NL+     SD++V KRK  +  +   G F C N+CF +   SPD RKSPLFE  SP +       +FL +PART
Subjt:  QKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNF-CRNACFTSFQPSPDLRKSPLFEFHSPARNS--PNAIFLHVPART

Query:  AALLLEAALKIHKQKS-SPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNEEER
        AA+LL+AA +I KQ+S   K  K + +  G   FGSVLK LT R     R   A G+   L    +  SS RR                         ER
Subjt:  AALLLEAALKIHKQKS-SPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNEEER

Query:  SMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSP-SYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVE----DEEDKEQCSPVSILDT
         +++   C                  +CESPF FVLQ +P S G +TP F S A SP RR+ ED+  D  ESL+K +  E    +EEDKEQCSPVS+LD 
Subjt:  SMDLGTSCSSQSEDSEETSVAYLGGDYCESPFRFVLQRSP-SYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVE----DEEDKEQCSPVSILDT

Query:  PFDDSYDERHDDRVRDRVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEVA
          ++  DE H     D     +L CS+  VQR K++LL KLRRFE+LA LDP+ELE  M +E+  E + +Y  +EE       ++ ++ ++    V E  
Subjt:  PFDDSYDERHDDRVRDRVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEVA

Query:  SDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEEL
        +  S C                  AE+E  ++N   +++       R+ L  E +   +D +V +DL++E  EW ++  +  EA  DLE +IF +L++E 
Subjt:  SDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEEL

Query:  AVEL
        + EL
Subjt:  AVEL

AT5G03670.1 unknown protein6.2e-7540.97Show/hide
Query:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARNSPNAIFLHVPART
        M +Q+HL  LLEEDQEPF L SYI+++R  +   +  + LQV KR+PIS  +     FCRNACF S + SPD +KSPLFE  SP R S NAIF+++PART
Subjt:  MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARNSPNAIFLHVPART

Query:  AALLLEAALKIHKQKSSPKIKKTQIKNQGLA--RFGSVLKRLTLRNRNTNRQSEACG--SGGDLASFGQRKSSIRRKLT-------------------QG
        A++LLEAA++I KQ S  ++ KT+ +N G A   FGSVLK+LT R +      +  G  S   +    + +S + RK+                      
Subjt:  AALLLEAALKIHKQKSSPKIKKTQIKNQGLA--RFGSVLKRLTLRNRNTNRQSEACG--SGGDLASFGQRKSSIRRKLT-------------------QG

Query:  ETSSYNGRSSYGFWSES-NEEERSMDL----GTSCSSQSEDSEETSVAYLGGD------YCESPFRFVLQRSPSY-GCRTPDFQSPAISPCRRNKE-DKT
        ET      SS G WSES    ERS D+      S SS+S  S+E ++   G D      +CESPF FVLQ  PS  G RTP+F SPA SP     E +K 
Subjt:  ETSSYNGRSSYGFWSES-NEEERSMDL----GTSCSSQSEDSEETSVAYLGGD------YCESPFRFVLQRSPSY-GCRTPDFQSPAISPCRRNKE-DKT

Query:  IDGGESLKKFQVVEDEEDKEQCSPVSILDTPFDDSYDERHDDRVRDRVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYD
            E LKK ++ E+EE+KEQ SPVS+LD PF D  ++ H       ++D ++  S+ +VQ+ K  LL KL RFE+LA LDP+ELEK M D++  E +  
Subjt:  IDGGESLKKFQVVEDEEDKEQCSPVSILDTPFDDSYDERHDDRVRDRVEDYDLECSYAAVQRTKQQLLNKLRRFERLADLDPIELEKIMVDEQQYERDYD

Query:  YFSNEECEYYKSPVQWHNENDIEWFVKEVASDTSSCKSQRFLPQDMRKLVIDLIAEE-EADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKK
            EE E  KS   +H E   +  +K    +         +P+ +  L+ DL AEE  +D         V +RVC+RL  W++VE NTIDMMVE D + 
Subjt:  YFSNEECEYYKSPVQWHNENDIEWFVKEVASDTSSCKSQRFLPQDMRKLVIDLIAEE-EADQRNRNTREEVIQRVCKRLELWKEVEFNTIDMMVEEDLKK

Query:  E-VDEWK-KNQEQRGEAAIDLELAIFSLLVEELAVEL
        E +  W+ KN     E  +D+E  IF  LVEEL+ ++
Subjt:  E-VDEWK-KNQEQRGEAAIDLELAIFSLLVEELAVEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGCTCAAAAGCACTTGCACCAGCTGCTTGAAGAGGATCAAGAACCCTTTCATTTGAACAGCTACATTGCGGAGAAACGTGTTAATCTCAAAAGGGTTTCTCCTAA
ATCCGATTTGCAAGTCCACAAACGAAAACCCATCTCCACAACTTCAATTTTCCAGGGAAATTTCTGCAGGAATGCTTGTTTTACGTCCTTCCAGCCCTCGCCGGACCTTA
GGAAATCGCCGCTCTTTGAGTTTCATTCCCCGGCTAGAAACAGCCCCAATGCTATTTTCCTCCATGTCCCGGCCAGGACTGCCGCTCTGCTTCTTGAAGCCGCTCTCAAG
ATTCATAAACAGAAATCGTCTCCCAAAATTAAAAAGACCCAGATTAAGAATCAAGGGCTTGCGCGGTTTGGGTCGGTTCTAAAGAGATTAACTCTTCGAAATCGAAACAC
CAACCGTCAATCTGAAGCTTGCGGTAGTGGAGGGGATTTGGCGTCGTTTGGGCAAAGAAAAAGCTCCATTCGAAGGAAATTAACGCAGGGTGAGACCAGCTCCTACAATG
GAAGGTCTAGCTATGGCTTCTGGTCGGAGAGCAACGAAGAAGAAAGATCAATGGATTTGGGGACTTCGTGCAGTAGCCAATCTGAGGATTCAGAGGAGACTTCTGTTGCT
TATTTGGGGGGAGATTACTGCGAAAGCCCTTTTCGATTTGTTCTCCAGCGAAGCCCGTCCTACGGTTGTCGGACGCCGGATTTCCAGTCGCCGGCGATCTCTCCCTGTCG
CCGTAACAAAGAGGACAAAACGATTGACGGTGGAGAAAGCTTGAAGAAATTTCAGGTGGTAGAAGATGAAGAAGATAAGGAGCAATGTAGTCCTGTGTCTATATTGGACA
CTCCTTTTGATGACAGTTACGATGAACGGCATGACGACCGGGTGAGGGACAGGGTCGAAGATTACGATTTGGAATGCAGCTATGCAGCTGTCCAAAGAACAAAGCAGCAA
CTATTAAACAAGCTTCGCAGATTCGAGCGACTCGCAGACTTGGATCCAATTGAACTCGAGAAAATAATGGTAGACGAACAACAATACGAGAGAGATTACGACTACTTTAG
TAATGAAGAATGTGAATATTACAAGTCACCAGTTCAGTGGCATAATGAAAATGACATCGAATGGTTTGTGAAAGAGGTTGCGAGCGATACAAGCTCTTGCAAATCCCAAC
GATTCCTCCCTCAAGACATGAGGAAACTCGTCATAGATCTCATTGCAGAAGAAGAGGCAGATCAAAGAAATCGCAACACGAGAGAGGAGGTGATACAAAGGGTTTGCAAG
AGGTTGGAGCTGTGGAAAGAGGTGGAATTCAACACCATAGACATGATGGTGGAAGAAGATTTGAAGAAGGAAGTTGATGAGTGGAAGAAAAACCAGGAGCAGAGAGGAGA
GGCAGCCATTGATTTGGAGCTTGCAATCTTCAGCCTGCTGGTGGAGGAATTGGCAGTGGAACTTGCTCCT
mRNA sequenceShow/hide mRNA sequence
ATGATGGCTCAAAAGCACTTGCACCAGCTGCTTGAAGAGGATCAAGAACCCTTTCATTTGAACAGCTACATTGCGGAGAAACGTGTTAATCTCAAAAGGGTTTCTCCTAA
ATCCGATTTGCAAGTCCACAAACGAAAACCCATCTCCACAACTTCAATTTTCCAGGGAAATTTCTGCAGGAATGCTTGTTTTACGTCCTTCCAGCCCTCGCCGGACCTTA
GGAAATCGCCGCTCTTTGAGTTTCATTCCCCGGCTAGAAACAGCCCCAATGCTATTTTCCTCCATGTCCCGGCCAGGACTGCCGCTCTGCTTCTTGAAGCCGCTCTCAAG
ATTCATAAACAGAAATCGTCTCCCAAAATTAAAAAGACCCAGATTAAGAATCAAGGGCTTGCGCGGTTTGGGTCGGTTCTAAAGAGATTAACTCTTCGAAATCGAAACAC
CAACCGTCAATCTGAAGCTTGCGGTAGTGGAGGGGATTTGGCGTCGTTTGGGCAAAGAAAAAGCTCCATTCGAAGGAAATTAACGCAGGGTGAGACCAGCTCCTACAATG
GAAGGTCTAGCTATGGCTTCTGGTCGGAGAGCAACGAAGAAGAAAGATCAATGGATTTGGGGACTTCGTGCAGTAGCCAATCTGAGGATTCAGAGGAGACTTCTGTTGCT
TATTTGGGGGGAGATTACTGCGAAAGCCCTTTTCGATTTGTTCTCCAGCGAAGCCCGTCCTACGGTTGTCGGACGCCGGATTTCCAGTCGCCGGCGATCTCTCCCTGTCG
CCGTAACAAAGAGGACAAAACGATTGACGGTGGAGAAAGCTTGAAGAAATTTCAGGTGGTAGAAGATGAAGAAGATAAGGAGCAATGTAGTCCTGTGTCTATATTGGACA
CTCCTTTTGATGACAGTTACGATGAACGGCATGACGACCGGGTGAGGGACAGGGTCGAAGATTACGATTTGGAATGCAGCTATGCAGCTGTCCAAAGAACAAAGCAGCAA
CTATTAAACAAGCTTCGCAGATTCGAGCGACTCGCAGACTTGGATCCAATTGAACTCGAGAAAATAATGGTAGACGAACAACAATACGAGAGAGATTACGACTACTTTAG
TAATGAAGAATGTGAATATTACAAGTCACCAGTTCAGTGGCATAATGAAAATGACATCGAATGGTTTGTGAAAGAGGTTGCGAGCGATACAAGCTCTTGCAAATCCCAAC
GATTCCTCCCTCAAGACATGAGGAAACTCGTCATAGATCTCATTGCAGAAGAAGAGGCAGATCAAAGAAATCGCAACACGAGAGAGGAGGTGATACAAAGGGTTTGCAAG
AGGTTGGAGCTGTGGAAAGAGGTGGAATTCAACACCATAGACATGATGGTGGAAGAAGATTTGAAGAAGGAAGTTGATGAGTGGAAGAAAAACCAGGAGCAGAGAGGAGA
GGCAGCCATTGATTTGGAGCTTGCAATCTTCAGCCTGCTGGTGGAGGAATTGGCAGTGGAACTTGCTCCT
Protein sequenceShow/hide protein sequence
MMAQKHLHQLLEEDQEPFHLNSYIAEKRVNLKRVSPKSDLQVHKRKPISTTSIFQGNFCRNACFTSFQPSPDLRKSPLFEFHSPARNSPNAIFLHVPARTAALLLEAALK
IHKQKSSPKIKKTQIKNQGLARFGSVLKRLTLRNRNTNRQSEACGSGGDLASFGQRKSSIRRKLTQGETSSYNGRSSYGFWSESNEEERSMDLGTSCSSQSEDSEETSVA
YLGGDYCESPFRFVLQRSPSYGCRTPDFQSPAISPCRRNKEDKTIDGGESLKKFQVVEDEEDKEQCSPVSILDTPFDDSYDERHDDRVRDRVEDYDLECSYAAVQRTKQQ
LLNKLRRFERLADLDPIELEKIMVDEQQYERDYDYFSNEECEYYKSPVQWHNENDIEWFVKEVASDTSSCKSQRFLPQDMRKLVIDLIAEEEADQRNRNTREEVIQRVCK
RLELWKEVEFNTIDMMVEEDLKKEVDEWKKNQEQRGEAAIDLELAIFSLLVEELAVELAP