; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS004233 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS004233
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionWPP domain-associated protein
Genome locationscaffold92:1012963..1014666
RNA-Seq ExpressionMS004233
SyntenyMS004233
Gene Ontology termsNA
InterPro domainsIPR037490 - WPP domain-associated protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601161.1 hypothetical protein SDJN03_06394, partial [Cucurbita argyrosperma subsp. sororia]1.7e-20371.43Show/hide
Query:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE
        M+ IFGVID  F+VSIVDSTMM IVHRAMDKAH RVKS EGV+ERLHEISKFYELSVMQLDGCIKFV+EETDSHNPES HEEVLAGLAEIRNRLQRRLYE
Subjt:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPE-ENKRNH
        SELAILQKDREL DRF SESKLRQALE TE+ELVSSQEDLE  R+RSAGSSNLS   GEDDNRDGEFCELKDSVDRQVWKIREKLE DDY P+ +N+RNH
Subjt:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPE-ENKRNH

Query:  CMNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCE
        C+ND+KVEEMGSDID+LKETLD+AFGKMQSAIF S+MGPIEQQ+KSSIENDIIS+ L GFVRD QEDLEAE RRKE  Q+SVS NEHW+ LMNE  GLCE
Subjt:  CMNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCE

Query:  DLKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREILGLSSRRGGNPVSLE
        +LKPL I QNE QPQ  E+ D                +EYGIN +E ELE+EG HDVAKM++NH          AEE + L+ E+L     R   P SL+
Subjt:  DLKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREILGLSSRRGGNPVSLE

Query:  SRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQNVMMEEI
        SR + VLEK ENL ILNA++NKI GQ+   +EEDIP E  EQ+FTE  RQKSDV TL D+WGKMH+L++EEN G I+NQI ML  +RE+ +FQN+MMEEI
Subjt:  SRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQNVMMEEI

Query:  YITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYY
        + T+F+G+ E+F NDLS WELEI ISDGICR FIR+MFNQ +ETMESYK E  IKDD+Y+
Subjt:  YITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYY

KAG7031963.1 WPP domain-associated protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-20271.25Show/hide
Query:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE
        M+ IFGVID  F+VSIVDSTMM IVHRAMDKAH RVKS EGV+ERLHEISKFYELSVMQLDGCIKFV+EETDSHNPES HEEVLAGLAEIRNRLQRRLYE
Subjt:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPE-ENKRNH
        SELAILQKDREL DRF SESKLRQALE TE+ELVSSQEDLE  R+RSAGSSNLS   GEDDNRDGEFCELKDSVDRQVWKIREKLE DDY P+ +N+RNH
Subjt:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPE-ENKRNH

Query:  CMNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCE
        C+ND+KVEEMGSDID+LKETLD+AFGKMQSAIF S+MGPIEQQ+KSSIENDIIS+ L GFVRD QEDLEAE RRKE  Q+SVS NEHW+ LMNE  GLCE
Subjt:  CMNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCE

Query:  DLKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREILGLSSRRGGNPVSLE
        +LKPL I QNE QPQ  E+ D                +EYGIN +E ELE+EG HDVAKM++N          +AEE + L+ E+L     R  +P SL+
Subjt:  DLKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREILGLSSRRGGNPVSLE

Query:  SRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQNVMMEEI
        SR + VLEK ENL ILNA++NKI GQ+   +EEDIP E  EQ+FTE  RQKSDV TL D+WGKMH+L++EEN G I+NQI ML  +RE+ +FQN+MMEEI
Subjt:  SRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQNVMMEEI

Query:  YITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYY
        + T+F+G+ E+F NDLS WELEI ISDGICR FIR+MFNQ +ETMESYK E  IKDD+Y+
Subjt:  YITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYY

XP_022139213.1 uncharacterized protein LOC111010182 [Momordica charantia]4.9e-30496.83Show/hide
Query:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE
        MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCI FVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE
Subjt:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPEENKRNHC
        SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPEENKRNHC
Subjt:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPEENKRNHC

Query:  MNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCED
        MNDVKVEE+GSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCED
Subjt:  MNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCED

Query:  LKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREILGLSSRRGGNPVSLES
        LKPLIIRQNETQPQDGEECDISDFGSRSPKR EKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLR+EILGLSSRRGGNPVSLES
Subjt:  LKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREILGLSSRRGGNPVSLES

Query:  RIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQNVMMEEIY
        RIQRVLEKQEN+IILNAKVNKIFGQHG VNEEDIPLERKEQ+FTETDRQKSDVDTLTDVWGKMHKLQDEE TGQIRNQISMLMQEREEKEFQN+MMEEIY
Subjt:  RIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQNVMMEEIY

Query:  ITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYYGICRDLIKD
        ITIFKGLIERF N+L SWELEIQISDGICRDFIRNMFNQQNE MESYK E HIKDD+YYGICRD I+D
Subjt:  ITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYYGICRDLIKD

XP_022985013.1 uncharacterized protein LOC111483104 [Cucurbita maxima]3.2e-20271.48Show/hide
Query:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE
        M+ IFGVID  F+VSIVDSTMM IVHRAMDKAH RVKS EGV+ERLHEISKFYELSVMQLDGCIKFVQEETDSHNPES HEEVLAGLAEIRNRLQRRLYE
Subjt:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPE-ENKRNH
        SELAILQKDREL DRF SESKLRQALE TE+ELVSSQEDLE  R+RSAGSSNLS   GEDDNRDGEFCELKDSVDRQVWKIREKLE DDYEP+ +N+RNH
Subjt:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPE-ENKRNH

Query:  CMNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCE
        C+NDVKVEEMGSDID+LKETLD+AFGKMQSAIF S+MGPIEQQ+KSSIENDIIS+ L GFVRD QEDLEAE R+KE  Q+SVS NEHW+ LMNE  GLCE
Subjt:  CMNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCE

Query:  DLKPLIIRQNETQPQDGEECDIS-DFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREILGLSSRRGGNPVSL
        +LKPL I QNE QPQ  EE     D   R         +EYGIN +E ELE++G HDVAKM++N          +AEE   LR+E+L   SR      SL
Subjt:  DLKPLIIRQNETQPQDGEECDIS-DFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREILGLSSRRGGNPVSL

Query:  ESRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQNVMMEE
        +SR Q VLEK ENL ILNA++NKI GQ+   +EEDIP E  +Q+FTE  RQKSDV TL D+WGKMH+L++EEN G I+NQI M   +RE+ +FQN+M EE
Subjt:  ESRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQNVMMEE

Query:  IYITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYY
        IY T+F+GL E+F NDLS WELEI ISDGICR FIR+MF+Q +ETMESY  E  IKDD+Y+
Subjt:  IYITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYY

XP_038891653.1 uncharacterized protein LOC120081046 [Benincasa hispida]7.6e-21272.51Show/hide
Query:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE
        M+ IFGVID RF+VSIVDSTMM IVHRAMDKAH RVKSREGV+ERLHEISKFYELSVMQLDGCIKFVQEETD+ NPES HEEVLAGLAEIRNRLQRRLYE
Subjt:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDD-NRDGEFCELKDSVDRQVWKIREKLEVDDYEPE-ENKRN
        SELAILQKDREL DRFESE KLRQALE TERELVSSQEDLE+ER+RSAGSSNLS   GEDD +RDGEF ELKDSVDRQVWKI+EKLE DD EP+ + +RN
Subjt:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDD-NRDGEFCELKDSVDRQVWKIREKLEVDDYEPE-ENKRN

Query:  HCMNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLC
        HC+NDV+VEEMGSDID+LKETLD+AFGKMQSAIF SEMGPIEQQ+KSSIENDIISI L+GF RD QEDLEAE  RKEK ++SV+LN HW+DLMNEVTGLC
Subjt:  HCMNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLC

Query:  EDLKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNS--------AEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREIL----G
        EDLKPL I QNE QPQ GE C+I DFGSRSPKREEK+S        +EYGIN N  ELEDE           HES+I ++S EA + ++L+ E+L     
Subjt:  EDLKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNS--------AEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREIL----G

Query:  LSSRRGGNPVSLESRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQER
        LSSRR  +   L+SR Q VL   ENL+I  AKVNKI GQ+G  NEEDIPLE+KEQ+FTE  RQKSDVD+L DVWGKMH+LQDEEN G I+NQI +L QER
Subjt:  LSSRRGGNPVSLESRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQER

Query:  EEKEFQNVMMEEIYITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYYGICRDLIKD
        E+ EFQN+MMEEIYIT+F+GL E+F NDL+  E EI I+DGICRD IRN FNQ ++TMES+K E  IKDD+Y+ + ++ +KD
Subjt:  EEKEFQNVMMEEIYITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYYGICRDLIKD

TrEMBL top hitse value%identityAlignment
A0A1S3BGE6 uncharacterized protein LOC1034895676.5e-17764.76Show/hide
Query:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE
        M+ IFG+IDG+F++SIVDSTMM IVHRAMDKAH RVKSREGV+ERLHEISKFYELSVMQLDGCIKFVQEETD+HNPE+ HEEVLAGLAEIRNRLQRRLYE
Subjt:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDD-NRDGEFCELKDSVDRQVWKIREKLEV-DDYEPE-ENKR
        SELAILQKDREL DR ESE KLRQALEITERELVSSQEDLE+ER+RSAGSSNLS   GEDD NRDGEF E+K           EK E  DDYEP+ + KR
Subjt:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDD-NRDGEFCELKDSVDRQVWKIREKLEV-DDYEPE-ENKR

Query:  NHCMNDV-KVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTG
        N C+NDV +VEEMGSDID+LKETLD+AFGKM SAI  SEMG IEQQ+KSSIENDIISI L+GFV+D QEDLEAEV RKEKQ   VS N+ W+DLMNEV G
Subjt:  NHCMNDV-KVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTG

Query:  LCEDLKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREIL----GLSSRRG
        L EDLKP +I QNE Q     EC+I DF                                       ES+I +KS EA EQ +L  E+L     LS RR 
Subjt:  LCEDLKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREIL----GLSSRRG

Query:  GNPVSLESRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQ
         +P SL+ R Q +LE+ EN +ILNA VNK   Q+   +EEDIPLE+ EQ+F E  +QKSDVDTL DVWGKMH+LQDEEN+G I+NQI  L QERE++EFQ
Subjt:  GNPVSLESRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQ

Query:  NVMMEEIYITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYYGICRDLIKD
        N+M EE YIT+ +GL E+F +DLSSWELEI ISDGI RD IR+MFNQ +ETM+S   E  IKDD+Y+ + ++ ++D
Subjt:  NVMMEEIYITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYYGICRDLIKD

A0A5D3CG51 WPP domain-associated protein isoform X26.5e-17764.76Show/hide
Query:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE
        M+ IFG+IDG+F++SIVDSTMM IVHRAMDKAH RVKSREGV+ERLHEISKFYELSVMQLDGCIKFVQEETD+HNPE+ HEEVLAGLAEIRNRLQRRLYE
Subjt:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDD-NRDGEFCELKDSVDRQVWKIREKLEV-DDYEPE-ENKR
        SELAILQKDREL DR ESE KLRQALEITERELVSSQEDLE+ER+RSAGSSNLS   GEDD NRDGEF E+K           EK E  DDYEP+ + KR
Subjt:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDD-NRDGEFCELKDSVDRQVWKIREKLEV-DDYEPE-ENKR

Query:  NHCMNDV-KVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTG
        N C+NDV +VEEMGSDID+LKETLD+AFGKM SAI  SEMG IEQQ+KSSIENDIISI L+GFV+D QEDLEAEV RKEKQ   VS N+ W+DLMNEV G
Subjt:  NHCMNDV-KVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTG

Query:  LCEDLKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREIL----GLSSRRG
        L EDLKP +I QNE Q     EC+I DF                                       ES+I +KS EA EQ +L  E+L     LS RR 
Subjt:  LCEDLKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREIL----GLSSRRG

Query:  GNPVSLESRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQ
         +P SL+ R Q +LE+ EN +ILNA VNK   Q+   +EEDIPLE+ EQ+F E  +QKSDVDTL DVWGKMH+LQDEEN+G I+NQI  L QERE++EFQ
Subjt:  GNPVSLESRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQ

Query:  NVMMEEIYITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYYGICRDLIKD
        N+M EE YIT+ +GL E+F +DLSSWELEI ISDGI RD IR+MFNQ +ETM+S   E  IKDD+Y+ + ++ ++D
Subjt:  NVMMEEIYITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYYGICRDLIKD

A0A6J1CF63 uncharacterized protein LOC1110101822.4e-30496.83Show/hide
Query:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE
        MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCI FVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE
Subjt:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPEENKRNHC
        SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPEENKRNHC
Subjt:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPEENKRNHC

Query:  MNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCED
        MNDVKVEE+GSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCED
Subjt:  MNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCED

Query:  LKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREILGLSSRRGGNPVSLES
        LKPLIIRQNETQPQDGEECDISDFGSRSPKR EKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLR+EILGLSSRRGGNPVSLES
Subjt:  LKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREILGLSSRRGGNPVSLES

Query:  RIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQNVMMEEIY
        RIQRVLEKQEN+IILNAKVNKIFGQHG VNEEDIPLERKEQ+FTETDRQKSDVDTLTDVWGKMHKLQDEE TGQIRNQISMLMQEREEKEFQN+MMEEIY
Subjt:  RIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQNVMMEEIY

Query:  ITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYYGICRDLIKD
        ITIFKGLIERF N+L SWELEIQISDGICRDFIRNMFNQQNE MESYK E HIKDD+YYGICRD I+D
Subjt:  ITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYYGICRDLIKD

A0A6J1GZ55 uncharacterized protein LOC1114584751.9e-20070.89Show/hide
Query:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE
        M+ IFGVID  F+VSIVDSTMM IVHRAMDKAH RVKS EGV+ERLHEISKFYELSVMQLDGCI FVQEETDSHNPES HEEVLAGLAEIRNRLQRRLYE
Subjt:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPE-ENKRNH
        SELAILQKDREL DRF SESKLRQALE TE+ELVSSQEDLE  R+RSAGSSNLS   GEDDNRDGEFCELKDSVDRQVWKIREKLE DDY P+ +N+RNH
Subjt:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPE-ENKRNH

Query:  CMNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCE
        C+ND+KVEEMGSDID+LKETLD+AFGKMQSAIF S+MGPIEQQ+KSSIENDIIS+ L GFVRD QEDLEAE RRKE  Q+SVS NEHW+ LMNE  GLCE
Subjt:  CMNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCE

Query:  DLKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREILGLSSRRGGNPVSLE
         LKPL I QNE QPQ  E+ D                +EYGIN +E ELE+EG HDVAKM++N          +AEE + L+ E+L     R  +P SL+
Subjt:  DLKPLIIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREILGLSSRRGGNPVSLE

Query:  SRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQNVMMEEI
        SR + VLEK ENL ILNA++NKI GQ+   +EEDIP E  EQ+  E  RQKSDV TL D+WGKMH+L++EEN G I+NQI ML  +RE+ +FQN++MEEI
Subjt:  SRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQNVMMEEI

Query:  YITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYY
        Y T+F+GL E+F NDLS WELE  ISDGICR FIR+MFNQ +ETMESYK E  IKDD+Y+
Subjt:  YITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYY

A0A6J1JCB6 uncharacterized protein LOC1114831041.5e-20271.48Show/hide
Query:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE
        M+ IFGVID  F+VSIVDSTMM IVHRAMDKAH RVKS EGV+ERLHEISKFYELSVMQLDGCIKFVQEETDSHNPES HEEVLAGLAEIRNRLQRRLYE
Subjt:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPE-ENKRNH
        SELAILQKDREL DRF SESKLRQALE TE+ELVSSQEDLE  R+RSAGSSNLS   GEDDNRDGEFCELKDSVDRQVWKIREKLE DDYEP+ +N+RNH
Subjt:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPE-ENKRNH

Query:  CMNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCE
        C+NDVKVEEMGSDID+LKETLD+AFGKMQSAIF S+MGPIEQQ+KSSIENDIIS+ L GFVRD QEDLEAE R+KE  Q+SVS NEHW+ LMNE  GLCE
Subjt:  CMNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCE

Query:  DLKPLIIRQNETQPQDGEECDIS-DFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREILGLSSRRGGNPVSL
        +LKPL I QNE QPQ  EE     D   R         +EYGIN +E ELE++G HDVAKM++N          +AEE   LR+E+L   SR      SL
Subjt:  DLKPLIIRQNETQPQDGEECDIS-DFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREILGLSSRRGGNPVSL

Query:  ESRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQNVMMEE
        +SR Q VLEK ENL ILNA++NKI GQ+   +EEDIP E  +Q+FTE  RQKSDV TL D+WGKMH+L++EEN G I+NQI M   +RE+ +FQN+M EE
Subjt:  ESRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQNVMMEE

Query:  IYITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYY
        IY T+F+GL E+F NDLS WELEI ISDGICR FIR+MF+Q +ETMESY  E  IKDD+Y+
Subjt:  IYITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G14990.1 BEST Arabidopsis thaliana protein match is: myosin heavy chain-related (TAIR:AT2G34730.1)5.6e-5631.42Show/hide
Query:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE
        M++I   ++G+ + S+ DSTMM +V +AMDKAH ++K++ G+L RL+ IS FYEL+V+QL+ C+ FV +ETD    ES HEEV+  L EI++RL  RL E
Subjt:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPEENKRNHC
        +E+AIL+KDR+L +  E++  LR  LE  E ELV  Q   ++ER R        H    D  ++ EF ELK SVD+QV  +R+KLE +  E         
Subjt:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPEENKRNHC

Query:  MNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCED
        +     +    DID+LK T+D+AF KM  AIF SE+GPIEQ  + SIE D +++ ++GF+   +E +E         ++ + + ++ +   + V  +  +
Subjt:  MNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCED

Query:  LKPL------IIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAE----EQIRLRREILGLSSR
        L+ L      II    + P+       +   S S   E  +  E     + +E +D  +  V+K+I++HES+I  KS E      E I+ ++   G SS+
Subjt:  LKPL------IIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAE----EQIRLRREILGLSSR

Query:  RGGNPVSLESRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSD-------VDTLTDVWGKMHKLQDEENTGQIRNQISMLM
        R          I  ++   ++L+ LN K+                    E LF + D  + +        D L DVW KM K     N+    N I    
Subjt:  RGGNPVSLESRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSD-------VDTLTDVWGKMHKLQDEENTGQIRNQISMLM

Query:  QEREEKEFQNVMMEEIYITIFKGL-IERFRNDLSSWELEIQI------SDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYYGICRDLIKD
        +E+E+ E + +++E+ Y+T+ KGL  +   N+  + E E +I      S+  C D + N+  +++  +     EF  + ++ + I  +L+++
Subjt:  QEREEKEFQNVMMEEIYITIFKGL-IERFRNDLSSWELEIQI------SDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYYGICRDLIKD

AT5G14990.2 unknown protein5.6e-5631.42Show/hide
Query:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE
        M++I   ++G+ + S+ DSTMM +V +AMDKAH ++K++ G+L RL+ IS FYEL+V+QL+ C+ FV +ETD    ES HEEV+  L EI++RL  RL E
Subjt:  MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYE

Query:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPEENKRNHC
        +E+AIL+KDR+L +  E++  LR  LE  E ELV  Q   ++ER R        H    D  ++ EF ELK SVD+QV  +R+KLE +  E         
Subjt:  SELAILQKDRELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPEENKRNHC

Query:  MNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCED
        +     +    DID+LK T+D+AF KM  AIF SE+GPIEQ  + SIE D +++ ++GF+   +E +E         ++ + + ++ +   + V  +  +
Subjt:  MNDVKVEEMGSDIDMLKETLDMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCED

Query:  LKPL------IIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAE----EQIRLRREILGLSSR
        L+ L      II    + P+       +   S S   E  +  E     + +E +D  +  V+K+I++HES+I  KS E      E I+ ++   G SS+
Subjt:  LKPL------IIRQNETQPQDGEECDISDFGSRSPKREEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAE----EQIRLRREILGLSSR

Query:  RGGNPVSLESRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSD-------VDTLTDVWGKMHKLQDEENTGQIRNQISMLM
        R          I  ++   ++L+ LN K+                    E LF + D  + +        D L DVW KM K     N+    N I    
Subjt:  RGGNPVSLESRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKEQLFTETDRQKSD-------VDTLTDVWGKMHKLQDEENTGQIRNQISMLM

Query:  QEREEKEFQNVMMEEIYITIFKGL-IERFRNDLSSWELEIQI------SDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYYGICRDLIKD
        +E+E+ E + +++E+ Y+T+ KGL  +   N+  + E E +I      S+  C D + N+  +++  +     EF  + ++ + I  +L+++
Subjt:  QEREEKEFQNVMMEEIYITIFKGL-IERFRNDLSSWELEIQI------SDGICRDFIRNMFNQQNETMESYKNEFHIKDDMYYGICRDLIKD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGAAATTTTTGGGGTGATTGATGGCAGATTTAGAGTGTCGATAGTAGATTCAACCATGATGTCGATTGTGCATCGCGCAATGGATAAAGCCCACGGAAGAGTCAA
GTCCAGAGAAGGGGTACTAGAGAGACTACATGAGATATCAAAGTTTTACGAGTTGTCAGTAATGCAGTTGGATGGCTGTATCAAGTTCGTTCAAGAAGAAACTGACAGTC
ACAATCCAGAGAGCGGCCACGAAGAAGTGCTTGCTGGGCTGGCAGAAATCAGAAACCGCCTCCAACGCCGCCTCTATGAATCAGAGCTCGCCATCCTACAGAAGGATAGG
GAGTTGAGAGATCGATTTGAGAGCGAGTCGAAGTTAAGGCAGGCTTTGGAAATTACGGAGAGAGAATTGGTTTCTTCGCAAGAAGATCTTGAGATTGAAAGAACAAGGAG
TGCGGGGAGTTCCAATCTCAGCCATCAGTCAGGCGAAGATGACAATAGAGATGGAGAGTTCTGTGAGCTGAAAGATTCGGTGGATCGACAGGTTTGGAAGATCAGAGAGA
AACTCGAGGTTGATGATTATGAGCCTGAGGAGAACAAACGGAATCACTGTATGAACGATGTAAAAGTTGAAGAGATGGGATCTGACATTGATATGTTGAAGGAAACGCTG
GACATGGCTTTTGGAAAGATGCAGAGTGCCATTTTCTATTCGGAGATGGGACCAATAGAGCAGCAAATAAAATCTAGTATTGAGAATGACATAATATCTATAAATCTCAG
GGGATTTGTGAGAGATTCCCAAGAGGATTTAGAAGCAGAAGTGAGAAGGAAAGAGAAGCAGCAAATTTCCGTTTCGTTGAATGAACATTGGACAGATTTAATGAATGAAG
TTACAGGCTTGTGTGAGGATCTCAAGCCTCTCATCATTAGGCAAAATGAAACGCAGCCCCAAGATGGAGAAGAATGTGACATTTCGGATTTTGGGTCAAGATCACCAAAA
AGAGAGGAGAAGAATTCTGCAGAATATGGGATAAATATAAATGAAAAGGAGCTAGAAGATGAGGGAAGCCATGATGTTGCTAAGATGATAGAGAACCACGAGTCGGTAAT
TTCAGAAAAGAGTGCAGAAGCAGAAGAACAGATTCGATTGAGGCGAGAAATTTTAGGCCTGTCATCAAGGAGAGGGGGAAATCCTGTAAGCCTGGAAAGTAGGATACAAA
GAGTACTAGAAAAACAAGAGAATCTAATTATTTTGAATGCTAAGGTTAACAAAATTTTTGGCCAGCATGGAGTTGTTAATGAAGAAGACATTCCTCTAGAGAGAAAGGAG
CAATTATTTACAGAAACTGATAGGCAGAAATCAGATGTTGATACTTTGACAGATGTATGGGGCAAGATGCATAAACTGCAGGATGAAGAAAACACAGGACAAATACGAAA
CCAAATAAGCATGTTAATGCAAGAAAGAGAGGAGAAAGAATTTCAAAACGTAATGATGGAGGAGATTTATATCACTATATTCAAAGGTTTGATAGAAAGGTTTCGCAACG
ATTTGAGTAGTTGGGAATTGGAGATCCAGATTTCAGATGGTATATGCAGAGATTTCATTAGGAATATGTTCAATCAGCAGAATGAGACCATGGAAAGTTACAAGAATGAA
TTCCACATAAAAGATGACATGTATTATGGTATATGCAGAGATCTCATTAAGGAT
mRNA sequenceShow/hide mRNA sequence
ATGGAGGAAATTTTTGGGGTGATTGATGGCAGATTTAGAGTGTCGATAGTAGATTCAACCATGATGTCGATTGTGCATCGCGCAATGGATAAAGCCCACGGAAGAGTCAA
GTCCAGAGAAGGGGTACTAGAGAGACTACATGAGATATCAAAGTTTTACGAGTTGTCAGTAATGCAGTTGGATGGCTGTATCAAGTTCGTTCAAGAAGAAACTGACAGTC
ACAATCCAGAGAGCGGCCACGAAGAAGTGCTTGCTGGGCTGGCAGAAATCAGAAACCGCCTCCAACGCCGCCTCTATGAATCAGAGCTCGCCATCCTACAGAAGGATAGG
GAGTTGAGAGATCGATTTGAGAGCGAGTCGAAGTTAAGGCAGGCTTTGGAAATTACGGAGAGAGAATTGGTTTCTTCGCAAGAAGATCTTGAGATTGAAAGAACAAGGAG
TGCGGGGAGTTCCAATCTCAGCCATCAGTCAGGCGAAGATGACAATAGAGATGGAGAGTTCTGTGAGCTGAAAGATTCGGTGGATCGACAGGTTTGGAAGATCAGAGAGA
AACTCGAGGTTGATGATTATGAGCCTGAGGAGAACAAACGGAATCACTGTATGAACGATGTAAAAGTTGAAGAGATGGGATCTGACATTGATATGTTGAAGGAAACGCTG
GACATGGCTTTTGGAAAGATGCAGAGTGCCATTTTCTATTCGGAGATGGGACCAATAGAGCAGCAAATAAAATCTAGTATTGAGAATGACATAATATCTATAAATCTCAG
GGGATTTGTGAGAGATTCCCAAGAGGATTTAGAAGCAGAAGTGAGAAGGAAAGAGAAGCAGCAAATTTCCGTTTCGTTGAATGAACATTGGACAGATTTAATGAATGAAG
TTACAGGCTTGTGTGAGGATCTCAAGCCTCTCATCATTAGGCAAAATGAAACGCAGCCCCAAGATGGAGAAGAATGTGACATTTCGGATTTTGGGTCAAGATCACCAAAA
AGAGAGGAGAAGAATTCTGCAGAATATGGGATAAATATAAATGAAAAGGAGCTAGAAGATGAGGGAAGCCATGATGTTGCTAAGATGATAGAGAACCACGAGTCGGTAAT
TTCAGAAAAGAGTGCAGAAGCAGAAGAACAGATTCGATTGAGGCGAGAAATTTTAGGCCTGTCATCAAGGAGAGGGGGAAATCCTGTAAGCCTGGAAAGTAGGATACAAA
GAGTACTAGAAAAACAAGAGAATCTAATTATTTTGAATGCTAAGGTTAACAAAATTTTTGGCCAGCATGGAGTTGTTAATGAAGAAGACATTCCTCTAGAGAGAAAGGAG
CAATTATTTACAGAAACTGATAGGCAGAAATCAGATGTTGATACTTTGACAGATGTATGGGGCAAGATGCATAAACTGCAGGATGAAGAAAACACAGGACAAATACGAAA
CCAAATAAGCATGTTAATGCAAGAAAGAGAGGAGAAAGAATTTCAAAACGTAATGATGGAGGAGATTTATATCACTATATTCAAAGGTTTGATAGAAAGGTTTCGCAACG
ATTTGAGTAGTTGGGAATTGGAGATCCAGATTTCAGATGGTATATGCAGAGATTTCATTAGGAATATGTTCAATCAGCAGAATGAGACCATGGAAAGTTACAAGAATGAA
TTCCACATAAAAGATGACATGTATTATGGTATATGCAGAGATCTCATTAAGGAT
Protein sequenceShow/hide protein sequence
MEEIFGVIDGRFRVSIVDSTMMSIVHRAMDKAHGRVKSREGVLERLHEISKFYELSVMQLDGCIKFVQEETDSHNPESGHEEVLAGLAEIRNRLQRRLYESELAILQKDR
ELRDRFESESKLRQALEITERELVSSQEDLEIERTRSAGSSNLSHQSGEDDNRDGEFCELKDSVDRQVWKIREKLEVDDYEPEENKRNHCMNDVKVEEMGSDIDMLKETL
DMAFGKMQSAIFYSEMGPIEQQIKSSIENDIISINLRGFVRDSQEDLEAEVRRKEKQQISVSLNEHWTDLMNEVTGLCEDLKPLIIRQNETQPQDGEECDISDFGSRSPK
REEKNSAEYGININEKELEDEGSHDVAKMIENHESVISEKSAEAEEQIRLRREILGLSSRRGGNPVSLESRIQRVLEKQENLIILNAKVNKIFGQHGVVNEEDIPLERKE
QLFTETDRQKSDVDTLTDVWGKMHKLQDEENTGQIRNQISMLMQEREEKEFQNVMMEEIYITIFKGLIERFRNDLSSWELEIQISDGICRDFIRNMFNQQNETMESYKNE
FHIKDDMYYGICRDLIKD