; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G005140 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G005140
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCG_Chr05:4961124..4967487
RNA-Seq ExpressionClCG05G005140
SyntenyClCG05G005140
Gene Ontology termsGO:0016125 - sterol metabolic process (biological process)
GO:0019287 - isopentenyl diphosphate biosynthetic process, mevalonate pathway (biological process)
GO:0019288 - isopentenyl diphosphate biosynthetic process, methylerythritol 4-phosphate pathway (biological process)
GO:0048364 - root development (biological process)
GO:0050790 - regulation of catalytic activity (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0034046 - poly(G) binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR007011 - Late embryogenesis abundant protein, SMP subgroup domain
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582300.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]3.2e-16688.52Show/hide
Query:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML
        MP LSPNSLASLVE A+S+RSSLLGR AHAQILKTL+TP PAFLYNHLVNMYAKLD L+SAELIL+LAPCRSVVTWT+LIAGSVQNG FASALLHFSDML
Subjt:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML

Query:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL
        SDCVRPNDFTFPC FKA+TGLRMA+TG Q+H LAVKEGLINDVFVGCS FDMYSKL  L+DAYK+F EMPHRNLETWNAYISNSV HGRPEDSA AFIEL
Subjt:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL

Query:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
        LR GGKPDSITFCAF NACSDKLGLEPGCQLHGFIIRSG GQNVS+SNGLIDFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKA CLFLR
Subjt:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR

Query:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA
        ARKEDIKPTDFMVSSVLCA AGLSEIE G +
Subjt:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA

XP_008438671.1 PREDICTED: pentatricopeptide repeat-containing protein At4g14850 [Cucumis melo]1.1e-17191.24Show/hide
Query:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML
        MP LS NSLAS+VELAVSVRSSLLGRAAHAQILKTL+TP PAFLYNHLVNMYAKLDHL+SA+LIL+LAPCRSVVTWTALIAGSVQNGCF SALLHFSDML
Subjt:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML

Query:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL
        SDCVRPNDFTFPC  KA+TGLRM +TG QLH LAVKEGLINDVFVGCSVFDMYSKL FLNDAYK+FDEMP RNLETWNAYI+NSV HGRPEDSA AFIEL
Subjt:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL

Query:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
        LRVG KPDSITFCAF NACSDKLGL PGCQLHGF+IRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Subjt:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR

Query:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA
        ARKEDI+PTDFMVSSVLCACAGLSEIEFG +
Subjt:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA

XP_022956070.1 pentatricopeptide repeat-containing protein At4g14850 [Cucurbita moschata]3.2e-16688.52Show/hide
Query:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML
        MP LSPNSLASLVE A+S+RSSLLGR AHAQILKTL+TP PAFLYNHLVNMYAKLD L+SAELIL+LAPCRSVVTWT+LIAGSVQNG FASALLHFSDML
Subjt:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML

Query:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL
        SDCVRPNDFTFPC FKA+TGLRMA+TG Q+H LAVKEGLINDVFVGCS FDMYSKL  L+DAYK+F EMPHRNLETWNAYISNSV HGRPEDSA AFIEL
Subjt:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL

Query:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
        LR GGKPDSITFCAF NACSDKLGLEPGCQLHGFIIRSG GQNVS+SNGLIDFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKA CLFLR
Subjt:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR

Query:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA
        ARKEDIKPTDFMVSSVLCA AGLSEIE G +
Subjt:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA

XP_031738596.1 pentatricopeptide repeat-containing protein At4g14850 [Cucumis sativus]1.6e-17392.15Show/hide
Query:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML
        MP LS NSLAS+VELAVSVRSSLLGRAAHAQILKTL+TP PAFLYNHLVNMYAKLDHL+SA+LIL+LAPCRSVVTWTALIAGSVQNGCF SALLHFSDML
Subjt:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML

Query:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL
        SDCVRPNDFTFPC  KA+TGLRM  TG QLH LAVKEGLINDVFVGCSVFDMYSKL FLNDAYK+FDEMPHRNLETWNAYISNSV HGRPEDS  AFIEL
Subjt:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL

Query:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
        LRVGGKPDSITFCAF NACSDKLGL PGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Subjt:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR

Query:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA
        ARKEDI+PTDFMVSSVLCACAGLSEIEFG +
Subjt:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA

XP_038881355.1 pentatricopeptide repeat-containing protein At4g14850 [Benincasa hispida]9.3e-17493.05Show/hide
Query:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML
        MP LSPNSLASLVELAVSVRSSLLGRAAHAQILKTL+TPLPAFLYNHLVNMYAK DHL+SA+LIL+LAPCRSVVTWTALIAGSVQNGCFASALLHFSDML
Subjt:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML

Query:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL
        SDCVRPNDFTFPC  KA+TGLRMA+TG QLH LAVKEGLINDVFVGCSVFDMYSKL  L+DAYK+FDEMPHRNLET NAYISNSV HGRPEDSA AFIEL
Subjt:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL

Query:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
        LRVG KPDSITFCAFFNACSDKLGL PGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Subjt:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR

Query:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA
        ARKEDIKPTDFMVSSVLCACAGLSEIEFG +
Subjt:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA

TrEMBL top hitse value%identityAlignment
A0A0A0L4T8 Uncharacterized protein7.7e-17492.15Show/hide
Query:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML
        MP LS NSLAS+VELAVSVRSSLLGRAAHAQILKTL+TP PAFLYNHLVNMYAKLDHL+SA+LIL+LAPCRSVVTWTALIAGSVQNGCF SALLHFSDML
Subjt:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML

Query:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL
        SDCVRPNDFTFPC  KA+TGLRM  TG QLH LAVKEGLINDVFVGCSVFDMYSKL FLNDAYK+FDEMPHRNLETWNAYISNSV HGRPEDS  AFIEL
Subjt:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL

Query:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
        LRVGGKPDSITFCAF NACSDKLGL PGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Subjt:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR

Query:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA
        ARKEDI+PTDFMVSSVLCACAGLSEIEFG +
Subjt:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA

A0A1S3AXN0 pentatricopeptide repeat-containing protein At4g148505.5e-17291.24Show/hide
Query:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML
        MP LS NSLAS+VELAVSVRSSLLGRAAHAQILKTL+TP PAFLYNHLVNMYAKLDHL+SA+LIL+LAPCRSVVTWTALIAGSVQNGCF SALLHFSDML
Subjt:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML

Query:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL
        SDCVRPNDFTFPC  KA+TGLRM +TG QLH LAVKEGLINDVFVGCSVFDMYSKL FLNDAYK+FDEMP RNLETWNAYI+NSV HGRPEDSA AFIEL
Subjt:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL

Query:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
        LRVG KPDSITFCAF NACSDKLGL PGCQLHGF+IRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Subjt:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR

Query:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA
        ARKEDI+PTDFMVSSVLCACAGLSEIEFG +
Subjt:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA

A0A5A7U206 Pentatricopeptide repeat-containing protein5.5e-17291.24Show/hide
Query:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML
        MP LS NSLAS+VELAVSVRSSLLGRAAHAQILKTL+TP PAFLYNHLVNMYAKLDHL+SA+LIL+LAPCRSVVTWTALIAGSVQNGCF SALLHFSDML
Subjt:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML

Query:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL
        SDCVRPNDFTFPC  KA+TGLRM +TG QLH LAVKEGLINDVFVGCSVFDMYSKL FLNDAYK+FDEMP RNLETWNAYI+NSV HGRPEDSA AFIEL
Subjt:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL

Query:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
        LRVG KPDSITFCAF NACSDKLGL PGCQLHGF+IRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
Subjt:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR

Query:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA
        ARKEDI+PTDFMVSSVLCACAGLSEIEFG +
Subjt:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA

A0A6J1GWT9 pentatricopeptide repeat-containing protein At4g148501.6e-16688.52Show/hide
Query:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML
        MP LSPNSLASLVE A+S+RSSLLGR AHAQILKTL+TP PAFLYNHLVNMYAKLD L+SAELIL+LAPCRSVVTWT+LIAGSVQNG FASALLHFSDML
Subjt:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML

Query:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL
        SDCVRPNDFTFPC FKA+TGLRMA+TG Q+H LAVKEGLINDVFVGCS FDMYSKL  L+DAYK+F EMPHRNLETWNAYISNSV HGRPEDSA AFIEL
Subjt:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL

Query:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
        LR GGKPDSITFCAF NACSDKLGLEPGCQLHGFIIRSG GQNVS+SNGLIDFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKA CLFLR
Subjt:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR

Query:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA
        ARKEDIKPTDFMVSSVLCA AGLSEIE G +
Subjt:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA

A0A6J1IQR6 pentatricopeptide repeat-containing protein At4g148502.2e-16588.22Show/hide
Query:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML
        MP LSPNSLASLVELA+S+RSSLLGR AHAQILKTL+TP PAFLYNHLVNMYAKLD L+SAELIL+LAPCRSVVTWT+LIAGSVQNG F+SALLHFSDML
Subjt:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML

Query:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL
        SDCVRPNDFTFPC  KA+TGLRMA+TG QLH LAVKEGLINDVFVGCS FDMYSKL  L+DAYK+F EMPHRNLETWNAYISNSV HGRPEDSA AFIEL
Subjt:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL

Query:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
        LR GGKPDSITFCAF NACSDKLGLEPGCQLHGFIIRSG GQNVS+SNGLIDFYGKCGEV CSE++FDRMGERNSVSWSSLIAAYVQNNEEEKA CLFLR
Subjt:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR

Query:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA
        ARKE IKPTDFMVSSVLCA AGLSEIE G +
Subjt:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA

SwissProt top hitse value%identityAlignment
P09444 Late embryogenesis abundant protein D-342.3e-8264.89Show/hide
Query:  QEQPRRDEKLQQPHQNDEQLETVKYGDVFNVSGDLAANPIAPEDARMMGSAETRVLGQVPQAGPADVMRAAAAHNVQVGLLSSRDVSDVAKNQGINISET
        Q QPRR    QQP    E  E +KYGDVFNVSG+LA  PIAP+DA MM +AET+VLGQ  + G A VM+AAA  N QVG++   D++D+A  QG+ ++ET
Subjt:  QEQPRRDEKLQQPHQNDEQLETVKYGDVFNVSGDLAANPIAPEDARMMGSAETRVLGQVPQAGPADVMRAAAAHNVQVGLLSSRDVSDVAKNQGINISET

Query:  DVPGARLVTECVAGQVVGQYLDTT--MASGVGTPEQNVITIGQALEAACQTIGNKPVERSDAAAIQAAEVRATGNNVISPGGLAATAQAAATFNARMDRD
        DV G R++TE VAGQVVGQY+  T  M S VG   QN ITIG+ALEA  +T G+KPV++SDAAA+QAAEVRATG+NVI PGGLAATAQ+AA  NA +DRD
Subjt:  DVPGARLVTECVAGQVVGQYLDTT--MASGVGTPEQNVITIGQALEAACQTIGNKPVERSDAAAIQAAEVRATGNNVISPGGLAATAQAAATFNARMDRD

Query:  EDKIKLSYVLTGATEKLTTDKAVSRKDAEGVVSAELRNNPSLTAQPGGVAASIAAAARLNED
        E+KIKL+ VLTGAT KL  DKAV+R+DAEGVVSAELRNNP++   PGGVAAS+AAAARLNE+
Subjt:  EDKIKLSYVLTGATEKLTTDKAVSRKDAEGVVSAELRNNPSLTAQPGGVAASIAAAARLNED

Q0WSH6 Pentatricopeptide repeat-containing protein At4g148503.1e-11159.21Show/hide
Query:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML
        M LLS ++L  L++ A+S  S  LGR  HA+I+KTL +P P FL N+L+NMY+KLDH +SA L+L+L P R+VV+WT+LI+G  QNG F++AL+ F +M 
Subjt:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML

Query:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL
         + V PNDFTFPCAFKA   LR+ VTG Q+H LAVK G I DVFVGCS FDMY K    +DA K+FDE+P RNLETWNA+ISNSV  GRP ++  AFIE 
Subjt:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL

Query:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
         R+ G P+SITFCAF NACSD L L  G QLHG ++RSG+  +VSV NGLIDFYGKC ++  SE++F  MG +N+VSW SL+AAYVQN+E+EKAS L+LR
Subjt:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR

Query:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA
        +RK+ ++ +DFM+SSVL ACAG++ +E G +
Subjt:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA

Q9LIQ7 Pentatricopeptide repeat-containing protein At3g24000, mitochondrial6.3e-4835.14Show/hide
Query:  GRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTGLRMA
        GR  HA IL+++       + N L+NMYAK   L+ A  + +  P R  VTWT LI+G  Q+     ALL F+ ML     PN+FT     KA    R  
Subjt:  GRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTGLRMA

Query:  VTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLG
          G QLHG  VK G  ++V VG ++ D+Y++   ++DA  +FD +  RN  +WNA I+        E +   F  +LR G +P   ++ + F ACS    
Subjt:  VTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLG

Query:  LEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTDFMVSSVLCACAGLS
        LE G  +H ++I+SG        N L+D Y K G +  +  +FDR+ +R+ VSW+SL+ AY Q+   ++A   F   R+  I+P +    SVL AC+   
Subjt:  LEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTDFMVSSVLCACAGLS

Query:  EIEFGHAGEVEEG
             H+G ++EG
Subjt:  EIEFGHAGEVEEG

Q9LJ95 Late embryogenesis abundant protein 324.8e-6455.51Show/hide
Query:  QEQPRRDEKLQQPHQNDEQLETVKYGDVFNVSGDLAANPIAPEDARMMGSAETRVLGQVPQAGPADVMRAAAAHNVQVGLLSSRDVSDVAKNQGINISET
        QEQPRR              E VKYGDVF VSG+LA  PIAPEDA+MM SAET V G   + GPA VM++AA  N++ G +   D +++   +G  + +T
Subjt:  QEQPRRDEKLQQPHQNDEQLETVKYGDVFNVSGDLAANPIAPEDARMMGSAETRVLGQVPQAGPADVMRAAAAHNVQVGLLSSRDVSDVAKNQGINISET

Query:  DVPGARLVTECVAGQVVGQYLD-TTMASGVGTPE---QNVITIGQALEAACQTIGNKPVERSDAAAIQAAEVRATGNNVISPGGLAATAQAAATFNARMD
         VP A + TE V GQVVGQ+++   + +   T E   Q+ ITIG+ALEA  +T GNKPV++SDAAAIQAAE+RA+G NVI+  G+AA+AQ+AA  NA +D
Subjt:  DVPGARLVTECVAGQVVGQYLD-TTMASGVGTPE---QNVITIGQALEAACQTIGNKPVERSDAAAIQAAEVRATGNNVISPGGLAATAQAAATFNARMD

Query:  RDEDKIKLSYVLTGATEKLTTDKAVSRKDAEGVVSAELRNNPSLTAQPGGVAASIAAAARLNE
        RDE KIKL  VLTGA  KL+ D+AV+R+DAEGVVSAE+RNNP L   PGGVAAS+  AARLNE
Subjt:  RDEDKIKLSYVLTGATEKLTTDKAVSRKDAEGVVSAELRNNPSLTAQPGGVAASIAAAARLNE

Q9LJ97 Late embryogenesis abundant protein 319.6e-7360.47Show/hide
Query:  LQQPHQNDEQLETVKYGDVFNVSGDLAANPIAPEDARMMGSAETRVLGQVPQAGPADVMRAAAAHNVQVGLLSSRDVSDVAKNQGINISETDVPGARLVT
        + Q  Q     E V YGDVF VSG+LA  PIAPEDA MM +AETRV G   + G A VM++AA  N + G +   D +D+A  +G+ +++TDVPGAR+ T
Subjt:  LQQPHQNDEQLETVKYGDVFNVSGDLAANPIAPEDARMMGSAETRVLGQVPQAGPADVMRAAAAHNVQVGLLSSRDVSDVAKNQGINISETDVPGARLVT

Query:  ECVAGQVVGQYLD--------TTMASGVGTPEQNVITIGQALEAACQTIGNKPVERSDAAAIQAAEVRATGNNVISPGGLAATAQAAATFNARMDRDEDK
        E V GQVVGQY++           A  VG   Q+ ITIG+ALEA  QT GNKPV++SDAAAIQAAEVRA G NVI+PGG+AA+AQ+AA  NA +DRDEDK
Subjt:  ECVAGQVVGQYLD--------TTMASGVGTPEQNVITIGQALEAACQTIGNKPVERSDAAAIQAAEVRATGNNVISPGGLAATAQAAATFNARMDRDEDK

Query:  IKLSYVLTGATEKLTTDKAVSRKDAEGVVSAELRNNPSLTAQPGGVAASIAAAARLNE
        IKL  VL GAT KL  DKAV+R+DAEGVVSAELRNNP+L+  PGGVAASI AAARLNE
Subjt:  IKLSYVLTGATEKLTTDKAVSRKDAEGVVSAELRNNPSLTAQPGGVAASIAAAARLNE

Arabidopsis top hitse value%identityAlignment
AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein3.0e-4529.63Show/hide
Query:  FLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLIND
        + +N +V    KL  LD A+ + +  P R   TW ++++G  Q+     AL +F+ M  +    N+++F     A +GL     G Q+H L  K   ++D
Subjt:  FLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLIND

Query:  VFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQ
        V++G ++ DMYSK   +NDA ++FDEM  RN+ +WN+ I+    +G   ++   F  +L    +PD +T  +  +AC+    ++ G ++HG ++++   +
Subjt:  VFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQ

Query:  N-VSVSNGLIDFYGKCGEVECSEMVFD-------------------------------RMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTD
        N + +SN  +D Y KC  ++ +  +FD                               +M ERN VSW++LIA Y QN E E+A  LF   ++E + PT 
Subjt:  N-VSVSNGLIDFYGKCGEVECSEMVFD-------------------------------RMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTD

Query:  FMVSSVLCACAGLSEIEFGHAGEV
        +  +++L ACA L+E+  G    V
Subjt:  FMVSSVLCACAGLSEIEFGHAGEV

AT3G22490.1 Seed maturation protein6.9e-7460.47Show/hide
Query:  LQQPHQNDEQLETVKYGDVFNVSGDLAANPIAPEDARMMGSAETRVLGQVPQAGPADVMRAAAAHNVQVGLLSSRDVSDVAKNQGINISETDVPGARLVT
        + Q  Q     E V YGDVF VSG+LA  PIAPEDA MM +AETRV G   + G A VM++AA  N + G +   D +D+A  +G+ +++TDVPGAR+ T
Subjt:  LQQPHQNDEQLETVKYGDVFNVSGDLAANPIAPEDARMMGSAETRVLGQVPQAGPADVMRAAAAHNVQVGLLSSRDVSDVAKNQGINISETDVPGARLVT

Query:  ECVAGQVVGQYLD--------TTMASGVGTPEQNVITIGQALEAACQTIGNKPVERSDAAAIQAAEVRATGNNVISPGGLAATAQAAATFNARMDRDEDK
        E V GQVVGQY++           A  VG   Q+ ITIG+ALEA  QT GNKPV++SDAAAIQAAEVRA G NVI+PGG+AA+AQ+AA  NA +DRDEDK
Subjt:  ECVAGQVVGQYLD--------TTMASGVGTPEQNVITIGQALEAACQTIGNKPVERSDAAAIQAAEVRATGNNVISPGGLAATAQAAATFNARMDRDEDK

Query:  IKLSYVLTGATEKLTTDKAVSRKDAEGVVSAELRNNPSLTAQPGGVAASIAAAARLNE
        IKL  VL GAT KL  DKAV+R+DAEGVVSAELRNNP+L+  PGGVAASI AAARLNE
Subjt:  IKLSYVLTGATEKLTTDKAVSRKDAEGVVSAELRNNPSLTAQPGGVAASIAAAARLNE

AT3G22500.1 Seed maturation protein3.4e-6555.51Show/hide
Query:  QEQPRRDEKLQQPHQNDEQLETVKYGDVFNVSGDLAANPIAPEDARMMGSAETRVLGQVPQAGPADVMRAAAAHNVQVGLLSSRDVSDVAKNQGINISET
        QEQPRR              E VKYGDVF VSG+LA  PIAPEDA+MM SAET V G   + GPA VM++AA  N++ G +   D +++   +G  + +T
Subjt:  QEQPRRDEKLQQPHQNDEQLETVKYGDVFNVSGDLAANPIAPEDARMMGSAETRVLGQVPQAGPADVMRAAAAHNVQVGLLSSRDVSDVAKNQGINISET

Query:  DVPGARLVTECVAGQVVGQYLD-TTMASGVGTPE---QNVITIGQALEAACQTIGNKPVERSDAAAIQAAEVRATGNNVISPGGLAATAQAAATFNARMD
         VP A + TE V GQVVGQ+++   + +   T E   Q+ ITIG+ALEA  +T GNKPV++SDAAAIQAAE+RA+G NVI+  G+AA+AQ+AA  NA +D
Subjt:  DVPGARLVTECVAGQVVGQYLD-TTMASGVGTPE---QNVITIGQALEAACQTIGNKPVERSDAAAIQAAEVRATGNNVISPGGLAATAQAAATFNARMD

Query:  RDEDKIKLSYVLTGATEKLTTDKAVSRKDAEGVVSAELRNNPSLTAQPGGVAASIAAAARLNE
        RDE KIKL  VLTGA  KL+ D+AV+R+DAEGVVSAE+RNNP L   PGGVAAS+  AARLNE
Subjt:  RDEDKIKLSYVLTGATEKLTTDKAVSRKDAEGVVSAELRNNPSLTAQPGGVAASIAAAARLNE

AT3G24000.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.5e-4935.14Show/hide
Query:  GRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTGLRMA
        GR  HA IL+++       + N L+NMYAK   L+ A  + +  P R  VTWT LI+G  Q+     ALL F+ ML     PN+FT     KA    R  
Subjt:  GRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFTFPCAFKATTGLRMA

Query:  VTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLG
          G QLHG  VK G  ++V VG ++ D+Y++   ++DA  +FD +  RN  +WNA I+        E +   F  +LR G +P   ++ + F ACS    
Subjt:  VTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACSDKLG

Query:  LEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTDFMVSSVLCACAGLS
        LE G  +H ++I+SG        N L+D Y K G +  +  +FDR+ +R+ VSW+SL+ AY Q+   ++A   F   R+  I+P +    SVL AC+   
Subjt:  LEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTDFMVSSVLCACAGLS

Query:  EIEFGHAGEVEEG
             H+G ++EG
Subjt:  EIEFGHAGEVEEG

AT4G14850.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-11259.21Show/hide
Query:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML
        M LLS ++L  L++ A+S  S  LGR  HA+I+KTL +P P FL N+L+NMY+KLDH +SA L+L+L P R+VV+WT+LI+G  QNG F++AL+ F +M 
Subjt:  MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDML

Query:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL
         + V PNDFTFPCAFKA   LR+ VTG Q+H LAVK G I DVFVGCS FDMY K    +DA K+FDE+P RNLETWNA+ISNSV  GRP ++  AFIE 
Subjt:  SDCVRPNDFTFPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIEL

Query:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR
         R+ G P+SITFCAF NACSD L L  G QLHG ++RSG+  +VSV NGLIDFYGKC ++  SE++F  MG +N+VSW SL+AAYVQN+E+EKAS L+LR
Subjt:  LRVGGKPDSITFCAFFNACSDKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLR

Query:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA
        +RK+ ++ +DFM+SSVL ACAG++ +E G +
Subjt:  ARKEDIKPTDFMVSSVLCACAGLSEIEFGHA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCTTCTCTCGCCAAACTCGCTCGCTTCACTGGTCGAATTGGCCGTATCGGTTCGTTCTTCTCTTCTTGGCCGAGCCGCCCACGCCCAAATTCTCAAAACCCTAAG
AACCCCTCTTCCAGCCTTCCTCTACAACCACCTCGTGAACATGTACGCTAAACTCGATCATCTTGACTCGGCCGAACTCATCCTCAAACTCGCCCCTTGCCGTTCTGTCG
TCACTTGGACCGCCCTCATCGCTGGTTCCGTCCAAAACGGCTGTTTTGCTTCCGCTCTGCTTCACTTCTCCGACATGCTAAGTGACTGTGTTCGACCCAATGATTTCACT
TTCCCTTGCGCGTTCAAGGCCACCACTGGCCTTCGCATGGCCGTGACAGGCACACAGCTACACGGACTTGCGGTTAAGGAGGGATTAATAAACGATGTCTTCGTCGGGTG
CAGTGTCTTCGACATGTACAGCAAATTGAGCTTTCTTAATGACGCTTACAAGATATTTGATGAAATGCCTCATCGAAACCTCGAGACGTGGAATGCGTATATATCCAATT
CCGTGCACCATGGGCGACCTGAAGACTCTGCCAGTGCATTTATTGAGCTACTTCGGGTTGGTGGGAAGCCAGATTCCATAACATTTTGTGCTTTTTTCAATGCGTGTTCA
GACAAACTAGGCTTGGAGCCTGGGTGTCAGCTTCATGGGTTCATTATTAGAAGTGGGTATGGGCAGAATGTCTCTGTTTCAAATGGGTTGATTGATTTTTATGGGAAATG
TGGGGAAGTTGAATGTTCTGAGATGGTTTTTGACAGAATGGGAGAGCGGAACAGCGTATCTTGGTCCTCTTTGATAGCTGCTTACGTTCAAAACAATGAGGAGGAGAAGG
CTTCCTGCTTATTCTTGCGAGCGAGGAAAGAAGATATCAAACCAACTGATTTTATGGTATCAAGTGTGCTTTGTGCCTGTGCTGGTCTTTCAGAAATCGAGTTTGGACAT
GCTGGGGAAGTTGAGGAAGGAGATGCAGGAGGCTACTGGTTGCATTGCAGACACCAAGAGCAGCCACGCAGGGACGAAAAACTGCAGCAGCCTCACCAGAACGACGAGCA
GCTGGAAACCGTCAAGTATGGGGACGTTTTCAACGTCTCGGGTGACCTGGCTGCGAACCCCATCGCACCTGAGGATGCGCGCATGATGGGTAGTGCTGAAACGAGGGTGT
TGGGGCAAGTGCCTCAGGCTGGTCCGGCTGATGTCATGCGAGCCGCCGCCGCTCATAATGTCCAAGTTGGTCTTCTTAGTAGCCGTGATGTTAGCGATGTTGCTAAGAAT
CAAGGCATTAATATCAGCGAGACCGATGTTCCCGGAGCCCGTCTCGTTACTGAATGCGTTGCCGGACAGGTTGTTGGACAGTATTTGGACACGACGATGGCGAGTGGGGT
AGGAACGCCGGAGCAGAATGTAATCACGATTGGACAAGCCCTGGAAGCTGCATGTCAAACGATAGGAAACAAGCCGGTGGAACGAAGTGATGCTGCAGCAATTCAAGCCG
CAGAGGTCCGAGCAACCGGCAACAATGTCATAAGCCCAGGTGGGCTTGCCGCCACTGCTCAGGCGGCAGCAACTTTCAATGCCAGAATGGATCGAGACGAGGACAAGATC
AAGCTCAGCTATGTCTTAACGGGCGCAACTGAAAAACTGACGACAGACAAGGCGGTGAGCCGGAAGGATGCGGAGGGGGTGGTGAGCGCAGAGCTGAGGAACAATCCAAG
CCTGACGGCACAACCAGGTGGGGTGGCAGCATCCATTGCTGCTGCTGCGAGACTGAATGAGGACGATGCAGAGGGGATGGCGAGCGCAGAGCCGAAGAACAATCCAAGCC
CGACGACACACCCAGATGGGTTGGCGGCCTCCATCATCGCCAGCTCGGAACTGAATGAGGGGGGTGCAGGTATATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCTTCTCTCGCCAAACTCGCTCGCTTCACTGGTCGAATTGGCCGTATCGGTTCGTTCTTCTCTTCTTGGCCGAGCCGCCCACGCCCAAATTCTCAAAACCCTAAG
AACCCCTCTTCCAGCCTTCCTCTACAACCACCTCGTGAACATGTACGCTAAACTCGATCATCTTGACTCGGCCGAACTCATCCTCAAACTCGCCCCTTGCCGTTCTGTCG
TCACTTGGACCGCCCTCATCGCTGGTTCCGTCCAAAACGGCTGTTTTGCTTCCGCTCTGCTTCACTTCTCCGACATGCTAAGTGACTGTGTTCGACCCAATGATTTCACT
TTCCCTTGCGCGTTCAAGGCCACCACTGGCCTTCGCATGGCCGTGACAGGCACACAGCTACACGGACTTGCGGTTAAGGAGGGATTAATAAACGATGTCTTCGTCGGGTG
CAGTGTCTTCGACATGTACAGCAAATTGAGCTTTCTTAATGACGCTTACAAGATATTTGATGAAATGCCTCATCGAAACCTCGAGACGTGGAATGCGTATATATCCAATT
CCGTGCACCATGGGCGACCTGAAGACTCTGCCAGTGCATTTATTGAGCTACTTCGGGTTGGTGGGAAGCCAGATTCCATAACATTTTGTGCTTTTTTCAATGCGTGTTCA
GACAAACTAGGCTTGGAGCCTGGGTGTCAGCTTCATGGGTTCATTATTAGAAGTGGGTATGGGCAGAATGTCTCTGTTTCAAATGGGTTGATTGATTTTTATGGGAAATG
TGGGGAAGTTGAATGTTCTGAGATGGTTTTTGACAGAATGGGAGAGCGGAACAGCGTATCTTGGTCCTCTTTGATAGCTGCTTACGTTCAAAACAATGAGGAGGAGAAGG
CTTCCTGCTTATTCTTGCGAGCGAGGAAAGAAGATATCAAACCAACTGATTTTATGGTATCAAGTGTGCTTTGTGCCTGTGCTGGTCTTTCAGAAATCGAGTTTGGACAT
GCTGGGGAAGTTGAGGAAGGAGATGCAGGAGGCTACTGGTTGCATTGCAGACACCAAGAGCAGCCACGCAGGGACGAAAAACTGCAGCAGCCTCACCAGAACGACGAGCA
GCTGGAAACCGTCAAGTATGGGGACGTTTTCAACGTCTCGGGTGACCTGGCTGCGAACCCCATCGCACCTGAGGATGCGCGCATGATGGGTAGTGCTGAAACGAGGGTGT
TGGGGCAAGTGCCTCAGGCTGGTCCGGCTGATGTCATGCGAGCCGCCGCCGCTCATAATGTCCAAGTTGGTCTTCTTAGTAGCCGTGATGTTAGCGATGTTGCTAAGAAT
CAAGGCATTAATATCAGCGAGACCGATGTTCCCGGAGCCCGTCTCGTTACTGAATGCGTTGCCGGACAGGTTGTTGGACAGTATTTGGACACGACGATGGCGAGTGGGGT
AGGAACGCCGGAGCAGAATGTAATCACGATTGGACAAGCCCTGGAAGCTGCATGTCAAACGATAGGAAACAAGCCGGTGGAACGAAGTGATGCTGCAGCAATTCAAGCCG
CAGAGGTCCGAGCAACCGGCAACAATGTCATAAGCCCAGGTGGGCTTGCCGCCACTGCTCAGGCGGCAGCAACTTTCAATGCCAGAATGGATCGAGACGAGGACAAGATC
AAGCTCAGCTATGTCTTAACGGGCGCAACTGAAAAACTGACGACAGACAAGGCGGTGAGCCGGAAGGATGCGGAGGGGGTGGTGAGCGCAGAGCTGAGGAACAATCCAAG
CCTGACGGCACAACCAGGTGGGGTGGCAGCATCCATTGCTGCTGCTGCGAGACTGAATGAGGACGATGCAGAGGGGATGGCGAGCGCAGAGCCGAAGAACAATCCAAGCC
CGACGACACACCCAGATGGGTTGGCGGCCTCCATCATCGCCAGCTCGGAACTGAATGAGGGGGGTGCAGGTATATGAAAGGGTAAGGGCAGCTTAACTGAGAGCTTTGAG
TGAGTGTTTAGGTGTATGAGAAAATAACAGCCAGCTTAAGTGAGAGTTGTGAGTGAGTCTTTTGTTGGTTTGATCAGGAATTTTGTTTTTCTGATACCCTTATTTGGGTC
CTTAAAAACAAAACTCTGCTCCACAGCAAAGCAG
Protein sequenceShow/hide protein sequence
MPLLSPNSLASLVELAVSVRSSLLGRAAHAQILKTLRTPLPAFLYNHLVNMYAKLDHLDSAELILKLAPCRSVVTWTALIAGSVQNGCFASALLHFSDMLSDCVRPNDFT
FPCAFKATTGLRMAVTGTQLHGLAVKEGLINDVFVGCSVFDMYSKLSFLNDAYKIFDEMPHRNLETWNAYISNSVHHGRPEDSASAFIELLRVGGKPDSITFCAFFNACS
DKLGLEPGCQLHGFIIRSGYGQNVSVSNGLIDFYGKCGEVECSEMVFDRMGERNSVSWSSLIAAYVQNNEEEKASCLFLRARKEDIKPTDFMVSSVLCACAGLSEIEFGH
AGEVEEGDAGGYWLHCRHQEQPRRDEKLQQPHQNDEQLETVKYGDVFNVSGDLAANPIAPEDARMMGSAETRVLGQVPQAGPADVMRAAAAHNVQVGLLSSRDVSDVAKN
QGINISETDVPGARLVTECVAGQVVGQYLDTTMASGVGTPEQNVITIGQALEAACQTIGNKPVERSDAAAIQAAEVRATGNNVISPGGLAATAQAAATFNARMDRDEDKI
KLSYVLTGATEKLTTDKAVSRKDAEGVVSAELRNNPSLTAQPGGVAASIAAAARLNEDDAEGMASAEPKNNPSPTTHPDGLAASIIASSELNEGGAGI