; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10018808 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10018808
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr04:8985784..8987298
RNA-Seq ExpressionHG10018808
SyntenyHG10018808
Gene Ontology termsGO:0032544 - plastid translation (biological process)
GO:0043489 - RNA stabilization (biological process)
GO:0009536 - plastid (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571877.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.7e-24484.52Show/hide
Query:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF
        MVPKSI F+H FLSNRITSSF TISSILTYSTQPNLN       SH+AI + SLQSQ LEQSL+SFKLM+L+G+ PSS SFNNVLGLLAKSG+L +TWWF
Subjt:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF

Query:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD
        FTEFLGRTHFD YSFGITIKAFC+NGNVSKGFELL+QMER+GLSPNVVIYTILIDACCKNGDIEQAKVLFS+MNDLGLV NQYTYT MINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT
        GFEL+EKMKLVGV PSLYTYN+LINEYCRDGKLSIAFK+FDEMST GVSCNVVTY ILIGGLCR RQ+SKAERL E MKQ  INPT RT+NLLMDG CN 
Subjt:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT

Query:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH
        GKL+KAL YFD+LKLIG +PTSV+YNILIAGFSKAGNSSVVSELVREMEDRG+SPSKVTYTILMDAF+RSDDVEKA Q+F LMKK+GSVPDQ+TYGVL+H
Subjt:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH

Query:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC
        GLCMKGNMV+A+KLY SMVEM++EPNDVIYN MINGYCKECNSYKALKFL+EMV KG TPSLTSY STIEVL NEGKS EAK LLKEMIEAGL PSESLC
Subjt:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC

Query:  SKVG
         K+G
Subjt:  SKVG

XP_008455127.1 PREDICTED: pentatricopeptide repeat-containing protein At4g11690 [Cucumis melo]1.4e-24985.71Show/hide
Query:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF
        MVPKSIGF++PFLSNRITSSFFTISS+LTYSTQ NLNS+SV G  HDA+IN SLQS  LEQSL SFKLMVLKG+SPSS+SFNNVL LLAKSGNLDRTWWF
Subjt:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF

Query:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD
        FTE+LGRT FDVYSFGITIKAFCENGNVSKGFELL+QME MG+SPNV IYTILI+ACCKNGDI+QAKV+FSRM+DLGL A+QY YTVMINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT
        GFELYEKMKL+GV P+LYTYNSLI EYCRDGKLS+AFKLFDE+S RGV+CN VTYNILIGGLCR  QV KAE LLE MK+AHINPT RTFNLLMDGLCNT
Subjt:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT

Query:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH
        GKLDKALSY DKLKLIGQSPT VTYNILI+GFSK GNSSVVSELVREMEDRGISPSKVTYTILMDAF+RSDD+EKAY+MF LMK+IG VPDQ+TYGVLIH
Subjt:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH

Query:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC
        GLC++GNMVEA+KLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVK G TP++ SY STI+VLC +GKSIEAK LLKEM EAGLKP ESL 
Subjt:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC

Query:  SKVG
        SKVG
Subjt:  SKVG

XP_022953086.1 pentatricopeptide repeat-containing protein At4g11690 [Cucurbita moschata]2.3e-24484.52Show/hide
Query:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF
        MVPKSI F+H FLSNRITSSF TISSILTYSTQPNLN       SH+AI + SLQSQ LEQSL+SFKLM+L+G+ PS+ SFNNVLGLLAKSG+L +TWWF
Subjt:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF

Query:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD
        FTEFLGRTHFD YSFGITIKAFC+NGNVSKGFELL+QMER+GLSPNVVIYTILIDACCKNGDIEQAKVLFS+MNDLGLVANQYTYT MINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT
        GFEL+EKMKLVGV PSLYTYN+LINEYCRDGKLSIAFK+FDEMST GVSCNVVTY ILIGGLCR RQ++KAERL E MKQ  INPT RT+NLLMDG CN 
Subjt:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT

Query:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH
        GKL+KAL YFD+LKLIG +PTSV+YNILIAGFSKAGNSSVVSELVREMEDRG+SPSKVTYTILMDAF+RSDDVEKA Q+F LMKK+GSVPDQ+TYGVL+H
Subjt:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH

Query:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC
        GLCMKGNMVEA+KLY SMVEM++EPNDVIYN MINGYCKECNSYKALKFL+EMV KG TPSLTSY STIEVL NEGKS EAK LLKEMIEAGL PSESLC
Subjt:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC

Query:  SKVG
         K+G
Subjt:  SKVG

XP_022971940.1 pentatricopeptide repeat-containing protein At4g11690 [Cucurbita maxima]1.1e-24986.71Show/hide
Query:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF
        MVPKSIGFLH FLSNRITSSFFTISSILTYSTQPNLN       SH+AIIN SLQSQLLEQSLHSFKLM+L+G+ PSS SFNNVLGLLAKSG+L +TW F
Subjt:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF

Query:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD
        FTEFLGRT FD YSFGITIKAFC+NGNVSKGFELL+QMER+GLSPNVVIYTILIDACCKNGDIEQAKVLFS+MNDLG VANQYTYT MINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT
        GFELYEKMKLVGV PSLYTYN+LINEYCRDGKLSIAFK+FDEMSTRGVSCNVVTY ILIGGLCR RQ+SKAERL E MKQ HINPT RT+NLLMDG CN 
Subjt:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT

Query:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH
        GKL+KAL YFDKLKLIG +PTSV+YNILIAGFSKAGNS+VVSELVREMEDRGISPSKVTYTILMDAF+RSDDVEKA Q+F LMKK+GSVPDQ+TYGVL+H
Subjt:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH

Query:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC
        GLCMKGNMVEA+KLYKSMVEM++EPNDVIYN MINGYCKECNSYKALKFL+EMV KG TPSL SYSSTIEVLCNEGKS EAK LLKEMIEAGL PSESL 
Subjt:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC

Query:  SKVG
         K+G
Subjt:  SKVG

XP_038887006.1 pentatricopeptide repeat-containing protein At4g11690 [Benincasa hispida]6.6e-26090.08Show/hide
Query:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF
        MV KSIGF++PFL NRITSSFFT SSILTYSTQPNLN DS  G SH AIIN SLQSQ LEQSLH+FKLMVLKGY PSS SFNNVLG LAKSGNL RTWWF
Subjt:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF

Query:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD
        F+EFL RT FDVYSFGITIKAFCENGN+SKGF+LL+QMERMGLS NVVIYTILIDACCKNGDIEQAKVLFSRM+DLGLVAN YTYTVMINGFFKKGY+KD
Subjt:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT
        GFELYEKMKLVGV P+LYTYNSLINEYCRDGKLS+AFKLFDEMSTRGVSCNV+TYNILIGGLCR RQVSKAE LLE+MKQAHINPT RTFNLL+DGLCNT
Subjt:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT

Query:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH
        GKLDKALSYFDK+KLIGQSPTSVTYNILIAGFSK GNSSVVSELVREMEDRGISPSKVTYTILM AF+RSDDVEKAY+MFRLMKKIGSVPDQ+TYGVLIH
Subjt:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH

Query:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC
        GLCMKGNMVEA+KLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKG TPS+ SYS TI VLCNEGKS EAKHLLKEMIEAGLKPSESLC
Subjt:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC

Query:  SKVG
         KVG
Subjt:  SKVG

TrEMBL top hitse value%identityAlignment
A0A1S3C0B2 pentatricopeptide repeat-containing protein At4g116906.7e-25085.71Show/hide
Query:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF
        MVPKSIGF++PFLSNRITSSFFTISS+LTYSTQ NLNS+SV G  HDA+IN SLQS  LEQSL SFKLMVLKG+SPSS+SFNNVL LLAKSGNLDRTWWF
Subjt:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF

Query:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD
        FTE+LGRT FDVYSFGITIKAFCENGNVSKGFELL+QME MG+SPNV IYTILI+ACCKNGDI+QAKV+FSRM+DLGL A+QY YTVMINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT
        GFELYEKMKL+GV P+LYTYNSLI EYCRDGKLS+AFKLFDE+S RGV+CN VTYNILIGGLCR  QV KAE LLE MK+AHINPT RTFNLLMDGLCNT
Subjt:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT

Query:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH
        GKLDKALSY DKLKLIGQSPT VTYNILI+GFSK GNSSVVSELVREMEDRGISPSKVTYTILMDAF+RSDD+EKAY+MF LMK+IG VPDQ+TYGVLIH
Subjt:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH

Query:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC
        GLC++GNMVEA+KLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVK G TP++ SY STI+VLC +GKSIEAK LLKEM EAGLKP ESL 
Subjt:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC

Query:  SKVG
        SKVG
Subjt:  SKVG

A0A5D3C6B4 Pentatricopeptide repeat-containing protein6.7e-25085.71Show/hide
Query:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF
        MVPKSIGF++PFLSNRITSSFFTISS+LTYSTQ NLNS+SV G  HDA+IN SLQS  LEQSL SFKLMVLKG+SPSS+SFNNVL LLAKSGNLDRTWWF
Subjt:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF

Query:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD
        FTE+LGRT FDVYSFGITIKAFCENGNVSKGFELL+QME MG+SPNV IYTILI+ACCKNGDI+QAKV+FSRM+DLGL A+QY YTVMINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT
        GFELYEKMKL+GV P+LYTYNSLI EYCRDGKLS+AFKLFDE+S RGV+CN VTYNILIGGLCR  QV KAE LLE MK+AHINPT RTFNLLMDGLCNT
Subjt:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT

Query:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH
        GKLDKALSY DKLKLIGQSPT VTYNILI+GFSK GNSSVVSELVREMEDRGISPSKVTYTILMDAF+RSDD+EKAY+MF LMK+IG VPDQ+TYGVLIH
Subjt:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH

Query:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC
        GLC++GNMVEA+KLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVK G TP++ SY STI+VLC +GKSIEAK LLKEM EAGLKP ESL 
Subjt:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC

Query:  SKVG
        SKVG
Subjt:  SKVG

A0A6J1C8X2 pentatricopeptide repeat-containing protein At4g116902.1e-23580.58Show/hide
Query:  MVPKSIGFLHPF----------------LSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNV
        MVPK  GF+H F                LSNRI S FFTISSILT+ST+ NLN+  + G  H+AII+  +QSQL EQSL+ FKLMVLKG  PSS SFNNV
Subjt:  MVPKSIGFLHPF----------------LSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNV

Query:  LGLLAKSGNLDRTWWFFTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYT
        LGLL KSG+L++ W FF EFLGRTHFDVYSFGI IKAFCE GNVSKGFELL+QMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFS+M DLGLVANQYT
Subjt:  LGLLAKSGNLDRTWWFFTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYT

Query:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHIN
        YTVMING FKKG KKDGFELYEKM L+GVFPS+YTYNSLINEYCRDG L +AFKLFDEM TRGVSCNVVTYNILIGGLCRNRQV KAE LLE+MK AHIN
Subjt:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHIN

Query:  PTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMK
        P+  TFNLLMDG CN GK DKALSYFD+LKLIG SPTSVTYNILIAGFSKAGNS+VV ELVREMEDRGISPSKVTYTILMDAF+RSDDV KA QMF LMK
Subjt:  PTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMK

Query:  KIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHL
        K+GSVPDQYTYGVL+HGLCMKGNMVEA+KLYKSM+E HL+PNDVIYNTMINGYCKECNSYKALKFL+EMV+KG TPS  SYSSTIEVLC +GKS EAK L
Subjt:  KIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHL

Query:  LKEMIEAGLKPSESLCSKVG
        LKEMIE GL PSESLC +VG
Subjt:  LKEMIEAGLKPSESLCSKVG

A0A6J1GM09 pentatricopeptide repeat-containing protein At4g116901.1e-24484.52Show/hide
Query:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF
        MVPKSI F+H FLSNRITSSF TISSILTYSTQPNLN       SH+AI + SLQSQ LEQSL+SFKLM+L+G+ PS+ SFNNVLGLLAKSG+L +TWWF
Subjt:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF

Query:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD
        FTEFLGRTHFD YSFGITIKAFC+NGNVSKGFELL+QMER+GLSPNVVIYTILIDACCKNGDIEQAKVLFS+MNDLGLVANQYTYT MINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT
        GFEL+EKMKLVGV PSLYTYN+LINEYCRDGKLSIAFK+FDEMST GVSCNVVTY ILIGGLCR RQ++KAERL E MKQ  INPT RT+NLLMDG CN 
Subjt:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT

Query:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH
        GKL+KAL YFD+LKLIG +PTSV+YNILIAGFSKAGNSSVVSELVREMEDRG+SPSKVTYTILMDAF+RSDDVEKA Q+F LMKK+GSVPDQ+TYGVL+H
Subjt:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH

Query:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC
        GLCMKGNMVEA+KLY SMVEM++EPNDVIYN MINGYCKECNSYKALKFL+EMV KG TPSLTSY STIEVL NEGKS EAK LLKEMIEAGL PSESLC
Subjt:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC

Query:  SKVG
         K+G
Subjt:  SKVG

A0A6J1I748 pentatricopeptide repeat-containing protein At4g116905.1e-25086.71Show/hide
Query:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF
        MVPKSIGFLH FLSNRITSSFFTISSILTYSTQPNLN       SH+AIIN SLQSQLLEQSLHSFKLM+L+G+ PSS SFNNVLGLLAKSG+L +TW F
Subjt:  MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWF

Query:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD
        FTEFLGRT FD YSFGITIKAFC+NGNVSKGFELL+QMER+GLSPNVVIYTILIDACCKNGDIEQAKVLFS+MNDLG VANQYTYT MINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT
        GFELYEKMKLVGV PSLYTYN+LINEYCRDGKLSIAFK+FDEMSTRGVSCNVVTY ILIGGLCR RQ+SKAERL E MKQ HINPT RT+NLLMDG CN 
Subjt:  GFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNT

Query:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH
        GKL+KAL YFDKLKLIG +PTSV+YNILIAGFSKAGNS+VVSELVREMEDRGISPSKVTYTILMDAF+RSDDVEKA Q+F LMKK+GSVPDQ+TYGVL+H
Subjt:  GKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIH

Query:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC
        GLCMKGNMVEA+KLYKSMVEM++EPNDVIYN MINGYCKECNSYKALKFL+EMV KG TPSL SYSSTIEVLCNEGKS EAK LLKEMIEAGL PSESL 
Subjt:  GLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLC

Query:  SKVG
         K+G
Subjt:  SKVG

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial1.8e-6630.63Show/hide
Query:  SHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGRTHF--DVYSFGITIKAFCENGNVSKGFELLSQMERMG
        S++ +I+   Q   ++++ H   LM LKGY+P   S++ V+    + G LD+ W    E + R     + Y +G  I   C    +++  E  S+M R G
Subjt:  SHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGRTHF--DVYSFGITIKAFCENGNVSKGFELLSQMERMG

Query:  LSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDE
        + P+ V+YT LID  CK GDI  A   F  M+   +  +  TYT +I+GF + G   +  +L+ +M   G+ P   T+  LIN YC+ G +  AF++ + 
Subjt:  LSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDE

Query:  MSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVS
        M   G S NVVTY  LI GLC+   +  A  LL EM +  + P I T+N +++GLC +G +++A+    + +  G +  +VTY  L+  + K+G      
Subjt:  MSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVS

Query:  ELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECN
        E+++EM  +G+ P+ VT+ +LM+ F     +E   ++   M   G  P+  T+  L+   C++ N+  A  +YK M    + P+   Y  ++ G+CK  N
Subjt:  ELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECN

Query:  SYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESL
          +A    +EM  KG + S+++YS  I+      K +EA+ +  +M   GL   + +
Subjt:  SYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESL

Q3EDF8 Pentatricopeptide repeat-containing protein At1g099005.0e-6130.61Show/hide
Query:  GYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGRTHF--DVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLF
        G  P   ++N ++    K+G ++         L R     DV ++   +++ C++G + +  E+L +M +    P+V+ YTILI+A C++  +  A  L 
Subjt:  GYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGRTHF--DVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLF

Query:  SRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSK
          M D G   +  TY V++NG  K+G   +  +    M   G  P++ T+N ++   C  G+   A KL  +M  +G S +VVT+NILI  LCR   + +
Subjt:  SRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSK

Query:  AERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRS
        A  +LE+M Q    P   ++N L+ G C   K+D+A+ Y +++   G  P  VTYN ++    K G      E++ ++  +G SP  +TY  ++D   ++
Subjt:  AERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRS

Query:  DDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIE
            KA ++   M+     PD  TY  L+ GL  +G + EA K +     M + PN V +N+++ G CK   + +A+ FL  M+ +G  P+ TSY+  IE
Subjt:  DDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIE

Query:  VLCNEGKSIEAKHLLKEMIEAGLKPSES
         L  EG + EA  LL E+   GL    S
Subjt:  VLCNEGKSIEAKHLLKEMIEAGLKPSES

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397101.9e-6332.45Show/hide
Query:  DAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKS-GNLDRTWWFFTEFL-GRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLS
        D ++ +  +  L++++L    L    G+ P   S+N VL    +S  N+      F E L  +   +V+++ I I+ FC  GN+     L  +ME  G  
Subjt:  DAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKS-GNLDRTWWFFTEFL-GRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLS

Query:  PNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMS
        PNVV Y  LID  CK   I+    L   M   GL  N  +Y V+ING  ++G  K+   +  +M   G      TYN+LI  YC++G    A  +  EM 
Subjt:  PNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMS

Query:  TRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSEL
          G++ +V+TY  LI  +C+   +++A   L++M+   + P  RT+  L+DG    G +++A     ++   G SP+ VTYN LI G    G       +
Subjt:  TRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSEL

Query:  VREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSY
        + +M+++G+SP  V+Y+ ++  F RS DV++A ++ R M + G  PD  TY  LI G C +    EA  LY+ M+ + L P++  Y  +IN YC E +  
Subjt:  VREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSY

Query:  KALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSE
        KAL+   EMV+KG  P + +YS  I  L  + ++ EAK LL ++      PS+
Subjt:  KALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSE

Q9SXD1 Pentatricopeptide repeat-containing protein At1g62670, mitochondrial6.0e-6228.48Show/hide
Query:  AIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGR-THFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPN
        +++N    S+ + +++     M + GY P++ +FN ++  L                + +    D+ ++G+ +   C+ G+    F LL++ME+  L P 
Subjt:  AIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGR-THFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPN

Query:  VVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTR
        V+IY  +ID  CK   ++ A  LF  M   G+  N  TY+ +I+     G   D   L   M    + P ++T+++LI+ + ++GKL  A KL+DEM  R
Subjt:  VVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTR

Query:  GVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVR
         +  ++VTY+ LI G C + ++ +A+++ E M   H  P + T+N L+ G C   ++++ +  F ++   G    +VTYNILI G  +AG+  +  E+ +
Subjt:  GVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVR

Query:  EMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKA
        EM   G+ P+ +TY  L+D   ++  +EKA  +F  +++    P  YTY ++I G+C  G + +   L+ ++    ++P+ V YNTMI+G+C++ +  +A
Subjt:  EMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKA

Query:  LKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAG
            +EM + G  P+   Y++ I     +G    +  L+KEM   G
Subjt:  LKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAG

Q9T0D6 Pentatricopeptide repeat-containing protein At4g116901.9e-14553.16Show/hide
Query:  LSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGRTHFDV
        +S +I S FFT SS+L Y T+    +       ++ IIN+ +QSQ L  S+  F  MV  G+ P S  FN +L  +  S + ++ W FF E   +   DV
Subjt:  LSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGRTHFDV

Query:  YSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVG
        YSFGI IK  CE G + K F+LL ++   G SPNVVIYT LID CCK G+IE+AK LF  M  LGLVAN+ TYTV+ING FK G KK GFE+YEKM+  G
Subjt:  YSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVG

Query:  VFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDK
        VFP+LYTYN ++N+ C+DG+   AF++FDEM  RGVSCN+VTYN LIGGLCR  ++++A +++++MK   INP + T+N L+DG C  GKL KALS    
Subjt:  VFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDK

Query:  LKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAA
        LK  G SP+ VTYNIL++GF + G++S  +++V+EME+RGI PSKVTYTIL+D F RSD++EKA Q+   M+++G VPD +TY VLIHG C+KG M EA+
Subjt:  LKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAA

Query:  KLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLCSKV
        +L+KSMVE + EPN+VIYNTMI GYCKE +SY+ALK L+EM +K   P++ SY   IEVLC E KS EA+ L+++MI++G+ PS S+ S +
Subjt:  KLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLCSKV

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.3e-6730.63Show/hide
Query:  SHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGRTHF--DVYSFGITIKAFCENGNVSKGFELLSQMERMG
        S++ +I+   Q   ++++ H   LM LKGY+P   S++ V+    + G LD+ W    E + R     + Y +G  I   C    +++  E  S+M R G
Subjt:  SHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGRTHF--DVYSFGITIKAFCENGNVSKGFELLSQMERMG

Query:  LSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDE
        + P+ V+YT LID  CK GDI  A   F  M+   +  +  TYT +I+GF + G   +  +L+ +M   G+ P   T+  LIN YC+ G +  AF++ + 
Subjt:  LSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDE

Query:  MSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVS
        M   G S NVVTY  LI GLC+   +  A  LL EM +  + P I T+N +++GLC +G +++A+    + +  G +  +VTY  L+  + K+G      
Subjt:  MSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVS

Query:  ELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECN
        E+++EM  +G+ P+ VT+ +LM+ F     +E   ++   M   G  P+  T+  L+   C++ N+  A  +YK M    + P+   Y  ++ G+CK  N
Subjt:  ELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECN

Query:  SYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESL
          +A    +EM  KG + S+++YS  I+      K +EA+ +  +M   GL   + +
Subjt:  SYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESL

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein1.3e-6730.63Show/hide
Query:  SHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGRTHF--DVYSFGITIKAFCENGNVSKGFELLSQMERMG
        S++ +I+   Q   ++++ H   LM LKGY+P   S++ V+    + G LD+ W    E + R     + Y +G  I   C    +++  E  S+M R G
Subjt:  SHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGRTHF--DVYSFGITIKAFCENGNVSKGFELLSQMERMG

Query:  LSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDE
        + P+ V+YT LID  CK GDI  A   F  M+   +  +  TYT +I+GF + G   +  +L+ +M   G+ P   T+  LIN YC+ G +  AF++ + 
Subjt:  LSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDE

Query:  MSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVS
        M   G S NVVTY  LI GLC+   +  A  LL EM +  + P I T+N +++GLC +G +++A+    + +  G +  +VTY  L+  + K+G      
Subjt:  MSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVS

Query:  ELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECN
        E+++EM  +G+ P+ VT+ +LM+ F     +E   ++   M   G  P+  T+  L+   C++ N+  A  +YK M    + P+   Y  ++ G+CK  N
Subjt:  ELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECN

Query:  SYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESL
          +A    +EM  KG + S+++YS  I+      K +EA+ +  +M   GL   + +
Subjt:  SYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESL

AT1G62670.1 rna processing factor 24.2e-6328.48Show/hide
Query:  AIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGR-THFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPN
        +++N    S+ + +++     M + GY P++ +FN ++  L                + +    D+ ++G+ +   C+ G+    F LL++ME+  L P 
Subjt:  AIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGR-THFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPN

Query:  VVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTR
        V+IY  +ID  CK   ++ A  LF  M   G+  N  TY+ +I+     G   D   L   M    + P ++T+++LI+ + ++GKL  A KL+DEM  R
Subjt:  VVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTR

Query:  GVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVR
         +  ++VTY+ LI G C + ++ +A+++ E M   H  P + T+N L+ G C   ++++ +  F ++   G    +VTYNILI G  +AG+  +  E+ +
Subjt:  GVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVR

Query:  EMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKA
        EM   G+ P+ +TY  L+D   ++  +EKA  +F  +++    P  YTY ++I G+C  G + +   L+ ++    ++P+ V YNTMI+G+C++ +  +A
Subjt:  EMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKA

Query:  LKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAG
            +EM + G  P+   Y++ I     +G    +  L+KEM   G
Subjt:  LKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAG

AT4G11690.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.4e-14653.16Show/hide
Query:  LSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGRTHFDV
        +S +I S FFT SS+L Y T+    +       ++ IIN+ +QSQ L  S+  F  MV  G+ P S  FN +L  +  S + ++ W FF E   +   DV
Subjt:  LSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGRTHFDV

Query:  YSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVG
        YSFGI IK  CE G + K F+LL ++   G SPNVVIYT LID CCK G+IE+AK LF  M  LGLVAN+ TYTV+ING FK G KK GFE+YEKM+  G
Subjt:  YSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVG

Query:  VFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDK
        VFP+LYTYN ++N+ C+DG+   AF++FDEM  RGVSCN+VTYN LIGGLCR  ++++A +++++MK   INP + T+N L+DG C  GKL KALS    
Subjt:  VFPSLYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDK

Query:  LKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAA
        LK  G SP+ VTYNIL++GF + G++S  +++V+EME+RGI PSKVTYTIL+D F RSD++EKA Q+   M+++G VPD +TY VLIHG C+KG M EA+
Subjt:  LKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAA

Query:  KLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLCSKV
        +L+KSMVE + EPN+VIYNTMI GYCKE +SY+ALK L+EM +K   P++ SY   IEVLC E KS EA+ L+++MI++G+ PS S+ S +
Subjt:  KLYKSMVEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLCSKV

AT5G39710.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-6432.45Show/hide
Query:  DAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKS-GNLDRTWWFFTEFL-GRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLS
        D ++ +  +  L++++L    L    G+ P   S+N VL    +S  N+      F E L  +   +V+++ I I+ FC  GN+     L  +ME  G  
Subjt:  DAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKS-GNLDRTWWFFTEFL-GRTHFDVYSFGITIKAFCENGNVSKGFELLSQMERMGLS

Query:  PNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMS
        PNVV Y  LID  CK   I+    L   M   GL  N  +Y V+ING  ++G  K+   +  +M   G      TYN+LI  YC++G    A  +  EM 
Subjt:  PNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTYNSLINEYCRDGKLSIAFKLFDEMS

Query:  TRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSEL
          G++ +V+TY  LI  +C+   +++A   L++M+   + P  RT+  L+DG    G +++A     ++   G SP+ VTYN LI G    G       +
Subjt:  TRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIAGFSKAGNSSVVSEL

Query:  VREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSY
        + +M+++G+SP  V+Y+ ++  F RS DV++A ++ R M + G  PD  TY  LI G C +    EA  LY+ M+ + L P++  Y  +IN YC E +  
Subjt:  VREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKECNSY

Query:  KALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSE
        KAL+   EMV+KG  P + +YS  I  L  + ++ EAK LL ++      PS+
Subjt:  KALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCCCAAATCCATTGGCTTCCTGCACCCATTTCTTTCTAATCGGATCACCTCGTCTTTCTTCACTATTTCTTCCATTTTAACCTATTCAACACAACCAAATCTGAA
TTCTGATTCAGTCTCTGGCCATTCTCATGATGCAATTATCAATACTTCTCTTCAATCTCAACTATTAGAACAGTCCCTTCACAGTTTTAAACTAATGGTCCTTAAAGGGT
ATTCTCCCAGTTCATTCTCTTTCAATAATGTATTGGGTTTACTTGCCAAATCAGGCAATTTGGATAGAACTTGGTGGTTTTTCACTGAATTTCTGGGGAGGACTCACTTT
GATGTGTATAGTTTTGGGATTACCATTAAAGCCTTTTGCGAAAATGGCAATGTAAGTAAAGGTTTTGAGCTTTTGTCTCAAATGGAGAGGATGGGTTTGTCTCCTAATGT
TGTTATATACACTATCTTGATTGATGCTTGTTGCAAAAATGGTGACATTGAGCAGGCTAAAGTATTGTTTTCTAGGATGAATGATCTTGGTTTGGTTGCTAACCAATATA
CTTATACTGTCATGATCAATGGATTTTTCAAGAAAGGTTATAAGAAAGATGGTTTTGAGCTTTATGAGAAGATGAAGCTTGTTGGGGTGTTTCCCAGTTTATATACTTAC
AACAGTCTTATTAATGAATATTGTAGGGATGGAAAGTTGAGCATTGCATTTAAGTTGTTTGATGAAATGTCTACAAGAGGGGTGTCATGTAATGTAGTCACATACAATAT
TCTAATTGGTGGGTTATGTCGTAATAGACAAGTGTCGAAAGCTGAACGGCTATTAGAAGAAATGAAACAAGCTCATATAAATCCAACTATTAGAACATTTAACCTGTTGA
TGGATGGGTTGTGTAACACTGGAAAGTTGGACAAGGCCTTAAGTTATTTTGATAAGCTGAAGTTGATTGGTCAGTCTCCAACTTCAGTGACCTACAACATTTTAATTGCA
GGTTTCTCTAAAGCAGGAAATTCTTCTGTAGTTTCAGAGTTAGTGAGAGAGATGGAGGACAGAGGCATTTCTCCCTCTAAAGTGACATACACAATTCTGATGGATGCATT
TATCCGATCCGATGATGTGGAGAAAGCCTATCAGATGTTTCGTCTCATGAAGAAAATTGGTTCGGTCCCCGATCAGTATACCTACGGTGTCCTAATTCATGGTTTGTGTA
TGAAAGGTAATATGGTAGAGGCAGCAAAACTATACAAATCCATGGTAGAGATGCATTTGGAGCCTAATGATGTTATCTATAATACAATGATAAATGGGTACTGTAAAGAG
TGCAACTCTTACAAGGCCTTGAAGTTTCTTGAAGAGATGGTTAAAAAAGGAGAAACTCCAAGTCTGACTAGTTACAGTTCCACCATTGAAGTCCTCTGCAACGAAGGGAA
GTCGATCGAGGCAAAACATTTACTTAAAGAGATGATTGAAGCCGGGTTGAAGCCATCAGAATCTCTCTGTAGTAAAGTTGGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGCCCAAATCCATTGGCTTCCTGCACCCATTTCTTTCTAATCGGATCACCTCGTCTTTCTTCACTATTTCTTCCATTTTAACCTATTCAACACAACCAAATCTGAA
TTCTGATTCAGTCTCTGGCCATTCTCATGATGCAATTATCAATACTTCTCTTCAATCTCAACTATTAGAACAGTCCCTTCACAGTTTTAAACTAATGGTCCTTAAAGGGT
ATTCTCCCAGTTCATTCTCTTTCAATAATGTATTGGGTTTACTTGCCAAATCAGGCAATTTGGATAGAACTTGGTGGTTTTTCACTGAATTTCTGGGGAGGACTCACTTT
GATGTGTATAGTTTTGGGATTACCATTAAAGCCTTTTGCGAAAATGGCAATGTAAGTAAAGGTTTTGAGCTTTTGTCTCAAATGGAGAGGATGGGTTTGTCTCCTAATGT
TGTTATATACACTATCTTGATTGATGCTTGTTGCAAAAATGGTGACATTGAGCAGGCTAAAGTATTGTTTTCTAGGATGAATGATCTTGGTTTGGTTGCTAACCAATATA
CTTATACTGTCATGATCAATGGATTTTTCAAGAAAGGTTATAAGAAAGATGGTTTTGAGCTTTATGAGAAGATGAAGCTTGTTGGGGTGTTTCCCAGTTTATATACTTAC
AACAGTCTTATTAATGAATATTGTAGGGATGGAAAGTTGAGCATTGCATTTAAGTTGTTTGATGAAATGTCTACAAGAGGGGTGTCATGTAATGTAGTCACATACAATAT
TCTAATTGGTGGGTTATGTCGTAATAGACAAGTGTCGAAAGCTGAACGGCTATTAGAAGAAATGAAACAAGCTCATATAAATCCAACTATTAGAACATTTAACCTGTTGA
TGGATGGGTTGTGTAACACTGGAAAGTTGGACAAGGCCTTAAGTTATTTTGATAAGCTGAAGTTGATTGGTCAGTCTCCAACTTCAGTGACCTACAACATTTTAATTGCA
GGTTTCTCTAAAGCAGGAAATTCTTCTGTAGTTTCAGAGTTAGTGAGAGAGATGGAGGACAGAGGCATTTCTCCCTCTAAAGTGACATACACAATTCTGATGGATGCATT
TATCCGATCCGATGATGTGGAGAAAGCCTATCAGATGTTTCGTCTCATGAAGAAAATTGGTTCGGTCCCCGATCAGTATACCTACGGTGTCCTAATTCATGGTTTGTGTA
TGAAAGGTAATATGGTAGAGGCAGCAAAACTATACAAATCCATGGTAGAGATGCATTTGGAGCCTAATGATGTTATCTATAATACAATGATAAATGGGTACTGTAAAGAG
TGCAACTCTTACAAGGCCTTGAAGTTTCTTGAAGAGATGGTTAAAAAAGGAGAAACTCCAAGTCTGACTAGTTACAGTTCCACCATTGAAGTCCTCTGCAACGAAGGGAA
GTCGATCGAGGCAAAACATTTACTTAAAGAGATGATTGAAGCCGGGTTGAAGCCATCAGAATCTCTCTGTAGTAAAGTTGGTTAA
Protein sequenceShow/hide protein sequence
MVPKSIGFLHPFLSNRITSSFFTISSILTYSTQPNLNSDSVSGHSHDAIINTSLQSQLLEQSLHSFKLMVLKGYSPSSFSFNNVLGLLAKSGNLDRTWWFFTEFLGRTHF
DVYSFGITIKAFCENGNVSKGFELLSQMERMGLSPNVVIYTILIDACCKNGDIEQAKVLFSRMNDLGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSLYTY
NSLINEYCRDGKLSIAFKLFDEMSTRGVSCNVVTYNILIGGLCRNRQVSKAERLLEEMKQAHINPTIRTFNLLMDGLCNTGKLDKALSYFDKLKLIGQSPTSVTYNILIA
GFSKAGNSSVVSELVREMEDRGISPSKVTYTILMDAFIRSDDVEKAYQMFRLMKKIGSVPDQYTYGVLIHGLCMKGNMVEAAKLYKSMVEMHLEPNDVIYNTMINGYCKE
CNSYKALKFLEEMVKKGETPSLTSYSSTIEVLCNEGKSIEAKHLLKEMIEAGLKPSESLCSKVG