; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021558 (gene) of Snake gourd v1 genome

Gene IDTan0021558
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG02:23193600..23195986
RNA-Seq ExpressionTan0021558
SyntenyTan0021558
Gene Ontology termsGO:0032544 - plastid translation (biological process)
GO:0043489 - RNA stabilization (biological process)
GO:0009536 - plastid (cellular component)
GO:0003729 - mRNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571877.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]2.6e-23282.93Show/hide
Query:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF
        MVPKSI FV  FLSNRI SSF TISSILTYSTQPNLN       S+EA +SAS+QSQ  EQSL++FKL +L+G+CPSS SFNN+LGLLAKSG+LHKTW F
Subjt:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF

Query:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD
        FTEFLGRTHFD YSFGI IKAFC+NGNVSKGFELLAQMER+  SPN VIYTILIDACCK GDIE+AKVLFS+M+D+GLV NQYTYT MINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI
        GFEL+EKMKLVGV PS+YTYN+LINEYCRDGKLSIAFK+FDEMST GVS NVVTY ILIGGLCRKRQ+SKAE+L E+MKQ RINPTTRT+NLLMDGFCNI
Subjt:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI

Query:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH
        GKL+KAL YFDELKLIG +PTSV+YNILIAGFSKAGNSSVVSELVREME RG+SPSKVTYTI+MDAFVRSDDVEKASQ+FHLMKKVG VPDQHTYGVLMH
Subjt:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH

Query:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG
        GLCMKGNMV+ASKLY SM+EM++EPNDVIYN MINGYCKECNSYKALKFL+EMVNKG+TP+L SY STIEVL N GKSTEAK LL+E+I  G
Subjt:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG

XP_022953086.1 pentatricopeptide repeat-containing protein At4g11690 [Cucurbita moschata]2.0e-23282.93Show/hide
Query:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF
        MVPKSI FV  FLSNRI SSF TISSILTYSTQPNLN       S+EA +SAS+QSQ  EQSL++FKL +L+G+CPS+ SFNN+LGLLAKSG+LHKTW F
Subjt:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF

Query:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD
        FTEFLGRTHFD YSFGI IKAFC+NGNVSKGFELLAQMER+  SPN VIYTILIDACCK GDIE+AKVLFS+M+D+GLVANQYTYT MINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI
        GFEL+EKMKLVGV PS+YTYN+LINEYCRDGKLSIAFK+FDEMST GVS NVVTY ILIGGLCRKRQ++KAE+L E+MKQ RINPTTRT+NLLMDGFCNI
Subjt:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI

Query:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH
        GKL+KAL YFDELKLIG +PTSV+YNILIAGFSKAGNSSVVSELVREME RG+SPSKVTYTI+MDAFVRSDDVEKASQ+FHLMKKVG VPDQHTYGVLMH
Subjt:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH

Query:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG
        GLCMKGNMVEASKLY SM+EM++EPNDVIYN MINGYCKECNSYKALKFL+EMVNKG+TP+L SY STIEVL N GKSTEAK LL+E+I  G
Subjt:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG

XP_022971940.1 pentatricopeptide repeat-containing protein At4g11690 [Cucurbita maxima]3.8e-23683.94Show/hide
Query:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF
        MVPKSIGF+  FLSNRI SSFFTISSILTYSTQPNLN       S+EA  +AS+QSQL EQSLH+FKL +L+G+CPSS SFNN+LGLLAKSG+LHKTW F
Subjt:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF

Query:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD
        FTEFLGRT FD YSFGI IKAFC+NGNVSKGFELLAQMER+  SPN VIYTILIDACCK GDIE+AKVLFS+M+D+G VANQYTYT MINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI
        GFELYEKMKLVGV PS+YTYN+LINEYCRDGKLSIAFK+FDEMSTRGVS NVVTY ILIGGLCRKRQ+SKAE+L E+MKQ  INPTTRT+NLLMDGFCNI
Subjt:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI

Query:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH
        GKL+KAL YFD+LKLIG +PTSV+YNILIAGFSKAGNS+VVSELVREME RGISPSKVTYTI+MDAFVRSDDVEKASQ+FHLMKKVG VPDQHTYGVLMH
Subjt:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH

Query:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG
        GLCMKGNMVEASKLYKSM+EM++EPNDVIYN MINGYCKECNSYKALKFL+EMVNKG+TP+LASY+STIEVLCN GKSTEAK LL+E+I  G
Subjt:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG

XP_023511558.1 pentatricopeptide repeat-containing protein At4g11690 [Cucurbita pepo subsp. pepo]2.0e-23283.33Show/hide
Query:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF
        MVPKSIG V  FLSNR  SSFFTISSILTYSTQPNLN       S+EA +SAS+QSQ  EQSL++FKL +L+G CPSS SFNN+LGLLAKSG+LHKTW F
Subjt:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF

Query:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD
        FTEFLGRTHFD YSFGI IKAFCENGNVSKGFELLAQMER+  SPN VIYTILIDACCK GDIE+AKVLFS+M+D+GLVANQYTYT MINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI
        GFE +EKMKLVGV PS+YTYN+LINEYCRDGKLSIAFK+FDEMST GVS NVVTY ILIGGLCRKRQ+SKAE+L E+MKQ RINPTTRT+NLLMDGFCNI
Subjt:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI

Query:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH
        GKL+KAL YFDELKLIG +PTSV+YNILIAGFSKAGNS+VVSELVREME RGISPSKVTYTI+MDAFVRSDDVEKASQ+ HLMKKVG VPDQHTYGVLMH
Subjt:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH

Query:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG
        GLCMKGNMVEASKLY SM+EM++EPNDVIYN MINGYCKECNSYKALKFL+EMVNKG+TP+LASY+STIEVL N GKSTEAK LL+E+I  G
Subjt:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG

XP_038887006.1 pentatricopeptide repeat-containing protein At4g11690 [Benincasa hispida]3.2e-23584.15Show/hide
Query:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF
        MV KSIGFV  FL NRI SSFFT SSILTYSTQPNLN DS    S+ A  +AS+QSQ  EQSLHNFKL VLKGYCPSS+SFNN+LG LAKSGNLH+TW F
Subjt:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF

Query:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD
        F+EFL RT FDVYSFGI IKAFCENGN+SKGF+LLAQMERM  S N VIYTILIDACCK GDIE+AKVLFSRMDD+GLVAN YTYTVMINGFFKKGY+KD
Subjt:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI
        GFELYEKMKLVGV P++YTYNSLINEYCRDGKLS+AFKLFDEMSTRGVS NV+TYNILIGGLCRKRQVSKAE LLE+MKQA INPTTRTFNLL+DG CN 
Subjt:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI

Query:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH
        GKLDKALSYFD++KLIGQSPTSVTYNILIAGFSK GNSSVVSELVREME RGISPSKVTYTI+M AFVRSDDVEKA +MF LMKK+G VPDQHTYGVL+H
Subjt:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH

Query:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG
        GLCMKGNMVEASKLYKSM+EMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMV KG+TP++ASY+ TI VLCN GKSTEAK LL+E+I  G
Subjt:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG

TrEMBL top hitse value%identityAlignment
A0A1S3C0B2 pentatricopeptide repeat-containing protein At4g116901.6e-22480.28Show/hide
Query:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF
        MVPKSIGFV  FLSNRI SSFFTISS+LTYSTQ NLNS+SV    ++A  +AS+QS   EQSL +FKL VLKG+ PSS SFNN+L LLAKSGNL +TW F
Subjt:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF

Query:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD
        FTE+LGRT FDVYSFGI IKAFCENGNVSKGFELLAQME M  SPN  IYTILI+ACCK GDI++AKV+FSRMDD+GL A+QY YTVMINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI
        GFELYEKMKL+GV P++YTYNSLI EYCRDGKLS+AFKLFDE+S RGV+ N VTYNILIGGLCRK QV KAE LLE+MK+A INPTTRTFNLLMDG CN 
Subjt:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI

Query:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH
        GKLDKALSY D+LKLIGQSPT VTYNILI+GFSK GNSSVVSELVREME RGISPSKVTYTI+MDAFVRSDD+EKA +MFHLMK++GLVPDQHTYGVL+H
Subjt:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH

Query:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG
        GLC++GNMVEASKLYKSM+EMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMV  G+TPN+ASY STI+VLC +GKS EAK LL+E+   G
Subjt:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG

A0A5D3C6B4 Pentatricopeptide repeat-containing protein1.6e-22480.28Show/hide
Query:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF
        MVPKSIGFV  FLSNRI SSFFTISS+LTYSTQ NLNS+SV    ++A  +AS+QS   EQSL +FKL VLKG+ PSS SFNN+L LLAKSGNL +TW F
Subjt:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF

Query:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD
        FTE+LGRT FDVYSFGI IKAFCENGNVSKGFELLAQME M  SPN  IYTILI+ACCK GDI++AKV+FSRMDD+GL A+QY YTVMINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI
        GFELYEKMKL+GV P++YTYNSLI EYCRDGKLS+AFKLFDE+S RGV+ N VTYNILIGGLCRK QV KAE LLE+MK+A INPTTRTFNLLMDG CN 
Subjt:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI

Query:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH
        GKLDKALSY D+LKLIGQSPT VTYNILI+GFSK GNSSVVSELVREME RGISPSKVTYTI+MDAFVRSDD+EKA +MFHLMK++GLVPDQHTYGVL+H
Subjt:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH

Query:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG
        GLC++GNMVEASKLYKSM+EMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMV  G+TPN+ASY STI+VLC +GKS EAK LL+E+   G
Subjt:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG

A0A6J1C8X2 pentatricopeptide repeat-containing protein At4g116906.0e-22780.51Show/hide
Query:  MVPKSIGFVRRF----------------LSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNI
        MVPK  GFV  F                LSNRI S FFTISSILT+ST+ NLN+  +    +EA  SA VQSQL EQSL++FKL VLKG CPSSNSFNN+
Subjt:  MVPKSIGFVRRF----------------LSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNI

Query:  LGLLAKSGNLHKTWLFFTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYT
        LGLL KSG+L+K W FF EFLGRTHFDVYSFGIMIKAFCE GNVSKGFELLAQMERM  SPN VIYTILIDACCK GDIE+AKVLFS+M D+GLVANQYT
Subjt:  LGLLAKSGNLHKTWLFFTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYT

Query:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARIN
        YTVMING FKKG KKDGFELYEKM L+GVFPSVYTYNSLINEYCRDG L +AFKLFDEM TRGVS NVVTYNILIGGLCR RQV KAE LLE+MK A IN
Subjt:  YTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARIN

Query:  PTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMK
        P+T TFNLLMDGFCN+GK DKALSYFDELKLIG SPTSVTYNILIAGFSKAGNS+VV ELVREME RGISPSKVTYTI+MDAFVRSDDV KASQMFHLMK
Subjt:  PTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMK

Query:  KVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDL
        KVG VPDQ+TYGVLMHGLCMKGNMVEASKLYKSMIE HL+PNDVIYNTMINGYCKECNSYKALKFL+EMV KGMTP+ ASY+STIEVLC +GKSTEAK+L
Subjt:  KVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDL

Query:  LEEIIRPG
        L+E+I  G
Subjt:  LEEIIRPG

A0A6J1GM09 pentatricopeptide repeat-containing protein At4g116909.5e-23382.93Show/hide
Query:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF
        MVPKSI FV  FLSNRI SSF TISSILTYSTQPNLN       S+EA +SAS+QSQ  EQSL++FKL +L+G+CPS+ SFNN+LGLLAKSG+LHKTW F
Subjt:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF

Query:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD
        FTEFLGRTHFD YSFGI IKAFC+NGNVSKGFELLAQMER+  SPN VIYTILIDACCK GDIE+AKVLFS+M+D+GLVANQYTYT MINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI
        GFEL+EKMKLVGV PS+YTYN+LINEYCRDGKLSIAFK+FDEMST GVS NVVTY ILIGGLCRKRQ++KAE+L E+MKQ RINPTTRT+NLLMDGFCNI
Subjt:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI

Query:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH
        GKL+KAL YFDELKLIG +PTSV+YNILIAGFSKAGNSSVVSELVREME RG+SPSKVTYTI+MDAFVRSDDVEKASQ+FHLMKKVG VPDQHTYGVLMH
Subjt:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH

Query:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG
        GLCMKGNMVEASKLY SM+EM++EPNDVIYN MINGYCKECNSYKALKFL+EMVNKG+TP+L SY STIEVL N GKSTEAK LL+E+I  G
Subjt:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG

A0A6J1I748 pentatricopeptide repeat-containing protein At4g116901.8e-23683.94Show/hide
Query:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF
        MVPKSIGF+  FLSNRI SSFFTISSILTYSTQPNLN       S+EA  +AS+QSQL EQSLH+FKL +L+G+CPSS SFNN+LGLLAKSG+LHKTW F
Subjt:  MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLF

Query:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD
        FTEFLGRT FD YSFGI IKAFC+NGNVSKGFELLAQMER+  SPN VIYTILIDACCK GDIE+AKVLFS+M+D+G VANQYTYT MINGFFKKGYKKD
Subjt:  FTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKD

Query:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI
        GFELYEKMKLVGV PS+YTYN+LINEYCRDGKLSIAFK+FDEMSTRGVS NVVTY ILIGGLCRKRQ+SKAE+L E+MKQ  INPTTRT+NLLMDGFCNI
Subjt:  GFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNI

Query:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH
        GKL+KAL YFD+LKLIG +PTSV+YNILIAGFSKAGNS+VVSELVREME RGISPSKVTYTI+MDAFVRSDDVEKASQ+FHLMKKVG VPDQHTYGVLMH
Subjt:  GKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMH

Query:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG
        GLCMKGNMVEASKLYKSM+EM++EPNDVIYN MINGYCKECNSYKALKFL+EMVNKG+TP+LASY+STIEVLCN GKSTEAK LL+E+I  G
Subjt:  GLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG

SwissProt top hitse value%identityAlignment
Q0WVK7 Pentatricopeptide repeat-containing protein At1g05670, mitochondrial2.2e-6129.2Show/hide
Query:  EQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHF--DVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDAC
        +++ H   L  LKGY P   S++ ++    + G L K W    E + R     + Y +G +I   C    +++  E  ++M R    P+ V+YT LID  
Subjt:  EQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHF--DVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDAC

Query:  CKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNI
        CK+GDI  A   F  M    +  +  TYT +I+GF + G   +  +L+ +M   G+ P   T+  LIN YC+ G +  AF++ + M   G S NVVTY  
Subjt:  CKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNI

Query:  LIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSK
        LI GLC++  +  A +LL +M +  + P   T+N +++G C  G +++A+    E +  G +  +VTY  L+  + K+G      E+++EM  +G+ P+ 
Subjt:  LIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSK

Query:  VTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKG
        VT+ ++M+ F     +E   ++ + M   G+ P+  T+  L+   C++ N+  A+ +YK M    + P+   Y  ++ G+CK  N  +A    +EM  KG
Subjt:  VTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKG

Query:  MTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG
         + ++++Y+  I+      K  EA+++ +++ R G
Subjt:  MTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG

Q3EDF8 Pentatricopeptide repeat-containing protein At1g099001.3e-6131.44Show/hide
Query:  VLKGYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVL
        ++ GYC  +   NN L +L +                    DV ++  ++++ C++G + +  E+L +M +    P+ + YTILI+A C+   +  A  L
Subjt:  VLKGYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVL

Query:  FSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVS
           M D G   +  TY V++NG  K+G   +  +    M   G  P+V T+N ++   C  G+   A KL  +M  +G S +VVT+NILI  LCRK  + 
Subjt:  FSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVS

Query:  KAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVR
        +A  +LEKM Q    P + ++N L+ GFC   K+D+A+ Y + +   G  P  VTYN ++    K G      E++ ++  +G SP  +TY  ++D   +
Subjt:  KAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVR

Query:  SDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTI
        +    KA ++   M+   L PD  TY  L+ GL  +G + EA K +     M + PN V +N+++ G CK   + +A+ FL  M+N+G  PN  SY   I
Subjt:  SDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTI

Query:  EVLCNNGKSTEAKDLLEEIIRPG
        E L   G + EA +LL E+   G
Subjt:  EVLCNNGKSTEAKDLLEEIIRPG

Q9ASZ8 Pentatricopeptide repeat-containing protein At1g126201.1e-6032.7Show/hide
Query:  GYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHF--DVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLF
        G+ P+  + N ++  L  +G +    L     +  T F  +  ++G ++K  C++G  +   ELL +ME      +AV Y+I+ID  CK G ++ A  LF
Subjt:  GYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHF--DVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLF

Query:  SRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSK
        + M+  G  A+   YT +I GF   G   DG +L   M    + P V  +++LI+ + ++GKL  A +L  EM  RG+S + VTY  LI G C++ Q+ K
Subjt:  SRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSK

Query:  AEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRS
        A  +L+ M      P  RTFN+L++G+C    +D  L  F ++ L G    +VTYN LI GF + G   V  EL +EM  R + P  V+Y I++D    +
Subjt:  AEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRS

Query:  DDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIE
         + EKA ++F  ++K  +  D   Y +++HG+C    + +A  L+ S+    ++P+   YN MI G CK+ +  +A     +M   G +PN  +YN  I 
Subjt:  DDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIE

Query:  VLCNNGKSTEAKDLLEEIIRPG
             G +T++  L+EEI R G
Subjt:  VLCNNGKSTEAKDLLEEIIRPG

Q9FIX3 Pentatricopeptide repeat-containing protein At5g397104.2e-6032.59Show/hide
Query:  DSVYSLSNEANSSASVQSQLS--EQSLHNFKLAVLKGYCPSSNSFNNILGLLAKS-GNLHKTWLFFTEFL-GRTHFDVYSFGIMIKAFCENGNVSKGFEL
        D  YS S+  +      S+LS  +++L    LA   G+ P   S+N +L    +S  N+      F E L  +   +V+++ I+I+ FC  GN+     L
Subjt:  DSVYSLSNEANSSASVQSQLS--EQSLHNFKLAVLKGYCPSSNSFNNILGLLAKS-GNLHKTWLFFTEFL-GRTHFDVYSFGIMIKAFCENGNVSKGFEL

Query:  LAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLS
          +ME     PN V Y  LID  CK   I+    L   M   GL  N  +Y V+ING  ++G  K+   +  +M   G      TYN+LI  YC++G   
Subjt:  LAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLS

Query:  IAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSK
         A  +  EM   G++ +V+TY  LI  +C+   +++A + L++M+   + P  RT+  L+DGF   G +++A     E+   G SP+ VTYN LI G   
Subjt:  IAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSK

Query:  AGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMI
         G       ++ +M+ +G+SP  V+Y+ ++  F RS DV++A ++   M + G+ PD  TY  L+ G C +    EA  LY+ M+ + L P++  Y  +I
Subjt:  AGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMI

Query:  NGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLL
        N YC E +  KAL+   EMV KG+ P++ +Y+  I  L    ++ EAK LL
Subjt:  NGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLL

Q9T0D6 Pentatricopeptide repeat-containing protein At4g116904.8e-14153.54Show/hide
Query:  LSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHFDV
        +S +I S FFT SS+L Y T+    S + + L  E   ++ VQSQ    S+  F   V  G+ P SN FN +L  +  S + ++ W FF E   +   DV
Subjt:  LSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHFDV

Query:  YSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVG
        YSFGI+IK  CE G + K F+LL ++    FSPN VIYT LID CCKKG+IEKAK LF  M  +GLVAN+ TYTV+ING FK G KK GFE+YEKM+  G
Subjt:  YSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVG

Query:  VFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDE
        VFP++YTYN ++N+ C+DG+   AF++FDEM  RGVS N+VTYN LIGGLCR+ ++++A +++++MK   INP   T+N L+DGFC +GKL KALS   +
Subjt:  VFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDE

Query:  LKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEAS
        LK  G SP+ VTYNIL++GF + G++S  +++V+EME RGI PSKVTYTI++D F RSD++EKA Q+   M+++GLVPD HTY VL+HG C+KG M EAS
Subjt:  LKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEAS

Query:  KLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG
        +L+KSM+E + EPN+VIYNTMI GYCKE +SY+ALK L+EM  K + PN+ASY   IEVLC   KS EA+ L+E++I  G
Subjt:  KLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG

Arabidopsis top hitse value%identityAlignment
AT1G05670.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.6e-6229.2Show/hide
Query:  EQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHF--DVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDAC
        +++ H   L  LKGY P   S++ ++    + G L K W    E + R     + Y +G +I   C    +++  E  ++M R    P+ V+YT LID  
Subjt:  EQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHF--DVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDAC

Query:  CKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNI
        CK+GDI  A   F  M    +  +  TYT +I+GF + G   +  +L+ +M   G+ P   T+  LIN YC+ G +  AF++ + M   G S NVVTY  
Subjt:  CKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNI

Query:  LIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSK
        LI GLC++  +  A +LL +M +  + P   T+N +++G C  G +++A+    E +  G +  +VTY  L+  + K+G      E+++EM  +G+ P+ 
Subjt:  LIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSK

Query:  VTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKG
        VT+ ++M+ F     +E   ++ + M   G+ P+  T+  L+   C++ N+  A+ +YK M    + P+   Y  ++ G+CK  N  +A    +EM  KG
Subjt:  VTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKG

Query:  MTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG
         + ++++Y+  I+      K  EA+++ +++ R G
Subjt:  MTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG

AT1G05670.2 Pentatricopeptide repeat (PPR-like) superfamily protein1.6e-6229.2Show/hide
Query:  EQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHF--DVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDAC
        +++ H   L  LKGY P   S++ ++    + G L K W    E + R     + Y +G +I   C    +++  E  ++M R    P+ V+YT LID  
Subjt:  EQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHF--DVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDAC

Query:  CKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNI
        CK+GDI  A   F  M    +  +  TYT +I+GF + G   +  +L+ +M   G+ P   T+  LIN YC+ G +  AF++ + M   G S NVVTY  
Subjt:  CKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNI

Query:  LIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSK
        LI GLC++  +  A +LL +M +  + P   T+N +++G C  G +++A+    E +  G +  +VTY  L+  + K+G      E+++EM  +G+ P+ 
Subjt:  LIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSK

Query:  VTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKG
        VT+ ++M+ F     +E   ++ + M   G+ P+  T+  L+   C++ N+  A+ +YK M    + P+   Y  ++ G+CK  N  +A    +EM  KG
Subjt:  VTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKG

Query:  MTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG
         + ++++Y+  I+      K  EA+++ +++ R G
Subjt:  MTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG

AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein9.2e-6331.44Show/hide
Query:  VLKGYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVL
        ++ GYC  +   NN L +L +                    DV ++  ++++ C++G + +  E+L +M +    P+ + YTILI+A C+   +  A  L
Subjt:  VLKGYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHFDVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVL

Query:  FSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVS
           M D G   +  TY V++NG  K+G   +  +    M   G  P+V T+N ++   C  G+   A KL  +M  +G S +VVT+NILI  LCRK  + 
Subjt:  FSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVS

Query:  KAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVR
        +A  +LEKM Q    P + ++N L+ GFC   K+D+A+ Y + +   G  P  VTYN ++    K G      E++ ++  +G SP  +TY  ++D   +
Subjt:  KAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVR

Query:  SDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTI
        +    KA ++   M+   L PD  TY  L+ GL  +G + EA K +     M + PN V +N+++ G CK   + +A+ FL  M+N+G  PN  SY   I
Subjt:  SDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTI

Query:  EVLCNNGKSTEAKDLLEEIIRPG
        E L   G + EA +LL E+   G
Subjt:  EVLCNNGKSTEAKDLLEEIIRPG

AT1G12620.1 Pentatricopeptide repeat (PPR) superfamily protein7.8e-6232.7Show/hide
Query:  GYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHF--DVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLF
        G+ P+  + N ++  L  +G +    L     +  T F  +  ++G ++K  C++G  +   ELL +ME      +AV Y+I+ID  CK G ++ A  LF
Subjt:  GYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHF--DVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLF

Query:  SRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSK
        + M+  G  A+   YT +I GF   G   DG +L   M    + P V  +++LI+ + ++GKL  A +L  EM  RG+S + VTY  LI G C++ Q+ K
Subjt:  SRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSK

Query:  AEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRS
        A  +L+ M      P  RTFN+L++G+C    +D  L  F ++ L G    +VTYN LI GF + G   V  EL +EM  R + P  V+Y I++D    +
Subjt:  AEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRS

Query:  DDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIE
         + EKA ++F  ++K  +  D   Y +++HG+C    + +A  L+ S+    ++P+   YN MI G CK+ +  +A     +M   G +PN  +YN  I 
Subjt:  DDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIE

Query:  VLCNNGKSTEAKDLLEEIIRPG
             G +T++  L+EEI R G
Subjt:  VLCNNGKSTEAKDLLEEIIRPG

AT4G11690.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.4e-14253.54Show/hide
Query:  LSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHFDV
        +S +I S FFT SS+L Y T+    S + + L  E   ++ VQSQ    S+  F   V  G+ P SN FN +L  +  S + ++ W FF E   +   DV
Subjt:  LSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHFDV

Query:  YSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVG
        YSFGI+IK  CE G + K F+LL ++    FSPN VIYT LID CCKKG+IEKAK LF  M  +GLVAN+ TYTV+ING FK G KK GFE+YEKM+  G
Subjt:  YSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVG

Query:  VFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDE
        VFP++YTYN ++N+ C+DG+   AF++FDEM  RGVS N+VTYN LIGGLCR+ ++++A +++++MK   INP   T+N L+DGFC +GKL KALS   +
Subjt:  VFPSVYTYNSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDE

Query:  LKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEAS
        LK  G SP+ VTYNIL++GF + G++S  +++V+EME RGI PSKVTYTI++D F RSD++EKA Q+   M+++GLVPD HTY VL+HG C+KG M EAS
Subjt:  LKLIGQSPTSVTYNILIAGFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEAS

Query:  KLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG
        +L+KSM+E + EPN+VIYNTMI GYCKE +SY+ALK L+EM  K + PN+ASY   IEVLC   KS EA+ L+E++I  G
Subjt:  KLYKSMIEMHLEPNDVIYNTMINGYCKECNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCCCAAATCCATTGGCTTCGTGCGCCGATTTCTTTCTAATCGGATCGCCTCGTCTTTCTTCACCATTTCTTCCATTTTAACCTATTCAACACAACCAAATCTGAA
TTCTGATTCAGTCTATAGCCTTTCTAATGAAGCAAATAGCAGTGCTTCAGTTCAATCTCAACTATCAGAACAATCCCTTCACAATTTTAAACTAGCGGTTCTTAAAGGGT
ATTGTCCGAGTTCAAACTCTTTCAATAATATTTTGGGTTTACTTGCTAAATCAGGCAATTTGCATAAAACTTGGTTGTTTTTCACTGAATTTTTGGGGAGGACTCACTTT
GATGTGTATAGTTTTGGGATCATGATTAAAGCCTTTTGTGAAAATGGCAATGTAAGTAAAGGTTTTGAGCTTTTGGCTCAAATGGAAAGGATGAGCTTCTCTCCTAATGC
TGTTATATACACTATTTTGATTGATGCCTGTTGCAAAAAAGGTGACATTGAGAAGGCTAAAGTATTGTTTTCTAGGATGGATGACATTGGTTTGGTTGCTAACCAATATA
CTTATACTGTGATGATCAATGGTTTTTTTAAGAAAGGATATAAGAAGGATGGTTTTGAGCTTTATGAGAAGATGAAGCTTGTTGGGGTGTTTCCTAGTGTATATACTTAC
AACAGTCTTATTAATGAGTATTGTAGAGATGGGAAGTTGAGTATTGCATTTAAGTTGTTCGATGAAATGTCTACAAGAGGGGTGTCGTCTAACGTAGTCACGTACAATAT
CCTAATTGGTGGGCTATGCCGTAAGAGACAAGTGTCGAAAGCTGAACAGTTGTTAGAAAAAATGAAACAAGCTCGTATAAACCCAACTACTAGAACATTTAACCTGTTGA
TGGATGGGTTCTGTAACATTGGAAAGTTGGACAAGGCCTTGAGTTATTTTGATGAGCTGAAGTTGATTGGCCAGTCTCCAACTTCAGTGACCTACAACATTTTAATTGCA
GGTTTTTCTAAAGCAGGAAATTCTTCTGTAGTTTCTGAGTTAGTGAGAGAGATGGAGGTCAGAGGCATTTCTCCCTCTAAGGTAACATATACAATTATGATGGATGCCTT
TGTTCGTTCTGATGATGTCGAAAAAGCCTCTCAAATGTTTCATCTCATGAAGAAAGTCGGTTTGGTCCCAGATCAGCATACCTATGGTGTCCTAATGCATGGTTTGTGTA
TGAAAGGCAATATGGTAGAGGCATCAAAACTGTACAAATCAATGATAGAGATGCATCTGGAGCCTAATGATGTCATCTATAATACAATGATAAATGGGTATTGCAAAGAG
TGCAACTCCTACAAGGCCTTGAAGTTTCTTGAAGAAATGGTTAACAAAGGAATGACTCCAAATCTGGCTAGTTACAATTCTACCATTGAAGTCCTCTGCAACAACGGGAA
GTCAACCGAGGCGAAAGATTTACTTGAAGAGATTATTAGGCCGGGTTGA
mRNA sequenceShow/hide mRNA sequence
GTCTACTAGGGCACCATTTCAATTTCATCCAATTTTAGCTGCAAATGAATGTGCTTTAGGGCTTCTGAGGAGACGTTGGGCTGAGATCAACAGCAGAGCTGGATGGATGG
AATTGGATTTACAAAAACATGTAATTCTGAATCTGAAGTAATCTCCTCGTTGAAATGGCGAGATTTTTCACCCGATTGAAGTTAGATATCAACTTGTTCTTCCATTCTTG
ATTGTAATCGGCCACGACGCTCTCAAGTTTCTCCGTCGGTCTCAGAATCATCACTTCCAATCGCCATTGTTGTTGGAGAGTCTCTGATTCCTCTTGTTGATAATCTTAAC
ATCATTGTTCCGCTGATCTGACGGTAAATTCTGAATGTCAATTCCTTAGTGTCTGTTTTTCTTCGTAAGTTGTTGTGCATGATGTAGTCTCTTCTTTATGGTTCCCAAAT
CCATTGGCTTCGTGCGCCGATTTCTTTCTAATCGGATCGCCTCGTCTTTCTTCACCATTTCTTCCATTTTAACCTATTCAACACAACCAAATCTGAATTCTGATTCAGTC
TATAGCCTTTCTAATGAAGCAAATAGCAGTGCTTCAGTTCAATCTCAACTATCAGAACAATCCCTTCACAATTTTAAACTAGCGGTTCTTAAAGGGTATTGTCCGAGTTC
AAACTCTTTCAATAATATTTTGGGTTTACTTGCTAAATCAGGCAATTTGCATAAAACTTGGTTGTTTTTCACTGAATTTTTGGGGAGGACTCACTTTGATGTGTATAGTT
TTGGGATCATGATTAAAGCCTTTTGTGAAAATGGCAATGTAAGTAAAGGTTTTGAGCTTTTGGCTCAAATGGAAAGGATGAGCTTCTCTCCTAATGCTGTTATATACACT
ATTTTGATTGATGCCTGTTGCAAAAAAGGTGACATTGAGAAGGCTAAAGTATTGTTTTCTAGGATGGATGACATTGGTTTGGTTGCTAACCAATATACTTATACTGTGAT
GATCAATGGTTTTTTTAAGAAAGGATATAAGAAGGATGGTTTTGAGCTTTATGAGAAGATGAAGCTTGTTGGGGTGTTTCCTAGTGTATATACTTACAACAGTCTTATTA
ATGAGTATTGTAGAGATGGGAAGTTGAGTATTGCATTTAAGTTGTTCGATGAAATGTCTACAAGAGGGGTGTCGTCTAACGTAGTCACGTACAATATCCTAATTGGTGGG
CTATGCCGTAAGAGACAAGTGTCGAAAGCTGAACAGTTGTTAGAAAAAATGAAACAAGCTCGTATAAACCCAACTACTAGAACATTTAACCTGTTGATGGATGGGTTCTG
TAACATTGGAAAGTTGGACAAGGCCTTGAGTTATTTTGATGAGCTGAAGTTGATTGGCCAGTCTCCAACTTCAGTGACCTACAACATTTTAATTGCAGGTTTTTCTAAAG
CAGGAAATTCTTCTGTAGTTTCTGAGTTAGTGAGAGAGATGGAGGTCAGAGGCATTTCTCCCTCTAAGGTAACATATACAATTATGATGGATGCCTTTGTTCGTTCTGAT
GATGTCGAAAAAGCCTCTCAAATGTTTCATCTCATGAAGAAAGTCGGTTTGGTCCCAGATCAGCATACCTATGGTGTCCTAATGCATGGTTTGTGTATGAAAGGCAATAT
GGTAGAGGCATCAAAACTGTACAAATCAATGATAGAGATGCATCTGGAGCCTAATGATGTCATCTATAATACAATGATAAATGGGTATTGCAAAGAGTGCAACTCCTACA
AGGCCTTGAAGTTTCTTGAAGAAATGGTTAACAAAGGAATGACTCCAAATCTGGCTAGTTACAATTCTACCATTGAAGTCCTCTGCAACAACGGGAAGTCAACCGAGGCG
AAAGATTTACTTGAAGAGATTATTAGGCCGGGTTGAACCCGTCAGAATCTCTCTACAGTAGGGTTGGTTAAGCCAAATCTTGTGCATAAAAATATGGGCACGTTGAGTGA
GGCAAAATGGTTGTGTAAGATTGCATTGCAATTATAAGATATCTGTGGCTGTCTTGAATTGGAAGCTAAATTTTATCATTGGACAAGGGGATGTACATTGGCTGCCAGTT
TTGAAGCTTTAGATTTTTTAGGATCAATTGTAGGTTTCATATGTGGATCTCGTTTTCCTTGTTTGGTATATTGGCGATTAGTGGTTATTAGAATGTCGTTATAAATGGGT
TAAATACATAAATAGCCTACTTTGCACACCATTTTGCAAAAATGGCACATCATTGCAATTTTTATCAGCTTTAGCCATTTGAAGCCATCTGGAAGTTAATTGTAGGTAGT
ATATTTCTAATTTACCCTTTCATTTATTATGTAAACAATAAAAAAACATTTTTTTTTTTTTTTAATGGTAACTACCA
Protein sequenceShow/hide protein sequence
MVPKSIGFVRRFLSNRIASSFFTISSILTYSTQPNLNSDSVYSLSNEANSSASVQSQLSEQSLHNFKLAVLKGYCPSSNSFNNILGLLAKSGNLHKTWLFFTEFLGRTHF
DVYSFGIMIKAFCENGNVSKGFELLAQMERMSFSPNAVIYTILIDACCKKGDIEKAKVLFSRMDDIGLVANQYTYTVMINGFFKKGYKKDGFELYEKMKLVGVFPSVYTY
NSLINEYCRDGKLSIAFKLFDEMSTRGVSSNVVTYNILIGGLCRKRQVSKAEQLLEKMKQARINPTTRTFNLLMDGFCNIGKLDKALSYFDELKLIGQSPTSVTYNILIA
GFSKAGNSSVVSELVREMEVRGISPSKVTYTIMMDAFVRSDDVEKASQMFHLMKKVGLVPDQHTYGVLMHGLCMKGNMVEASKLYKSMIEMHLEPNDVIYNTMINGYCKE
CNSYKALKFLEEMVNKGMTPNLASYNSTIEVLCNNGKSTEAKDLLEEIIRPG