; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036394 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036394
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationscaffold5:41388112..41391148
RNA-Seq ExpressionSpg036394
SyntenySpg036394
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8652726.1 hypothetical protein Csa_014106 [Cucumis sativus]2.4e-25686.03Show/hide
Query:  MDLKVLKKIHMLQVQSEVRKFDALRKRGTMGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAET
        MD K+L KIH+LQVQSEVR+ D+L KR TMGSKAMFKWA+TVTP+HV+QLI+AERDI KAL+IFDSATAEY+NGFKHDL+TF LMISKL+SANQFRLAET
Subjt:  MDLKVLKKIHMLQVQSEVRKFDALRKRGTMGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAET

Query:  LLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGT
        LLDRMKEEK DVTEDI LSICRAYGRIHKPLDSIR+FHKMQDFHCKPTEKSYI+V AILVEENQLK AFRFYR MRKMGIPPTV SLNVLIKAFCKN+GT
Subjt:  LLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGT

Query:  MDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGF
        MDKAMH+FR MSNHGC+PDSYTYGTLINGLCR  +IVEAKELLQEMETK CSPSVVTYTS+IHGLCQLNNVDEAM LLEDM  K IEPNVFTYS+LMDGF
Subjt:  MDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGF

Query:  CKAGHSLRARDLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSL
        CK GHS RARD+LELM+QKRLRPNMISYSTL+NGLC EGK+NEALEI DRMKLQG  PDAGLYGKIVN LCD+SRF+EAANFLDEMV+CGI PNR+TWSL
Subjt:  CKAGHSLRARDLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSL

Query:  HVRTHNRVIHGLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLM
        HVRTHNRVIHGLCTIN+SNRAFQL+LSV TRGISITVDTF+SLLKCFC K+DL KTSRILDEMVINGCIP+ EMWS MVNCFCD+RKACDAMKL+QLQLM
Subjt:  HVRTHNRVIHGLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLM

Query:  N
        +
Subjt:  N

KAG7019600.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.8e-25691.1Show/hide
Query:  MGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHK
        MGSKAMFKWA+TVTP+HVEQL++AERDINKALLIFDSATAEY+NGFKHDL+TFRLMI KLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGR+H+
Subjt:  MGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHK

Query:  PLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLING
        PLDSIR+FHKMQDFHCKPTEKSYI+VFAILVEENQLKLAFRFYRYMRK+GIPPTVASLNVLIKA CKN+GTMDKAM++FREMSN GC+PDSYTYGTLING
Subjt:  PLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLING

Query:  LCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS
        LCR GNIVEAKELLQEME K CSPSV+TYTS+IHGLCQLNNVDEAM LLEDMM KGIEPNVFTYS+LMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS
Subjt:  LCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS

Query:  TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQ
        TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVN LCD+ RF+EAANFLDEMV+CGITPNRVTWSLHVRTHNRVIHGLCT+NDSNRAFQL+LSV 
Subjt:  TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQ

Query:  TRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN
        TRGIS+TVDTFDSLLKCFC KRDL K SRILDEMVINGCIPEREMWS +VNCFCDQRKACDAMKL+QL+LMN
Subjt:  TRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN

XP_022139830.1 pentatricopeptide repeat-containing protein At5g46100 isoform X1 [Momordica charantia]8.4e-26290.63Show/hide
Query:  MLQVQSEVRKFDALRKRGTMGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKF
        MLQVQSEV KFDALRKRGTMGSKAMFKWA+TVTPSHVEQLI+AERDINKALLIFDSAT+EY+NGFKHDL+TFRLMISKLVSANQFR AETLLDRM EEKF
Subjt:  MLQVQSEVRKFDALRKRGTMGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKF

Query:  DVTEDIFLSICRAYGRIHKPLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFRE
        DVTEDIFL+ICRAYGR+HKPLDSIRIFHKM+DF CKPTEKSYITVFAILVEENQLKLA RFYRYMRKMG PPTVASLNVLIKAFCKN+GTMDKAMHI RE
Subjt:  DVTEDIFLSICRAYGRIHKPLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFRE

Query:  MSNHGCKPDSYTYGTLINGLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRAR
        MSNHGC+PDSYTYGTLINGLC+ G IVEAKELLQEMETK CSPSVVTYTSLIHGLCQLNNVDEA+ LLEDMMGKGIEPNVFTYS+LMDGFCKAGHS RAR
Subjt:  MSNHGCKPDSYTYGTLINGLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRAR

Query:  DLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIH
        DLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGL PDAGLYGKIVNGLCD SRF+EAANFLDEMV+ GITPNRVTWSLHVRTHNRVI 
Subjt:  DLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIH

Query:  GLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN
        GLCTINDS+RAFQL+LSVQTRGISITVDTFD LLKCFC KRDL KT RILDEMVINGCIP+RE+WS +VNCFCDQRK CDA+KL+QL+LM+
Subjt:  GLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN

XP_023519166.1 pentatricopeptide repeat-containing protein At5g46100 [Cucurbita pepo subsp. pepo]5.3e-25691.31Show/hide
Query:  MGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHK
        MGSKAMFKWA+TVTP+HVEQLI+AERDINKALLIFDSATAEY+NGFKHDL+TFRLMI KLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIH+
Subjt:  MGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHK

Query:  PLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLING
        PLDSIR+FHKMQDFHCKPTEKSYI+VFAILVEENQL LAFRFYRYMRK+GIPPTVASLNVLIKA CKN+GTMDKAM++FREMSN GC+PDSYTYGTLING
Subjt:  PLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLING

Query:  LCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS
        LCR GNIVEAKELLQEME K CSPSV+TYTS+IHGLCQLNNVDEAM LLEDMM KGIEPNVFTYS+LMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS
Subjt:  LCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS

Query:  TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQ
        TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVN LCD+ RF+EAANFLDEMV+CGITPNRVTWSLHVRTHNRVIHGLCT+NDSNRAFQL+LSV 
Subjt:  TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQ

Query:  TRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN
        TRGIS+TVDTFDSLLKCFC KRDL K SRILDEMVINGCIPEREMWS +VNCFCDQRKACDAMKL+QL+LMN
Subjt:  TRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN

XP_031736238.1 pentatricopeptide repeat-containing protein At5g46100 isoform X1 [Cucumis sativus]6.5e-26284.15Show/hide
Query:  LMYLLASFNELQTIEQ--CTTIIFLAWETMDLKVLKKIHMLQVQSEVRKFDALRKRGTMGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEY
        LMYLLAS  ELQT ++  CT II +    MD K+L KIH+LQVQSEVR+ D+L KR TMGSKAMFKWA+TVTP+HV+QLI+AERDI KAL+IFDSATAEY
Subjt:  LMYLLASFNELQTIEQ--CTTIIFLAWETMDLKVLKKIHMLQVQSEVRKFDALRKRGTMGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEY

Query:  SNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRF
        +NGFKHDL+TF LMISKL+SANQFRLAETLLDRMKEEK DVTEDI LSICRAYGRIHKPLDSIR+FHKMQDFHCKPTEKSYI+V AILVEENQLK AFRF
Subjt:  SNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRF

Query:  YRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNV
        YR MRKMGIPPTV SLNVLIKAFCKN+GTMDKAMH+FR MSNHGC+PDSYTYGTLINGLCR  +IVEAKELLQEMETK CSPSVVTYTS+IHGLCQLNNV
Subjt:  YRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNV

Query:  DEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLC
        DEAM LLEDM  K IEPNVFTYS+LMDGFCK GHS RARD+LELM+QKRLRPNMISYSTL+NGLC EGK+NEALEI DRMKLQG  PDAGLYGKIVN LC
Subjt:  DEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLC

Query:  DISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPE
        D+SRF+EAANFLDEMV+CGI PNR+TWSLHVRTHNRVIHGLCTIN+SNRAFQL+LSV TRGISITVDTF+SLLKCFC K+DL KTSRILDEMVINGCIP+
Subjt:  DISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPE

Query:  REMWSIMVNCFCDQRKACDAMKLVQLQLMN
         EMWS MVNCFCD+RKACDAMKL+QLQLM+
Subjt:  REMWSIMVNCFCDQRKACDAMKLVQLQLMN

TrEMBL top hitse value%identityAlignment
A0A5D3DS89 Pentatricopeptide repeat-containing protein5.9e-25386.76Show/hide
Query:  MLQVQSEVRKFDALRKRGTMGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKF
        M QVQSEVR+ D+L KRGTMGSKAMFKWA+TVTP+HV+QLI+AERDI KAL+IFDSATAEY+NGFKHD++TF LMISKL+SANQFRLAE LLDRMKEEK 
Subjt:  MLQVQSEVRKFDALRKRGTMGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKF

Query:  DVTEDIFLSICRAYGRIHKPLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFRE
        DVTEDI LSICRAYGRIHKPLDSIR+FHKM DFHCKPTEKSYI+V AILVEENQLKLAFRFYR MRKMGIPPTV SLNVLIKAFCKN+GTMDKAMH+FR 
Subjt:  DVTEDIFLSICRAYGRIHKPLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFRE

Query:  MSNHGCKPDSYTYGTLINGLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRAR
        MSNHG +PDSYTYGTLINGLCR GNIVEAKELLQEMETK CSPSV+TYTS+IHGLCQLNNVDEA+ LLEDM  K IEPNVFTYS+LMDGFCKAGHS RAR
Subjt:  MSNHGCKPDSYTYGTLINGLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRAR

Query:  DLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIH
        D+L LMVQKRLRPNMISYSTL+NGLC EGK+NEALEI DRMKLQGL PDAGLYGKIVN LCD+SRF+EAANFLDEMV+CGI PNR+TWSLHVRTHNRVIH
Subjt:  DLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIH

Query:  GLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN
        GLCTINDSNRAFQL+LSV TRGISITVDTF+SLLKCFC KRDL KTSRILDEMVINGCIP+ EMWS MVNCFCD+RKACDAMKL+QLQLM+
Subjt:  GLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN

A0A6J1CDW1 pentatricopeptide repeat-containing protein At5g46100 isoform X14.1e-26290.63Show/hide
Query:  MLQVQSEVRKFDALRKRGTMGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKF
        MLQVQSEV KFDALRKRGTMGSKAMFKWA+TVTPSHVEQLI+AERDINKALLIFDSAT+EY+NGFKHDL+TFRLMISKLVSANQFR AETLLDRM EEKF
Subjt:  MLQVQSEVRKFDALRKRGTMGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKF

Query:  DVTEDIFLSICRAYGRIHKPLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFRE
        DVTEDIFL+ICRAYGR+HKPLDSIRIFHKM+DF CKPTEKSYITVFAILVEENQLKLA RFYRYMRKMG PPTVASLNVLIKAFCKN+GTMDKAMHI RE
Subjt:  DVTEDIFLSICRAYGRIHKPLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFRE

Query:  MSNHGCKPDSYTYGTLINGLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRAR
        MSNHGC+PDSYTYGTLINGLC+ G IVEAKELLQEMETK CSPSVVTYTSLIHGLCQLNNVDEA+ LLEDMMGKGIEPNVFTYS+LMDGFCKAGHS RAR
Subjt:  MSNHGCKPDSYTYGTLINGLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRAR

Query:  DLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIH
        DLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGL PDAGLYGKIVNGLCD SRF+EAANFLDEMV+ GITPNRVTWSLHVRTHNRVI 
Subjt:  DLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIH

Query:  GLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN
        GLCTINDS+RAFQL+LSVQTRGISITVDTFD LLKCFC KRDL KT RILDEMVINGCIP+RE+WS +VNCFCDQRK CDA+KL+QL+LM+
Subjt:  GLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN

A0A6J1CF14 pentatricopeptide repeat-containing protein At5g46100 isoform X21.3e-25290.47Show/hide
Query:  MGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHK
        MGSKAMFKWA+TVTPSHVEQLI+AERDINKALLIFDSAT+EY+NGFKHDL+TFRLMISKLVSANQFR AETLLDRM EEKFDVTEDIFL+ICRAYGR+HK
Subjt:  MGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHK

Query:  PLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLING
        PLDSIRIFHKM+DF CKPTEKSYITVFAILVEENQLKLA RFYRYMRKMG PPTVASLNVLIKAFCKN+GTMDKAMHI REMSNHGC+PDSYTYGTLING
Subjt:  PLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLING

Query:  LCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS
        LC+ G IVEAKELLQEMETK CSPSVVTYTSLIHGLCQLNNVDEA+ LLEDMMGKGIEPNVFTYS+LMDGFCKAGHS RARDLLELMVQKRLRPNMISYS
Subjt:  LCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS

Query:  TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQ
        TLINGLCKEGKLNEALEILDRMKLQGL PDAGLYGKIVNGLCD SRF+EAANFLDEMV+ GITPNRVTWSLHVRTHNRVI GLCTINDS+RAFQL+LSVQ
Subjt:  TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQ

Query:  TRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN
        TRGISITVDTFD LLKCFC KRDL KT RILDEMVINGCIP+RE+WS +VNCFCDQRK CDA+KL+QL+LM+
Subjt:  TRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN

A0A6J1EKD3 pentatricopeptide repeat-containing protein At5g461001.7e-25590.89Show/hide
Query:  MGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHK
        MGSKAMFKWA+TVTP+HVEQL++AERDINKALLIFDSATAEY+NGFKHDL+TFRLMI KLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGR+H+
Subjt:  MGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHK

Query:  PLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLING
        PLDSIR+FHKMQDFHCKPTEKSYI+VFAILVEENQLKLAFRFYRYMRK+GIPPTVASLNVLIKA CKN+GTMDKAM++FREMSN GC+PDSYTYGTLING
Subjt:  PLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLING

Query:  LCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS
        LCR GNIVEAKELLQEME K CSPSV+TYTS+IHGLCQLNNVDEAM LLEDMM KGIEPNVFTYS+LMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS
Subjt:  LCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS

Query:  TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQ
        TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVN LCD+ RF+EAANFLDEMV+CGITPNRVTWSLHVRTHNRVIHGLCT+NDSNRAFQL+LSV 
Subjt:  TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQ

Query:  TRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN
        TRGIS+TVDTFDSLLKCFC KRDL K SRILDEMVINGCIPEREMWS +VN FCDQRKACDAMKL+QL+LMN
Subjt:  TRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN

A0A6J1KLZ3 pentatricopeptide repeat-containing protein At5g461003.3e-25691.31Show/hide
Query:  MGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHK
        MGSKAMFKWA+TVTP+HVEQLI+AERDINKALLIFDSATAEY+NGFKHDL+TFRLMI KLVSANQFRLAETLLDRMKEEK DVTEDIFLSICRAYGRIH+
Subjt:  MGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHK

Query:  PLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLING
        PLDSIR+FHKMQDFHCKPTEKSYI+VFAILVEENQLKLAFRFYRYMRK+GIPPTVASLNVLIKA CKN+GTMDKAM++FREMSN GC+PDSYTYGTLING
Subjt:  PLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLING

Query:  LCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS
        LCR GNIVEAKELLQEME K CSPSVVTYTS+IHGLCQLNNVDEAM LLEDMM KGIEPNVFTYS+LMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS
Subjt:  LCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYS

Query:  TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQ
        TLINGLCKEGK+NEALEILDRMKLQGLTPDAGLYGKIVN LCD+ RF+EAANFLDEMV+CGITPNRVTWSLHVRTHNRVIHGLCT+NDSNRAFQL+LSV 
Subjt:  TLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQ

Query:  TRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN
        TRGIS+TVDTFDSLLKCFC KRDL K SRILDEMVINGCIPEREMWS +VNCFCDQRKACDAMKL+QL+LMN
Subjt:  TRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN

SwissProt top hitse value%identityAlignment
O49436 Pentatricopeptide repeat-containing protein At4g200909.3e-6232.79Show/hide
Query:  TAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDSIRIFHKMQD-FHCKPTEKSYITVFAILVEENQLK
        +A     FK    T   MI    ++  F   E LL R++ E   + E  F+ + RAYG+ H P  ++ +FH+M D F CK + KS+ +V  +++ E    
Subjt:  TAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDSIRIFHKMQD-FHCKPTEKSYITVFAILVEENQLK

Query:  LAFRFYRYM----RKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLI
            FY Y+      M I P   S N++IKA CK    +D+A+ +FR M    C PD YTY TL++GLC+   I EA  LL EM+++ CSPS V Y  LI
Subjt:  LAFRFYRYM----RKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLI

Query:  HGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGL
         GLC+  ++     L+++M  KG  PN  TY+TL+ G C  G   +A  LLE MV  +  PN ++Y TLINGL K+ +  +A+ +L  M+ +G   +  +
Subjt:  HGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGL

Query:  YGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDE
        Y  +++GL    +  EA +   +M   G  PN V +S+       ++ GLC     N A ++   +   G      T+ SL+K F       +  ++  E
Subjt:  YGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDE

Query:  MVINGCIPEREMWSIMVNCFCDQRKACDAM
        M   GC   +  +S++++  C   +  +AM
Subjt:  MVINGCIPEREMWSIMVNCFCDQRKACDAM

Q9CA58 Putative pentatricopeptide repeat-containing protein At1g745801.4e-6230.42Show/hide
Query:  PSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFD-VTEDIFLSICRAYGRIHKPLDSIRIFHKMQD
        P HV  +I+ ++D  KAL +F+S   E   GFKH L T+R +I KL    +F   E +L  M+E   + + E +++   + YGR  K  +++ +F +M  
Subjt:  PSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFD-VTEDIFLSICRAYGRIHKPLDSIRIFHKMQD

Query:  FHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAKEL
        + C+PT  SY  + ++LV+      A + Y  MR  GI P V S  + +K+FCK T     A+ +   MS+ GC+ +   Y T++ G        E  EL
Subjt:  FHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAKEL

Query:  LQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYSTLINGLCKEGKLN
          +M     S  + T+  L+  LC+  +V E   LL+ ++ +G+ PN+FTY+  + G C+ G    A  ++  ++++  +P++I+Y+ LI GLCK  K  
Subjt:  LQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYSTLINGLCKEGKLN

Query:  EALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQTRGISITVDTFDS
        EA   L +M  +GL PD+  Y  ++ G C     + A   + + V  G  P++        T+  +I GLC   ++NRA  LF     +GI   V  +++
Subjt:  EALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQTRGISITVDTFDS

Query:  LLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQL
        L+K    +  + + +++ +EM   G IPE + ++I+VN  C      DA  LV++ +
Subjt:  LLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQL

Q9FMF6 Pentatricopeptide repeat-containing protein At5g64320, mitochondrial8.4e-6329.16Show/hide
Query:  VTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDSIRIFHKMQ
        +TP  + +L+    +++ ++ +F    ++  NG++H  D ++++I KL +  +F+  + LL +MK+E     E +F+SI R Y +   P  + R+  +M+
Subjt:  VTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDSIRIFHKMQ

Query:  D-FHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAK
        + + C+PT KSY  V  ILV  N  K+A   +  M    IPPT+ +  V++KAFC     +D A+ + R+M+ HGC P+S  Y TLI+ L +   + EA 
Subjt:  D-FHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAK

Query:  ELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLL---------------------------
        +LL+EM    C P   T+  +I GLC+ + ++EA  ++  M+ +G  P+  TY  LM+G CK G    A+DL                            
Subjt:  ELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLL---------------------------

Query:  -----ELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRV
             +++    + P++ +Y++LI G  KEG +  ALE+L  M+ +G  P+   Y  +V+G C + +  EA N L+EM   G+ PN V +       N +
Subjt:  -----ELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRV

Query:  IHGLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLV
        I   C  +    A ++F  +  +G    V TF+SL+   C   ++     +L +M+  G +     ++ ++N F  + +  +A KLV
Subjt:  IHGLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLV

Q9FNL2 Pentatricopeptide repeat-containing protein At5g461004.6e-17060.61Show/hide
Query:  MGSKA-MFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIH
        MGSK  MFKW++ +TPS V +L+RAE+D+ K++ +FDSATAEY+NG+ HD  +F  M+ +LVSAN+F+ AE L+ RMK E   V+EDI LSICR YGR+H
Subjt:  MGSKA-MFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIH

Query:  KPLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLIN
        +P DS+R+FHKM+DF C P++K+Y+TV AILVEENQL LAF+FY+ MR++G+PPTVASLNVLIKA C+N GT+D  + IF EM   GC PDSYTYGTLI+
Subjt:  KPLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLIN

Query:  GLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISY
        GLCR G I EAK+L  EM  K+C+P+VVTYTSLI+GLC   NVDEAM  LE+M  KGIEPNVFTYS+LMDG CK G SL+A +L E+M+ +  RPNM++Y
Subjt:  GLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISY

Query:  STLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSV
        +TLI GLCKE K+ EA+E+LDRM LQGL PDAGLYGK+++G C IS+FREAANFLDEM++ GITPNR+TW++HV+T N V+ GLC  N  +RAF L+LS+
Subjt:  STLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSV

Query:  QTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQ
        ++RGIS+ V+T +SL+KC C K +  K  +++DE+V +GCIP +  W +++    D+
Subjt:  QTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQ

Q9M302 Pentatricopeptide repeat-containing protein At3g488103.3e-5930.09Show/hide
Query:  TVTPSHVE-------QLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDS
        T +P+H E       + +R E  +  AL  F S     SN FKH   TF +MI KL    Q    + LL +MK + F  +ED+F+S+   Y ++     +
Subjt:  TVTPSHVE-------QLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDS

Query:  IRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRS
        + +F+++++F C P+ K Y  V   L+ EN++++ +  YR M++ G  P V + NVL+KA CKN   +D A  +  EMSN GC PD+ +Y T+I+ +C  
Subjt:  IRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRS

Query:  GNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYSTLIN
        G + E +EL +  E     P V  Y +LI+GLC+ ++   A  L+ +M+ KGI PNV +YSTL++  C +G    A   L  M+++   PN+ + S+L+ 
Subjt:  GNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYSTLIN

Query:  GLCKEGKLNEALEILDRM-KLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQTRG
        G    G   +AL++ ++M +  GL P+   Y  +V G C      +A +    M   G +PN       +RT+  +I+G       + A  ++  + T G
Subjt:  GLCKEGKLNEALEILDRM-KLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQTRG

Query:  ISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCD
            V  + ++++  C      +   +++ M    C P    ++  +   CD
Subjt:  ISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCD

Arabidopsis top hitse value%identityAlignment
AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein1.0e-6330.42Show/hide
Query:  PSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFD-VTEDIFLSICRAYGRIHKPLDSIRIFHKMQD
        P HV  +I+ ++D  KAL +F+S   E   GFKH L T+R +I KL    +F   E +L  M+E   + + E +++   + YGR  K  +++ +F +M  
Subjt:  PSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFD-VTEDIFLSICRAYGRIHKPLDSIRIFHKMQD

Query:  FHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAKEL
        + C+PT  SY  + ++LV+      A + Y  MR  GI P V S  + +K+FCK T     A+ +   MS+ GC+ +   Y T++ G        E  EL
Subjt:  FHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAKEL

Query:  LQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYSTLINGLCKEGKLN
          +M     S  + T+  L+  LC+  +V E   LL+ ++ +G+ PN+FTY+  + G C+ G    A  ++  ++++  +P++I+Y+ LI GLCK  K  
Subjt:  LQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYSTLINGLCKEGKLN

Query:  EALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQTRGISITVDTFDS
        EA   L +M  +GL PD+  Y  ++ G C     + A   + + V  G  P++        T+  +I GLC   ++NRA  LF     +GI   V  +++
Subjt:  EALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQTRGISITVDTFDS

Query:  LLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQL
        L+K    +  + + +++ +EM   G IPE + ++I+VN  C      DA  LV++ +
Subjt:  LLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQL

AT3G48810.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-6030.09Show/hide
Query:  TVTPSHVE-------QLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDS
        T +P+H E       + +R E  +  AL  F S     SN FKH   TF +MI KL    Q    + LL +MK + F  +ED+F+S+   Y ++     +
Subjt:  TVTPSHVE-------QLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDS

Query:  IRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRS
        + +F+++++F C P+ K Y  V   L+ EN++++ +  YR M++ G  P V + NVL+KA CKN   +D A  +  EMSN GC PD+ +Y T+I+ +C  
Subjt:  IRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRS

Query:  GNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYSTLIN
        G + E +EL +  E     P V  Y +LI+GLC+ ++   A  L+ +M+ KGI PNV +YSTL++  C +G    A   L  M+++   PN+ + S+L+ 
Subjt:  GNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYSTLIN

Query:  GLCKEGKLNEALEILDRM-KLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQTRG
        G    G   +AL++ ++M +  GL P+   Y  +V G C      +A +    M   G +PN       +RT+  +I+G       + A  ++  + T G
Subjt:  GLCKEGKLNEALEILDRM-KLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQTRG

Query:  ISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCD
            V  + ++++  C      +   +++ M    C P    ++  +   CD
Subjt:  ISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCD

AT4G20090.1 Pentatricopeptide repeat (PPR) superfamily protein6.6e-6332.79Show/hide
Query:  TAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDSIRIFHKMQD-FHCKPTEKSYITVFAILVEENQLK
        +A     FK    T   MI    ++  F   E LL R++ E   + E  F+ + RAYG+ H P  ++ +FH+M D F CK + KS+ +V  +++ E    
Subjt:  TAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDSIRIFHKMQD-FHCKPTEKSYITVFAILVEENQLK

Query:  LAFRFYRYM----RKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLI
            FY Y+      M I P   S N++IKA CK    +D+A+ +FR M    C PD YTY TL++GLC+   I EA  LL EM+++ CSPS V Y  LI
Subjt:  LAFRFYRYM----RKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLI

Query:  HGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGL
         GLC+  ++     L+++M  KG  PN  TY+TL+ G C  G   +A  LLE MV  +  PN ++Y TLINGL K+ +  +A+ +L  M+ +G   +  +
Subjt:  HGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGL

Query:  YGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDE
        Y  +++GL    +  EA +   +M   G  PN V +S+       ++ GLC     N A ++   +   G      T+ SL+K F       +  ++  E
Subjt:  YGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDE

Query:  MVINGCIPEREMWSIMVNCFCDQRKACDAM
        M   GC   +  +S++++  C   +  +AM
Subjt:  MVINGCIPEREMWSIMVNCFCDQRKACDAM

AT5G46100.1 Pentatricopeptide repeat (PPR) superfamily protein3.3e-17160.61Show/hide
Query:  MGSKA-MFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIH
        MGSK  MFKW++ +TPS V +L+RAE+D+ K++ +FDSATAEY+NG+ HD  +F  M+ +LVSAN+F+ AE L+ RMK E   V+EDI LSICR YGR+H
Subjt:  MGSKA-MFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIH

Query:  KPLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLIN
        +P DS+R+FHKM+DF C P++K+Y+TV AILVEENQL LAF+FY+ MR++G+PPTVASLNVLIKA C+N GT+D  + IF EM   GC PDSYTYGTLI+
Subjt:  KPLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLIN

Query:  GLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISY
        GLCR G I EAK+L  EM  K+C+P+VVTYTSLI+GLC   NVDEAM  LE+M  KGIEPNVFTYS+LMDG CK G SL+A +L E+M+ +  RPNM++Y
Subjt:  GLCRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISY

Query:  STLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSV
        +TLI GLCKE K+ EA+E+LDRM LQGL PDAGLYGK+++G C IS+FREAANFLDEM++ GITPNR+TW++HV+T N V+ GLC  N  +RAF L+LS+
Subjt:  STLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSV

Query:  QTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQ
        ++RGIS+ V+T +SL+KC C K +  K  +++DE+V +GCIP +  W +++    D+
Subjt:  QTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQ

AT5G64320.1 Pentatricopeptide repeat (PPR) superfamily protein6.0e-6429.16Show/hide
Query:  VTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDSIRIFHKMQ
        +TP  + +L+    +++ ++ +F    ++  NG++H  D ++++I KL +  +F+  + LL +MK+E     E +F+SI R Y +   P  + R+  +M+
Subjt:  VTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSICRAYGRIHKPLDSIRIFHKMQ

Query:  D-FHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAK
        + + C+PT KSY  V  ILV  N  K+A   +  M    IPPT+ +  V++KAFC     +D A+ + R+M+ HGC P+S  Y TLI+ L +   + EA 
Subjt:  D-FHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGLCRSGNIVEAK

Query:  ELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLL---------------------------
        +LL+EM    C P   T+  +I GLC+ + ++EA  ++  M+ +G  P+  TY  LM+G CK G    A+DL                            
Subjt:  ELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLL---------------------------

Query:  -----ELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRV
             +++    + P++ +Y++LI G  KEG +  ALE+L  M+ +G  P+   Y  +V+G C + +  EA N L+EM   G+ PN V +       N +
Subjt:  -----ELMVQKRLRPNMISYSTLINGLCKEGKLNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRV

Query:  IHGLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLV
        I   C  +    A ++F  +  +G    V TF+SL+   C   ++     +L +M+  G +     ++ ++N F  + +  +A KLV
Subjt:  IHGLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTKRDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATGTCAACGTCGTTGCTGGGAACTCTCGGTCGCCATCGCCGCCGCTGGTAATCGGAGATGGGCTCGTTTCAGCAACGCCCTACGAAACTGCCGTCTTTCAACCTC
CATCCGATTCAGAGTTCAGCTAGAACCATTGCCCGGCCCCTTGCAGAGGATTCTTCAATTTACCTCCTCCAATCTCTCACTTCACATTAGCCTGCACTCCGTCGGCAGTC
TGATGTATTTGTTGGCATCTTTTAATGAACTTCAAACTATTGAACAGTGCACTACAATTATTTTTCTTGCATGGGAAACTATGGACTTGAAAGTATTGAAAAAGATTCAC
ATGCTTCAGGTTCAATCGGAAGTTCGAAAGTTTGATGCTTTAAGAAAGAGAGGAACAATGGGCAGTAAAGCCATGTTTAAATGGGCAAGAACAGTCACGCCTTCTCATGT
TGAACAGCTAATTCGAGCAGAACGAGACATAAACAAGGCACTTCTCATATTTGACTCTGCGACAGCTGAGTATTCAAATGGTTTCAAGCACGATCTCGATACTTTTAGGC
TCATGATTAGTAAGTTAGTTTCTGCAAACCAGTTCAGGTTAGCAGAAACACTTCTTGATAGGATGAAGGAAGAGAAATTTGATGTCACTGAGGATATATTTCTCTCTATT
TGTAGGGCTTATGGTCGTATCCATAAGCCATTGGATTCCATTAGAATTTTTCACAAAATGCAGGATTTCCATTGCAAGCCTACAGAGAAATCTTACATTACAGTGTTTGC
CATTCTTGTTGAAGAAAATCAACTAAAATTGGCCTTTAGATTTTATAGGTATATGAGAAAAATGGGTATTCCCCCTACTGTAGCTTCTCTTAATGTTCTAATCAAAGCCT
TTTGCAAGAATACTGGAACTATGGATAAAGCAATGCACATATTCCGTGAAATGTCTAATCATGGGTGTAAACCTGATTCATATACATATGGAACTCTGATCAATGGGTTA
TGTAGATCGGGAAACATTGTCGAAGCAAAGGAATTATTGCAAGAGATGGAGACAAAAAATTGTTCACCTTCTGTCGTCACCTATACTTCATTGATACATGGTTTGTGTCA
GTTGAACAATGTGGATGAAGCAATGCTATTACTTGAAGATATGATGGGCAAGGGTATTGAACCTAATGTGTTTACTTACAGTACCCTAATGGATGGATTTTGCAAGGCTG
GTCATTCTTTGCGAGCTAGAGACCTCTTGGAGCTGATGGTTCAAAAACGACTGAGGCCCAACATGATTAGTTATAGTACATTGATTAATGGACTTTGTAAAGAAGGAAAA
CTAAACGAAGCTTTAGAGATTCTCGACAGAATGAAACTTCAAGGTTTGACACCAGATGCTGGGTTGTATGGGAAAATAGTGAATGGCCTCTGTGATATTAGCAGATTCCG
AGAAGCTGCAAACTTCTTGGATGAGATGGTCGTTTGTGGGATCACACCTAATAGAGTAACATGGAGCCTTCATGTCAGGACTCATAACAGAGTAATTCACGGCCTCTGCA
CTATCAACGATTCAAATCGTGCATTTCAGTTGTTTCTTAGCGTGCAGACACGTGGTATTAGTATCACTGTTGATACTTTTGATTCTTTATTAAAATGCTTCTGTACCAAA
AGAGATCTTCATAAAACTTCTAGAATTCTGGATGAGATGGTGATTAATGGATGTATTCCTGAGAGAGAAATGTGGAGTATCATGGTTAATTGTTTTTGTGATCAAAGAAA
AGCTTGTGATGCTATGAAATTGGTGCAACTTCAGTTGATGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAATGTCAACGTCGTTGCTGGGAACTCTCGGTCGCCATCGCCGCCGCTGGTAATCGGAGATGGGCTCGTTTCAGCAACGCCCTACGAAACTGCCGTCTTTCAACCTC
CATCCGATTCAGAGTTCAGCTAGAACCATTGCCCGGCCCCTTGCAGAGGATTCTTCAATTTACCTCCTCCAATCTCTCACTTCACATTAGCCTGCACTCCGTCGGCAGTC
TGATGTATTTGTTGGCATCTTTTAATGAACTTCAAACTATTGAACAGTGCACTACAATTATTTTTCTTGCATGGGAAACTATGGACTTGAAAGTATTGAAAAAGATTCAC
ATGCTTCAGGTTCAATCGGAAGTTCGAAAGTTTGATGCTTTAAGAAAGAGAGGAACAATGGGCAGTAAAGCCATGTTTAAATGGGCAAGAACAGTCACGCCTTCTCATGT
TGAACAGCTAATTCGAGCAGAACGAGACATAAACAAGGCACTTCTCATATTTGACTCTGCGACAGCTGAGTATTCAAATGGTTTCAAGCACGATCTCGATACTTTTAGGC
TCATGATTAGTAAGTTAGTTTCTGCAAACCAGTTCAGGTTAGCAGAAACACTTCTTGATAGGATGAAGGAAGAGAAATTTGATGTCACTGAGGATATATTTCTCTCTATT
TGTAGGGCTTATGGTCGTATCCATAAGCCATTGGATTCCATTAGAATTTTTCACAAAATGCAGGATTTCCATTGCAAGCCTACAGAGAAATCTTACATTACAGTGTTTGC
CATTCTTGTTGAAGAAAATCAACTAAAATTGGCCTTTAGATTTTATAGGTATATGAGAAAAATGGGTATTCCCCCTACTGTAGCTTCTCTTAATGTTCTAATCAAAGCCT
TTTGCAAGAATACTGGAACTATGGATAAAGCAATGCACATATTCCGTGAAATGTCTAATCATGGGTGTAAACCTGATTCATATACATATGGAACTCTGATCAATGGGTTA
TGTAGATCGGGAAACATTGTCGAAGCAAAGGAATTATTGCAAGAGATGGAGACAAAAAATTGTTCACCTTCTGTCGTCACCTATACTTCATTGATACATGGTTTGTGTCA
GTTGAACAATGTGGATGAAGCAATGCTATTACTTGAAGATATGATGGGCAAGGGTATTGAACCTAATGTGTTTACTTACAGTACCCTAATGGATGGATTTTGCAAGGCTG
GTCATTCTTTGCGAGCTAGAGACCTCTTGGAGCTGATGGTTCAAAAACGACTGAGGCCCAACATGATTAGTTATAGTACATTGATTAATGGACTTTGTAAAGAAGGAAAA
CTAAACGAAGCTTTAGAGATTCTCGACAGAATGAAACTTCAAGGTTTGACACCAGATGCTGGGTTGTATGGGAAAATAGTGAATGGCCTCTGTGATATTAGCAGATTCCG
AGAAGCTGCAAACTTCTTGGATGAGATGGTCGTTTGTGGGATCACACCTAATAGAGTAACATGGAGCCTTCATGTCAGGACTCATAACAGAGTAATTCACGGCCTCTGCA
CTATCAACGATTCAAATCGTGCATTTCAGTTGTTTCTTAGCGTGCAGACACGTGGTATTAGTATCACTGTTGATACTTTTGATTCTTTATTAAAATGCTTCTGTACCAAA
AGAGATCTTCATAAAACTTCTAGAATTCTGGATGAGATGGTGATTAATGGATGTATTCCTGAGAGAGAAATGTGGAGTATCATGGTTAATTGTTTTTGTGATCAAAGAAA
AGCTTGTGATGCTATGAAATTGGTGCAACTTCAGTTGATGAATTGA
Protein sequenceShow/hide protein sequence
MKCQRRCWELSVAIAAAGNRRWARFSNALRNCRLSTSIRFRVQLEPLPGPLQRILQFTSSNLSLHISLHSVGSLMYLLASFNELQTIEQCTTIIFLAWETMDLKVLKKIH
MLQVQSEVRKFDALRKRGTMGSKAMFKWARTVTPSHVEQLIRAERDINKALLIFDSATAEYSNGFKHDLDTFRLMISKLVSANQFRLAETLLDRMKEEKFDVTEDIFLSI
CRAYGRIHKPLDSIRIFHKMQDFHCKPTEKSYITVFAILVEENQLKLAFRFYRYMRKMGIPPTVASLNVLIKAFCKNTGTMDKAMHIFREMSNHGCKPDSYTYGTLINGL
CRSGNIVEAKELLQEMETKNCSPSVVTYTSLIHGLCQLNNVDEAMLLLEDMMGKGIEPNVFTYSTLMDGFCKAGHSLRARDLLELMVQKRLRPNMISYSTLINGLCKEGK
LNEALEILDRMKLQGLTPDAGLYGKIVNGLCDISRFREAANFLDEMVVCGITPNRVTWSLHVRTHNRVIHGLCTINDSNRAFQLFLSVQTRGISITVDTFDSLLKCFCTK
RDLHKTSRILDEMVINGCIPEREMWSIMVNCFCDQRKACDAMKLVQLQLMN