; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040689 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040689
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr13:7334131..7337081
RNA-Seq ExpressionLag0040689
SyntenyLag0040689
Gene Ontology termsGO:0080156 - mitochondrial mRNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022139117.1 pentatricopeptide repeat-containing protein At3g12770 [Momordica charantia]0.0e+0090.17Show/hide
Query:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVV
        MS+HSFSL LSLSSLSTA SK+AATS EALL RKHLDQLYVQLIVSGL+KC FLVIKFV ACLHL D+ YAHKAF EVLEPDILLWNA+IKGY TQNN+ 
Subjt:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVV

Query:  AGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGD
         GA++LYT+MQVSGVHPDCFTFLYVLKACGGMS+E IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQ S +R+VFD+L +RTVVSWTSIISGYVQNGD
Subjt:  AGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGD

Query:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG
        P EALSVFK+MRQSN+K DWIALVSV TAYTDMEDLGQGK+IHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNL+LWNAMISGYAKNG
Subjt:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG

Query:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH
        Y EEAIKLFREMISKNIR DSVTVRSAILAGAQVGSL+LARWLDGYI KSEYRDDTFVNTALIDMYAKCGSIYFA  VFDRMVDKDVVLWS MIMGYGLH
Subjt:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH

Query:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK
        GHG+EAI+LYN MKQ GV PNDVTFVGLLTACKN+GLVKEGW+LFH+++DHGIEPHHQHYSCVVDLLGRAG LNQAYDFIM+MPIKPGVSVWGALLSACK
Subjt:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK

Query:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG
        IHRQV+LGEIAAEQLF LDPYNTGHYVQLSNLYASAHLW+HVANVRLMMTQKGLNKDLGHSSIEING L+TFHVGDRSHPRSKEIFEELDRLE+RLKAAG
Subjt:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG

Query:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        Y+PHM+SVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGT LRIT NLRACVNCHSAIKLISKLVDREIIIRDAKRFH FKDG CSCGDFW
Subjt:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW

XP_022941625.1 pentatricopeptide repeat-containing protein At3g12770 isoform X1 [Cucurbita moschata]0.0e+0089.11Show/hide
Query:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLR------RKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYT
        MS+HSFSL LSLSSLSTALSK AATS EALLR      RKHLDQLYVQLIVSGL KCGFLVIKFV ACLHLRD+NYAHK F EVLEPDILLWN IIKGY 
Subjt:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLR------RKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYT

Query:  TQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISG
        TQNN+ AGAIR+Y DMQVSGV+PDCFTFLYVLKACGGMSVEGIGKQMH QTFKYG GSNVFVQNSLVSMYA+FGQ SS+R+VFD+LHNRTVVSWTSIISG
Subjt:  TQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISG

Query:  YVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMIS
        YVQNGDPV+AL VFK+MR+S +K DWI LVSV TAYTDMEDLGQGKAIH LVTKLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMIS
Subjt:  YVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMIS

Query:  GYAKNGYCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMI
        GYAKNGY EEAI+LFR+MISKNI  DSVTVRSAILA AQ GSLELARWLDGYI KSEYRDD FVNTALIDM+AKCGSI FAR VFDRMVDKDVVLWS MI
Subjt:  GYAKNGYCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMI

Query:  MGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGA
        MGYGLHGHGQEAI LYN MKQ+GVCPN+VTFVGLLTACKN+GLVKEGWELFHQM+DHGIEPHHQHYSCVVDLLGRAG LN+AYDFIMSMPIKPGVSVWGA
Subjt:  MGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGA

Query:  LLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEK
        LLS CKIHRQVRLGEIAAEQLF+LDPYNTGHYVQLSNLYASAHLW+HV NVRLMMTQKGLNKDLGHSSIEING L+TFHVGDRSHPRSKEIFEELDRLE+
Subjt:  LLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEK

Query:  RLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        RLKAAGYV HM+SVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREII+RDAKRFH+FKDG CSCGDFW
Subjt:  RLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW

XP_022941627.1 pentatricopeptide repeat-containing protein At3g12770 isoform X2 [Cucurbita moschata]0.0e+0089.88Show/hide
Query:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVV
        MS+HSFSL LSLSSLSTALSK AATS EALLRRKHLDQLYVQLIVSGL KCGFLVIKFV ACLHLRD+NYAHK F EVLEPDILLWN IIKGY TQNN+ 
Subjt:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVV

Query:  AGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGD
        AGAIR+Y DMQVSGV+PDCFTFLYVLKACGGMSVEGIGKQMH QTFKYG GSNVFVQNSLVSMYA+FGQ SS+R+VFD+LHNRTVVSWTSIISGYVQNGD
Subjt:  AGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGD

Query:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG
        PV+AL VFK+MR+S +K DWI LVSV TAYTDMEDLGQGKAIH LVTKLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMISGYAKNG
Subjt:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG

Query:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH
        Y EEAI+LFR+MISKNI  DSVTVRSAILA AQ GSLELARWLDGYI KSEYRDD FVNTALIDM+AKCGSI FAR VFDRMVDKDVVLWS MIMGYGLH
Subjt:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH

Query:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK
        GHGQEAI LYN MKQ+GVCPN+VTFVGLLTACKN+GLVKEGWELFHQM+DHGIEPHHQHYSCVVDLLGRAG LN+AYDFIMSMPIKPGVSVWGALLS CK
Subjt:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK

Query:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG
        IHRQVRLGEIAAEQLF+LDPYNTGHYVQLSNLYASAHLW+HV NVRLMMTQKGLNKDLGHSSIEING L+TFHVGDRSHPRSKEIFEELDRLE+RLKAAG
Subjt:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG

Query:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        YV HM+SVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREII+RDAKRFH+FKDG CSCGDFW
Subjt:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW

XP_022988451.1 pentatricopeptide repeat-containing protein At3g12770 isoform X2 [Cucurbita maxima]0.0e+0089.74Show/hide
Query:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVV
        MS+HSFSL LSL+SLSTALSK AATS EALLRRKHLDQLYVQLIVSGL KCGFLVIKFV ACLHLRD+NYAHK F EVLEPDILLWN IIKGY TQNN+ 
Subjt:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVV

Query:  AGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGD
        AGAIR+Y DMQVSGV+PDCFTFLYVLKACGGMSVEGIGKQMH QTFKYGFGSNVFVQNSLVSMYA++GQ SS+R+VFD+LHNRTVVSWTSIISGYVQNGD
Subjt:  AGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGD

Query:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG
        P++AL VFK+MRQS +K DWI LVSV TAYTDMEDLGQGKAIH LVTKLGLEFEPDIVVSLT MYAK G+VE+ARFFFNQMEKPNL+LWNAMISGYAKNG
Subjt:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG

Query:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH
        Y EEAI+LFR+MISKNI  DSVTVRSAILA AQVGSLELARWLDGYI KSEYRDD FVNTALIDM+AKCGSI FARSVFDRMVDKD+V WS MIMGYGLH
Subjt:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH

Query:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK
        GHGQEAI LYN MKQ+G+ PNDVTFVGLLTACKN+GLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAG LN+AYDFIMSMPIKPGVSVWGALLS CK
Subjt:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK

Query:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG
        IHRQVRLGEIAAEQLF+LDPYNTGHYVQLSNLYASAHLW+ VANVRLMMTQKGLNKDLGHSSIEING L+TFHVGDRSHPRSKEIFEELDRLE+RLKAAG
Subjt:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG

Query:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        YV HM+SVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRD KRFHHFKDG CSCGDFW
Subjt:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW

XP_038892943.1 pentatricopeptide repeat-containing protein At3g12770 [Benincasa hispida]0.0e+0089.45Show/hide
Query:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVV
        MS+HSFSL LSLSSLS+ALSK A TSHEA LRRKHLDQLYVQLIVSGL+KCGFL+IKFV +CLH  D+NYAHKAF EV+EPDILLWNAIIKGY TQ N+ 
Subjt:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVV

Query:  AGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGD
         GAIR+Y DMQ+S V+P+CFTFLYVLKAC GMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQ SS+R+VFD+LH+RTVVSWTSIISGYVQNGD
Subjt:  AGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGD

Query:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG
        PVEAL +FKEMRQ N+K DWI LVSV TAYTD+EDLGQGK+IHGLVTKLGLEFEPDIV+SLTTMYAK G VE+ARFFFNQMEKPNLILWNAMISGYAKNG
Subjt:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG

Query:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH
        Y EEAIKLF EMISKNIR DSVTVRSAILAGAQVGSL+LARWLD YI +SEYRDDTFVNT+L+DMYAKCGSIYFAR VFDRMV KDVVLWS MIMGYGLH
Subjt:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH

Query:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK
        GHGQEAI+ YNEMKQAGVCPNDVTFVGLLTACKN+GLVKEGWELFHQMQD+GIEPHHQHYSCVVDLLGRAG LNQAYDFIMSMP+KPGVSVWGALLSACK
Subjt:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK

Query:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG
        IHR+VRLGEIAAEQLFILDPYN G++VQLSNLYASAHLW HVANVRLMMTQKGLNKDLGHSSI+ING L+TFHVGDRSHPRSKEIFEELDRLEKRLKAAG
Subjt:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG

Query:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        YVPHM+SVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDG CSCGDFW
Subjt:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW

TrEMBL top hitse value%identityAlignment
A0A6J1CD43 pentatricopeptide repeat-containing protein At3g127700.0e+0090.17Show/hide
Query:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVV
        MS+HSFSL LSLSSLSTA SK+AATS EALL RKHLDQLYVQLIVSGL+KC FLVIKFV ACLHL D+ YAHKAF EVLEPDILLWNA+IKGY TQNN+ 
Subjt:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVV

Query:  AGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGD
         GA++LYT+MQVSGVHPDCFTFLYVLKACGGMS+E IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQ S +R+VFD+L +RTVVSWTSIISGYVQNGD
Subjt:  AGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGD

Query:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG
        P EALSVFK+MRQSN+K DWIALVSV TAYTDMEDLGQGK+IHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNL+LWNAMISGYAKNG
Subjt:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG

Query:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH
        Y EEAIKLFREMISKNIR DSVTVRSAILAGAQVGSL+LARWLDGYI KSEYRDDTFVNTALIDMYAKCGSIYFA  VFDRMVDKDVVLWS MIMGYGLH
Subjt:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH

Query:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK
        GHG+EAI+LYN MKQ GV PNDVTFVGLLTACKN+GLVKEGW+LFH+++DHGIEPHHQHYSCVVDLLGRAG LNQAYDFIM+MPIKPGVSVWGALLSACK
Subjt:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK

Query:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG
        IHRQV+LGEIAAEQLF LDPYNTGHYVQLSNLYASAHLW+HVANVRLMMTQKGLNKDLGHSSIEING L+TFHVGDRSHPRSKEIFEELDRLE+RLKAAG
Subjt:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG

Query:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        Y+PHM+SVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGT LRIT NLRACVNCHSAIKLISKLVDREIIIRDAKRFH FKDG CSCGDFW
Subjt:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW

A0A6J1FLM1 pentatricopeptide repeat-containing protein At3g12770 isoform X10.0e+0089.11Show/hide
Query:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLR------RKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYT
        MS+HSFSL LSLSSLSTALSK AATS EALLR      RKHLDQLYVQLIVSGL KCGFLVIKFV ACLHLRD+NYAHK F EVLEPDILLWN IIKGY 
Subjt:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLR------RKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYT

Query:  TQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISG
        TQNN+ AGAIR+Y DMQVSGV+PDCFTFLYVLKACGGMSVEGIGKQMH QTFKYG GSNVFVQNSLVSMYA+FGQ SS+R+VFD+LHNRTVVSWTSIISG
Subjt:  TQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISG

Query:  YVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMIS
        YVQNGDPV+AL VFK+MR+S +K DWI LVSV TAYTDMEDLGQGKAIH LVTKLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMIS
Subjt:  YVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMIS

Query:  GYAKNGYCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMI
        GYAKNGY EEAI+LFR+MISKNI  DSVTVRSAILA AQ GSLELARWLDGYI KSEYRDD FVNTALIDM+AKCGSI FAR VFDRMVDKDVVLWS MI
Subjt:  GYAKNGYCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMI

Query:  MGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGA
        MGYGLHGHGQEAI LYN MKQ+GVCPN+VTFVGLLTACKN+GLVKEGWELFHQM+DHGIEPHHQHYSCVVDLLGRAG LN+AYDFIMSMPIKPGVSVWGA
Subjt:  MGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGA

Query:  LLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEK
        LLS CKIHRQVRLGEIAAEQLF+LDPYNTGHYVQLSNLYASAHLW+HV NVRLMMTQKGLNKDLGHSSIEING L+TFHVGDRSHPRSKEIFEELDRLE+
Subjt:  LLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEK

Query:  RLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        RLKAAGYV HM+SVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREII+RDAKRFH+FKDG CSCGDFW
Subjt:  RLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW

A0A6J1FN08 pentatricopeptide repeat-containing protein At3g12770 isoform X20.0e+0089.88Show/hide
Query:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVV
        MS+HSFSL LSLSSLSTALSK AATS EALLRRKHLDQLYVQLIVSGL KCGFLVIKFV ACLHLRD+NYAHK F EVLEPDILLWN IIKGY TQNN+ 
Subjt:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVV

Query:  AGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGD
        AGAIR+Y DMQVSGV+PDCFTFLYVLKACGGMSVEGIGKQMH QTFKYG GSNVFVQNSLVSMYA+FGQ SS+R+VFD+LHNRTVVSWTSIISGYVQNGD
Subjt:  AGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGD

Query:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG
        PV+AL VFK+MR+S +K DWI LVSV TAYTDMEDLGQGKAIH LVTKLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMISGYAKNG
Subjt:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG

Query:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH
        Y EEAI+LFR+MISKNI  DSVTVRSAILA AQ GSLELARWLDGYI KSEYRDD FVNTALIDM+AKCGSI FAR VFDRMVDKDVVLWS MIMGYGLH
Subjt:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH

Query:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK
        GHGQEAI LYN MKQ+GVCPN+VTFVGLLTACKN+GLVKEGWELFHQM+DHGIEPHHQHYSCVVDLLGRAG LN+AYDFIMSMPIKPGVSVWGALLS CK
Subjt:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK

Query:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG
        IHRQVRLGEIAAEQLF+LDPYNTGHYVQLSNLYASAHLW+HV NVRLMMTQKGLNKDLGHSSIEING L+TFHVGDRSHPRSKEIFEELDRLE+RLKAAG
Subjt:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG

Query:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        YV HM+SVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREII+RDAKRFH+FKDG CSCGDFW
Subjt:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW

A0A6J1JD09 pentatricopeptide repeat-containing protein At3g12770 isoform X10.0e+0088.97Show/hide
Query:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLR------RKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYT
        MS+HSFSL LSL+SLSTALSK AATS EALLR      RKHLDQLYVQLIVSGL KCGFLVIKFV ACLHLRD+NYAHK F EVLEPDILLWN IIKGY 
Subjt:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLR------RKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYT

Query:  TQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISG
        TQNN+ AGAIR+Y DMQVSGV+PDCFTFLYVLKACGGMSVEGIGKQMH QTFKYGFGSNVFVQNSLVSMYA++GQ SS+R+VFD+LHNRTVVSWTSIISG
Subjt:  TQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISG

Query:  YVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMIS
        YVQNGDP++AL VFK+MRQS +K DWI LVSV TAYTDMEDLGQGKAIH LVTKLGLEFEPDIVVSLT MYAK G+VE+ARFFFNQMEKPNL+LWNAMIS
Subjt:  YVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMIS

Query:  GYAKNGYCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMI
        GYAKNGY EEAI+LFR+MISKNI  DSVTVRSAILA AQVGSLELARWLDGYI KSEYRDD FVNTALIDM+AKCGSI FARSVFDRMVDKD+V WS MI
Subjt:  GYAKNGYCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMI

Query:  MGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGA
        MGYGLHGHGQEAI LYN MKQ+G+ PNDVTFVGLLTACKN+GLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAG LN+AYDFIMSMPIKPGVSVWGA
Subjt:  MGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGA

Query:  LLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEK
        LLS CKIHRQVRLGEIAAEQLF+LDPYNTGHYVQLSNLYASAHLW+ VANVRLMMTQKGLNKDLGHSSIEING L+TFHVGDRSHPRSKEIFEELDRLE+
Subjt:  LLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEK

Query:  RLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        RLKAAGYV HM+SVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRD KRFHHFKDG CSCGDFW
Subjt:  RLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW

A0A6J1JH82 pentatricopeptide repeat-containing protein At3g12770 isoform X20.0e+0089.74Show/hide
Query:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVV
        MS+HSFSL LSL+SLSTALSK AATS EALLRRKHLDQLYVQLIVSGL KCGFLVIKFV ACLHLRD+NYAHK F EVLEPDILLWN IIKGY TQNN+ 
Subjt:  MSMHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVV

Query:  AGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGD
        AGAIR+Y DMQVSGV+PDCFTFLYVLKACGGMSVEGIGKQMH QTFKYGFGSNVFVQNSLVSMYA++GQ SS+R+VFD+LHNRTVVSWTSIISGYVQNGD
Subjt:  AGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGD

Query:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG
        P++AL VFK+MRQS +K DWI LVSV TAYTDMEDLGQGKAIH LVTKLGLEFEPDIVVSLT MYAK G+VE+ARFFFNQMEKPNL+LWNAMISGYAKNG
Subjt:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG

Query:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH
        Y EEAI+LFR+MISKNI  DSVTVRSAILA AQVGSLELARWLDGYI KSEYRDD FVNTALIDM+AKCGSI FARSVFDRMVDKD+V WS MIMGYGLH
Subjt:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH

Query:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK
        GHGQEAI LYN MKQ+G+ PNDVTFVGLLTACKN+GLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAG LN+AYDFIMSMPIKPGVSVWGALLS CK
Subjt:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK

Query:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG
        IHRQVRLGEIAAEQLF+LDPYNTGHYVQLSNLYASAHLW+ VANVRLMMTQKGLNKDLGHSSIEING L+TFHVGDRSHPRSKEIFEELDRLE+RLKAAG
Subjt:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG

Query:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        YV HM+SVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRD KRFHHFKDG CSCGDFW
Subjt:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic2.2e-15540.7Show/hide
Query:  QLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGI
        +++  L+ SG +   F +          R +N A K F  + E D++ WN I+ GY +QN +   A+ +   M    + P   T + VL A   + +  +
Subjt:  QLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGI

Query:  GKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLG
        GK++HG   + GF S V +  +LV MYAK G + ++R +FD +  R VVSW S+I  YVQN +P EA+ +F++M    +KP  ++++    A  D+ DL 
Subjt:  GKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLG

Query:  QGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSL
        +G+ IH L  +LGL+    +V SL +MY K  +V+ A   F +++   L+ WNAMI G+A+NG   +A+  F +M S+ ++ D+ T  S I A A++   
Subjt:  QGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSL

Query:  ELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGL
          A+W+ G + +S    + FV TAL+DMYAKCG+I  AR +FD M ++ V  W+ MI GYG HG G+ A+ L+ EM++  + PN VTF+ +++AC ++GL
Subjt:  ELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGL

Query:  VKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASA
        V+ G + F+ M +++ IE    HY  +VDLLGRAG LN+A+DFIM MP+KP V+V+GA+L AC+IH+ V   E AAE+LF L+P + G++V L+N+Y +A
Subjt:  VKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASA

Query:  HLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIIS
         +W+ V  VR+ M ++GL K  G S +EI  ++ +F  G  +HP SK+I+  L++L   +K AGYVP  + VL  + ++  E+ L  HSE+LA+++G+++
Subjt:  HLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIIS

Query:  TAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        T  GTT+ + KNLR C +CH+A K IS +  REI++RD +RFHHFK+G CSCGD+W
Subjt:  TAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127705.8e-24157.37Show/hide
Query:  MHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAG
        +HS S + SL         + + +H+A L+     Q++ +L+V GL   GFL+ K + A     DI +A + F ++  P I  WNAII+GY ++NN    
Subjt:  MHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAG

Query:  AIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFD--RLHNRTVVSWTSIISGYVQNGD
        A+ +Y++MQ++ V PD FTF ++LKAC G+S   +G+ +H Q F+ GF ++VFVQN L+++YAK  ++ S+R VF+   L  RT+VSWT+I+S Y QNG+
Subjt:  AIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFD--RLHNRTVVSWTSIISGYVQNGD

Query:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG
        P+EAL +F +MR+ ++KPDW+ALVSV  A+T ++DL QG++IH  V K+GLE EPD+++SL TMYAK GQV  A+  F++M+ PNLILWNAMISGYAKNG
Subjt:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG

Query:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH
        Y  EAI +F EMI+K++R D++++ SAI A AQVGSLE AR +  Y+ +S+YRDD F+++ALIDM+AKCGS+  AR VFDR +D+DVV+WS MI+GYGLH
Subjt:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH

Query:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK
        G  +EAISLY  M++ GV PNDVTF+GLL AC ++G+V+EGW  F++M DH I P  QHY+CV+DLLGRAG+L+QAY+ I  MP++PGV+VWGALLSACK
Subjt:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK

Query:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG
         HR V LGE AA+QLF +DP NTGHYVQLSNLYA+A LWD VA VR+ M +KGLNKD+G S +E+ G+L+ F VGD+SHPR +EI  +++ +E RLK  G
Subjt:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG

Query:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        +V + D+ LHDLN EE EETLC+HSER+A+AYG+IST  GT LRITKNLRACVNCH+A KLISKLVDREI++RD  RFHHFKDG CSCGD+W
Subjt:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW

Q9LW32 Pentatricopeptide repeat-containing protein At3g26782, mitochondrial1.2e-14842.03Show/hide
Query:  DILLWNAIIKGYTTQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLH
        D+  WN++I       +  A A+  ++ M+   ++P   +F   +KAC  +     GKQ H Q F +G+ S++FV ++L+ MY+  G++  +R VFD + 
Subjt:  DILLWNAIIKGYTTQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLH

Query:  NRTVVSWTSIISGYVQNGDPVEALSVFKEM------RQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQ--VEV
         R +VSWTS+I GY  NG+ ++A+S+FK++          M  D + LVSV +A + +   G  ++IH  V K G +    +  +L   YAK G+  V V
Subjt:  NRTVVSWTSIISGYVQNGDPVEALSVFKEM------RQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQ--VEV

Query:  ARFFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMI-SKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSI
        AR  F+Q+   + + +N+++S YA++G   EA ++FR ++ +K +  +++T+ + +LA +  G+L + + +   + +    DD  V T++IDMY KCG +
Subjt:  ARFFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMI-SKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSI

Query:  YFARSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDH-GIEPHHQHYSCVVDLLGRAG
          AR  FDRM +K+V  W+ MI GYG+HGH  +A+ L+  M  +GV PN +TFV +L AC +AGL  EGW  F+ M+   G+EP  +HY C+VDLLGRAG
Subjt:  YFARSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDH-GIEPHHQHYSCVVDLLGRAG

Query:  NLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQT
         L +AYD I  M +KP   +W +LL+AC+IH+ V L EI+  +LF LD  N G+Y+ LS++YA A  W  V  VR++M  +GL K  G S +E+NG++  
Subjt:  NLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQT

Query:  FHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREII
        F +GD  HP+ ++I+E L  L ++L  AGYV +  SV HD++ EE E TL  HSE+LA+A+GI++T PG+T+ + KNLR C +CH+ IKLISK+VDRE +
Subjt:  FHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREII

Query:  IRDAKRFHHFKDGDCSCGDFW
        +RDAKRFHHFKDG CSCGD+W
Subjt:  IRDAKRFHHFKDGDCSCGDFW

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233306.3e-14739Show/hide
Query:  SLSTALSKVAATSHEALLRRKHLDQLYVQLI-VSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAGAIRLYTDMQV
        S S AL K    +   +  +    QL+ Q I    L+     ++  +    +L+ ++ A   F  +  P +L W ++I+ +T Q ++ + A+  + +M+ 
Subjt:  SLSTALSKVAATSHEALLRRKHLDQLYVQLI-VSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAGAIRLYTDMQV

Query:  SGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAK-------------------------------------FGQISSSRM
        SG  PD   F  VLK+C  M     G+ +HG   + G   +++  N+L++MYAK                                     FG I S R 
Subjt:  SGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAK-------------------------------------FGQISSSRM

Query:  VFDRLHNRTVVSWTSIISGYVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVAR
        VF+ +  + VVS+ +II+GY Q+G   +AL + +EM  +++KPD   L SV   +++  D+ +GK IHG V + G++ +  I  SL  MYAK  ++E + 
Subjt:  VFDRLHNRTVVSWTSIISGYVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVAR

Query:  FFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFA
          F+++   + I WN++++GY +NG   EA++LFR+M++  ++  +V   S I A A + +L L + L GY+ +  +  + F+ +AL+DMY+KCG+I  A
Subjt:  FFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFA

Query:  RSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGNLN
        R +FDRM   D V W+ +IMG+ LHGHG EA+SL+ EMK+ GV PN V FV +LTAC + GLV E W  F+ M + +G+    +HY+ V DLLGRAG L 
Subjt:  RSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGNLN

Query:  QAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHV
        +AY+FI  M ++P  SVW  LLS+C +H+ + L E  AE++F +D  N G YV + N+YAS   W  +A +RL M +KGL K    S IE+  K   F  
Subjt:  QAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHV

Query:  GDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRD
        GDRSHP   +I E L  + ++++  GYV     VLHD++ E   E L  HSERLAVA+GII+T PGTT+R+TKN+R C +CH AIK ISK+ +REII+RD
Subjt:  GDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRD

Query:  AKRFHHFKDGDCSCGDFW
          RFHHF  G+CSCGD+W
Subjt:  AKRFHHFKDGDCSCGDFW

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic3.7e-14739.97Show/hide
Query:  DQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEG
        +QL+  ++ SG  +   +    V   L  + ++ A K F E+ E D++ WN+II GY + N +    + ++  M VSG+  D  T + V   C    +  
Subjt:  DQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEG

Query:  IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDL
        +G+ +H    K  F       N+L+ MY+K G + S++ VF  + +R+VVS+TS+I+GY + G   EA+ +F+EM +  + PD   + +V         L
Subjt:  IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDL

Query:  GQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMI-SKNIRADSVTVRSAILAGAQVG
         +GK +H  + +  L F+  +  +L  MYAK G ++ A   F++M   ++I WN +I GY+KN Y  EA+ LF  ++  K    D  TV   + A A + 
Subjt:  GQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMI-SKNIRADSVTVRSAILAGAQVG

Query:  SLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNA
        + +  R + GYI ++ Y  D  V  +L+DMYAKCG++  A  +FD +  KD+V W+VMI GYG+HG G+EAI+L+N+M+QAG+  ++++FV LL AC ++
Subjt:  SLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNA

Query:  GLVKEGWELFHQMQ-DHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYA
        GLV EGW  F+ M+ +  IEP  +HY+C+VD+L R G+L +AY FI +MPI P  ++WGALL  C+IH  V+L E  AE++F L+P NTG+YV ++N+YA
Subjt:  GLVKEGWELFHQMQ-DHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYA

Query:  SAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGI
         A  W+ V  +R  + Q+GL K+ G S IEI G++  F  GD S+P ++ I   L ++  R+   GY P     L D    E EE LC HSE+LA+A GI
Subjt:  SAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGI

Query:  ISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        IS+  G  +R+TKNLR C +CH   K +SKL  REI++RD+ RFH FKDG CSC  FW
Subjt:  ISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein1.5e-15640.7Show/hide
Query:  QLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGI
        +++  L+ SG +   F +          R +N A K F  + E D++ WN I+ GY +QN +   A+ +   M    + P   T + VL A   + +  +
Subjt:  QLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGI

Query:  GKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLG
        GK++HG   + GF S V +  +LV MYAK G + ++R +FD +  R VVSW S+I  YVQN +P EA+ +F++M    +KP  ++++    A  D+ DL 
Subjt:  GKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLG

Query:  QGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSL
        +G+ IH L  +LGL+    +V SL +MY K  +V+ A   F +++   L+ WNAMI G+A+NG   +A+  F +M S+ ++ D+ T  S I A A++   
Subjt:  QGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSL

Query:  ELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGL
          A+W+ G + +S    + FV TAL+DMYAKCG+I  AR +FD M ++ V  W+ MI GYG HG G+ A+ L+ EM++  + PN VTF+ +++AC ++GL
Subjt:  ELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGL

Query:  VKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASA
        V+ G + F+ M +++ IE    HY  +VDLLGRAG LN+A+DFIM MP+KP V+V+GA+L AC+IH+ V   E AAE+LF L+P + G++V L+N+Y +A
Subjt:  VKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASA

Query:  HLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIIS
         +W+ V  VR+ M ++GL K  G S +EI  ++ +F  G  +HP SK+I+  L++L   +K AGYVP  + VL  + ++  E+ L  HSE+LA+++G+++
Subjt:  HLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIIS

Query:  TAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        T  GTT+ + KNLR C +CH+A K IS +  REI++RD +RFHHFK+G CSCGD+W
Subjt:  TAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW

AT3G12770.1 mitochondrial editing factor 224.1e-24257.37Show/hide
Query:  MHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAG
        +HS S + SL         + + +H+A L+     Q++ +L+V GL   GFL+ K + A     DI +A + F ++  P I  WNAII+GY ++NN    
Subjt:  MHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAG

Query:  AIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFD--RLHNRTVVSWTSIISGYVQNGD
        A+ +Y++MQ++ V PD FTF ++LKAC G+S   +G+ +H Q F+ GF ++VFVQN L+++YAK  ++ S+R VF+   L  RT+VSWT+I+S Y QNG+
Subjt:  AIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFD--RLHNRTVVSWTSIISGYVQNGD

Query:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG
        P+EAL +F +MR+ ++KPDW+ALVSV  A+T ++DL QG++IH  V K+GLE EPD+++SL TMYAK GQV  A+  F++M+ PNLILWNAMISGYAKNG
Subjt:  PVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNG

Query:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH
        Y  EAI +F EMI+K++R D++++ SAI A AQVGSLE AR +  Y+ +S+YRDD F+++ALIDM+AKCGS+  AR VFDR +D+DVV+WS MI+GYGLH
Subjt:  YCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLH

Query:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK
        G  +EAISLY  M++ GV PNDVTF+GLL AC ++G+V+EGW  F++M DH I P  QHY+CV+DLLGRAG+L+QAY+ I  MP++PGV+VWGALLSACK
Subjt:  GHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACK

Query:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG
         HR V LGE AA+QLF +DP NTGHYVQLSNLYA+A LWD VA VR+ M +KGLNKD+G S +E+ G+L+ F VGD+SHPR +EI  +++ +E RLK  G
Subjt:  IHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAG

Query:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        +V + D+ LHDLN EE EETLC+HSER+A+AYG+IST  GT LRITKNLRACVNCH+A KLISKLVDREI++RD  RFHHFKDG CSCGD+W
Subjt:  YVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.5e-14839Show/hide
Query:  SLSTALSKVAATSHEALLRRKHLDQLYVQLI-VSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAGAIRLYTDMQV
        S S AL K    +   +  +    QL+ Q I    L+     ++  +    +L+ ++ A   F  +  P +L W ++I+ +T Q ++ + A+  + +M+ 
Subjt:  SLSTALSKVAATSHEALLRRKHLDQLYVQLI-VSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAGAIRLYTDMQV

Query:  SGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAK-------------------------------------FGQISSSRM
        SG  PD   F  VLK+C  M     G+ +HG   + G   +++  N+L++MYAK                                     FG I S R 
Subjt:  SGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAK-------------------------------------FGQISSSRM

Query:  VFDRLHNRTVVSWTSIISGYVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVAR
        VF+ +  + VVS+ +II+GY Q+G   +AL + +EM  +++KPD   L SV   +++  D+ +GK IHG V + G++ +  I  SL  MYAK  ++E + 
Subjt:  VFDRLHNRTVVSWTSIISGYVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVAR

Query:  FFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFA
          F+++   + I WN++++GY +NG   EA++LFR+M++  ++  +V   S I A A + +L L + L GY+ +  +  + F+ +AL+DMY+KCG+I  A
Subjt:  FFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMISKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFA

Query:  RSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGNLN
        R +FDRM   D V W+ +IMG+ LHGHG EA+SL+ EMK+ GV PN V FV +LTAC + GLV E W  F+ M + +G+    +HY+ V DLLGRAG L 
Subjt:  RSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGNLN

Query:  QAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHV
        +AY+FI  M ++P  SVW  LLS+C +H+ + L E  AE++F +D  N G YV + N+YAS   W  +A +RL M +KGL K    S IE+  K   F  
Subjt:  QAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHV

Query:  GDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRD
        GDRSHP   +I E L  + ++++  GYV     VLHD++ E   E L  HSERLAVA+GII+T PGTT+R+TKN+R C +CH AIK ISK+ +REII+RD
Subjt:  GDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRD

Query:  AKRFHHFKDGDCSCGDFW
          RFHHF  G+CSCGD+W
Subjt:  AKRFHHFKDGDCSCGDFW

AT3G26782.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.2e-15042.03Show/hide
Query:  DILLWNAIIKGYTTQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLH
        D+  WN++I       +  A A+  ++ M+   ++P   +F   +KAC  +     GKQ H Q F +G+ S++FV ++L+ MY+  G++  +R VFD + 
Subjt:  DILLWNAIIKGYTTQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLH

Query:  NRTVVSWTSIISGYVQNGDPVEALSVFKEM------RQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQ--VEV
         R +VSWTS+I GY  NG+ ++A+S+FK++          M  D + LVSV +A + +   G  ++IH  V K G +    +  +L   YAK G+  V V
Subjt:  NRTVVSWTSIISGYVQNGDPVEALSVFKEM------RQSNMKPDWIALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQ--VEV

Query:  ARFFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMI-SKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSI
        AR  F+Q+   + + +N+++S YA++G   EA ++FR ++ +K +  +++T+ + +LA +  G+L + + +   + +    DD  V T++IDMY KCG +
Subjt:  ARFFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMI-SKNIRADSVTVRSAILAGAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSI

Query:  YFARSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDH-GIEPHHQHYSCVVDLLGRAG
          AR  FDRM +K+V  W+ MI GYG+HGH  +A+ L+  M  +GV PN +TFV +L AC +AGL  EGW  F+ M+   G+EP  +HY C+VDLLGRAG
Subjt:  YFARSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKEGWELFHQMQDH-GIEPHHQHYSCVVDLLGRAG

Query:  NLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQT
         L +AYD I  M +KP   +W +LL+AC+IH+ V L EI+  +LF LD  N G+Y+ LS++YA A  W  V  VR++M  +GL K  G S +E+NG++  
Subjt:  NLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQT

Query:  FHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREII
        F +GD  HP+ ++I+E L  L ++L  AGYV +  SV HD++ EE E TL  HSE+LA+A+GI++T PG+T+ + KNLR C +CH+ IKLISK+VDRE +
Subjt:  FHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREII

Query:  IRDAKRFHHFKDGDCSCGDFW
        +RDAKRFHHFKDG CSCGD+W
Subjt:  IRDAKRFHHFKDGDCSCGDFW

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein2.6e-14839.97Show/hide
Query:  DQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEG
        +QL+  ++ SG  +   +    V   L  + ++ A K F E+ E D++ WN+II GY + N +    + ++  M VSG+  D  T + V   C    +  
Subjt:  DQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAGAIRLYTDMQVSGVHPDCFTFLYVLKACGGMSVEG

Query:  IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDL
        +G+ +H    K  F       N+L+ MY+K G + S++ VF  + +R+VVS+TS+I+GY + G   EA+ +F+EM +  + PD   + +V         L
Subjt:  IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGDPVEALSVFKEMRQSNMKPDWIALVSVTTAYTDMEDL

Query:  GQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMI-SKNIRADSVTVRSAILAGAQVG
         +GK +H  + +  L F+  +  +L  MYAK G ++ A   F++M   ++I WN +I GY+KN Y  EA+ LF  ++  K    D  TV   + A A + 
Subjt:  GQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMI-SKNIRADSVTVRSAILAGAQVG

Query:  SLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNA
        + +  R + GYI ++ Y  D  V  +L+DMYAKCG++  A  +FD +  KD+V W+VMI GYG+HG G+EAI+L+N+M+QAG+  ++++FV LL AC ++
Subjt:  SLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNA

Query:  GLVKEGWELFHQMQ-DHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYA
        GLV EGW  F+ M+ +  IEP  +HY+C+VD+L R G+L +AY FI +MPI P  ++WGALL  C+IH  V+L E  AE++F L+P NTG+YV ++N+YA
Subjt:  GLVKEGWELFHQMQ-DHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYA

Query:  SAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGI
         A  W+ V  +R  + Q+GL K+ G S IEI G++  F  GD S+P ++ I   L ++  R+   GY P     L D    E EE LC HSE+LA+A GI
Subjt:  SAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGI

Query:  ISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW
        IS+  G  +R+TKNLR C +CH   K +SKL  REI++RD+ RFH FKDG CSC  FW
Subjt:  ISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGDCSCGDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCATGCATTCTTTTTCGCTCTTTCTCTCCTTGTCGTCGCTATCTACAGCTCTCTCAAAGGTGGCTGCAACCTCGCATGAGGCTTTATTGAGGAGGAAGCATTTGGA
TCAATTATACGTCCAGTTAATTGTGTCTGGACTAAACAAGTGCGGTTTCTTGGTGATCAAATTTGTCAAGGCATGTTTGCATCTCAGAGATATTAACTATGCGCACAAGG
CTTTTCATGAAGTCTTGGAACCGGATATTTTGTTGTGGAATGCCATCATAAAGGGTTACACCACTCAGAATAATGTTGTTGCTGGTGCTATCAGATTGTATACCGATATG
CAAGTATCAGGGGTGCACCCAGATTGCTTCACATTTTTGTATGTGCTTAAAGCATGCGGTGGCATGTCAGTCGAAGGAATAGGTAAACAGATGCATGGACAGACATTTAA
GTATGGCTTTGGATCAAATGTTTTTGTGCAGAACAGTCTTGTGTCAATGTATGCTAAATTTGGTCAAATCTCATCTTCTAGGATGGTGTTTGATAGGTTACATAATAGAA
CTGTTGTTTCATGGACTTCCATCATTTCAGGGTATGTTCAGAATGGTGATCCAGTGGAAGCATTGAGTGTTTTCAAAGAAATGAGACAAAGTAATATGAAACCTGATTGG
ATTGCCCTTGTTAGTGTTACGACAGCATATACAGACATGGAGGATTTGGGACAAGGAAAGGCCATTCATGGCTTAGTGACTAAATTGGGTCTAGAATTTGAACCGGATAT
CGTGGTATCACTCACTACCATGTATGCCAAACGTGGACAGGTGGAAGTTGCCAGGTTTTTCTTTAATCAGATGGAAAAACCGAATTTAATTTTGTGGAATGCTATGATTT
CTGGTTATGCAAAAAATGGATATTGTGAAGAAGCAATCAAGCTATTCCGTGAGATGATATCAAAAAATATCAGAGCCGATTCTGTTACTGTGAGGTCTGCTATTCTAGCC
GGTGCCCAAGTGGGGTCTCTTGAACTTGCAAGATGGTTGGATGGTTATATCTGTAAGAGTGAGTACAGAGATGATACTTTTGTGAACACAGCCCTTATAGATATGTATGC
AAAATGTGGAAGCATATATTTTGCTCGTAGTGTTTTCGATAGAATGGTTGATAAAGACGTTGTTTTATGGAGTGTTATGATTATGGGGTATGGATTACATGGTCATGGAC
AAGAAGCCATCAGCCTTTACAATGAAATGAAGCAAGCTGGAGTTTGTCCAAATGACGTTACTTTCGTTGGTCTTCTCACAGCTTGCAAAAATGCAGGTCTTGTAAAAGAG
GGATGGGAGCTTTTCCACCAGATGCAAGACCATGGGATTGAACCGCATCATCAGCATTACTCTTGTGTGGTCGATCTTCTAGGACGTGCTGGTAATTTGAATCAAGCTTA
TGATTTTATTATGAGCATGCCAATTAAACCTGGAGTTAGTGTTTGGGGGGCACTTCTAAGTGCGTGCAAGATCCATCGACAAGTGAGGTTGGGAGAAATTGCTGCAGAAC
AGCTTTTCATATTAGATCCATATAATACAGGGCATTATGTGCAACTCTCAAACCTATATGCTTCTGCCCATTTATGGGATCACGTGGCAAATGTTCGATTAATGATGACG
CAGAAAGGACTGAACAAAGACCTCGGACATAGTTCTATTGAGATAAATGGAAAACTTCAAACTTTTCACGTCGGAGATAGATCACATCCTAGATCAAAGGAAATTTTTGA
AGAGCTTGATAGATTAGAGAAAAGATTAAAAGCTGCTGGTTATGTCCCTCATATGGATTCTGTTCTACATGACTTGAATCATGAGGAGATTGAGGAAACCCTTTGTAACC
ACAGTGAGAGGTTAGCAGTTGCTTATGGCATCATCAGTACTGCTCCTGGAACTACACTTAGAATAACCAAGAATCTCCGAGCATGCGTTAATTGCCATTCAGCGATAAAG
CTTATATCAAAGCTCGTCGATAGGGAAATAATTATTCGAGATGCGAAACGTTTTCATCATTTCAAAGATGGAGATTGTTCATGTGGAGATTTTTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCATGCATTCTTTTTCGCTCTTTCTCTCCTTGTCGTCGCTATCTACAGCTCTCTCAAAGGTGGCTGCAACCTCGCATGAGGCTTTATTGAGGAGGAAGCATTTGGA
TCAATTATACGTCCAGTTAATTGTGTCTGGACTAAACAAGTGCGGTTTCTTGGTGATCAAATTTGTCAAGGCATGTTTGCATCTCAGAGATATTAACTATGCGCACAAGG
CTTTTCATGAAGTCTTGGAACCGGATATTTTGTTGTGGAATGCCATCATAAAGGGTTACACCACTCAGAATAATGTTGTTGCTGGTGCTATCAGATTGTATACCGATATG
CAAGTATCAGGGGTGCACCCAGATTGCTTCACATTTTTGTATGTGCTTAAAGCATGCGGTGGCATGTCAGTCGAAGGAATAGGTAAACAGATGCATGGACAGACATTTAA
GTATGGCTTTGGATCAAATGTTTTTGTGCAGAACAGTCTTGTGTCAATGTATGCTAAATTTGGTCAAATCTCATCTTCTAGGATGGTGTTTGATAGGTTACATAATAGAA
CTGTTGTTTCATGGACTTCCATCATTTCAGGGTATGTTCAGAATGGTGATCCAGTGGAAGCATTGAGTGTTTTCAAAGAAATGAGACAAAGTAATATGAAACCTGATTGG
ATTGCCCTTGTTAGTGTTACGACAGCATATACAGACATGGAGGATTTGGGACAAGGAAAGGCCATTCATGGCTTAGTGACTAAATTGGGTCTAGAATTTGAACCGGATAT
CGTGGTATCACTCACTACCATGTATGCCAAACGTGGACAGGTGGAAGTTGCCAGGTTTTTCTTTAATCAGATGGAAAAACCGAATTTAATTTTGTGGAATGCTATGATTT
CTGGTTATGCAAAAAATGGATATTGTGAAGAAGCAATCAAGCTATTCCGTGAGATGATATCAAAAAATATCAGAGCCGATTCTGTTACTGTGAGGTCTGCTATTCTAGCC
GGTGCCCAAGTGGGGTCTCTTGAACTTGCAAGATGGTTGGATGGTTATATCTGTAAGAGTGAGTACAGAGATGATACTTTTGTGAACACAGCCCTTATAGATATGTATGC
AAAATGTGGAAGCATATATTTTGCTCGTAGTGTTTTCGATAGAATGGTTGATAAAGACGTTGTTTTATGGAGTGTTATGATTATGGGGTATGGATTACATGGTCATGGAC
AAGAAGCCATCAGCCTTTACAATGAAATGAAGCAAGCTGGAGTTTGTCCAAATGACGTTACTTTCGTTGGTCTTCTCACAGCTTGCAAAAATGCAGGTCTTGTAAAAGAG
GGATGGGAGCTTTTCCACCAGATGCAAGACCATGGGATTGAACCGCATCATCAGCATTACTCTTGTGTGGTCGATCTTCTAGGACGTGCTGGTAATTTGAATCAAGCTTA
TGATTTTATTATGAGCATGCCAATTAAACCTGGAGTTAGTGTTTGGGGGGCACTTCTAAGTGCGTGCAAGATCCATCGACAAGTGAGGTTGGGAGAAATTGCTGCAGAAC
AGCTTTTCATATTAGATCCATATAATACAGGGCATTATGTGCAACTCTCAAACCTATATGCTTCTGCCCATTTATGGGATCACGTGGCAAATGTTCGATTAATGATGACG
CAGAAAGGACTGAACAAAGACCTCGGACATAGTTCTATTGAGATAAATGGAAAACTTCAAACTTTTCACGTCGGAGATAGATCACATCCTAGATCAAAGGAAATTTTTGA
AGAGCTTGATAGATTAGAGAAAAGATTAAAAGCTGCTGGTTATGTCCCTCATATGGATTCTGTTCTACATGACTTGAATCATGAGGAGATTGAGGAAACCCTTTGTAACC
ACAGTGAGAGGTTAGCAGTTGCTTATGGCATCATCAGTACTGCTCCTGGAACTACACTTAGAATAACCAAGAATCTCCGAGCATGCGTTAATTGCCATTCAGCGATAAAG
CTTATATCAAAGCTCGTCGATAGGGAAATAATTATTCGAGATGCGAAACGTTTTCATCATTTCAAAGATGGAGATTGTTCATGTGGAGATTTTTGGTGA
Protein sequenceShow/hide protein sequence
MSMHSFSLFLSLSSLSTALSKVAATSHEALLRRKHLDQLYVQLIVSGLNKCGFLVIKFVKACLHLRDINYAHKAFHEVLEPDILLWNAIIKGYTTQNNVVAGAIRLYTDM
QVSGVHPDCFTFLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQISSSRMVFDRLHNRTVVSWTSIISGYVQNGDPVEALSVFKEMRQSNMKPDW
IALVSVTTAYTDMEDLGQGKAIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYCEEAIKLFREMISKNIRADSVTVRSAILA
GAQVGSLELARWLDGYICKSEYRDDTFVNTALIDMYAKCGSIYFARSVFDRMVDKDVVLWSVMIMGYGLHGHGQEAISLYNEMKQAGVCPNDVTFVGLLTACKNAGLVKE
GWELFHQMQDHGIEPHHQHYSCVVDLLGRAGNLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMT
QKGLNKDLGHSSIEINGKLQTFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPHMDSVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIK
LISKLVDREIIIRDAKRFHHFKDGDCSCGDFW