; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Csor.00g018690 (gene) of Silver-seed gourd (wild; sororia) v1 genome

Gene IDCsor.00g018690
OrganismCucurbita argyrosperma subsp. sororia (Silver-seed gourd (wild; sororia) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCsor_Chr04:1374797..1377963
RNA-Seq ExpressionCsor.00g018690
SyntenyCsor.00g018690
Gene Ontology termsGO:0080156 - mitochondrial mRNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600109.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0100Show/hide
Query:  MAILLSFNDYGVHLRRPPPYPLRLSSCCNRTASSLLGLSAMSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVN
        MAILLSFNDYGVHLRRPPPYPLRLSSCCNRTASSLLGLSAMSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVN
Subjt:  MAILLSFNDYGVHLRRPPPYPLRLSSCCNRTASSLLGLSAMSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVN

Query:  ACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLV
        ACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLV
Subjt:  ACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLV

Query:  SMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSL
        SMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSL
Subjt:  SMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSL

Query:  TNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTA
        TNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTA
Subjt:  TNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTA

Query:  LIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYS
        LIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYS
Subjt:  LIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYS

Query:  CVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHS
        CVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHS
Subjt:  CVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHS

Query:  SIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKL
        SIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKL
Subjt:  SIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKL

Query:  ISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
        ISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
Subjt:  ISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

KAG7030779.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]0.099.42Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA
        MSLHSFSLSLSLSS+STALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA

Query:  GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDP
        GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGL SNVFVQNSLVSMYARFGQTSSARLVFDKLH RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDP

Query:  VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY
        VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY
Subjt:  VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY

Query:  GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI
        HGQEAIDLYNRMKQSGV PNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI
Subjt:  HGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI

Query:  HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
        VAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
Subjt:  VAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

XP_022941625.1 pentatricopeptide repeat-containing protein At3g12770 isoform X1 [Cucurbita moschata]0.097.99Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLR------RKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYT
        MSLHSFSLSLSLSSLSTALSKAAATSQEALLR      RKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYT
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLR------RKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYT

Query:  QNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGY
        QNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGL SNVFVQNSLVSMYARFGQTSSARLVFDKLH RTVVSWTSIISGY
Subjt:  QNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGY

Query:  VQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISG
        VQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISG
Subjt:  VQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISG

Query:  YAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIM
        YAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIM
Subjt:  YAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIM

Query:  GYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGAL
        GYGLHGHGQEAIDLYNRMKQSGV PN+VTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGAL
Subjt:  GYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGAL

Query:  LSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
        LSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHV NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
Subjt:  LSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR

Query:  LKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
        LKAAGYVAHMESVLHDLN EEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV+REIIVRDAKRFH+FKDGVCSCGDFW
Subjt:  LKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

XP_022941627.1 pentatricopeptide repeat-containing protein At3g12770 isoform X2 [Cucurbita moschata]0.098.84Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA
        MSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA

Query:  GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDP
        GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGL SNVFVQNSLVSMYARFGQTSSARLVFDKLH RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDP

Query:  VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY
        VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY
Subjt:  VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY

Query:  GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI
        HGQEAIDLYNRMKQSGV PN+VTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI
Subjt:  HGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI

Query:  HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHV NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
        VAHMESVLHDLN EEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV+REIIVRDAKRFH+FKDGVCSCGDFW
Subjt:  VAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

XP_023525471.1 pentatricopeptide repeat-containing protein At3g12770 [Cucurbita pepo subsp. pepo]0.098.55Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA
        MSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQL VSGLYKCGFLVIKF+NACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA

Query:  GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDP
        GAIRMYKDMQVSGVNPDCFTFLYVLKAC GMSVEGIGKQMHSQTFKYGL SNVFVQNSLVSMYARFGQTSSARLVFDKLH RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDP

Query:  VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY
        VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY
Subjt:  VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY

Query:  GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFAR VFDRMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI
        HGQEAIDLYNRMKQSGV PNDVTFVGLLTACKNSGLVKEGWELFHQMRD+GIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI
Subjt:  HGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI

Query:  HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHV NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
        VAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV+REIIVRDAKRFHHFKDGVCSCGDFW
Subjt:  VAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

TrEMBL top hitse value%identityAlignment
A0A6J1CD43 pentatricopeptide repeat-containing protein At3g127700.089.15Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA
        MSLHSFSLSLSLSSLSTA SK AATSQEALL RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHL DV YAHK FREVLEPDILLWN +IKGYTQNNIF 
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA

Query:  GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDP
        GA+++Y +MQVSGV+PDCFTFLYVLKACGGMS+E IGKQMH QTFKYG  SNVFVQNSLVSMYA+FGQTS AR+VFDKL  RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDP

Query:  VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY
         +AL VFK MR+S VKLDWI LVSV+TAYTDMEDLGQGK+IH LVTKLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY

Query:  GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKNI VDSVTVRSAILA AQ GSL+LARWLDGYISKSEYRDD FVNTALIDM+AKCGSI FA  VFDRMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI
        HG+EAI+LYN MKQ GV PNDVTFVGLLTACKNSGLVKEGW+LFH++RDHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIM+MPIKPGVSVWGALLS CKI
Subjt:  HGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI

Query:  HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQV+LGEIAAEQLF LDPYNTGHYVQLSNLYASAHLWNHV NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
        + HMESVLHDLN EEIEETLCNHSERLAVAYGIISTAPGT LRIT NLRACVNCHSAIKLISKLV+REII+RDAKRFH FKDG CSCGDFW
Subjt:  VAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

A0A6J1FLM1 pentatricopeptide repeat-containing protein At3g12770 isoform X10.097.99Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLR------RKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYT
        MSLHSFSLSLSLSSLSTALSKAAATSQEALLR      RKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYT
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLR------RKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYT

Query:  QNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGY
        QNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGL SNVFVQNSLVSMYARFGQTSSARLVFDKLH RTVVSWTSIISGY
Subjt:  QNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGY

Query:  VQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISG
        VQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISG
Subjt:  VQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISG

Query:  YAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIM
        YAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIM
Subjt:  YAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIM

Query:  GYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGAL
        GYGLHGHGQEAIDLYNRMKQSGV PN+VTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGAL
Subjt:  GYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGAL

Query:  LSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
        LSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHV NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
Subjt:  LSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR

Query:  LKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
        LKAAGYVAHMESVLHDLN EEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV+REIIVRDAKRFH+FKDGVCSCGDFW
Subjt:  LKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

A0A6J1FN08 pentatricopeptide repeat-containing protein At3g12770 isoform X20.098.84Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA
        MSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA

Query:  GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDP
        GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGL SNVFVQNSLVSMYARFGQTSSARLVFDKLH RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDP

Query:  VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY
        VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY
Subjt:  VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY

Query:  GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI
        HGQEAIDLYNRMKQSGV PN+VTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI
Subjt:  HGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI

Query:  HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHV NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
        VAHMESVLHDLN EEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV+REIIVRDAKRFH+FKDGVCSCGDFW
Subjt:  VAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

A0A6J1JD09 pentatricopeptide repeat-containing protein At3g12770 isoform X10.095.84Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLR------RKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYT
        MSLHSFSLSLSL+SLSTALSKAAATSQEALLR      RKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYT
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLR------RKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYT

Query:  QNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGY
        QNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYG  SNVFVQNSLVSMYAR+GQTSSARLVFDKLH RTVVSWTSIISGY
Subjt:  QNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGY

Query:  VQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISG
        VQNGDP+DALRVFKDMR+STVKLDWIVLVSV+TAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAK GRVE+ARFFFNQMEKPNLLLWNAMISG
Subjt:  VQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISG

Query:  YAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIM
        YAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQ GSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFAR VFDRMVDKD+V WSAMIM
Subjt:  YAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIM

Query:  GYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGAL
        GYGLHGHGQEAIDLYNRMKQSG+ PNDVTFVGLLTACKNSGLVKEGWELFHQM+DHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGAL
Subjt:  GYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGAL

Query:  LSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
        LSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWN V NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
Subjt:  LSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR

Query:  LKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
        LKAAGYVAHMESVLHDLN EEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV+REII+RD KRFHHFKDGVCSCGDFW
Subjt:  LKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

A0A6J1JH82 pentatricopeptide repeat-containing protein At3g12770 isoform X20.096.67Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA
        MSLHSFSLSLSL+SLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFA

Query:  GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDP
        GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYG  SNVFVQNSLVSMYAR+GQTSSARLVFDKLH RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDP

Query:  VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY
        +DALRVFKDMR+STVKLDWIVLVSV+TAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAK GRVE+ARFFFNQMEKPNLLLWNAMISGYAKNGY
Subjt:  VDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGY

Query:  GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAIELFRKMISKNIGVDSVTVRSAILAVAQ GSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFAR VFDRMVDKD+V WSAMIMGYGLHG
Subjt:  GEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI
        HGQEAIDLYNRMKQSG+ PNDVTFVGLLTACKNSGLVKEGWELFHQM+DHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI
Subjt:  HGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKI

Query:  HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWN V NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
        VAHMESVLHDLN EEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV+REII+RD KRFHHFKDGVCSCGDFW
Subjt:  VAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic3.3e-15441.53Show/hide
Query:  QLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIG
        +++  L+ SG     F +    N     R VN A KVF  + E D++ WN I+ GY+QN +   A+ M K M    + P   T + VL A   + +  +G
Subjt:  QLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIG

Query:  KQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQ
        K++H    + G  S V +  +LV MYA+ G   +AR +FD +  R VVSW S+I  YVQN +P +A+ +F+ M    VK   + ++  + A  D+ DL +
Subjt:  KQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQ

Query:  GKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLE
        G+ IH L  +LGL+    +V SL +MY KC  V+ A   F +++   L+ WNAMI G+A+NG   +A+  F +M S+ +  D+ T  S I A+A+     
Subjt:  GKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLE

Query:  LARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLV
         A+W+ G + +S    +VFV TAL+DM+AKCG+I  AR +FD M ++ V  W+AMI GYG HG G+ A++L+  M++  + PN VTF+ +++AC +SGLV
Subjt:  LARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLV

Query:  KEGWELFHQMRD-HGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAH
        + G + F+ M++ + IE    HY  +VDLLGRAG LN A+DFIM MP+KP V+V+GA+L  C+IH+ V   E AAE+LF L+P + G++V L+N+Y +A 
Subjt:  KEGWELFHQMRD-HGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAH

Query:  LWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIIST
        +W  V  VR+ M ++GL K  G S +EI   + +F  G  +HP SK+I+  L++L   +K AGYV     VL   ND + E+ L  HSE+LA+++G+++T
Subjt:  LWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIIST

Query:  APGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
          GTT+ + KNLR C +CH+A K IS +  REI+VRD +RFHHFK+G CSCGD+W
Subjt:  APGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.1e-14437.83Show/hide
Query:  LYVQLIVSGLYKCGFLVIKFVNACL---HLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEG
        ++ Q+I  GL+   + + K +  C+   H   + YA  VF+ + EP++L+WN + +G+  ++    A+++Y  M   G+ P+ +TF +VLK+C       
Subjt:  LYVQLIVSGLYKCGFLVIKFVNACL---HLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEG

Query:  IGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYR-------------------------------TVVSWTSIISGYVQNGDPVDAL
         G+Q+H    K G   +++V  SL+SMY + G+   A  VFDK  +R                                VVSW ++ISGY + G+  +AL
Subjt:  IGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYR-------------------------------TVVSWTSIISGYVQNGDPVDAL

Query:  RVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEA
         +FKDM ++ V+ D   +V+VV+A      +  G+ +H  +   G      IV +L ++Y+KCG +E A   F ++   +++ WN +I GY      +EA
Subjt:  RVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEA

Query:  IELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISK--SEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHG
        + LF++M+      + VT+ S + A A  G++++ RW+  YI K      +   + T+LIDM+AKCG I  A  VF+ ++ K +  W+AMI G+ +HG  
Subjt:  IELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISK--SEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHG

Query:  QEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQM-RDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIH
          + DL++RM++ G+ P+D+TFVGLL+AC +SG++  G  +F  M +D+ + P  +HY C++DLLG +G    A + I  M ++P   +W +LL  CK+H
Subjt:  QEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQM-RDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIH

Query:  RQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYV
          V LGE  AE L  ++P N G YV LSN+YASA  WN V   R ++  KG+ K  G SSIEI+  +  F +GD+ HPR++EI+  L+ +E  L+ AG+V
Subjt:  RQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYV

Query:  AHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
             VL ++ +E  E  L +HSE+LA+A+G+IST PGT L I KNLR C NCH A KLISK+  REII RD  RFHHF+DGVCSC D+W
Subjt:  AHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127706.1e-24157.81Show/hide
Query:  EALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKA
        ++   +  L Q++ +L+V GL   GFL+ K ++A     D+ +A +VF ++  P I  WN II+GY++NN F  A+ MY +MQ++ V+PD FTF ++LKA
Subjt:  EALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKA

Query:  CGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFD--KLHYRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSV
        C G+S   +G+ +H+Q F+ G  ++VFVQN L+++YA+  +  SAR VF+   L  RT+VSWT+I+S Y QNG+P++AL +F  MR+  VK DW+ LVSV
Subjt:  CGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFD--KLHYRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSV

Query:  VTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMISKNIGVDSVTVRS
        + A+T ++DL QG++IH+ V K+GLE EPD+++SL  MYAKCG+V  A+  F++M+ PNL+LWNAMISGYAKNGY  EAI++F +MI+K++  D++++ S
Subjt:  VTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMISKNIGVDSVTVRS

Query:  AILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFV
        AI A AQ GSLE AR +  Y+ +S+YRDDVF+++ALIDM AKCGS+  AR VFDR +D+DVV+WSAMI+GYGLHG  +EAI LY  M++ GVHPNDVTF+
Subjt:  AILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFV

Query:  GLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHY
        GLL AC +SG+V+EGW  F++M DH I P  QHY+CV+DLLGRAG+L++AY+ I  MP++PGV+VWGALLS CK HR V LGE AA+QLF +DP NTGHY
Subjt:  GLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHY

Query:  VQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSE
        VQLSNLYA+A LW+ V  VR+ M +KGLNKD+G S +E+ G LE F VGD+SHPR +EI  +++ +E RLK  G+VA+ ++ LHDLNDEE EETLC+HSE
Subjt:  VQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSE

Query:  RLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
        R+A+AYG+IST  GT LRITKNLRACVNCH+A KLISKLV+REI+VRD  RFHHFKDGVCSCGD+W
Subjt:  RLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

Q9LW32 Pentatricopeptide repeat-containing protein At3g26782, mitochondrial9.6e-15442.97Show/hide
Query:  REVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLV
        R V + D+  WN +I    ++   A A+  +  M+   + P   +F   +KAC  +     GKQ H Q F +G +S++FV ++L+ MY+  G+   AR V
Subjt:  REVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLV

Query:  FDKLHYRTVVSWTSIISGYVQNGDPVDALRVFKDM------RRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGR
        FD++  R +VSWTS+I GY  NG+ +DA+ +FKD+          + LD + LVSV++A + +   G  ++IHS V K G +    +  +L + YAK G 
Subjt:  FDKLHYRTVVSWTSIISGYVQNGDPVDALRVFKDM------RRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGR

Query:  --VEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMI-SKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHA
          V VAR  F+Q+   + + +N+++S YA++G   EA E+FR+++ +K +  +++T+ + +LAV+ +G+L + + +   + +    DDV V T++IDM+ 
Subjt:  --VEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMI-SKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHA

Query:  KCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDH-GIEPHHQHYSCVVDL
        KCG +  AR  FDRM +K+V  W+AMI GYG+HGH  +A++L+  M  SGV PN +TFV +L AC ++GL  EGW  F+ M+   G+EP  +HY C+VDL
Subjt:  KCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDH-GIEPHHQHYSCVVDL

Query:  LGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEIN
        LGRAG+L +AYD I  M +KP   +W +LL+ C+IH+ V L EI+  +LF LD  N G+Y+ LS++YA A  W  V  VR++M  +GL K  G S +E+N
Subjt:  LGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEIN

Query:  GNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV
        G +  F +GD  HP+ ++I+E L  L R+L  AGYV++  SV HD+++EE E TL  HSE+LA+A+GI++T PG+T+ + KNLR C +CH+ IKLISK+V
Subjt:  GNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV

Query:  NREIIVRDAKRFHHFKDGVCSCGDFW
        +RE +VRDAKRFHHFKDG CSCGD+W
Subjt:  NREIIVRDAKRFHHFKDGVCSCGDFW

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.3e-14539.73Show/hide
Query:  DQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGI
        +QL+  ++ SG  +   +    V   L  + V+ A KVF E+ E D++ WN II GY  N +    + ++  M VSG+  D  T + V   C    +  +
Subjt:  DQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGI

Query:  GKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLG
        G+ +HS   K          N+L+ MY++ G   SA+ VF ++  R+VVS+TS+I+GY + G   +A+++F++M    +  D   + +V+        L 
Subjt:  GKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLG

Query:  QGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMI-SKNIGVDSVTVRSAILAVAQAGS
        +GK +H  + +  L F+  +  +L +MYAKCG ++ A   F++M   +++ WN +I GY+KN Y  EA+ LF  ++  K    D  TV   + A A   +
Subjt:  QGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMI-SKNIGVDSVTVRSAILAVAQAGS

Query:  LELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSG
         +  R + GYI ++ Y  D  V  +L+DM+AKCG++  A  +FD +  KD+V W+ MI GYG+HG G+EAI L+N+M+Q+G+  ++++FV LL AC +SG
Subjt:  LELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSG

Query:  LVKEGWELFHQMR-DHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYAS
        LV EGW  F+ MR +  IEP  +HY+C+VD+L R G L +AY FI +MPI P  ++WGALL GC+IH  V+L E  AE++F L+P NTG+YV ++N+YA 
Subjt:  LVKEGWELFHQMR-DHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYAS

Query:  AHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGII
        A  W  V+ +R  + Q+GL K+ G S IEI G +  F  GD S+P ++ I   L ++  R+   GY    +  L D  + E EE LC HSE+LA+A GII
Subjt:  AHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGII

Query:  STAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
        S+  G  +R+TKNLR C +CH   K +SKL  REI++RD+ RFH FKDG CSC  FW
Subjt:  STAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.6e-14637.83Show/hide
Query:  LYVQLIVSGLYKCGFLVIKFVNACL---HLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEG
        ++ Q+I  GL+   + + K +  C+   H   + YA  VF+ + EP++L+WN + +G+  ++    A+++Y  M   G+ P+ +TF +VLK+C       
Subjt:  LYVQLIVSGLYKCGFLVIKFVNACL---HLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEG

Query:  IGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYR-------------------------------TVVSWTSIISGYVQNGDPVDAL
         G+Q+H    K G   +++V  SL+SMY + G+   A  VFDK  +R                                VVSW ++ISGY + G+  +AL
Subjt:  IGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYR-------------------------------TVVSWTSIISGYVQNGDPVDAL

Query:  RVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEA
         +FKDM ++ V+ D   +V+VV+A      +  G+ +H  +   G      IV +L ++Y+KCG +E A   F ++   +++ WN +I GY      +EA
Subjt:  RVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEA

Query:  IELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISK--SEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHG
        + LF++M+      + VT+ S + A A  G++++ RW+  YI K      +   + T+LIDM+AKCG I  A  VF+ ++ K +  W+AMI G+ +HG  
Subjt:  IELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISK--SEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHG

Query:  QEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQM-RDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIH
          + DL++RM++ G+ P+D+TFVGLL+AC +SG++  G  +F  M +D+ + P  +HY C++DLLG +G    A + I  M ++P   +W +LL  CK+H
Subjt:  QEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQM-RDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIH

Query:  RQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYV
          V LGE  AE L  ++P N G YV LSN+YASA  WN V   R ++  KG+ K  G SSIEI+  +  F +GD+ HPR++EI+  L+ +E  L+ AG+V
Subjt:  RQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYV

Query:  AHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
             VL ++ +E  E  L +HSE+LA+A+G+IST PGT L I KNLR C NCH A KLISK+  REII RD  RFHHF+DGVCSC D+W
Subjt:  AHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein2.3e-15541.53Show/hide
Query:  QLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIG
        +++  L+ SG     F +    N     R VN A KVF  + E D++ WN I+ GY+QN +   A+ M K M    + P   T + VL A   + +  +G
Subjt:  QLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIG

Query:  KQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQ
        K++H    + G  S V +  +LV MYA+ G   +AR +FD +  R VVSW S+I  YVQN +P +A+ +F+ M    VK   + ++  + A  D+ DL +
Subjt:  KQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQ

Query:  GKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLE
        G+ IH L  +LGL+    +V SL +MY KC  V+ A   F +++   L+ WNAMI G+A+NG   +A+  F +M S+ +  D+ T  S I A+A+     
Subjt:  GKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLE

Query:  LARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLV
         A+W+ G + +S    +VFV TAL+DM+AKCG+I  AR +FD M ++ V  W+AMI GYG HG G+ A++L+  M++  + PN VTF+ +++AC +SGLV
Subjt:  LARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLV

Query:  KEGWELFHQMRD-HGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAH
        + G + F+ M++ + IE    HY  +VDLLGRAG LN A+DFIM MP+KP V+V+GA+L  C+IH+ V   E AAE+LF L+P + G++V L+N+Y +A 
Subjt:  KEGWELFHQMRD-HGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAH

Query:  LWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIIST
        +W  V  VR+ M ++GL K  G S +EI   + +F  G  +HP SK+I+  L++L   +K AGYV     VL   ND + E+ L  HSE+LA+++G+++T
Subjt:  LWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIIST

Query:  APGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
          GTT+ + KNLR C +CH+A K IS +  REI+VRD +RFHHFK+G CSCGD+W
Subjt:  APGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

AT3G12770.1 mitochondrial editing factor 224.3e-24257.81Show/hide
Query:  EALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKA
        ++   +  L Q++ +L+V GL   GFL+ K ++A     D+ +A +VF ++  P I  WN II+GY++NN F  A+ MY +MQ++ V+PD FTF ++LKA
Subjt:  EALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKA

Query:  CGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFD--KLHYRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSV
        C G+S   +G+ +H+Q F+ G  ++VFVQN L+++YA+  +  SAR VF+   L  RT+VSWT+I+S Y QNG+P++AL +F  MR+  VK DW+ LVSV
Subjt:  CGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFD--KLHYRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSV

Query:  VTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMISKNIGVDSVTVRS
        + A+T ++DL QG++IH+ V K+GLE EPD+++SL  MYAKCG+V  A+  F++M+ PNL+LWNAMISGYAKNGY  EAI++F +MI+K++  D++++ S
Subjt:  VTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMISKNIGVDSVTVRS

Query:  AILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFV
        AI A AQ GSLE AR +  Y+ +S+YRDDVF+++ALIDM AKCGS+  AR VFDR +D+DVV+WSAMI+GYGLHG  +EAI LY  M++ GVHPNDVTF+
Subjt:  AILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFV

Query:  GLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHY
        GLL AC +SG+V+EGW  F++M DH I P  QHY+CV+DLLGRAG+L++AY+ I  MP++PGV+VWGALLS CK HR V LGE AA+QLF +DP NTGHY
Subjt:  GLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHY

Query:  VQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSE
        VQLSNLYA+A LW+ V  VR+ M +KGLNKD+G S +E+ G LE F VGD+SHPR +EI  +++ +E RLK  G+VA+ ++ LHDLNDEE EETLC+HSE
Subjt:  VQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSE

Query:  RLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
        R+A+AYG+IST  GT LRITKNLRACVNCH+A KLISKLV+REI+VRD  RFHHFKDGVCSCGD+W
Subjt:  RLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW

AT3G26782.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.8e-15542.97Show/hide
Query:  REVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLV
        R V + D+  WN +I    ++   A A+  +  M+   + P   +F   +KAC  +     GKQ H Q F +G +S++FV ++L+ MY+  G+   AR V
Subjt:  REVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLV

Query:  FDKLHYRTVVSWTSIISGYVQNGDPVDALRVFKDM------RRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGR
        FD++  R +VSWTS+I GY  NG+ +DA+ +FKD+          + LD + LVSV++A + +   G  ++IHS V K G +    +  +L + YAK G 
Subjt:  FDKLHYRTVVSWTSIISGYVQNGDPVDALRVFKDM------RRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGR

Query:  --VEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMI-SKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHA
          V VAR  F+Q+   + + +N+++S YA++G   EA E+FR+++ +K +  +++T+ + +LAV+ +G+L + + +   + +    DDV V T++IDM+ 
Subjt:  --VEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMI-SKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHA

Query:  KCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDH-GIEPHHQHYSCVVDL
        KCG +  AR  FDRM +K+V  W+AMI GYG+HGH  +A++L+  M  SGV PN +TFV +L AC ++GL  EGW  F+ M+   G+EP  +HY C+VDL
Subjt:  KCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDH-GIEPHHQHYSCVVDL

Query:  LGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEIN
        LGRAG+L +AYD I  M +KP   +W +LL+ C+IH+ V L EI+  +LF LD  N G+Y+ LS++YA A  W  V  VR++M  +GL K  G S +E+N
Subjt:  LGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEIN

Query:  GNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV
        G +  F +GD  HP+ ++I+E L  L R+L  AGYV++  SV HD+++EE E TL  HSE+LA+A+GI++T PG+T+ + KNLR C +CH+ IKLISK+V
Subjt:  GNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV

Query:  NREIIVRDAKRFHHFKDGVCSCGDFW
        +RE +VRDAKRFHHFKDG CSCGD+W
Subjt:  NREIIVRDAKRFHHFKDGVCSCGDFW

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein8.9e-14739.73Show/hide
Query:  DQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGI
        +QL+  ++ SG  +   +    V   L  + V+ A KVF E+ E D++ WN II GY  N +    + ++  M VSG+  D  T + V   C    +  +
Subjt:  DQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNYAHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGI

Query:  GKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLG
        G+ +HS   K          N+L+ MY++ G   SA+ VF ++  R+VVS+TS+I+GY + G   +A+++F++M    +  D   + +V+        L 
Subjt:  GKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLHYRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLG

Query:  QGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMI-SKNIGVDSVTVRSAILAVAQAGS
        +GK +H  + +  L F+  +  +L +MYAKCG ++ A   F++M   +++ WN +I GY+KN Y  EA+ LF  ++  K    D  TV   + A A   +
Subjt:  QGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNAMISGYAKNGYGEEAIELFRKMI-SKNIGVDSVTVRSAILAVAQAGS

Query:  LELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSG
         +  R + GYI ++ Y  D  V  +L+DM+AKCG++  A  +FD +  KD+V W+ MI GYG+HG G+EAI L+N+M+Q+G+  ++++FV LL AC +SG
Subjt:  LELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHGHGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSG

Query:  LVKEGWELFHQMR-DHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYAS
        LV EGW  F+ MR +  IEP  +HY+C+VD+L R G L +AY FI +MPI P  ++WGALL GC+IH  V+L E  AE++F L+P NTG+YV ++N+YA 
Subjt:  LVKEGWELFHQMR-DHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIAAEQLFLLDPYNTGHYVQLSNLYAS

Query:  AHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGII
        A  W  V+ +R  + Q+GL K+ G S IEI G +  F  GD S+P ++ I   L ++  R+   GY    +  L D  + E EE LC HSE+LA+A GII
Subjt:  AHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETLCNHSERLAVAYGII

Query:  STAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW
        S+  G  +R+TKNLR C +CH   K +SKL  REI++RD+ RFH FKDG CSC  FW
Subjt:  STAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATTCTCCTCAGTTTCAACGATTATGGGGTCCATCTTCGCCGGCCGCCGCCATATCCCCTCCGTCTCTCAAGCTGCTGCAATCGTACAGCTTCGTCGCTCCTCGG
ACTTTCAGCCATGTCTTTGCATTCATTTTCGCTCTCTCTCTCCTTGTCGTCGCTATCTACAGCTCTCTCAAAGGCGGCGGCAACCTCGCAGGAGGCTTTATTGAGGAGGA
AGCATTTGGATCAATTATACGTCCAGTTAATTGTGTCTGGGCTATACAAGTGTGGTTTCTTGGTTATCAAATTTGTCAATGCATGTTTGCATCTCAGAGATGTTAACTAC
GCGCATAAGGTTTTTCGTGAAGTCTTAGAACCAGATATCTTGTTGTGGAATGGCATCATAAAGGGCTACACTCAGAACAATATTTTTGCTGGGGCTATCAGAATGTATAA
GGATATGCAAGTGTCAGGGGTGAACCCAGATTGCTTCACATTTTTGTATGTGCTTAAAGCGTGCGGTGGAATGTCGGTCGAAGGAATAGGTAAACAGATGCATAGCCAGA
CGTTTAAATATGGCCTTAGATCAAATGTGTTTGTGCAGAACAGTCTTGTGTCAATGTATGCTAGATTTGGCCAAACCTCATCTGCTAGGCTCGTCTTCGATAAGTTACAT
TATCGAACTGTTGTTTCGTGGACGTCCATCATTTCTGGGTACGTTCAGAATGGCGATCCCGTGGACGCGTTGAGAGTTTTCAAAGATATGAGGCGAAGTACTGTGAAACT
TGATTGGATTGTCCTTGTTAGTGTTGTGACAGCCTACACAGACATGGAGGATTTGGGGCAAGGGAAAGCCATTCATAGCTTAGTGACTAAATTGGGTCTAGAATTCGAAC
CCGACATAGTGGTCTCGCTCACTAACATGTATGCTAAATGTGGACGGGTGGAAGTTGCTAGATTTTTCTTTAATCAGATGGAAAAACCAAATTTACTTTTGTGGAATGCT
ATGATTTCTGGTTATGCAAAAAATGGATATGGTGAAGAAGCAATCGAGCTATTCCGTAAGATGATTTCAAAGAATATTGGGGTCGATTCTGTTACTGTGAGGTCTGCTAT
TCTAGCCGTTGCCCAAGCGGGGTCTCTTGAACTAGCAAGATGGTTGGATGGTTATATCTCTAAGAGTGAGTACCGAGACGATGTTTTCGTGAACACAGCCCTTATAGATA
TGCATGCAAAATGTGGAAGCATATGTTTTGCTCGTGGTGTTTTCGATAGAATGGTCGATAAAGACGTCGTCTTATGGAGTGCTATGATTATGGGGTATGGATTACACGGT
CATGGACAAGAAGCCATCGACCTTTACAACAGAATGAAGCAATCTGGAGTTCATCCGAACGATGTTACTTTTGTTGGCCTTCTCACAGCATGTAAAAACTCGGGTCTTGT
AAAAGAGGGATGGGAGCTTTTCCACCAGATGCGAGACCACGGGATTGAACCGCATCACCAGCATTACTCTTGCGTGGTCGATCTTCTGGGACGTGCAGGCTATTTGAATC
GAGCTTATGATTTTATTATGAGCATGCCCATTAAACCTGGAGTTAGTGTTTGGGGGGCACTTTTAAGTGGATGTAAGATCCATCGTCAAGTGAGGTTGGGAGAGATAGCT
GCAGAACAGCTTTTCTTATTAGATCCATATAATACAGGTCATTATGTACAACTCTCAAACTTATATGCTTCTGCCCATTTATGGAACCACGTGAGGAACGTTCGATTAAT
GATGACGCAGAAAGGACTGAACAAGGACCTCGGACATAGTTCGATTGAGATCAACGGAAATCTCGAAACGTTCCATGTTGGAGATAGATCACATCCGAGATCGAAGGAAA
TCTTTGAAGAACTTGATAGATTGGAGAGGAGATTAAAGGCAGCTGGTTATGTTGCTCATATGGAATCTGTTCTACATGACTTGAATGATGAGGAGATTGAGGAAACTCTT
TGTAACCATAGTGAGAGGTTAGCAGTTGCTTATGGCATCATCAGTACTGCTCCTGGAACTACACTTAGAATAACGAAAAATCTCCGTGCGTGCGTTAATTGTCATTCGGC
GATAAAGCTAATATCGAAGCTTGTCAATAGGGAAATAATTGTTCGAGATGCGAAACGCTTTCATCATTTCAAAGATGGAGTTTGTTCGTGCGGAGATTTTTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGATTCTCCTCAGTTTCAACGATTATGGGGTCCATCTTCGCCGGCCGCCGCCATATCCCCTCCGTCTCTCAAGCTGCTGCAATCGTACAGCTTCGTCGCTCCTCGG
ACTTTCAGCCATGTCTTTGCATTCATTTTCGCTCTCTCTCTCCTTGTCGTCGCTATCTACAGCTCTCTCAAAGGCGGCGGCAACCTCGCAGGAGGCTTTATTGAGGAGGA
AGCATTTGGATCAATTATACGTCCAGTTAATTGTGTCTGGGCTATACAAGTGTGGTTTCTTGGTTATCAAATTTGTCAATGCATGTTTGCATCTCAGAGATGTTAACTAC
GCGCATAAGGTTTTTCGTGAAGTCTTAGAACCAGATATCTTGTTGTGGAATGGCATCATAAAGGGCTACACTCAGAACAATATTTTTGCTGGGGCTATCAGAATGTATAA
GGATATGCAAGTGTCAGGGGTGAACCCAGATTGCTTCACATTTTTGTATGTGCTTAAAGCGTGCGGTGGAATGTCGGTCGAAGGAATAGGTAAACAGATGCATAGCCAGA
CGTTTAAATATGGCCTTAGATCAAATGTGTTTGTGCAGAACAGTCTTGTGTCAATGTATGCTAGATTTGGCCAAACCTCATCTGCTAGGCTCGTCTTCGATAAGTTACAT
TATCGAACTGTTGTTTCGTGGACGTCCATCATTTCTGGGTACGTTCAGAATGGCGATCCCGTGGACGCGTTGAGAGTTTTCAAAGATATGAGGCGAAGTACTGTGAAACT
TGATTGGATTGTCCTTGTTAGTGTTGTGACAGCCTACACAGACATGGAGGATTTGGGGCAAGGGAAAGCCATTCATAGCTTAGTGACTAAATTGGGTCTAGAATTCGAAC
CCGACATAGTGGTCTCGCTCACTAACATGTATGCTAAATGTGGACGGGTGGAAGTTGCTAGATTTTTCTTTAATCAGATGGAAAAACCAAATTTACTTTTGTGGAATGCT
ATGATTTCTGGTTATGCAAAAAATGGATATGGTGAAGAAGCAATCGAGCTATTCCGTAAGATGATTTCAAAGAATATTGGGGTCGATTCTGTTACTGTGAGGTCTGCTAT
TCTAGCCGTTGCCCAAGCGGGGTCTCTTGAACTAGCAAGATGGTTGGATGGTTATATCTCTAAGAGTGAGTACCGAGACGATGTTTTCGTGAACACAGCCCTTATAGATA
TGCATGCAAAATGTGGAAGCATATGTTTTGCTCGTGGTGTTTTCGATAGAATGGTCGATAAAGACGTCGTCTTATGGAGTGCTATGATTATGGGGTATGGATTACACGGT
CATGGACAAGAAGCCATCGACCTTTACAACAGAATGAAGCAATCTGGAGTTCATCCGAACGATGTTACTTTTGTTGGCCTTCTCACAGCATGTAAAAACTCGGGTCTTGT
AAAAGAGGGATGGGAGCTTTTCCACCAGATGCGAGACCACGGGATTGAACCGCATCACCAGCATTACTCTTGCGTGGTCGATCTTCTGGGACGTGCAGGCTATTTGAATC
GAGCTTATGATTTTATTATGAGCATGCCCATTAAACCTGGAGTTAGTGTTTGGGGGGCACTTTTAAGTGGATGTAAGATCCATCGTCAAGTGAGGTTGGGAGAGATAGCT
GCAGAACAGCTTTTCTTATTAGATCCATATAATACAGGTCATTATGTACAACTCTCAAACTTATATGCTTCTGCCCATTTATGGAACCACGTGAGGAACGTTCGATTAAT
GATGACGCAGAAAGGACTGAACAAGGACCTCGGACATAGTTCGATTGAGATCAACGGAAATCTCGAAACGTTCCATGTTGGAGATAGATCACATCCGAGATCGAAGGAAA
TCTTTGAAGAACTTGATAGATTGGAGAGGAGATTAAAGGCAGCTGGTTATGTTGCTCATATGGAATCTGTTCTACATGACTTGAATGATGAGGAGATTGAGGAAACTCTT
TGTAACCATAGTGAGAGGTTAGCAGTTGCTTATGGCATCATCAGTACTGCTCCTGGAACTACACTTAGAATAACGAAAAATCTCCGTGCGTGCGTTAATTGTCATTCGGC
GATAAAGCTAATATCGAAGCTTGTCAATAGGGAAATAATTGTTCGAGATGCGAAACGCTTTCATCATTTCAAAGATGGAGTTTGTTCGTGCGGAGATTTTTGGTGA
Protein sequenceShow/hide protein sequence
MAILLSFNDYGVHLRRPPPYPLRLSSCCNRTASSLLGLSAMSLHSFSLSLSLSSLSTALSKAAATSQEALLRRKHLDQLYVQLIVSGLYKCGFLVIKFVNACLHLRDVNY
AHKVFREVLEPDILLWNGIIKGYTQNNIFAGAIRMYKDMQVSGVNPDCFTFLYVLKACGGMSVEGIGKQMHSQTFKYGLRSNVFVQNSLVSMYARFGQTSSARLVFDKLH
YRTVVSWTSIISGYVQNGDPVDALRVFKDMRRSTVKLDWIVLVSVVTAYTDMEDLGQGKAIHSLVTKLGLEFEPDIVVSLTNMYAKCGRVEVARFFFNQMEKPNLLLWNA
MISGYAKNGYGEEAIELFRKMISKNIGVDSVTVRSAILAVAQAGSLELARWLDGYISKSEYRDDVFVNTALIDMHAKCGSICFARGVFDRMVDKDVVLWSAMIMGYGLHG
HGQEAIDLYNRMKQSGVHPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNRAYDFIMSMPIKPGVSVWGALLSGCKIHRQVRLGEIA
AEQLFLLDPYNTGHYVQLSNLYASAHLWNHVRNVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVAHMESVLHDLNDEEIEETL
CNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVNREIIVRDAKRFHHFKDGVCSCGDFW