; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0021103 (gene) of Chayote v1 genome

Gene IDSed0021103
OrganismSechium edule (Chayote v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG04:39753817..39756823
RNA-Seq ExpressionSed0021103
SyntenySed0021103
Gene Ontology termsGO:0080156 - mitochondrial mRNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030779.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0087.41Show/hide
Query:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA
        MS H+FSLSL  SS+++ALSK AATSQEALLRRK LDQLYVQL+VSGL++CGFLVIKFVNACLHLRDVNYAHK FREVLEPDILLWN IIKGYTQN+ FA
Subjt:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA

Query:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP
        GAIRMY DMQ+S V+P+CFT LYVLKACGGMSVEGIGKQMH QTFKYG GSNV+VQNSLVSMYA+FGQTSSAR+VFDKLH+RT+VSWT+IISGYVQNGDP
Subjt:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP

Query:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        V+ALRVFK MR+S VK DWI LVSV+TAYTDMEDLGQGK IH LV+KLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG
        GEE+I++FR+MIS +I VDSVTV SAILA AQ GSL+LARWLD YISKSEYRD+ +VNTALID++ KCGSI FAR VFDR+VDKDVV+WSAMIMGYGLHG
Subjt:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG

Query:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI
        HGQEAI+LYN MKQ+GV PNDV FVGLLTACKNSGLVKEGWELFHQMR+HGIEPHHQHYSCVV+LLGR GYLN AYDFIM+MPIKPGVSVWGALLS CKI
Subjt:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI

Query:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVRLGEIAAEQLF+LDPYNTGHYVQLSNLYASAHLWNHV NVRLMMTQKGLNKDLG+SSIEINGNLET HVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        V HMESVLHDLN EEIEETLCNHSERLAVAYGIISTAPGTTLRIT NLRACV+CHSAIKLISKLV+REII+RDAKRFHHFKDGVCSCGDFW
Subjt:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_022139117.1 pentatricopeptide repeat-containing protein At3g12770 [Momordica charantia]0.0e+0088.86Show/hide
Query:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA
        MS H+FSLSL  SSL++A SK+AATSQEALL RK LDQLYVQL+VSGLH+C FLVIKFVNACLHL DV YAHKAFREVLEPDILLWNA+IKGYTQN+ F 
Subjt:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA

Query:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP
        GA+++YT+MQ+S VHP+CFT LYVLKACGGMS+E IGKQMHGQTFKYGFGSNV+VQNSLVSMYAKFGQTS AR+VFDKL DRT+VSWT+IISGYVQNGDP
Subjt:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP

Query:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY
         EAL VFK MRQSNVK DWIALVSVMTAYTDMEDLGQGK IHGLV+KLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG
        GEE+IK+FREMIS +IRVDSVTV SAILAGAQVGSLDLARWLD YISKSEYRD+T+VNTALID+Y KCGSIYFA +VFDR+VDKDVV+WSAMIMGYGLHG
Subjt:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG

Query:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI
        HG+EAINLYNAMKQ GV PNDV FVGLLTACKNSGLVKEGW+LFH++R+HGIEPHHQHYSCVV+LLGR GYLN+AYDFIMNMPIKPGVSVWGALLS+CKI
Subjt:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI

Query:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQV+LGEIAAEQLF LDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLG+SSIEINGNLET HVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        +PHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGT LRITNNLRACV+CHSAIKLISKLVDREIIIRDAKRFH FKDG CSCGDFW
Subjt:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_022941627.1 pentatricopeptide repeat-containing protein At3g12770 isoform X2 [Cucurbita moschata]0.0e+0087.55Show/hide
Query:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA
        MS H+FSLSL  SSL++ALSK AATSQEALLRRK LDQLYVQL+VSGL++CGFLVIKFVNACLHLRDVNYAHK FREVLEPDILLWN IIKGYTQN+ FA
Subjt:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA

Query:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP
        GAIRMY DMQ+S V+P+CFT LYVLKACGGMSVEGIGKQMH QTFKYG GSNV+VQNSLVSMYA+FGQTSSAR+VFDKLH+RT+VSWT+IISGYVQNGDP
Subjt:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP

Query:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        V+ALRVFK MR+S VK DWI LVSV+TAYTDMEDLGQGK IH LV+KLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG
        GEE+I++FR+MIS +I VDSVTV SAILA AQ GSL+LARWLD YISKSEYRD+ +VNTALID++ KCGSI FAR VFDR+VDKDVV+WSAMIMGYGLHG
Subjt:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG

Query:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI
        HGQEAI+LYN MKQ+GV PN+V FVGLLTACKNSGLVKEGWELFHQMR+HGIEPHHQHYSCVV+LLGR GYLN AYDFIM+MPIKPGVSVWGALLS CKI
Subjt:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI

Query:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVRLGEIAAEQLF+LDPYNTGHYVQLSNLYASAHLWNHV NVRLMMTQKGLNKDLG+SSIEINGNLET HVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        V HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRIT NLRACV+CHSAIKLISKLVDREII+RDAKRFH+FKDGVCSCGDFW
Subjt:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_022988451.1 pentatricopeptide repeat-containing protein At3g12770 isoform X2 [Cucurbita maxima]0.0e+0087.41Show/hide
Query:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA
        MS H+FSLSL  +SL++ALSK AATSQEALLRRK LDQLYVQL+VSGL++CGFLVIKFVNACLHLRDVNYAHK FREVLEPDILLWN IIKGYTQN+ FA
Subjt:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA

Query:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP
        GAIRMY DMQ+S V+P+CFT LYVLKACGGMSVEGIGKQMH QTFKYGFGSNV+VQNSLVSMYA++GQTSSAR+VFDKLH+RT+VSWT+IISGYVQNGDP
Subjt:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP

Query:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        ++ALRVFK MRQS VK DWI LVSVMTAYTDMEDLGQGK IH LV+KLGLEFEPDIVVSLT MYAK G+VE+ARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG
        GEE+I++FR+MIS +I VDSVTV SAILA AQVGSL+LARWLD YISKSEYRD+ +VNTALID++ KCGSI FAR VFDR+VDKD+V WSAMIMGYGLHG
Subjt:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG

Query:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI
        HGQEAI+LYN MKQ+G+ PNDV FVGLLTACKNSGLVKEGWELFHQM++HGIEPHHQHYSCVV+LLGR GYLN AYDFIM+MPIKPGVSVWGALLS CKI
Subjt:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI

Query:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVRLGEIAAEQLF+LDPYNTGHYVQLSNLYASAHLWN VANVRLMMTQKGLNKDLG+SSIEINGNLET HVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        V HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRIT NLRACV+CHSAIKLISKLVDREIIIRD KRFHHFKDGVCSCGDFW
Subjt:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_038892943.1 pentatricopeptide repeat-containing protein At3g12770 [Benincasa hispida]0.0e+0087.41Show/hide
Query:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA
        MS H+FSLSL  SSL+SALSK A TS EA LRRK LDQLYVQL+VSGLH+CGFL+IKFVN+CLH  DVNYAHKAFREV+EPDILLWNAIIKGYTQ +   
Subjt:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA

Query:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP
        GAIRMY DMQMS V+PNCFT LYVLKAC GMSVEGIGKQMHGQTFKYGFGSNV+VQNSLVSMYAKFGQTSSARIVFDKLHDRT+VSWT+IISGYVQNGDP
Subjt:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP

Query:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        VEAL++FK MRQ NVK DWI LVSVMTAYTD+EDLGQGK IHGLV+KLGLEFEPDIV+SLTTMYAK G VE+ARFFFNQMEKPNLILWNAMISGYAKNGY
Subjt:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG
        GEE+IK+F EMIS +IRVDSVTV SAILAGAQVGSL LARWLD+YIS+SEYRD+T+VNT+L+D+Y KCGSIYFAR VFDR+V KDVV+WSAMIMGYGLHG
Subjt:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG

Query:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI
        HGQEAIN YN MKQAGV PNDV FVGLLTACKNSGLVKEGWELFHQM+++GIEPHHQHYSCVV+LLGR GYLN+AYDFIM+MP+KPGVSVWGALLS+CKI
Subjt:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI

Query:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HR+VRLGEIAAEQLF+LDPYN G++VQLSNLYASAHLW HVANVRLMMTQKGLNKDLG+SSI+INGNLET HVGDRSHPRSKEIFEELDRLE+RLKAAGY
Subjt:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRIT NLRACV+CHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
Subjt:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

TrEMBL top hitse value%identityAlignment
A0A6J1CD43 pentatricopeptide repeat-containing protein At3g127700.0e+0088.86Show/hide
Query:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA
        MS H+FSLSL  SSL++A SK+AATSQEALL RK LDQLYVQL+VSGLH+C FLVIKFVNACLHL DV YAHKAFREVLEPDILLWNA+IKGYTQN+ F 
Subjt:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA

Query:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP
        GA+++YT+MQ+S VHP+CFT LYVLKACGGMS+E IGKQMHGQTFKYGFGSNV+VQNSLVSMYAKFGQTS AR+VFDKL DRT+VSWT+IISGYVQNGDP
Subjt:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP

Query:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY
         EAL VFK MRQSNVK DWIALVSVMTAYTDMEDLGQGK IHGLV+KLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG
        GEE+IK+FREMIS +IRVDSVTV SAILAGAQVGSLDLARWLD YISKSEYRD+T+VNTALID+Y KCGSIYFA +VFDR+VDKDVV+WSAMIMGYGLHG
Subjt:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG

Query:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI
        HG+EAINLYNAMKQ GV PNDV FVGLLTACKNSGLVKEGW+LFH++R+HGIEPHHQHYSCVV+LLGR GYLN+AYDFIMNMPIKPGVSVWGALLS+CKI
Subjt:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI

Query:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQV+LGEIAAEQLF LDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLG+SSIEINGNLET HVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        +PHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGT LRITNNLRACV+CHSAIKLISKLVDREIIIRDAKRFH FKDG CSCGDFW
Subjt:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A6J1FLM1 pentatricopeptide repeat-containing protein At3g12770 isoform X10.0e+0086.8Show/hide
Query:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLR------RKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYT
        MS H+FSLSL  SSL++ALSK AATSQEALLR      RK LDQLYVQL+VSGL++CGFLVIKFVNACLHLRDVNYAHK FREVLEPDILLWN IIKGYT
Subjt:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLR------RKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYT

Query:  QNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGY
        QN+ FAGAIRMY DMQ+S V+P+CFT LYVLKACGGMSVEGIGKQMH QTFKYG GSNV+VQNSLVSMYA+FGQTSSAR+VFDKLH+RT+VSWT+IISGY
Subjt:  QNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGY

Query:  VQNGDPVEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISG
        VQNGDPV+ALRVFK MR+S VK DWI LVSV+TAYTDMEDLGQGK IH LV+KLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMISG
Subjt:  VQNGDPVEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISG

Query:  YAKNGYGEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIM
        YAKNGYGEE+I++FR+MIS +I VDSVTV SAILA AQ GSL+LARWLD YISKSEYRD+ +VNTALID++ KCGSI FAR VFDR+VDKDVV+WSAMIM
Subjt:  YAKNGYGEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIM

Query:  GYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGAL
        GYGLHGHGQEAI+LYN MKQ+GV PN+V FVGLLTACKNSGLVKEGWELFHQMR+HGIEPHHQHYSCVV+LLGR GYLN AYDFIM+MPIKPGVSVWGAL
Subjt:  GYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGAL

Query:  LSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERR
        LS CKIHRQVRLGEIAAEQLF+LDPYNTGHYVQLSNLYASAHLWNHV NVRLMMTQKGLNKDLG+SSIEINGNLET HVGDRSHPRSKEIFEELDRLERR
Subjt:  LSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERR

Query:  LKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        LKAAGYV HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRIT NLRACV+CHSAIKLISKLVDREII+RDAKRFH+FKDGVCSCGDFW
Subjt:  LKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A6J1FN08 pentatricopeptide repeat-containing protein At3g12770 isoform X20.0e+0087.55Show/hide
Query:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA
        MS H+FSLSL  SSL++ALSK AATSQEALLRRK LDQLYVQL+VSGL++CGFLVIKFVNACLHLRDVNYAHK FREVLEPDILLWN IIKGYTQN+ FA
Subjt:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA

Query:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP
        GAIRMY DMQ+S V+P+CFT LYVLKACGGMSVEGIGKQMH QTFKYG GSNV+VQNSLVSMYA+FGQTSSAR+VFDKLH+RT+VSWT+IISGYVQNGDP
Subjt:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP

Query:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        V+ALRVFK MR+S VK DWI LVSV+TAYTDMEDLGQGK IH LV+KLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG
        GEE+I++FR+MIS +I VDSVTV SAILA AQ GSL+LARWLD YISKSEYRD+ +VNTALID++ KCGSI FAR VFDR+VDKDVV+WSAMIMGYGLHG
Subjt:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG

Query:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI
        HGQEAI+LYN MKQ+GV PN+V FVGLLTACKNSGLVKEGWELFHQMR+HGIEPHHQHYSCVV+LLGR GYLN AYDFIM+MPIKPGVSVWGALLS CKI
Subjt:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI

Query:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVRLGEIAAEQLF+LDPYNTGHYVQLSNLYASAHLWNHV NVRLMMTQKGLNKDLG+SSIEINGNLET HVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        V HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRIT NLRACV+CHSAIKLISKLVDREII+RDAKRFH+FKDGVCSCGDFW
Subjt:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A6J1JD09 pentatricopeptide repeat-containing protein At3g12770 isoform X10.0e+0086.66Show/hide
Query:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLR------RKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYT
        MS H+FSLSL  +SL++ALSK AATSQEALLR      RK LDQLYVQL+VSGL++CGFLVIKFVNACLHLRDVNYAHK FREVLEPDILLWN IIKGYT
Subjt:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLR------RKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYT

Query:  QNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGY
        QN+ FAGAIRMY DMQ+S V+P+CFT LYVLKACGGMSVEGIGKQMH QTFKYGFGSNV+VQNSLVSMYA++GQTSSAR+VFDKLH+RT+VSWT+IISGY
Subjt:  QNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGY

Query:  VQNGDPVEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISG
        VQNGDP++ALRVFK MRQS VK DWI LVSVMTAYTDMEDLGQGK IH LV+KLGLEFEPDIVVSLT MYAK G+VE+ARFFFNQMEKPNL+LWNAMISG
Subjt:  VQNGDPVEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISG

Query:  YAKNGYGEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIM
        YAKNGYGEE+I++FR+MIS +I VDSVTV SAILA AQVGSL+LARWLD YISKSEYRD+ +VNTALID++ KCGSI FAR VFDR+VDKD+V WSAMIM
Subjt:  YAKNGYGEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIM

Query:  GYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGAL
        GYGLHGHGQEAI+LYN MKQ+G+ PNDV FVGLLTACKNSGLVKEGWELFHQM++HGIEPHHQHYSCVV+LLGR GYLN AYDFIM+MPIKPGVSVWGAL
Subjt:  GYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGAL

Query:  LSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERR
        LS CKIHRQVRLGEIAAEQLF+LDPYNTGHYVQLSNLYASAHLWN VANVRLMMTQKGLNKDLG+SSIEINGNLET HVGDRSHPRSKEIFEELDRLERR
Subjt:  LSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERR

Query:  LKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        LKAAGYV HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRIT NLRACV+CHSAIKLISKLVDREIIIRD KRFHHFKDGVCSCGDFW
Subjt:  LKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A6J1JH82 pentatricopeptide repeat-containing protein At3g12770 isoform X20.0e+0087.41Show/hide
Query:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA
        MS H+FSLSL  +SL++ALSK AATSQEALLRRK LDQLYVQL+VSGL++CGFLVIKFVNACLHLRDVNYAHK FREVLEPDILLWN IIKGYTQN+ FA
Subjt:  MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFA

Query:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP
        GAIRMY DMQ+S V+P+CFT LYVLKACGGMSVEGIGKQMH QTFKYGFGSNV+VQNSLVSMYA++GQTSSAR+VFDKLH+RT+VSWT+IISGYVQNGDP
Subjt:  GAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDP

Query:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        ++ALRVFK MRQS VK DWI LVSVMTAYTDMEDLGQGK IH LV+KLGLEFEPDIVVSLT MYAK G+VE+ARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG
        GEE+I++FR+MIS +I VDSVTV SAILA AQVGSL+LARWLD YISKSEYRD+ +VNTALID++ KCGSI FAR VFDR+VDKD+V WSAMIMGYGLHG
Subjt:  GEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHG

Query:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI
        HGQEAI+LYN MKQ+G+ PNDV FVGLLTACKNSGLVKEGWELFHQM++HGIEPHHQHYSCVV+LLGR GYLN AYDFIM+MPIKPGVSVWGALLS CKI
Subjt:  HGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKI

Query:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVRLGEIAAEQLF+LDPYNTGHYVQLSNLYASAHLWN VANVRLMMTQKGLNKDLG+SSIEINGNLET HVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        V HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRIT NLRACV+CHSAIKLISKLVDREIIIRD KRFHHFKDGVCSCGDFW
Subjt:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic7.2e-14337.95Show/hide
Query:  EALLRRKQLDQLYVQLVVSGLHRCGFLVIK-FVNACL-HLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDM-QMSRVHPNCFTLLYV
        E  +  +QL Q +  ++ +G     +   K F  A L     + YA K F E+ +P+   WN +I+ Y        +I  + DM   S+ +PN +T  ++
Subjt:  EALLRRKQLDQLYVQLVVSGLHRCGFLVIK-FVNACL-HLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDM-QMSRVHPNCFTLLYV

Query:  LKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDPVEALRVFKAMRQSNVKPDWIALVS
        +KA   +S   +G+ +HG   K   GS+V+V NSL+  Y   G   SA  VF  + ++ +VSW ++I+G+VQ G P +AL +FK M   +VK   + +V 
Subjt:  LKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDPVEALRVFKAMRQSNVKPDWIALVS

Query:  VMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFF-------------------------------NQMEKPNLILWNAMIS
        V++A   + +L  G+ +   + +  +     +  ++  MY K G +E A+  F                               N M + +++ WNA+IS
Subjt:  VMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFF-------------------------------NQMEKPNLILWNAMIS

Query:  GYAKNGYGEESIKIFREM-ISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAM
         Y +NG   E++ +F E+ +  +++++ +T++S + A AQVG+L+L RW+ +YI K   R N +V +ALI +Y KCG +  +R VF+ +  +DV VWSAM
Subjt:  GYAKNGYGEESIKIFREM-ISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAM

Query:  IMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMR-NHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVW
        I G  +HG G EA++++  M++A V PN V F  +  AC ++GLV E   LFHQM  N+GI P  +HY+C+V++LGR+GYL +A  FI  MPI P  SVW
Subjt:  IMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMR-NHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVW

Query:  GALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRL
        GALL +CKIH  + L E+A  +L  L+P N G +V LSN+YA    W +V+ +R  M   GL K+ G SSIEI+G +     GD +HP S++++ +L  +
Subjt:  GALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRL

Query:  ERRLKAAGYVPHMESVLHDLNHEEIEETLCN-HSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDF
          +LK+ GY P +  VL  +  EE++E   N HSE+LA+ YG+IST     +R+  NLR C DCHS  KLIS+L DREII+RD  RFHHF++G CSC DF
Subjt:  ERRLKAAGYVPHMESVLHDLNHEEIEETLCN-HSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDF

Query:  W
        W
Subjt:  W

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic5.9e-15340.31Show/hide
Query:  QLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIG
        +++  LV SG     F +    N     R VN A K F  + E D++ WN I+ GY+QN     A+ M   M    + P+  T++ VL A   + +  +G
Subjt:  QLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIG

Query:  KQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDPVEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQ
        K++HG   + GF S V +  +LV MYAK G   +AR +FD + +R +VSW ++I  YVQN +P EA+ +F+ M    VKP  ++++  + A  D+ DL +
Subjt:  KQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDPVEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQ

Query:  GKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLD
        G+ IH L  +LGL+    +V SL +MY K  +V+ A   F +++   L+ WNAMI G+A+NG   +++  F +M S +++ D+ T +S I A A++    
Subjt:  GKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLD

Query:  LARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLV
         A+W+   + +S    N +V TAL+D+Y KCG+I  AR++FD + ++ V  W+AMI GYG HG G+ A+ L+  M++  + PN V F+ +++AC +SGLV
Subjt:  LARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLV

Query:  KEGWELFHQMR-NHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAH
        + G + F+ M+ N+ IE    HY  +V+LLGR G LNEA+DFIM MP+KP V+V+GA+L +C+IH+ V   E AAE+LF L+P + G++V L+N+Y +A 
Subjt:  KEGWELFHQMR-NHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAH

Query:  LWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIIST
        +W  V  VR+ M ++GL K  G S +EI   + +   G  +HP SK+I+  L++L   +K AGYVP    VL  + ++  E+ L  HSE+LA+++G+++T
Subjt:  LWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIIST

Query:  APGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
          GTT+ +  NLR C DCH+A K IS +  REI++RD +RFHHFK+G CSCGD+W
Subjt:  APGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.6e-14237.54Show/hide
Query:  LYVQLVVSGLHRCGFLVIKFVNACL---HLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEG
        ++ Q++  GLH   + + K +  C+   H   + YA   F+ + EP++L+WN + +G+  +S    A+++Y  M    + PN +T  +VLK+C       
Subjt:  LYVQLVVSGLHRCGFLVIKFVNACL---HLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEG

Query:  IGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFD---------------------------KLHD----RTIVSWTAIISGYVQNGDPVEAL
         G+Q+HG   K G   ++YV  SL+SMY + G+   A  VFD                           KL D    + +VSW A+ISGY + G+  EAL
Subjt:  IGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFD---------------------------KLHD----RTIVSWTAIISGYVQNGDPVEAL

Query:  RVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEES
         +FK M ++NV+PD   +V+V++A      +  G+ +H  +   G      IV +L  +Y+K G++E A   F ++   ++I WN +I GY      +E+
Subjt:  RVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEES

Query:  IKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISK--SEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHG
        + +F+EM+ +    + VT++S + A A +G++D+ RW+  YI K      + + + T+LID+Y KCG I  A  VF+ I+ K +  W+AMI G+ +HG  
Subjt:  IKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISK--SEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHG

Query:  QEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQM-RNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIH
          + +L++ M++ G+ P+D+ FVGLL+AC +SG++  G  +F  M +++ + P  +HY C+++LLG +G   EA + I  M ++P   +W +LL +CK+H
Subjt:  QEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQM-RNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIH

Query:  RQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYV
          V LGE  AE L  ++P N G YV LSN+YASA  WN VA  R ++  KG+ K  G SSIEI+  +    +GD+ HPR++EI+  L+ +E  L+ AG+V
Subjt:  RQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYV

Query:  PHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        P    VL ++  E  E  L +HSE+LA+A+G+IST PGT L I  NLR C +CH A KLISK+  REII RD  RFHHF+DGVCSC D+W
Subjt:  PHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127701.3e-23757.21Show/hide
Query:  EALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKA
        ++   + QL Q++ +L+V GL   GFL+ K ++A     D+ +A + F ++  P I  WNAII+GY++N+ F  A+ MY++MQ++RV P+ FT  ++LKA
Subjt:  EALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKA

Query:  CGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFD--KLHDRTIVSWTAIISGYVQNGDPVEALRVFKAMRQSNVKPDWIALVSV
        C G+S   +G+ +H Q F+ GF ++V+VQN L+++YAK  +  SAR VF+   L +RTIVSWTAI+S Y QNG+P+EAL +F  MR+ +VKPDW+ALVSV
Subjt:  CGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFD--KLHDRTIVSWTAIISGYVQNGDPVEALRVFKAMRQSNVKPDWIALVSV

Query:  MTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEESIKIFREMISTSIRVDSVTVMS
        + A+T ++DL QG+ IH  V K+GLE EPD+++SL TMYAK GQV  A+  F++M+ PNLILWNAMISGYAKNGY  E+I +F EMI+  +R D++++ S
Subjt:  MTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEESIKIFREMISTSIRVDSVTVMS

Query:  AILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFV
        AI A AQVGSL+ AR +  Y+ +S+YRD+ ++++ALID++ KCGS+  AR+VFDR +D+DVVVWSAMI+GYGLHG  +EAI+LY AM++ GVHPNDV F+
Subjt:  AILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFV

Query:  GLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHY
        GLL AC +SG+V+EGW  F++M +H I P  QHY+CV++LLGR G+L++AY+ I  MP++PGV+VWGALLS+CK HR V LGE AA+QLF +DP NTGHY
Subjt:  GLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHY

Query:  VQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSE
        VQLSNLYA+A LW+ VA VR+ M +KGLNKD+G S +E+ G LE   VGD+SHPR +EI  +++ +E RLK  G+V + ++ LHDLN EE EETLC+HSE
Subjt:  VQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSE

Query:  RLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        R+A+AYG+IST  GT LRIT NLRACV+CH+A KLISKLVDREI++RD  RFHHFKDGVCSCGD+W
Subjt:  RLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

Q9LW32 Pentatricopeptide repeat-containing protein At3g26782, mitochondrial5.0e-14440.26Show/hide
Query:  REVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIV
        R V + D+  WN++I    ++   A A+  ++ M+   ++P   +    +KAC  +     GKQ H Q F +G+ S+++V ++L+ MY+  G+   AR V
Subjt:  REVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIV

Query:  FDKLHDRTIVSWTAIISGYVQNGDPVEALRVFKAM------RQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQ
        FD++  R IVSWT++I GY  NG+ ++A+ +FK +          +  D + LVSV++A + +   G  + IH  V K G +    +  +L   YAK G+
Subjt:  FDKLHDRTIVSWTAIISGYVQNGDPVEALRVFKAM------RQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQ

Query:  --VEVARFFFNQMEKPNLILWNAMISGYAKNGYGEESIKIFREMISTS-IRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYL
          V VAR  F+Q+   + + +N+++S YA++G   E+ ++FR ++    +  +++T+ + +LA +  G+L + + + + + +    D+  V T++ID+Y 
Subjt:  --VEVARFFFNQMEKPNLILWNAMISGYAKNGYGEESIKIFREMISTS-IRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYL

Query:  KCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNH-GIEPHHQHYSCVVNL
        KCG +  AR  FDR+ +K+V  W+AMI GYG+HGH  +A+ L+ AM  +GV PN + FV +L AC ++GL  EGW  F+ M+   G+EP  +HY C+V+L
Subjt:  KCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNH-GIEPHHQHYSCVVNL

Query:  LGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEIN
        LGR G+L +AYD I  M +KP   +W +LL++C+IH+ V L EI+  +LF LD  N G+Y+ LS++YA A  W  V  VR++M  +GL K  G+S +E+N
Subjt:  LGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEIN

Query:  GNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLV
        G +    +GD  HP+ ++I+E L  L R+L  AGYV +  SV HD++ EE E TL  HSE+LA+A+GI++T PG+T+ +  NLR C DCH+ IKLISK+V
Subjt:  GNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLV

Query:  DREIIIRDAKRFHHFKDGVCSCGDFW
        DRE ++RDAKRFHHFKDG CSCGD+W
Subjt:  DREIIIRDAKRFHHFKDGVCSCGDFW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.1e-14337.54Show/hide
Query:  LYVQLVVSGLHRCGFLVIKFVNACL---HLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEG
        ++ Q++  GLH   + + K +  C+   H   + YA   F+ + EP++L+WN + +G+  +S    A+++Y  M    + PN +T  +VLK+C       
Subjt:  LYVQLVVSGLHRCGFLVIKFVNACL---HLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEG

Query:  IGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFD---------------------------KLHD----RTIVSWTAIISGYVQNGDPVEAL
         G+Q+HG   K G   ++YV  SL+SMY + G+   A  VFD                           KL D    + +VSW A+ISGY + G+  EAL
Subjt:  IGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFD---------------------------KLHD----RTIVSWTAIISGYVQNGDPVEAL

Query:  RVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEES
         +FK M ++NV+PD   +V+V++A      +  G+ +H  +   G      IV +L  +Y+K G++E A   F ++   ++I WN +I GY      +E+
Subjt:  RVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEES

Query:  IKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISK--SEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHG
        + +F+EM+ +    + VT++S + A A +G++D+ RW+  YI K      + + + T+LID+Y KCG I  A  VF+ I+ K +  W+AMI G+ +HG  
Subjt:  IKIFREMISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISK--SEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHG

Query:  QEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQM-RNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIH
          + +L++ M++ G+ P+D+ FVGLL+AC +SG++  G  +F  M +++ + P  +HY C+++LLG +G   EA + I  M ++P   +W +LL +CK+H
Subjt:  QEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQM-RNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIH

Query:  RQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYV
          V LGE  AE L  ++P N G YV LSN+YASA  WN VA  R ++  KG+ K  G SSIEI+  +    +GD+ HPR++EI+  L+ +E  L+ AG+V
Subjt:  RQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYV

Query:  PHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        P    VL ++  E  E  L +HSE+LA+A+G+IST PGT L I  NLR C +CH A KLISK+  REII RD  RFHHF+DGVCSC D+W
Subjt:  PHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein4.2e-15440.31Show/hide
Query:  QLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIG
        +++  LV SG     F +    N     R VN A K F  + E D++ WN I+ GY+QN     A+ M   M    + P+  T++ VL A   + +  +G
Subjt:  QLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIG

Query:  KQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDPVEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQ
        K++HG   + GF S V +  +LV MYAK G   +AR +FD + +R +VSW ++I  YVQN +P EA+ +F+ M    VKP  ++++  + A  D+ DL +
Subjt:  KQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDPVEALRVFKAMRQSNVKPDWIALVSVMTAYTDMEDLGQ

Query:  GKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLD
        G+ IH L  +LGL+    +V SL +MY K  +V+ A   F +++   L+ WNAMI G+A+NG   +++  F +M S +++ D+ T +S I A A++    
Subjt:  GKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEESIKIFREMISTSIRVDSVTVMSAILAGAQVGSLD

Query:  LARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLV
         A+W+   + +S    N +V TAL+D+Y KCG+I  AR++FD + ++ V  W+AMI GYG HG G+ A+ L+  M++  + PN V F+ +++AC +SGLV
Subjt:  LARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLV

Query:  KEGWELFHQMR-NHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAH
        + G + F+ M+ N+ IE    HY  +V+LLGR G LNEA+DFIM MP+KP V+V+GA+L +C+IH+ V   E AAE+LF L+P + G++V L+N+Y +A 
Subjt:  KEGWELFHQMR-NHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAH

Query:  LWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIIST
        +W  V  VR+ M ++GL K  G S +EI   + +   G  +HP SK+I+  L++L   +K AGYVP    VL  + ++  E+ L  HSE+LA+++G+++T
Subjt:  LWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIIST

Query:  APGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
          GTT+ +  NLR C DCH+A K IS +  REI++RD +RFHHFK+G CSCGD+W
Subjt:  APGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.1e-14437.95Show/hide
Query:  EALLRRKQLDQLYVQLVVSGLHRCGFLVIK-FVNACL-HLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDM-QMSRVHPNCFTLLYV
        E  +  +QL Q +  ++ +G     +   K F  A L     + YA K F E+ +P+   WN +I+ Y        +I  + DM   S+ +PN +T  ++
Subjt:  EALLRRKQLDQLYVQLVVSGLHRCGFLVIK-FVNACL-HLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDM-QMSRVHPNCFTLLYV

Query:  LKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDPVEALRVFKAMRQSNVKPDWIALVS
        +KA   +S   +G+ +HG   K   GS+V+V NSL+  Y   G   SA  VF  + ++ +VSW ++I+G+VQ G P +AL +FK M   +VK   + +V 
Subjt:  LKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDPVEALRVFKAMRQSNVKPDWIALVS

Query:  VMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFF-------------------------------NQMEKPNLILWNAMIS
        V++A   + +L  G+ +   + +  +     +  ++  MY K G +E A+  F                               N M + +++ WNA+IS
Subjt:  VMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFF-------------------------------NQMEKPNLILWNAMIS

Query:  GYAKNGYGEESIKIFREM-ISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAM
         Y +NG   E++ +F E+ +  +++++ +T++S + A AQVG+L+L RW+ +YI K   R N +V +ALI +Y KCG +  +R VF+ +  +DV VWSAM
Subjt:  GYAKNGYGEESIKIFREM-ISTSIRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAM

Query:  IMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMR-NHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVW
        I G  +HG G EA++++  M++A V PN V F  +  AC ++GLV E   LFHQM  N+GI P  +HY+C+V++LGR+GYL +A  FI  MPI P  SVW
Subjt:  IMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMR-NHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVW

Query:  GALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRL
        GALL +CKIH  + L E+A  +L  L+P N G +V LSN+YA    W +V+ +R  M   GL K+ G SSIEI+G +     GD +HP S++++ +L  +
Subjt:  GALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRL

Query:  ERRLKAAGYVPHMESVLHDLNHEEIEETLCN-HSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDF
          +LK+ GY P +  VL  +  EE++E   N HSE+LA+ YG+IST     +R+  NLR C DCHS  KLIS+L DREII+RD  RFHHF++G CSC DF
Subjt:  ERRLKAAGYVPHMESVLHDLNHEEIEETLCN-HSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDF

Query:  W
        W
Subjt:  W

AT3G12770.1 mitochondrial editing factor 229.4e-23957.21Show/hide
Query:  EALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKA
        ++   + QL Q++ +L+V GL   GFL+ K ++A     D+ +A + F ++  P I  WNAII+GY++N+ F  A+ MY++MQ++RV P+ FT  ++LKA
Subjt:  EALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKA

Query:  CGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFD--KLHDRTIVSWTAIISGYVQNGDPVEALRVFKAMRQSNVKPDWIALVSV
        C G+S   +G+ +H Q F+ GF ++V+VQN L+++YAK  +  SAR VF+   L +RTIVSWTAI+S Y QNG+P+EAL +F  MR+ +VKPDW+ALVSV
Subjt:  CGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFD--KLHDRTIVSWTAIISGYVQNGDPVEALRVFKAMRQSNVKPDWIALVSV

Query:  MTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEESIKIFREMISTSIRVDSVTVMS
        + A+T ++DL QG+ IH  V K+GLE EPD+++SL TMYAK GQV  A+  F++M+ PNLILWNAMISGYAKNGY  E+I +F EMI+  +R D++++ S
Subjt:  MTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEESIKIFREMISTSIRVDSVTVMS

Query:  AILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFV
        AI A AQVGSL+ AR +  Y+ +S+YRD+ ++++ALID++ KCGS+  AR+VFDR +D+DVVVWSAMI+GYGLHG  +EAI+LY AM++ GVHPNDV F+
Subjt:  AILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFV

Query:  GLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHY
        GLL AC +SG+V+EGW  F++M +H I P  QHY+CV++LLGR G+L++AY+ I  MP++PGV+VWGALLS+CK HR V LGE AA+QLF +DP NTGHY
Subjt:  GLLTACKNSGLVKEGWELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHY

Query:  VQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSE
        VQLSNLYA+A LW+ VA VR+ M +KGLNKD+G S +E+ G LE   VGD+SHPR +EI  +++ +E RLK  G+V + ++ LHDLN EE EETLC+HSE
Subjt:  VQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSE

Query:  RLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        R+A+AYG+IST  GT LRIT NLRACV+CH+A KLISKLVDREI++RD  RFHHFKDGVCSCGD+W
Subjt:  RLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

AT3G26782.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.6e-14540.26Show/hide
Query:  REVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIV
        R V + D+  WN++I    ++   A A+  ++ M+   ++P   +    +KAC  +     GKQ H Q F +G+ S+++V ++L+ MY+  G+   AR V
Subjt:  REVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQMSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIV

Query:  FDKLHDRTIVSWTAIISGYVQNGDPVEALRVFKAM------RQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQ
        FD++  R IVSWT++I GY  NG+ ++A+ +FK +          +  D + LVSV++A + +   G  + IH  V K G +    +  +L   YAK G+
Subjt:  FDKLHDRTIVSWTAIISGYVQNGDPVEALRVFKAM------RQSNVKPDWIALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQ

Query:  --VEVARFFFNQMEKPNLILWNAMISGYAKNGYGEESIKIFREMISTS-IRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYL
          V VAR  F+Q+   + + +N+++S YA++G   E+ ++FR ++    +  +++T+ + +LA +  G+L + + + + + +    D+  V T++ID+Y 
Subjt:  --VEVARFFFNQMEKPNLILWNAMISGYAKNGYGEESIKIFREMISTS-IRVDSVTVMSAILAGAQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYL

Query:  KCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNH-GIEPHHQHYSCVVNL
        KCG +  AR  FDR+ +K+V  W+AMI GYG+HGH  +A+ L+ AM  +GV PN + FV +L AC ++GL  EGW  F+ M+   G+EP  +HY C+V+L
Subjt:  KCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEGWELFHQMRNH-GIEPHHQHYSCVVNL

Query:  LGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEIN
        LGR G+L +AYD I  M +KP   +W +LL++C+IH+ V L EI+  +LF LD  N G+Y+ LS++YA A  W  V  VR++M  +GL K  G+S +E+N
Subjt:  LGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGYSSIEIN

Query:  GNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLV
        G +    +GD  HP+ ++I+E L  L R+L  AGYV +  SV HD++ EE E TL  HSE+LA+A+GI++T PG+T+ +  NLR C DCH+ IKLISK+V
Subjt:  GNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKLISKLV

Query:  DREIIIRDAKRFHHFKDGVCSCGDFW
        DRE ++RDAKRFHHFKDG CSCGD+W
Subjt:  DREIIIRDAKRFHHFKDGVCSCGDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCCGCATGCGTTTTCGCTCTCTCTCTTCTCGTCATCGCTAACTTCAGCTCTCTCAAAGGTGGCGGCAACCTCGCAAGAGGCTCTATTGAGGAGGAAGCAATTGGA
TCAATTATACGTCCAGTTAGTCGTGTCAGGACTACACAGGTGCGGTTTCTTGGTGATCAAATTTGTCAATGCATGTTTGCATCTTAGAGATGTTAACTATGCGCACAAAG
CTTTTCGTGAAGTCTTAGAACCAGATATTTTGTTGTGGAATGCCATCATAAAGGGTTACACTCAGAATAGTACTTTTGCTGGTGCTATCAGGATGTATACAGATATGCAA
ATGTCAAGGGTGCATCCAAACTGTTTCACACTTTTGTATGTGCTTAAAGCATGTGGTGGAATGTCAGTCGAAGGAATAGGTAAACAGATGCATGGCCAGACATTTAAATA
TGGCTTTGGATCAAATGTTTATGTTCAGAACAGTCTTGTGTCAATGTATGCTAAATTTGGTCAAACCTCATCTGCTAGGATCGTTTTTGATAAGTTACATGATAGAACTA
TTGTTTCATGGACTGCCATCATTTCTGGGTATGTTCAGAATGGTGATCCTGTGGAAGCATTGAGGGTTTTCAAAGCAATGAGACAAAGTAATGTCAAACCTGATTGGATT
GCTCTTGTTAGTGTTATGACAGCATATACAGACATGGAGGATTTAGGACAAGGAAAGGTCATTCATGGCTTAGTGTCGAAATTGGGTCTAGAATTCGAACCCGATATAGT
GGTTTCACTCACTACCATGTATGCAAAACGTGGACAGGTGGAAGTCGCTAGATTTTTCTTTAATCAGATGGAAAAACCAAATTTAATTTTGTGGAACGCTATGATTTCTG
GTTATGCAAAAAATGGATATGGTGAAGAATCAATCAAGATATTCCGTGAAATGATTTCAACAAGTATCAGGGTTGATTCAGTTACAGTGATGTCTGCTATTCTAGCAGGT
GCCCAAGTGGGGTCTCTTGATCTAGCAAGATGGTTGGATAATTATATCTCTAAGAGTGAGTATAGAGACAATACTTATGTAAACACGGCCCTTATAGATATCTATCTAAA
ATGTGGAAGCATATATTTTGCTCGTATTGTTTTCGATAGAATTGTCGATAAAGATGTTGTCGTATGGAGTGCAATGATTATGGGGTATGGATTACATGGTCATGGGCAAG
AAGCCATTAACCTTTACAATGCAATGAAGCAAGCTGGAGTTCATCCAAACGATGTTATTTTTGTTGGCCTTCTCACAGCTTGCAAAAATTCAGGACTTGTAAAAGAGGGA
TGGGAACTTTTCCACCAGATGCGAAATCACGGGATTGAACCGCATCACCAGCATTACTCTTGCGTGGTCAATCTTCTAGGACGTACTGGCTACTTGAATGAAGCTTATGA
TTTTATTATGAATATGCCAATTAAACCTGGAGTTAGTGTTTGGGGGGCTCTTCTGAGTTCATGCAAGATCCATCGCCAAGTGAGGCTAGGAGAAATTGCTGCAGAACAGC
TTTTCGTATTAGATCCATACAATACAGGGCATTATGTGCAACTTTCAAACTTATATGCTTCTGCCCACTTATGGAATCACGTGGCGAACGTTCGACTAATGATGACGCAG
AAAGGACTGAACAAGGACCTTGGATATAGTTCTATCGAGATCAATGGAAATCTCGAAACATTGCATGTTGGAGATAGATCACATCCTAGATCAAAGGAAATTTTTGAGGA
GCTTGATAGATTGGAGAGGAGATTAAAAGCAGCCGGTTATGTTCCTCATATGGAATCTGTTCTACATGACTTGAATCATGAGGAGATTGAGGAAACTCTTTGCAATCACA
GTGAAAGGCTAGCAGTTGCTTATGGCATCATAAGTACTGCCCCGGGAACTACACTTCGAATAACCAATAATCTTCGAGCTTGCGTTGATTGCCATTCAGCGATAAAGCTA
ATATCGAAGCTTGTTGATAGGGAAATAATTATTCGAGATGCGAAGCGTTTTCATCATTTCAAAGATGGAGTTTGTTCATGCGGAGATTTTTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCCGCATGCGTTTTCGCTCTCTCTCTTCTCGTCATCGCTAACTTCAGCTCTCTCAAAGGTGGCGGCAACCTCGCAAGAGGCTCTATTGAGGAGGAAGCAATTGGA
TCAATTATACGTCCAGTTAGTCGTGTCAGGACTACACAGGTGCGGTTTCTTGGTGATCAAATTTGTCAATGCATGTTTGCATCTTAGAGATGTTAACTATGCGCACAAAG
CTTTTCGTGAAGTCTTAGAACCAGATATTTTGTTGTGGAATGCCATCATAAAGGGTTACACTCAGAATAGTACTTTTGCTGGTGCTATCAGGATGTATACAGATATGCAA
ATGTCAAGGGTGCATCCAAACTGTTTCACACTTTTGTATGTGCTTAAAGCATGTGGTGGAATGTCAGTCGAAGGAATAGGTAAACAGATGCATGGCCAGACATTTAAATA
TGGCTTTGGATCAAATGTTTATGTTCAGAACAGTCTTGTGTCAATGTATGCTAAATTTGGTCAAACCTCATCTGCTAGGATCGTTTTTGATAAGTTACATGATAGAACTA
TTGTTTCATGGACTGCCATCATTTCTGGGTATGTTCAGAATGGTGATCCTGTGGAAGCATTGAGGGTTTTCAAAGCAATGAGACAAAGTAATGTCAAACCTGATTGGATT
GCTCTTGTTAGTGTTATGACAGCATATACAGACATGGAGGATTTAGGACAAGGAAAGGTCATTCATGGCTTAGTGTCGAAATTGGGTCTAGAATTCGAACCCGATATAGT
GGTTTCACTCACTACCATGTATGCAAAACGTGGACAGGTGGAAGTCGCTAGATTTTTCTTTAATCAGATGGAAAAACCAAATTTAATTTTGTGGAACGCTATGATTTCTG
GTTATGCAAAAAATGGATATGGTGAAGAATCAATCAAGATATTCCGTGAAATGATTTCAACAAGTATCAGGGTTGATTCAGTTACAGTGATGTCTGCTATTCTAGCAGGT
GCCCAAGTGGGGTCTCTTGATCTAGCAAGATGGTTGGATAATTATATCTCTAAGAGTGAGTATAGAGACAATACTTATGTAAACACGGCCCTTATAGATATCTATCTAAA
ATGTGGAAGCATATATTTTGCTCGTATTGTTTTCGATAGAATTGTCGATAAAGATGTTGTCGTATGGAGTGCAATGATTATGGGGTATGGATTACATGGTCATGGGCAAG
AAGCCATTAACCTTTACAATGCAATGAAGCAAGCTGGAGTTCATCCAAACGATGTTATTTTTGTTGGCCTTCTCACAGCTTGCAAAAATTCAGGACTTGTAAAAGAGGGA
TGGGAACTTTTCCACCAGATGCGAAATCACGGGATTGAACCGCATCACCAGCATTACTCTTGCGTGGTCAATCTTCTAGGACGTACTGGCTACTTGAATGAAGCTTATGA
TTTTATTATGAATATGCCAATTAAACCTGGAGTTAGTGTTTGGGGGGCTCTTCTGAGTTCATGCAAGATCCATCGCCAAGTGAGGCTAGGAGAAATTGCTGCAGAACAGC
TTTTCGTATTAGATCCATACAATACAGGGCATTATGTGCAACTTTCAAACTTATATGCTTCTGCCCACTTATGGAATCACGTGGCGAACGTTCGACTAATGATGACGCAG
AAAGGACTGAACAAGGACCTTGGATATAGTTCTATCGAGATCAATGGAAATCTCGAAACATTGCATGTTGGAGATAGATCACATCCTAGATCAAAGGAAATTTTTGAGGA
GCTTGATAGATTGGAGAGGAGATTAAAAGCAGCCGGTTATGTTCCTCATATGGAATCTGTTCTACATGACTTGAATCATGAGGAGATTGAGGAAACTCTTTGCAATCACA
GTGAAAGGCTAGCAGTTGCTTATGGCATCATAAGTACTGCCCCGGGAACTACACTTCGAATAACCAATAATCTTCGAGCTTGCGTTGATTGCCATTCAGCGATAAAGCTA
ATATCGAAGCTTGTTGATAGGGAAATAATTATTCGAGATGCGAAGCGTTTTCATCATTTCAAAGATGGAGTTTGTTCATGCGGAGATTTTTGGTGAAGCCTGGTTGGTAT
TTTGTATTTACTTTAACCTTACACTTACTGATTTTTTTTAAAAAATCCCATTCGCATTAACCTAACCGTGATAGAACACAATTTTCTAACAGTGA
Protein sequenceShow/hide protein sequence
MSPHAFSLSLFSSSLTSALSKVAATSQEALLRRKQLDQLYVQLVVSGLHRCGFLVIKFVNACLHLRDVNYAHKAFREVLEPDILLWNAIIKGYTQNSTFAGAIRMYTDMQ
MSRVHPNCFTLLYVLKACGGMSVEGIGKQMHGQTFKYGFGSNVYVQNSLVSMYAKFGQTSSARIVFDKLHDRTIVSWTAIISGYVQNGDPVEALRVFKAMRQSNVKPDWI
ALVSVMTAYTDMEDLGQGKVIHGLVSKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEESIKIFREMISTSIRVDSVTVMSAILAG
AQVGSLDLARWLDNYISKSEYRDNTYVNTALIDIYLKCGSIYFARIVFDRIVDKDVVVWSAMIMGYGLHGHGQEAINLYNAMKQAGVHPNDVIFVGLLTACKNSGLVKEG
WELFHQMRNHGIEPHHQHYSCVVNLLGRTGYLNEAYDFIMNMPIKPGVSVWGALLSSCKIHRQVRLGEIAAEQLFVLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQ
KGLNKDLGYSSIEINGNLETLHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITNNLRACVDCHSAIKL
ISKLVDREIIIRDAKRFHHFKDGVCSCGDFW