; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0019819 (gene) of Snake gourd v1 genome

Gene IDTan0019819
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG04:3325662..3338063
RNA-Seq ExpressionTan0019819
SyntenyTan0019819
Gene Ontology termsGO:0080156 - mitochondrial mRNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030779.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0090.3Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFA
        MSLHSFSLSLSLSS+STALSKAAATSQEALL RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHLRDVNYAHK F EVLE DILLWN IIKGYTQNNIFA
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFA

Query:  GAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GAIRMY DMQVSGV P+CFTFLY+LKACGGMSVEGIGKQMH Q FKYG GSNVFVQNSLVSMYA+FGQT SAR+VFDKLH+RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        V+ALRVFK+MR+S +K DWI LVSV+TAYTDMEDLGQGK IH LVTKLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKN+ +DSVTVRSAILA AQ GS++LARWLDGYISKSEY+DD+FVNTALIDM+AKCGSI FAR VFDRMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKI
        HGQEAI LY  MKQ+GV PND+TFV LLTACKNSG VKEGWELFHQM+DHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIMSMPIKPGVSVWGALLS CKI
Subjt:  HGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKI

Query:  HRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVRLGEIAA+QLF+LDPYNTGHYVQLSNLYASAHLW+HV NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        V HMESVLHDLN EEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRAC+NCHSAIKLISKLV+REII+RDAKRFHHFKDGVCSCGDFW
Subjt:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_022139117.1 pentatricopeptide repeat-containing protein At3g12770 [Momordica charantia]0.0e+0090.59Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFA
        MSLHSFSLSLSLSSLSTA SK AATSQEALL RKHLDQLYVQLIVSGLHKC FLVIKFVNACLHL DV YAHKAF EVLE DILLWNA+IKGYTQNNIF 
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFA

Query:  GAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GA+++YT+MQVSGV P+CFTFLY+LKACGGMS+E IGKQMHGQ FKYGFGSNVFVQNSLVSMYAKFGQT  AR+VFDKL DRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGY
         EAL VFK+MRQSN+K DWIALVSVMTAYTDMEDLGQGK IHGLVTKLGLEFEPDIVVSLTTMYAKRG+VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAIKLFREMISKN+R+DSVTVRSAILAGAQVGS+DLARWLDGYISKSEY+DD FVNTALIDMYAKCGSI+FA +VFDRMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKI
        HG+EAI+LY  MKQ GV PND+TFV LLTACKNSG VKEGW+LFH+++DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIM+MPIKPGVSVWGALLS+CKI
Subjt:  HGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKI

Query:  HRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQV+LGEIAA+QLF LDPYNTGHYVQLSNLYASAHLW+HVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        +PHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGT LRIT NLRAC+NCHSAIKLISKLVDREIIIRDAKRFH FKDG CSCGDFW
Subjt:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_022941627.1 pentatricopeptide repeat-containing protein At3g12770 isoform X2 [Cucurbita moschata]0.0e+0090.45Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFA
        MSLHSFSLSLSLSSLSTALSKAAATSQEALL RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHLRDVNYAHK F EVLE DILLWN IIKGYTQNNIFA
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFA

Query:  GAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GAIRMY DMQVSGV P+CFTFLY+LKACGGMSVEGIGKQMH Q FKYG GSNVFVQNSLVSMYA+FGQT SAR+VFDKLH+RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        V+ALRVFK+MR+S +K DWI LVSV+TAYTDMEDLGQGK IH LVTKLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKN+ +DSVTVRSAILA AQ GS++LARWLDGYISKSEY+DD+FVNTALIDM+AKCGSI FAR VFDRMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKI
        HGQEAI LY  MKQ+GV PN++TFV LLTACKNSG VKEGWELFHQM+DHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIMSMPIKPGVSVWGALLS CKI
Subjt:  HGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKI

Query:  HRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVRLGEIAA+QLF+LDPYNTGHYVQLSNLYASAHLW+HV NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        V HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRAC+NCHSAIKLISKLVDREII+RDAKRFH+FKDGVCSCGDFW
Subjt:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_022988437.1 pentatricopeptide repeat-containing protein At3g12770 isoform X1 [Cucurbita maxima]0.0e+0089.67Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-------RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYT
        MSLHSFSLSLSL+SLSTALSKAAATSQEALL       RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHLRDVNYAHK F EVLE DILLWN IIKGYT
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-------RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYT

Query:  QNNIFAGAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGY
        QNNIFAGAIRMY DMQVSGV P+CFTFLY+LKACGGMSVEGIGKQMH Q FKYGFGSNVFVQNSLVSMYA++GQT SAR+VFDKLH+RTVVSWTSIISGY
Subjt:  QNNIFAGAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGY

Query:  VQNGDPVEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISG
        VQNGDP++ALRVFK+MRQS +K DWI LVSVMTAYTDMEDLGQGK IH LVTKLGLEFEPDIVVSLT MYAK G+VE+ARFFFNQMEKPNL+LWNAMISG
Subjt:  VQNGDPVEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISG

Query:  YAKNGYGEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIM
        YAKNGYGEEAI+LFR+MISKN+ +DSVTVRSAILA AQVGS++LARWLDGYISKSEY+DD+FVNTALIDM+AKCGSI FAR VFDRMVDKD+V WSAMIM
Subjt:  YAKNGYGEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIM

Query:  GYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGAL
        GYGLHGHGQEAI LY  MKQ+G+ PND+TFV LLTACKNSG VKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIMSMPIKPGVSVWGAL
Subjt:  GYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGAL

Query:  LSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
        LS CKIHRQVRLGEIAA+QLF+LDPYNTGHYVQLSNLYASAHLW+ VANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
Subjt:  LSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR

Query:  LKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        LKAAGYV HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRAC+NCHSAIKLISKLVDREIIIRD KRFHHFKDGVCSCGDFW
Subjt:  LKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_022988451.1 pentatricopeptide repeat-containing protein At3g12770 isoform X2 [Cucurbita maxima]0.0e+0090.45Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFA
        MSLHSFSLSLSL+SLSTALSKAAATSQEALL RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHLRDVNYAHK F EVLE DILLWN IIKGYTQNNIFA
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFA

Query:  GAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GAIRMY DMQVSGV P+CFTFLY+LKACGGMSVEGIGKQMH Q FKYGFGSNVFVQNSLVSMYA++GQT SAR+VFDKLH+RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        ++ALRVFK+MRQS +K DWI LVSVMTAYTDMEDLGQGK IH LVTKLGLEFEPDIVVSLT MYAK G+VE+ARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKN+ +DSVTVRSAILA AQVGS++LARWLDGYISKSEY+DD+FVNTALIDM+AKCGSI FAR VFDRMVDKD+V WSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKI
        HGQEAI LY  MKQ+G+ PND+TFV LLTACKNSG VKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIMSMPIKPGVSVWGALLS CKI
Subjt:  HGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKI

Query:  HRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVRLGEIAA+QLF+LDPYNTGHYVQLSNLYASAHLW+ VANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        V HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRAC+NCHSAIKLISKLVDREIIIRD KRFHHFKDGVCSCGDFW
Subjt:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

TrEMBL top hitse value%identityAlignment
A0A6J1CD43 pentatricopeptide repeat-containing protein At3g127700.0e+0090.59Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFA
        MSLHSFSLSLSLSSLSTA SK AATSQEALL RKHLDQLYVQLIVSGLHKC FLVIKFVNACLHL DV YAHKAF EVLE DILLWNA+IKGYTQNNIF 
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFA

Query:  GAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GA+++YT+MQVSGV P+CFTFLY+LKACGGMS+E IGKQMHGQ FKYGFGSNVFVQNSLVSMYAKFGQT  AR+VFDKL DRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGY
         EAL VFK+MRQSN+K DWIALVSVMTAYTDMEDLGQGK IHGLVTKLGLEFEPDIVVSLTTMYAKRG+VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAIKLFREMISKN+R+DSVTVRSAILAGAQVGS+DLARWLDGYISKSEY+DD FVNTALIDMYAKCGSI+FA +VFDRMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKI
        HG+EAI+LY  MKQ GV PND+TFV LLTACKNSG VKEGW+LFH+++DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIM+MPIKPGVSVWGALLS+CKI
Subjt:  HGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKI

Query:  HRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQV+LGEIAA+QLF LDPYNTGHYVQLSNLYASAHLW+HVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        +PHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGT LRIT NLRAC+NCHSAIKLISKLVDREIIIRDAKRFH FKDG CSCGDFW
Subjt:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A6J1FLM1 pentatricopeptide repeat-containing protein At3g12770 isoform X10.0e+0089.67Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-------RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYT
        MSLHSFSLSLSLSSLSTALSKAAATSQEALL       RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHLRDVNYAHK F EVLE DILLWN IIKGYT
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-------RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYT

Query:  QNNIFAGAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGY
        QNNIFAGAIRMY DMQVSGV P+CFTFLY+LKACGGMSVEGIGKQMH Q FKYG GSNVFVQNSLVSMYA+FGQT SAR+VFDKLH+RTVVSWTSIISGY
Subjt:  QNNIFAGAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGY

Query:  VQNGDPVEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISG
        VQNGDPV+ALRVFK+MR+S +K DWI LVSV+TAYTDMEDLGQGK IH LVTKLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMISG
Subjt:  VQNGDPVEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISG

Query:  YAKNGYGEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIM
        YAKNGYGEEAI+LFR+MISKN+ +DSVTVRSAILA AQ GS++LARWLDGYISKSEY+DD+FVNTALIDM+AKCGSI FAR VFDRMVDKDVVLWSAMIM
Subjt:  YAKNGYGEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIM

Query:  GYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGAL
        GYGLHGHGQEAI LY  MKQ+GV PN++TFV LLTACKNSG VKEGWELFHQM+DHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIMSMPIKPGVSVWGAL
Subjt:  GYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGAL

Query:  LSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
        LS CKIHRQVRLGEIAA+QLF+LDPYNTGHYVQLSNLYASAHLW+HV NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
Subjt:  LSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR

Query:  LKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        LKAAGYV HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRAC+NCHSAIKLISKLVDREII+RDAKRFH+FKDGVCSCGDFW
Subjt:  LKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A6J1FN08 pentatricopeptide repeat-containing protein At3g12770 isoform X20.0e+0090.45Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFA
        MSLHSFSLSLSLSSLSTALSKAAATSQEALL RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHLRDVNYAHK F EVLE DILLWN IIKGYTQNNIFA
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFA

Query:  GAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GAIRMY DMQVSGV P+CFTFLY+LKACGGMSVEGIGKQMH Q FKYG GSNVFVQNSLVSMYA+FGQT SAR+VFDKLH+RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        V+ALRVFK+MR+S +K DWI LVSV+TAYTDMEDLGQGK IH LVTKLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKN+ +DSVTVRSAILA AQ GS++LARWLDGYISKSEY+DD+FVNTALIDM+AKCGSI FAR VFDRMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKI
        HGQEAI LY  MKQ+GV PN++TFV LLTACKNSG VKEGWELFHQM+DHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIMSMPIKPGVSVWGALLS CKI
Subjt:  HGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKI

Query:  HRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVRLGEIAA+QLF+LDPYNTGHYVQLSNLYASAHLW+HV NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        V HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRAC+NCHSAIKLISKLVDREII+RDAKRFH+FKDGVCSCGDFW
Subjt:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A6J1JD09 pentatricopeptide repeat-containing protein At3g12770 isoform X10.0e+0089.67Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-------RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYT
        MSLHSFSLSLSL+SLSTALSKAAATSQEALL       RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHLRDVNYAHK F EVLE DILLWN IIKGYT
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-------RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYT

Query:  QNNIFAGAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGY
        QNNIFAGAIRMY DMQVSGV P+CFTFLY+LKACGGMSVEGIGKQMH Q FKYGFGSNVFVQNSLVSMYA++GQT SAR+VFDKLH+RTVVSWTSIISGY
Subjt:  QNNIFAGAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGY

Query:  VQNGDPVEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISG
        VQNGDP++ALRVFK+MRQS +K DWI LVSVMTAYTDMEDLGQGK IH LVTKLGLEFEPDIVVSLT MYAK G+VE+ARFFFNQMEKPNL+LWNAMISG
Subjt:  VQNGDPVEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISG

Query:  YAKNGYGEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIM
        YAKNGYGEEAI+LFR+MISKN+ +DSVTVRSAILA AQVGS++LARWLDGYISKSEY+DD+FVNTALIDM+AKCGSI FAR VFDRMVDKD+V WSAMIM
Subjt:  YAKNGYGEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIM

Query:  GYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGAL
        GYGLHGHGQEAI LY  MKQ+G+ PND+TFV LLTACKNSG VKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIMSMPIKPGVSVWGAL
Subjt:  GYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGAL

Query:  LSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
        LS CKIHRQVRLGEIAA+QLF+LDPYNTGHYVQLSNLYASAHLW+ VANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
Subjt:  LSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR

Query:  LKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        LKAAGYV HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRAC+NCHSAIKLISKLVDREIIIRD KRFHHFKDGVCSCGDFW
Subjt:  LKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A6J1JH82 pentatricopeptide repeat-containing protein At3g12770 isoform X20.0e+0090.45Show/hide
Query:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFA
        MSLHSFSLSLSL+SLSTALSKAAATSQEALL RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHLRDVNYAHK F EVLE DILLWN IIKGYTQNNIFA
Subjt:  MSLHSFSLSLSLSSLSTALSKAAATSQEALL-RKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFA

Query:  GAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GAIRMY DMQVSGV P+CFTFLY+LKACGGMSVEGIGKQMH Q FKYGFGSNVFVQNSLVSMYA++GQT SAR+VFDKLH+RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        ++ALRVFK+MRQS +K DWI LVSVMTAYTDMEDLGQGK IH LVTKLGLEFEPDIVVSLT MYAK G+VE+ARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKN+ +DSVTVRSAILA AQVGS++LARWLDGYISKSEY+DD+FVNTALIDM+AKCGSI FAR VFDRMVDKD+V WSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKI
        HGQEAI LY  MKQ+G+ PND+TFV LLTACKNSG VKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIMSMPIKPGVSVWGALLS CKI
Subjt:  HGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKI

Query:  HRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVRLGEIAA+QLF+LDPYNTGHYVQLSNLYASAHLW+ VANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        V HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRAC+NCHSAIKLISKLVDREIIIRD KRFHHFKDGVCSCGDFW
Subjt:  VPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic3.3e-15640.74Show/hide
Query:  TALSKAAATSQEALLRKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPN
        T L K      E  + K +  L   L+ SG     F +    N     R VN A K F  + E D++ WN I+ GY+QN +   A+ M   M    ++P+
Subjt:  TALSKAAATSQEALLRKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPN

Query:  CFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKP
          T + +L A   + +  +GK++HG A + GF S V +  +LV MYAK G   +AR +FD + +R VVSW S+I  YVQN +P EA+ +F++M    +KP
Subjt:  CFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKP

Query:  DWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNVR
          ++++  + A  D+ DL +G+ IH L  +LGL+    +V SL +MY K  +V+ A   F +++   L+ WNAMI G+A+NG   +A+  F +M S+ V+
Subjt:  DWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNVR

Query:  LDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGV
         D+ T  S I A A++     A+W+ G + +S    ++FV TAL+DMYAKCG+I  AR++FD M ++ V  W+AMI GYG HG G+ A+ L++ M++  +
Subjt:  LDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGV

Query:  HPNDITFVALLTACKNSGFVKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFI
         PN +TF+++++AC +SG V+ G + F+ M +++ IE    HY  +VDLLGRAG LN+A+DFIM MP+KP V+V+GA+L +C+IH+ V   E AA++LF 
Subjt:  HPNDITFVALLTACKNSGFVKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFI

Query:  LDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEI
        L+P + G++V L+N+Y +A +W+ V  VR+ M ++GL K  G S +EI   + +F  G  +HP SK+I+  L++L   +K AGYVP    VL  + ++  
Subjt:  LDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEI

Query:  EETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        E+ L  HSE+LA+++G+++T  GTT+ + KNLR C +CH+A K IS +  REI++RD +RFHHFK+G CSCGD+W
Subjt:  EETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127702.3e-24257.91Show/hide
Query:  AATSQEALLRKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPNCFTFLY
        A+    A  +  L Q++ +L+V GL    FL+ K ++A     D+ +A + F ++    I  WNAII+GY++NN F  A+ MY++MQ++ V P+ FTF +
Subjt:  AATSQEALLRKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPNCFTFLY

Query:  MLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKPDWIA
        +LKAC G+S   +G+ +H Q F+ GF ++VFVQN L+++YAK  +  SAR VF+   L +RT+VSWT+I+S Y QNG+P+EAL +F +MR+ ++KPDW+A
Subjt:  MLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKPDWIA

Query:  LVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNVRLDSV
        LVSV+ A+T ++DL QG+ IH  V K+GLE EPD+++SL TMYAK G+V  A+  F++M+ PNLILWNAMISGYAKNGY  EAI +F EMI+K+VR D++
Subjt:  LVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNVRLDSV

Query:  TVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPND
        ++ SAI A AQVGS++ AR +  Y+ +S+Y+DD+F+++ALIDM+AKCGS+  AR+VFDR +D+DVV+WSAMI+GYGLHG  +EAISLY+ M++ GVHPND
Subjt:  TVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPND

Query:  ITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYN
        +TF+ LL AC +SG V+EGW  F++M DH I P  QHY+CV+DLLGRAG+L+QAY+ I  MP++PGV+VWGALLS+CK HR V LGE AA+QLF +DP N
Subjt:  ITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYN

Query:  TGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLC
        TGHYVQLSNLYA+A LWD VA VR+ M +KGLNKD+G S +E+ G LE F VGD+SHPR +EI  +++ +E RLK  G+V + ++ LHDLN EE EETLC
Subjt:  TGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLC

Query:  NHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        +HSER+A+AYG+IST  GT LRITKNLRAC+NCH+A KLISKLVDREI++RD  RFHHFKDGVCSCGD+W
Subjt:  NHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

Q9LW32 Pentatricopeptide repeat-containing protein At3g26782, mitochondrial7.9e-15042.12Show/hide
Query:  ESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKL
        ++D+  WN++I    ++   A A+  ++ M+   + P   +F   +KAC  +     GKQ H QAF +G+ S++FV ++L+ MY+  G+   AR VFD++
Subjt:  ESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKL

Query:  HDRTVVSWTSIISGYVQNGDPVEALRVFKEM------RQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGK--VE
          R +VSWTS+I GY  NG+ ++A+ +FK++          M  D + LVSV++A + +   G  + IH  V K G +    +  +L   YAK G+  V 
Subjt:  HDRTVVSWTSIISGYVQNGDPVEALRVFKEM------RQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGK--VE

Query:  VARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGS
        VAR  F+Q+   + + +N+++S YA++G   EA ++FR ++ +K V  +++T+ + +LA +  G++ + + +   + +   +DD+ V T++IDMY KCG 
Subjt:  VARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGS

Query:  IHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDH-GIEPHHQHYSCVVDLLGRA
        +  AR  FDRM +K+V  W+AMI GYG+HGH  +A+ L+  M  +GV PN ITFV++L AC ++G   EGW  F+ M+   G+EP  +HY C+VDLLGRA
Subjt:  IHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDH-GIEPHHQHYSCVVDLLGRA

Query:  GYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLE
        G+L +AYD I  M +KP   +W +LL++C+IH+ V L EI+  +LF LD  N G+Y+ LS++YA A  W  V  VR++M  +GL K  G S +E+NG + 
Subjt:  GYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLE

Query:  TFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREI
         F +GD  HP+ ++I+E L  L R+L  AGYV +  SV HD++ EE E TL  HSE+LA+A+GI++T PG+T+ + KNLR C +CH+ IKLISK+VDRE 
Subjt:  TFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREI

Query:  IIRDAKRFHHFKDGVCSCGDFW
        ++RDAKRFHHFKDG CSCGD+W
Subjt:  IIRDAKRFHHFKDGVCSCGDFW

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233302.2e-14439.19Show/hide
Query:  SLSTALSKAAATSQEALLRK-HLDQLYVQLI--VSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQV
        S S AL K    +   +  K    QL+ Q I   S  H  + +VI       +L+ ++ A   F  +    +L W ++I+ +T  ++F+ A+  + +M+ 
Subjt:  SLSTALSKAAATSQEALLRK-HLDQLYVQLI--VSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQV

Query:  SGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAK---FGQTLSARIVFDKLHDRT-------------------------
        SG  P+   F  +LK+C  M     G+ +HG   + G   +++  N+L++MYAK    G  +S   VFD++  RT                         
Subjt:  SGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAK---FGQTLSARIVFDKLHDRT-------------------------

Query:  --------VVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARF
                VVS+ +II+GY Q+G   +ALR+ +EM  +++KPD   L SV+  +++  D+ +GK IHG V + G++ +  I  SL  MYAK  ++E +  
Subjt:  --------VVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARF

Query:  FFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFAR
         F+++   + I WN++++GY +NG   EA++LFR+M++  V+  +V   S I A A + ++ L + L GY+ +  +  +IF+ +AL+DMY+KCG+I  AR
Subjt:  FFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFAR

Query:  IVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGYLNQ
         +FDRM   D V W+A+IMG+ LHGHG EA+SL++ MK+ GV PN + FVA+LTAC + G V E W  F+ M + +G+    +HY+ V DLLGRAG L +
Subjt:  IVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGYLNQ

Query:  AYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVG
        AY+FI  M ++P  SVW  LLSSC +H+ + L E  A+++F +D  N G YV + N+YAS   W  +A +RL M +KGL K    S IE+      F  G
Subjt:  AYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVG

Query:  DRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDA
        DRSHP   +I E L  +  +++  GYV     VLHD++ E   E L  HSERLAVA+GII+T PGTT+R+TKN+R C +CH AIK ISK+ +REII+RD 
Subjt:  DRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDA

Query:  KRFHHFKDGVCSCGDFW
         RFHHF  G CSCGD+W
Subjt:  KRFHHFKDGVCSCGDFW

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic2.6e-14540.03Show/hide
Query:  DQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGI
        +QL+  ++ SG  + + +    V   L  + V+ A K F E+ E D++ WN+II GY  N +    + ++  M VSG++ +  T + +   C    +  +
Subjt:  DQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGI

Query:  GKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLG
        G+ +H    K  F       N+L+ MY+K G   SA+ VF ++ DR+VVS+TS+I+GY + G   EA+++F+EM +  + PD   + +V+        L 
Subjt:  GKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLG

Query:  QGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNVRLDSVTVRSAILAGAQVGS
        +GK +H  + +  L F+  +  +L  MYAK G ++ A   F++M   ++I WN +I GY+KN Y  EA+ LF  ++  K    D  TV   + A A + +
Subjt:  QGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNVRLDSVTVRSAILAGAQVGS

Query:  IDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSG
         D  R + GYI ++ Y  D  V  +L+DMYAKCG++  A ++FD +  KD+V W+ MI GYG+HG G+EAI+L+  M+QAG+  ++I+FV+LL AC +SG
Subjt:  IDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSG

Query:  FVKEGWELFHQMQ-DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYAS
         V EGW  F+ M+ +  IEP  +HY+C+VD+L R G L +AY FI +MPI P  ++WGALL  C+IH  V+L E  A+++F L+P NTG+YV ++N+YA 
Subjt:  FVKEGWELFHQMQ-DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYAS

Query:  AHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGII
        A  W+ V  +R  + Q+GL K+ G S IEI G +  F  GD S+P ++ I   L ++  R+   GY P  +  L D    E EE LC HSE+LA+A GII
Subjt:  AHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGII

Query:  STAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        S+  G  +R+TKNLR C +CH   K +SKL  REI++RD+ RFH FKDG CSC  FW
Subjt:  STAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-15740.74Show/hide
Query:  TALSKAAATSQEALLRKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPN
        T L K      E  + K +  L   L+ SG     F +    N     R VN A K F  + E D++ WN I+ GY+QN +   A+ M   M    ++P+
Subjt:  TALSKAAATSQEALLRKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPN

Query:  CFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKP
          T + +L A   + +  +GK++HG A + GF S V +  +LV MYAK G   +AR +FD + +R VVSW S+I  YVQN +P EA+ +F++M    +KP
Subjt:  CFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKP

Query:  DWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNVR
          ++++  + A  D+ DL +G+ IH L  +LGL+    +V SL +MY K  +V+ A   F +++   L+ WNAMI G+A+NG   +A+  F +M S+ V+
Subjt:  DWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNVR

Query:  LDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGV
         D+ T  S I A A++     A+W+ G + +S    ++FV TAL+DMYAKCG+I  AR++FD M ++ V  W+AMI GYG HG G+ A+ L++ M++  +
Subjt:  LDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGV

Query:  HPNDITFVALLTACKNSGFVKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFI
         PN +TF+++++AC +SG V+ G + F+ M +++ IE    HY  +VDLLGRAG LN+A+DFIM MP+KP V+V+GA+L +C+IH+ V   E AA++LF 
Subjt:  HPNDITFVALLTACKNSGFVKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFI

Query:  LDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEI
        L+P + G++V L+N+Y +A +W+ V  VR+ M ++GL K  G S +EI   + +F  G  +HP SK+I+  L++L   +K AGYVP    VL  + ++  
Subjt:  LDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEI

Query:  EETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        E+ L  HSE+LA+++G+++T  GTT+ + KNLR C +CH+A K IS +  REI++RD +RFHHFK+G CSCGD+W
Subjt:  EETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

AT3G12770.1 mitochondrial editing factor 221.7e-24357.91Show/hide
Query:  AATSQEALLRKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPNCFTFLY
        A+    A  +  L Q++ +L+V GL    FL+ K ++A     D+ +A + F ++    I  WNAII+GY++NN F  A+ MY++MQ++ V P+ FTF +
Subjt:  AATSQEALLRKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPNCFTFLY

Query:  MLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKPDWIA
        +LKAC G+S   +G+ +H Q F+ GF ++VFVQN L+++YAK  +  SAR VF+   L +RT+VSWT+I+S Y QNG+P+EAL +F +MR+ ++KPDW+A
Subjt:  MLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKPDWIA

Query:  LVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNVRLDSV
        LVSV+ A+T ++DL QG+ IH  V K+GLE EPD+++SL TMYAK G+V  A+  F++M+ PNLILWNAMISGYAKNGY  EAI +F EMI+K+VR D++
Subjt:  LVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNVRLDSV

Query:  TVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPND
        ++ SAI A AQVGS++ AR +  Y+ +S+Y+DD+F+++ALIDM+AKCGS+  AR+VFDR +D+DVV+WSAMI+GYGLHG  +EAISLY+ M++ GVHPND
Subjt:  TVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPND

Query:  ITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYN
        +TF+ LL AC +SG V+EGW  F++M DH I P  QHY+CV+DLLGRAG+L+QAY+ I  MP++PGV+VWGALLS+CK HR V LGE AA+QLF +DP N
Subjt:  ITFVALLTACKNSGFVKEGWELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYN

Query:  TGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLC
        TGHYVQLSNLYA+A LWD VA VR+ M +KGLNKD+G S +E+ G LE F VGD+SHPR +EI  +++ +E RLK  G+V + ++ LHDLN EE EETLC
Subjt:  TGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLC

Query:  NHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        +HSER+A+AYG+IST  GT LRITKNLRAC+NCH+A KLISKLVDREI++RD  RFHHFKDGVCSCGD+W
Subjt:  NHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-14539.19Show/hide
Query:  SLSTALSKAAATSQEALLRK-HLDQLYVQLI--VSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQV
        S S AL K    +   +  K    QL+ Q I   S  H  + +VI       +L+ ++ A   F  +    +L W ++I+ +T  ++F+ A+  + +M+ 
Subjt:  SLSTALSKAAATSQEALLRK-HLDQLYVQLI--VSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQV

Query:  SGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAK---FGQTLSARIVFDKLHDRT-------------------------
        SG  P+   F  +LK+C  M     G+ +HG   + G   +++  N+L++MYAK    G  +S   VFD++  RT                         
Subjt:  SGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAK---FGQTLSARIVFDKLHDRT-------------------------

Query:  --------VVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARF
                VVS+ +II+GY Q+G   +ALR+ +EM  +++KPD   L SV+  +++  D+ +GK IHG V + G++ +  I  SL  MYAK  ++E +  
Subjt:  --------VVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARF

Query:  FFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFAR
         F+++   + I WN++++GY +NG   EA++LFR+M++  V+  +V   S I A A + ++ L + L GY+ +  +  +IF+ +AL+DMY+KCG+I  AR
Subjt:  FFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFAR

Query:  IVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGYLNQ
         +FDRM   D V W+A+IMG+ LHGHG EA+SL++ MK+ GV PN + FVA+LTAC + G V E W  F+ M + +G+    +HY+ V DLLGRAG L +
Subjt:  IVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQM-QDHGIEPHHQHYSCVVDLLGRAGYLNQ

Query:  AYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVG
        AY+FI  M ++P  SVW  LLSSC +H+ + L E  A+++F +D  N G YV + N+YAS   W  +A +RL M +KGL K    S IE+      F  G
Subjt:  AYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVG

Query:  DRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDA
        DRSHP   +I E L  +  +++  GYV     VLHD++ E   E L  HSERLAVA+GII+T PGTT+R+TKN+R C +CH AIK ISK+ +REII+RD 
Subjt:  DRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDA

Query:  KRFHHFKDGVCSCGDFW
         RFHHF  G CSCGD+W
Subjt:  KRFHHFKDGVCSCGDFW

AT3G26782.1 Tetratricopeptide repeat (TPR)-like superfamily protein5.6e-15142.12Show/hide
Query:  ESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKL
        ++D+  WN++I    ++   A A+  ++ M+   + P   +F   +KAC  +     GKQ H QAF +G+ S++FV ++L+ MY+  G+   AR VFD++
Subjt:  ESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKL

Query:  HDRTVVSWTSIISGYVQNGDPVEALRVFKEM------RQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGK--VE
          R +VSWTS+I GY  NG+ ++A+ +FK++          M  D + LVSV++A + +   G  + IH  V K G +    +  +L   YAK G+  V 
Subjt:  HDRTVVSWTSIISGYVQNGDPVEALRVFKEM------RQSNMKPDWIALVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGK--VE

Query:  VARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGS
        VAR  F+Q+   + + +N+++S YA++G   EA ++FR ++ +K V  +++T+ + +LA +  G++ + + +   + +   +DD+ V T++IDMY KCG 
Subjt:  VARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNVRLDSVTVRSAILAGAQVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGS

Query:  IHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDH-GIEPHHQHYSCVVDLLGRA
        +  AR  FDRM +K+V  W+AMI GYG+HGH  +A+ L+  M  +GV PN ITFV++L AC ++G   EGW  F+ M+   G+EP  +HY C+VDLLGRA
Subjt:  IHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGWELFHQMQDH-GIEPHHQHYSCVVDLLGRA

Query:  GYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLE
        G+L +AYD I  M +KP   +W +LL++C+IH+ V L EI+  +LF LD  N G+Y+ LS++YA A  W  V  VR++M  +GL K  G S +E+NG + 
Subjt:  GYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLE

Query:  TFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREI
         F +GD  HP+ ++I+E L  L R+L  AGYV +  SV HD++ EE E TL  HSE+LA+A+GI++T PG+T+ + KNLR C +CH+ IKLISK+VDRE 
Subjt:  TFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLISKLVDREI

Query:  IIRDAKRFHHFKDGVCSCGDFW
        ++RDAKRFHHFKDG CSCGD+W
Subjt:  IIRDAKRFHHFKDGVCSCGDFW

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein1.9e-14640.03Show/hide
Query:  DQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGI
        +QL+  ++ SG  + + +    V   L  + V+ A K F E+ E D++ WN+II GY  N +    + ++  M VSG++ +  T + +   C    +  +
Subjt:  DQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQVSGVQPNCFTFLYMLKACGGMSVEGI

Query:  GKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLG
        G+ +H    K  F       N+L+ MY+K G   SA+ VF ++ DR+VVS+TS+I+GY + G   EA+++F+EM +  + PD   + +V+        L 
Subjt:  GKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKPDWIALVSVMTAYTDMEDLG

Query:  QGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNVRLDSVTVRSAILAGAQVGS
        +GK +H  + +  L F+  +  +L  MYAK G ++ A   F++M   ++I WN +I GY+KN Y  EA+ LF  ++  K    D  TV   + A A + +
Subjt:  QGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNVRLDSVTVRSAILAGAQVGS

Query:  IDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSG
         D  R + GYI ++ Y  D  V  +L+DMYAKCG++  A ++FD +  KD+V W+ MI GYG+HG G+EAI+L+  M+QAG+  ++I+FV+LL AC +SG
Subjt:  IDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSG

Query:  FVKEGWELFHQMQ-DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYAS
         V EGW  F+ M+ +  IEP  +HY+C+VD+L R G L +AY FI +MPI P  ++WGALL  C+IH  V+L E  A+++F L+P NTG+YV ++N+YA 
Subjt:  FVKEGWELFHQMQ-DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYAS

Query:  AHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGII
        A  W+ V  +R  + Q+GL K+ G S IEI G +  F  GD S+P ++ I   L ++  R+   GY P  +  L D    E EE LC HSE+LA+A GII
Subjt:  AHLWDHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGII

Query:  STAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        S+  G  +R+TKNLR C +CH   K +SKL  REI++RD+ RFH FKDG CSC  FW
Subjt:  STAPGTTLRITKNLRACINCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTGCATTCATTTTCGCTCTCTCTCTCCTTGTCGTCGCTATCTACCGCTCTCTCAAAGGCGGCGGCAACCTCACAAGAGGCTTTATTGAGGAAGCATTTGGATCA
ATTATATGTCCAGTTAATTGTGTCTGGACTACACAAGTGCAGTTTCTTGGTGATCAAATTTGTCAATGCATGTCTGCATCTCAGAGATGTTAACTACGCGCACAAGGCTT
TTTGTGAAGTCTTAGAATCGGACATTTTGTTGTGGAATGCCATCATAAAGGGTTACACTCAAAATAATATTTTTGCTGGTGCTATCAGAATGTATACAGATATGCAAGTG
TCAGGCGTGCAACCAAATTGCTTCACATTTTTGTATATGCTTAAAGCATGCGGTGGAATGTCAGTTGAAGGAATAGGTAAACAGATGCATGGCCAGGCATTTAAATATGG
CTTTGGATCAAATGTTTTTGTGCAGAACAGTCTTGTGTCAATGTATGCTAAATTTGGTCAAACCTTATCTGCTAGGATTGTTTTTGATAAGTTACATGATAGAACTGTTG
TTTCGTGGACTTCCATCATTTCTGGGTATGTTCAGAATGGTGATCCTGTGGAAGCATTGAGAGTTTTCAAAGAAATGAGACAAAGTAATATGAAACCTGATTGGATAGCC
CTTGTTAGTGTTATGACAGCTTATACAGACATGGAGGATTTGGGACAAGGAAAGGGCATTCATGGCTTAGTGACTAAATTGGGTCTAGAATTTGAACCAGATATAGTGGT
ATCACTCACTACCATGTATGCAAAACGTGGAAAAGTGGAAGTTGCCAGATTTTTCTTTAATCAGATGGAAAAACCAAATTTAATATTGTGGAATGCTATGATTTCTGGTT
ATGCAAAAAATGGATATGGTGAAGAAGCCATCAAGCTATTCCGTGAGATGATTTCAAAAAATGTAAGGCTTGATTCTGTTACTGTGAGGTCTGCTATTCTAGCCGGTGCC
CAAGTGGGATCTATTGATCTAGCAAGATGGCTGGATGGTTATATTTCTAAGAGTGAGTACAAGGATGATATTTTTGTAAACACAGCTCTTATAGATATGTATGCAAAATG
TGGAAGCATACATTTTGCTCGTATTGTTTTTGATAGAATGGTTGATAAAGACGTTGTCTTATGGAGTGCAATGATTATGGGGTATGGATTACATGGTCATGGACAAGAAG
CCATAAGTCTTTACAAGGGAATGAAGCAAGCTGGAGTTCATCCGAACGACATCACTTTTGTTGCCCTTCTCACAGCTTGCAAAAATTCAGGTTTTGTAAAAGAGGGATGG
GAGCTTTTCCACCAGATGCAAGACCATGGGATTGAACCGCATCACCAACACTACTCTTGCGTGGTGGATCTTCTAGGACGTGCAGGCTATTTGAATCAAGCGTATGATTT
CATTATGAGCATGCCAATTAAACCTGGAGTTAGTGTTTGGGGGGCTCTTCTGAGTTCATGCAAGATCCATCGCCAAGTGAGGCTGGGAGAAATTGCTGCAAAACAGCTCT
TCATATTAGATCCATATAATACAGGGCATTATGTGCAACTTTCAAACCTATATGCTTCTGCCCATTTATGGGATCATGTGGCAAACGTTCGATTAATGATGACGCAGAAA
GGACTGAACAAGGACCTCGGACATAGTTCCATTGAGATCAACGGAAATCTTGAGACGTTTCATGTTGGAGATAGATCACATCCTAGATCAAAGGAGATCTTTGAAGAGCT
TGATAGATTGGAGAGGAGATTAAAAGCAGCTGGTTATGTTCCTCATATGGAATCTGTTCTACATGACTTGAATCATGAAGAGATTGAGGAAACTCTTTGTAATCATAGTG
AGAGGCTTGCAGTTGCTTATGGCATCATCAGTACTGCTCCTGGAACTACACTTAGAATAACGAAGAATCTCCGAGCATGCATTAATTGCCATTCAGCGATAAAGCTTATA
TCGAAGCTTGTCGATAGGGAAATAATTATTCGAGACGCGAAACGTTTTCACCATTTCAAAGATGGAGTATGTTCATGTGGAGATTTTTGGTGA
mRNA sequenceShow/hide mRNA sequence
AAATTCGAAAAAGAATTGAGATTAATTTTTTTTCACAGTGGTTAAAAGAAACAGAGCATTTTTTTTTTCGAATTGAAAACGCGACGGCGATTCTCCTCTGTTCCAACAAT
TAGGGGTTCCGTCTTCGCCGGCCGCCGCCACATCACTGCCGTCTCTTAAGCGCTGCAATCGCACAGCTACATCGGAACTTGCTTCTGCGGAGAACTTCTTCACCGGAAAG
ATAGGGACTTGACGGAGCTGGCTTTTGCAGCTCCTCGCACTTCCAGTCATGTCTTTGCATTCATTTTCGCTCTCTCTCTCCTTGTCGTCGCTATCTACCGCTCTCTCAAA
GGCGGCGGCAACCTCACAAGAGGCTTTATTGAGGAAGCATTTGGATCAATTATATGTCCAGTTAATTGTGTCTGGACTACACAAGTGCAGTTTCTTGGTGATCAAATTTG
TCAATGCATGTCTGCATCTCAGAGATGTTAACTACGCGCACAAGGCTTTTTGTGAAGTCTTAGAATCGGACATTTTGTTGTGGAATGCCATCATAAAGGGTTACACTCAA
AATAATATTTTTGCTGGTGCTATCAGAATGTATACAGATATGCAAGTGTCAGGCGTGCAACCAAATTGCTTCACATTTTTGTATATGCTTAAAGCATGCGGTGGAATGTC
AGTTGAAGGAATAGGTAAACAGATGCATGGCCAGGCATTTAAATATGGCTTTGGATCAAATGTTTTTGTGCAGAACAGTCTTGTGTCAATGTATGCTAAATTTGGTCAAA
CCTTATCTGCTAGGATTGTTTTTGATAAGTTACATGATAGAACTGTTGTTTCGTGGACTTCCATCATTTCTGGGTATGTTCAGAATGGTGATCCTGTGGAAGCATTGAGA
GTTTTCAAAGAAATGAGACAAAGTAATATGAAACCTGATTGGATAGCCCTTGTTAGTGTTATGACAGCTTATACAGACATGGAGGATTTGGGACAAGGAAAGGGCATTCA
TGGCTTAGTGACTAAATTGGGTCTAGAATTTGAACCAGATATAGTGGTATCACTCACTACCATGTATGCAAAACGTGGAAAAGTGGAAGTTGCCAGATTTTTCTTTAATC
AGATGGAAAAACCAAATTTAATATTGTGGAATGCTATGATTTCTGGTTATGCAAAAAATGGATATGGTGAAGAAGCCATCAAGCTATTCCGTGAGATGATTTCAAAAAAT
GTAAGGCTTGATTCTGTTACTGTGAGGTCTGCTATTCTAGCCGGTGCCCAAGTGGGATCTATTGATCTAGCAAGATGGCTGGATGGTTATATTTCTAAGAGTGAGTACAA
GGATGATATTTTTGTAAACACAGCTCTTATAGATATGTATGCAAAATGTGGAAGCATACATTTTGCTCGTATTGTTTTTGATAGAATGGTTGATAAAGACGTTGTCTTAT
GGAGTGCAATGATTATGGGGTATGGATTACATGGTCATGGACAAGAAGCCATAAGTCTTTACAAGGGAATGAAGCAAGCTGGAGTTCATCCGAACGACATCACTTTTGTT
GCCCTTCTCACAGCTTGCAAAAATTCAGGTTTTGTAAAAGAGGGATGGGAGCTTTTCCACCAGATGCAAGACCATGGGATTGAACCGCATCACCAACACTACTCTTGCGT
GGTGGATCTTCTAGGACGTGCAGGCTATTTGAATCAAGCGTATGATTTCATTATGAGCATGCCAATTAAACCTGGAGTTAGTGTTTGGGGGGCTCTTCTGAGTTCATGCA
AGATCCATCGCCAAGTGAGGCTGGGAGAAATTGCTGCAAAACAGCTCTTCATATTAGATCCATATAATACAGGGCATTATGTGCAACTTTCAAACCTATATGCTTCTGCC
CATTTATGGGATCATGTGGCAAACGTTCGATTAATGATGACGCAGAAAGGACTGAACAAGGACCTCGGACATAGTTCCATTGAGATCAACGGAAATCTTGAGACGTTTCA
TGTTGGAGATAGATCACATCCTAGATCAAAGGAGATCTTTGAAGAGCTTGATAGATTGGAGAGGAGATTAAAAGCAGCTGGTTATGTTCCTCATATGGAATCTGTTCTAC
ATGACTTGAATCATGAAGAGATTGAGGAAACTCTTTGTAATCATAGTGAGAGGCTTGCAGTTGCTTATGGCATCATCAGTACTGCTCCTGGAACTACACTTAGAATAACG
AAGAATCTCCGAGCATGCATTAATTGCCATTCAGCGATAAAGCTTATATCGAAGCTTGTCGATAGGGAAATAATTATTCGAGACGCGAAACGTTTTCACCATTTCAAAGA
TGGAGTATGTTCATGTGGAGATTTTTGGTGAAATCGGATCTGCTGGATTGGACACTGAGGTCTACGACTTGGTGGCCTTTGGGGTCGGCTGTTTTGTGGAATCCCATGGA
TTTGCTGAGGAATCGCTCTTCTATGACCGCTGTGAGGTACAGGTACCTGAGCACAACACCTTGCCTCGATGGTCTCTCAGCACCCACATGAGCCCTCCTCGTCCATGGTT
ATTGCTCCAGGCTGCATCCGTGTCAAATTTCCACCAATTGATCAGTGGAGAAGCCAGTTGTTACTCATCCTCTGTATTTGGGCTTGATTAAGGAGGATATGTGGTTTGGC
ATTGTGATTTGTATGTATTAGCGATCAAGATTGAGCCCTTACAAATGCTCAAAGCTTTTTCAAAGTTGTAGAAGTCAATTGAGTTTGAATAGTAAAGCTAATTGGAGTGG
AATGAAGTTTCTGAAAATGTGTTGTTAAGACAAACTATTCAAAAGGTGGTAATGAGGTATATTTTTTTTGGAGAAAGTTTTATATTGCTATTTGCTTGCAGAAAAATGGT
CTATTCTATTATACAATTATTTATTGTGTTATCAAAGACTCGATCATCTTGGTCAGAGTCATTCACTTCAATTTCTACTAAGCAAACTGCAACTCACAGGCTATAGTCAA
TATAAACATTCCAACCATGTGTTATTAGCCTGTAGTTTTGTGATAGCTCCCTGTATGTTCAGCTGACATCTTAAATGTACCAAAGATTAAGGAAACACACTGCAATCGAG
TCAGGTACTTGCCAAAATTATTTGACTTGTAGTTCATCTATTGTCAAATTAGGCTTGATGAAGATGCATTCAACATAAGATGATGAAAGTTTGAAACTCTTCACTTTACA
CTTTTCGAATAAAAGTTCATATTATATATTTATTTATGTTGTTCTTGACAATTTTTTTGTTTGTAGGTTGATAAAAGAACTACTTCACAACAATGGAATGTCTTGGTTAC
AATACAACTTGAGTTATTTGAATTTCTGATTACAACATATTTTATAATTATTGAATGGTGTATTAATGAAGTTTATATATTTTTTTTATCATCTTAAGTTAATATATTCA
TTACTATATCTATTTTGTGGGGTA
Protein sequenceShow/hide protein sequence
MSLHSFSLSLSLSSLSTALSKAAATSQEALLRKHLDQLYVQLIVSGLHKCSFLVIKFVNACLHLRDVNYAHKAFCEVLESDILLWNAIIKGYTQNNIFAGAIRMYTDMQV
SGVQPNCFTFLYMLKACGGMSVEGIGKQMHGQAFKYGFGSNVFVQNSLVSMYAKFGQTLSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALRVFKEMRQSNMKPDWIA
LVSVMTAYTDMEDLGQGKGIHGLVTKLGLEFEPDIVVSLTTMYAKRGKVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNVRLDSVTVRSAILAGA
QVGSIDLARWLDGYISKSEYKDDIFVNTALIDMYAKCGSIHFARIVFDRMVDKDVVLWSAMIMGYGLHGHGQEAISLYKGMKQAGVHPNDITFVALLTACKNSGFVKEGW
ELFHQMQDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSSCKIHRQVRLGEIAAKQLFILDPYNTGHYVQLSNLYASAHLWDHVANVRLMMTQK
GLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYVPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTTLRITKNLRACINCHSAIKLI
SKLVDREIIIRDAKRFHHFKDGVCSCGDFW