; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g34670 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g34670
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr9:26546181..26549046
RNA-Seq ExpressionMoc09g34670
SyntenyMoc09g34670
Gene Ontology termsGO:0080156 - mitochondrial mRNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030779.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0089.29Show/hide
Query:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV
        MSLHSFSLSLSLSS+STA SK AATSQEALL RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHL DV YAHK FREVLEPDILLWN +IKGYTQNNIF 
Subjt:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV

Query:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP
        GA+++Y +MQVSGV+PDCFTFLYVLKACGGMS+E IGKQMH QTFKYG GSNVFVQNSLVSMYA+FGQTS AR+VFDKL +RTVVSWTSIISGYVQNGDP
Subjt:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP

Query:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY
         +AL VFK MR+S VKLDWI LVSV+TAYTDMEDLGQGK+IH LVTKLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKNI VDSVTVRSAILA AQ GSL+LARWLDGYISKSEYRDD FVNTALIDM+AKCGSI FA  VFDRMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI
        HG+EAI+LYN MKQ GVRPNDVTFVGLLTACKNSGLVKEGW+LFH++RDHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIM+MPIKPGVSVWGALLS CKI
Subjt:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI

Query:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQV+LGEIAAEQLF LDPYNTGHYVQLSNLYASAHLWNHV NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
        + HMESVLHDLN EEIEETLCNHSERLAVAYGIISTAPGT LRIT NLRACVNCHSAIKLISKLV+REII+RDAKRFH FKDG CSCGDFW
Subjt:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

XP_022139117.1 pentatricopeptide repeat-containing protein At3g12770 [Momordica charantia]0.0e+00100Show/hide
Query:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV
        MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV
Subjt:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV

Query:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP
        GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP
Subjt:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP

Query:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY
        AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY
Subjt:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI
        HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI
Subjt:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI

Query:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
        IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
Subjt:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

XP_022941627.1 pentatricopeptide repeat-containing protein At3g12770 isoform X2 [Cucurbita moschata]0.0e+0089.44Show/hide
Query:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV
        MSLHSFSLSLSLSSLSTA SK AATSQEALL RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHL DV YAHK FREVLEPDILLWN +IKGYTQNNIF 
Subjt:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV

Query:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP
        GA+++Y +MQVSGV+PDCFTFLYVLKACGGMS+E IGKQMH QTFKYG GSNVFVQNSLVSMYA+FGQTS AR+VFDKL +RTVVSWTSIISGYVQNGDP
Subjt:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP

Query:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY
         +AL VFK MR+S VKLDWI LVSV+TAYTDMEDLGQGK+IH LVTKLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKNI VDSVTVRSAILA AQ GSL+LARWLDGYISKSEYRDD FVNTALIDM+AKCGSI FA  VFDRMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI
        HG+EAI+LYN MKQ GV PN+VTFVGLLTACKNSGLVKEGW+LFH++RDHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIM+MPIKPGVSVWGALLS CKI
Subjt:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI

Query:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQV+LGEIAAEQLF LDPYNTGHYVQLSNLYASAHLWNHV NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
        + HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGT LRIT NLRACVNCHSAIKLISKLVDREII+RDAKRFH FKDG CSCGDFW
Subjt:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

XP_022988451.1 pentatricopeptide repeat-containing protein At3g12770 isoform X2 [Cucurbita maxima]0.0e+0089.29Show/hide
Query:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV
        MSLHSFSLSLSL+SLSTA SK AATSQEALL RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHL DV YAHK FREVLEPDILLWN +IKGYTQNNIF 
Subjt:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV

Query:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP
        GA+++Y +MQVSGV+PDCFTFLYVLKACGGMS+E IGKQMH QTFKYGFGSNVFVQNSLVSMYA++GQTS AR+VFDKL +RTVVSWTSIISGYVQNGDP
Subjt:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP

Query:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY
         +AL VFK MRQS VKLDWI LVSVMTAYTDMEDLGQGK+IH LVTKLGLEFEPDIVVSLT MYAK G+VE+ARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKNI VDSVTVRSAILA AQVGSL+LARWLDGYISKSEYRDD FVNTALIDM+AKCGSI FA  VFDRMVDKD+V WSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI
        HG+EAI+LYN MKQ G+RPNDVTFVGLLTACKNSGLVKEGW+LFH+++DHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIM+MPIKPGVSVWGALLS CKI
Subjt:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI

Query:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQV+LGEIAAEQLF LDPYNTGHYVQLSNLYASAHLWN VANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
        + HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGT LRIT NLRACVNCHSAIKLISKLVDREIIIRD KRFH FKDG CSCGDFW
Subjt:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

XP_038892943.1 pentatricopeptide repeat-containing protein At3g12770 [Benincasa hispida]0.0e+0088.57Show/hide
Query:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV
        MSLHSFSLSLSLSSLS+A SK A TS EA L RKHLDQLYVQLIVSGLHKC FL+IKFVN+CLH GDV YAHKAFREV+EPDILLWNA+IKGYTQ NI  
Subjt:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV

Query:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP
        GA+++Y +MQ+S V+P+CFTFLYVLKAC GMS+E IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTS AR+VFDKL DRTVVSWTSIISGYVQNGDP
Subjt:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP

Query:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY
         EAL +FK+MRQ NVKLDWI LVSVMTAYTD+EDLGQGKSIHGLVTKLGLEFEPDIV+SLTTMYAK G VE+ARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAIKLF EMISKNIRVDSVTVRSAILAGAQVGSL LARWLD YIS+SEYRDDTFVNT+L+DMYAKCGSIYFA  VFDRMV KDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI
        HG+EAIN YN MKQ GV PNDVTFVGLLTACKNSGLVKEGW+LFH+++D+GIEPHHQHYSCVVDLLGRAGYLNQAYDFIM+MP+KPGVSVWGALLSACKI
Subjt:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI

Query:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HR+V+LGEIAAEQLF LDPYN G++VQLSNLYASAHLW HVANVRLMMTQKGLNKDLGHSSI+INGNLETFHVGDRSHPRSKEIFEELDRLE+RLKAAGY
Subjt:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
        +PHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGT LRIT NLRACVNCHSAIKLISKLVDREIIIRDAKRFH FKDG CSCGDFW
Subjt:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

TrEMBL top hitse value%identityAlignment
A0A6J1CD43 pentatricopeptide repeat-containing protein At3g127700.0e+00100Show/hide
Query:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV
        MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV
Subjt:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV

Query:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP
        GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP
Subjt:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP

Query:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY
        AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY
Subjt:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI
        HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI
Subjt:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI

Query:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
        IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
Subjt:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

A0A6J1FLM1 pentatricopeptide repeat-containing protein At3g12770 isoform X10.0e+0088.67Show/hide
Query:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLG------RKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYT
        MSLHSFSLSLSLSSLSTA SK AATSQEALL       RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHL DV YAHK FREVLEPDILLWN +IKGYT
Subjt:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLG------RKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYT

Query:  QNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGY
        QNNIF GA+++Y +MQVSGV+PDCFTFLYVLKACGGMS+E IGKQMH QTFKYG GSNVFVQNSLVSMYA+FGQTS AR+VFDKL +RTVVSWTSIISGY
Subjt:  QNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGY

Query:  VQNGDPAEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISG
        VQNGDP +AL VFK MR+S VKLDWI LVSV+TAYTDMEDLGQGK+IH LVTKLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMISG
Subjt:  VQNGDPAEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISG

Query:  YAKNGYGEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIM
        YAKNGYGEEAI+LFR+MISKNI VDSVTVRSAILA AQ GSL+LARWLDGYISKSEYRDD FVNTALIDM+AKCGSI FA  VFDRMVDKDVVLWSAMIM
Subjt:  YAKNGYGEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIM

Query:  GYGLHGHGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGAL
        GYGLHGHG+EAI+LYN MKQ GV PN+VTFVGLLTACKNSGLVKEGW+LFH++RDHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIM+MPIKPGVSVWGAL
Subjt:  GYGLHGHGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGAL

Query:  LSACKIHRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
        LS CKIHRQV+LGEIAAEQLF LDPYNTGHYVQLSNLYASAHLWNHV NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
Subjt:  LSACKIHRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR

Query:  LKAAGYIPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
        LKAAGY+ HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGT LRIT NLRACVNCHSAIKLISKLVDREII+RDAKRFH FKDG CSCGDFW
Subjt:  LKAAGYIPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

A0A6J1FN08 pentatricopeptide repeat-containing protein At3g12770 isoform X20.0e+0089.44Show/hide
Query:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV
        MSLHSFSLSLSLSSLSTA SK AATSQEALL RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHL DV YAHK FREVLEPDILLWN +IKGYTQNNIF 
Subjt:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV

Query:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP
        GA+++Y +MQVSGV+PDCFTFLYVLKACGGMS+E IGKQMH QTFKYG GSNVFVQNSLVSMYA+FGQTS AR+VFDKL +RTVVSWTSIISGYVQNGDP
Subjt:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP

Query:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY
         +AL VFK MR+S VKLDWI LVSV+TAYTDMEDLGQGK+IH LVTKLGLEFEPDIVVSLT MYAK G+VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKNI VDSVTVRSAILA AQ GSL+LARWLDGYISKSEYRDD FVNTALIDM+AKCGSI FA  VFDRMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI
        HG+EAI+LYN MKQ GV PN+VTFVGLLTACKNSGLVKEGW+LFH++RDHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIM+MPIKPGVSVWGALLS CKI
Subjt:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI

Query:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQV+LGEIAAEQLF LDPYNTGHYVQLSNLYASAHLWNHV NVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
        + HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGT LRIT NLRACVNCHSAIKLISKLVDREII+RDAKRFH FKDG CSCGDFW
Subjt:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

A0A6J1JD09 pentatricopeptide repeat-containing protein At3g12770 isoform X10.0e+0088.52Show/hide
Query:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLG------RKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYT
        MSLHSFSLSLSL+SLSTA SK AATSQEALL       RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHL DV YAHK FREVLEPDILLWN +IKGYT
Subjt:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLG------RKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYT

Query:  QNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGY
        QNNIF GA+++Y +MQVSGV+PDCFTFLYVLKACGGMS+E IGKQMH QTFKYGFGSNVFVQNSLVSMYA++GQTS AR+VFDKL +RTVVSWTSIISGY
Subjt:  QNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGY

Query:  VQNGDPAEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISG
        VQNGDP +AL VFK MRQS VKLDWI LVSVMTAYTDMEDLGQGK+IH LVTKLGLEFEPDIVVSLT MYAK G+VE+ARFFFNQMEKPNL+LWNAMISG
Subjt:  VQNGDPAEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISG

Query:  YAKNGYGEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIM
        YAKNGYGEEAI+LFR+MISKNI VDSVTVRSAILA AQVGSL+LARWLDGYISKSEYRDD FVNTALIDM+AKCGSI FA  VFDRMVDKD+V WSAMIM
Subjt:  YAKNGYGEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIM

Query:  GYGLHGHGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGAL
        GYGLHGHG+EAI+LYN MKQ G+RPNDVTFVGLLTACKNSGLVKEGW+LFH+++DHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIM+MPIKPGVSVWGAL
Subjt:  GYGLHGHGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGAL

Query:  LSACKIHRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
        LS CKIHRQV+LGEIAAEQLF LDPYNTGHYVQLSNLYASAHLWN VANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR
Subjt:  LSACKIHRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERR

Query:  LKAAGYIPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
        LKAAGY+ HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGT LRIT NLRACVNCHSAIKLISKLVDREIIIRD KRFH FKDG CSCGDFW
Subjt:  LKAAGYIPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

A0A6J1JH82 pentatricopeptide repeat-containing protein At3g12770 isoform X20.0e+0089.29Show/hide
Query:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV
        MSLHSFSLSLSL+SLSTA SK AATSQEALL RKHLDQLYVQLIVSGL+KC FLVIKFVNACLHL DV YAHK FREVLEPDILLWN +IKGYTQNNIF 
Subjt:  MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFV

Query:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP
        GA+++Y +MQVSGV+PDCFTFLYVLKACGGMS+E IGKQMH QTFKYGFGSNVFVQNSLVSMYA++GQTS AR+VFDKL +RTVVSWTSIISGYVQNGDP
Subjt:  GAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDP

Query:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY
         +AL VFK MRQS VKLDWI LVSVMTAYTDMEDLGQGK+IH LVTKLGLEFEPDIVVSLT MYAK G+VE+ARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  AEALSVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKNI VDSVTVRSAILA AQVGSL+LARWLDGYISKSEYRDD FVNTALIDM+AKCGSI FA  VFDRMVDKD+V WSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHG

Query:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI
        HG+EAI+LYN MKQ G+RPNDVTFVGLLTACKNSGLVKEGW+LFH+++DHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIM+MPIKPGVSVWGALLS CKI
Subjt:  HGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKI

Query:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
        HRQV+LGEIAAEQLF LDPYNTGHYVQLSNLYASAHLWN VANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY
Subjt:  HRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGY

Query:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
        + HMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGT LRIT NLRACVNCHSAIKLISKLVDREIIIRD KRFH FKDG CSCGDFW
Subjt:  IPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

SwissProt top hitse value%identityAlignment
Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic7.7e-15340.09Show/hide
Query:  TAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHP
        T   K+     E  +G+    +++  L+ SG     F +    N       V  A K F  + E D++ WN ++ GY+QN +   A+++   M    + P
Subjt:  TAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHP

Query:  DCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDPAEALSVFKKMRQSNVK
           T + VL A   + +  +GK++HG   + GF S V +  +LV MYAK G    AR +FD + +R VVSW S+I  YVQN +P EA+ +F+KM    VK
Subjt:  DCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDPAEALSVFKKMRQSNVK

Query:  LDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLFREMISKNI
           ++++  + A  D+ DL +G+ IH L  +LGL+    +V SL +MY K  +V+ A   F +++   LV WNAMI G+A+NG   +A+  F +M S+ +
Subjt:  LDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLFREMISKNI

Query:  RVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAINLYNAMKQVG
        + D+ T  S I A A++     A+W+ G + +S    + FV TAL+DMYAKCG+I  A L+FD M ++ V  W+AMI GYG HG GK A+ L+  M++  
Subjt:  RVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAINLYNAMKQVG

Query:  VRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRD-HGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKLGEIAAEQLF
        ++PN VTF+ +++AC +SGLV+ G   F+ +++ + IE    HY  +VDLLGRAG LN+A+DFIM MP+KP V+V+GA+L AC+IH+ V   E AAE+LF
Subjt:  VRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRD-HGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKLGEIAAEQLF

Query:  SLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMESVLHDLNHEE
         L+P + G++V L+N+Y +A +W  V  VR+ M ++GL K  G S +EI   + +F  G  +HP SK+I+  L++L   +K AGY+P    VL  + ++ 
Subjt:  SLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMESVLHDLNHEE

Query:  IEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
         E+ L  HSE+LA+++G+++T  GT + +  NLR C +CH+A K IS +  REI++RD +RFH FK+GACSCGD+W
Subjt:  IEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic2.0e-14538.12Show/hide
Query:  LYVQLIVSGLHKCRFLVIKFVNACL---HLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEE
        ++ Q+I  GLH   + + K +  C+   H   + YA   F+ + EP++L+WN + +G+  ++  V A+KLY  M   G+ P+ +TF +VLK+C      +
Subjt:  LYVQLIVSGLHKCRFLVIKFVNACL---HLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEE

Query:  IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDR-------------------------------TVVSWTSIISGYVQNGDPAEAL
         G+Q+HG   K G   +++V  SL+SMY + G+   A  VFDK   R                                VVSW ++ISGY + G+  EAL
Subjt:  IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDR-------------------------------TVVSWTSIISGYVQNGDPAEAL

Query:  SVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEA
         +FK M ++NV+ D   +V+V++A      +  G+ +H  +   G      IV +L  +Y+K G++E A   F ++   +++ WN +I GY      +EA
Subjt:  SVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEA

Query:  IKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISK--SEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHG
        + LF+EM+      + VT+ S + A A +G++D+ RW+  YI K      + + + T+LIDMYAKCG I  A  VF+ ++ K +  W+AMI G+ +HG  
Subjt:  IKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISK--SEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHG

Query:  KEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEI-RDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIH
          + +L++ M+++G++P+D+TFVGLL+AC +SG++  G  +F  + +D+ + P  +HY C++DLLG +G   +A + I  M ++P   +W +LL ACK+H
Subjt:  KEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEI-RDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIH

Query:  RQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYI
          V+LGE  AE L  ++P N G YV LSN+YASA  WN VA  R ++  KG+ K  G SSIEI+  +  F +GD+ HPR++EI+  L+ +E  L+ AG++
Subjt:  RQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYI

Query:  PHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
        P    VL ++  E  E  L +HSE+LA+A+G+IST PGT L I  NLR C NCH A KLISK+  REII RD  RFH F+DG CSC D+W
Subjt:  PHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127703.0e-24258.55Show/hide
Query:  RKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMS
        +  L Q++ +L+V GL    FL+ K ++A    GD+ +A + F ++  P I  WNA+I+GY++NN F  A+ +Y+ MQ++ V PD FTF ++LKAC G+S
Subjt:  RKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMS

Query:  IEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFD--KLQDRTVVSWTSIISGYVQNGDPAEALSVFKKMRQSNVKLDWIALVSVMTAYT
          ++G+ +H Q F+ GF ++VFVQN L+++YAK  +   AR VF+   L +RT+VSWT+I+S Y QNG+P EAL +F +MR+ +VK DW+ALVSV+ A+T
Subjt:  IEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFD--KLQDRTVVSWTSIISGYVQNGDPAEALSVFKKMRQSNVKLDWIALVSVMTAYT

Query:  DMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSAILAG
         ++DL QG+SIH  V K+GLE EPD+++SL TMYAK GQV  A+  F++M+ PNL+LWNAMISGYAKNGY  EAI +F EMI+K++R D++++ SAI A 
Subjt:  DMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSAILAG

Query:  AQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAINLYNAMKQVGVRPNDVTFVGLLTA
        AQVGSL+ AR +  Y+ +S+YRDD F+++ALIDM+AKCGS+  A LVFDR +D+DVV+WSAMI+GYGLHG  +EAI+LY AM++ GV PNDVTF+GLL A
Subjt:  AQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAINLYNAMKQVGVRPNDVTFVGLLTA

Query:  CKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKLGEIAAEQLFSLDPYNTGHYVQLSN
        C +SG+V+EGW  F+ + DH I P  QHY+CV+DLLGRAG+L+QAY+ I  MP++PGV+VWGALLSACK HR V+LGE AA+QLFS+DP NTGHYVQLSN
Subjt:  CKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKLGEIAAEQLFSLDPYNTGHYVQLSN

Query:  LYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMESVLHDLNHEEIEETLCNHSERLAVA
        LYA+A LW+ VA VR+ M +KGLNKD+G S +E+ G LE F VGD+SHPR +EI  +++ +E RLK  G++ + ++ LHDLN EE EETLC+HSER+A+A
Subjt:  LYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMESVLHDLNHEEIEETLCNHSERLAVA

Query:  YGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
        YG+IST  GTPLRIT NLRACVNCH+A KLISKLVDREI++RD  RFH FKDG CSCGD+W
Subjt:  YGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

Q9LW32 Pentatricopeptide repeat-containing protein At3g26782, mitochondrial2.3e-14942.01Show/hide
Query:  REVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVV
        R V + D+  WN+VI    ++     A+  ++ M+   ++P   +F   +KAC  +     GKQ H Q F +G+ S++FV ++L+ MY+  G+   AR V
Subjt:  REVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVV

Query:  FDKLQDRTVVSWTSIISGYVQNGDPAEALSVFKKM------RQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQ
        FD++  R +VSWTS+I GY  NG+  +A+S+FK +          + LD + LVSV++A + +   G  +SIH  V K G +    +  +L   YAK G+
Subjt:  FDKLQDRTVVSWTSIISGYVQNGDPAEALSVFKKM------RQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQ

Query:  --VEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLFREMI-SKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYA
          V VAR  F+Q+   + V +N+++S YA++G   EA ++FR ++ +K +  +++T+ + +LA +  G+L + + +   + +    DD  V T++IDMY 
Subjt:  --VEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLFREMI-SKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYA

Query:  KCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDH-GIEPHHQHYSCVVDL
        KCG +  A   FDRM +K+V  W+AMI GYG+HGH  +A+ L+ AM   GVRPN +TFV +L AC ++GL  EGW  F+ ++   G+EP  +HY C+VDL
Subjt:  KCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDH-GIEPHHQHYSCVVDL

Query:  LGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEIN
        LGRAG+L +AYD I  M +KP   +W +LL+AC+IH+ V+L EI+  +LF LD  N G+Y+ LS++YA A  W  V  VR++M  +GL K  G S +E+N
Subjt:  LGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEIN

Query:  GNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLV
        G +  F +GD  HP+ ++I+E L  L R+L  AGY+ +  SV HD++ EE E TL  HSE+LA+A+GI++T PG+ + +  NLR C +CH+ IKLISK+V
Subjt:  GNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLV

Query:  DREIIIRDAKRFHRFKDGACSCGDFW
        DRE ++RDAKRFH FKDG CSCGD+W
Subjt:  DREIIIRDAKRFHRFKDGACSCGDFW

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic1.6e-14539.27Show/hide
Query:  SLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTE
        S + S +S +FS L +            +QL+  ++ SG  +   +    V   L    V  A K F E+ E D++ WN++I GY  N +    + ++ +
Subjt:  SLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTE

Query:  MQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDPAEALSVFK
        M VSG+  D  T + V   C    +  +G+ +H    K  F       N+L+ MY+K G    A+ VF ++ DR+VVS+TS+I+GY + G   EA+ +F+
Subjt:  MQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDPAEALSVFK

Query:  KMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLF
        +M +  +  D   + +V+        L +GK +H  + +  L F+  +  +L  MYAK G ++ A   F++M   +++ WN +I GY+KN Y  EA+ LF
Subjt:  KMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLF

Query:  REMI-SKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAIN
          ++  K    D  TV   + A A + + D  R + GYI ++ Y  D  V  +L+DMYAKCG++  A ++FD +  KD+V W+ MI GYG+HG GKEAI 
Subjt:  REMI-SKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAIN

Query:  LYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIR-DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKL
        L+N M+Q G+  ++++FV LL AC +SGLV EGW  F+ +R +  IEP  +HY+C+VD+L R G L +AY FI NMPI P  ++WGALL  C+IH  VKL
Subjt:  LYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIR-DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKL

Query:  GEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMES
         E  AE++F L+P NTG+YV ++N+YA A  W  V  +R  + Q+GL K+ G S IEI G +  F  GD S+P ++ I   L ++  R+   GY P  + 
Subjt:  GEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMES

Query:  VLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
         L D    E EE LC HSE+LA+A GIIS+  G  +R+T NLR C +CH   K +SKL  REI++RD+ RFH+FKDG CSC  FW
Subjt:  VLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-14638.12Show/hide
Query:  LYVQLIVSGLHKCRFLVIKFVNACL---HLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEE
        ++ Q+I  GLH   + + K +  C+   H   + YA   F+ + EP++L+WN + +G+  ++  V A+KLY  M   G+ P+ +TF +VLK+C      +
Subjt:  LYVQLIVSGLHKCRFLVIKFVNACL---HLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEE

Query:  IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDR-------------------------------TVVSWTSIISGYVQNGDPAEAL
         G+Q+HG   K G   +++V  SL+SMY + G+   A  VFDK   R                                VVSW ++ISGY + G+  EAL
Subjt:  IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDR-------------------------------TVVSWTSIISGYVQNGDPAEAL

Query:  SVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEA
         +FK M ++NV+ D   +V+V++A      +  G+ +H  +   G      IV +L  +Y+K G++E A   F ++   +++ WN +I GY      +EA
Subjt:  SVFKKMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEA

Query:  IKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISK--SEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHG
        + LF+EM+      + VT+ S + A A +G++D+ RW+  YI K      + + + T+LIDMYAKCG I  A  VF+ ++ K +  W+AMI G+ +HG  
Subjt:  IKLFREMISKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISK--SEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHG

Query:  KEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEI-RDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIH
          + +L++ M+++G++P+D+TFVGLL+AC +SG++  G  +F  + +D+ + P  +HY C++DLLG +G   +A + I  M ++P   +W +LL ACK+H
Subjt:  KEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEI-RDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIH

Query:  RQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYI
          V+LGE  AE L  ++P N G YV LSN+YASA  WN VA  R ++  KG+ K  G SSIEI+  +  F +GD+ HPR++EI+  L+ +E  L+ AG++
Subjt:  RQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYI

Query:  PHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
        P    VL ++  E  E  L +HSE+LA+A+G+IST PGT L I  NLR C NCH A KLISK+  REII RD  RFH F+DG CSC D+W
Subjt:  PHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein5.5e-15440.09Show/hide
Query:  TAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHP
        T   K+     E  +G+    +++  L+ SG     F +    N       V  A K F  + E D++ WN ++ GY+QN +   A+++   M    + P
Subjt:  TAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHP

Query:  DCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDPAEALSVFKKMRQSNVK
           T + VL A   + +  +GK++HG   + GF S V +  +LV MYAK G    AR +FD + +R VVSW S+I  YVQN +P EA+ +F+KM    VK
Subjt:  DCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDPAEALSVFKKMRQSNVK

Query:  LDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLFREMISKNI
           ++++  + A  D+ DL +G+ IH L  +LGL+    +V SL +MY K  +V+ A   F +++   LV WNAMI G+A+NG   +A+  F +M S+ +
Subjt:  LDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLFREMISKNI

Query:  RVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAINLYNAMKQVG
        + D+ T  S I A A++     A+W+ G + +S    + FV TAL+DMYAKCG+I  A L+FD M ++ V  W+AMI GYG HG GK A+ L+  M++  
Subjt:  RVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAINLYNAMKQVG

Query:  VRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRD-HGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKLGEIAAEQLF
        ++PN VTF+ +++AC +SGLV+ G   F+ +++ + IE    HY  +VDLLGRAG LN+A+DFIM MP+KP V+V+GA+L AC+IH+ V   E AAE+LF
Subjt:  VRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRD-HGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKLGEIAAEQLF

Query:  SLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMESVLHDLNHEE
         L+P + G++V L+N+Y +A +W  V  VR+ M ++GL K  G S +EI   + +F  G  +HP SK+I+  L++L   +K AGY+P    VL  + ++ 
Subjt:  SLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMESVLHDLNHEE

Query:  IEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
         E+ L  HSE+LA+++G+++T  GT + +  NLR C +CH+A K IS +  REI++RD +RFH FK+GACSCGD+W
Subjt:  IEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

AT3G12770.1 mitochondrial editing factor 222.2e-24358.55Show/hide
Query:  RKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMS
        +  L Q++ +L+V GL    FL+ K ++A    GD+ +A + F ++  P I  WNA+I+GY++NN F  A+ +Y+ MQ++ V PD FTF ++LKAC G+S
Subjt:  RKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMS

Query:  IEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFD--KLQDRTVVSWTSIISGYVQNGDPAEALSVFKKMRQSNVKLDWIALVSVMTAYT
          ++G+ +H Q F+ GF ++VFVQN L+++YAK  +   AR VF+   L +RT+VSWT+I+S Y QNG+P EAL +F +MR+ +VK DW+ALVSV+ A+T
Subjt:  IEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFD--KLQDRTVVSWTSIISGYVQNGDPAEALSVFKKMRQSNVKLDWIALVSVMTAYT

Query:  DMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSAILAG
         ++DL QG+SIH  V K+GLE EPD+++SL TMYAK GQV  A+  F++M+ PNL+LWNAMISGYAKNGY  EAI +F EMI+K++R D++++ SAI A 
Subjt:  DMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSAILAG

Query:  AQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAINLYNAMKQVGVRPNDVTFVGLLTA
        AQVGSL+ AR +  Y+ +S+YRDD F+++ALIDM+AKCGS+  A LVFDR +D+DVV+WSAMI+GYGLHG  +EAI+LY AM++ GV PNDVTF+GLL A
Subjt:  AQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAINLYNAMKQVGVRPNDVTFVGLLTA

Query:  CKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKLGEIAAEQLFSLDPYNTGHYVQLSN
        C +SG+V+EGW  F+ + DH I P  QHY+CV+DLLGRAG+L+QAY+ I  MP++PGV+VWGALLSACK HR V+LGE AA+QLFS+DP NTGHYVQLSN
Subjt:  CKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKLGEIAAEQLFSLDPYNTGHYVQLSN

Query:  LYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMESVLHDLNHEEIEETLCNHSERLAVA
        LYA+A LW+ VA VR+ M +KGLNKD+G S +E+ G LE F VGD+SHPR +EI  +++ +E RLK  G++ + ++ LHDLN EE EETLC+HSER+A+A
Subjt:  LYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMESVLHDLNHEEIEETLCNHSERLAVA

Query:  YGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
        YG+IST  GTPLRIT NLRACVNCH+A KLISKLVDREI++RD  RFH FKDG CSCGD+W
Subjt:  YGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW

AT3G26782.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-15042.01Show/hide
Query:  REVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVV
        R V + D+  WN+VI    ++     A+  ++ M+   ++P   +F   +KAC  +     GKQ H Q F +G+ S++FV ++L+ MY+  G+   AR V
Subjt:  REVLEPDILLWNAVIKGYTQNNIFVGAVKLYTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVV

Query:  FDKLQDRTVVSWTSIISGYVQNGDPAEALSVFKKM------RQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQ
        FD++  R +VSWTS+I GY  NG+  +A+S+FK +          + LD + LVSV++A + +   G  +SIH  V K G +    +  +L   YAK G+
Subjt:  FDKLQDRTVVSWTSIISGYVQNGDPAEALSVFKKM------RQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQ

Query:  --VEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLFREMI-SKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYA
          V VAR  F+Q+   + V +N+++S YA++G   EA ++FR ++ +K +  +++T+ + +LA +  G+L + + +   + +    DD  V T++IDMY 
Subjt:  --VEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLFREMI-SKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYA

Query:  KCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDH-GIEPHHQHYSCVVDL
        KCG +  A   FDRM +K+V  W+AMI GYG+HGH  +A+ L+ AM   GVRPN +TFV +L AC ++GL  EGW  F+ ++   G+EP  +HY C+VDL
Subjt:  KCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAINLYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIRDH-GIEPHHQHYSCVVDL

Query:  LGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEIN
        LGRAG+L +AYD I  M +KP   +W +LL+AC+IH+ V+L EI+  +LF LD  N G+Y+ LS++YA A  W  V  VR++M  +GL K  G S +E+N
Subjt:  LGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKLGEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEIN

Query:  GNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLV
        G +  F +GD  HP+ ++I+E L  L R+L  AGY+ +  SV HD++ EE E TL  HSE+LA+A+GI++T PG+ + +  NLR C +CH+ IKLISK+V
Subjt:  GNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMESVLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLV

Query:  DREIIIRDAKRFHRFKDGACSCGDFW
        DRE ++RDAKRFH FKDG CSCGD+W
Subjt:  DREIIIRDAKRFHRFKDGACSCGDFW

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-14639.27Show/hide
Query:  SLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTE
        S + S +S +FS L +            +QL+  ++ SG  +   +    V   L    V  A K F E+ E D++ WN++I GY  N +    + ++ +
Subjt:  SLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKLYTE

Query:  MQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDPAEALSVFK
        M VSG+  D  T + V   C    +  +G+ +H    K  F       N+L+ MY+K G    A+ VF ++ DR+VVS+TS+I+GY + G   EA+ +F+
Subjt:  MQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDPAEALSVFK

Query:  KMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLF
        +M +  +  D   + +V+        L +GK +H  + +  L F+  +  +L  MYAK G ++ A   F++M   +++ WN +I GY+KN Y  EA+ LF
Subjt:  KMRQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLF

Query:  REMI-SKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAIN
          ++  K    D  TV   + A A + + D  R + GYI ++ Y  D  V  +L+DMYAKCG++  A ++FD +  KD+V W+ MI GYG+HG GKEAI 
Subjt:  REMI-SKNIRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAIN

Query:  LYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIR-DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKL
        L+N M+Q G+  ++++FV LL AC +SGLV EGW  F+ +R +  IEP  +HY+C+VD+L R G L +AY FI NMPI P  ++WGALL  C+IH  VKL
Subjt:  LYNAMKQVGVRPNDVTFVGLLTACKNSGLVKEGWDLFHEIR-DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKL

Query:  GEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMES
         E  AE++F L+P NTG+YV ++N+YA A  W  V  +R  + Q+GL K+ G S IEI G +  F  GD S+P ++ I   L ++  R+   GY P  + 
Subjt:  GEIAAEQLFSLDPYNTGHYVQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMES

Query:  VLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW
         L D    E EE LC HSE+LA+A GIIS+  G  +R+T NLR C +CH   K +SKL  REI++RD+ RFH+FKDG CSC  FW
Subjt:  VLHDLNHEEIEETLCNHSERLAVAYGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCTGCATTCGTTTTCCCTCTCTCTCTCCTTGTCCTCACTGTCTACAGCTTTCTCAAAGTTGGCGGCAACCTCGCAAGAGGCTTTATTGGGGAGGAAGCAT
TTGGATCAATTATACGTTCAGTTAATTGTGTCTGGATTACACAAGTGTCGTTTCTTGGTAATCAAATTTGTCAATGCATGCTTGCATCTTGGAGACGTTTACTAT
GCGCACAAGGCTTTTCGTGAAGTCTTAGAACCAGATATTTTGTTGTGGAATGCAGTCATAAAGGGCTATACTCAGAATAATATTTTTGTCGGTGCTGTCAAATTG
TATACAGAGATGCAAGTATCAGGGGTCCACCCAGATTGCTTCACATTTTTGTACGTGCTTAAAGCATGCGGTGGAATGTCTATTGAAGAAATAGGTAAACAGATG
CATGGCCAGACATTTAAATATGGCTTTGGATCAAATGTTTTTGTGCAGAACAGTCTTGTGTCAATGTATGCTAAATTTGGTCAAACCTCACCTGCGAGGGTCGTG
TTTGATAAGTTGCAAGATAGAACTGTTGTTTCGTGGACTTCCATCATTTCTGGGTATGTTCAGAATGGTGATCCTGCAGAAGCATTGAGTGTTTTCAAAAAAATG
AGACAAAGTAATGTGAAACTTGATTGGATTGCTCTTGTTAGTGTAATGACAGCATATACAGACATGGAAGATTTGGGACAAGGAAAGTCCATTCATGGCTTAGTG
ACTAAATTGGGTCTAGAATTTGAACCCGATATTGTGGTATCGCTCACTACCATGTATGCAAAACGTGGACAGGTGGAAGTTGCCAGATTTTTCTTTAATCAGATG
GAAAAACCAAATTTAGTTTTGTGGAATGCTATGATTTCTGGTTATGCAAAAAATGGGTATGGTGAAGAAGCAATAAAGCTATTTCGTGAGATGATTTCAAAAAAT
ATCAGGGTAGATTCTGTTACTGTGAGGTCTGCTATTCTAGCTGGTGCCCAAGTGGGGTCCCTTGATCTAGCGAGATGGTTGGATGGTTATATCTCTAAGAGTGAG
TACAGGGACGATACTTTTGTAAACACGGCCCTTATAGATATGTATGCAAAATGTGGAAGCATATATTTTGCTAGTCTTGTTTTTGATAGGATGGTTGATAAAGAT
GTTGTTTTATGGAGTGCAATGATTATGGGGTATGGATTACATGGCCATGGAAAAGAAGCCATCAATCTTTACAATGCAATGAAGCAAGTCGGAGTTCGTCCAAAC
GATGTTACTTTTGTTGGCCTTCTCACAGCTTGCAAAAATTCAGGTTTGGTAAAAGAGGGATGGGATCTTTTCCATGAGATACGGGACCATGGGATTGAGCCACAT
CACCAGCATTACTCTTGTGTGGTCGATCTTCTAGGACGTGCAGGCTACTTGAATCAAGCTTATGACTTTATTATGAATATGCCGATTAAACCTGGAGTCAGTGTT
TGGGGGGCACTTCTGAGTGCGTGCAAGATCCATCGCCAAGTGAAGTTGGGAGAAATTGCTGCAGAACAGCTTTTCTCATTAGATCCATATAATACAGGGCATTAT
GTGCAGCTCTCAAACCTATATGCTTCTGCTCATTTATGGAATCACGTGGCAAACGTACGATTAATGATGACGCAGAAAGGACTGAACAAGGACCTTGGACATAGT
TCTATTGAGATCAATGGGAATCTCGAAACATTCCACGTTGGAGATAGATCACATCCTAGATCAAAGGAAATCTTTGAAGAGCTTGATAGACTGGAAAGGAGACTA
AAAGCGGCTGGTTATATTCCTCATATGGAATCTGTTCTGCACGACTTGAATCATGAGGAGATTGAGGAAACTCTTTGTAATCACAGTGAGAGGCTAGCAGTTGCT
TATGGCATCATCAGTACTGCTCCTGGAACTCCACTTCGAATAACGAACAATCTCCGAGCATGTGTTAATTGCCATTCAGCCATAAAGCTTATATCGAAGCTTGTC
GATAGGGAAATAATTATTCGAGATGCGAAACGTTTTCATCGCTTCAAAGATGGAGCTTGTTCATGTGGAGATTTTTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCTGCATTCGTTTTCCCTCTCTCTCTCCTTGTCCTCACTGTCTACAGCTTTCTCAAAGTTGGCGGCAACCTCGCAAGAGGCTTTATTGGGGAGGAAGCAT
TTGGATCAATTATACGTTCAGTTAATTGTGTCTGGATTACACAAGTGTCGTTTCTTGGTAATCAAATTTGTCAATGCATGCTTGCATCTTGGAGACGTTTACTAT
GCGCACAAGGCTTTTCGTGAAGTCTTAGAACCAGATATTTTGTTGTGGAATGCAGTCATAAAGGGCTATACTCAGAATAATATTTTTGTCGGTGCTGTCAAATTG
TATACAGAGATGCAAGTATCAGGGGTCCACCCAGATTGCTTCACATTTTTGTACGTGCTTAAAGCATGCGGTGGAATGTCTATTGAAGAAATAGGTAAACAGATG
CATGGCCAGACATTTAAATATGGCTTTGGATCAAATGTTTTTGTGCAGAACAGTCTTGTGTCAATGTATGCTAAATTTGGTCAAACCTCACCTGCGAGGGTCGTG
TTTGATAAGTTGCAAGATAGAACTGTTGTTTCGTGGACTTCCATCATTTCTGGGTATGTTCAGAATGGTGATCCTGCAGAAGCATTGAGTGTTTTCAAAAAAATG
AGACAAAGTAATGTGAAACTTGATTGGATTGCTCTTGTTAGTGTAATGACAGCATATACAGACATGGAAGATTTGGGACAAGGAAAGTCCATTCATGGCTTAGTG
ACTAAATTGGGTCTAGAATTTGAACCCGATATTGTGGTATCGCTCACTACCATGTATGCAAAACGTGGACAGGTGGAAGTTGCCAGATTTTTCTTTAATCAGATG
GAAAAACCAAATTTAGTTTTGTGGAATGCTATGATTTCTGGTTATGCAAAAAATGGGTATGGTGAAGAAGCAATAAAGCTATTTCGTGAGATGATTTCAAAAAAT
ATCAGGGTAGATTCTGTTACTGTGAGGTCTGCTATTCTAGCTGGTGCCCAAGTGGGGTCCCTTGATCTAGCGAGATGGTTGGATGGTTATATCTCTAAGAGTGAG
TACAGGGACGATACTTTTGTAAACACGGCCCTTATAGATATGTATGCAAAATGTGGAAGCATATATTTTGCTAGTCTTGTTTTTGATAGGATGGTTGATAAAGAT
GTTGTTTTATGGAGTGCAATGATTATGGGGTATGGATTACATGGCCATGGAAAAGAAGCCATCAATCTTTACAATGCAATGAAGCAAGTCGGAGTTCGTCCAAAC
GATGTTACTTTTGTTGGCCTTCTCACAGCTTGCAAAAATTCAGGTTTGGTAAAAGAGGGATGGGATCTTTTCCATGAGATACGGGACCATGGGATTGAGCCACAT
CACCAGCATTACTCTTGTGTGGTCGATCTTCTAGGACGTGCAGGCTACTTGAATCAAGCTTATGACTTTATTATGAATATGCCGATTAAACCTGGAGTCAGTGTT
TGGGGGGCACTTCTGAGTGCGTGCAAGATCCATCGCCAAGTGAAGTTGGGAGAAATTGCTGCAGAACAGCTTTTCTCATTAGATCCATATAATACAGGGCATTAT
GTGCAGCTCTCAAACCTATATGCTTCTGCTCATTTATGGAATCACGTGGCAAACGTACGATTAATGATGACGCAGAAAGGACTGAACAAGGACCTTGGACATAGT
TCTATTGAGATCAATGGGAATCTCGAAACATTCCACGTTGGAGATAGATCACATCCTAGATCAAAGGAAATCTTTGAAGAGCTTGATAGACTGGAAAGGAGACTA
AAAGCGGCTGGTTATATTCCTCATATGGAATCTGTTCTGCACGACTTGAATCATGAGGAGATTGAGGAAACTCTTTGTAATCACAGTGAGAGGCTAGCAGTTGCT
TATGGCATCATCAGTACTGCTCCTGGAACTCCACTTCGAATAACGAACAATCTCCGAGCATGTGTTAATTGCCATTCAGCCATAAAGCTTATATCGAAGCTTGTC
GATAGGGAAATAATTATTCGAGATGCGAAACGTTTTCATCGCTTCAAAGATGGAGCTTGTTCATGTGGAGATTTTTGGTGA
Protein sequenceShow/hide protein sequence
MSLHSFSLSLSLSSLSTAFSKLAATSQEALLGRKHLDQLYVQLIVSGLHKCRFLVIKFVNACLHLGDVYYAHKAFREVLEPDILLWNAVIKGYTQNNIFVGAVKL
YTEMQVSGVHPDCFTFLYVLKACGGMSIEEIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSPARVVFDKLQDRTVVSWTSIISGYVQNGDPAEALSVFKKM
RQSNVKLDWIALVSVMTAYTDMEDLGQGKSIHGLVTKLGLEFEPDIVVSLTTMYAKRGQVEVARFFFNQMEKPNLVLWNAMISGYAKNGYGEEAIKLFREMISKN
IRVDSVTVRSAILAGAQVGSLDLARWLDGYISKSEYRDDTFVNTALIDMYAKCGSIYFASLVFDRMVDKDVVLWSAMIMGYGLHGHGKEAINLYNAMKQVGVRPN
DVTFVGLLTACKNSGLVKEGWDLFHEIRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMNMPIKPGVSVWGALLSACKIHRQVKLGEIAAEQLFSLDPYNTGHY
VQLSNLYASAHLWNHVANVRLMMTQKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLERRLKAAGYIPHMESVLHDLNHEEIEETLCNHSERLAVA
YGIISTAPGTPLRITNNLRACVNCHSAIKLISKLVDREIIIRDAKRFHRFKDGACSCGDFW