; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G031040 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G031040
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCicolChr02:26588825..26591906
RNA-Seq ExpressionCcUC02G031040
SyntenyCcUC02G031040
Gene Ontology termsGO:0080156 - mitochondrial mRNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030779.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0087.84Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSLSLSS+S+ALSK+A TS E  LRRKHLDQLYVQLIVSGL+KCGFLVIKFVNACLH  DVNYAHK F EVL+PDILLWN I+KGYTQ NI A
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GAIRMY DMQ+S V+P+CFTFLYVLKACGG  V+GIGKQMH QTFKYG GSNVFVQNSLVSMYA+FGQTSSAR+VFDKLH+RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        V+AL+VFK+MR   VK DWI LVSV+TAYTD+EDLGQG++IH LVTKLGLEFEPDIV+SLT MYAK G VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKNI VDSVTVRS ILA AQ  SLELARWLDGY+SKSEYRDD FVNTALIDM+AKCGSI FAR VF+RMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HGQEAI LYN MKQ+G+RPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIMSMPIKPGVSVWGALLS CKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY
        HRQVRLGEIAAEQLF+LDPYNTG++VQLSNLYASAHLW HV NVRLMMTQKGLNKDLGHSSIE+NGNLE FHVGDRSHPR+KEIFEELDRLE+RLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        V   ESVLHDLNDEEIEE+LCNHSERLAVAYG ISTAPGTTLRITKNLRACVNCHSAIKLISKLV+REII+RDAKRFHHFKDGVCSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_008445864.1 PREDICTED: pentatricopeptide repeat-containing protein At3g12770 [Cucumis melo]0.0e+0090.16Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSL LSSLSSALSKS ITSHE SLRRKHLDQ+YVQLIVSGLHKC +LVIKFVNACLHFGDVNYAHKAFCEV +PDI LWNAI+KGY QKNIV 
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        G IRMYMDMQ+S+VHPNCFTFLYVLKACGGT V+ +GKQ+HG TFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        +EALKVFKEMR CNVKPDWIALVSVMTAYTDVED+GQG+SIHGLVTKLGLEFEPDIVISLTTMYAKRG VEVARFFF++MEKPNLILWNAMISGYAKNGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG
        GEEAIKLFREMISKNIRVDS+T+RS ILA AQ+ SLELA WLDGY+SKSEYRDDTFVNTAL+DMYAKCGSIY AR VF+R+ +KDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HGQEAI LYNEMKQAG+ PND TF+GLLTACKNSGLVKEGWELFHQM +HGIEPHHQHYSC+VDLLGRAGYLNQAYDFIMSMPIKPGV+VWGALLSACKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY
        HR+VRLGEIAA+QLFILDPYNTG++VQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIE+NG+LE FHVGDRSHPR+KEIFEELDRLEKRLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        VP  ESVLHDLN EEIEE+LCNHSERLAVAYG +STAPGTTLRITKNLRACVNCHSAIK+ISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_011654911.1 pentatricopeptide repeat-containing protein At3g12770 [Cucumis sativus]0.0e+0089.87Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSL LSSLSSALSKS IT HE SLRRKHLDQ+YVQLIVSGLHKC FL+IKF+NACLHFGDVNYAHKAF EV +PDILLWNAI+KGYTQKNIV 
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
          IRMYMDMQ+S+VHPNCFTFLYVLKACGGT V+GIGKQ+HGQTFKYGFGSNVFVQNSLVSMYAKFGQ S ARIVFDKLHDRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        +EAL VFKEMR CNVKPDWIALVSVMTAYT+VEDLGQG+SIHGLVTKLGLEFEPDIVISLTTMYAKRG VEVARFFFN+MEKPNLILWNAMISGYA NGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG
        GEEAIKLFREMI+KNIRVDS+T+RS +LA+AQ+ SLELARWLDGY+SKSEYRDDTFVNT LIDMYAKCGSIY AR VF+R+ DKDVVLWS MIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HGQEAI LYNEMKQAG+ PND TF+GLLTACKNSGLVKEGWELFH M DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY
        HR+VRLGEIAAEQLFILDPYNTG++VQLSNLYASAHLWT VANVRLMMTQKGLNKDLGHSSIE+NGNLE F VGDRSHP++KEIFEELDRLEKRLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        VP  ESVLHDLN EEIEE+LC+HSERLAVAYG ISTAPGTTLRITKNLRAC+NCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_022139117.1 pentatricopeptide repeat-containing protein At3g12770 [Momordica charantia]0.0e+0087.99Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSLSLSSLS+A SK A TS E  L RKHLDQLYVQLIVSGLHKC FLVIKFVNACLH GDV YAHKAF EVL+PDILLWNA++KGYTQ NI  
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GA+++Y +MQ+S VHP+CFTFLYVLKACGG  ++ IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTS AR+VFDKL DRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
         EAL VFK+MR  NVK DWIALVSVMTAYTD+EDLGQG+SIHGLVTKLGLEFEPDIV+SLTTMYAKRG VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG
        GEEAIKLFREMISKNIRVDSVTVRS ILA AQ+ SL+LARWLDGY+SKSEYRDDTFVNTALIDMYAKCGSIYFA  VF+RMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HG+EAI+LYN MKQ G+RPNDVTFVGLLTACKNSGLVKEGW+LFH++RDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIM+MPIKPGVSVWGALLSACKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY
        HRQV+LGEIAAEQLF LDPYNTG++VQLSNLYASAHLW HVANVRLMMTQKGLNKDLGHSSIE+NGNLE FHVGDRSHPR+KEIFEELDRLE+RLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        +P  ESVLHDLN EEIEE+LCNHSERLAVAYG ISTAPGT LRIT NLRACVNCHSAIKLISKLVDREIIIRDAKRFH FKDG CSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_038892943.1 pentatricopeptide repeat-containing protein At3g12770 [Benincasa hispida]0.0e+0091.9Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSLSLSSLSSALSKSAITSHE SLRRKHLDQLYVQLIVSGLHKCGFL+IKFVN+CLHFGDVNYAHKAF EV++PDILLWNAI+KGYTQKNI  
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GAIRMYMDMQMS V+PNCFTFLYVLKAC G  V+GIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        VEALK+FKEMR CNVK DWI LVSVMTAYTDVEDLGQG+SIHGLVTKLGLEFEPDIVISLTTMYAK G VE+ARFFFNQMEKPNLILWNAMISGYAKNGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG
        GEEAIKLF EMISKNIRVDSVTVRS ILA AQ+ SL+LARWLD Y+S+SEYRDDTFVNT+L+DMYAKCGSIYFAR VF+RMV KDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HGQEAI+ YNEMKQAG+ PNDVTFVGLLTACKNSGLVKEGWELFHQM+D+GIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMP+KPGVSVWGALLSACKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY
        HR+VRLGEIAAEQLFILDPYN GYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSI++NGNLE FHVGDRSHPR+KEIFEELDRLEKRLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        VP  ESVLHDLN EEIEE+LCNHSERLAVAYG ISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

TrEMBL top hitse value%identityAlignment
A0A0A0KLB9 DYW_deaminase domain-containing protein0.0e+0089.87Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSL LSSLSSALSKS IT HE SLRRKHLDQ+YVQLIVSGLHKC FL+IKF+NACLHFGDVNYAHKAF EV +PDILLWNAI+KGYTQKNIV 
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
          IRMYMDMQ+S+VHPNCFTFLYVLKACGGT V+GIGKQ+HGQTFKYGFGSNVFVQNSLVSMYAKFGQ S ARIVFDKLHDRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        +EAL VFKEMR CNVKPDWIALVSVMTAYT+VEDLGQG+SIHGLVTKLGLEFEPDIVISLTTMYAKRG VEVARFFFN+MEKPNLILWNAMISGYA NGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG
        GEEAIKLFREMI+KNIRVDS+T+RS +LA+AQ+ SLELARWLDGY+SKSEYRDDTFVNT LIDMYAKCGSIY AR VF+R+ DKDVVLWS MIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HGQEAI LYNEMKQAG+ PND TF+GLLTACKNSGLVKEGWELFH M DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY
        HR+VRLGEIAAEQLFILDPYNTG++VQLSNLYASAHLWT VANVRLMMTQKGLNKDLGHSSIE+NGNLE F VGDRSHP++KEIFEELDRLEKRLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        VP  ESVLHDLN EEIEE+LC+HSERLAVAYG ISTAPGTTLRITKNLRAC+NCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A1S3BEJ1 pentatricopeptide repeat-containing protein At3g127700.0e+0090.16Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSL LSSLSSALSKS ITSHE SLRRKHLDQ+YVQLIVSGLHKC +LVIKFVNACLHFGDVNYAHKAFCEV +PDI LWNAI+KGY QKNIV 
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        G IRMYMDMQ+S+VHPNCFTFLYVLKACGGT V+ +GKQ+HG TFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        +EALKVFKEMR CNVKPDWIALVSVMTAYTDVED+GQG+SIHGLVTKLGLEFEPDIVISLTTMYAKRG VEVARFFF++MEKPNLILWNAMISGYAKNGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG
        GEEAIKLFREMISKNIRVDS+T+RS ILA AQ+ SLELA WLDGY+SKSEYRDDTFVNTAL+DMYAKCGSIY AR VF+R+ +KDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HGQEAI LYNEMKQAG+ PND TF+GLLTACKNSGLVKEGWELFHQM +HGIEPHHQHYSC+VDLLGRAGYLNQAYDFIMSMPIKPGV+VWGALLSACKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY
        HR+VRLGEIAA+QLFILDPYNTG++VQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIE+NG+LE FHVGDRSHPR+KEIFEELDRLEKRLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        VP  ESVLHDLN EEIEE+LCNHSERLAVAYG +STAPGTTLRITKNLRACVNCHSAIK+ISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A5A7SVV3 Pentatricopeptide repeat-containing protein0.0e+0090.12Show/hide
Query:  KHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLV
        KHLDQ+YVQLIVSGLHKC +LVIKFVNACLHFGDVNYAHKAFCEV +PDI LWNAI+KGY QKNIV G IRMYMDMQ+S+VHPNCFTFLYVLKACGGT V
Subjt:  KHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLV

Query:  DGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSVMTAYTDVE
        + +GKQ+HG TFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP+EALKVFKEMR CNVKPDWIALVSVMTAYTDVE
Subjt:  DGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSVMTAYTDVE

Query:  DLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAAAQI
        D+GQG+SIHGLVTKLGLEFEPDIVISLTTMYAKRG VEVARFFF++MEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDS+T+RS ILA AQ+
Subjt:  DLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAAAQI

Query:  RSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKN
         SLELA WLDGY+SKSEYRDDTFVNTAL+DMYAKCGSIY AR VF+R+ +KDVVLWSAMIMGYGLHGHGQEAI LYNEMKQAG+ PND TF+GLLTACKN
Subjt:  RSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKN

Query:  SGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYA
        SGLVKEGWELFHQM +HGIEPHHQHYSC+VDLLGRAGYLNQAYDFIMSMPIKPGV+VWGALLSACKIHR+VRLGEIAA+QLFILDPYNTG++VQLSNLYA
Subjt:  SGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYA

Query:  SAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGS
        SAHLWTHVANVRLMMTQKGLNKDLGHSSIE+NG+LE FHVGDRSHPR+KEIFEELDRLEKRLK AGYVP  ESVLHDLN EEIEE+LCNHSERLAVAYG 
Subjt:  SAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGS

Query:  ISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        +STAPGTTLRITKNLRACVNCHSAIK+ISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
Subjt:  ISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A6J1CD43 pentatricopeptide repeat-containing protein At3g127700.0e+0087.99Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSLSLSSLS+A SK A TS E  L RKHLDQLYVQLIVSGLHKC FLVIKFVNACLH GDV YAHKAF EVL+PDILLWNA++KGYTQ NI  
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GA+++Y +MQ+S VHP+CFTFLYVLKACGG  ++ IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTS AR+VFDKL DRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
         EAL VFK+MR  NVK DWIALVSVMTAYTD+EDLGQG+SIHGLVTKLGLEFEPDIV+SLTTMYAKRG VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG
        GEEAIKLFREMISKNIRVDSVTVRS ILA AQ+ SL+LARWLDGY+SKSEYRDDTFVNTALIDMYAKCGSIYFA  VF+RMVDKDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HG+EAI+LYN MKQ G+RPNDVTFVGLLTACKNSGLVKEGW+LFH++RDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIM+MPIKPGVSVWGALLSACKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY
        HRQV+LGEIAAEQLF LDPYNTG++VQLSNLYASAHLW HVANVRLMMTQKGLNKDLGHSSIE+NGNLE FHVGDRSHPR+KEIFEELDRLE+RLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        +P  ESVLHDLN EEIEE+LCNHSERLAVAYG ISTAPGT LRIT NLRACVNCHSAIKLISKLVDREIIIRDAKRFH FKDG CSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A6J1JH82 pentatricopeptide repeat-containing protein At3g12770 isoform X20.0e+0087.55Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSLSL+SLS+ALSK+A TS E  LRRKHLDQLYVQLIVSGL+KCGFLVIKFVNACLH  DVNYAHK F EVL+PDILLWN I+KGYTQ NI A
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GAIRMY DMQ+S V+P+CFTFLYVLKACGG  V+GIGKQMH QTFKYGFGSNVFVQNSLVSMYA++GQTSSAR+VFDKLH+RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        ++AL+VFK+MR   VK DWI LVSVMTAYTD+EDLGQG++IH LVTKLGLEFEPDIV+SLT MYAK G VE+ARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKNI VDSVTVRS ILA AQ+ SLELARWLDGY+SKSEYRDD FVNTALIDM+AKCGSI FARSVF+RMVDKD+V WSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HGQEAI LYN MKQ+GIRPNDVTFVGLLTACKNSGLVKEGWELFHQM+DHGIEPHHQHYSCVVDLLGRAGYLN+AYDFIMSMPIKPGVSVWGALLS CKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY
        HRQVRLGEIAAEQLF+LDPYNTG++VQLSNLYASAHLW  VANVRLMMTQKGLNKDLGHSSIE+NGNLE FHVGDRSHPR+KEIFEELDRLE+RLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        V   ESVLHDLN EEIEE+LCNHSERLAVAYG ISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRD KRFHHFKDGVCSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.3e-14739.57Show/hide
Query:  VSLRRKHLDQLYVQLIVSGLHKCGFLVIK-FVNACL-HFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDM-QMSRVHPNCFTFLYVL
        VSLR+  L Q +  +I +G     +   K F  A L  F  + YA K F E+  P+   WN +++ Y        +I  ++DM   S+ +PN +TF +++
Subjt:  VSLRRKHLDQLYVQLIVSGLHKCGFLVIK-FVNACL-HFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDM-QMSRVHPNCFTFLYVL

Query:  KACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSV
        KA        +G+ +HG   K   GS+VFV NSL+  Y   G   SA  VF  + ++ VVSW S+I+G+VQ G P +AL++FK+M   +VK   + +V V
Subjt:  KACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSV

Query:  MTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFF-------------------------------NQMEKPNLILWNAMISG
        ++A   + +L  GR +   + +  +     +  ++  MY K G +E A+  F                               N M + +++ WNA+IS 
Subjt:  MTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFF-------------------------------NQMEKPNLILWNAMISG

Query:  YAKNGYGEEAIKLFREM-ISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMI
        Y +NG   EA+ +F E+ + KN++++ +T+ ST+ A AQ+ +LEL RW+  Y+ K   R +  V +ALI MY+KCG +  +R VFN +  +DV +WSAMI
Subjt:  YAKNGYGEEAIKLFREM-ISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMI

Query:  MGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMR-DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWG
         G  +HG G EA+ ++ +M++A ++PN VTF  +  AC ++GLV E   LFHQM  ++GI P  +HY+C+VD+LGR+GYL +A  FI +MPI P  SVWG
Subjt:  MGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMR-DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWG

Query:  ALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLE
        ALL ACKIH  + L E+A  +L  L+P N G HV LSN+YA    W +V+ +R  M   GL K+ G SSIE++G +  F  GD +HP +++++ +L  + 
Subjt:  ALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLE

Query:  KRLKEAGYVPDNESVLHDLNDEEI-EESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        ++LK  GY P+   VL  + +EE+ E+SL  HSE+LA+ YG IST     +R+ KNLR C +CHS  KLIS+L DREII+RD  RFHHF++G CSC DFW
Subjt:  KRLKEAGYVPDNESVLHDLNDEEI-EESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic1.4e-15742.44Show/hide
Query:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIG
        +++  L+ SG     F +    N       VN A K F  + + D++ WN IV GY+Q  +   A+ M   M    + P+  T + VL A     +  +G
Subjt:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIG

Query:  KQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQ
        K++HG   + GF S V +  +LV MYAK G   +AR +FD + +R VVSW S+I  YVQN +P EA+ +F++M    VKP  ++++  + A  D+ DL +
Subjt:  KQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQ

Query:  GRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLE
        GR IH L  +LGL+    +V SL +MY K   V+ A   F +++   L+ WNAMI G+A+NG   +A+  F +M S+ ++ D+ T  S I A A++    
Subjt:  GRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLE

Query:  LARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLV
         A+W+ G V +S    + FV TAL+DMYAKCG+I  AR +F+ M ++ V  W+AMI GYG HG G+ A+ L+ EM++  I+PN VTF+ +++AC +SGLV
Subjt:  LARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLV

Query:  KEGWELFHQMRD-HGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAH
        + G + F+ M++ + IE    HY  +VDLLGRAG LN+A+DFIM MP+KP V+V+GA+L AC+IH+ V   E AAE+LF L+P + GYHV L+N+Y +A 
Subjt:  KEGWELFHQMRD-HGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAH

Query:  LWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGSIST
        +W  V  VR+ M ++GL K  G S +E+   +  F  G  +HP +K+I+  L++L   +KEAGYVPD   VL   ND + E+ L  HSE+LA+++G ++T
Subjt:  LWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGSIST

Query:  APGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
          GTT+ + KNLR C +CH+A K IS +  REI++RD +RFHHFK+G CSCGD+W
Subjt:  APGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127706.8e-24259.15Show/hide
Query:  RKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDMQMSRVHPNCFTFLYVLKACGGTL
        +  L Q++ +L+V GL   GFL+ K ++A   FGD+ +A + F ++  P I  WNAI++GY++ N    A+ MY +MQ++RV P+ FTF ++LKAC G  
Subjt:  RKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDMQMSRVHPNCFTFLYVLKACGGTL

Query:  VDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSVMTAYT
           +G+ +H Q F+ GF ++VFVQN L+++YAK  +  SAR VF+   L +RT+VSWT+I+S Y QNG+P+EAL++F +MR  +VKPDW+ALVSV+ A+T
Subjt:  VDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSVMTAYT

Query:  DVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAA
         ++DL QGRSIH  V K+GLE EPD++ISL TMYAK G V  A+  F++M+ PNLILWNAMISGYAKNGY  EAI +F EMI+K++R D++++ S I A 
Subjt:  DVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAA

Query:  AQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTA
        AQ+ SLE AR +  YV +S+YRDD F+++ALIDM+AKCGS+  AR VF+R +D+DVV+WSAMI+GYGLHG  +EAISLY  M++ G+ PNDVTF+GLL A
Subjt:  AQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTA

Query:  CKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSN
        C +SG+V+EGW  F++M DH I P  QHY+CV+DLLGRAG+L+QAY+ I  MP++PGV+VWGALLSACK HR V LGE AA+QLF +DP NTG++VQLSN
Subjt:  CKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSN

Query:  LYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVA
        LYA+A LW  VA VR+ M +KGLNKD+G S +EV G LE F VGD+SHPR +EI  +++ +E RLKE G+V + ++ LHDLNDEE EE+LC+HSER+A+A
Subjt:  LYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVA

Query:  YGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        YG IST  GT LRITKNLRACVNCH+A KLISKLVDREI++RD  RFHHFKDGVCSCGD+W
Subjt:  YGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

Q9LW32 Pentatricopeptide repeat-containing protein At3g26782, mitochondrial2.6e-14841.77Show/hide
Query:  DILLWNAIVKGYTQKNIVAGAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHD
        D+  WN+++    +    A A+  +  M+   ++P   +F   +KAC        GKQ H Q F +G+ S++FV ++L+ MY+  G+   AR VFD++  
Subjt:  DILLWNAIVKGYTQKNIVAGAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHD

Query:  RTVVSWTSIISGYVQNGDPVEALKVFKEM------RHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAK--RGWVEVA
        R +VSWTS+I GY  NG+ ++A+ +FK++          +  D + LVSV++A + V   G   SIH  V K G +    +  +L   YAK   G V VA
Subjt:  RTVVSWTSIISGYVQNGDPVEALKVFKEM------RHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAK--RGWVEVA

Query:  RFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIY
        R  F+Q+   + + +N+++S YA++G   EA ++FR ++ +K +  +++T+ + +LA +   +L + + +   V +    DD  V T++IDMY KCG + 
Subjt:  RFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIY

Query:  FARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDH-GIEPHHQHYSCVVDLLGRAGY
         AR  F+RM +K+V  W+AMI GYG+HGH  +A+ L+  M  +G+RPN +TFV +L AC ++GL  EGW  F+ M+   G+EP  +HY C+VDLLGRAG+
Subjt:  FARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDH-GIEPHHQHYSCVVDLLGRAGY

Query:  LNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMF
        L +AYD I  M +KP   +W +LL+AC+IH+ V L EI+  +LF LD  N GY++ LS++YA A  W  V  VR++M  +GL K  G S +E+NG + +F
Subjt:  LNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMF

Query:  HVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIII
         +GD  HP+ ++I+E L  L ++L EAGYV +  SV HD+++EE E +L  HSE+LA+A+G ++T PG+T+ + KNLR C +CH+ IKLISK+VDRE ++
Subjt:  HVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIII

Query:  RDAKRFHHFKDGVCSCGDFW
        RDAKRFHHFKDG CSCGD+W
Subjt:  RDAKRFHHFKDGVCSCGDFW

Q9SUH6 Pentatricopeptide repeat-containing protein At4g307001.3e-14737.2Show/hide
Query:  HLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDMQMSR-VHPNCFTFLYVLKACGGTLV
        HL Q + Q+I+ G      L+ K        G + YA   F  V  PD+ L+N +++G++       ++ ++  ++ S  + PN  T+ + + A  G   
Subjt:  HLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDMQMSR-VHPNCFTFLYVLKACGGTLV

Query:  DGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEM-RHCNVKPDWIALVSVMTAYTDV
        D  G+ +HGQ    G  S + + +++V MY KF +   AR VFD++ ++  + W ++ISGY +N   VE+++VF+++      + D   L+ ++ A  ++
Subjt:  DGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEM-RHCNVKPDWIALVSVMTAYTDV

Query:  EDLGQGRSIHGLVTKLGL----------------------------EF-EPDIV----------------------------------------------
        ++L  G  IH L TK G                             EF +PDIV                                              
Subjt:  EDLGQGRSIHGLVTKLGL----------------------------EF-EPDIV----------------------------------------------

Query:  -----------------------ISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAAAQIR
                                +LTT+Y+K   +E AR  F++  + +L  WNAMISGY +NG  E+AI LFREM       + VT+   + A AQ+ 
Subjt:  -----------------------ISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAAAQIR

Query:  SLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNS
        +L L +W+   V  +++    +V+TALI MYAKCGSI  AR +F+ M  K+ V W+ MI GYGLHG GQEA++++ EM  +GI P  VTF+ +L AC ++
Subjt:  SLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNS

Query:  GLVKEGWELFHQM-RDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYA
        GLVKEG E+F+ M   +G EP  +HY+C+VD+LGRAG+L +A  FI +M I+PG SVW  LL AC+IH+   L    +E+LF LDP N GYHV LSN+++
Subjt:  GLVKEGWELFHQM-RDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYA

Query:  SAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGS
        +   +   A VR    ++ L K  G++ IE+     +F  GD+SHP+ KEI+E+L++LE +++EAGY P+ E  LHD+ +EE E  +  HSERLA+A+G 
Subjt:  SAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGS

Query:  ISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        I+T PGT +RI KNLR C++CH+  KLISK+ +R I++RDA RFHHFKDGVCSCGD+W
Subjt:  ISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

Arabidopsis top hitse value%identityAlignment
AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein9.6e-15942.44Show/hide
Query:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIG
        +++  L+ SG     F +    N       VN A K F  + + D++ WN IV GY+Q  +   A+ M   M    + P+  T + VL A     +  +G
Subjt:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIG

Query:  KQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQ
        K++HG   + GF S V +  +LV MYAK G   +AR +FD + +R VVSW S+I  YVQN +P EA+ +F++M    VKP  ++++  + A  D+ DL +
Subjt:  KQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQ

Query:  GRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLE
        GR IH L  +LGL+    +V SL +MY K   V+ A   F +++   L+ WNAMI G+A+NG   +A+  F +M S+ ++ D+ T  S I A A++    
Subjt:  GRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLE

Query:  LARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLV
         A+W+ G V +S    + FV TAL+DMYAKCG+I  AR +F+ M ++ V  W+AMI GYG HG G+ A+ L+ EM++  I+PN VTF+ +++AC +SGLV
Subjt:  LARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLV

Query:  KEGWELFHQMRD-HGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAH
        + G + F+ M++ + IE    HY  +VDLLGRAG LN+A+DFIM MP+KP V+V+GA+L AC+IH+ V   E AAE+LF L+P + GYHV L+N+Y +A 
Subjt:  KEGWELFHQMRD-HGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAH

Query:  LWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGSIST
        +W  V  VR+ M ++GL K  G S +E+   +  F  G  +HP +K+I+  L++L   +KEAGYVPD   VL   ND + E+ L  HSE+LA+++G ++T
Subjt:  LWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGSIST

Query:  APGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
          GTT+ + KNLR C +CH+A K IS +  REI++RD +RFHHFK+G CSCGD+W
Subjt:  APGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.0e-14939.57Show/hide
Query:  VSLRRKHLDQLYVQLIVSGLHKCGFLVIK-FVNACL-HFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDM-QMSRVHPNCFTFLYVL
        VSLR+  L Q +  +I +G     +   K F  A L  F  + YA K F E+  P+   WN +++ Y        +I  ++DM   S+ +PN +TF +++
Subjt:  VSLRRKHLDQLYVQLIVSGLHKCGFLVIK-FVNACL-HFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDM-QMSRVHPNCFTFLYVL

Query:  KACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSV
        KA        +G+ +HG   K   GS+VFV NSL+  Y   G   SA  VF  + ++ VVSW S+I+G+VQ G P +AL++FK+M   +VK   + +V V
Subjt:  KACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSV

Query:  MTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFF-------------------------------NQMEKPNLILWNAMISG
        ++A   + +L  GR +   + +  +     +  ++  MY K G +E A+  F                               N M + +++ WNA+IS 
Subjt:  MTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFF-------------------------------NQMEKPNLILWNAMISG

Query:  YAKNGYGEEAIKLFREM-ISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMI
        Y +NG   EA+ +F E+ + KN++++ +T+ ST+ A AQ+ +LEL RW+  Y+ K   R +  V +ALI MY+KCG +  +R VFN +  +DV +WSAMI
Subjt:  YAKNGYGEEAIKLFREM-ISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMI

Query:  MGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMR-DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWG
         G  +HG G EA+ ++ +M++A ++PN VTF  +  AC ++GLV E   LFHQM  ++GI P  +HY+C+VD+LGR+GYL +A  FI +MPI P  SVWG
Subjt:  MGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMR-DHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWG

Query:  ALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLE
        ALL ACKIH  + L E+A  +L  L+P N G HV LSN+YA    W +V+ +R  M   GL K+ G SSIE++G +  F  GD +HP +++++ +L  + 
Subjt:  ALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLE

Query:  KRLKEAGYVPDNESVLHDLNDEEI-EESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        ++LK  GY P+   VL  + +EE+ E+SL  HSE+LA+ YG IST     +R+ KNLR C +CHS  KLIS+L DREII+RD  RFHHF++G CSC DFW
Subjt:  KRLKEAGYVPDNESVLHDLNDEEI-EESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

AT3G12770.1 mitochondrial editing factor 224.8e-24359.15Show/hide
Query:  RKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDMQMSRVHPNCFTFLYVLKACGGTL
        +  L Q++ +L+V GL   GFL+ K ++A   FGD+ +A + F ++  P I  WNAI++GY++ N    A+ MY +MQ++RV P+ FTF ++LKAC G  
Subjt:  RKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDMQMSRVHPNCFTFLYVLKACGGTL

Query:  VDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSVMTAYT
           +G+ +H Q F+ GF ++VFVQN L+++YAK  +  SAR VF+   L +RT+VSWT+I+S Y QNG+P+EAL++F +MR  +VKPDW+ALVSV+ A+T
Subjt:  VDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSVMTAYT

Query:  DVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAA
         ++DL QGRSIH  V K+GLE EPD++ISL TMYAK G V  A+  F++M+ PNLILWNAMISGYAKNGY  EAI +F EMI+K++R D++++ S I A 
Subjt:  DVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAA

Query:  AQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTA
        AQ+ SLE AR +  YV +S+YRDD F+++ALIDM+AKCGS+  AR VF+R +D+DVV+WSAMI+GYGLHG  +EAISLY  M++ G+ PNDVTF+GLL A
Subjt:  AQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTA

Query:  CKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSN
        C +SG+V+EGW  F++M DH I P  QHY+CV+DLLGRAG+L+QAY+ I  MP++PGV+VWGALLSACK HR V LGE AA+QLF +DP NTG++VQLSN
Subjt:  CKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSN

Query:  LYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVA
        LYA+A LW  VA VR+ M +KGLNKD+G S +EV G LE F VGD+SHPR +EI  +++ +E RLKE G+V + ++ LHDLNDEE EE+LC+HSER+A+A
Subjt:  LYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVA

Query:  YGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        YG IST  GT LRITKNLRACVNCH+A KLISKLVDREI++RD  RFHHFKDGVCSCGD+W
Subjt:  YGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

AT3G26782.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-14941.77Show/hide
Query:  DILLWNAIVKGYTQKNIVAGAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHD
        D+  WN+++    +    A A+  +  M+   ++P   +F   +KAC        GKQ H Q F +G+ S++FV ++L+ MY+  G+   AR VFD++  
Subjt:  DILLWNAIVKGYTQKNIVAGAIRMYMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHD

Query:  RTVVSWTSIISGYVQNGDPVEALKVFKEM------RHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAK--RGWVEVA
        R +VSWTS+I GY  NG+ ++A+ +FK++          +  D + LVSV++A + V   G   SIH  V K G +    +  +L   YAK   G V VA
Subjt:  RTVVSWTSIISGYVQNGDPVEALKVFKEM------RHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAK--RGWVEVA

Query:  RFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIY
        R  F+Q+   + + +N+++S YA++G   EA ++FR ++ +K +  +++T+ + +LA +   +L + + +   V +    DD  V T++IDMY KCG + 
Subjt:  RFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIY

Query:  FARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDH-GIEPHHQHYSCVVDLLGRAGY
         AR  F+RM +K+V  W+AMI GYG+HGH  +A+ L+  M  +G+RPN +TFV +L AC ++GL  EGW  F+ M+   G+EP  +HY C+VDLLGRAG+
Subjt:  FARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDH-GIEPHHQHYSCVVDLLGRAGY

Query:  LNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMF
        L +AYD I  M +KP   +W +LL+AC+IH+ V L EI+  +LF LD  N GY++ LS++YA A  W  V  VR++M  +GL K  G S +E+NG + +F
Subjt:  LNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMF

Query:  HVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIII
         +GD  HP+ ++I+E L  L ++L EAGYV +  SV HD+++EE E +L  HSE+LA+A+G ++T PG+T+ + KNLR C +CH+ IKLISK+VDRE ++
Subjt:  HVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIII

Query:  RDAKRFHHFKDGVCSCGDFW
        RDAKRFHHFKDG CSCGD+W
Subjt:  RDAKRFHHFKDGVCSCGDFW

AT4G30700.1 Pentatricopeptide repeat (PPR) superfamily protein9.0e-14937.2Show/hide
Query:  HLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDMQMSR-VHPNCFTFLYVLKACGGTLV
        HL Q + Q+I+ G      L+ K        G + YA   F  V  PD+ L+N +++G++       ++ ++  ++ S  + PN  T+ + + A  G   
Subjt:  HLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRMYMDMQMSR-VHPNCFTFLYVLKACGGTLV

Query:  DGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEM-RHCNVKPDWIALVSVMTAYTDV
        D  G+ +HGQ    G  S + + +++V MY KF +   AR VFD++ ++  + W ++ISGY +N   VE+++VF+++      + D   L+ ++ A  ++
Subjt:  DGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEM-RHCNVKPDWIALVSVMTAYTDV

Query:  EDLGQGRSIHGLVTKLGL----------------------------EF-EPDIV----------------------------------------------
        ++L  G  IH L TK G                             EF +PDIV                                              
Subjt:  EDLGQGRSIHGLVTKLGL----------------------------EF-EPDIV----------------------------------------------

Query:  -----------------------ISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAAAQIR
                                +LTT+Y+K   +E AR  F++  + +L  WNAMISGY +NG  E+AI LFREM       + VT+   + A AQ+ 
Subjt:  -----------------------ISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAAAQIR

Query:  SLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNS
        +L L +W+   V  +++    +V+TALI MYAKCGSI  AR +F+ M  K+ V W+ MI GYGLHG GQEA++++ EM  +GI P  VTF+ +L AC ++
Subjt:  SLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNS

Query:  GLVKEGWELFHQM-RDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYA
        GLVKEG E+F+ M   +G EP  +HY+C+VD+LGRAG+L +A  FI +M I+PG SVW  LL AC+IH+   L    +E+LF LDP N GYHV LSN+++
Subjt:  GLVKEGWELFHQM-RDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYA

Query:  SAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGS
        +   +   A VR    ++ L K  G++ IE+     +F  GD+SHP+ KEI+E+L++LE +++EAGY P+ E  LHD+ +EE E  +  HSERLA+A+G 
Subjt:  SAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGS

Query:  ISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        I+T PGT +RI KNLR C++CH+  KLISK+ +R I++RDA RFHHFKDGVCSCGD+W
Subjt:  ISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTGCATTCTTTTTCGCTTTCTCTCTCCTTGTCGTCACTCTCATCAGCTCTCTCAAAGTCGGCGATAACCTCGCATGAGGTTTCATTAAGGAGGAAGCAT
TTGGATCAATTATACGTCCAGTTAATTGTGTCTGGACTACACAAGTGTGGTTTCCTGGTGATCAAATTTGTCAATGCTTGTTTGCATTTCGGAGATGTTAACTAC
GCACACAAGGCTTTTTGCGAAGTCTTAGACCCGGATATTTTGTTGTGGAATGCCATTGTAAAGGGTTACACTCAGAAGAATATTGTTGCTGGTGCTATCAGAATG
TATATGGATATGCAAATGTCACGGGTGCACCCGAATTGCTTCACATTTTTGTATGTGCTTAAAGCATGCGGTGGAACATTAGTTGACGGAATAGGTAAACAGATG
CATGGCCAGACATTTAAATATGGCTTTGGATCAAATGTTTTTGTGCAGAATAGTCTTGTGTCAATGTATGCTAAATTTGGTCAAACCTCGTCTGCTAGGATCGTG
TTTGATAAGCTGCATGATAGGACAGTTGTTTCATGGACTTCCATCATTTCTGGGTATGTTCAGAATGGTGATCCCGTGGAAGCATTGAAAGTTTTCAAAGAAATG
AGACATTGTAATGTAAAGCCTGATTGGATTGCCCTTGTTAGCGTCATGACAGCATATACGGATGTGGAAGATTTGGGACAAGGAAGGTCCATTCATGGTTTAGTA
ACTAAATTGGGTCTAGAATTTGAACCCGATATAGTGATATCACTCACTACTATGTATGCAAAACGTGGATGGGTAGAAGTTGCCAGATTTTTCTTTAATCAGATG
GAAAAACCGAATTTAATATTGTGGAATGCTATGATTTCTGGCTATGCAAAAAATGGATATGGTGAAGAAGCAATCAAACTATTCCGCGAGATGATTTCCAAAAAT
ATCAGGGTTGATTCTGTTACTGTGAGGTCTACTATTCTAGCCGCTGCCCAAATCAGGTCTCTTGAACTAGCAAGATGGTTGGATGGTTATGTCTCTAAGAGTGAG
TACAGAGATGATACTTTTGTAAACACGGCCCTTATAGATATGTATGCAAAATGCGGAAGCATATATTTTGCTCGTAGTGTTTTCAATAGAATGGTCGATAAAGAC
GTTGTCTTATGGAGTGCAATGATTATGGGGTATGGATTACATGGTCATGGACAAGAAGCCATCAGCCTTTACAATGAAATGAAGCAAGCTGGAATTCGTCCAAAC
GATGTTACTTTTGTTGGTCTTCTCACAGCATGCAAAAATTCAGGTCTTGTAAAAGAGGGATGGGAGCTTTTCCACCAGATGCGAGACCACGGGATTGAACCACAT
CACCAGCATTACTCTTGCGTGGTCGATCTTCTAGGACGTGCAGGCTATTTGAATCAAGCTTATGATTTCATTATGAGCATGCCAATTAAACCTGGAGTTAGTGTT
TGGGGGGCTCTTCTGAGTGCTTGCAAGATCCATCGCCAAGTAAGGTTGGGAGAAATTGCTGCAGAACAACTTTTCATATTAGATCCATATAATACAGGGTATCAT
GTGCAACTCTCAAACCTCTATGCTTCTGCCCATTTGTGGACTCACGTGGCTAACGTTCGATTAATGATGACACAAAAAGGACTGAACAAGGACCTCGGACATAGT
TCTATTGAGGTGAACGGAAATCTCGAAATGTTTCATGTTGGAGATAGATCACATCCCAGAGCAAAGGAAATTTTTGAAGAGCTTGATAGATTAGAGAAAAGATTA
AAAGAAGCCGGTTATGTTCCTGATAATGAATCTGTTCTACATGACTTGAATGATGAGGAGATTGAGGAAAGTCTTTGTAACCACAGTGAGAGGCTAGCAGTTGCT
TATGGTAGCATTAGTACTGCTCCTGGAACTACACTTAGAATAACCAAGAATCTCCGAGCATGTGTTAATTGCCATTCAGCGATAAAGCTTATATCAAAGCTTGTC
GATAGGGAAATAATTATTCGAGACGCAAAACGTTTTCATCATTTCAAAGATGGAGTTTGTTCATGTGGAGATTTCTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTTGCATTCTTTTTCGCTTTCTCTCTCCTTGTCGTCACTCTCATCAGCTCTCTCAAAGTCGGCGATAACCTCGCATGAGGTTTCATTAAGGAGGAAGCAT
TTGGATCAATTATACGTCCAGTTAATTGTGTCTGGACTACACAAGTGTGGTTTCCTGGTGATCAAATTTGTCAATGCTTGTTTGCATTTCGGAGATGTTAACTAC
GCACACAAGGCTTTTTGCGAAGTCTTAGACCCGGATATTTTGTTGTGGAATGCCATTGTAAAGGGTTACACTCAGAAGAATATTGTTGCTGGTGCTATCAGAATG
TATATGGATATGCAAATGTCACGGGTGCACCCGAATTGCTTCACATTTTTGTATGTGCTTAAAGCATGCGGTGGAACATTAGTTGACGGAATAGGTAAACAGATG
CATGGCCAGACATTTAAATATGGCTTTGGATCAAATGTTTTTGTGCAGAATAGTCTTGTGTCAATGTATGCTAAATTTGGTCAAACCTCGTCTGCTAGGATCGTG
TTTGATAAGCTGCATGATAGGACAGTTGTTTCATGGACTTCCATCATTTCTGGGTATGTTCAGAATGGTGATCCCGTGGAAGCATTGAAAGTTTTCAAAGAAATG
AGACATTGTAATGTAAAGCCTGATTGGATTGCCCTTGTTAGCGTCATGACAGCATATACGGATGTGGAAGATTTGGGACAAGGAAGGTCCATTCATGGTTTAGTA
ACTAAATTGGGTCTAGAATTTGAACCCGATATAGTGATATCACTCACTACTATGTATGCAAAACGTGGATGGGTAGAAGTTGCCAGATTTTTCTTTAATCAGATG
GAAAAACCGAATTTAATATTGTGGAATGCTATGATTTCTGGCTATGCAAAAAATGGATATGGTGAAGAAGCAATCAAACTATTCCGCGAGATGATTTCCAAAAAT
ATCAGGGTTGATTCTGTTACTGTGAGGTCTACTATTCTAGCCGCTGCCCAAATCAGGTCTCTTGAACTAGCAAGATGGTTGGATGGTTATGTCTCTAAGAGTGAG
TACAGAGATGATACTTTTGTAAACACGGCCCTTATAGATATGTATGCAAAATGCGGAAGCATATATTTTGCTCGTAGTGTTTTCAATAGAATGGTCGATAAAGAC
GTTGTCTTATGGAGTGCAATGATTATGGGGTATGGATTACATGGTCATGGACAAGAAGCCATCAGCCTTTACAATGAAATGAAGCAAGCTGGAATTCGTCCAAAC
GATGTTACTTTTGTTGGTCTTCTCACAGCATGCAAAAATTCAGGTCTTGTAAAAGAGGGATGGGAGCTTTTCCACCAGATGCGAGACCACGGGATTGAACCACAT
CACCAGCATTACTCTTGCGTGGTCGATCTTCTAGGACGTGCAGGCTATTTGAATCAAGCTTATGATTTCATTATGAGCATGCCAATTAAACCTGGAGTTAGTGTT
TGGGGGGCTCTTCTGAGTGCTTGCAAGATCCATCGCCAAGTAAGGTTGGGAGAAATTGCTGCAGAACAACTTTTCATATTAGATCCATATAATACAGGGTATCAT
GTGCAACTCTCAAACCTCTATGCTTCTGCCCATTTGTGGACTCACGTGGCTAACGTTCGATTAATGATGACACAAAAAGGACTGAACAAGGACCTCGGACATAGT
TCTATTGAGGTGAACGGAAATCTCGAAATGTTTCATGTTGGAGATAGATCACATCCCAGAGCAAAGGAAATTTTTGAAGAGCTTGATAGATTAGAGAAAAGATTA
AAAGAAGCCGGTTATGTTCCTGATAATGAATCTGTTCTACATGACTTGAATGATGAGGAGATTGAGGAAAGTCTTTGTAACCACAGTGAGAGGCTAGCAGTTGCT
TATGGTAGCATTAGTACTGCTCCTGGAACTACACTTAGAATAACCAAGAATCTCCGAGCATGTGTTAATTGCCATTCAGCGATAAAGCTTATATCAAAGCTTGTC
GATAGGGAAATAATTATTCGAGACGCAAAACGTTTTCATCATTTCAAAGATGGAGTTTGTTCATGTGGAGATTTCTGGTGA
Protein sequenceShow/hide protein sequence
MSLHSFSLSLSLSSLSSALSKSAITSHEVSLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFCEVLDPDILLWNAIVKGYTQKNIVAGAIRM
YMDMQMSRVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEM
RHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKN
IRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVDKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPN
DVTFVGLLTACKNSGLVKEGWELFHQMRDHGIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYH
VQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRAKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVA
YGSISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW