; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G14800 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G14800
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr02:27321195..27327343
RNA-Seq ExpressionClc02G14800
SyntenyClc02G14800
Gene Ontology termsGO:0080156 - mitochondrial mRNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7030779.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0088.13Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSLSLSS+S+ALSK+A TS EA LRRKHLDQLYVQLIVSGL+KCGFLVIKFVNACLH  DVNYAHK FREVLEPDILLWN I+KGYTQ NI A
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GAIRMY D+Q+S V+P+CFTFLYVLKACGG  V+GIGKQMH QTFKYG GSNVFVQNSLVSMYA+FGQTSSAR+VFDKLH+RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        V+AL+VFK+MR   VK DWI LVSV+TAYTD+EDLGQG++IH LVTKLGLEFEPDIV+SLT MYAK G VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKNI VDSVTVRS ILA AQ  SLELARWLDGY+SKSEYRDD FVNTALIDM+AKCGSI FAR VF+RMV KDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HGQEAI LYN MKQ+G+RPNDVTFVGLLTACKNSGLVKEGWELFHQMRDH IEPHHQHYSCVVDLLGRAGYLN+AYDFIMSMPIKPGVSVWGALLS CKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY
        HRQVRLGEIAAEQLF+LDPYNTG++VQLSNLYASAHLW HV NVRLMMTQKGLNKDLGHSSIE+NGNLE FHVGDRSHPRSKEIFEELDRLE+RLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        V   ESVLHDLNDEEIEE+LCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV+REII+RDAKRFHHFKDGVCSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_008445864.1 PREDICTED: pentatricopeptide repeat-containing protein At3g12770 [Cucumis melo]0.0e+0090.45Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSL LSSLSSALSKS ITSHEASLRRKHLDQ+YVQLIVSGLHKC +LVIKFVNACLHFGDVNYAHKAF EV EPDI LWNAI+KGY QKNIV 
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        G IRMYMD+Q+SQVHPNCFTFLYVLKACGGT V+ +GKQ+HG TFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        +EALKVFKEMR CNVKPDWIALVSVMTAYTDVED+GQG+SIHGLVTKLGLEFEPDIVISLTTMYAKRG VEVARFFF++MEKPNLILWNAMISGYAKNGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG
        GEEAIKLFREMISKNIRVDS+T+RS ILA AQ+ SLELA WLDGY+SKSEYRDDTFVNTAL+DMYAKCGSIY AR VF+R+  KDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HGQEAI LYNEMKQAG+ PND TF+GLLTACKNSGLVKEGWELFHQM +H IEPHHQHYSC+VDLLGRAGYLNQAYDFIMSMPIKPGV+VWGALLSACKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY
        HR+VRLGEIAA+QLFILDPYNTG++VQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIE+NG+LE FHVGDRSHPRSKEIFEELDRLEKRLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        VP  ESVLHDLN EEIEE+LCNHSERLAVAYGI+STAPGTTLRITKNLRACVNCHSAIK+ISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_011654911.1 pentatricopeptide repeat-containing protein At3g12770 [Cucumis sativus]0.0e+0090.3Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSL LSSLSSALSKS IT HEASLRRKHLDQ+YVQLIVSGLHKC FL+IKF+NACLHFGDVNYAHKAFREV EPDILLWNAI+KGYTQKNIV 
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
          IRMYMD+Q+SQVHPNCFTFLYVLKACGGT V+GIGKQ+HGQTFKYGFGSNVFVQNSLVSMYAKFGQ S ARIVFDKLHDRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        +EAL VFKEMR CNVKPDWIALVSVMTAYT+VEDLGQG+SIHGLVTKLGLEFEPDIVISLTTMYAKRG VEVARFFFN+MEKPNLILWNAMISGYA NGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG
        GEEAIKLFREMI+KNIRVDS+T+RS +LA+AQ+ SLELARWLDGY+SKSEYRDDTFVNT LIDMYAKCGSIY AR VF+R+  KDVVLWS MIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HGQEAI LYNEMKQAG+ PND TF+GLLTACKNSGLVKEGWELFH M DH IEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY
        HR+VRLGEIAAEQLFILDPYNTG++VQLSNLYASAHLWT VANVRLMMTQKGLNKDLGHSSIE+NGNLE F VGDRSHP+SKEIFEELDRLEKRLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        VP  ESVLHDLN EEIEE+LC+HSERLAVAYGIISTAPGTTLRITKNLRAC+NCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_022139117.1 pentatricopeptide repeat-containing protein At3g12770 [Momordica charantia]0.0e+0088.28Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSLSLSSLS+A SK A TS EA L RKHLDQLYVQLIVSGLHKC FLVIKFVNACLH GDV YAHKAFREVLEPDILLWNA++KGYTQ NI  
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GA+++Y ++Q+S VHP+CFTFLYVLKACGG  ++ IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTS AR+VFDKL DRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
         EAL VFK+MR  NVK DWIALVSVMTAYTD+EDLGQG+SIHGLVTKLGLEFEPDIV+SLTTMYAKRG VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG
        GEEAIKLFREMISKNIRVDSVTVRS ILA AQ+ SL+LARWLDGY+SKSEYRDDTFVNTALIDMYAKCGSIYFA  VF+RMV KDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HG+EAI+LYN MKQ G+RPNDVTFVGLLTACKNSGLVKEGW+LFH++RDH IEPHHQHYSCVVDLLGRAGYLNQAYDFIM+MPIKPGVSVWGALLSACKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY
        HRQV+LGEIAAEQLF LDPYNTG++VQLSNLYASAHLW HVANVRLMMTQKGLNKDLGHSSIE+NGNLE FHVGDRSHPRSKEIFEELDRLE+RLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        +P  ESVLHDLN EEIEE+LCNHSERLAVAYGIISTAPGT LRIT NLRACVNCHSAIKLISKLVDREIIIRDAKRFH FKDG CSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

XP_038892943.1 pentatricopeptide repeat-containing protein At3g12770 [Benincasa hispida]0.0e+0092.33Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFL+IKFVN+CLHFGDVNYAHKAFREV+EPDILLWNAI+KGYTQKNI  
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GAIRMYMD+QMS V+PNCFTFLYVLKAC G  V+GIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        VEALK+FKEMR CNVK DWI LVSVMTAYTDVEDLGQG+SIHGLVTKLGLEFEPDIVISLTTMYAK G VE+ARFFFNQMEKPNLILWNAMISGYAKNGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG
        GEEAIKLF EMISKNIRVDSVTVRS ILA AQ+ SL+LARWLD Y+S+SEYRDDTFVNT+L+DMYAKCGSIYFAR VF+RMV KDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HGQEAI+ YNEMKQAG+ PNDVTFVGLLTACKNSGLVKEGWELFHQM+D+ IEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMP+KPGVSVWGALLSACKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY
        HR+VRLGEIAAEQLFILDPYN GYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSI++NGNLE FHVGDRSHPRSKEIFEELDRLEKRLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        VP  ESVLHDLN EEIEE+LCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

TrEMBL top hitse value%identityAlignment
A0A0A0KLB9 DYW_deaminase domain-containing protein0.0e+0090.3Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSL LSSLSSALSKS IT HEASLRRKHLDQ+YVQLIVSGLHKC FL+IKF+NACLHFGDVNYAHKAFREV EPDILLWNAI+KGYTQKNIV 
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
          IRMYMD+Q+SQVHPNCFTFLYVLKACGGT V+GIGKQ+HGQTFKYGFGSNVFVQNSLVSMYAKFGQ S ARIVFDKLHDRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        +EAL VFKEMR CNVKPDWIALVSVMTAYT+VEDLGQG+SIHGLVTKLGLEFEPDIVISLTTMYAKRG VEVARFFFN+MEKPNLILWNAMISGYA NGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG
        GEEAIKLFREMI+KNIRVDS+T+RS +LA+AQ+ SLELARWLDGY+SKSEYRDDTFVNT LIDMYAKCGSIY AR VF+R+  KDVVLWS MIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HGQEAI LYNEMKQAG+ PND TF+GLLTACKNSGLVKEGWELFH M DH IEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY
        HR+VRLGEIAAEQLFILDPYNTG++VQLSNLYASAHLWT VANVRLMMTQKGLNKDLGHSSIE+NGNLE F VGDRSHP+SKEIFEELDRLEKRLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        VP  ESVLHDLN EEIEE+LC+HSERLAVAYGIISTAPGTTLRITKNLRAC+NCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A1S3BEJ1 pentatricopeptide repeat-containing protein At3g127700.0e+0090.45Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSL LSSLSSALSKS ITSHEASLRRKHLDQ+YVQLIVSGLHKC +LVIKFVNACLHFGDVNYAHKAF EV EPDI LWNAI+KGY QKNIV 
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        G IRMYMD+Q+SQVHPNCFTFLYVLKACGGT V+ +GKQ+HG TFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        +EALKVFKEMR CNVKPDWIALVSVMTAYTDVED+GQG+SIHGLVTKLGLEFEPDIVISLTTMYAKRG VEVARFFF++MEKPNLILWNAMISGYAKNGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG
        GEEAIKLFREMISKNIRVDS+T+RS ILA AQ+ SLELA WLDGY+SKSEYRDDTFVNTAL+DMYAKCGSIY AR VF+R+  KDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HGQEAI LYNEMKQAG+ PND TF+GLLTACKNSGLVKEGWELFHQM +H IEPHHQHYSC+VDLLGRAGYLNQAYDFIMSMPIKPGV+VWGALLSACKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY
        HR+VRLGEIAA+QLFILDPYNTG++VQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIE+NG+LE FHVGDRSHPRSKEIFEELDRLEKRLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        VP  ESVLHDLN EEIEE+LCNHSERLAVAYGI+STAPGTTLRITKNLRACVNCHSAIK+ISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A6J1CD43 pentatricopeptide repeat-containing protein At3g127700.0e+0088.28Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSLSLSSLS+A SK A TS EA L RKHLDQLYVQLIVSGLHKC FLVIKFVNACLH GDV YAHKAFREVLEPDILLWNA++KGYTQ NI  
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GA+++Y ++Q+S VHP+CFTFLYVLKACGG  ++ IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTS AR+VFDKL DRTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
         EAL VFK+MR  NVK DWIALVSVMTAYTD+EDLGQG+SIHGLVTKLGLEFEPDIV+SLTTMYAKRG VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG
        GEEAIKLFREMISKNIRVDSVTVRS ILA AQ+ SL+LARWLDGY+SKSEYRDDTFVNTALIDMYAKCGSIYFA  VF+RMV KDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HG+EAI+LYN MKQ G+RPNDVTFVGLLTACKNSGLVKEGW+LFH++RDH IEPHHQHYSCVVDLLGRAGYLNQAYDFIM+MPIKPGVSVWGALLSACKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY
        HRQV+LGEIAAEQLF LDPYNTG++VQLSNLYASAHLW HVANVRLMMTQKGLNKDLGHSSIE+NGNLE FHVGDRSHPRSKEIFEELDRLE+RLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        +P  ESVLHDLN EEIEE+LCNHSERLAVAYGIISTAPGT LRIT NLRACVNCHSAIKLISKLVDREIIIRDAKRFH FKDG CSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A6J1FN08 pentatricopeptide repeat-containing protein At3g12770 isoform X20.0e+0087.84Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSLSLSSLS+ALSK+A TS EA LRRKHLDQLYVQLIVSGL+KCGFLVIKFVNACLH  DVNYAHK FREVLEPDILLWN I+KGYTQ NI A
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GAIRMY D+Q+S V+P+CFTFLYVLKACGG  V+GIGKQMH QTFKYG GSNVFVQNSLVSMYA+FGQTSSAR+VFDKLH+RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        V+AL+VFK+MR   VK DWI LVSV+TAYTD+EDLGQG++IH LVTKLGLEFEPDIV+SLT MYAK G VEVARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKNI VDSVTVRS ILA AQ  SLELARWLDGY+SKSEYRDD FVNTALIDM+AKCGSI FAR VF+RMV KDVVLWSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HGQEAI LYN MKQ+G+ PN+VTFVGLLTACKNSGLVKEGWELFHQMRDH IEPHHQHYSCVVDLLGRAGYLN+AYDFIMSMPIKPGVSVWGALLS CKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY
        HRQVRLGEIAAEQLF+LDPYNTG++VQLSNLYASAHLW HV NVRLMMTQKGLNKDLGHSSIE+NGNLE FHVGDRSHPRSKEIFEELDRLE+RLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        V   ESVLHDLN EEIEE+LCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREII+RDAKRFH+FKDGVCSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

A0A6J1JH82 pentatricopeptide repeat-containing protein At3g12770 isoform X20.0e+0087.84Show/hide
Query:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA
        MSLHSFSLSLSL+SLS+ALSK+A TS EA LRRKHLDQLYVQLIVSGL+KCGFLVIKFVNACLH  DVNYAHK FREVLEPDILLWN I+KGYTQ NI A
Subjt:  MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVA

Query:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP
        GAIRMY D+Q+S V+P+CFTFLYVLKACGG  V+GIGKQMH QTFKYGFGSNVFVQNSLVSMYA++GQTSSAR+VFDKLH+RTVVSWTSIISGYVQNGDP
Subjt:  GAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP

Query:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY
        ++AL+VFK+MR   VK DWI LVSVMTAYTD+EDLGQG++IH LVTKLGLEFEPDIV+SLT MYAK G VE+ARFFFNQMEKPNL+LWNAMISGYAKNGY
Subjt:  VEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGY

Query:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG
        GEEAI+LFR+MISKNI VDSVTVRS ILA AQ+ SLELARWLDGY+SKSEYRDD FVNTALIDM+AKCGSI FARSVF+RMV KD+V WSAMIMGYGLHG
Subjt:  GEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHG

Query:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI
        HGQEAI LYN MKQ+GIRPNDVTFVGLLTACKNSGLVKEGWELFHQM+DH IEPHHQHYSCVVDLLGRAGYLN+AYDFIMSMPIKPGVSVWGALLS CKI
Subjt:  HGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKI

Query:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY
        HRQVRLGEIAAEQLF+LDPYNTG++VQLSNLYASAHLW  VANVRLMMTQKGLNKDLGHSSIE+NGNLE FHVGDRSHPRSKEIFEELDRLE+RLK AGY
Subjt:  HRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGY

Query:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        V   ESVLHDLN EEIEE+LCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRD KRFHHFKDGVCSCGDFW
Subjt:  VPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.3e-14739.09Show/hide
Query:  EASLRRKHLDQLYVQLIVSGLHKCGFLVIK-FVNACL-HFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMD-IQMSQVHPNCFTFLYV
        E  +  + L Q +  +I +G     +   K F  A L  F  + YA K F E+ +P+   WN +++ Y        +I  ++D +  SQ +PN +TF ++
Subjt:  EASLRRKHLDQLYVQLIVSGLHKCGFLVIK-FVNACL-HFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMD-IQMSQVHPNCFTFLYV

Query:  LKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVS
        +KA        +G+ +HG   K   GS+VFV NSL+  Y   G   SA  VF  + ++ VVSW S+I+G+VQ G P +AL++FK+M   +VK   + +V 
Subjt:  LKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVS

Query:  VMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFF-------------------------------NQMEKPNLILWNAMIS
        V++A   + +L  GR +   + +  +     +  ++  MY K G +E A+  F                               N M + +++ WNA+IS
Subjt:  VMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFF-------------------------------NQMEKPNLILWNAMIS

Query:  GYAKNGYGEEAIKLFREM-ISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAM
         Y +NG   EA+ +F E+ + KN++++ +T+ ST+ A AQ+ +LEL RW+  Y+ K   R +  V +ALI MY+KCG +  +R VFN +  +DV +WSAM
Subjt:  GYAKNGYGEEAIKLFREM-ISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAM

Query:  IMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMR-DHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVW
        I G  +HG G EA+ ++ +M++A ++PN VTF  +  AC ++GLV E   LFHQM  ++ I P  +HY+C+VD+LGR+GYL +A  FI +MPI P  SVW
Subjt:  IMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMR-DHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVW

Query:  GALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRL
        GALL ACKIH  + L E+A  +L  L+P N G HV LSN+YA    W +V+ +R  M   GL K+ G SSIE++G +  F  GD +HP S++++ +L  +
Subjt:  GALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRL

Query:  EKRLKEAGYVPDNESVLHDLNDEEI-EESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDF
         ++LK  GY P+   VL  + +EE+ E+SL  HSE+LA+ YG+IST     +R+ KNLR C +CHS  KLIS+L DREII+RD  RFHHF++G CSC DF
Subjt:  EKRLKEAGYVPDNESVLHDLNDEEI-EESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDF

Query:  W
        W
Subjt:  W

Q3E6Q1 Pentatricopeptide repeat-containing protein At1g11290, chloroplastic1.0e-15742.6Show/hide
Query:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIG
        +++  L+ SG     F +    N       VN A K F  + E D++ WN IV GY+Q  +   A+ M   +    + P+  T + VL A     +  +G
Subjt:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIG

Query:  KQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQ
        K++HG   + GF S V +  +LV MYAK G   +AR +FD + +R VVSW S+I  YVQN +P EA+ +F++M    VKP  ++++  + A  D+ DL +
Subjt:  KQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQ

Query:  GRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLE
        GR IH L  +LGL+    +V SL +MY K   V+ A   F +++   L+ WNAMI G+A+NG   +A+  F +M S+ ++ D+ T  S I A A++    
Subjt:  GRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLE

Query:  LARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLV
         A+W+ G V +S    + FV TAL+DMYAKCG+I  AR +F+ M  + V  W+AMI GYG HG G+ A+ L+ EM++  I+PN VTF+ +++AC +SGLV
Subjt:  LARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLV

Query:  KEGWELFHQMRD-HRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAH
        + G + F+ M++ + IE    HY  +VDLLGRAG LN+A+DFIM MP+KP V+V+GA+L AC+IH+ V   E AAE+LF L+P + GYHV L+N+Y +A 
Subjt:  KEGWELFHQMRD-HRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAH

Query:  LWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGIIST
        +W  V  VR+ M ++GL K  G S +E+   +  F  G  +HP SK+I+  L++L   +KEAGYVPD   VL   ND + E+ L  HSE+LA+++G+++T
Subjt:  LWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGIIST

Query:  APGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
          GTT+ + KNLR C +CH+A K IS +  REI++RD +RFHHFK+G CSCGD+W
Subjt:  APGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic2.2e-14738.55Show/hide
Query:  LYVQLIVSGLHKCGFLVIKFVNACL---HFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDG
        ++ Q+I  GLH   + + K +  C+   HF  + YA   F+ + EP++L+WN + +G+   +    A+++Y+ +    + PN +TF +VLK+C  +    
Subjt:  LYVQLIVSGLHKCGFLVIKFVNACL---HFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDG

Query:  IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFD---------------------------KLHD----RTVVSWTSIISGYVQNGDPVEAL
         G+Q+HG   K G   +++V  SL+SMY + G+   A  VFD                           KL D    + VVSW ++ISGY + G+  EAL
Subjt:  IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFD---------------------------KLHD----RTVVSWTSIISGYVQNGDPVEAL

Query:  KVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEA
        ++FK+M   NV+PD   +V+V++A      +  GR +H  +   G      IV +L  +Y+K G +E A   F ++   ++I WN +I GY      +EA
Subjt:  KVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEA

Query:  IKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSK--SEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHG
        + LF+EM+      + VT+ S + A A + ++++ RW+  Y+ K      + + + T+LIDMYAKCG I  A  VFN ++ K +  W+AMI G+ +HG  
Subjt:  IKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSK--SEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHG

Query:  QEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQM-RDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIH
          +  L++ M++ GI+P+D+TFVGLL+AC +SG++  G  +F  M +D+++ P  +HY C++DLLG +G   +A + I  M ++P   +W +LL ACK+H
Subjt:  QEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQM-RDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIH

Query:  RQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYV
          V LGE  AE L  ++P N G +V LSN+YASA  W  VA  R ++  KG+ K  G SSIE++  +  F +GD+ HPR++EI+  L+ +E  L++AG+V
Subjt:  RQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYV

Query:  PDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        PD   VL ++ +E  E +L +HSE+LA+A+G+IST PGT L I KNLR C NCH A KLISK+  REII RD  RFHHF+DGVCSC D+W
Subjt:  PDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127701.5e-24158.26Show/hide
Query:  EASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKA
        +++  +  L Q++ +L+V GL   GFL+ K ++A   FGD+ +A + F ++  P I  WNAI++GY++ N    A+ MY ++Q+++V P+ FTF ++LKA
Subjt:  EASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKA

Query:  CGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSV
        C G     +G+ +H Q F+ GF ++VFVQN L+++YAK  +  SAR VF+   L +RT+VSWT+I+S Y QNG+P+EAL++F +MR  +VKPDW+ALVSV
Subjt:  CGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSV

Query:  MTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRS
        + A+T ++DL QGRSIH  V K+GLE EPD++ISL TMYAK G V  A+  F++M+ PNLILWNAMISGYAKNGY  EAI +F EMI+K++R D++++ S
Subjt:  MTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRS

Query:  TILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFV
         I A AQ+ SLE AR +  YV +S+YRDD F+++ALIDM+AKCGS+  AR VF+R + +DVV+WSAMI+GYGLHG  +EAISLY  M++ G+ PNDVTF+
Subjt:  TILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFV

Query:  GLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYH
        GLL AC +SG+V+EGW  F++M DH+I P  QHY+CV+DLLGRAG+L+QAY+ I  MP++PGV+VWGALLSACK HR V LGE AA+QLF +DP NTG++
Subjt:  GLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYH

Query:  VQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSE
        VQLSNLYA+A LW  VA VR+ M +KGLNKD+G S +EV G LE F VGD+SHPR +EI  +++ +E RLKE G+V + ++ LHDLNDEE EE+LC+HSE
Subjt:  VQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSE

Query:  RLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        R+A+AYG+IST  GT LRITKNLRACVNCH+A KLISKLVDREI++RD  RFHHFKDGVCSCGD+W
Subjt:  RLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

Q9LW32 Pentatricopeptide repeat-containing protein At3g26782, mitochondrial5.7e-14841.53Show/hide
Query:  REVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIV
        R V + D+  WN+++    +    A A+  +  ++   ++P   +F   +KAC        GKQ H Q F +G+ S++FV ++L+ MY+  G+   AR V
Subjt:  REVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIV

Query:  FDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEM------RHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAK--R
        FD++  R +VSWTS+I GY  NG+ ++A+ +FK++          +  D + LVSV++A + V   G   SIH  V K G +    +  +L   YAK   
Subjt:  FDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEM------RHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAK--R

Query:  GWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYA
        G V VAR  F+Q+   + + +N+++S YA++G   EA ++FR ++ +K +  +++T+ + +LA +   +L + + +   V +    DD  V T++IDMY 
Subjt:  GWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYA

Query:  KCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDH-RIEPHHQHYSCVVDL
        KCG +  AR  F+RM  K+V  W+AMI GYG+HGH  +A+ L+  M  +G+RPN +TFV +L AC ++GL  EGW  F+ M+    +EP  +HY C+VDL
Subjt:  KCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDH-RIEPHHQHYSCVVDL

Query:  LGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVN
        LGRAG+L +AYD I  M +KP   +W +LL+AC+IH+ V L EI+  +LF LD  N GY++ LS++YA A  W  V  VR++M  +GL K  G S +E+N
Subjt:  LGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVN

Query:  GNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV
        G + +F +GD  HP+ ++I+E L  L ++L EAGYV +  SV HD+++EE E +L  HSE+LA+A+GI++T PG+T+ + KNLR C +CH+ IKLISK+V
Subjt:  GNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV

Query:  DREIIIRDAKRFHHFKDGVCSCGDFW
        DRE ++RDAKRFHHFKDG CSCGD+W
Subjt:  DREIIIRDAKRFHHFKDGVCSCGDFW

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.5e-14838.55Show/hide
Query:  LYVQLIVSGLHKCGFLVIKFVNACL---HFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDG
        ++ Q+I  GLH   + + K +  C+   HF  + YA   F+ + EP++L+WN + +G+   +    A+++Y+ +    + PN +TF +VLK+C  +    
Subjt:  LYVQLIVSGLHKCGFLVIKFVNACL---HFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDG

Query:  IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFD---------------------------KLHD----RTVVSWTSIISGYVQNGDPVEAL
         G+Q+HG   K G   +++V  SL+SMY + G+   A  VFD                           KL D    + VVSW ++ISGY + G+  EAL
Subjt:  IGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFD---------------------------KLHD----RTVVSWTSIISGYVQNGDPVEAL

Query:  KVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEA
        ++FK+M   NV+PD   +V+V++A      +  GR +H  +   G      IV +L  +Y+K G +E A   F ++   ++I WN +I GY      +EA
Subjt:  KVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEA

Query:  IKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSK--SEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHG
        + LF+EM+      + VT+ S + A A + ++++ RW+  Y+ K      + + + T+LIDMYAKCG I  A  VFN ++ K +  W+AMI G+ +HG  
Subjt:  IKLFREMISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSK--SEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHG

Query:  QEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQM-RDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIH
          +  L++ M++ GI+P+D+TFVGLL+AC +SG++  G  +F  M +D+++ P  +HY C++DLLG +G   +A + I  M ++P   +W +LL ACK+H
Subjt:  QEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQM-RDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIH

Query:  RQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYV
          V LGE  AE L  ++P N G +V LSN+YASA  W  VA  R ++  KG+ K  G SSIE++  +  F +GD+ HPR++EI+  L+ +E  L++AG+V
Subjt:  RQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYV

Query:  PDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        PD   VL ++ +E  E +L +HSE+LA+A+G+IST PGT L I KNLR C NCH A KLISK+  REII RD  RFHHF+DGVCSC D+W
Subjt:  PDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

AT1G11290.1 Pentatricopeptide repeat (PPR) superfamily protein7.4e-15942.6Show/hide
Query:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIG
        +++  L+ SG     F +    N       VN A K F  + E D++ WN IV GY+Q  +   A+ M   +    + P+  T + VL A     +  +G
Subjt:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIG

Query:  KQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQ
        K++HG   + GF S V +  +LV MYAK G   +AR +FD + +R VVSW S+I  YVQN +P EA+ +F++M    VKP  ++++  + A  D+ DL +
Subjt:  KQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSVMTAYTDVEDLGQ

Query:  GRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLE
        GR IH L  +LGL+    +V SL +MY K   V+ A   F +++   L+ WNAMI G+A+NG   +A+  F +M S+ ++ D+ T  S I A A++    
Subjt:  GRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAAAQIRSLE

Query:  LARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLV
         A+W+ G V +S    + FV TAL+DMYAKCG+I  AR +F+ M  + V  W+AMI GYG HG G+ A+ L+ EM++  I+PN VTF+ +++AC +SGLV
Subjt:  LARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLV

Query:  KEGWELFHQMRD-HRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAH
        + G + F+ M++ + IE    HY  +VDLLGRAG LN+A+DFIM MP+KP V+V+GA+L AC+IH+ V   E AAE+LF L+P + GYHV L+N+Y +A 
Subjt:  KEGWELFHQMRD-HRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAH

Query:  LWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGIIST
        +W  V  VR+ M ++GL K  G S +E+   +  F  G  +HP SK+I+  L++L   +KEAGYVPD   VL   ND + E+ L  HSE+LA+++G+++T
Subjt:  LWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGIIST

Query:  APGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
          GTT+ + KNLR C +CH+A K IS +  REI++RD +RFHHFK+G CSCGD+W
Subjt:  APGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.0e-14939.09Show/hide
Query:  EASLRRKHLDQLYVQLIVSGLHKCGFLVIK-FVNACL-HFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMD-IQMSQVHPNCFTFLYV
        E  +  + L Q +  +I +G     +   K F  A L  F  + YA K F E+ +P+   WN +++ Y        +I  ++D +  SQ +PN +TF ++
Subjt:  EASLRRKHLDQLYVQLIVSGLHKCGFLVIK-FVNACL-HFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMD-IQMSQVHPNCFTFLYV

Query:  LKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVS
        +KA        +G+ +HG   K   GS+VFV NSL+  Y   G   SA  VF  + ++ VVSW S+I+G+VQ G P +AL++FK+M   +VK   + +V 
Subjt:  LKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVS

Query:  VMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFF-------------------------------NQMEKPNLILWNAMIS
        V++A   + +L  GR +   + +  +     +  ++  MY K G +E A+  F                               N M + +++ WNA+IS
Subjt:  VMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFF-------------------------------NQMEKPNLILWNAMIS

Query:  GYAKNGYGEEAIKLFREM-ISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAM
         Y +NG   EA+ +F E+ + KN++++ +T+ ST+ A AQ+ +LEL RW+  Y+ K   R +  V +ALI MY+KCG +  +R VFN +  +DV +WSAM
Subjt:  GYAKNGYGEEAIKLFREM-ISKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAM

Query:  IMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMR-DHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVW
        I G  +HG G EA+ ++ +M++A ++PN VTF  +  AC ++GLV E   LFHQM  ++ I P  +HY+C+VD+LGR+GYL +A  FI +MPI P  SVW
Subjt:  IMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMR-DHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVW

Query:  GALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRL
        GALL ACKIH  + L E+A  +L  L+P N G HV LSN+YA    W +V+ +R  M   GL K+ G SSIE++G +  F  GD +HP S++++ +L  +
Subjt:  GALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRL

Query:  EKRLKEAGYVPDNESVLHDLNDEEI-EESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDF
         ++LK  GY P+   VL  + +EE+ E+SL  HSE+LA+ YG+IST     +R+ KNLR C +CHS  KLIS+L DREII+RD  RFHHF++G CSC DF
Subjt:  EKRLKEAGYVPDNESVLHDLNDEEI-EESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDF

Query:  W
        W
Subjt:  W

AT3G12770.1 mitochondrial editing factor 221.1e-24258.26Show/hide
Query:  EASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKA
        +++  +  L Q++ +L+V GL   GFL+ K ++A   FGD+ +A + F ++  P I  WNAI++GY++ N    A+ MY ++Q+++V P+ FTF ++LKA
Subjt:  EASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKA

Query:  CGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSV
        C G     +G+ +H Q F+ GF ++VFVQN L+++YAK  +  SAR VF+   L +RT+VSWT+I+S Y QNG+P+EAL++F +MR  +VKPDW+ALVSV
Subjt:  CGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWIALVSV

Query:  MTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRS
        + A+T ++DL QGRSIH  V K+GLE EPD++ISL TMYAK G V  A+  F++M+ PNLILWNAMISGYAKNGY  EAI +F EMI+K++R D++++ S
Subjt:  MTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRS

Query:  TILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFV
         I A AQ+ SLE AR +  YV +S+YRDD F+++ALIDM+AKCGS+  AR VF+R + +DVV+WSAMI+GYGLHG  +EAISLY  M++ G+ PNDVTF+
Subjt:  TILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFV

Query:  GLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYH
        GLL AC +SG+V+EGW  F++M DH+I P  QHY+CV+DLLGRAG+L+QAY+ I  MP++PGV+VWGALLSACK HR V LGE AA+QLF +DP NTG++
Subjt:  GLLTACKNSGLVKEGWELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYH

Query:  VQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSE
        VQLSNLYA+A LW  VA VR+ M +KGLNKD+G S +EV G LE F VGD+SHPR +EI  +++ +E RLKE G+V + ++ LHDLNDEE EE+LC+HSE
Subjt:  VQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSE

Query:  RLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW
        R+A+AYG+IST  GT LRITKNLRACVNCH+A KLISKLVDREI++RD  RFHHFKDGVCSCGD+W
Subjt:  RLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGVCSCGDFW

AT3G26782.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.1e-14941.53Show/hide
Query:  REVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIV
        R V + D+  WN+++    +    A A+  +  ++   ++P   +F   +KAC        GKQ H Q F +G+ S++FV ++L+ MY+  G+   AR V
Subjt:  REVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQMSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIV

Query:  FDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEM------RHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAK--R
        FD++  R +VSWTS+I GY  NG+ ++A+ +FK++          +  D + LVSV++A + V   G   SIH  V K G +    +  +L   YAK   
Subjt:  FDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEM------RHCNVKPDWIALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAK--R

Query:  GWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYA
        G V VAR  F+Q+   + + +N+++S YA++G   EA ++FR ++ +K +  +++T+ + +LA +   +L + + +   V +    DD  V T++IDMY 
Subjt:  GWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMI-SKNIRVDSVTVRSTILAAAQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYA

Query:  KCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDH-RIEPHHQHYSCVVDL
        KCG +  AR  F+RM  K+V  W+AMI GYG+HGH  +A+ L+  M  +G+RPN +TFV +L AC ++GL  EGW  F+ M+    +EP  +HY C+VDL
Subjt:  KCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEGWELFHQMRDH-RIEPHHQHYSCVVDL

Query:  LGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVN
        LGRAG+L +AYD I  M +KP   +W +LL+AC+IH+ V L EI+  +LF LD  N GY++ LS++YA A  W  V  VR++M  +GL K  G S +E+N
Subjt:  LGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQKGLNKDLGHSSIEVN

Query:  GNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV
        G + +F +GD  HP+ ++I+E L  L ++L EAGYV +  SV HD+++EE E +L  HSE+LA+A+GI++T PG+T+ + KNLR C +CH+ IKLISK+V
Subjt:  GNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLV

Query:  DREIIIRDAKRFHHFKDGVCSCGDFW
        DRE ++RDAKRFHHFKDG CSCGD+W
Subjt:  DREIIIRDAKRFHHFKDGVCSCGDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTTGCATTCGTTTTCGCTTTCTCTCTCCTTGTCATCACTCTCATCAGCTCTCTCAAAGTCGGCGATAACCTCGCATGAGGCTTCATTAAGGAGGAAGCATTTGGA
TCAATTATACGTCCAGTTAATTGTGTCTGGACTACACAAGTGTGGTTTCTTGGTGATCAAATTTGTCAATGCTTGTTTGCATTTCGGAGATGTTAACTACGCACACAAGG
CTTTTCGCGAAGTCTTAGAACCGGATATTTTGTTGTGGAATGCCATCGTAAAGGGTTACACTCAGAAGAATATTGTTGCTGGTGCTATCAGAATGTATATGGATATACAA
ATGTCACAGGTGCACCCGAATTGCTTCACATTTTTGTATGTGCTTAAAGCATGCGGTGGAACATTAGTTGACGGAATAGGTAAACAGATGCATGGCCAGACATTTAAATA
TGGCTTTGGATCAAATGTTTTTGTGCAGAATAGTCTTGTGTCAATGTATGCTAAATTTGGTCAAACCTCGTCTGCTAGGATCGTGTTTGATAAGCTGCATGATAGGACAG
TTGTTTCATGGACTTCCATCATTTCTGGGTACGTTCAGAATGGTGATCCCGTGGAAGCATTGAAAGTTTTCAAAGAAATGAGACATTGTAATGTAAAGCCTGATTGGATT
GCCCTTGTTAGCGTCATGACAGCATATACGGATGTGGAAGATTTGGGACAAGGAAGGTCCATTCATGGTTTAGTAACTAAATTGGGTCTAGAATTTGAACCCGATATAGT
GATATCGCTCACTACTATGTATGCAAAACGTGGATGGGTAGAAGTTGCCAGATTTTTCTTTAATCAGATGGAAAAACCAAATTTAATATTGTGGAATGCTATGATTTCTG
GCTATGCAAAAAATGGATATGGTGAAGAAGCAATCAAACTATTCCGCGAGATGATTTCCAAAAATATCAGGGTTGATTCTGTTACTGTGAGGTCTACTATTCTAGCCGCT
GCCCAAATCAGGTCTCTTGAACTAGCAAGATGGTTGGATGGTTATGTCTCTAAGAGTGAGTACAGAGACGATACTTTTGTAAACACGGCCCTTATAGATATGTATGCAAA
ATGCGGAAGCATATATTTTGCTCGTAGTGTTTTCAATAGAATGGTCGGTAAAGACGTTGTCTTATGGAGTGCAATGATTATGGGGTATGGATTACATGGTCATGGACAAG
AAGCCATCAGCCTTTACAATGAAATGAAGCAAGCTGGAATTCGTCCAAACGATGTTACTTTTGTTGGTCTTCTCACAGCATGCAAAAATTCAGGTCTTGTAAAAGAGGGA
TGGGAGCTTTTCCACCAGATGCGAGACCACAGGATTGAACCACATCACCAGCATTACTCTTGCGTGGTCGATCTTCTAGGACGTGCAGGCTATTTGAATCAAGCTTATGA
TTTCATTATGAGCATGCCAATTAAACCTGGAGTTAGTGTTTGGGGGGCTCTTCTGAGTGCTTGCAAGATCCATCGCCAAGTAAGGTTGGGAGAAATTGCTGCAGAACAAC
TTTTCATATTAGATCCATATAATACAGGGTATCATGTGCAACTCTCAAACCTCTATGCTTCTGCCCATTTATGGACTCACGTGGCTAACGTTCGATTAATGATGACACAA
AAAGGACTGAACAAGGACCTCGGACATAGTTCTATCGAGGTAAACGGAAATCTCGAAATGTTTCATGTTGGCGATAGATCACATCCCAGATCAAAGGAAATTTTTGAAGA
GCTTGATAGATTAGAGAAAAGATTAAAAGAAGCTGGTTATGTTCCTGATAATGAATCTGTTCTACATGACTTGAATGATGAGGAGATTGAGGAAAGTCTTTGTAACCACA
GTGAGAGACTAGCAGTTGCTTATGGTATCATTAGTACTGCTCCTGGAACTACACTTAGAATAACCAAGAATCTCCGAGCATGCGTTAATTGCCATTCAGCGATAAAGCTT
ATATCAAAGCTTGTCGATAGGGAAATAATTATTCGAGACGCAAAACGTTTTCACCATTTCAAAGATGGAGTTTGTTCATGTGGAGATTTCTGGTGA
mRNA sequenceShow/hide mRNA sequence
CCCACTTCAAAAATTGAACAACTCTTTTGTTTCTTTATATTTTTTGGTTCCAGATTCGAAGTAATTGAGACGATTGGTGCAATCTACTTACGTCTAAAGAACGTTTGGGC
TGAGGTCAAGCAATTGTTTTTCGGATTGAAAAAAGCGATGGCGATTTTCCTCTGTTCCAACAGTTATGGGGTCCAACTTCGTCGGCCGCCGCTACATCACCGTCGCCCCC
CAATCTGCTGCAATCGCACAGCTTTATCCGAGTTTTCTTCCCCGGAGAACCTCCTCTCCGGAGAGGCAGGGATTTGACAGAGCTGGCTTTAGCAGCTCCTCGCACCTCCA
CCCATGTCTTTGCATTCGTTTTCGCTTTCTCTCTCCTTGTCATCACTCTCATCAGCTCTCTCAAAGTCGGCGATAACCTCGCATGAGGCTTCATTAAGGAGGAAGCATTT
GGATCAATTATACGTCCAGTTAATTGTGTCTGGACTACACAAGTGTGGTTTCTTGGTGATCAAATTTGTCAATGCTTGTTTGCATTTCGGAGATGTTAACTACGCACACA
AGGCTTTTCGCGAAGTCTTAGAACCGGATATTTTGTTGTGGAATGCCATCGTAAAGGGTTACACTCAGAAGAATATTGTTGCTGGTGCTATCAGAATGTATATGGATATA
CAAATGTCACAGGTGCACCCGAATTGCTTCACATTTTTGTATGTGCTTAAAGCATGCGGTGGAACATTAGTTGACGGAATAGGTAAACAGATGCATGGCCAGACATTTAA
ATATGGCTTTGGATCAAATGTTTTTGTGCAGAATAGTCTTGTGTCAATGTATGCTAAATTTGGTCAAACCTCGTCTGCTAGGATCGTGTTTGATAAGCTGCATGATAGGA
CAGTTGTTTCATGGACTTCCATCATTTCTGGGTACGTTCAGAATGGTGATCCCGTGGAAGCATTGAAAGTTTTCAAAGAAATGAGACATTGTAATGTAAAGCCTGATTGG
ATTGCCCTTGTTAGCGTCATGACAGCATATACGGATGTGGAAGATTTGGGACAAGGAAGGTCCATTCATGGTTTAGTAACTAAATTGGGTCTAGAATTTGAACCCGATAT
AGTGATATCGCTCACTACTATGTATGCAAAACGTGGATGGGTAGAAGTTGCCAGATTTTTCTTTAATCAGATGGAAAAACCAAATTTAATATTGTGGAATGCTATGATTT
CTGGCTATGCAAAAAATGGATATGGTGAAGAAGCAATCAAACTATTCCGCGAGATGATTTCCAAAAATATCAGGGTTGATTCTGTTACTGTGAGGTCTACTATTCTAGCC
GCTGCCCAAATCAGGTCTCTTGAACTAGCAAGATGGTTGGATGGTTATGTCTCTAAGAGTGAGTACAGAGACGATACTTTTGTAAACACGGCCCTTATAGATATGTATGC
AAAATGCGGAAGCATATATTTTGCTCGTAGTGTTTTCAATAGAATGGTCGGTAAAGACGTTGTCTTATGGAGTGCAATGATTATGGGGTATGGATTACATGGTCATGGAC
AAGAAGCCATCAGCCTTTACAATGAAATGAAGCAAGCTGGAATTCGTCCAAACGATGTTACTTTTGTTGGTCTTCTCACAGCATGCAAAAATTCAGGTCTTGTAAAAGAG
GGATGGGAGCTTTTCCACCAGATGCGAGACCACAGGATTGAACCACATCACCAGCATTACTCTTGCGTGGTCGATCTTCTAGGACGTGCAGGCTATTTGAATCAAGCTTA
TGATTTCATTATGAGCATGCCAATTAAACCTGGAGTTAGTGTTTGGGGGGCTCTTCTGAGTGCTTGCAAGATCCATCGCCAAGTAAGGTTGGGAGAAATTGCTGCAGAAC
AACTTTTCATATTAGATCCATATAATACAGGGTATCATGTGCAACTCTCAAACCTCTATGCTTCTGCCCATTTATGGACTCACGTGGCTAACGTTCGATTAATGATGACA
CAAAAAGGACTGAACAAGGACCTCGGACATAGTTCTATCGAGGTAAACGGAAATCTCGAAATGTTTCATGTTGGCGATAGATCACATCCCAGATCAAAGGAAATTTTTGA
AGAGCTTGATAGATTAGAGAAAAGATTAAAAGAAGCTGGTTATGTTCCTGATAATGAATCTGTTCTACATGACTTGAATGATGAGGAGATTGAGGAAAGTCTTTGTAACC
ACAGTGAGAGACTAGCAGTTGCTTATGGTATCATTAGTACTGCTCCTGGAACTACACTTAGAATAACCAAGAATCTCCGAGCATGCGTTAATTGCCATTCAGCGATAAAG
CTTATATCAAAGCTTGTCGATAGGGAAATAATTATTCGAGACGCAAAACGTTTTCACCATTTCAAAGATGGAGTTTGTTCATGTGGAGATTTCTGGTGAAGCCTGTCTAA
CACATGGGCTTGCGTAAACCTTCCTGGACACTGTAGTAATTAAGCGAACTTCATGGTTCTTCATGTAGATCAATTTCATTCAATAAGGGAACACCTATGATAATAGATGA
TATAGAGGATGAACAAGTATGTATCCAAGAAGAATGAAAACCAGAACTAGGGGTCCCTACTGTAGAACCATTTTACAAACGTGTATTTGGAAACATTTGTGAGGATTTTG
TTCCTCTTCAATCGTTATGCCACAAGATGTGGAACAAAATAAGGATGTTAATTGTTGTGTATGATGATTTTCATCAACCTGATATTGGAAGAGAGATACCTGAACAATAA
ATGGTATTGGTCAAAGTATATTTTGTTTAATA
Protein sequenceShow/hide protein sequence
MSLHSFSLSLSLSSLSSALSKSAITSHEASLRRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIVKGYTQKNIVAGAIRMYMDIQ
MSQVHPNCFTFLYVLKACGGTLVDGIGKQMHGQTFKYGFGSNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRHCNVKPDWI
ALVSVMTAYTDVEDLGQGRSIHGLVTKLGLEFEPDIVISLTTMYAKRGWVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFREMISKNIRVDSVTVRSTILAA
AQIRSLELARWLDGYVSKSEYRDDTFVNTALIDMYAKCGSIYFARSVFNRMVGKDVVLWSAMIMGYGLHGHGQEAISLYNEMKQAGIRPNDVTFVGLLTACKNSGLVKEG
WELFHQMRDHRIEPHHQHYSCVVDLLGRAGYLNQAYDFIMSMPIKPGVSVWGALLSACKIHRQVRLGEIAAEQLFILDPYNTGYHVQLSNLYASAHLWTHVANVRLMMTQ
KGLNKDLGHSSIEVNGNLEMFHVGDRSHPRSKEIFEELDRLEKRLKEAGYVPDNESVLHDLNDEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKL
ISKLVDREIIIRDAKRFHHFKDGVCSCGDFW