; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10014979 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10014979
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr02:22578761..22584429
RNA-Seq ExpressionHG10014979
SyntenyHG10014979
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034056.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]1.4e-21060.33Show/hide
Query:  PPKALEGSNLEPLSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNC
        P K  +GSNL PLS KHLDQ+YVQLIVSGLHKC +LVIKFVNACLHFGDVNYAHKAF EV EPDI LWNAIIKGY QKNIV G IRMY DMQ+S+VHPNC
Subjt:  PPKALEGSNLEPLSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNC

Query:  FTFLYVLKACGGTLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPD
        FTFLYVLKACGGT VE LGKQ+HG TFKYGFG+NVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP+EALKVFKEMRQCNVKPD
Subjt:  FTFLYVLKACGGTLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPD

Query:  WIALVSVMTAYTDVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRV
        WIALVSVMTAYTDVED+GQGKS+HGLVTK G+EFEPDIVISLTTMYAKRGLVEVARFFF++MEKPNLILWNAMISGYAKNGYGEEAIKLFR+MISKNIRV
Subjt:  WIALVSVMTAYTDVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRV

Query:  DSVT------------------------------------------------------------------------------------------------
        DS+T                                                                                                
Subjt:  DSVT------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEE
                                        KGLNKDLGHSSIEING+LETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVP  ESVLHDLNHEEIEE
Subjt:  --------------------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEE

Query:  SLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        +LCNHSERLAVAYGI+STAPGTTLRITKNLRACVNCHSAIK+ISKLVDREIIIRDAKRFHHFKDG+CSCGDFW
Subjt:  SLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

XP_008445864.1 PREDICTED: pentatricopeptide repeat-containing protein At3g12770 [Cucumis melo]9.5e-20760.36Show/hide
Query:  LSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGG
        L RKHLDQ+YVQLIVSGLHKC +LVIKFVNACLHFGDVNYAHKAF EV EPDI LWNAIIKGY QKNIV G IRMY DMQ+S+VHPNCFTFLYVLKACGG
Subjt:  LSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGG

Query:  TLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYT
        T VE LGKQ+HG TFKYGFG+NVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP+EALKVFKEMRQCNVKPDWIALVSVMTAYT
Subjt:  TLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYT

Query:  DVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------
        DVED+GQGKS+HGLVTK G+EFEPDIVISLTTMYAKRGLVEVARFFF++MEKPNLILWNAMISGYAKNGYGEEAIKLFR+MISKNIRVDS+T        
Subjt:  DVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVA
                            KGLNKDLGHSSIEING+LETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVP  ESVLHDLNHEEIEE+LCNHSERLAVA
Subjt:  --------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVA

Query:  YGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        YGI+STAPGTTLRITKNLRACVNCHSAIK+ISKLVDREIIIRDAKRFHHFKDG+CSCGDFW
Subjt:  YGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

XP_011654911.1 pentatricopeptide repeat-containing protein At3g12770 [Cucumis sativus]1.6e-20658.37Show/hide
Query:  FAFSLLVVTHISSL-KSFAPLHLEPLPPKALEGSNLEPLSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKG
        F+ SLL+ +  S+L KS   LH         E S    L RKHLDQ+YVQLIVSGLHKC FL+IKF+NACLHFGDVNYAHKAFREV EPDILLWNAIIKG
Subjt:  FAFSLLVVTHISSL-KSFAPLHLEPLPPKALEGSNLEPLSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKG

Query:  YTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIIS
        YTQKNIV   IRMY DMQ+S+VHPNCFTFLYVLKACGGT VEG+GKQ+HGQTFKYGFG+NVFVQNSLVSMYAKFGQ S ARIVFDKLHDRTVVSWTSIIS
Subjt:  YTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIIS

Query:  GYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYTDVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMI
        GYVQNGDP+EAL VFKEMRQCNVKPDWIALVSVMTAYT+VEDLGQGKS+HGLVTK G+EFEPDIVISLTTMYAKRGLVEVARFFFN+MEKPNLILWNAMI
Subjt:  GYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYTDVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMI

Query:  SGYAKNGYGEEAIKLFRKMISKNIRVDSVT----------------------------------------------------------------------
        SGYA NGYGEEAIKLFR+MI+KNIRVDS+T                                                                      
Subjt:  SGYAKNGYGEEAIKLFRKMISKNIRVDSVT----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLE
                                                                  KGLNKDLGHSSIEINGNLETF VGDRSHP+SKEIFEELDRLE
Subjt:  ----------------------------------------------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLE

Query:  KRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        KRLKAAGYVP  ESVLHDLNHEEIEE+LC+HSERLAVAYGIISTAPGTTLRITKNLRAC+NCHSAIKLISKLVDREIIIRDAKRFHHFKDG+CSCGDFW
Subjt:  KRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

XP_022139117.1 pentatricopeptide repeat-containing protein At3g12770 [Momordica charantia]1.0e-20058.55Show/hide
Query:  LSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGG
        L RKHLDQLYVQLIVSGLHKC FLVIKFVNACLH GDV YAHKAFREVLEPDILLWNA+IKGYTQ NI  GA+++YT+MQ+S VHP+CFTFLYVLKACGG
Subjt:  LSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGG

Query:  TLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYT
          +E +GKQMHGQTFKYGFG+NVFVQNSLVSMYAKFGQTS AR+VFDKL DRTVVSWTSIISGYVQNGDP EAL VFK+MRQ NVK DWIALVSVMTAYT
Subjt:  TLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYT

Query:  DVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------
        D+EDLGQGKS+HGLVTK G+EFEPDIV+SLTTMYAKRG VEVARFFFNQMEKPNL+LWNAMISGYAKNGYGEEAIKLFR+MISKNIRVDSVT        
Subjt:  DVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVA
                            KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLE+RLKAAGY+P  ESVLHDLNHEEIEE+LCNHSERLAVA
Subjt:  --------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVA

Query:  YGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        YGIISTAPGT LRIT NLRACVNCHSAIKLISKLVDREIIIRDAKRFH FKDG CSCGDFW
Subjt:  YGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

XP_038892943.1 pentatricopeptide repeat-containing protein At3g12770 [Benincasa hispida]1.1e-21061.12Show/hide
Query:  LSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGG
        L RKHLDQLYVQLIVSGLHKCGFL+IKFVN+CLHFGDVNYAHKAFREV+EPDILLWNAIIKGYTQKNI  GAIRMY DMQMS V+PNCFTFLYVLKAC G
Subjt:  LSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGG

Query:  TLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYT
          VEG+GKQMHGQTFKYGFG+NVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALK+FKEMRQCNVK DWI LVSVMTAYT
Subjt:  TLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYT

Query:  DVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------
        DVEDLGQGKS+HGLVTK G+EFEPDIVISLTTMYAK GLVE+ARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLF +MISKNIRVDSVT        
Subjt:  DVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVA
                            KGLNKDLGHSSI+INGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVP  ESVLHDLNHEEIEE+LCNHSERLAVA
Subjt:  --------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVA

Query:  YGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        YGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDG+CSCGDFW
Subjt:  YGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

TrEMBL top hitse value%identityAlignment
A0A0A0KLB9 DYW_deaminase domain-containing protein7.9e-20758.37Show/hide
Query:  FAFSLLVVTHISSL-KSFAPLHLEPLPPKALEGSNLEPLSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKG
        F+ SLL+ +  S+L KS   LH         E S    L RKHLDQ+YVQLIVSGLHKC FL+IKF+NACLHFGDVNYAHKAFREV EPDILLWNAIIKG
Subjt:  FAFSLLVVTHISSL-KSFAPLHLEPLPPKALEGSNLEPLSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKG

Query:  YTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIIS
        YTQKNIV   IRMY DMQ+S+VHPNCFTFLYVLKACGGT VEG+GKQ+HGQTFKYGFG+NVFVQNSLVSMYAKFGQ S ARIVFDKLHDRTVVSWTSIIS
Subjt:  YTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIIS

Query:  GYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYTDVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMI
        GYVQNGDP+EAL VFKEMRQCNVKPDWIALVSVMTAYT+VEDLGQGKS+HGLVTK G+EFEPDIVISLTTMYAKRGLVEVARFFFN+MEKPNLILWNAMI
Subjt:  GYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYTDVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMI

Query:  SGYAKNGYGEEAIKLFRKMISKNIRVDSVT----------------------------------------------------------------------
        SGYA NGYGEEAIKLFR+MI+KNIRVDS+T                                                                      
Subjt:  SGYAKNGYGEEAIKLFRKMISKNIRVDSVT----------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLE
                                                                  KGLNKDLGHSSIEINGNLETF VGDRSHP+SKEIFEELDRLE
Subjt:  ----------------------------------------------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLE

Query:  KRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        KRLKAAGYVP  ESVLHDLNHEEIEE+LC+HSERLAVAYGIISTAPGTTLRITKNLRAC+NCHSAIKLISKLVDREIIIRDAKRFHHFKDG+CSCGDFW
Subjt:  KRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

A0A1S3BEJ1 pentatricopeptide repeat-containing protein At3g127704.6e-20760.36Show/hide
Query:  LSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGG
        L RKHLDQ+YVQLIVSGLHKC +LVIKFVNACLHFGDVNYAHKAF EV EPDI LWNAIIKGY QKNIV G IRMY DMQ+S+VHPNCFTFLYVLKACGG
Subjt:  LSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGG

Query:  TLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYT
        T VE LGKQ+HG TFKYGFG+NVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP+EALKVFKEMRQCNVKPDWIALVSVMTAYT
Subjt:  TLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYT

Query:  DVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------
        DVED+GQGKS+HGLVTK G+EFEPDIVISLTTMYAKRGLVEVARFFF++MEKPNLILWNAMISGYAKNGYGEEAIKLFR+MISKNIRVDS+T        
Subjt:  DVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVA
                            KGLNKDLGHSSIEING+LETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVP  ESVLHDLNHEEIEE+LCNHSERLAVA
Subjt:  --------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVA

Query:  YGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        YGI+STAPGTTLRITKNLRACVNCHSAIK+ISKLVDREIIIRDAKRFHHFKDG+CSCGDFW
Subjt:  YGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

A0A5A7SVV3 Pentatricopeptide repeat-containing protein6.9e-21160.33Show/hide
Query:  PPKALEGSNLEPLSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNC
        P K  +GSNL PLS KHLDQ+YVQLIVSGLHKC +LVIKFVNACLHFGDVNYAHKAF EV EPDI LWNAIIKGY QKNIV G IRMY DMQ+S+VHPNC
Subjt:  PPKALEGSNLEPLSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNC

Query:  FTFLYVLKACGGTLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPD
        FTFLYVLKACGGT VE LGKQ+HG TFKYGFG+NVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDP+EALKVFKEMRQCNVKPD
Subjt:  FTFLYVLKACGGTLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPD

Query:  WIALVSVMTAYTDVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRV
        WIALVSVMTAYTDVED+GQGKS+HGLVTK G+EFEPDIVISLTTMYAKRGLVEVARFFF++MEKPNLILWNAMISGYAKNGYGEEAIKLFR+MISKNIRV
Subjt:  WIALVSVMTAYTDVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRV

Query:  DSVT------------------------------------------------------------------------------------------------
        DS+T                                                                                                
Subjt:  DSVT------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEE
                                        KGLNKDLGHSSIEING+LETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVP  ESVLHDLNHEEIEE
Subjt:  --------------------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEE

Query:  SLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        +LCNHSERLAVAYGI+STAPGTTLRITKNLRACVNCHSAIK+ISKLVDREIIIRDAKRFHHFKDG+CSCGDFW
Subjt:  SLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

A0A6J1CD43 pentatricopeptide repeat-containing protein At3g127704.9e-20158.55Show/hide
Query:  LSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGG
        L RKHLDQLYVQLIVSGLHKC FLVIKFVNACLH GDV YAHKAFREVLEPDILLWNA+IKGYTQ NI  GA+++YT+MQ+S VHP+CFTFLYVLKACGG
Subjt:  LSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGG

Query:  TLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYT
          +E +GKQMHGQTFKYGFG+NVFVQNSLVSMYAKFGQTS AR+VFDKL DRTVVSWTSIISGYVQNGDP EAL VFK+MRQ NVK DWIALVSVMTAYT
Subjt:  TLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYT

Query:  DVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------
        D+EDLGQGKS+HGLVTK G+EFEPDIV+SLTTMYAKRG VEVARFFFNQMEKPNL+LWNAMISGYAKNGYGEEAIKLFR+MISKNIRVDSVT        
Subjt:  DVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVA
                            KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLE+RLKAAGY+P  ESVLHDLNHEEIEE+LCNHSERLAVA
Subjt:  --------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVA

Query:  YGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        YGIISTAPGT LRIT NLRACVNCHSAIKLISKLVDREIIIRDAKRFH FKDG CSCGDFW
Subjt:  YGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

A0A6J1JH82 pentatricopeptide repeat-containing protein At3g12770 isoform X23.0e-19857.38Show/hide
Query:  KALEGSNLEPLSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFT
        KA   S    L RKHLDQLYVQLIVSGL+KCGFLVIKFVNACLH  DVNYAHK FREVLEPDILLWN IIKGYTQ NI AGAIRMY DMQ+S V+P+CFT
Subjt:  KALEGSNLEPLSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFT

Query:  FLYVLKACGGTLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWI
        FLYVLKACGG  VEG+GKQMH QTFKYGFG+NVFVQNSLVSMYA++GQTSSAR+VFDKLH+RTVVSWTSIISGYVQNGDP++AL+VFK+MRQ  VK DWI
Subjt:  FLYVLKACGGTLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWI

Query:  ALVSVMTAYTDVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDS
         LVSVMTAYTD+EDLGQGK++H LVTK G+EFEPDIV+SLT MYAK G VE+ARFFFNQMEKPNL+LWNAMISGYAKNGYGEEAI+LFRKMISKNI VDS
Subjt:  ALVSVMTAYTDVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDS

Query:  VT--------------------------------------------------------------------------------------------------
        VT                                                                                                  
Subjt:  VT--------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESL
                                      KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLE+RLKAAGYV   ESVLHDLNHEEIEE+L
Subjt:  ------------------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESL

Query:  CNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        CNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRD KRFHHFKDG+CSCGDFW
Subjt:  CNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

SwissProt top hitse value%identityAlignment
O81767 Pentatricopeptide repeat-containing protein At4g339902.7e-7129.91Show/hide
Query:  IVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVEGLG-----K
        I  GL    F+  K ++    FG +    K F  +   D++ WN+IIK Y        AI ++ +M++SR+ P+C T + +      +++  LG     +
Subjt:  IVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVEGLG-----K

Query:  QMHGQTFKYG-FGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEM-RQCNVKPDWIALVSVMTAYTDVEDLG
         + G T + G F  ++ + N++V MYAK G   SAR VF+ L +  V+SW +IISGY QNG   EA++++  M  +  +  +    VSV+ A +    L 
Subjt:  QMHGQTFKYG-FGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEM-RQCNVKPDWIALVSVMTAYTDVEDLG

Query:  QGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------------
        QG  +HG + K+G+  +  +V SL  MY K G +E A   F Q+ + N + WN +I+ +  +G+GE+A+ LF++M+ + ++ D +T              
Subjt:  QGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVAYGIIST
                      KGL K  G SS+E++  +E F+ G+++HP  +E++ EL  L+ +LK  GYVPD+  VL D+  +E E  L +HSERLA+A+ +I+T
Subjt:  --------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVAYGIIST

Query:  APGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
           TT+RI KNLR C +CHS  K ISK+ +REII+RD+ RFHHFK+G+CSCGD+W
Subjt:  APGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

Q9LTV8 Pentatricopeptide repeat-containing protein At3g127704.1e-12036.46Show/hide
Query:  RKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTL
        +  L Q++ +L+V GL   GFL+ K ++A   FGD+ +A + F ++  P I  WNAII+GY++ N    A+ MY++MQ++RV P+ FTF ++LKAC G  
Subjt:  RKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTL

Query:  VEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYT
           +G+ +H Q F+ GF  +VFVQN L+++YAK  +  SAR VF+   L +RT+VSWT+I+S Y QNG+P+EAL++F +MR+ +VKPDW+ALVSV+ A+T
Subjt:  VEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYT

Query:  DVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------
         ++DL QG+S+H  V K G+E EPD++ISL TMYAK G V  A+  F++M+ PNLILWNAMISGYAKNGY  EAI +F +MI+K++R D+++        
Subjt:  DVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVA
                            KGLNKD+G S +E+ G LE F VGD+SHPR +EI  +++ +E RLK  G+V + ++ LHDLN EE EE+LC+HSER+A+A
Subjt:  --------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVA

Query:  YGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        YG+IST  GT LRITKNLRACVNCH+A KLISKLVDREI++RD  RFHHFKDG+CSCGD+W
Subjt:  YGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233309.0e-7533.26Show/hide
Query:  LPPKALEGSNLEPLSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGD-----------VNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMY
        L P +   S++ P+  +++D      ++ G    G+++ K +++ ++ G            +  + + F  +   D + WN+++ GY Q      A+R++
Subjt:  LPPKALEGSNLEPLSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGD-----------VNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMY

Query:  TDMQMSRVHPNCFTFLYVLKACGGTLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKV
          M  ++V P    F  V+ AC       LGKQ+HG   + GFG+N+F+ ++LV MY+K G   +AR +FD+++    VSWT+II G+  +G   EA+ +
Subjt:  TDMQMSRVHPNCFTFLYVLKACGGTLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKV

Query:  FKEMRQCNVKPDWIALVSVMTAYTDVEDLGQGKSVHGLVTK-SGVEFEPDIVISLTTMYAKRGLVEVARFFFNQM-EKPNLILWNAMISGYAKNGYGEEA
        F+EM++  VKP+ +A V+V+TA + V  + +       +TK  G+  E +   ++  +  + G +E A  F ++M  +P   +W+ ++S  + +   E A
Subjt:  FKEMRQCNVKPDWIALVSVMTAYTDVEDLGQGKSVHGLVTK-SGVEFEPDIVISLTTMYAKRGLVEVARFFFNQM-EKPNLILWNAMISGYAKNGYGEEA

Query:  IKLFRKMIS-------------------------KNIRVDSVTKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESV
         K+  K+ +                           +R+    KGL K    S IE+      F  GDRSHP   +I E L  + ++++  GYV D   V
Subjt:  IKLFRKMIS-------------------------KNIRVDSVTKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESV

Query:  LHDLNHEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        LHD++ E   E L  HSERLAVA+GII+T PGTT+R+TKN+R C +CH AIK ISK+ +REII+RD  RFHHF  G CSCGD+W
Subjt:  LHDLNHEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

Q9SUH6 Pentatricopeptide repeat-containing protein At4g307004.6e-7128.88Show/hide
Query:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVEGLG
        Q++     +G +   +++  F++     G +      FRE  +PDI+ +NA+I GYT       ++ ++ ++ +S       T + ++   G  +   L 
Subjt:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVEGLG

Query:  KQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYTDVEDLGQ
          +HG   K  F ++  V  +L ++Y+K  +  SAR +FD+  ++++ SW ++ISGY QNG   +A+ +F+EM++    P+ + +  +++A   +  L  
Subjt:  KQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYTDVEDLGQ

Query:  GKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT---------------
        GK VH LV  +  E    +  +L  MYAK G +  AR  F+ M K N + WN MISGY  +G G+EA+ +F +M++  I    VT               
Subjt:  GKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVAYGIISTA
                     + L K  G++ IEI      F  GD+SHP+ KEI+E+L++LE +++ AGY P+ E  LHD+  EE E  +  HSERLA+A+G+I+T 
Subjt:  -------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVAYGIISTA

Query:  PGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        PGT +RI KNLR C++CH+  KLISK+ +R I++RDA RFHHFKDG+CSCGD+W
Subjt:  PGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276104.2e-7232.97Show/hide
Query:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVE-GL
        +++ Q++ +   +   +    ++A +  G V  A K F  + + DI+ W+A++ GY Q      AI+M+ ++    + PN FTF  +L  C  T    G 
Subjt:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVE-GL

Query:  GKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYTDVEDLG
        GKQ HG   K    +++ V ++L++MYAK G   SA  VF +  ++ +VSW S+ISGY Q+G  ++AL VFKEM++  VK D +  + V  A T    + 
Subjt:  GKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYTDVEDLG

Query:  QGKSVHGLVTKS-GVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKP-NLILWNAMISG----------------------------------YAKNGY
        +G+    ++ +   +    +    +  +Y++ G +E A      M  P    +W  +++                                   YA++G 
Subjt:  QGKSVHGLVTKS-GVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKP-NLILWNAMISG----------------------------------YAKNGY

Query:  GEEAIKLFRKMISKNIRVDSVTKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLA
         +E  K+ + M  +N++         K+ G+S IE+     +F  GDRSHP   +I+ +L+ L  RLK  GY PD   VL D++ E  E  L  HSERLA
Subjt:  GEEAIKLFRKMISKNIRVDSVTKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLA

Query:  VAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHF-KDGICSCGDFW
        +A+G+I+T  G+ L I KNLR C +CH  IKLI+K+ +REI++RD+ RFHHF  DG+CSCGDFW
Subjt:  VAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHF-KDGICSCGDFW

Arabidopsis top hitse value%identityAlignment
AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-7332.97Show/hide
Query:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVE-GL
        +++ Q++ +   +   +    ++A +  G V  A K F  + + DI+ W+A++ GY Q      AI+M+ ++    + PN FTF  +L  C  T    G 
Subjt:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVE-GL

Query:  GKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYTDVEDLG
        GKQ HG   K    +++ V ++L++MYAK G   SA  VF +  ++ +VSW S+ISGY Q+G  ++AL VFKEM++  VK D +  + V  A T    + 
Subjt:  GKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYTDVEDLG

Query:  QGKSVHGLVTKS-GVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKP-NLILWNAMISG----------------------------------YAKNGY
        +G+    ++ +   +    +    +  +Y++ G +E A      M  P    +W  +++                                   YA++G 
Subjt:  QGKSVHGLVTKS-GVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKP-NLILWNAMISG----------------------------------YAKNGY

Query:  GEEAIKLFRKMISKNIRVDSVTKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLA
         +E  K+ + M  +N++         K+ G+S IE+     +F  GDRSHP   +I+ +L+ L  RLK  GY PD   VL D++ E  E  L  HSERLA
Subjt:  GEEAIKLFRKMISKNIRVDSVTKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLA

Query:  VAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHF-KDGICSCGDFW
        +A+G+I+T  G+ L I KNLR C +CH  IKLI+K+ +REI++RD+ RFHHF  DG+CSCGDFW
Subjt:  VAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHF-KDGICSCGDFW

AT3G12770.1 mitochondrial editing factor 222.9e-12136.46Show/hide
Query:  RKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTL
        +  L Q++ +L+V GL   GFL+ K ++A   FGD+ +A + F ++  P I  WNAII+GY++ N    A+ MY++MQ++RV P+ FTF ++LKAC G  
Subjt:  RKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTL

Query:  VEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYT
           +G+ +H Q F+ GF  +VFVQN L+++YAK  +  SAR VF+   L +RT+VSWT+I+S Y QNG+P+EAL++F +MR+ +VKPDW+ALVSV+ A+T
Subjt:  VEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFD--KLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYT

Query:  DVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------
         ++DL QG+S+H  V K G+E EPD++ISL TMYAK G V  A+  F++M+ PNLILWNAMISGYAKNGY  EAI +F +MI+K++R D+++        
Subjt:  DVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVA
                            KGLNKD+G S +E+ G LE F VGD+SHPR +EI  +++ +E RLK  G+V + ++ LHDLN EE EE+LC+HSER+A+A
Subjt:  --------------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVA

Query:  YGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        YG+IST  GT LRITKNLRACVNCH+A KLISKLVDREI++RD  RFHHFKDG+CSCGD+W
Subjt:  YGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.4e-7633.26Show/hide
Query:  LPPKALEGSNLEPLSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGD-----------VNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMY
        L P +   S++ P+  +++D      ++ G    G+++ K +++ ++ G            +  + + F  +   D + WN+++ GY Q      A+R++
Subjt:  LPPKALEGSNLEPLSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGD-----------VNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMY

Query:  TDMQMSRVHPNCFTFLYVLKACGGTLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKV
          M  ++V P    F  V+ AC       LGKQ+HG   + GFG+N+F+ ++LV MY+K G   +AR +FD+++    VSWT+II G+  +G   EA+ +
Subjt:  TDMQMSRVHPNCFTFLYVLKACGGTLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKV

Query:  FKEMRQCNVKPDWIALVSVMTAYTDVEDLGQGKSVHGLVTK-SGVEFEPDIVISLTTMYAKRGLVEVARFFFNQM-EKPNLILWNAMISGYAKNGYGEEA
        F+EM++  VKP+ +A V+V+TA + V  + +       +TK  G+  E +   ++  +  + G +E A  F ++M  +P   +W+ ++S  + +   E A
Subjt:  FKEMRQCNVKPDWIALVSVMTAYTDVEDLGQGKSVHGLVTK-SGVEFEPDIVISLTTMYAKRGLVEVARFFFNQM-EKPNLILWNAMISGYAKNGYGEEA

Query:  IKLFRKMIS-------------------------KNIRVDSVTKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESV
         K+  K+ +                           +R+    KGL K    S IE+      F  GDRSHP   +I E L  + ++++  GYV D   V
Subjt:  IKLFRKMIS-------------------------KNIRVDSVTKGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESV

Query:  LHDLNHEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        LHD++ E   E L  HSERLAVA+GII+T PGTT+R+TKN+R C +CH AIK ISK+ +REII+RD  RFHHF  G CSCGD+W
Subjt:  LHDLNHEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

AT4G30700.1 Pentatricopeptide repeat (PPR) superfamily protein3.3e-7228.88Show/hide
Query:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVEGLG
        Q++     +G +   +++  F++     G +      FRE  +PDI+ +NA+I GYT       ++ ++ ++ +S       T + ++   G  +   L 
Subjt:  QLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVEGLG

Query:  KQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYTDVEDLGQ
          +HG   K  F ++  V  +L ++Y+K  +  SAR +FD+  ++++ SW ++ISGY QNG   +A+ +F+EM++    P+ + +  +++A   +  L  
Subjt:  KQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVMTAYTDVEDLGQ

Query:  GKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT---------------
        GK VH LV  +  E    +  +L  MYAK G +  AR  F+ M K N + WN MISGY  +G G+EA+ +F +M++  I    VT               
Subjt:  GKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT---------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVAYGIISTA
                     + L K  G++ IEI      F  GD+SHP+ KEI+E+L++LE +++ AGY P+ E  LHD+  EE E  +  HSERLA+A+G+I+T 
Subjt:  -------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVAYGIISTA

Query:  PGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
        PGT +RI KNLR C++CH+  KLISK+ +R I++RDA RFHHFKDG+CSCGD+W
Subjt:  PGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW

AT4G33990.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.9e-7229.91Show/hide
Query:  IVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVEGLG-----K
        I  GL    F+  K ++    FG +    K F  +   D++ WN+IIK Y        AI ++ +M++SR+ P+C T + +      +++  LG     +
Subjt:  IVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHPNCFTFLYVLKACGGTLVEGLG-----K

Query:  QMHGQTFKYG-FGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEM-RQCNVKPDWIALVSVMTAYTDVEDLG
         + G T + G F  ++ + N++V MYAK G   SAR VF+ L +  V+SW +IISGY QNG   EA++++  M  +  +  +    VSV+ A +    L 
Subjt:  QMHGQTFKYG-FGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEM-RQCNVKPDWIALVSVMTAYTDVEDLG

Query:  QGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------------
        QG  +HG + K+G+  +  +V SL  MY K G +E A   F Q+ + N + WN +I+ +  +G+GE+A+ LF++M+ + ++ D +T              
Subjt:  QGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVT--------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVAYGIIST
                      KGL K  G SS+E++  +E F+ G+++HP  +E++ EL  L+ +LK  GYVPD+  VL D+  +E E  L +HSERLA+A+ +I+T
Subjt:  --------------KGLNKDLGHSSIEINGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVAYGIIST

Query:  APGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW
           TT+RI KNLR C +CHS  K ISK+ +REII+RD+ RFHHFK+G+CSCGD+W
Subjt:  APGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDAKRFHHFKDGICSCGDFW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCTGCAGAAGGGTTGAGGAATCATCATGGTGGTGACCAACTTCGAAGAGGCAGAGCAAGTTTTTTTCGGATTGAAAAACGCGATGGTGATTTTCCTCTGTTCCA
ACGGTTATGGGGTCCAACTTCGCCGGCCGCCGCCACATCACCGCCGCCTCCCAAGCTGCTGCAATCGCACAGCTTCATTCGAGTTTTCTTCCCCGGAGAACCTCCTCACC
GGAGAGACAGGGATTTGACAGAGCTGGCTTCAGCAGCTCCTCGCACCTCCACCCATGTCTTTGCATTCGTTTTCGCTTTCTCTCTCCTTGTCGTCACTCACATCAGCTCT
CTCAAAAGCTTTGCTCCTCTACACCTGGAACCTCTGCCGCCCAAGGCACTCGAAGGTTCAAACTTGGAGCCTCTAAGCCGGAAGCATTTGGATCAATTATACGTCCAGTT
AATTGTGTCTGGACTACACAAGTGTGGTTTCTTGGTGATCAAATTTGTCAATGCATGCTTGCATTTCGGAGATGTTAACTACGCACACAAGGCTTTTCGTGAAGTTTTAG
AACCGGATATATTGTTGTGGAATGCCATCATAAAGGGTTACACTCAGAAGAATATTGTTGCTGGTGCTATCAGAATGTATACGGATATGCAAATGTCACGGGTGCACCCT
AATTGCTTCACATTTTTGTATGTGCTTAAAGCATGCGGTGGAACATTAGTTGAAGGATTAGGTAAACAGATGCATGGCCAGACATTTAAATATGGCTTTGGAACAAATGT
TTTTGTGCAGAATAGTCTTGTGTCAATGTATGCTAAATTTGGTCAAACCTCATCTGCTAGGATCGTGTTTGATAAGCTGCATGATAGAACTGTTGTTTCATGGACTTCCA
TCATTTCTGGGTATGTTCAGAATGGTGATCCCGTGGAAGCATTGAAAGTTTTCAAAGAAATGAGGCAATGTAATGTAAAGCCTGATTGGATTGCCCTTGTTAGCGTCATG
ACAGCGTATACGGACGTGGAAGATTTGGGACAAGGAAAGTCCGTTCATGGTTTAGTGACTAAATCGGGTGTAGAATTTGAACCCGACATAGTGATATCACTCACTACTAT
GTATGCAAAACGTGGATTGGTGGAAGTTGCCAGATTTTTCTTTAATCAGATGGAAAAACCAAATTTAATTTTGTGGAATGCTATGATTTCTGGCTATGCAAAAAATGGAT
ATGGTGAAGAAGCAATCAAGCTATTCCGCAAGATGATTTCCAAAAATATCAGGGTTGATTCTGTTACTAAAGGACTGAATAAGGACCTCGGACATAGTTCTATCGAGATC
AATGGAAATCTCGAAACGTTTCATGTTGGAGATAGATCACATCCCAGATCAAAGGAAATTTTTGAAGAGCTTGATAGATTAGAGAAAAGATTAAAAGCAGCTGGTTATGT
TCCTGATAATGAATCTGTTCTACATGACTTGAATCATGAGGAAATTGAGGAAAGTCTTTGTAATCACAGTGAGAGGCTAGCAGTTGCTTATGGTATCATCAGTACTGCTC
CTGGAACTACACTTAGAATAACCAAGAATCTCCGAGCTTGCGTTAACTGTCATTCAGCGATAAAGCTTATATCGAAGCTTGTCGATAGGGAAATAATTATTCGAGACGCG
AAACGTTTTCACCATTTCAAAGATGGAATTTGTTCATGTGGAGACTTTTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGCTGCAGAAGGGTTGAGGAATCATCATGGTGGTGACCAACTTCGAAGAGGCAGAGCAAGTTTTTTTCGGATTGAAAAACGCGATGGTGATTTTCCTCTGTTCCA
ACGGTTATGGGGTCCAACTTCGCCGGCCGCCGCCACATCACCGCCGCCTCCCAAGCTGCTGCAATCGCACAGCTTCATTCGAGTTTTCTTCCCCGGAGAACCTCCTCACC
GGAGAGACAGGGATTTGACAGAGCTGGCTTCAGCAGCTCCTCGCACCTCCACCCATGTCTTTGCATTCGTTTTCGCTTTCTCTCTCCTTGTCGTCACTCACATCAGCTCT
CTCAAAAGCTTTGCTCCTCTACACCTGGAACCTCTGCCGCCCAAGGCACTCGAAGGTTCAAACTTGGAGCCTCTAAGCCGGAAGCATTTGGATCAATTATACGTCCAGTT
AATTGTGTCTGGACTACACAAGTGTGGTTTCTTGGTGATCAAATTTGTCAATGCATGCTTGCATTTCGGAGATGTTAACTACGCACACAAGGCTTTTCGTGAAGTTTTAG
AACCGGATATATTGTTGTGGAATGCCATCATAAAGGGTTACACTCAGAAGAATATTGTTGCTGGTGCTATCAGAATGTATACGGATATGCAAATGTCACGGGTGCACCCT
AATTGCTTCACATTTTTGTATGTGCTTAAAGCATGCGGTGGAACATTAGTTGAAGGATTAGGTAAACAGATGCATGGCCAGACATTTAAATATGGCTTTGGAACAAATGT
TTTTGTGCAGAATAGTCTTGTGTCAATGTATGCTAAATTTGGTCAAACCTCATCTGCTAGGATCGTGTTTGATAAGCTGCATGATAGAACTGTTGTTTCATGGACTTCCA
TCATTTCTGGGTATGTTCAGAATGGTGATCCCGTGGAAGCATTGAAAGTTTTCAAAGAAATGAGGCAATGTAATGTAAAGCCTGATTGGATTGCCCTTGTTAGCGTCATG
ACAGCGTATACGGACGTGGAAGATTTGGGACAAGGAAAGTCCGTTCATGGTTTAGTGACTAAATCGGGTGTAGAATTTGAACCCGACATAGTGATATCACTCACTACTAT
GTATGCAAAACGTGGATTGGTGGAAGTTGCCAGATTTTTCTTTAATCAGATGGAAAAACCAAATTTAATTTTGTGGAATGCTATGATTTCTGGCTATGCAAAAAATGGAT
ATGGTGAAGAAGCAATCAAGCTATTCCGCAAGATGATTTCCAAAAATATCAGGGTTGATTCTGTTACTAAAGGACTGAATAAGGACCTCGGACATAGTTCTATCGAGATC
AATGGAAATCTCGAAACGTTTCATGTTGGAGATAGATCACATCCCAGATCAAAGGAAATTTTTGAAGAGCTTGATAGATTAGAGAAAAGATTAAAAGCAGCTGGTTATGT
TCCTGATAATGAATCTGTTCTACATGACTTGAATCATGAGGAAATTGAGGAAAGTCTTTGTAATCACAGTGAGAGGCTAGCAGTTGCTTATGGTATCATCAGTACTGCTC
CTGGAACTACACTTAGAATAACCAAGAATCTCCGAGCTTGCGTTAACTGTCATTCAGCGATAAAGCTTATATCGAAGCTTGTCGATAGGGAAATAATTATTCGAGACGCG
AAACGTTTTCACCATTTCAAAGATGGAATTTGTTCATGTGGAGACTTTTGGTGA
Protein sequenceShow/hide protein sequence
MAAAEGLRNHHGGDQLRRGRASFFRIEKRDGDFPLFQRLWGPTSPAAATSPPPPKLLQSHSFIRVFFPGEPPHRRDRDLTELASAAPRTSTHVFAFVFAFSLLVVTHISS
LKSFAPLHLEPLPPKALEGSNLEPLSRKHLDQLYVQLIVSGLHKCGFLVIKFVNACLHFGDVNYAHKAFREVLEPDILLWNAIIKGYTQKNIVAGAIRMYTDMQMSRVHP
NCFTFLYVLKACGGTLVEGLGKQMHGQTFKYGFGTNVFVQNSLVSMYAKFGQTSSARIVFDKLHDRTVVSWTSIISGYVQNGDPVEALKVFKEMRQCNVKPDWIALVSVM
TAYTDVEDLGQGKSVHGLVTKSGVEFEPDIVISLTTMYAKRGLVEVARFFFNQMEKPNLILWNAMISGYAKNGYGEEAIKLFRKMISKNIRVDSVTKGLNKDLGHSSIEI
NGNLETFHVGDRSHPRSKEIFEELDRLEKRLKAAGYVPDNESVLHDLNHEEIEESLCNHSERLAVAYGIISTAPGTTLRITKNLRACVNCHSAIKLISKLVDREIIIRDA
KRFHHFKDGICSCGDFW