; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015439 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015439
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionpentatricopeptide repeat-containing protein At4g18975, chloroplastic
Genome locationtig00003651:365664..386833
RNA-Seq ExpressionSgr015439
SyntenySgr015439
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571702.1 Mediator of RNA polymerase II transcription subunit 15a, partial [Cucurbita argyrosperma subsp. sororia]2.2e-2060.36Show/hide
Query:  EMGSIAMKNTRRLISLEENAQLILPFALKLFKDLEAFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKST
        ++ S+  +  R +IS+    ++ L   +KLFKDLEAFGRKPPEKSIVQR+ADA EMLGL+EEKERVL KY  LFTDE+KGSIKKY        K K KST
Subjt:  EMGSIAMKNTRRLISLEENAQLILPFALKLFKDLEAFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKST

Query:  KGNRDSGHLMK
        KGN+D+ +LMK
Subjt:  KGNRDSGHLMK

XP_022154414.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Momordica charantia]3.2e-9641.7Show/hide
Query:  GQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAWE
        GQ+M+LGVSRLQVGNSCYCTMVQAQMC+QLAD+DMKNKDVNNSKALCQ SEQ+   MRKHQIGENV RKDKINFLV TLV+L+ SKEAVYGALDAWVAWE
Subjt:  GQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAWE

Query:  QNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSFH
        QNFPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTTMRVY QLIRALDMDHRAEE+HKFWVMKIG DLHSVPWQLCRSM+SIYYRN MLD+LV    
Subjt:  QNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSFH

Query:  RRAKLIALWLGKSGGGSSSLGLAKFRWREEGQRSMGSSRLQGWLERVMVEEGGVEDSGYGLEMLSFVSGYKGLLVVENLVPADKEGKELGLVRGRVGSGM
                                                                                                            
Subjt:  RRAKLIALWLGKSGGGSSSLGLAKFRWREEGQRSMGSSRLQGWLERVMVEEGGVEDSGYGLEMLSFVSGYKGLLVVENLVPADKEGKELGLVRGRVGSGM

Query:  ELFIWVELLAWSQWLTFVGTIPVVHQVVQLEVVKNGSTCTGIGVDEDEDGSQISSRMKPMLLALTFMDASIKFIRERGPHDSVPFVVEAFTDDNNGGASK
                                                                                                            
Subjt:  ELFIWVELLAWSQWLTFVGTIPVVHQVVQLEVVKNGSTCTGIGVDEDEDGSQISSRMKPMLLALTFMDASIKFIRERGPHDSVPFVVEAFTDDNNGGASK

Query:  HENNKLALMEVGRACESYKGREGGDDWKLIKRLRKAFHPFLLFSAVKDSILKQIEFEPLSVTNDDEMGSIAMKNTRRLISLEENAQLILPFALKLFKDLE
                                                                                                     KLFKDLE
Subjt:  HENNKLALMEVGRACESYKGREGGDDWKLIKRLRKAFHPFLLFSAVKDSILKQIEFEPLSVTNDDEMGSIAMKNTRRLISLEENAQLILPFALKLFKDLE

Query:  AFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKSTKGNRDSGHLMKAQ
        AFGRKPPEKSIVQR+ADAYEMLGL EEKERVLEKYKDLFTDERKG I+KY ++SFEKSK++ K TK ++D+G L K Q
Subjt:  AFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKSTKGNRDSGHLMKAQ

XP_022154416.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X3 [Momordica charantia]3.2e-9641.7Show/hide
Query:  GQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAWE
        GQ+M+LGVSRLQVGNSCYCTMVQAQMC+QLAD+DMKNKDVNNSKALCQ SEQ+   MRKHQIGENV RKDKINFLV TLV+L+ SKEAVYGALDAWVAWE
Subjt:  GQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAWE

Query:  QNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSFH
        QNFPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTTMRVY QLIRALDMDHRAEE+HKFWVMKIG DLHSVPWQLCRSM+SIYYRN MLD+LV    
Subjt:  QNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSFH

Query:  RRAKLIALWLGKSGGGSSSLGLAKFRWREEGQRSMGSSRLQGWLERVMVEEGGVEDSGYGLEMLSFVSGYKGLLVVENLVPADKEGKELGLVRGRVGSGM
                                                                                                            
Subjt:  RRAKLIALWLGKSGGGSSSLGLAKFRWREEGQRSMGSSRLQGWLERVMVEEGGVEDSGYGLEMLSFVSGYKGLLVVENLVPADKEGKELGLVRGRVGSGM

Query:  ELFIWVELLAWSQWLTFVGTIPVVHQVVQLEVVKNGSTCTGIGVDEDEDGSQISSRMKPMLLALTFMDASIKFIRERGPHDSVPFVVEAFTDDNNGGASK
                                                                                                            
Subjt:  ELFIWVELLAWSQWLTFVGTIPVVHQVVQLEVVKNGSTCTGIGVDEDEDGSQISSRMKPMLLALTFMDASIKFIRERGPHDSVPFVVEAFTDDNNGGASK

Query:  HENNKLALMEVGRACESYKGREGGDDWKLIKRLRKAFHPFLLFSAVKDSILKQIEFEPLSVTNDDEMGSIAMKNTRRLISLEENAQLILPFALKLFKDLE
                                                                                                     KLFKDLE
Subjt:  HENNKLALMEVGRACESYKGREGGDDWKLIKRLRKAFHPFLLFSAVKDSILKQIEFEPLSVTNDDEMGSIAMKNTRRLISLEENAQLILPFALKLFKDLE

Query:  AFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKSTKGNRDSGHLMKAQ
        AFGRKPPEKSIVQR+ADAYEMLGL EEKERVLEKYKDLFTDERKG I+KY ++SFEKSK++ K TK ++D+G L K Q
Subjt:  AFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKSTKGNRDSGHLMKAQ

XP_022967610.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucurbita maxima]5.9e-9083Show/hide
Query:  VGQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAW
        VGQVMELGV++LQ+GNSCYCTM+Q QM K+ ADKDM +KDVNNSK L QTSE+++  +RKHQIGENV RKDKINFLVNTL++L+DSKEAVYGALDAWVAW
Subjt:  VGQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAW

Query:  EQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSF
        EQ+FPIASLK ALA LEKE QWHRVVQVIKWMLSKGQGTTM VY QLIRALDMDHRAEEAHKFWVMKIG+DLHSVPWQLCRSM+SIYYRN ML+ LVK F
Subjt:  EQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSF

XP_022967610.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucurbita maxima]4.9e-2060.36Show/hide
Query:  EMGSIAMKNTRRLISLEENAQLILPFALKLFKDLEAFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKST
        ++ S+  +  R +IS+    ++ L   +KLFKDLEAFGRKPPEKSIVQR+ADA EMLGL+EEKERVL KY  LFTDE+KGSIKKY        K K KST
Subjt:  EMGSIAMKNTRRLISLEENAQLILPFALKLFKDLEAFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKST

Query:  KGNRDSGHLMK
        KGN+D+  LMK
Subjt:  KGNRDSGHLMK

XP_022967610.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Cucurbita maxima]1.7e-8982Show/hide
Query:  VGQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAW
        VGQ+MELGV++LQ+GNSCYCTM+Q QM K+ ADKDM +KDVNNSK L QTSE+++  +RKHQIGENV RKDKI+FLVNTL++L+DSKEAVYGALDAWVAW
Subjt:  VGQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAW

Query:  EQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSF
        EQ+FPIASLK ALA LEKE QWHRVVQVIKWMLSKGQGTTM VY QLIRALDMDHRAEEAHKFWVMKIG+DLHSVPWQLCRSM+SIYYRN ML+ LVK F
Subjt:  EQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSF

XP_038887984.1 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X1 [Benincasa hispida]7.0e-9140.41Show/hide
Query:  VGQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAW
        VGQ+MELGVSRLQVG+ CYCTM+Q QM KQLA KD+KNKD NNSKAL QTSEQ++  +RKHQIG+NVPRKDKINFLVNTL++L+DSKEAVYGALDAWVAW
Subjt:  VGQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAW

Query:  EQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSF
        EQ+FPI SLK  L  LEKEQQWHRVVQVIKWMLSKGQGTTM VY QLIRALDMDHRAEEAHKFWVMKIG+DLHSVPWQLCRSM++IYYRN ML+ LV   
Subjt:  EQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSF

Query:  HRRAKLIALWLGKSGGGSSSLGLAKFRWREEGQRSMGSSRLQGWLERVMVEEGGVEDSGYGLEMLSFVSGYKGLLVVENLVPADKEGKELGLVRGRVGSG
                                                                                                            
Subjt:  HRRAKLIALWLGKSGGGSSSLGLAKFRWREEGQRSMGSSRLQGWLERVMVEEGGVEDSGYGLEMLSFVSGYKGLLVVENLVPADKEGKELGLVRGRVGSG

Query:  MELFIWVELLAWSQWLTFVGTIPVVHQVVQLEVVKNGSTCTGIGVDEDEDGSQISSRMKPMLLALTFMDASIKFIRERGPHDSVPFVVEAFTDDNNGGAS
                                                                                                            
Subjt:  MELFIWVELLAWSQWLTFVGTIPVVHQVVQLEVVKNGSTCTGIGVDEDEDGSQISSRMKPMLLALTFMDASIKFIRERGPHDSVPFVVEAFTDDNNGGAS

Query:  KHENNKLALMEVGRACESYKGREGGDDWKLIKRLRKAFHPFLLFSAVKDSILKQIEFEPLSVTNDDEMGSIAMKNTRRLISLEENAQLILPFALKLFKDL
                                                                                                      KLFKDL
Subjt:  KHENNKLALMEVGRACESYKGREGGDDWKLIKRLRKAFHPFLLFSAVKDSILKQIEFEPLSVTNDDEMGSIAMKNTRRLISLEENAQLILPFALKLFKDL

Query:  EAFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKSTKGNRDSGHLMKAQ
        EAFGRKPPEKSIVQR+ADA E+LGLLEEKERVL KYK LFTDE++GSIKKYKRVSFEKSK K KSTK   D+ +LMKAQ
Subjt:  EAFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKSTKGNRDSGHLMKAQ

TrEMBL top hitse value%identityAlignment
A0A6J1DM10 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X11.6e-9641.7Show/hide
Query:  GQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAWE
        GQ+M+LGVSRLQVGNSCYCTMVQAQMC+QLAD+DMKNKDVNNSKALCQ SEQ+   MRKHQIGENV RKDKINFLV TLV+L+ SKEAVYGALDAWVAWE
Subjt:  GQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAWE

Query:  QNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSFH
        QNFPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTTMRVY QLIRALDMDHRAEE+HKFWVMKIG DLHSVPWQLCRSM+SIYYRN MLD+LV    
Subjt:  QNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSFH

Query:  RRAKLIALWLGKSGGGSSSLGLAKFRWREEGQRSMGSSRLQGWLERVMVEEGGVEDSGYGLEMLSFVSGYKGLLVVENLVPADKEGKELGLVRGRVGSGM
                                                                                                            
Subjt:  RRAKLIALWLGKSGGGSSSLGLAKFRWREEGQRSMGSSRLQGWLERVMVEEGGVEDSGYGLEMLSFVSGYKGLLVVENLVPADKEGKELGLVRGRVGSGM

Query:  ELFIWVELLAWSQWLTFVGTIPVVHQVVQLEVVKNGSTCTGIGVDEDEDGSQISSRMKPMLLALTFMDASIKFIRERGPHDSVPFVVEAFTDDNNGGASK
                                                                                                            
Subjt:  ELFIWVELLAWSQWLTFVGTIPVVHQVVQLEVVKNGSTCTGIGVDEDEDGSQISSRMKPMLLALTFMDASIKFIRERGPHDSVPFVVEAFTDDNNGGASK

Query:  HENNKLALMEVGRACESYKGREGGDDWKLIKRLRKAFHPFLLFSAVKDSILKQIEFEPLSVTNDDEMGSIAMKNTRRLISLEENAQLILPFALKLFKDLE
                                                                                                     KLFKDLE
Subjt:  HENNKLALMEVGRACESYKGREGGDDWKLIKRLRKAFHPFLLFSAVKDSILKQIEFEPLSVTNDDEMGSIAMKNTRRLISLEENAQLILPFALKLFKDLE

Query:  AFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKSTKGNRDSGHLMKAQ
        AFGRKPPEKSIVQR+ADAYEMLGL EEKERVLEKYKDLFTDERKG I+KY ++SFEKSK++ K TK ++D+G L K Q
Subjt:  AFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKSTKGNRDSGHLMKAQ

A0A6J1DNN5 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X31.6e-9641.7Show/hide
Query:  GQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAWE
        GQ+M+LGVSRLQVGNSCYCTMVQAQMC+QLAD+DMKNKDVNNSKALCQ SEQ+   MRKHQIGENV RKDKINFLV TLV+L+ SKEAVYGALDAWVAWE
Subjt:  GQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAWE

Query:  QNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSFH
        QNFPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTTMRVY QLIRALDMDHRAEE+HKFWVMKIG DLHSVPWQLCRSM+SIYYRN MLD+LV    
Subjt:  QNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSFH

Query:  RRAKLIALWLGKSGGGSSSLGLAKFRWREEGQRSMGSSRLQGWLERVMVEEGGVEDSGYGLEMLSFVSGYKGLLVVENLVPADKEGKELGLVRGRVGSGM
                                                                                                            
Subjt:  RRAKLIALWLGKSGGGSSSLGLAKFRWREEGQRSMGSSRLQGWLERVMVEEGGVEDSGYGLEMLSFVSGYKGLLVVENLVPADKEGKELGLVRGRVGSGM

Query:  ELFIWVELLAWSQWLTFVGTIPVVHQVVQLEVVKNGSTCTGIGVDEDEDGSQISSRMKPMLLALTFMDASIKFIRERGPHDSVPFVVEAFTDDNNGGASK
                                                                                                            
Subjt:  ELFIWVELLAWSQWLTFVGTIPVVHQVVQLEVVKNGSTCTGIGVDEDEDGSQISSRMKPMLLALTFMDASIKFIRERGPHDSVPFVVEAFTDDNNGGASK

Query:  HENNKLALMEVGRACESYKGREGGDDWKLIKRLRKAFHPFLLFSAVKDSILKQIEFEPLSVTNDDEMGSIAMKNTRRLISLEENAQLILPFALKLFKDLE
                                                                                                     KLFKDLE
Subjt:  HENNKLALMEVGRACESYKGREGGDDWKLIKRLRKAFHPFLLFSAVKDSILKQIEFEPLSVTNDDEMGSIAMKNTRRLISLEENAQLILPFALKLFKDLE

Query:  AFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKSTKGNRDSGHLMKAQ
        AFGRKPPEKSIVQR+ADAYEMLGL EEKERVLEKYKDLFTDERKG I+KY ++SFEKSK++ K TK ++D+G L K Q
Subjt:  AFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKSTKGNRDSGHLMKAQ

A0A6J1HGC4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X19.1e-2059.46Show/hide
Query:  EMGSIAMKNTRRLISLEENAQLILPFALKLFKDLEAFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKST
        ++ S+  +  R +IS+    ++ L   +KLFK+LEAFGRKPPEKSIVQR+ADA EMLGL+EEKERVL KY  LFTDE+KGSIKKY        K K KST
Subjt:  EMGSIAMKNTRRLISLEENAQLILPFALKLFKDLEAFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKST

Query:  KGNRDSGHLMK
        KGN+D+  LMK
Subjt:  KGNRDSGHLMK

A0A6J1HGC4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X14.6e-8839.97Show/hide
Query:  GQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAWE
        GQ+M+LGVSRLQVGNSCYCTMVQAQMC+QLAD+DMKNK           SEQ+   MRKHQIGENV RKDKINFLV TLV+L+ SKEAVYGALDAWVAWE
Subjt:  GQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAWE

Query:  QNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSFH
        QNFPIASLKQ LA LEKEQQWHRVVQVIKWMLSKGQGTTMRVY QLIRALDMDHRAEE+HKFWVMKIG DLHSVPWQLCRSM+SIYYRN MLD+LV    
Subjt:  QNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSFH

Query:  RRAKLIALWLGKSGGGSSSLGLAKFRWREEGQRSMGSSRLQGWLERVMVEEGGVEDSGYGLEMLSFVSGYKGLLVVENLVPADKEGKELGLVRGRVGSGM
                                                                                                            
Subjt:  RRAKLIALWLGKSGGGSSSLGLAKFRWREEGQRSMGSSRLQGWLERVMVEEGGVEDSGYGLEMLSFVSGYKGLLVVENLVPADKEGKELGLVRGRVGSGM

Query:  ELFIWVELLAWSQWLTFVGTIPVVHQVVQLEVVKNGSTCTGIGVDEDEDGSQISSRMKPMLLALTFMDASIKFIRERGPHDSVPFVVEAFTDDNNGGASK
                                                                                                            
Subjt:  ELFIWVELLAWSQWLTFVGTIPVVHQVVQLEVVKNGSTCTGIGVDEDEDGSQISSRMKPMLLALTFMDASIKFIRERGPHDSVPFVVEAFTDDNNGGASK

Query:  HENNKLALMEVGRACESYKGREGGDDWKLIKRLRKAFHPFLLFSAVKDSILKQIEFEPLSVTNDDEMGSIAMKNTRRLISLEENAQLILPFALKLFKDLE
                                                                                                     KLFKDLE
Subjt:  HENNKLALMEVGRACESYKGREGGDDWKLIKRLRKAFHPFLLFSAVKDSILKQIEFEPLSVTNDDEMGSIAMKNTRRLISLEENAQLILPFALKLFKDLE

Query:  AFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKSTKGNRDSGHLMKAQ
        AFGRKPPEKSIVQR+ADAYEMLGL EEKERVLEKYKDLFTDERKG I+KY ++SFEKSK++ K TK ++D+G L K Q
Subjt:  AFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKSTKGNRDSGHLMKAQ

A0A6J1HUZ4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X12.9e-9083Show/hide
Query:  VGQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAW
        VGQVMELGV++LQ+GNSCYCTM+Q QM K+ ADKDM +KDVNNSK L QTSE+++  +RKHQIGENV RKDKINFLVNTL++L+DSKEAVYGALDAWVAW
Subjt:  VGQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAW

Query:  EQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSF
        EQ+FPIASLK ALA LEKE QWHRVVQVIKWMLSKGQGTTM VY QLIRALDMDHRAEEAHKFWVMKIG+DLHSVPWQLCRSM+SIYYRN ML+ LVK F
Subjt:  EQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSF

A0A6J1HUZ4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X12.4e-2060.36Show/hide
Query:  EMGSIAMKNTRRLISLEENAQLILPFALKLFKDLEAFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKST
        ++ S+  +  R +IS+    ++ L   +KLFKDLEAFGRKPPEKSIVQR+ADA EMLGL+EEKERVL KY  LFTDE+KGSIKKY        K K KST
Subjt:  EMGSIAMKNTRRLISLEENAQLILPFALKLFKDLEAFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKST

Query:  KGNRDSGHLMK
        KGN+D+  LMK
Subjt:  KGNRDSGHLMK

A0A6J1HUZ4 pentatricopeptide repeat-containing protein At4g18975, chloroplastic isoform X12.4e-8981.5Show/hide
Query:  VGQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAW
        VGQ+MELGV++LQ+GNSCYCTM+Q QM K+  DKDM +KDVNNSK L QTSE+++  +RKHQIGENV RKDKI+FLVNTL++L+DSKEAVYGALDAWVAW
Subjt:  VGQVMELGVSRLQVGNSCYCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAW

Query:  EQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSF
        EQ+FPIASLK ALA LEKE QWHRVVQVIKWMLSKGQGTTM VY QLIRALDMDHRAEEAHKFWVMKIG+DLHSVPWQLCRSM+SIYYRN ML+ LVK F
Subjt:  EQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSF

SwissProt top hitse value%identityAlignment
Q2V3H0 Pentatricopeptide repeat-containing protein At4g18975, chloroplastic5.1e-2848.8Show/hide
Query:  LVNTLVNLKDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSV
        LV  L  L + KEAVYGAL+ WVAWE  FPI +  +AL  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + T   S+
Subjt:  LVNTLVNLKDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSV

Query:  PWQLCRSMVSIYYRNNMLDSLVKSF
        P +L   M+++Y  +++ D +++ F
Subjt:  PWQLCRSMVSIYYRNNMLDSLVKSF

Q8LG95 Pentatricopeptide repeat-containing protein At4g211902.6e-2442.4Show/hide
Query:  LVNTLVNLKDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSV
        ++  +  L + KE VYGALD+++AWE  FP+  +K+AL  LE E++W +++QV KWMLSKGQG TM  Y+ L+ AL  D+R +EA + W       L   
Subjt:  LVNTLVNLKDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSV

Query:  PWQLCRSMVSIYYRNNMLDSLVKSF
        P +    M+SIYY+ +M   L + F
Subjt:  PWQLCRSMVSIYYRNNMLDSLVKSF

Arabidopsis top hitse value%identityAlignment
AT1G04590.1 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G21190.1)2.3e-6067.27Show/hide
Query:  MKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSK
        +KN+D  +      + + + E  RKHQIGEN+P+KDKI FLVNTL++++D+KEAVYGALDAWVAWE+NFPIASLK  +A+LEKE QWHR+VQVIKW+LSK
Subjt:  MKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSK

Query:  GQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSF
        GQG TM  Y QLIRALDMD RAEEAH  W  K+G DLHSVPWQLC  M+ IY+RNNML  LVK F
Subjt:  GQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSF

AT1G04590.1 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G21190.1)3.4e-1167.39Show/hide
Query:  LKLFKDLEAFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDL
        +KLFKDLE++ RKPP+K IVQ +ADAYE+LG+L+EKERV+ KY  L
Subjt:  LKLFKDLEAFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDL

AT1G04590.2 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G18975.4)6.8e-6067.48Show/hide
Query:  MKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSK
        +KN+D  +      + + + E  RKHQIGEN+P+KDKI FLVNTL++++D+KEAVYGALDAWVAWE+NFPIASLK  +A+LEKE QWHR+VQVIKW+LSK
Subjt:  MKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSK

Query:  GQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVK
        GQG TM  Y QLIRALDMD RAEEAH  W  K+G DLHSVPWQLC  M+ IY+RNNML  LVK
Subjt:  GQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVK

AT1G04590.2 BEST Arabidopsis thaliana protein match is: Pentatricopeptide repeat (PPR) superfamily protein (TAIR:AT4G18975.4)2.6e-1167.39Show/hide
Query:  LKLFKDLEAFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDL
        +KLFKDLE++ RKPP+K IVQ +ADAYE+LG+L+EKERV+ KY  L
Subjt:  LKLFKDLEAFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDL

AT4G18975.1 Pentatricopeptide repeat (PPR) superfamily protein3.6e-2948.8Show/hide
Query:  LVNTLVNLKDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSV
        LV  L  L + KEAVYGAL+ WVAWE  FPI +  +AL  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + T   S+
Subjt:  LVNTLVNLKDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSV

Query:  PWQLCRSMVSIYYRNNMLDSLVKSF
        P +L   M+++Y  +++ D +++ F
Subjt:  PWQLCRSMVSIYYRNNMLDSLVKSF

AT4G18975.2 Pentatricopeptide repeat (PPR) superfamily protein3.6e-2948.8Show/hide
Query:  LVNTLVNLKDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSV
        LV  L  L + KEAVYGAL+ WVAWE  FPI +  +AL  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + T   S+
Subjt:  LVNTLVNLKDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSV

Query:  PWQLCRSMVSIYYRNNMLDSLVKSF
        P +L   M+++Y  +++ D +++ F
Subjt:  PWQLCRSMVSIYYRNNMLDSLVKSF

AT4G18975.3 Pentatricopeptide repeat (PPR) superfamily protein3.6e-2948.8Show/hide
Query:  LVNTLVNLKDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSV
        LV  L  L + KEAVYGAL+ WVAWE  FPI +  +AL  L K  QWHRV+Q+ KWMLSKGQG TM  Y  L+ A DMD RA+EA   W M + T   S+
Subjt:  LVNTLVNLKDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQVIKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSV

Query:  PWQLCRSMVSIYYRNNMLDSLVKSF
        P +L   M+++Y  +++ D +++ F
Subjt:  PWQLCRSMVSIYYRNNMLDSLVKSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCGGAATTCATGGAGAGTCGCCGCCGGTAAACTTTTTCGGGCAAAGGACAGATGTCTTTAATCTCCCAGACCCCAAGGCAGGTTTTCTTAGACCGTTTCCCCGTGC
TAGAGGGCCCCACTGTCCTTCACCAGCTCTCTGTAATCCCTTACCTGACCTATACAGTACACCTTTTGACTTATCCAGCCCCCATTGTTTAGATGCCATTGATGAAGCCT
CTGATTTCACTCAACCTCACAGTGACCCACCACCATCCAAAAAGGACGGCCTCACAATTTTTGCTGATAAATTGGGAGAAAAGGTGTACACACCGAAAAAAAACCCACTT
GTTGATTTCAAACCTACCAATGACTCTCCTACCTTTTGCTCCAAACCCAACCGCCCCACCTCCACCCTACCTATCCTCTTACTCGACCCGATTAAGCTCCTTGCCAGCCC
ACACAAATTAGCCGTTAACATCTCAAAAAGGAAAATTTTCCTCGTTGCTGGTACCAAACACTCTACCAATCCCCCTCTTCTTGACCCAGAGAGGCCTTTACCCAAGGGCA
GTCTTCAGAAAGAACCTGTGGGCAGCCTCAAGACTTTGCGCAAATGGATGAACATTGTAGGACAAGTAATGGAGCTTGGAGTCAGCAGGCTGCAAGTTGGGAACTCTTGT
TACTGTACAATGGTACAAGCTCAAATGTGTAAACAGCTTGCTGATAAAGATATGAAAAATAAGGATGTTAACAATAGCAAAGCTTTGTGCCAGACTTCAGAGCAAGATGT
TGAAGGCATGAGAAAGCACCAAATTGGGGAAAATGTTCCACGGAAGGACAAAATTAACTTCCTTGTGAACACGCTTGTCAATTTGAAAGATAGTAAGGAGGCTGTTTATG
GCGCTCTTGATGCCTGGGTTGCATGGGAGCAAAACTTTCCAATAGCATCCCTTAAGCAGGCATTGGCTGCTCTTGAGAAGGAACAGCAGTGGCATAGAGTTGTTCAGGTA
ATCAAATGGATGCTAAGCAAGGGGCAGGGAACCACAATGAGAGTCTATTGGCAGTTAATACGGGCTTTAGACATGGACCATCGAGCAGAAGAAGCACACAAGTTTTGGGT
CATGAAAATTGGTACGGATCTACATTCAGTCCCTTGGCAGTTGTGCAGAAGCATGGTATCAATATACTATCGTAATAACATGCTAGACAGTCTTGTAAAGAGTTTCCATA
GGAGGGCCAAGTTGATTGCCCTCTGGTTGGGAAAGTCTGGAGGTGGATCTTCTTCCCTCGGATTGGCCAAGTTTAGGTGGAGAGAAGAGGGGCAAAGATCAATGGGTTCT
TCCAGGTTGCAAGGGTGGTTGGAAAGGGTCATGGTAGAAGAAGGTGGGGTCGAAGACAGTGGGTATGGGCTGGAGATGTTGTCTTTTGTATCAGGGTACAAGGGGTTGTT
GGTAGTAGAAAACTTGGTACCCGCGGATAAGGAAGGTAAGGAGCTAGGTCTAGTAAGGGGCCGGGTTGGATCGGGTATGGAGTTGTTTATATGGGTCGAACTGTTAGCAT
GGTCTCAATGGCTGACTTTTGTGGGTACTATCCCAGTAGTCCATCAGGTGGTGCAGTTGGAAGTAGTGAAAAATGGGTCGACTTGTACTGGTATTGGAGTTGATGAGGAT
GAAGATGGAAGCCAGATTTCCTCAAGGATGAAGCCAATGCTGTTAGCTCTGACATTTATGGATGCTTCCATTAAATTCATACGAGAAAGGGGCCCTCATGATTCTGTACC
ATTCGTCGTGGAAGCATTTACGGATGACAATAATGGAGGAGCATCGAAGCATGAGAATAATAAGCTAGCTTTAATGGAGGTAGGTAGAGCATGTGAGTCATATAAAGGAA
GGGAGGGTGGGGACGATTGGAAGCTCATTAAAAGGCTGCGAAAGGCTTTCCATCCTTTTCTATTGTTTTCGGCTGTAAAAGACTCCATTTTGAAGCAAATTGAGTTCGAA
CCTCTTTCTGTGACTAATGACGATGAGATGGGTTCTATTGCTATGAAGAATACTCGAAGATTGATCTCGTTGGAAGAAAATGCTCAGCTAATCTTGCCTTTTGCACTCAA
GCTTTTTAAGGATCTTGAAGCTTTTGGACGTAAACCTCCAGAAAAATCAATAGTGCAGAGGATAGCAGATGCCTATGAGATGCTAGGCTTGCTTGAAGAGAAAGAGAGGG
TGTTAGAGAAGTACAAAGACCTTTTTACAGATGAAAGGAAAGGGTCCATCAAGAAATATAAGAGGGTTTCGTTTGAGAAATCGAAGAAAAAAGGAAAATCGACAAAGGGC
AATAGAGACAGTGGCCATCTTATGAAGGCTCAATGA
mRNA sequenceShow/hide mRNA sequence
ATGACCGGAATTCATGGAGAGTCGCCGCCGGTAAACTTTTTCGGGCAAAGGACAGATGTCTTTAATCTCCCAGACCCCAAGGCAGGTTTTCTTAGACCGTTTCCCCGTGC
TAGAGGGCCCCACTGTCCTTCACCAGCTCTCTGTAATCCCTTACCTGACCTATACAGTACACCTTTTGACTTATCCAGCCCCCATTGTTTAGATGCCATTGATGAAGCCT
CTGATTTCACTCAACCTCACAGTGACCCACCACCATCCAAAAAGGACGGCCTCACAATTTTTGCTGATAAATTGGGAGAAAAGGTGTACACACCGAAAAAAAACCCACTT
GTTGATTTCAAACCTACCAATGACTCTCCTACCTTTTGCTCCAAACCCAACCGCCCCACCTCCACCCTACCTATCCTCTTACTCGACCCGATTAAGCTCCTTGCCAGCCC
ACACAAATTAGCCGTTAACATCTCAAAAAGGAAAATTTTCCTCGTTGCTGGTACCAAACACTCTACCAATCCCCCTCTTCTTGACCCAGAGAGGCCTTTACCCAAGGGCA
GTCTTCAGAAAGAACCTGTGGGCAGCCTCAAGACTTTGCGCAAATGGATGAACATTGTAGGACAAGTAATGGAGCTTGGAGTCAGCAGGCTGCAAGTTGGGAACTCTTGT
TACTGTACAATGGTACAAGCTCAAATGTGTAAACAGCTTGCTGATAAAGATATGAAAAATAAGGATGTTAACAATAGCAAAGCTTTGTGCCAGACTTCAGAGCAAGATGT
TGAAGGCATGAGAAAGCACCAAATTGGGGAAAATGTTCCACGGAAGGACAAAATTAACTTCCTTGTGAACACGCTTGTCAATTTGAAAGATAGTAAGGAGGCTGTTTATG
GCGCTCTTGATGCCTGGGTTGCATGGGAGCAAAACTTTCCAATAGCATCCCTTAAGCAGGCATTGGCTGCTCTTGAGAAGGAACAGCAGTGGCATAGAGTTGTTCAGGTA
ATCAAATGGATGCTAAGCAAGGGGCAGGGAACCACAATGAGAGTCTATTGGCAGTTAATACGGGCTTTAGACATGGACCATCGAGCAGAAGAAGCACACAAGTTTTGGGT
CATGAAAATTGGTACGGATCTACATTCAGTCCCTTGGCAGTTGTGCAGAAGCATGGTATCAATATACTATCGTAATAACATGCTAGACAGTCTTGTAAAGAGTTTCCATA
GGAGGGCCAAGTTGATTGCCCTCTGGTTGGGAAAGTCTGGAGGTGGATCTTCTTCCCTCGGATTGGCCAAGTTTAGGTGGAGAGAAGAGGGGCAAAGATCAATGGGTTCT
TCCAGGTTGCAAGGGTGGTTGGAAAGGGTCATGGTAGAAGAAGGTGGGGTCGAAGACAGTGGGTATGGGCTGGAGATGTTGTCTTTTGTATCAGGGTACAAGGGGTTGTT
GGTAGTAGAAAACTTGGTACCCGCGGATAAGGAAGGTAAGGAGCTAGGTCTAGTAAGGGGCCGGGTTGGATCGGGTATGGAGTTGTTTATATGGGTCGAACTGTTAGCAT
GGTCTCAATGGCTGACTTTTGTGGGTACTATCCCAGTAGTCCATCAGGTGGTGCAGTTGGAAGTAGTGAAAAATGGGTCGACTTGTACTGGTATTGGAGTTGATGAGGAT
GAAGATGGAAGCCAGATTTCCTCAAGGATGAAGCCAATGCTGTTAGCTCTGACATTTATGGATGCTTCCATTAAATTCATACGAGAAAGGGGCCCTCATGATTCTGTACC
ATTCGTCGTGGAAGCATTTACGGATGACAATAATGGAGGAGCATCGAAGCATGAGAATAATAAGCTAGCTTTAATGGAGGTAGGTAGAGCATGTGAGTCATATAAAGGAA
GGGAGGGTGGGGACGATTGGAAGCTCATTAAAAGGCTGCGAAAGGCTTTCCATCCTTTTCTATTGTTTTCGGCTGTAAAAGACTCCATTTTGAAGCAAATTGAGTTCGAA
CCTCTTTCTGTGACTAATGACGATGAGATGGGTTCTATTGCTATGAAGAATACTCGAAGATTGATCTCGTTGGAAGAAAATGCTCAGCTAATCTTGCCTTTTGCACTCAA
GCTTTTTAAGGATCTTGAAGCTTTTGGACGTAAACCTCCAGAAAAATCAATAGTGCAGAGGATAGCAGATGCCTATGAGATGCTAGGCTTGCTTGAAGAGAAAGAGAGGG
TGTTAGAGAAGTACAAAGACCTTTTTACAGATGAAAGGAAAGGGTCCATCAAGAAATATAAGAGGGTTTCGTTTGAGAAATCGAAGAAAAAAGGAAAATCGACAAAGGGC
AATAGAGACAGTGGCCATCTTATGAAGGCTCAATGA
Protein sequenceShow/hide protein sequence
MTGIHGESPPVNFFGQRTDVFNLPDPKAGFLRPFPRARGPHCPSPALCNPLPDLYSTPFDLSSPHCLDAIDEASDFTQPHSDPPPSKKDGLTIFADKLGEKVYTPKKNPL
VDFKPTNDSPTFCSKPNRPTSTLPILLLDPIKLLASPHKLAVNISKRKIFLVAGTKHSTNPPLLDPERPLPKGSLQKEPVGSLKTLRKWMNIVGQVMELGVSRLQVGNSC
YCTMVQAQMCKQLADKDMKNKDVNNSKALCQTSEQDVEGMRKHQIGENVPRKDKINFLVNTLVNLKDSKEAVYGALDAWVAWEQNFPIASLKQALAALEKEQQWHRVVQV
IKWMLSKGQGTTMRVYWQLIRALDMDHRAEEAHKFWVMKIGTDLHSVPWQLCRSMVSIYYRNNMLDSLVKSFHRRAKLIALWLGKSGGGSSSLGLAKFRWREEGQRSMGS
SRLQGWLERVMVEEGGVEDSGYGLEMLSFVSGYKGLLVVENLVPADKEGKELGLVRGRVGSGMELFIWVELLAWSQWLTFVGTIPVVHQVVQLEVVKNGSTCTGIGVDED
EDGSQISSRMKPMLLALTFMDASIKFIRERGPHDSVPFVVEAFTDDNNGGASKHENNKLALMEVGRACESYKGREGGDDWKLIKRLRKAFHPFLLFSAVKDSILKQIEFE
PLSVTNDDEMGSIAMKNTRRLISLEENAQLILPFALKLFKDLEAFGRKPPEKSIVQRIADAYEMLGLLEEKERVLEKYKDLFTDERKGSIKKYKRVSFEKSKKKGKSTKG
NRDSGHLMKAQ