; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g04830 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g04830
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr5:3334666..3337168
RNA-Seq ExpressionMoc05g04830
SyntenyMoc05g04830
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147487.1 pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Momordica charantia]3.9e-300100Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
        MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP

Query:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
        GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
Subjt:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE

Query:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH
        MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH
Subjt:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH

Query:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS
        AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS
Subjt:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH
        VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH

Query:  SQNEG
        SQNEG
Subjt:  SQNEG

XP_022147489.1 pentatricopeptide repeat-containing protein At5g08510 isoform X2 [Momordica charantia]4.9e-303100Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
        MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP

Query:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
        GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
Subjt:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE

Query:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH
        MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH
Subjt:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH

Query:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS
        AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS
Subjt:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH
        VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH

Query:  SQNEGMNPH
        SQNEGMNPH
Subjt:  SQNEGMNPH

XP_022979295.1 pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Cucurbita maxima]4.8e-26687.72Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
        MNQLKQIHAY LR+GVDYTKFLIEKLLQIPNLPYAC LFDLIPKPSVFLYNKFIQ++SS G  HRCW LYYQMC QGCSPN HSFTFLF ACAS  N +P
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP

Query:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
        GQMLH+HFCKSGFASDVFALTALLDMY KLG+L+SARQLFDEMPVRDIPTWNSM+AGY+RSG MGAALELFD+MP+RNV+SWTALISGY+QNGKYAKALE
Subjt:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE

Query:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH
        MFLRLENE+G KPNEVT+ASVLPACAQLGALDIG+RIE YAR NGFFKNLYVSNAILEVHARCGNIEEAR+VFDEIGSKRNLCSWNTMIMGLAVHGRC  
Subjt:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH

Query:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS
        A++LYDQML QR RPDDVTF+GLLLACTHGGMVAKGRQ+FESME KFQIAPKLEHYGCLVDLLGRAGE+EEAY+LIQ+MPM PDSVIWGALLGACSFHG+
Subjt:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH
        VEL EVAAESLFKLEPWNPGNYVILSNIYASAGDW GVAR+RK MKGGHI KRAG SYIEVGDGIHEFIVEDRSH KSDEIYALLH IY+IIK     LH
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH

Query:  SQNEG
        SQNEG
Subjt:  SQNEG

XP_023527365.1 pentatricopeptide repeat-containing protein At5g08510 [Cucurbita pepo subsp. pepo]8.1e-26687.33Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
        MNQLKQIHAY LR+GVDYTKFLIEKLLQIPNLPYAC LFDLIPKPSVFLYNKFIQ++SS G HHRCW LYYQMC QGCSPN HSFTFLF ACAS  N +P
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP

Query:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
        GQMLH+HFCKSGFASDVFALTALLDMY KLG+L+SARQLFDE PVRDIPTWNSM+AGY+RSG MGAAL+LFD+MP RNV+SWTALISGY+QNGKYAKALE
Subjt:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE

Query:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH
        MFLRLENE+G KPNEVT+ASVLPACA LGALDIG+RIEAYAR NGFFKNLYVSNAILEVHARCGNIEEAR+VFDEIGSKRNLCSWNTMIMGLAVHGRC  
Subjt:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH

Query:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS
        A++LYDQML QR RPDDVTF+GLLLACTHGGMVAKGRQLFESME KFQIAPKLEHYGCLVDLLGRAGE+EEAY+LIQ+MPM+PDSVIWGALLGACSFHG+
Subjt:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH
        VEL EVAAESLFKLEPWNPGNYVILSNIYASAGDW GVAR+RK MKGGHI KRAG SYIEVGDGIHEFIVEDRSH KSDEIYALLH +Y+IIK     LH
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH

Query:  SQNEG
        +QNEG
Subjt:  SQNEG

XP_038877076.1 pentatricopeptide repeat-containing protein At5g08510-like [Benincasa hispida]4.0e-26587.52Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
        M QLKQIHAY LR+G+DYTKFLIEKLLQ PNLPYAC LFDLIP+PSVFLYNKFIQ++SS GQ HRCW LYYQMC QGCSPNQHSFTFLFAACASL N +P
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP

Query:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
        GQMLH+HFCKSGFASDVFA TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSM+AGY+RSGDM AA +LFD+MPVR+VVSWT LISGY+QNGKYAKALE
Subjt:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE

Query:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFK-NLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCS
        +FLRLENE+GIKPNEVT+ASVLPACAQLGALDIG+RIEAYARNNGFFK N YVSNAILEVHARCGNI EAR+VFDEIGSKRNLCSWNTMIMGLAVHGRCS
Subjt:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFK-NLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCS

Query:  HAMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHG
         A++LYDQML +R+RPDDVTF+GLLLACTHGGMVA+GRQ+FESMES FQIAPKLEHYGCLVDLLGRAGEL+EAY LIQ MPM PDSVIWGALLGACSFHG
Subjt:  HAMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHG

Query:  SVELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPAL
        +VEL EVAAESLF LEPWNPGNYVILSNIYASAGDW GVARLRK MKGG ITKRAGYSYIEVGDGIHEFIVEDRSHL+S EIYALLHGIY IIKL K   
Subjt:  SVELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPAL

Query:  HSQNE
        H+QNE
Subjt:  HSQNE

TrEMBL top hitse value%identityAlignment
A0A0A0LY28 Uncharacterized protein1.7e-26185.32Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
        MNQLKQIHAY LR+G+D+TKFLIEKLLQ+P+LPYAC LFD IPKPSV+LYNKFIQ++SS G  HRCW LY QMC QGCSPNQ+SFTFLF ACASL NV+P
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP

Query:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
        GQMLH+HFCKSGFASD+FA+TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNS++AGY+RSG M AALELF++MPVRNV+SWTALISGY+QNGKYAKALE
Subjt:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE

Query:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH
        MF+ LENE+G KPNEV++ASVLPAC+QLGALDIG+RIEAYARNNGFFKN YVSNA+LE+HARCGNIEEA+QVFDEIGSKRNLCSWNTMIMGLAVHGRC  
Subjt:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH

Query:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS
        A++LYDQML +++RPDDVTF+GLLLACTHGGMVA+GRQLFESMESKFQ+APKLEHYGCLVDLLGRAGEL+EAYNLIQ MPM PDSVIWG LLGACSFHG+
Subjt:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH
        VEL EVAAESLFKLEPWNPGNYVILSNIYA AGDW GVARLRK MKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKS EIYALLH IY IIKL K   H
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH

Query:  SQNE
          NE
Subjt:  SQNE

A0A6J1D160 pentatricopeptide repeat-containing protein At5g08510 isoform X22.4e-303100Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
        MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP

Query:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
        GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
Subjt:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE

Query:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH
        MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH
Subjt:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH

Query:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS
        AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS
Subjt:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH
        VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH

Query:  SQNEGMNPH
        SQNEGMNPH
Subjt:  SQNEGMNPH

A0A6J1D2G9 pentatricopeptide repeat-containing protein At5g08510 isoform X11.9e-300100Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
        MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP

Query:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
        GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
Subjt:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE

Query:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH
        MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH
Subjt:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH

Query:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS
        AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS
Subjt:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH
        VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH

Query:  SQNEG
        SQNEG
Subjt:  SQNEG

A0A6J1E8Q2 pentatricopeptide repeat-containing protein At5g08510 isoform X17.4e-26586.93Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
        MNQLKQIHAY LR+GVDYTKFLI+KLLQIPNLPYAC LFDLIPKPSVFLYNKFIQ++SS G HHRCW LYYQMC QGCSPN HSFTFLF ACAS  N +P
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP

Query:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
        GQMLH+HF KSGFASDVFALTALLDMY KLG+L+SARQLFDEMPVRDIPTWNSM+AGY+RSG MGAALELFD+MP RNV+SWTALISGY+QNGKYAKALE
Subjt:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE

Query:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH
        MFLRLENE+G KPNEVT+ASVLPACA LGALDIG+RIEAYAR NGFFKNLYVSNAILEVHARCGNIEEAR+VFDEIGSKRNLCSWNTMIMGLAVHGRC  
Subjt:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH

Query:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS
        A++LYDQML QR RPDDVTF+GLLLACTHGGMVAKGRQLFESME KFQIAPKLEHYGCLVDLLGRAGE+EEAY+LIQ+MPM+PDSVIWGALLGACSFHG+
Subjt:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH
        VEL EVAAESLFKLEPWNPGNYVILSNIYASAGDW GVAR RK MKGGH+ KRAG SYIEVGDGIHEF+VEDRSH KSDEIYALLH +Y+IIK     LH
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH

Query:  SQNEG
        +QNEG
Subjt:  SQNEG

A0A6J1ISU4 pentatricopeptide repeat-containing protein At5g08510 isoform X12.3e-26687.72Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
        MNQLKQIHAY LR+GVDYTKFLIEKLLQIPNLPYAC LFDLIPKPSVFLYNKFIQ++SS G  HRCW LYYQMC QGCSPN HSFTFLF ACAS  N +P
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP

Query:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
        GQMLH+HFCKSGFASDVFALTALLDMY KLG+L+SARQLFDEMPVRDIPTWNSM+AGY+RSG MGAALELFD+MP+RNV+SWTALISGY+QNGKYAKALE
Subjt:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE

Query:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH
        MFLRLENE+G KPNEVT+ASVLPACAQLGALDIG+RIE YAR NGFFKNLYVSNAILEVHARCGNIEEAR+VFDEIGSKRNLCSWNTMIMGLAVHGRC  
Subjt:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH

Query:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS
        A++LYDQML QR RPDDVTF+GLLLACTHGGMVAKGRQ+FESME KFQIAPKLEHYGCLVDLLGRAGE+EEAY+LIQ+MPM PDSVIWGALLGACSFHG+
Subjt:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH
        VEL EVAAESLFKLEPWNPGNYVILSNIYASAGDW GVAR+RK MKGGHI KRAG SYIEVGDGIHEFIVEDRSH KSDEIYALLH IY+IIK     LH
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALH

Query:  SQNEG
        SQNEG
Subjt:  SQNEG

SwissProt top hitse value%identityAlignment
Q9C501 Pentatricopeptide repeat-containing protein At1g333502.8e-11241.04Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKL-----LQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHH--RCWSLYYQMCRQGC-SPNQHSFTFLFAAC
        +N LKQ+ ++ + SG+ ++ FL  KL     L++ NL YA  +FD    P+  LY   + +YSSS   H    +S +  M  +    PN   +  +  + 
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKL-----LQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHH--RCWSLYYQMCRQGC-SPNQHSFTFLFAAC

Query:  ASLQNVFPGQMLHAHFCKSGFASDVFALTALLDMYA-KLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQ
          L + F   ++H H  KSGF   V   TALL  YA  +  +  ARQLFDEM  R++ +W +M++GY+RSGD+  A+ LF+ MP R+V SW A+++  +Q
Subjt:  ASLQNVFPGQMLHAHFCKSGFASDVFALTALLDMYA-KLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQ

Query:  NGKYAKALEMFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMG
        NG + +A+ +F R+ NE  I+PNEVTV  VL ACAQ G L + + I A+A       +++VSN++++++ +CGN+EEA  VF ++ SK++L +WN+MI  
Subjt:  NGKYAKALEMFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMG

Query:  LAVHGRCSHAMELYDQML---TQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIW
         A+HGR   A+ ++++M+      I+PD +TFIGLL ACTHGG+V+KGR  F+ M ++F I P++EHYGCL+DLLGRAG  +EA  ++ TM M  D  IW
Subjt:  LAVHGRCSHAMELYDQML---TQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIW

Query:  GALLGACSFHGSVELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGI
        G+LL AC  HG ++LAEVA ++L  L P N G   +++N+Y   G+W    R RK +K  +  K  G+S IE+ + +H+F   D+SH +++EIY +L  +
Subjt:  GALLGACSFHGSVELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGI

Query:  YS
         S
Subjt:  YS

Q9FFG8 Pentatricopeptide repeat-containing protein At5g442302.5e-10840.45Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQ------IPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACAS
        +NQ+KQIH + LR G+D + +++ KL++      +P  PYA  + + +   + FL+   I+ Y+  G+     ++Y  M ++  +P   +F+ L  AC +
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQ------IPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACAS

Query:  LQNVFPGQMLHAH-FCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNG
        ++++  G+  HA  F   GF   V+    ++DMY K   +  AR++FDEMP RD+ +W  ++A Y+R G+M  A ELF+ +P +++V+WTA+++G++QN 
Subjt:  LQNVFPGQMLHAH-FCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNG

Query:  KYAKALEMFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGF--FKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMG
        K  +ALE F R+E + GI+ +EVTVA  + ACAQLGA     R    A+ +G+    ++ + +A+++++++CGN+EEA  VF  + +K N+ ++++MI+G
Subjt:  KYAKALEMFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGF--FKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMG

Query:  LAVHGRCSHAMELYDQMLTQ-RIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGA
        LA HGR   A+ L+  M+TQ  I+P+ VTF+G L+AC+H G+V +GRQ+F+SM   F + P  +HY C+VDLLGR G L+EA  LI+TM + P   +WGA
Subjt:  LAVHGRCSHAMELYDQMLTQ-RIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGA

Query:  LLGACSFHGSVELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDG-IHEFIVEDRSHLKSDEI
        LLGAC  H + E+AE+AAE LF+LEP   GNY++LSN+YASAGDW GV R+RK +K   + K    S++   +G +H+F   + +H  S++I
Subjt:  LLGACSFHGSVELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDG-IHEFIVEDRSHLKSDEI

Q9FNN7 Pentatricopeptide repeat-containing protein At5g085101.1e-17257.37Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
        MN +KQ+HA+ LR+GVD TK L+++LL IPNL YA  LFD       FLYNK IQ+Y    Q H    LY  +   G  P+ H+F F+FAA AS  +  P
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP

Query:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
         ++LH+ F +SGF SD F  T L+  YAKLG L  AR++FDEM  RD+P WN+M+ GY R GDM AA+ELFD MP +NV SWT +ISG+SQNG Y++AL+
Subjt:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE

Query:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH
        MFL +E ++ +KPN +TV SVLPACA LG L+IGRR+E YAR NGFF N+YV NA +E++++CG I+ A+++F+E+G++RNLCSWN+MI  LA HG+   
Subjt:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH

Query:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS
        A+ L+ QML +  +PD VTF+GLLLAC HGGMV KG++LF+SME   +I+PKLEHYGC++DLLGR G+L+EAY+LI+TMPM PD+V+WG LLGACSFHG+
Subjt:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPAL
        VE+AE+A+E+LFKLEP NPGN VI+SNIYA+   W GV R+RK MK   +TK AGYSY +EVG  +H+F VED+SH +S EIY +L  I+  +KL+K   
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPAL

Query:  HS
         S
Subjt:  HS

Q9LS72 Pentatricopeptide repeat-containing protein At3g292309.8e-11338.99Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNL----PYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQ
        +NQ+KQ+HA  +R  +     +  KL+   +L      A  +F+ + +P+V L N  I++++ + Q ++ + ++ +M R G   +  ++ FL  AC+   
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNL----PYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQ

Query:  NVFPGQMLHAHFCKSGFASDVFALTALLDMYA---------------------------------KLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGD
         +   +M+H H  K G +SD++   AL+D Y+                                 K G LR AR+LFDEMP RD+ +WN+M+ GY+R  +
Subjt:  NVFPGQMLHAHFCKSGFASDVFALTALLDMYA---------------------------------KLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGD

Query:  MGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALEMF------------------------LRLENER--------GIKPNEVTVASVLPACAQLGAL
        M  A ELF++MP RN VSW+ ++ GYS+ G    A  MF                        L  E +R        G+K +   V S+L AC + G L
Subjt:  MGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALEMF------------------------LRLENER--------GIKPNEVTVASVLPACAQLGAL

Query:  DIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSHAMELYDQMLTQRIRPDDVTFIGLLLACTHGG
         +G RI +  + +    N YV NA+L+++A+CGN+++A  VF++I  K++L SWNTM+ GL VHG    A+EL+ +M  + IRPD VTFI +L +C H G
Subjt:  DIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSHAMELYDQMLTQRIRPDDVTFIGLLLACTHGG

Query:  MVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGSVELAEVAAESLFKLEPWNPGNYVILSNIYAS
        ++ +G   F SME  + + P++EHYGCLVDLLGR G L+EA  ++QTMPM P+ VIWGALLGAC  H  V++A+   ++L KL+P +PGNY +LSNIYA+
Subjt:  MVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGSVELAEVAAESLFKLEPWNPGNYVILSNIYAS

Query:  AGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALL
        A DW GVA +R  MK   + K +G S +E+ DGIHEF V D+SH KSD+IY +L
Subjt:  AGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALL

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205409.8e-11341.84Show/hide
Query:  NQLKQIHAYGLRSGVDYTKFLIEKLL----QIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCS-PNQHSFTFLFAACASLQ
        N+ K+I+A  +  G+  + F++ K++    +I ++ YA  LF+ +  P+VFLYN  I++Y+ +  +     +Y Q+ R+    P++ +F F+F +CASL 
Subjt:  NQLKQIHAYGLRSGVDYTKFLIEKLL----QIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCS-PNQHSFTFLFAACASLQ

Query:  NVFPGQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYA
        + + G+ +H H CK G    V    AL+DMY K   L  A ++FDEM  RD+ +WNS+++GY+R G M  A  LF  M  + +VSWTA+ISGY+  G Y 
Subjt:  NVFPGQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYA

Query:  KALEMFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHG
        +A++ F  ++   GI+P+E+++ SVLP+CAQLG+L++G+ I  YA   GF K   V NA++E++++CG I +A Q+F ++  K ++ SW+TMI G A HG
Subjt:  KALEMFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHG

Query:  RCSHAMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACS
            A+E +++M   +++P+ +TF+GLL AC+H GM  +G + F+ M   +QI PK+EHYGCL+D+L RAG+LE A  + +TMPM PDS IWG+LL +C 
Subjt:  RCSHAMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACS

Query:  FHGSVELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRS
          G++++A VA + L +LEP + GNYV+L+NIYA  G W  V+RLRK ++  ++ K  G S IEV + + EF+  D S
Subjt:  FHGSVELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRS

Arabidopsis top hitse value%identityAlignment
AT1G33350.1 Pentatricopeptide repeat (PPR) superfamily protein2.0e-11341.04Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKL-----LQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHH--RCWSLYYQMCRQGC-SPNQHSFTFLFAAC
        +N LKQ+ ++ + SG+ ++ FL  KL     L++ NL YA  +FD    P+  LY   + +YSSS   H    +S +  M  +    PN   +  +  + 
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKL-----LQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHH--RCWSLYYQMCRQGC-SPNQHSFTFLFAAC

Query:  ASLQNVFPGQMLHAHFCKSGFASDVFALTALLDMYA-KLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQ
          L + F   ++H H  KSGF   V   TALL  YA  +  +  ARQLFDEM  R++ +W +M++GY+RSGD+  A+ LF+ MP R+V SW A+++  +Q
Subjt:  ASLQNVFPGQMLHAHFCKSGFASDVFALTALLDMYA-KLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQ

Query:  NGKYAKALEMFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMG
        NG + +A+ +F R+ NE  I+PNEVTV  VL ACAQ G L + + I A+A       +++VSN++++++ +CGN+EEA  VF ++ SK++L +WN+MI  
Subjt:  NGKYAKALEMFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMG

Query:  LAVHGRCSHAMELYDQML---TQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIW
         A+HGR   A+ ++++M+      I+PD +TFIGLL ACTHGG+V+KGR  F+ M ++F I P++EHYGCL+DLLGRAG  +EA  ++ TM M  D  IW
Subjt:  LAVHGRCSHAMELYDQML---TQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIW

Query:  GALLGACSFHGSVELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGI
        G+LL AC  HG ++LAEVA ++L  L P N G   +++N+Y   G+W    R RK +K  +  K  G+S IE+ + +H+F   D+SH +++EIY +L  +
Subjt:  GALLGACSFHGSVELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGI

Query:  YS
         S
Subjt:  YS

AT2G20540.1 mitochondrial editing factor 216.9e-11441.84Show/hide
Query:  NQLKQIHAYGLRSGVDYTKFLIEKLL----QIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCS-PNQHSFTFLFAACASLQ
        N+ K+I+A  +  G+  + F++ K++    +I ++ YA  LF+ +  P+VFLYN  I++Y+ +  +     +Y Q+ R+    P++ +F F+F +CASL 
Subjt:  NQLKQIHAYGLRSGVDYTKFLIEKLL----QIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCS-PNQHSFTFLFAACASLQ

Query:  NVFPGQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYA
        + + G+ +H H CK G    V    AL+DMY K   L  A ++FDEM  RD+ +WNS+++GY+R G M  A  LF  M  + +VSWTA+ISGY+  G Y 
Subjt:  NVFPGQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYA

Query:  KALEMFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHG
        +A++ F  ++   GI+P+E+++ SVLP+CAQLG+L++G+ I  YA   GF K   V NA++E++++CG I +A Q+F ++  K ++ SW+TMI G A HG
Subjt:  KALEMFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHG

Query:  RCSHAMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACS
            A+E +++M   +++P+ +TF+GLL AC+H GM  +G + F+ M   +QI PK+EHYGCL+D+L RAG+LE A  + +TMPM PDS IWG+LL +C 
Subjt:  RCSHAMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACS

Query:  FHGSVELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRS
          G++++A VA + L +LEP + GNYV+L+NIYA  G W  V+RLRK ++  ++ K  G S IEV + + EF+  D S
Subjt:  FHGSVELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRS

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.9e-11438.99Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNL----PYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQ
        +NQ+KQ+HA  +R  +     +  KL+   +L      A  +F+ + +P+V L N  I++++ + Q ++ + ++ +M R G   +  ++ FL  AC+   
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNL----PYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQ

Query:  NVFPGQMLHAHFCKSGFASDVFALTALLDMYA---------------------------------KLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGD
         +   +M+H H  K G +SD++   AL+D Y+                                 K G LR AR+LFDEMP RD+ +WN+M+ GY+R  +
Subjt:  NVFPGQMLHAHFCKSGFASDVFALTALLDMYA---------------------------------KLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGD

Query:  MGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALEMF------------------------LRLENER--------GIKPNEVTVASVLPACAQLGAL
        M  A ELF++MP RN VSW+ ++ GYS+ G    A  MF                        L  E +R        G+K +   V S+L AC + G L
Subjt:  MGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALEMF------------------------LRLENER--------GIKPNEVTVASVLPACAQLGAL

Query:  DIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSHAMELYDQMLTQRIRPDDVTFIGLLLACTHGG
         +G RI +  + +    N YV NA+L+++A+CGN+++A  VF++I  K++L SWNTM+ GL VHG    A+EL+ +M  + IRPD VTFI +L +C H G
Subjt:  DIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSHAMELYDQMLTQRIRPDDVTFIGLLLACTHGG

Query:  MVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGSVELAEVAAESLFKLEPWNPGNYVILSNIYAS
        ++ +G   F SME  + + P++EHYGCLVDLLGR G L+EA  ++QTMPM P+ VIWGALLGAC  H  V++A+   ++L KL+P +PGNY +LSNIYA+
Subjt:  MVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGSVELAEVAAESLFKLEPWNPGNYVILSNIYAS

Query:  AGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALL
        A DW GVA +R  MK   + K +G S +E+ DGIHEF V D+SH KSD+IY +L
Subjt:  AGDWRGVARLRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALL

AT5G08510.1 Pentatricopeptide repeat (PPR) superfamily protein7.8e-17457.37Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP
        MN +KQ+HA+ LR+GVD TK L+++LL IPNL YA  LFD       FLYNK IQ+Y    Q H    LY  +   G  P+ H+F F+FAA AS  +  P
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFP

Query:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE
         ++LH+ F +SGF SD F  T L+  YAKLG L  AR++FDEM  RD+P WN+M+ GY R GDM AA+ELFD MP +NV SWT +ISG+SQNG Y++AL+
Subjt:  GQMLHAHFCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALE

Query:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH
        MFL +E ++ +KPN +TV SVLPACA LG L+IGRR+E YAR NGFF N+YV NA +E++++CG I+ A+++F+E+G++RNLCSWN+MI  LA HG+   
Subjt:  MFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSH

Query:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS
        A+ L+ QML +  +PD VTF+GLLLAC HGGMV KG++LF+SME   +I+PKLEHYGC++DLLGR G+L+EAY+LI+TMPM PD+V+WG LLGACSFHG+
Subjt:  AMELYDQMLTQRIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGS

Query:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPAL
        VE+AE+A+E+LFKLEP NPGN VI+SNIYA+   W GV R+RK MK   +TK AGYSY +EVG  +H+F VED+SH +S EIY +L  I+  +KL+K   
Subjt:  VELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPAL

Query:  HS
         S
Subjt:  HS

AT5G44230.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-10940.45Show/hide
Query:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQ------IPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACAS
        +NQ+KQIH + LR G+D + +++ KL++      +P  PYA  + + +   + FL+   I+ Y+  G+     ++Y  M ++  +P   +F+ L  AC +
Subjt:  MNQLKQIHAYGLRSGVDYTKFLIEKLLQ------IPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACAS

Query:  LQNVFPGQMLHAH-FCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNG
        ++++  G+  HA  F   GF   V+    ++DMY K   +  AR++FDEMP RD+ +W  ++A Y+R G+M  A ELF+ +P +++V+WTA+++G++QN 
Subjt:  LQNVFPGQMLHAH-FCKSGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNG

Query:  KYAKALEMFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGF--FKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMG
        K  +ALE F R+E + GI+ +EVTVA  + ACAQLGA     R    A+ +G+    ++ + +A+++++++CGN+EEA  VF  + +K N+ ++++MI+G
Subjt:  KYAKALEMFLRLENERGIKPNEVTVASVLPACAQLGALDIGRRIEAYARNNGF--FKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMG

Query:  LAVHGRCSHAMELYDQMLTQ-RIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGA
        LA HGR   A+ L+  M+TQ  I+P+ VTF+G L+AC+H G+V +GRQ+F+SM   F + P  +HY C+VDLLGR G L+EA  LI+TM + P   +WGA
Subjt:  LAVHGRCSHAMELYDQMLTQ-RIRPDDVTFIGLLLACTHGGMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGA

Query:  LLGACSFHGSVELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDG-IHEFIVEDRSHLKSDEI
        LLGAC  H + E+AE+AAE LF+LEP   GNY++LSN+YASAGDW GV R+RK +K   + K    S++   +G +H+F   + +H  S++I
Subjt:  LLGACSFHGSVELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVARLRKTMKGGHITKRAGYSYIEVGDG-IHEFIVEDRSHLKSDEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCAATTGAAGCAAATTCATGCGTACGGCCTCAGAAGCGGCGTAGATTACACAAAATTCCTCATCGAGAAACTCCTCCAAATCCCAAATTTGCCATATGCTTGCGC
CCTCTTCGACCTCATTCCGAAGCCCTCTGTTTTTCTCTACAACAAGTTCATTCAATCGTATTCTTCCTCTGGTCAGCACCACCGATGTTGGTCGCTTTACTACCAAATGT
GCCGGCAAGGCTGCTCACCGAACCAGCACTCCTTCACTTTTCTCTTTGCCGCCTGCGCTTCGCTTCAAAATGTTTTCCCAGGTCAGATGCTTCATGCCCATTTCTGTAAG
TCGGGATTTGCCTCAGATGTATTTGCCTTAACGGCGTTGTTGGACATGTACGCGAAATTGGGTATGTTGAGGTCTGCCCGCCAACTGTTCGATGAAATGCCTGTTAGAGA
TATACCCACTTGGAACTCGATGGTTGCTGGCTATTCAAGGTCCGGGGACATGGGGGCAGCGTTAGAATTGTTCGACCGAATGCCTGTAAGAAATGTGGTCTCGTGGACCG
CATTGATATCTGGGTATTCGCAGAATGGCAAGTATGCCAAGGCACTGGAGATGTTTCTGAGATTGGAAAATGAGAGAGGCATTAAACCAAATGAGGTTACCGTAGCTAGT
GTTCTTCCTGCCTGTGCTCAGCTTGGGGCGTTGGATATTGGGAGGAGGATTGAAGCATACGCACGAAATAATGGTTTTTTCAAAAACTTGTACGTAAGTAATGCTATACT
GGAAGTGCATGCCAGGTGCGGGAATATTGAGGAAGCTAGACAAGTTTTTGATGAGATTGGAAGCAAAAGAAACTTGTGCTCGTGGAATACCATGATTATGGGATTGGCTG
TCCATGGAAGATGCAGCCATGCTATGGAGCTTTATGATCAAATGCTGACACAAAGAATAAGACCTGATGACGTTACATTCATAGGTCTTCTCTTGGCTTGCACTCACGGG
GGCATGGTTGCGAAAGGCCGACAACTCTTCGAATCAATGGAAAGTAAGTTTCAAATTGCTCCTAAATTAGAGCACTATGGCTGCTTGGTTGATCTATTAGGCAGGGCTGG
AGAACTAGAAGAAGCTTACAATCTCATTCAAACCATGCCTATGGTTCCTGATTCTGTAATTTGGGGAGCTCTTCTGGGAGCTTGCAGCTTCCATGGCAGTGTCGAATTAG
CTGAAGTAGCAGCTGAGTCTCTCTTCAAGCTCGAGCCGTGGAACCCTGGAAATTATGTCATTCTCTCGAACATTTATGCATCGGCTGGTGATTGGCGCGGAGTTGCGAGG
CTGAGGAAGACGATGAAGGGAGGACATATTACAAAGAGAGCAGGGTATAGTTATATTGAAGTGGGAGATGGTATTCATGAGTTCATTGTAGAAGATAGATCACATCTGAA
GAGTGATGAAATATATGCTTTACTTCATGGAATTTATTCAATTATTAAACTTCAGAAGCCTGCACTTCATAGTCAGAATGAAGGAATGAATCCCCACTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACCAATTGAAGCAAATTCATGCGTACGGCCTCAGAAGCGGCGTAGATTACACAAAATTCCTCATCGAGAAACTCCTCCAAATCCCAAATTTGCCATATGCTTGCGC
CCTCTTCGACCTCATTCCGAAGCCCTCTGTTTTTCTCTACAACAAGTTCATTCAATCGTATTCTTCCTCTGGTCAGCACCACCGATGTTGGTCGCTTTACTACCAAATGT
GCCGGCAAGGCTGCTCACCGAACCAGCACTCCTTCACTTTTCTCTTTGCCGCCTGCGCTTCGCTTCAAAATGTTTTCCCAGGTCAGATGCTTCATGCCCATTTCTGTAAG
TCGGGATTTGCCTCAGATGTATTTGCCTTAACGGCGTTGTTGGACATGTACGCGAAATTGGGTATGTTGAGGTCTGCCCGCCAACTGTTCGATGAAATGCCTGTTAGAGA
TATACCCACTTGGAACTCGATGGTTGCTGGCTATTCAAGGTCCGGGGACATGGGGGCAGCGTTAGAATTGTTCGACCGAATGCCTGTAAGAAATGTGGTCTCGTGGACCG
CATTGATATCTGGGTATTCGCAGAATGGCAAGTATGCCAAGGCACTGGAGATGTTTCTGAGATTGGAAAATGAGAGAGGCATTAAACCAAATGAGGTTACCGTAGCTAGT
GTTCTTCCTGCCTGTGCTCAGCTTGGGGCGTTGGATATTGGGAGGAGGATTGAAGCATACGCACGAAATAATGGTTTTTTCAAAAACTTGTACGTAAGTAATGCTATACT
GGAAGTGCATGCCAGGTGCGGGAATATTGAGGAAGCTAGACAAGTTTTTGATGAGATTGGAAGCAAAAGAAACTTGTGCTCGTGGAATACCATGATTATGGGATTGGCTG
TCCATGGAAGATGCAGCCATGCTATGGAGCTTTATGATCAAATGCTGACACAAAGAATAAGACCTGATGACGTTACATTCATAGGTCTTCTCTTGGCTTGCACTCACGGG
GGCATGGTTGCGAAAGGCCGACAACTCTTCGAATCAATGGAAAGTAAGTTTCAAATTGCTCCTAAATTAGAGCACTATGGCTGCTTGGTTGATCTATTAGGCAGGGCTGG
AGAACTAGAAGAAGCTTACAATCTCATTCAAACCATGCCTATGGTTCCTGATTCTGTAATTTGGGGAGCTCTTCTGGGAGCTTGCAGCTTCCATGGCAGTGTCGAATTAG
CTGAAGTAGCAGCTGAGTCTCTCTTCAAGCTCGAGCCGTGGAACCCTGGAAATTATGTCATTCTCTCGAACATTTATGCATCGGCTGGTGATTGGCGCGGAGTTGCGAGG
CTGAGGAAGACGATGAAGGGAGGACATATTACAAAGAGAGCAGGGTATAGTTATATTGAAGTGGGAGATGGTATTCATGAGTTCATTGTAGAAGATAGATCACATCTGAA
GAGTGATGAAATATATGCTTTACTTCATGGAATTTATTCAATTATTAAACTTCAGAAGCCTGCACTTCATAGTCAGAATGAAGGAATGAATCCCCACTAA
Protein sequenceShow/hide protein sequence
MNQLKQIHAYGLRSGVDYTKFLIEKLLQIPNLPYACALFDLIPKPSVFLYNKFIQSYSSSGQHHRCWSLYYQMCRQGCSPNQHSFTFLFAACASLQNVFPGQMLHAHFCK
SGFASDVFALTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSMVAGYSRSGDMGAALELFDRMPVRNVVSWTALISGYSQNGKYAKALEMFLRLENERGIKPNEVTVAS
VLPACAQLGALDIGRRIEAYARNNGFFKNLYVSNAILEVHARCGNIEEARQVFDEIGSKRNLCSWNTMIMGLAVHGRCSHAMELYDQMLTQRIRPDDVTFIGLLLACTHG
GMVAKGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELEEAYNLIQTMPMVPDSVIWGALLGACSFHGSVELAEVAAESLFKLEPWNPGNYVILSNIYASAGDWRGVAR
LRKTMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSDEIYALLHGIYSIIKLQKPALHSQNEGMNPH