; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034133 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034133
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr3:4712704..4715871
RNA-Seq ExpressionLag0034133
SyntenyLag0034133
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589516.1 Pentatricopeptide repeat-containing protein, chloroplastic/mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]2.2e-26587.4Show/hide
Query:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS
        MI +N   SNL  SL S+++ KFPHFQLL PKFI SSDRKT T   PNG  F+KEQR LSLFKQCS VKDLNQ+HAR+IQSGFDQNLFV+GKLIEFC+VS
Subjt:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS

Query:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY
          GDMNYAV VFN IENPDGFLWNTMIRGFGRIS L KAFEFYKRMLEKGIAADNFTFSFLLKI GQLGSIMLGKQLHV ILKLGL+SHVYVRNTLIHMY
Subjt:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY

Query:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM
        GRLKD+K+ARNLFDEMP+P LVAWNTVIDCHVSCGMYNEAL LFLQMLQ GV+PDEATLVVTVSACSALGALDFGRWVHSHVK NDRGKTIAVFNSLIDM
Subjt:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM

Query:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM
        YAKCGAVEDAREMFNA   KNIVTWNTMIM LATHGD EDALTLFS MLAEK+ TPD VTFLGVLCACNHGGKVEEGRRYFDLMTK  NIQPT+KHYGSM
Subjt:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM

Query:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY
        VDILGRAGFVEEAYQLIRSMPMECNA+IWRTLLAACR+HGNVKLGERVRSHVLE+EPDHSSDYVLLAN+YASSGQWNEMMK+RKSMQRKGVQKPEPGNS+
Subjt:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY

Query:  LEINPQRKLELETVGN
        LEINP RKLE+ETV N
Subjt:  LEINPQRKLELETVGN

XP_022135209.1 pentatricopeptide repeat-containing protein At2g02980, chloroplastic-like [Momordica charantia]1.3e-26585.93Show/hide
Query:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFI---SSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFC
        MI I  R SN  +SLCSQI+ KFPHF LLFPKFI    SSD +  T  RPNG DFAKE+R+LSLFKQCSTVKDLNQ+HAR++Q+GFDQNLFV+ KLIEFC
Subjt:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFI---SSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFC

Query:  SVSDRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLI
        +VSD GDMNYAVL FN IENPDGFLWNTMIRGFGRISKLQ+AFEFYKRMLEKGI+ADNFTFSFLLKISGQ+GS++LGKQLHV ILKLGLESHVYVRNTLI
Subjt:  SVSDRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLI

Query:  HMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSL
        HMYG LKDIK ARNLF EMP+PDLVAWNTVIDCHVSCGMY EALGLF +M Q G++PDEATLVVTVSACSALGALD GRWVHSHVK NDRGKT+AVFNSL
Subjt:  HMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSL

Query:  IDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHY
        IDMYAKCGAVEDA ++FN +G KN VTWNTMIMALATHGDAEDALTLFSKML EKLE PDDVTFLGVLCACNHGG+VEEGRRYFDLMTK FN+QPTLKHY
Subjt:  IDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHY

Query:  GSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPG
        GSMVD+LGRAGFVEEAYQLIRSMPMECNA+IWRTLLAACR+HGNVKLGERVRSHVLE+EPDHSSDYVLLANMYA+SGQWNEMMKIRKSMQRKGVQKPEPG
Subjt:  GSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPG

Query:  NSYLEINPQRKLELETVGN
        NSYLEINP RKLE+ETV N
Subjt:  NSYLEINPQRKLELETVGN

XP_022987183.1 pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial-like [Cucurbita maxima]2.2e-26587.21Show/hide
Query:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS
        MI +N   SNL LSLCS+++ KFPHFQLL PK+I SSD KT T T PNGS F+KEQR LSLFKQCS VKDLNQ+HAR IQSGFDQNLFV+GKLIEFC+VS
Subjt:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS

Query:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY
          GDMNYAV VFN IENPDGFLWNTMIRGFGRIS L KAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIML KQLH  ILKLGL+SHVYVRNTLIHMY
Subjt:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY

Query:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM
        GRLKD+K+ARNLFDEMP+P LVAWNTVIDCHVSCGMYNEAL LFLQMLQ GV+PDEATLVVTVSACSALGALDFGRWVHSHVK NDRGKTIAVFNSLIDM
Subjt:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM

Query:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM
        YAKCGAVEDAREMFNA   KNIVTWNTMIM LATHGD EDALTLFS MLAEK+ TPD VTFLGVLCACNHGGKVEEGRRYFDLMTK  +IQPT+KHYGSM
Subjt:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM

Query:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY
        VDILGRAGFVEEAYQLIRSMPMECNA+IWRTLLAACR+HGNVKLGERVRSHVLE+EPDHSSDYVLLAN+YASSGQWNEMMK+RKSMQRKGVQKPEPGNS+
Subjt:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY

Query:  LEINPQRKLELETVGN
        LEINP RKLE++TV N
Subjt:  LEINPQRKLELETVGN

XP_023516362.1 pentatricopeptide repeat-containing protein At4g21065-like [Cucurbita pepo subsp. pepo]1.3e-26587.4Show/hide
Query:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS
        MI +N   SNL  SLCS+++ KFPHFQLL PKFI SSDRKT T   PNGS F+KEQR LSLFKQCS VKDLNQVHAR I+SGFDQNLFV+GKLIEFC+VS
Subjt:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS

Query:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY
          GDMNYAV VFN IENPDGFLWNTMIRGFGRIS L KAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHV ILKLGL+SHVYVRNTLIHMY
Subjt:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY

Query:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM
        G LKD+K+ARNLFDEMP+P LVAWNTVIDCHVSCGMYNEAL LFLQMLQ GV+PDEATLVVTVSACSALGALDFGRWVHSHVK NDRGKTIAVFNSLIDM
Subjt:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM

Query:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM
        YAKCGAVE+AREMFNA   KNIVTWNTMIM LATHGD EDALTLFS MLAEK+ TPD VTFLGVLCACNHGGKVEEGRRYFDLMTK  NIQPT+KHYGSM
Subjt:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM

Query:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY
        VDILGRAGFVEEAYQLIRSMPMECNA+IWRTLLAACR+HGNVKLGERVRSHVLE+EPDHSSDYVLLAN+YASSGQWNEMMK+RKSMQRKGVQKPEPGNS+
Subjt:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY

Query:  LEINPQRKLELETVGN
        LEINP RKLE++TV N
Subjt:  LEINPQRKLELETVGN

XP_038880037.1 pentatricopeptide repeat-containing protein At4g21065-like [Benincasa hispida]4.3e-26689.29Show/hide
Query:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS
        MI I SR +NL LSL SQ+KFKFPHFQLL PKF+ SSDRKTAT T PNGS FAKEQR LSL KQCSTVKDLNQ+HAR+IQ GFDQNLFV+GKLIEFC+VS
Subjt:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS

Query:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY
        + GDMNYAV+VFNGIENPDGFLWNTMIRGFGRISKL KAFEFYKRMLEKGIAADNFTFSFLLKISGQ GSIMLGKQLHV ILK+GL+SHVYVRNTLIHMY
Subjt:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY

Query:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM
        GRLKD+KIARNLFDEMP+PDLVAWN VIDCHVSCGMY+EALGLFLQMLQ GV+PDEATLVVTVSACSALGALDFGRWVHSHVK ND+GKTIAV NSLIDM
Subjt:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM

Query:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM
        YAKCGAVEDAREMFNA+ DKN+VTWNTMIM LATHGDAEDALTLFS MLA+K+ETPD VTFLGVLCACNHGGKVEEGR YF LMTK FNIQPT+KHYGSM
Subjt:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM

Query:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY
        VDILGRAGFVEEAYQLIRSMPMECNA+IWRTLLAACR+HGNVKLGERVRSHVLE+EPDHSSDYVLLANMYASSGQWNEM+K RKSMQRKGVQKPEPGNSY
Subjt:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY

Query:  LEIN
        LEIN
Subjt:  LEIN

TrEMBL top hitse value%identityAlignment
A0A1S3BWU9 pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial9.5e-25183.96Show/hide
Query:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS
        MI IN R  NL LSLCS  K KF  FQL+ P+FI +S  KTAT T PNG  FAKEQ ++ LFKQCST+K LNQ+HAR+I+ GFDQNLFV+GKLI+FC+VS
Subjt:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS

Query:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY
        D GDMNYAV VF+ IENPDGFLWNTMIRGFGRI KL  AFEFYKRMLEKGI ADNFTFSFLLKI+GQLGSIMLGKQLHV ILKLGL+SHVYVRNTLIHMY
Subjt:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY

Query:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM
        GRLKD+ IARNLFDE+P+ DLVAWN VIDCHVSCGMYNEAL LFLQMLQ GV+PDEATLVVTVSACSALGALDFGRW+HSHV  NDRGKT AVFNSLIDM
Subjt:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM

Query:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM
        YAKCGAV+ AREMFNA+  KN+VTWNTMIM LATHGDAEDALTLFS ML EK+ETPD VTFL VLCACNHGGKVEEGRRYFDLMTK FNIQPTLKHYGSM
Subjt:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM

Query:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY
        VDILGRAGFVEEAYQLI+SMPMECNA+IWRTLLAAC++HGNVKLGERV SHVLE+EPDHSSDYVLLANMYASSGQWNEM+K RKSMQ+KGV+KPEPGNSY
Subjt:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY

Query:  LEINP
        LEINP
Subjt:  LEINP

A0A5A7UQ05 Pentatricopeptide repeat-containing protein9.5e-25183.96Show/hide
Query:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS
        MI IN R  NL LSLCS  K KF  FQL+ P+FI +S  KTAT T PNG  FAKEQ ++ LFKQCST+K LNQ+HAR+I+ GFDQNLFV+GKLI+FC+VS
Subjt:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS

Query:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY
        D GDMNYAV VF+ IENPDGFLWNTMIRGFGRI KL  AFEFYKRMLEKGI ADNFTFSFLLKI+GQLGSIMLGKQLHV ILKLGL+SHVYVRNTLIHMY
Subjt:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY

Query:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM
        GRLKD+ IARNLFDE+P+ DLVAWN VIDCHVSCGMYNEAL LFLQMLQ GV+PDEATLVVTVSACSALGALDFGRW+HSHV  NDRGKT AVFNSLIDM
Subjt:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM

Query:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM
        YAKCGAV+ AREMFNA+  KN+VTWNTMIM LATHGDAEDALTLFS ML EK+ETPD VTFL VLCACNHGGKVEEGRRYFDLMTK FNIQPTLKHYGSM
Subjt:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM

Query:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY
        VDILGRAGFVEEAYQLI+SMPMECNA+IWRTLLAAC++HGNVKLGERV SHVLE+EPDHSSDYVLLANMYASSGQWNEM+K RKSMQ+KGV+KPEPGNSY
Subjt:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY

Query:  LEINP
        LEINP
Subjt:  LEINP

A0A6J1C0S9 pentatricopeptide repeat-containing protein At2g02980, chloroplastic-like6.1e-26685.93Show/hide
Query:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFI---SSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFC
        MI I  R SN  +SLCSQI+ KFPHF LLFPKFI    SSD +  T  RPNG DFAKE+R+LSLFKQCSTVKDLNQ+HAR++Q+GFDQNLFV+ KLIEFC
Subjt:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFI---SSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFC

Query:  SVSDRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLI
        +VSD GDMNYAVL FN IENPDGFLWNTMIRGFGRISKLQ+AFEFYKRMLEKGI+ADNFTFSFLLKISGQ+GS++LGKQLHV ILKLGLESHVYVRNTLI
Subjt:  SVSDRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLI

Query:  HMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSL
        HMYG LKDIK ARNLF EMP+PDLVAWNTVIDCHVSCGMY EALGLF +M Q G++PDEATLVVTVSACSALGALD GRWVHSHVK NDRGKT+AVFNSL
Subjt:  HMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSL

Query:  IDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHY
        IDMYAKCGAVEDA ++FN +G KN VTWNTMIMALATHGDAEDALTLFSKML EKLE PDDVTFLGVLCACNHGG+VEEGRRYFDLMTK FN+QPTLKHY
Subjt:  IDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHY

Query:  GSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPG
        GSMVD+LGRAGFVEEAYQLIRSMPMECNA+IWRTLLAACR+HGNVKLGERVRSHVLE+EPDHSSDYVLLANMYA+SGQWNEMMKIRKSMQRKGVQKPEPG
Subjt:  GSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPG

Query:  NSYLEINPQRKLELETVGN
        NSYLEINP RKLE+ETV N
Subjt:  NSYLEINPQRKLELETVGN

A0A6J1E6Y6 pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial-like3.4e-26487.02Show/hide
Query:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS
        MI +N   SNL  SL S+++ KFPHFQLL PKFI SSDRKT T   PNG  F+KEQR LSLFKQCS VKDLNQ+HAR+IQSGFDQNLFV+GKLIEFC+VS
Subjt:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS

Query:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY
          GDMNYAV VFN IENPDGFLWNTMIRGFGRIS L KAFEFYKRMLEKGIAADNFTFSFLLKI GQLGSIMLGKQLHV ILKLGL+SHVYVRNTLIHMY
Subjt:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY

Query:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM
        G LKD+K+ARNLFDEMP+P LVAWNTVIDCHVSCGMYNEAL LFLQMLQ GV+PDEATLVVTVSACSALGALDFGRWVHSHVK ND GKTIAVFNSLIDM
Subjt:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM

Query:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM
        YAKCGAVEDAREMFNA   KNIVTWNTMIM LATHGD EDALTLFS MLAEK+ TPD VTFLGVLCACNHGGKVEEGRRYFDLMTK  NIQPT+KHYGSM
Subjt:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM

Query:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY
        VDILGRAGFVEEAYQLIRSMPMECNA+IWRTLLAACR+HGNVKLGERVRSHVLE+EPDHSSDYVLLAN+YASSGQWNEMMK+RKSMQRKGVQKPEPGNS+
Subjt:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY

Query:  LEINPQRKLELETVGN
        LEINP RKLE+ETV N
Subjt:  LEINPQRKLELETVGN

A0A6J1JI60 pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial-like1.0e-26587.21Show/hide
Query:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS
        MI +N   SNL LSLCS+++ KFPHFQLL PK+I SSD KT T T PNGS F+KEQR LSLFKQCS VKDLNQ+HAR IQSGFDQNLFV+GKLIEFC+VS
Subjt:  MIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS

Query:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY
          GDMNYAV VFN IENPDGFLWNTMIRGFGRIS L KAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIML KQLH  ILKLGL+SHVYVRNTLIHMY
Subjt:  DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMY

Query:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM
        GRLKD+K+ARNLFDEMP+P LVAWNTVIDCHVSCGMYNEAL LFLQMLQ GV+PDEATLVVTVSACSALGALDFGRWVHSHVK NDRGKTIAVFNSLIDM
Subjt:  GRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDM

Query:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM
        YAKCGAVEDAREMFNA   KNIVTWNTMIM LATHGD EDALTLFS MLAEK+ TPD VTFLGVLCACNHGGKVEEGRRYFDLMTK  +IQPT+KHYGSM
Subjt:  YAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSM

Query:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY
        VDILGRAGFVEEAYQLIRSMPMECNA+IWRTLLAACR+HGNVKLGERVRSHVLE+EPDHSSDYVLLAN+YASSGQWNEMMK+RKSMQRKGVQKPEPGNS+
Subjt:  VDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSY

Query:  LEINPQRKLELETVGN
        LEINP RKLE++TV N
Subjt:  LEINPQRKLELETVGN

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210651.7e-10042.08Show/hide
Query:  STVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS--DRGDMNYAVLVFNGIENP-DGFLWNTMIRGFGRISKLQKAFEFYKRMLEKG-IAADNFTFSFL
        S++  L Q+HA  I+ G   +   +GK + F  VS      M+YA  VF+ IE P + F+WNT+IRG+  I     AF  Y+ M   G +  D  T+ FL
Subjt:  STVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS--DRGDMNYAVLVFNGIENP-DGFLWNTMIRGFGRISKLQKAFEFYKRMLEKG-IAADNFTFSFL

Query:  LKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVV
        +K    +  + LG+ +H  +++ G  S +YV+N+L+H+Y    D+  A  +FD+MP  DLVAWN+VI+     G   EAL L+ +M   G++PD  T+V 
Subjt:  LKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVV

Query:  TVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTF
         +SAC+ +GAL  G+ VH ++      + +   N L+D+YA+CG VE+A+ +F+ + DKN V+W ++I+ LA +G  ++A+ LF  M + +   P ++TF
Subjt:  TVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTF

Query:  LGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSS
        +G+L AC+H G V+EG  YF  M +++ I+P ++H+G MVD+L RAG V++AY+ I+SMPM+ N +IWRTLL AC VHG+  L E  R  +L+LEP+HS 
Subjt:  LGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSS

Query:  DYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSYLEI
        DYVLL+NMYAS  +W+++ KIRK M R GV+K  PG+S +E+
Subjt:  DYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSYLEI

Q0WQW5 Pentatricopeptide repeat-containing protein At1g59720, chloroplastic/mitochondrial3.9e-10040.77Show/hide
Query:  QRILSLFKQCSTVKDLNQVHARVIQSGFDQ---NLFVVGKLIEFCSVSDRGDMNYAVLVFNGIENPDGFLWNTMIRGFGR-ISKLQKAFEFYKRMLEKGI
        QRI SL + CS +  L Q+HA  +++ + +    LF+ GK+++    S   D+NYA  VF+ IEN   F+WNT+IR     +S+ ++AF  Y++MLE+G 
Subjt:  QRILSLFKQCSTVKDLNQVHARVIQSGFDQ---NLFVVGKLIEFCSVSDRGDMNYAVLVFNGIENPDGFLWNTMIRGFGR-ISKLQKAFEFYKRMLEKGI

Query:  AA-DNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQC
        ++ D  TF F+LK    +     GKQ+H  I+K G    VYV N LIH+YG    + +AR +FDEMP   LV+WN++ID  V  G Y+ AL LF +M Q 
Subjt:  AA-DNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQC

Query:  GVQPDEATLVVTVSACSALGALDFGRWVHSHVKGN---DRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSK
          +PD  T+   +SAC+ LG+L  G W H+ +      D    + V NSLI+MY KCG++  A ++F  +  +++ +WN MI+  ATHG AE+A+  F +
Subjt:  GVQPDEATLVVTVSACSALGALDFGRWVHSHVKGN---DRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSK

Query:  MLAEKLET-PDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLL-AACRVHGNVKLG
        M+ ++    P+ VTF+G+L ACNH G V +GR+YFD+M + + I+P L+HYG +VD++ RAG++ EA  ++ SMPM+ +A+IWR+LL A C+   +V+L 
Subjt:  MLAEKLET-PDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLL-AACRVHGNVKLG

Query:  ERVRSHVLELEPDHSSD-------YVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSYLEIN
        E +  +++  + D+ S        YVLL+ +YAS+ +WN++  +RK M   G++K EPG S +EIN
Subjt:  ERVRSHVLELEPDHSSD-------YVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSYLEIN

Q8LK93 Pentatricopeptide repeat-containing protein At2g02980, chloroplastic3.0e-10040.13Show/hide
Query:  SSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS-DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRIS
        SS     T T+ +  D    Q  + L  +C+++++L Q+ A  I+S  +   F V KLI FC+ S     M+YA  +F  +  PD  ++N+M RG+ R +
Subjt:  SSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS-DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRIS

Query:  KLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSC
           + F  +  +LE GI  DN+TF  LLK      ++  G+QLH   +KLGL+ +VYV  TLI+MY   +D+  AR +FD +  P +V +N +I  +   
Subjt:  KLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSC

Query:  GMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALAT
           NEAL LF +M    ++P+E TL+  +S+C+ LG+LD G+W+H + K +   K + V  +LIDM+AKCG+++DA  +F  +  K+   W+ MI+A A 
Subjt:  GMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALAT

Query:  HGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLA
        HG AE ++ +F +M +E ++ PD++TFLG+L AC+H G+VEEGR+YF  M  +F I P++KHYGSMVD+L RAG +E+AY+ I  +P+    ++WR LLA
Subjt:  HGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLA

Query:  ACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQ-RKGVQKPEPGNSYLEIN
        AC  H N+ L E+V   + EL+  H  DYV+L+N+YA + +W  +  +RK M+ RK V+   PG S +E+N
Subjt:  ACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQ-RKGVQKPEPGNSYLEIN

Q9C6T2 Pentatricopeptide repeat-containing protein At1g319201.3e-9539.51Show/hide
Query:  KEQRILSLFKQCSTVKDLNQVHARVIQ-SGFDQNLFVVGKLIEFCSVSD-RGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGI
        KEQ  L L K+C  + +  QVHAR I+ S F  + F    ++  C+ S     MNYA  +F GI++P  F +NTMIRG+  +   ++A  FY  M+++G 
Subjt:  KEQRILSLFKQCSTVKDLNQVHARVIQ-SGFDQNLFVVGKLIEFCSVSD-RGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGI

Query:  AADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQML-QC
          DNFT+  LLK   +L SI  GKQ+H  + KLGLE+ V+V+N+LI+MYGR  +++++  +F+++      +W++++      GM++E L LF  M  + 
Subjt:  AADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQML-QC

Query:  GVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLA
         ++ +E+ +V  + AC+  GAL+ G  +H  +  N     I V  SL+DMY KCG ++ A  +F  +  +N +T++ MI  LA HG+ E AL +FSKM+ 
Subjt:  GVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLA

Query:  EKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRS
        E LE PD V ++ VL AC+H G V+EGRR F  M K+  ++PT +HYG +VD+LGRAG +EEA + I+S+P+E N +IWRT L+ CRV  N++LG+    
Subjt:  EKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRS

Query:  HVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSYLEI
         +L+L   +  DY+L++N+Y+    W+++ + R  +  KG+ K  PG S +E+
Subjt:  HVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSYLEI

Q9LXF2 Pentatricopeptide repeat-containing protein At5g153009.2e-9436.05Show/hide
Query:  RPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVSDRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKR
        R   +D    +R   L++ C  ++ L Q+HA ++ +G   NL VVG+LI   S+S  G + YA  +F+ I  PD  + N ++RG  +  K +K    Y  
Subjt:  RPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVSDRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKR

Query:  MLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKI-------------------------------ARNLFD
        M ++G++ D +TF+F+LK   +L     G   H  +++ G   + YV+N LI  +    D+ I                               A  LFD
Subjt:  MLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKI-------------------------------ARNLFD

Query:  EMP-------------------------------RPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVK
        EMP                                 D+V WN +I  +V+CG   EALG+F +M   G  PD  T++  +SAC+ LG L+ G+ +H ++ 
Subjt:  EMP-------------------------------RPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVK

Query:  GNDRGKT-----IAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGR
              +       ++N+LIDMYAKCG+++ A E+F  + D+++ TWNT+I+ LA H  AE ++ +F +M   K+  P++VTF+GV+ AC+H G+V+EGR
Subjt:  GNDRGKT-----IAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGR

Query:  RYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNE
        +YF LM   +NI+P +KHYG MVD+LGRAG +EEA+  + SM +E NAI+WRTLL AC+++GNV+LG+     +L +  D S DYVLL+N+YAS+GQW+ 
Subjt:  RYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNE

Query:  MMKIRKSMQRKGVQKP
        + K+RK      V+KP
Subjt:  MMKIRKSMQRKGVQKP

Arabidopsis top hitse value%identityAlignment
AT1G31920.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.2e-9739.51Show/hide
Query:  KEQRILSLFKQCSTVKDLNQVHARVIQ-SGFDQNLFVVGKLIEFCSVSD-RGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGI
        KEQ  L L K+C  + +  QVHAR I+ S F  + F    ++  C+ S     MNYA  +F GI++P  F +NTMIRG+  +   ++A  FY  M+++G 
Subjt:  KEQRILSLFKQCSTVKDLNQVHARVIQ-SGFDQNLFVVGKLIEFCSVSD-RGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGI

Query:  AADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQML-QC
          DNFT+  LLK   +L SI  GKQ+H  + KLGLE+ V+V+N+LI+MYGR  +++++  +F+++      +W++++      GM++E L LF  M  + 
Subjt:  AADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQML-QC

Query:  GVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLA
         ++ +E+ +V  + AC+  GAL+ G  +H  +  N     I V  SL+DMY KCG ++ A  +F  +  +N +T++ MI  LA HG+ E AL +FSKM+ 
Subjt:  GVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLA

Query:  EKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRS
        E LE PD V ++ VL AC+H G V+EGRR F  M K+  ++PT +HYG +VD+LGRAG +EEA + I+S+P+E N +IWRT L+ CRV  N++LG+    
Subjt:  EKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRS

Query:  HVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSYLEI
         +L+L   +  DY+L++N+Y+    W+++ + R  +  KG+ K  PG S +E+
Subjt:  HVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSYLEI

AT1G59720.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.8e-10140.77Show/hide
Query:  QRILSLFKQCSTVKDLNQVHARVIQSGFDQ---NLFVVGKLIEFCSVSDRGDMNYAVLVFNGIENPDGFLWNTMIRGFGR-ISKLQKAFEFYKRMLEKGI
        QRI SL + CS +  L Q+HA  +++ + +    LF+ GK+++    S   D+NYA  VF+ IEN   F+WNT+IR     +S+ ++AF  Y++MLE+G 
Subjt:  QRILSLFKQCSTVKDLNQVHARVIQSGFDQ---NLFVVGKLIEFCSVSDRGDMNYAVLVFNGIENPDGFLWNTMIRGFGR-ISKLQKAFEFYKRMLEKGI

Query:  AA-DNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQC
        ++ D  TF F+LK    +     GKQ+H  I+K G    VYV N LIH+YG    + +AR +FDEMP   LV+WN++ID  V  G Y+ AL LF +M Q 
Subjt:  AA-DNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQC

Query:  GVQPDEATLVVTVSACSALGALDFGRWVHSHVKGN---DRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSK
          +PD  T+   +SAC+ LG+L  G W H+ +      D    + V NSLI+MY KCG++  A ++F  +  +++ +WN MI+  ATHG AE+A+  F +
Subjt:  GVQPDEATLVVTVSACSALGALDFGRWVHSHVKGN---DRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSK

Query:  MLAEKLET-PDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLL-AACRVHGNVKLG
        M+ ++    P+ VTF+G+L ACNH G V +GR+YFD+M + + I+P L+HYG +VD++ RAG++ EA  ++ SMPM+ +A+IWR+LL A C+   +V+L 
Subjt:  MLAEKLET-PDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLL-AACRVHGNVKLG

Query:  ERVRSHVLELEPDHSSD-------YVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSYLEIN
        E +  +++  + D+ S        YVLL+ +YAS+ +WN++  +RK M   G++K EPG S +EIN
Subjt:  ERVRSHVLELEPDHSSD-------YVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSYLEIN

AT2G02980.1 Pentatricopeptide repeat (PPR) superfamily protein2.1e-10140.13Show/hide
Query:  SSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS-DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRIS
        SS     T T+ +  D    Q  + L  +C+++++L Q+ A  I+S  +   F V KLI FC+ S     M+YA  +F  +  PD  ++N+M RG+ R +
Subjt:  SSDRKTATGTRPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS-DRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRIS

Query:  KLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSC
           + F  +  +LE GI  DN+TF  LLK      ++  G+QLH   +KLGL+ +VYV  TLI+MY   +D+  AR +FD +  P +V +N +I  +   
Subjt:  KLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSC

Query:  GMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALAT
           NEAL LF +M    ++P+E TL+  +S+C+ LG+LD G+W+H + K +   K + V  +LIDM+AKCG+++DA  +F  +  K+   W+ MI+A A 
Subjt:  GMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALAT

Query:  HGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLA
        HG AE ++ +F +M +E ++ PD++TFLG+L AC+H G+VEEGR+YF  M  +F I P++KHYGSMVD+L RAG +E+AY+ I  +P+    ++WR LLA
Subjt:  HGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLA

Query:  ACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQ-RKGVQKPEPGNSYLEIN
        AC  H N+ L E+V   + EL+  H  DYV+L+N+YA + +W  +  +RK M+ RK V+   PG S +E+N
Subjt:  ACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQ-RKGVQKPEPGNSYLEIN

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.2e-10142.08Show/hide
Query:  STVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS--DRGDMNYAVLVFNGIENP-DGFLWNTMIRGFGRISKLQKAFEFYKRMLEKG-IAADNFTFSFL
        S++  L Q+HA  I+ G   +   +GK + F  VS      M+YA  VF+ IE P + F+WNT+IRG+  I     AF  Y+ M   G +  D  T+ FL
Subjt:  STVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVS--DRGDMNYAVLVFNGIENP-DGFLWNTMIRGFGRISKLQKAFEFYKRMLEKG-IAADNFTFSFL

Query:  LKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVV
        +K    +  + LG+ +H  +++ G  S +YV+N+L+H+Y    D+  A  +FD+MP  DLVAWN+VI+     G   EAL L+ +M   G++PD  T+V 
Subjt:  LKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVV

Query:  TVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTF
         +SAC+ +GAL  G+ VH ++      + +   N L+D+YA+CG VE+A+ +F+ + DKN V+W ++I+ LA +G  ++A+ LF  M + +   P ++TF
Subjt:  TVSACSALGALDFGRWVHSHVKGNDRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTF

Query:  LGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSS
        +G+L AC+H G V+EG  YF  M +++ I+P ++H+G MVD+L RAG V++AY+ I+SMPM+ N +IWRTLL AC VHG+  L E  R  +L+LEP+HS 
Subjt:  LGVLCACNHGGKVEEGRRYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSS

Query:  DYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSYLEI
        DYVLL+NMYAS  +W+++ KIRK M R GV+K  PG+S +E+
Subjt:  DYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPGNSYLEI

AT5G15300.1 Pentatricopeptide repeat (PPR) superfamily protein6.6e-9536.05Show/hide
Query:  RPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVSDRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKR
        R   +D    +R   L++ C  ++ L Q+HA ++ +G   NL VVG+LI   S+S  G + YA  +F+ I  PD  + N ++RG  +  K +K    Y  
Subjt:  RPNGSDFAKEQRILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVSDRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKR

Query:  MLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKI-------------------------------ARNLFD
        M ++G++ D +TF+F+LK   +L     G   H  +++ G   + YV+N LI  +    D+ I                               A  LFD
Subjt:  MLEKGIAADNFTFSFLLKISGQLGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKI-------------------------------ARNLFD

Query:  EMP-------------------------------RPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVK
        EMP                                 D+V WN +I  +V+CG   EALG+F +M   G  PD  T++  +SAC+ LG L+ G+ +H ++ 
Subjt:  EMP-------------------------------RPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRWVHSHVK

Query:  GNDRGKT-----IAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGR
              +       ++N+LIDMYAKCG+++ A E+F  + D+++ TWNT+I+ LA H  AE ++ +F +M   K+  P++VTF+GV+ AC+H G+V+EGR
Subjt:  GNDRGKT-----IAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGR

Query:  RYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNE
        +YF LM   +NI+P +KHYG MVD+LGRAG +EEA+  + SM +E NAI+WRTLL AC+++GNV+LG+     +L +  D S DYVLL+N+YAS+GQW+ 
Subjt:  RYFDLMTKQFNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNE

Query:  MMKIRKSMQRKGVQKP
        + K+RK      V+KP
Subjt:  MMKIRKSMQRKGVQKP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATCAATCACCCACTTAATCTGGGAGTGCCGAACTGGAATCCAAAGGATTGGTGGGGCTGGATGAAGGAAAACTTGGATGAAGAAGAAATTGCTAAAAGTGCTGT
CATTATGTGGAAAATATGGAATTATTGCAACAGACCAGCCAACACTCATATTCGTGGAGTAGAAGCCCTCTATCAAAGCATAGAATTCAGCATAAATGAAATTGAAGATT
ATTACCTCAAGTCCCGATCCTCCGATAGAATTGGGAGCCTCTCGAATCAAGCTCCGATGAAGAATCCGACCCCAAATTGCTGGCTTCTGAAATCCGACGCCTCCTGGAAC
GAGAATCTGGGCAATGGTGGCGTGGGTTGGGTGGTTCGTGACTCGGAGGGATCACTTGTCTGCGTAGGGAGGAAGATAATTCGGAAAAATTGGTCGATCAAGACGCTGGA
AGCTAAAGCTTTGTTAGAAGGGCTTCGTCAAATTGCTGACACCTGCAAACGACGCTCCATTGCCATAGAAATTCAGACAGATGCTCTGGAGATCATCAACATCCTCAATG
GAAAAATAGAGGATCTATCGGAAGTGAAATCTCTCATCGACCAAATTAAAGTCATCGCCTCAGATGTGCAAATCGTGGGTTTTAGTCATTATAGTAGGGTTTTGAACACA
GACGCGCACTGTGTTGTGAGATCGACCATGGATCTCCTGATTGTTGAAGGTGATCTCCATGGAAGTGACTCTTCGCGGGAAGAGGGGCTAGTTTTTTGGGCTCCCTTTTG
GCCTTCGTGGTTAGATCCCCTCATTAAGGAGGATTTGAACAGAATTGTTATGATCGGCATCAACAGCAGAAACTCGAACTTGCCACTCTCTCTGTGTTCACAAATTAAGT
TCAAATTTCCACATTTTCAATTGCTTTTCCCTAAGTTCATTTCGAGTTCCGATCGAAAAACAGCAACTGGTACAAGACCCAATGGCAGCGATTTTGCCAAGGAGCAAAGA
ATTCTGTCTCTTTTCAAGCAATGCTCAACCGTGAAAGATTTGAATCAAGTCCATGCCCGTGTTATCCAGTCGGGTTTCGATCAGAATCTCTTCGTTGTTGGCAAACTCAT
TGAGTTTTGTTCGGTTTCGGACCGTGGCGATATGAATTATGCTGTTCTTGTTTTCAACGGTATCGAGAACCCAGATGGGTTTCTTTGGAATACAATGATCAGGGGATTTG
GAAGAATTAGTAAGCTGCAAAAGGCGTTTGAGTTCTACAAGAGAATGCTAGAGAAGGGGATAGCTGCAGATAATTTCACTTTTTCTTTCTTACTAAAGATTTCTGGGCAG
TTGGGTTCGATTATGTTGGGCAAGCAGTTACATGTTGGTATTCTGAAACTTGGCCTTGAATCCCATGTGTATGTGAGGAACACGCTTATTCATATGTATGGCAGGTTAAA
AGACATCAAAATAGCACGCAACCTGTTTGATGAAATGCCTAGACCAGATTTAGTGGCTTGGAATACAGTCATTGACTGTCATGTCTCTTGTGGGATGTACAATGAAGCAC
TTGGCCTGTTTCTTCAAATGTTGCAGTGTGGCGTACAGCCTGATGAAGCCACACTGGTTGTGACAGTCTCAGCATGCTCTGCGTTGGGTGCACTGGACTTTGGGAGGTGG
GTTCATTCCCATGTGAAGGGTAACGATAGAGGGAAGACTATAGCTGTTTTCAATTCTTTGATCGACATGTACGCCAAGTGTGGAGCAGTCGAAGACGCACGAGAGATGTT
TAATGCAATAGGTGACAAGAACATAGTAACATGGAACACAATGATCATGGCGTTAGCAACACATGGTGATGCAGAGGATGCACTGACACTATTCTCAAAGATGTTAGCGG
AGAAGCTTGAGACTCCTGATGATGTAACTTTCTTGGGAGTACTGTGTGCTTGTAACCATGGAGGAAAGGTGGAAGAAGGGAGGAGGTATTTTGATCTCATGACCAAACAG
TTCAATATCCAACCCACACTGAAGCACTATGGATCCATGGTGGATATTCTGGGACGAGCTGGGTTTGTAGAAGAAGCTTATCAGCTGATAAGGAGCATGCCAATGGAGTG
CAATGCCATTATATGGAGAACATTACTCGCTGCCTGTCGGGTGCATGGAAATGTTAAACTCGGGGAGAGAGTGAGGAGCCATGTTCTGGAGCTAGAGCCAGATCATAGTA
GTGATTATGTTCTTCTTGCAAATATGTATGCAAGCTCTGGTCAATGGAATGAAATGATGAAGATCAGAAAATCAATGCAACGAAAAGGAGTGCAGAAACCAGAGCCTGGT
AATAGTTATTTGGAAATCAATCCGCAAAGGAAGTTGGAGTTGGAAACTGTTGGGAATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGAATCAATCACCCACTTAATCTGGGAGTGCCGAACTGGAATCCAAAGGATTGGTGGGGCTGGATGAAGGAAAACTTGGATGAAGAAGAAATTGCTAAAAGTGCTGT
CATTATGTGGAAAATATGGAATTATTGCAACAGACCAGCCAACACTCATATTCGTGGAGTAGAAGCCCTCTATCAAAGCATAGAATTCAGCATAAATGAAATTGAAGATT
ATTACCTCAAGTCCCGATCCTCCGATAGAATTGGGAGCCTCTCGAATCAAGCTCCGATGAAGAATCCGACCCCAAATTGCTGGCTTCTGAAATCCGACGCCTCCTGGAAC
GAGAATCTGGGCAATGGTGGCGTGGGTTGGGTGGTTCGTGACTCGGAGGGATCACTTGTCTGCGTAGGGAGGAAGATAATTCGGAAAAATTGGTCGATCAAGACGCTGGA
AGCTAAAGCTTTGTTAGAAGGGCTTCGTCAAATTGCTGACACCTGCAAACGACGCTCCATTGCCATAGAAATTCAGACAGATGCTCTGGAGATCATCAACATCCTCAATG
GAAAAATAGAGGATCTATCGGAAGTGAAATCTCTCATCGACCAAATTAAAGTCATCGCCTCAGATGTGCAAATCGTGGGTTTTAGTCATTATAGTAGGGTTTTGAACACA
GACGCGCACTGTGTTGTGAGATCGACCATGGATCTCCTGATTGTTGAAGGTGATCTCCATGGAAGTGACTCTTCGCGGGAAGAGGGGCTAGTTTTTTGGGCTCCCTTTTG
GCCTTCGTGGTTAGATCCCCTCATTAAGGAGGATTTGAACAGAATTGTTATGATCGGCATCAACAGCAGAAACTCGAACTTGCCACTCTCTCTGTGTTCACAAATTAAGT
TCAAATTTCCACATTTTCAATTGCTTTTCCCTAAGTTCATTTCGAGTTCCGATCGAAAAACAGCAACTGGTACAAGACCCAATGGCAGCGATTTTGCCAAGGAGCAAAGA
ATTCTGTCTCTTTTCAAGCAATGCTCAACCGTGAAAGATTTGAATCAAGTCCATGCCCGTGTTATCCAGTCGGGTTTCGATCAGAATCTCTTCGTTGTTGGCAAACTCAT
TGAGTTTTGTTCGGTTTCGGACCGTGGCGATATGAATTATGCTGTTCTTGTTTTCAACGGTATCGAGAACCCAGATGGGTTTCTTTGGAATACAATGATCAGGGGATTTG
GAAGAATTAGTAAGCTGCAAAAGGCGTTTGAGTTCTACAAGAGAATGCTAGAGAAGGGGATAGCTGCAGATAATTTCACTTTTTCTTTCTTACTAAAGATTTCTGGGCAG
TTGGGTTCGATTATGTTGGGCAAGCAGTTACATGTTGGTATTCTGAAACTTGGCCTTGAATCCCATGTGTATGTGAGGAACACGCTTATTCATATGTATGGCAGGTTAAA
AGACATCAAAATAGCACGCAACCTGTTTGATGAAATGCCTAGACCAGATTTAGTGGCTTGGAATACAGTCATTGACTGTCATGTCTCTTGTGGGATGTACAATGAAGCAC
TTGGCCTGTTTCTTCAAATGTTGCAGTGTGGCGTACAGCCTGATGAAGCCACACTGGTTGTGACAGTCTCAGCATGCTCTGCGTTGGGTGCACTGGACTTTGGGAGGTGG
GTTCATTCCCATGTGAAGGGTAACGATAGAGGGAAGACTATAGCTGTTTTCAATTCTTTGATCGACATGTACGCCAAGTGTGGAGCAGTCGAAGACGCACGAGAGATGTT
TAATGCAATAGGTGACAAGAACATAGTAACATGGAACACAATGATCATGGCGTTAGCAACACATGGTGATGCAGAGGATGCACTGACACTATTCTCAAAGATGTTAGCGG
AGAAGCTTGAGACTCCTGATGATGTAACTTTCTTGGGAGTACTGTGTGCTTGTAACCATGGAGGAAAGGTGGAAGAAGGGAGGAGGTATTTTGATCTCATGACCAAACAG
TTCAATATCCAACCCACACTGAAGCACTATGGATCCATGGTGGATATTCTGGGACGAGCTGGGTTTGTAGAAGAAGCTTATCAGCTGATAAGGAGCATGCCAATGGAGTG
CAATGCCATTATATGGAGAACATTACTCGCTGCCTGTCGGGTGCATGGAAATGTTAAACTCGGGGAGAGAGTGAGGAGCCATGTTCTGGAGCTAGAGCCAGATCATAGTA
GTGATTATGTTCTTCTTGCAAATATGTATGCAAGCTCTGGTCAATGGAATGAAATGATGAAGATCAGAAAATCAATGCAACGAAAAGGAGTGCAGAAACCAGAGCCTGGT
AATAGTTATTTGGAAATCAATCCGCAAAGGAAGTTGGAGTTGGAAACTGTTGGGAATTGA
Protein sequenceShow/hide protein sequence
MGINHPLNLGVPNWNPKDWWGWMKENLDEEEIAKSAVIMWKIWNYCNRPANTHIRGVEALYQSIEFSINEIEDYYLKSRSSDRIGSLSNQAPMKNPTPNCWLLKSDASWN
ENLGNGGVGWVVRDSEGSLVCVGRKIIRKNWSIKTLEAKALLEGLRQIADTCKRRSIAIEIQTDALEIINILNGKIEDLSEVKSLIDQIKVIASDVQIVGFSHYSRVLNT
DAHCVVRSTMDLLIVEGDLHGSDSSREEGLVFWAPFWPSWLDPLIKEDLNRIVMIGINSRNSNLPLSLCSQIKFKFPHFQLLFPKFISSSDRKTATGTRPNGSDFAKEQR
ILSLFKQCSTVKDLNQVHARVIQSGFDQNLFVVGKLIEFCSVSDRGDMNYAVLVFNGIENPDGFLWNTMIRGFGRISKLQKAFEFYKRMLEKGIAADNFTFSFLLKISGQ
LGSIMLGKQLHVGILKLGLESHVYVRNTLIHMYGRLKDIKIARNLFDEMPRPDLVAWNTVIDCHVSCGMYNEALGLFLQMLQCGVQPDEATLVVTVSACSALGALDFGRW
VHSHVKGNDRGKTIAVFNSLIDMYAKCGAVEDAREMFNAIGDKNIVTWNTMIMALATHGDAEDALTLFSKMLAEKLETPDDVTFLGVLCACNHGGKVEEGRRYFDLMTKQ
FNIQPTLKHYGSMVDILGRAGFVEEAYQLIRSMPMECNAIIWRTLLAACRVHGNVKLGERVRSHVLELEPDHSSDYVLLANMYASSGQWNEMMKIRKSMQRKGVQKPEPG
NSYLEINPQRKLELETVGN