; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0020161 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0020161
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr02:10644796..10646578
RNA-Seq ExpressionIVF0020161
SyntenyIVF0020161
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0038403.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.091.75Show/hide
Query:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG
        MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG
Subjt:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG

Query:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM
        HEIK                                      ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM
Subjt:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM

Query:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL
        SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL
Subjt:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL

Query:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS
        EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS
Subjt:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS

Query:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---
        AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF   
Subjt:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---

Query:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV
                ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV
Subjt:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV

XP_004142592.1 pentatricopeptide repeat-containing protein At1g08610 [Cucumis sativus]0.083.78Show/hide
Query:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG
        MAY LTVQNYMVTVNG HECSKQEYASTGIGQCLLEKEK SSLHL  LCK+S   SYSCHWSTTLG GRKQRVLHFKGLQRSVCIDRVD+TYEDEL LNG
Subjt:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG

Query:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM
        HEIK                                      ILQKFC KGKLMEASR+VDIMASRNQIPDF CC+N+IRGFV  DR+DKAVQVLKIMVM
Subjt:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM

Query:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL
        SGGVPD+ITYNMVIG LCKQGHLESAIELL+DMSLSGCPPDVITYNAVIRHMFDNGCFDQA+EFWKEQLRKGTPPYLITYTILIELVWKHRGTV A+EVL
Subjt:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL

Query:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS
        EEMANEGCYPDLVTYNSLINLTCKQGKFED ALVIDNLLFHGMVPDAVTYNTLLHSLSRRG WDEVDEILKIMSISLQPPTVVT NVLINGLCKNGLLDS
Subjt:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS

Query:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGFI--
        AINFLNQMFS NCLPDIITYNTLLGAL KEGMVDEAFQLLHLL  +ACSPGLISYNTVLDGLS+KGYMDKAMSLYSQMMENGI+PDD THRSIIWGF   
Subjt:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGFI--

Query:  ---------LKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQ
                 LKGILKGG++VNSS YR LVHELC+NKKVDLAIQVLEMMLSS CK NETIYSTIINSIAS+GLKEQADELRQKLIE KVLGKQ
Subjt:  ---------LKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQ

XP_008443760.1 PREDICTED: pentatricopeptide repeat-containing protein At1g08610 [Cucumis melo]0.091.41Show/hide
Query:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG
        MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG
Subjt:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG

Query:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM
        HEIK                                      ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQ LKIMVM
Subjt:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM

Query:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL
        SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL
Subjt:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL

Query:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS
        EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS
Subjt:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS

Query:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---
        AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF   
Subjt:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---

Query:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV
                ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGK+AV
Subjt:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV

XP_022158526.1 pentatricopeptide repeat-containing protein At1g08610 [Momordica charantia]0.076.43Show/hide
Query:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG
        MAYTLT QNY VT +G HECSKQEY ST I QC +EK   S+LHLNCLCKSSC SSYSCHWS  LG  RK RVLH +G+QRSVCIDRVD+ Y+DEL LNG
Subjt:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG

Query:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM
        HE+K                                      ILQKFCNKGKLMEASRLVDIMA RNQIP+FHCC+NLIRGFVKIDR+DKAVQVLKIMVM
Subjt:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM

Query:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL
        SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIR M DNG FDQA+EFWKEQLRKGTPPYLITYTILIELV KHRGT+ AIEVL
Subjt:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL

Query:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS
        EEMA EGCYPDLVTYNSLINL CKQGKFE   LVI++LL HGM P+AVTYNTLLHSL+RRGRWDEVDEIL IMS S QPPTVVTYN+LINGLCKNGLLD 
Subjt:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS

Query:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---
        AINFLNQMFS +CLPDIITYNTLLGALSKEG+VDEAFQLLHLL G++CSPGLISYN V+DGLSKKG M+KAM LYSQM+ENGI+PD+ITHR++IWG+   
Subjt:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---

Query:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV
                ILKG +K G K++SSSY FLVHELCIN+KVDL+IQVLE+MLS++CKPNETIYSTII+SIAS+G K+QADELR+KLIE KVLGK+AV
Subjt:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV

XP_038880487.1 pentatricopeptide repeat-containing protein At1g08610 [Benincasa hispida]0.083.16Show/hide
Query:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG
        MAYTLTVQNYMVTVNG HECSKQEYA+TGIGQC LEKEK SSLHLNC CKSSC SSYSCH STTLG GRKQ VLH KGLQRSVCIDRVD+ YEDEL LNG
Subjt:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG

Query:  HEI--------------------------------------KILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM
        HE                                       +ILQKFCNKGKLMEASRLVDIMA RNQIP+FHCCVNLIRGFVKIDR+DKAVQVLKIMVM
Subjt:  HEI--------------------------------------KILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM

Query:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL
        SGG PD+ITYNM+IGGLCKQGHL+SAIELLD+MS SGCPPDVITYNAVIR MFDNG FDQA+EFWKEQ+RKGTPPYLITYTILIEL+ KH GT RAIEVL
Subjt:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL

Query:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS
        EEMANEGCYPDLVTYNSLINLTCKQGKFED ALVIDNLLFHGMVP+AVTYNTLLHSLSRRGRWDEVDEIL IMSISLQPPTVVTYNVLINGLCKNGLLD 
Subjt:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS

Query:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---
        AINFLNQMFS NCLPDIITYNTLLGALSKEGMVDEAFQLLHLL G+ CSPGLISYNTVLDGLSKKGYMDKAMSLY QM ENGI+PDDITHRSIIWG    
Subjt:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---

Query:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV
                ILKG L+ GHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSS+ KPNETIYSTIINSIAS+GLKEQADELRQKLIE KVLGKQAV
Subjt:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV

TrEMBL top hitse value%identityAlignment
A0A0A0LY76 Uncharacterized protein7.9e-28183.78Show/hide
Query:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG
        MAY LTVQNYMVTVNG HECSKQEYASTGIGQCLLEKEK SSLHL  LCK+S   SYSCHWSTTLG GRKQRVLHFKGLQRSVCIDRVD+TYEDEL LNG
Subjt:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG

Query:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM
        HEIK                                      ILQKFC KGKLMEASR+VDIMASRNQIPDF CC+N+IRGFV  DR+DKAVQVLKIMVM
Subjt:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM

Query:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL
        SGGVPD+ITYNMVIG LCKQGHLESAIELL+DMSLSGCPPDVITYNAVIRHMFDNGCFDQA+EFWKEQLRKGTPPYLITYTILIELVWKHRGTV A+EVL
Subjt:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL

Query:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS
        EEMANEGCYPDLVTYNSLINLTCKQGKFED ALVIDNLLFHGMVPDAVTYNTLLHSLSRRG WDEVDEILKIMSISLQPPTVVT NVLINGLCKNGLLDS
Subjt:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS

Query:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---
        AINFLNQMFS NCLPDIITYNTLLGAL KEGMVDEAFQLLHLL  +ACSPGLISYNTVLDGLS+KGYMDKAMSLYSQMMENGI+PDD THRSIIWGF   
Subjt:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---

Query:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQ
                 LKGILKGG++VNSS YR LVHELC+NKKVDLAIQVLEMMLSS CK NETIYSTIINSIAS+GLKEQADELRQKLIE KVLGKQ
Subjt:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQ

A0A1S3B8B2 pentatricopeptide repeat-containing protein At1g086104.1e-31091.41Show/hide
Query:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG
        MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG
Subjt:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG

Query:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM
        HEIK                                      ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQ LKIMVM
Subjt:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM

Query:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL
        SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL
Subjt:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL

Query:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS
        EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS
Subjt:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS

Query:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---
        AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF   
Subjt:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---

Query:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV
                ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGK+AV
Subjt:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV

A0A5A7T640 Pentatricopeptide repeat-containing protein0.0e+0091.75Show/hide
Query:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG
        MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG
Subjt:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG

Query:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM
        HEIK                                      ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM
Subjt:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM

Query:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL
        SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL
Subjt:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL

Query:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS
        EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS
Subjt:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS

Query:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---
        AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF   
Subjt:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---

Query:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV
                ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV
Subjt:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV

A0A6J1E155 pentatricopeptide repeat-containing protein At1g086105.5e-25876.43Show/hide
Query:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG
        MAYTLT QNY VT +G HECSKQEY ST I QC +EK   S+LHLNCLCKSSC SSYSCHWS  LG  RK RVLH +G+QRSVCIDRVD+ Y+DEL LNG
Subjt:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG

Query:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM
        HE+K                                      ILQKFCNKGKLMEASRLVDIMA RNQIP+FHCC+NLIRGFVKIDR+DKAVQVLKIMVM
Subjt:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM

Query:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL
        SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIR M DNG FDQA+EFWKEQLRKGTPPYLITYTILIELV KHRGT+ AIEVL
Subjt:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL

Query:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS
        EEMA EGCYPDLVTYNSLINL CKQGKFE   LVI++LL HGM P+AVTYNTLLHSL+RRGRWDEVDEIL IMS S QPPTVVTYN+LINGLCKNGLLD 
Subjt:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS

Query:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---
        AINFLNQMFS +CLPDIITYNTLLGALSKEG+VDEAFQLLHLL G++CSPGLISYN V+DGLSKKG M+KAM LYSQM+ENGI+PD+ITHR++IWG+   
Subjt:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---

Query:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV
                ILKG +K G K++SSSY FLVHELCIN+KVDL+IQVLE+MLS++CKPNETIYSTII+SIAS+G K+QADELR+KLIE KVLGK+AV
Subjt:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV

A0A6J1HC71 pentatricopeptide repeat-containing protein At1g086101.6e-25776.43Show/hide
Query:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG
        MAYTLTVQNYMVT+NG HEC KQEY STG GQCL+EKEK  S+HL+CLCKSSC SSYS HWSTT GFGRKQRVL  KG Q SVCIDRVD+ Y+DEL LNG
Subjt:  MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNG

Query:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM
        HEI+                                      ILQKFC  GKLME+SRLVDIMA RNQIPDFHCCV LIRG VKID +DKAVQVLKIMVM
Subjt:  HEIK--------------------------------------ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVM

Query:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL
        SGG+PDVITYNMVIGGLCK+G+LESA+ELLDDMSLSGCPPDV+TYNAVIRHMFDNGCFDQA+EFWKEQ+RKGTPPYLITYTILIELV KH GTVRAIEVL
Subjt:  SGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVL

Query:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS
        EEMAN GCYPD+VTYNSLIN+TCKQGK+ED  LVIDNLLFHGMVPD VTYNTLLHSLSRRG WDEV E+L IM+ +L PPTVVTYNVLINGLCKNG LD 
Subjt:  EEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDS

Query:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---
        AINFLNQMF+ N LPDIITYNTLL ALSKE MVDEAFQLLHLLIG+ CSP LISYNTVL+GLSKKGY+D+A SLY+QM++NGI+PDD T RS+I GF   
Subjt:  AINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF---

Query:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV
                ++KG LK G++VNS SYR+LVHELCINKKVDLAIQVLEMMLSS+CKPNE+IY TIINSIAS+GLKEQADELRQKLIE KVLGK+ V
Subjt:  --------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV

SwissProt top hitse value%identityAlignment
A3KPF8 Pentatricopeptide repeat-containing protein At1g79080, chloroplastic1.0e-5930.67Show/hide
Query:  PDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQL
        P+      L+    K +R+ KA++V+++MV SG +PD   Y  ++  LCK+G++  A++L++ M   G P + +TYNA++R +   G  +Q+++F +  +
Subjt:  PDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQL

Query:  RKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEI
        +KG  P   TY+ L+E  +K RGT  A+++L+E+  +G  P+LV+YN L+   CK+G+ +D   +   L   G   + V+YN LL  L   GRW+E + +
Subjt:  RKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEI

Query:  LKIMSISLQPPTVVTYNVLINGLCKNGLLDSAINFLNQMFSNN--CLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYN-----------
        L  M    + P+VVTYN+LIN L  +G  + A+  L +M   N        +YN ++  L KEG VD   + L  +I   C P   +YN           
Subjt:  LKIMSISLQPPTVVTYNVLINGLCKNGLLDSAINFLNQMFSNN--CLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYN-----------

Query:  ------------------------TVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGFILKGILKGGHKVNS------------SSYRFLVHE
                                +V+  L +KG    A  L  +M   G  PD  T+ ++I G  L+G+  G  +V S             ++  ++  
Subjt:  ------------------------TVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGFILKGILKGGHKVNS------------SSYRFLVHE

Query:  LCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV
        LC  ++ DLA++V EMM+  +  PNET Y+ ++  IA     E A E+  +L   KV+G+ AV
Subjt:  LCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV

Q3EDF8 Pentatricopeptide repeat-containing protein At1g099002.3e-8834.69Show/hide
Query:  HFKGLQRSVCIDRVDNTYEDELVLNGHEIKILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNM
        H+  +  S  ++ V++        N H    L++    G+L E  + ++ M     +PD   C  LIRGF ++ +  KA ++L+I+  SG VPDVITYN+
Subjt:  HFKGLQRSVCIDRVDNTYEDELVLNGHEIKILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNM

Query:  VIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDL
        +I G CK G + +A+ +LD MS+S   PDV+TYN ++R + D+G   QA+E     L++   P +ITYTILIE   +  G   A+++L+EM + GC PD+
Subjt:  VIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDL

Query:  VTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLL--------------
        VTYN L+N  CK+G+ ++    ++++   G  P+ +T+N +L S+   GRW + +++L  M      P+VVT+N+LIN LC+ GLL              
Subjt:  VTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLL--------------

Query:  ---------------------DSAINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQ
                             D AI +L +M S  C PDI+TYNT+L AL K+G V++A ++L+ L    CSP LI+YNTV+DGL+K G   KA+ L  +
Subjt:  ---------------------DSAINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQ

Query:  MMENGIMPDDITHRSIIWGFILKGIL-----------KGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQAD
        M    + PD IT+ S++ G   +G +           + G + N+ ++  ++  LC +++ D AI  L  M++  CKPNET Y+ +I  +A  G+ ++A 
Subjt:  MMENGIMPDDITHRSIIWGFILKGIL-----------KGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQAD

Query:  ELRQKLIELKVLGKQA
        EL  +L    ++ K +
Subjt:  ELRQKLIELKVLGKQA

Q9CAN0 Pentatricopeptide repeat-containing protein At1g63130, mitochondrial9.0e-5627.09Show/hide
Query:  ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDV
        ++  FC + +L  A  ++  M      PD     +L+ GF   +RI  AV ++  MV  G  PD  T+N +I GL +      A+ L+D M + GC PD+
Subjt:  ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDV

Query:  ITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHG
        +TY  V+  +   G  D A+   K+  +    P ++ Y  +I+ +  ++    A+ +  EM N+G  P++VTYNSLI   C  G++ D + ++ +++   
Subjt:  ITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHG

Query:  MVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDSAINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHL
        + P+ VT++ L+ +  + G+  E +++   M      P + TY+ LING C +  LD A +    M S +C P+++TYNTL+    K   VDE  +L   
Subjt:  MVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDSAINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHL

Query:  LIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGFILKG-----------ILKGGHKVNSSSYRFLVHELCINKKVDLAI
        +         ++Y T++ G  +    D A  ++ QM+ +G++PD +T+  ++ G    G           + +   + +  +Y  ++  +C   KV+   
Subjt:  LIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGFILKG-----------ILKGGHKVNSSSYRFLVHELCINKKVDLAI

Query:  QVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIE
         +   +     KPN   Y+T+++     GLKE+AD L +++ E
Subjt:  QVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIE

Q9FRS4 Pentatricopeptide repeat-containing protein At1g086102.9e-14755.36Show/hide
Query:  KILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPD
        +IL   C+ GKL +A +LV++MA  NQ+P F  C NL+RG  +ID++DKA+ +L++MVMSGGVPD ITYNM+IG LCK+GH+ +A+ LL+DMSLSG PPD
Subjt:  KILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPD

Query:  VITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFH
        VITYN VIR MFD G  +QAI FWK+QL+ G PP++ITYT+L+ELV ++ G+ RAIEVLE+MA EGCYPD+VTYNSL+N  C++G  E+ A VI ++L H
Subjt:  VITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFH

Query:  GMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDSAINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLH
        G+  + VTYNTLLHSL     WDEV+EIL IM  +   PTV+TYN+LINGLCK  LL  AI+F  QM    CLPDI+TYNT+LGA+SKEGMVD+A +LL 
Subjt:  GMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDSAINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLH

Query:  LLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF-----------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLA
        LL  + C PGLI+YN+V+DGL+KKG M KA+ LY QM++ GI PDDIT RS+I+GF           +LK     G+ +  S+YR ++  LC  K++++A
Subjt:  LLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF-----------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLA

Query:  IQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVL
        I+V+E+ML+  CKP+ETIY+ I+  +   G+  +A +L++KL + K+L
Subjt:  IQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVL

Q9SR00 Pentatricopeptide repeat-containing protein At3g04760, chloroplastic1.9e-7432.3Show/hide
Query:  IKILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPP
        +KI  + C  G  +E+  L++ M  +   PD   C  LI+GF  +  I KAV+V++I+    G PDV  YN +I G CK   ++ A  +LD M      P
Subjt:  IKILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPP

Query:  DVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLF
        D +TYN +I  +   G  D A++   + L     P +ITYTILIE      G   A+++++EM + G  PD+ TYN++I   CK+G  +    ++ NL  
Subjt:  DVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLF

Query:  HGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKN-----------------------------------GLLDSAINFL
         G  PD ++YN LL +L  +G+W+E ++++  M      P VVTY++LI  LC++                                   G LD AI FL
Subjt:  HGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKN-----------------------------------GLLDSAINFL

Query:  NQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGFILKGILKG
          M S+ CLPDI+ YNT+L  L K G  D+A ++   L    CSP   SYNT+   L   G   +A+ +  +MM NGI PD+IT+ S+I     +G++  
Subjt:  NQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGFILKGILKG

Query:  GHKV-----------NSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGK
          ++           +  +Y  ++   C   +++ AI VLE M+ + C+PNET Y+ +I  I  +G + +A EL   L+ +  + +
Subjt:  GHKV-----------NSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGK

Arabidopsis top hitse value%identityAlignment
AT1G08610.1 Pentatricopeptide repeat (PPR) superfamily protein2.1e-14855.36Show/hide
Query:  KILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPD
        +IL   C+ GKL +A +LV++MA  NQ+P F  C NL+RG  +ID++DKA+ +L++MVMSGGVPD ITYNM+IG LCK+GH+ +A+ LL+DMSLSG PPD
Subjt:  KILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPD

Query:  VITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFH
        VITYN VIR MFD G  +QAI FWK+QL+ G PP++ITYT+L+ELV ++ G+ RAIEVLE+MA EGCYPD+VTYNSL+N  C++G  E+ A VI ++L H
Subjt:  VITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFH

Query:  GMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDSAINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLH
        G+  + VTYNTLLHSL     WDEV+EIL IM  +   PTV+TYN+LINGLCK  LL  AI+F  QM    CLPDI+TYNT+LGA+SKEGMVD+A +LL 
Subjt:  GMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDSAINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLH

Query:  LLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF-----------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLA
        LL  + C PGLI+YN+V+DGL+KKG M KA+ LY QM++ GI PDDIT RS+I+GF           +LK     G+ +  S+YR ++  LC  K++++A
Subjt:  LLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGF-----------ILKGILKGGHKVNSSSYRFLVHELCINKKVDLA

Query:  IQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVL
        I+V+E+ML+  CKP+ETIY+ I+  +   G+  +A +L++KL + K+L
Subjt:  IQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVL

AT1G09900.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.7e-8934.69Show/hide
Query:  HFKGLQRSVCIDRVDNTYEDELVLNGHEIKILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNM
        H+  +  S  ++ V++        N H    L++    G+L E  + ++ M     +PD   C  LIRGF ++ +  KA ++L+I+  SG VPDVITYN+
Subjt:  HFKGLQRSVCIDRVDNTYEDELVLNGHEIKILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNM

Query:  VIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDL
        +I G CK G + +A+ +LD MS+S   PDV+TYN ++R + D+G   QA+E     L++   P +ITYTILIE   +  G   A+++L+EM + GC PD+
Subjt:  VIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDL

Query:  VTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLL--------------
        VTYN L+N  CK+G+ ++    ++++   G  P+ +T+N +L S+   GRW + +++L  M      P+VVT+N+LIN LC+ GLL              
Subjt:  VTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLL--------------

Query:  ---------------------DSAINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQ
                             D AI +L +M S  C PDI+TYNT+L AL K+G V++A ++L+ L    CSP LI+YNTV+DGL+K G   KA+ L  +
Subjt:  ---------------------DSAINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQ

Query:  MMENGIMPDDITHRSIIWGFILKGIL-----------KGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQAD
        M    + PD IT+ S++ G   +G +           + G + N+ ++  ++  LC +++ D AI  L  M++  CKPNET Y+ +I  +A  G+ ++A 
Subjt:  MMENGIMPDDITHRSIIWGFILKGIL-----------KGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQAD

Query:  ELRQKLIELKVLGKQA
        EL  +L    ++ K +
Subjt:  ELRQKLIELKVLGKQA

AT1G63130.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.4e-5727.09Show/hide
Query:  ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDV
        ++  FC + +L  A  ++  M      PD     +L+ GF   +RI  AV ++  MV  G  PD  T+N +I GL +      A+ L+D M + GC PD+
Subjt:  ILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDV

Query:  ITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHG
        +TY  V+  +   G  D A+   K+  +    P ++ Y  +I+ +  ++    A+ +  EM N+G  P++VTYNSLI   C  G++ D + ++ +++   
Subjt:  ITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHG

Query:  MVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDSAINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHL
        + P+ VT++ L+ +  + G+  E +++   M      P + TY+ LING C +  LD A +    M S +C P+++TYNTL+    K   VDE  +L   
Subjt:  MVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKNGLLDSAINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHL

Query:  LIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGFILKG-----------ILKGGHKVNSSSYRFLVHELCINKKVDLAI
        +         ++Y T++ G  +    D A  ++ QM+ +G++PD +T+  ++ G    G           + +   + +  +Y  ++  +C   KV+   
Subjt:  LIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGFILKG-----------ILKGGHKVNSSSYRFLVHELCINKKVDLAI

Query:  QVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIE
         +   +     KPN   Y+T+++     GLKE+AD L +++ E
Subjt:  QVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIE

AT1G79080.1 Pentatricopeptide repeat (PPR) superfamily protein7.3e-6130.67Show/hide
Query:  PDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQL
        P+      L+    K +R+ KA++V+++MV SG +PD   Y  ++  LCK+G++  A++L++ M   G P + +TYNA++R +   G  +Q+++F +  +
Subjt:  PDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCFDQAIEFWKEQL

Query:  RKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEI
        +KG  P   TY+ L+E  +K RGT  A+++L+E+  +G  P+LV+YN L+   CK+G+ +D   +   L   G   + V+YN LL  L   GRW+E + +
Subjt:  RKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDEI

Query:  LKIMSISLQPPTVVTYNVLINGLCKNGLLDSAINFLNQMFSNN--CLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYN-----------
        L  M    + P+VVTYN+LIN L  +G  + A+  L +M   N        +YN ++  L KEG VD   + L  +I   C P   +YN           
Subjt:  LKIMSISLQPPTVVTYNVLINGLCKNGLLDSAINFLNQMFSNN--CLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYN-----------

Query:  ------------------------TVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGFILKGILKGGHKVNS------------SSYRFLVHE
                                +V+  L +KG    A  L  +M   G  PD  T+ ++I G  L+G+  G  +V S             ++  ++  
Subjt:  ------------------------TVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGFILKGILKGGHKVNS------------SSYRFLVHE

Query:  LCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV
        LC  ++ DLA++V EMM+  +  PNET Y+ ++  IA     E A E+  +L   KV+G+ AV
Subjt:  LCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV

AT3G04760.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.4e-7532.3Show/hide
Query:  IKILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPP
        +KI  + C  G  +E+  L++ M  +   PD   C  LI+GF  +  I KAV+V++I+    G PDV  YN +I G CK   ++ A  +LD M      P
Subjt:  IKILQKFCNKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPP

Query:  DVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLF
        D +TYN +I  +   G  D A++   + L     P +ITYTILIE      G   A+++++EM + G  PD+ TYN++I   CK+G  +    ++ NL  
Subjt:  DVITYNAVIRHMFDNGCFDQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLF

Query:  HGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKN-----------------------------------GLLDSAINFL
         G  PD ++YN LL +L  +G+W+E ++++  M      P VVTY++LI  LC++                                   G LD AI FL
Subjt:  HGMVPDAVTYNTLLHSLSRRGRWDEVDEILKIMSISLQPPTVVTYNVLINGLCKN-----------------------------------GLLDSAINFL

Query:  NQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGFILKGILKG
          M S+ CLPDI+ YNT+L  L K G  D+A ++   L    CSP   SYNT+   L   G   +A+ +  +MM NGI PD+IT+ S+I     +G++  
Subjt:  NQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQMMENGIMPDDITHRSIIWGFILKGILKG

Query:  GHKV-----------NSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGK
          ++           +  +Y  ++   C   +++ AI VLE M+ + C+PNET Y+ +I  I  +G + +A EL   L+ +  + +
Subjt:  GHKV-----------NSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGTATACATTGACCGTTCAAAATTATATGGTAACAGTTAATGGTCAGCATGAATGTTCCAAACAAGAGTATGCTAGTACTGGTATCGGCCAATGCTTGCTGGAAAA
AGAAAAACTGTCTTCTTTACACTTAAATTGTTTGTGCAAGTCTAGTTGCATTAGTTCATATAGCTGTCATTGGAGTACAACGCTTGGTTTTGGAAGAAAACAACGGGTTT
TACATTTTAAAGGATTGCAGAGGAGTGTATGTATTGATAGAGTTGATAATACTTACGAAGATGAATTGGTGTTGAATGGCCATGAGATAAAGATTCTACAAAAGTTCTGC
AACAAGGGGAAGTTGATGGAAGCATCTAGGTTAGTTGATATTATGGCTAGTCGAAACCAGATTCCAGATTTCCATTGTTGCGTAAACTTGATTCGTGGGTTTGTAAAGAT
CGACCGAATAGATAAAGCTGTACAAGTCCTGAAAATCATGGTGATGTCTGGTGGTGTTCCAGATGTTATTACGTACAACATGGTGATTGGTGGTTTATGCAAGCAAGGAC
ATTTGGAATCTGCCATTGAGCTCTTGGACGACATGAGTTTGAGTGGTTGCCCCCCAGATGTTATTACATATAATGCAGTAATCCGCCACATGTTTGACAATGGATGTTTT
GATCAGGCTATTGAATTTTGGAAGGAACAGCTCAGAAAAGGAACTCCTCCTTATTTAATTACTTATACAATCCTCATTGAGCTAGTCTGGAAGCACCGTGGAACAGTTCG
TGCTATTGAAGTATTGGAAGAAATGGCTAATGAGGGTTGTTATCCTGATCTTGTCACATACAATTCCTTGATCAACTTAACCTGCAAACAGGGAAAATTTGAAGATACAG
CTTTAGTTATTGATAATCTTCTTTTCCATGGAATGGTACCCGATGCTGTGACTTACAACACCCTTCTCCATTCACTTTCAAGGCGTGGGCGTTGGGATGAAGTTGATGAA
ATCTTAAAAATCATGAGTATTAGTTTGCAGCCTCCAACAGTTGTCACGTACAATGTCTTGATTAATGGTCTATGTAAAAATGGACTCTTAGATAGTGCCATAAACTTTCT
CAATCAAATGTTTTCCAACAATTGTTTGCCTGACATTATAACGTACAACACTCTACTTGGTGCTCTTAGTAAGGAAGGTATGGTAGATGAGGCTTTTCAATTACTTCACC
TTTTAATTGGCTCAGCCTGCTCTCCTGGCTTAATTTCTTACAATACTGTGCTTGATGGGTTATCTAAAAAGGGGTACATGGATAAAGCAATGAGTTTATACAGTCAAATG
ATGGAAAATGGGATCATGCCAGATGATATCACCCATCGTTCTATAATTTGGGGTTTTATATTGAAGGGGATTCTCAAGGGAGGACACAAAGTGAATAGTAGTTCTTACAG
ATTTCTAGTTCATGAACTATGCATAAATAAGAAGGTGGATCTTGCAATACAAGTTCTGGAAATGATGTTATCAAGTCAATGTAAACCCAATGAGACAATTTATTCTACTA
TAATCAACAGCATAGCATCTTCCGGTTTAAAGGAACAGGCTGATGAGTTACGCCAGAAGTTGATAGAATTGAAGGTTTTAGGTAAGCAAGCAGTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGTATACATTGACCGTTCAAAATTATATGGTAACAGTTAATGGTCAGCATGAATGTTCCAAACAAGAGTATGCTAGTACTGGTATCGGCCAATGCTTGCTGGAAAA
AGAAAAACTGTCTTCTTTACACTTAAATTGTTTGTGCAAGTCTAGTTGCATTAGTTCATATAGCTGTCATTGGAGTACAACGCTTGGTTTTGGAAGAAAACAACGGGTTT
TACATTTTAAAGGATTGCAGAGGAGTGTATGTATTGATAGAGTTGATAATACTTACGAAGATGAATTGGTGTTGAATGGCCATGAGATAAAGATTCTACAAAAGTTCTGC
AACAAGGGGAAGTTGATGGAAGCATCTAGGTTAGTTGATATTATGGCTAGTCGAAACCAGATTCCAGATTTCCATTGTTGCGTAAACTTGATTCGTGGGTTTGTAAAGAT
CGACCGAATAGATAAAGCTGTACAAGTCCTGAAAATCATGGTGATGTCTGGTGGTGTTCCAGATGTTATTACGTACAACATGGTGATTGGTGGTTTATGCAAGCAAGGAC
ATTTGGAATCTGCCATTGAGCTCTTGGACGACATGAGTTTGAGTGGTTGCCCCCCAGATGTTATTACATATAATGCAGTAATCCGCCACATGTTTGACAATGGATGTTTT
GATCAGGCTATTGAATTTTGGAAGGAACAGCTCAGAAAAGGAACTCCTCCTTATTTAATTACTTATACAATCCTCATTGAGCTAGTCTGGAAGCACCGTGGAACAGTTCG
TGCTATTGAAGTATTGGAAGAAATGGCTAATGAGGGTTGTTATCCTGATCTTGTCACATACAATTCCTTGATCAACTTAACCTGCAAACAGGGAAAATTTGAAGATACAG
CTTTAGTTATTGATAATCTTCTTTTCCATGGAATGGTACCCGATGCTGTGACTTACAACACCCTTCTCCATTCACTTTCAAGGCGTGGGCGTTGGGATGAAGTTGATGAA
ATCTTAAAAATCATGAGTATTAGTTTGCAGCCTCCAACAGTTGTCACGTACAATGTCTTGATTAATGGTCTATGTAAAAATGGACTCTTAGATAGTGCCATAAACTTTCT
CAATCAAATGTTTTCCAACAATTGTTTGCCTGACATTATAACGTACAACACTCTACTTGGTGCTCTTAGTAAGGAAGGTATGGTAGATGAGGCTTTTCAATTACTTCACC
TTTTAATTGGCTCAGCCTGCTCTCCTGGCTTAATTTCTTACAATACTGTGCTTGATGGGTTATCTAAAAAGGGGTACATGGATAAAGCAATGAGTTTATACAGTCAAATG
ATGGAAAATGGGATCATGCCAGATGATATCACCCATCGTTCTATAATTTGGGGTTTTATATTGAAGGGGATTCTCAAGGGAGGACACAAAGTGAATAGTAGTTCTTACAG
ATTTCTAGTTCATGAACTATGCATAAATAAGAAGGTGGATCTTGCAATACAAGTTCTGGAAATGATGTTATCAAGTCAATGTAAACCCAATGAGACAATTTATTCTACTA
TAATCAACAGCATAGCATCTTCCGGTTTAAAGGAACAGGCTGATGAGTTACGCCAGAAGTTGATAGAATTGAAGGTTTTAGGTAAGCAAGCAGTTTAG
Protein sequenceShow/hide protein sequence
MAYTLTVQNYMVTVNGQHECSKQEYASTGIGQCLLEKEKLSSLHLNCLCKSSCISSYSCHWSTTLGFGRKQRVLHFKGLQRSVCIDRVDNTYEDELVLNGHEIKILQKFC
NKGKLMEASRLVDIMASRNQIPDFHCCVNLIRGFVKIDRIDKAVQVLKIMVMSGGVPDVITYNMVIGGLCKQGHLESAIELLDDMSLSGCPPDVITYNAVIRHMFDNGCF
DQAIEFWKEQLRKGTPPYLITYTILIELVWKHRGTVRAIEVLEEMANEGCYPDLVTYNSLINLTCKQGKFEDTALVIDNLLFHGMVPDAVTYNTLLHSLSRRGRWDEVDE
ILKIMSISLQPPTVVTYNVLINGLCKNGLLDSAINFLNQMFSNNCLPDIITYNTLLGALSKEGMVDEAFQLLHLLIGSACSPGLISYNTVLDGLSKKGYMDKAMSLYSQM
MENGIMPDDITHRSIIWGFILKGILKGGHKVNSSSYRFLVHELCINKKVDLAIQVLEMMLSSQCKPNETIYSTIINSIASSGLKEQADELRQKLIELKVLGKQAV