; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G25910 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G25910
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr5:25007260..25010048
RNA-Seq ExpressionCSPI05G25910
SyntenyCSPI05G25910
Gene Ontology termsGO:0080156 - mitochondrial mRNA modification (biological process)
GO:0005739 - mitochondrion (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8648806.1 hypothetical protein Csa_008294 [Cucumis sativus]0.0e+0099.01Show/hide
Query:  MHNFKLLPHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFG
        MHNFKL PHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFG
Subjt:  MHNFKLLPHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFG

Query:  ICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVATLLMNCT
        ICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFND+QFVDEPKQNKQPAERNAVTETPFWLTYTSYNN+DHARIVATLLMNCT
Subjt:  ICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVATLLMNCT

Query:  NVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGL
        NVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYI MLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGL
Subjt:  NVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGL

Query:  ISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML
        ISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGG AKEAVNMFIKLRQSGL+PDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML
Subjt:  ISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML

Query:  NSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLP
        NSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLP
Subjt:  NSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLP

Query:  HYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP
        HYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP
Subjt:  HYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP

Query:  GYSLATRLD
        GYSLATRLD
Subjt:  GYSLATRLD

XP_004135378.1 pentatricopeptide repeat-containing protein At1g77170, mitochondrial [Cucumis sativus]0.0e+0099.01Show/hide
Query:  MHNFKLLPHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFG
        MHNFKL PHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFG
Subjt:  MHNFKLLPHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFG

Query:  ICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVATLLMNCT
        ICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFND+QFVDEPKQNKQPAERNAVTETPFWLTYTSYNN+DHARIVATLLMNCT
Subjt:  ICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVATLLMNCT

Query:  NVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGL
        NVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYI MLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGL
Subjt:  NVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGL

Query:  ISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML
        ISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGG AKEAVNMFIKLRQSGL+PDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML
Subjt:  ISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML

Query:  NSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLP
        NSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLP
Subjt:  NSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLP

Query:  HYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP
        HYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP
Subjt:  HYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP

Query:  GYSLATRLD
        GYSLATRLD
Subjt:  GYSLATRLD

XP_008446707.1 PREDICTED: pentatricopeptide repeat-containing protein At1g77170 isoform X1 [Cucumis melo]3.1e-29684.89Show/hide
Query:  MHNFKLLPHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFG
        MHNFKLLPHFLLLS KN+DCNSSSLHKLLLPSLSSAASRARRGFY YAP PLSG+EY CYN NNKFELQN GPDIHKEGKCLVLPVQD +SD K EER  
Subjt:  MHNFKLLPHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFG

Query:  ICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVATLLMNCT
        I DEKKSKFTNVGSSPESS+HTFL TAGHCY                                                      VD ARIVA LLMNCT
Subjt:  ICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVATLLMNCT

Query:  NVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGL
        N+LELYQIHAHVLRTNMLENHPSSFYWNII+RSYTRLEVPR ALFVYIAMLRAGILPD YTLPIVFKALSLAYAFDLGLQLHSVAIRLG EFDQYSESGL
Subjt:  NVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGL

Query:  ISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML
        ISLYSK+GDLECACKVFEQN NRKLGSWNAIIAGLSQGG AKEAVNMFIKLRQSGL+PDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKV GKSNILML
Subjt:  ISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML

Query:  NSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLP
        NSLIDMYGKCGRMDLAMKVFSNM HR+VSSWTSLIVGYAMHGQVKQALENF+FMREAGVPPNQVTFVGVLSACVHGG+INEGK+YFDMMKNVYGFKPQLP
Subjt:  NSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLP

Query:  HYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP
        HYGCMVDLL KAGLLEEARRMIEEMPMKANS+IWGCLIGGCEKHGNVEIGEWA KHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP
Subjt:  HYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP

Query:  GYSLATRLD
        GYSLAT LD
Subjt:  GYSLATRLD

XP_023541139.1 pentatricopeptide repeat-containing protein At1g77170, mitochondrial isoform X1 [Cucurbita pepo subsp. pepo]1.3e-27077.6Show/hide
Query:  MHNFKLLPHFLLLSPKNVD----CNS---SSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDY
        MHNFK L +FLL+S KN+D    CNS   +  + L LPSLS   S ARRGF  YA     GNEY C   NNK E+Q  G DIH+E K  VLPVQDVVS  
Subjt:  MHNFKLLPHFLLLSPKNVD----CNS---SSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDY

Query:  KCEERFGICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVA
        KCE +F   D  KSKFTNVGSSP+SS  T L   GH  P Y +KHL T++ R LVEFNDV+F+D+ K NKQ AER+A  + P   TYTS NNVD   IVA
Subjt:  KCEERFGICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVA

Query:  TLLMNCTNVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFD
        TLLMNCT++LEL QIHAH+LRTNMLEN+PSSF WN IIRSYTR E PR AL VYIAMLRAG+ PD YTLPIVFKALSL YAF LGLQLHSVAIRLG EFD
Subjt:  TLLMNCTNVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFD

Query:  QYSESGLISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTG
        QYSESGLISLYSK+GDLECACKVFEQNH+RKLGSWNAIIAGLSQGG AKEAV+MFIKLR+SGL+PDD TIVSVTSACGSLG+LELSLQMHKFVFQVK TG
Subjt:  QYSESGLISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTG

Query:  KSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVY
        KS+ILMLNSLIDMYGKCGRMDLA+KVFSNM HRNVSSWTSLIVGYAMHGQVKQALENF+ MREAGVPPN VTF+GVLSACVHGGM+ EG+HYF+MMK+VY
Subjt:  KSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVY

Query:  GFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQ
        G+KPQLPHYGCMVDLL KAGLLEEARRMI+EMPMKANSIIWGCLIG CEKHGNVE+GEWA KHLQELEPWNDGVYVVLSNIYA+NG+WKE QK+R+ MKQ
Subjt:  GFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQ

Query:  RQLAKVPGYSLATRLD
        RQLAKVPGYSLAT+LD
Subjt:  RQLAKVPGYSLATRLD

XP_038893424.1 pentatricopeptide repeat-containing protein At1g77170, mitochondrial-like [Benincasa hispida]2.3e-29984.67Show/hide
Query:  MHNFKLLPHFLLLSPKNVDCNSSS----LHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCE
        MHNFKLL +FLL+S KN+DCNSSS    LHKLLLPSLS AAS  RRGFYHYA   LSG+E VC N N+KF L+N G DIHK+GK LVLPVQ  V   KCE
Subjt:  MHNFKLLPHFLLLSPKNVDCNSSS----LHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCE

Query:  ERFGICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVATLL
        ERF I DE KSKFTNVGSSPESSR+  L TAGH YP Y TK LETNELR LVEFNDVQFVD+ KQNKQ AER+A  E P WLTYTS NNVD ARIVA LL
Subjt:  ERFGICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVATLL

Query:  MNCTNVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYS
         NCTN+LELYQIHAHVLRTN+LEN+PSSF+WN IIRSYT+LE PR AL VYIAMLRAG+LPD YTLPIVFKALSLAYAF LG+QLHS+AIRLG EF QYS
Subjt:  MNCTNVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYS

Query:  ESGLISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSN
        ESGLISLYSK+GDLECA KVFEQNH+RKLGSWNAIIAGLSQGG AKEAV+MFIKLRQSGL+PDD TIVSVTSACGSLG+LELSLQMHKFVFQVKV GKS+
Subjt:  ESGLISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSN

Query:  ILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFK
        ILMLNSLIDMYGKCGRMDLAMKVFSNM HRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPN VTFVGVLSACVHGGMINEGKHYFDMMKNVYGFK
Subjt:  ILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFK

Query:  PQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQL
        PQLPHYGCMVDLL KAGLLEEAR+MIEEMPMKANS+IWGCLIGGCEKHGNVE+GEWA KHLQELEPWNDGVYVVLSNIYA+NG+WKE QK+R+VMKQRQL
Subjt:  PQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQL

Query:  AKVPGYSLATRLD
        AKVPGYSL TRLD
Subjt:  AKVPGYSLATRLD

TrEMBL top hitse value%identityAlignment
A0A0A0KRB8 Uncharacterized protein0.0e+0099.01Show/hide
Query:  MHNFKLLPHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFG
        MHNFKL PHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFG
Subjt:  MHNFKLLPHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFG

Query:  ICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVATLLMNCT
        ICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFND+QFVDEPKQNKQPAERNAVTETPFWLTYTSYNN+DHARIVATLLMNCT
Subjt:  ICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVATLLMNCT

Query:  NVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGL
        NVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYI MLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGL
Subjt:  NVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGL

Query:  ISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML
        ISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGG AKEAVNMFIKLRQSGL+PDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML
Subjt:  ISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML

Query:  NSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLP
        NSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLP
Subjt:  NSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLP

Query:  HYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP
        HYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP
Subjt:  HYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP

Query:  GYSLATRLD
        GYSLATRLD
Subjt:  GYSLATRLD

A0A1S3BF82 pentatricopeptide repeat-containing protein At1g77170 isoform X11.5e-29684.89Show/hide
Query:  MHNFKLLPHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFG
        MHNFKLLPHFLLLS KN+DCNSSSLHKLLLPSLSSAASRARRGFY YAP PLSG+EY CYN NNKFELQN GPDIHKEGKCLVLPVQD +SD K EER  
Subjt:  MHNFKLLPHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFG

Query:  ICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVATLLMNCT
        I DEKKSKFTNVGSSPESS+HTFL TAGHCY                                                      VD ARIVA LLMNCT
Subjt:  ICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVATLLMNCT

Query:  NVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGL
        N+LELYQIHAHVLRTNMLENHPSSFYWNII+RSYTRLEVPR ALFVYIAMLRAGILPD YTLPIVFKALSLAYAFDLGLQLHSVAIRLG EFDQYSESGL
Subjt:  NVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGL

Query:  ISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML
        ISLYSK+GDLECACKVFEQN NRKLGSWNAIIAGLSQGG AKEAVNMFIKLRQSGL+PDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKV GKSNILML
Subjt:  ISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML

Query:  NSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLP
        NSLIDMYGKCGRMDLAMKVFSNM HR+VSSWTSLIVGYAMHGQVKQALENF+FMREAGVPPNQVTFVGVLSACVHGG+INEGK+YFDMMKNVYGFKPQLP
Subjt:  NSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLP

Query:  HYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP
        HYGCMVDLL KAGLLEEARRMIEEMPMKANS+IWGCLIGGCEKHGNVEIGEWA KHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP
Subjt:  HYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP

Query:  GYSLATRLD
        GYSLAT LD
Subjt:  GYSLATRLD

A0A5A7SZV4 Pentatricopeptide repeat-containing protein1.5e-29684.89Show/hide
Query:  MHNFKLLPHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFG
        MHNFKLLPHFLLLS KN+DCNSSSLHKLLLPSLSSAASRARRGFY YAP PLSG+EY CYN NNKFELQN GPDIHKEGKCLVLPVQD +SD K EER  
Subjt:  MHNFKLLPHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFG

Query:  ICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVATLLMNCT
        I DEKKSKFTNVGSSPESS+HTFL TAGHCY                                                      VD ARIVA LLMNCT
Subjt:  ICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVATLLMNCT

Query:  NVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGL
        N+LELYQIHAHVLRTNMLENHPSSFYWNII+RSYTRLEVPR ALFVYIAMLRAGILPD YTLPIVFKALSLAYAFDLGLQLHSVAIRLG EFDQYSESGL
Subjt:  NVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGL

Query:  ISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML
        ISLYSK+GDLECACKVFEQN NRKLGSWNAIIAGLSQGG AKEAVNMFIKLRQSGL+PDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKV GKSNILML
Subjt:  ISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML

Query:  NSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLP
        NSLIDMYGKCGRMDLAMKVFSNM HR+VSSWTSLIVGYAMHGQVKQALENF+FMREAGVPPNQVTFVGVLSACVHGG+INEGK+YFDMMKNVYGFKPQLP
Subjt:  NSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLP

Query:  HYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP
        HYGCMVDLL KAGLLEEARRMIEEMPMKANS+IWGCLIGGCEKHGNVEIGEWA KHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP
Subjt:  HYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVP

Query:  GYSLATRLD
        GYSLAT LD
Subjt:  GYSLATRLD

A0A6J1G132 pentatricopeptide repeat-containing protein At1g77170, mitochondrial-like isoform X13.3e-26777.44Show/hide
Query:  MHNFKLLPHFLLLSPKNVD----CNS---SSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDY
        MHNFK L +FLL+S KN+D    CNS   +  + L LPS S   S ARRGFY YA     GNEY C   NNK  LQN G DIH+E K  VLPVQDVVS  
Subjt:  MHNFKLLPHFLLLSPKNVD----CNS---SSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDY

Query:  KCEERFGICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVA
        KC+ RF   D  KSKFTNVGSS +    T L   GH  P Y TKHL T++ R LVEFNDV+F+D+ K NKQ AER+A  + P   TYTS NNVD   IVA
Subjt:  KCEERFGICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVA

Query:  TLLMNCTNVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFD
        TLLMNCT++LEL QIHAH+LRTNMLEN+PSSF WN IIR YTRLE PR AL VYIAMLRAG+ PD YTLPIVFKALSL YAF LGLQLHSVAIRLG EFD
Subjt:  TLLMNCTNVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFD

Query:  QYSESGLISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTG
        QYSESGLISLYSK+GDLECA KVFEQNH+RKLGSWNAIIAGLSQGG AKEAV+MFIKLR+SGL+PDD TIVSVTSACGSLG+LELSLQMHKFVFQVK TG
Subjt:  QYSESGLISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTG

Query:  KSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVY
        KS+ILMLNSLIDMYGKCGRMDLA+KVFSNM HRNVSSWTSLIVGYAMHGQVKQALENF+ MREAGVPPN VTF+GVLSACVHGGM+ EG+HYF+MMK+VY
Subjt:  KSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVY

Query:  GFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQ
        G+KPQLPHYGCMVDLL KAGLLEEARRMI+EMPMKANSIIWGCLIG CEKHGNVE+GEWA KHLQELEPWNDGVYVVLSNIYATNG+WKE QK+R+ MKQ
Subjt:  GFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQ

Query:  RQLAKVPGYSLATRLD
        RQLAKVPGYSLAT+LD
Subjt:  RQLAKVPGYSLATRLD

A0A6J1HYV7 pentatricopeptide repeat-containing protein At1g77170, mitochondrial isoform X12.1e-26977.27Show/hide
Query:  MHNFKLLPHFLLLSPKNVD----CNS---SSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDY
        MHNFK L +FLL+S KN+D    CNS   +  + L LPS S   S ARRGF  YA     GNEY C   NNK  LQN G DIH+E K  VLPVQDVVS  
Subjt:  MHNFKLLPHFLLLSPKNVD----CNS---SSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDY

Query:  KCEERFGICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVA
        KCE RF   D  KSKFTN GSS +SS  T L   GH  P Y TKHL T++ R LVEFNDV+F+D+ K NKQ AER+A  +TP W TYTS NNVD   IVA
Subjt:  KCEERFGICDEKKSKFTNVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVA

Query:  TLLMNCTNVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFD
        TL+MNCT++LEL QIHAH+LRTNMLEN+PSSF WN IIRSYTRLE PR AL VYIAMLRAG+ PD YTLPIVFKALSL YAF LGLQLHSVAIRLG EFD
Subjt:  TLLMNCTNVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFD

Query:  QYSESGLISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTG
        QYSESGLISLYSK+GDLECA KVFE+NH+RKLGSWNAIIAGLSQGG AKEAV+MFIKLR+SGL+PDD TIVSVTSACGSLG+LELSLQMHKFVFQVKV G
Subjt:  QYSESGLISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTG

Query:  KSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVY
        KS+ILMLNSLIDMYGKCGRMDLA+KVFSNM HRNVSSWTSLIVGYAMHGQVKQALENF+ MREAGVPPN VTF+GVLSACVHGGM+ EG+HYF+MMK+VY
Subjt:  KSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVY

Query:  GFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQ
        G+KPQLPHYGCMVDLL KAGLLEEARRMI+EMPM+ANSIIWGCLIG CEKHGNVE+GEWA KHLQELEPWNDGVYVVLSNIYA+NG+WKE  K+R+ MKQ
Subjt:  GFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQ

Query:  RQLAKVPGYSLATRLD
        RQLAKVPGYSLAT+LD
Subjt:  RQLAKVPGYSLATRLD

SwissProt top hitse value%identityAlignment
A8MQA3 Pentatricopeptide repeat-containing protein At4g210651.4e-8139.85Show/hide
Query:  LENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGIL-PDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYSKIGDLECACKV
        +E   + F WN +IR Y  +     A  +Y  M  +G++ PD +T P + KA++      LG  +HSV IR GF    Y ++ L+ LY+  GD+  A KV
Subjt:  LENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGIL-PDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYSKIGDLECACKV

Query:  FEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILMLNSLIDMYGKCGRMDLA
        F++   + L +WN++I G ++ G  +EA+ ++ ++   G+ PD FTIVS+ SAC  +G L L  ++H  V+ +KV    N+   N L+D+Y +CGR++ A
Subjt:  FEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILMLNSLIDMYGKCGRMDLA

Query:  MKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREA-GVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLPHYGCMVDLLSKAGLL
          +F  M  +N  SWTSLIVG A++G  K+A+E F++M    G+ P ++TFVG+L AC H GM+ EG  YF  M+  Y  +P++ H+GCMVDLL++AG +
Subjt:  MKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREA-GVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLPHYGCMVDLLSKAGLL

Query:  EEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVPGYSL
        ++A   I+ MPM+ N +IW  L+G C  HG+ ++ E+A   + +LEP + G YV+LSN+YA+   W + QK+R  M +  + KVPG+SL
Subjt:  EEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVPGYSL

Q3ECB8 Pentatricopeptide repeat-containing protein At1g77170, mitochondrial1.4e-14557.14Show/hide
Query:  DHARIVATLLMNCTNVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAI
        D  +++ATLL NCT++  + +IH  + R+ +L+ +P +F WN I+RSY R E P  A+ VY+ M+R+ +LPD Y+LPIV KA    + F LG +LHSVA+
Subjt:  DHARIVATLLMNCTNVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAI

Query:  RLGFEFDQYSESGLISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFV
        RLGF  D++ ESG I+LY K G+ E A KVF++N  RKLGSWNAII GL+  G A EAV MF+ +++SGL+PDDFT+VSVT++CG LG+L L+ Q+HK V
Subjt:  RLGFEFDQYSESGLISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFV

Query:  FQVKVTGKSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYF
         Q K   KS+I+MLNSLIDMYGKCGRMDLA  +F  M  RNV SW+S+IVGYA +G   +ALE F+ MRE GV PN++TFVGVLSACVHGG++ EGK YF
Subjt:  FQVKVTGKSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYF

Query:  DMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQK
         MMK+ +  +P L HYGC+VDLLS+ G L+EA++++EEMPMK N ++WGCL+GGCEK G+VE+ EW   ++ ELEPWNDGVYVVL+N+YA  GMWK+ ++
Subjt:  DMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQK

Query:  MRDVMKQRQLAKVPGYSLAT
        +R +MK +++AK+P YS A+
Subjt:  MRDVMKQRQLAKVPGYSLAT

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665201.3e-7933.62Show/hide
Query:  LMNCTNVLELYQIHAHVLRTNMLE-------------------------------NHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPI
        L  C+   EL QIHA +L+T +++                               + P +F WN++IR ++  + P  +L +Y  ML +    + YT P 
Subjt:  LMNCTNVLELYQIHAHVLRTNMLE-------------------------------NHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPI

Query:  VFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYS-------------------------------KIGDLECACKVFEQNHNRKLGSWNAIIA
        + KA S   AF+   Q+H+   +LG+E D Y+ + LI+ Y+                               K G ++ A  +F +   +   SW  +I+
Subjt:  VFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYS-------------------------------KIGDLECACKVFEQNHNRKLGSWNAIIA

Query:  GLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTS
        G  Q    KEA+ +F +++ S ++PD+ ++ +  SAC  LG LE    +H ++ + ++  + + ++   LIDMY KCG M+ A++VF N+  ++V +WT+
Subjt:  GLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTS

Query:  LIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSII
        LI GYA HG  ++A+  F  M++ G+ PN +TF  VL+AC + G++ EGK  F  M+  Y  KP + HYGC+VDLL +AGLL+EA+R I+EMP+K N++I
Subjt:  LIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSII

Query:  WGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVPGYS
        WG L+  C  H N+E+GE  G+ L  ++P++ G YV  +NI+A +  W +A + R +MK++ +AKVPG S
Subjt:  WGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVPGYS

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.1e-7839.42Show/hide
Query:  WNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYSKIGDLECACKVFEQNHNRKLG
        WN +I  Y      + AL ++  M++  + PD  T+  V  A + + + +LG Q+H      GF  +    + LI LYSK G+LE AC +FE+   + + 
Subjt:  WNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYSKIGDLECACKVFEQNHNRKLG

Query:  SWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML-NSLIDMYGKCGRMDLAMKVFSNMGH
        SWN +I G +     KEA+ +F ++ +SG  P+D T++S+  AC  LG +++   +H ++   ++ G +N   L  SLIDMY KCG ++ A +VF+++ H
Subjt:  SWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML-NSLIDMYGKCGRMDLAMKVFSNMGH

Query:  RNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEM
        +++SSW ++I G+AMHG+   + + F  MR+ G+ P+ +TFVG+LSAC H GM++ G+H F  M   Y   P+L HYGCM+DLL  +GL +EA  MI  M
Subjt:  RNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEM

Query:  PMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVPGYS
         M+ + +IW  L+  C+ HGNVE+GE   ++L ++EP N G YV+LSNIYA+ G W E  K R ++  + + KVPG S
Subjt:  PMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVPGYS

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205407.9e-7736.9Show/hide
Query:  NHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGI-LPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYSKIGDLECACKVFE
        ++P+ F +N IIR+YT   +    + +Y  +LR    LPD +T P +FK+ +   +  LG Q+H    + G  F   +E+ LI +Y K  DL  A KVF+
Subjt:  NHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGI-LPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYSKIGDLECACKVFE

Query:  QNHNRKLGSWN-------------------------------AIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVF
        + + R + SWN                               A+I+G +  GC  EA++ F +++ +G++PD+ +++SV  +C  LG+LEL   +H +  
Subjt:  QNHNRKLGSWN-------------------------------AIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVF

Query:  Q---VKVTGKSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKH
        +   +K TG     + N+LI+MY KCG +  A+++F  M  ++V SW+++I GYA HG    A+E F  M+ A V PN +TF+G+LSAC H GM  EG  
Subjt:  Q---VKVTGKSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKH

Query:  YFDMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEA
        YFDMM+  Y  +P++ HYGC++D+L++AG LE A  + + MPMK +S IWG L+  C   GN+++   A  HL ELEP + G YV+L+NIYA  G W++ 
Subjt:  YFDMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEA

Query:  QKMRDVMKQRQLAKVPGYSL
         ++R +++   + K PG SL
Subjt:  QKMRDVMKQRQLAKVPGYSL

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.9e-8039.42Show/hide
Query:  WNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYSKIGDLECACKVFEQNHNRKLG
        WN +I  Y      + AL ++  M++  + PD  T+  V  A + + + +LG Q+H      GF  +    + LI LYSK G+LE AC +FE+   + + 
Subjt:  WNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYSKIGDLECACKVFEQNHNRKLG

Query:  SWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML-NSLIDMYGKCGRMDLAMKVFSNMGH
        SWN +I G +     KEA+ +F ++ +SG  P+D T++S+  AC  LG +++   +H ++   ++ G +N   L  SLIDMY KCG ++ A +VF+++ H
Subjt:  SWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILML-NSLIDMYGKCGRMDLAMKVFSNMGH

Query:  RNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEM
        +++SSW ++I G+AMHG+   + + F  MR+ G+ P+ +TFVG+LSAC H GM++ G+H F  M   Y   P+L HYGCM+DLL  +GL +EA  MI  M
Subjt:  RNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEM

Query:  PMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVPGYS
         M+ + +IW  L+  C+ HGNVE+GE   ++L ++EP N G YV+LSNIYA+ G W E  K R ++  + + KVPG S
Subjt:  PMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVPGYS

AT1G77170.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.7e-14757.14Show/hide
Query:  DHARIVATLLMNCTNVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAI
        D  +++ATLL NCT++  + +IH  + R+ +L+ +P +F WN I+RSY R E P  A+ VY+ M+R+ +LPD Y+LPIV KA    + F LG +LHSVA+
Subjt:  DHARIVATLLMNCTNVLELYQIHAHVLRTNMLENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAI

Query:  RLGFEFDQYSESGLISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFV
        RLGF  D++ ESG I+LY K G+ E A KVF++N  RKLGSWNAII GL+  G A EAV MF+ +++SGL+PDDFT+VSVT++CG LG+L L+ Q+HK V
Subjt:  RLGFEFDQYSESGLISLYSKIGDLECACKVFEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFV

Query:  FQVKVTGKSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYF
         Q K   KS+I+MLNSLIDMYGKCGRMDLA  +F  M  RNV SW+S+IVGYA +G   +ALE F+ MRE GV PN++TFVGVLSACVHGG++ EGK YF
Subjt:  FQVKVTGKSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYF

Query:  DMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQK
         MMK+ +  +P L HYGC+VDLLS+ G L+EA++++EEMPMK N ++WGCL+GGCEK G+VE+ EW   ++ ELEPWNDGVYVVL+N+YA  GMWK+ ++
Subjt:  DMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQK

Query:  MRDVMKQRQLAKVPGYSLAT
        +R +MK +++AK+P YS A+
Subjt:  MRDVMKQRQLAKVPGYSLAT

AT2G20540.1 mitochondrial editing factor 215.6e-7836.9Show/hide
Query:  NHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGI-LPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYSKIGDLECACKVFE
        ++P+ F +N IIR+YT   +    + +Y  +LR    LPD +T P +FK+ +   +  LG Q+H    + G  F   +E+ LI +Y K  DL  A KVF+
Subjt:  NHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGI-LPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYSKIGDLECACKVFE

Query:  QNHNRKLGSWN-------------------------------AIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVF
        + + R + SWN                               A+I+G +  GC  EA++ F +++ +G++PD+ +++SV  +C  LG+LEL   +H +  
Subjt:  QNHNRKLGSWN-------------------------------AIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVF

Query:  Q---VKVTGKSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKH
        +   +K TG     + N+LI+MY KCG +  A+++F  M  ++V SW+++I GYA HG    A+E F  M+ A V PN +TF+G+LSAC H GM  EG  
Subjt:  Q---VKVTGKSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKH

Query:  YFDMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEA
        YFDMM+  Y  +P++ HYGC++D+L++AG LE A  + + MPMK +S IWG L+  C   GN+++   A  HL ELEP + G YV+L+NIYA  G W++ 
Subjt:  YFDMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEA

Query:  QKMRDVMKQRQLAKVPGYSL
         ++R +++   + K PG SL
Subjt:  QKMRDVMKQRQLAKVPGYSL

AT4G21065.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.9e-8339.85Show/hide
Query:  LENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGIL-PDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYSKIGDLECACKV
        +E   + F WN +IR Y  +     A  +Y  M  +G++ PD +T P + KA++      LG  +HSV IR GF    Y ++ L+ LY+  GD+  A KV
Subjt:  LENHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGIL-PDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYSKIGDLECACKV

Query:  FEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILMLNSLIDMYGKCGRMDLA
        F++   + L +WN++I G ++ G  +EA+ ++ ++   G+ PD FTIVS+ SAC  +G L L  ++H  V+ +KV    N+   N L+D+Y +CGR++ A
Subjt:  FEQNHNRKLGSWNAIIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILMLNSLIDMYGKCGRMDLA

Query:  MKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREA-GVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLPHYGCMVDLLSKAGLL
          +F  M  +N  SWTSLIVG A++G  K+A+E F++M    G+ P ++TFVG+L AC H GM+ EG  YF  M+  Y  +P++ H+GCMVDLL++AG +
Subjt:  MKVFSNMGHRNVSSWTSLIVGYAMHGQVKQALENFQFMREA-GVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLPHYGCMVDLLSKAGLL

Query:  EEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVPGYSL
        ++A   I+ MPM+ N +IW  L+G C  HG+ ++ E+A   + +LEP + G YV+LSN+YA+   W + QK+R  M +  + KVPG+SL
Subjt:  EEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVPGYSL

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.3e-8133.62Show/hide
Query:  LMNCTNVLELYQIHAHVLRTNMLE-------------------------------NHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPI
        L  C+   EL QIHA +L+T +++                               + P +F WN++IR ++  + P  +L +Y  ML +    + YT P 
Subjt:  LMNCTNVLELYQIHAHVLRTNMLE-------------------------------NHPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPI

Query:  VFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYS-------------------------------KIGDLECACKVFEQNHNRKLGSWNAIIA
        + KA S   AF+   Q+H+   +LG+E D Y+ + LI+ Y+                               K G ++ A  +F +   +   SW  +I+
Subjt:  VFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYS-------------------------------KIGDLECACKVFEQNHNRKLGSWNAIIA

Query:  GLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTS
        G  Q    KEA+ +F +++ S ++PD+ ++ +  SAC  LG LE    +H ++ + ++  + + ++   LIDMY KCG M+ A++VF N+  ++V +WT+
Subjt:  GLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTS

Query:  LIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSII
        LI GYA HG  ++A+  F  M++ G+ PN +TF  VL+AC + G++ EGK  F  M+  Y  KP + HYGC+VDLL +AGLL+EA+R I+EMP+K N++I
Subjt:  LIVGYAMHGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSII

Query:  WGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVPGYS
        WG L+  C  H N+E+GE  G+ L  ++P++ G YV  +NI+A +  W +A + R +MK++ +AKVPG S
Subjt:  WGCLIGGCEKHGNVEIGEWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVPGYS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAATTTCAAGCTACTCCCACACTTTCTCCTACTATCACCTAAGAACGTCGATTGCAACAGTTCCAGCCTTCATAAACTGCTTCTACCGTCTCTTTCCTCTGCTGC
TTCCCGTGCTAGGCGAGGATTCTATCATTACGCTCCGGAGCCATTATCAGGGAATGAGTACGTGTGCTATAATGGGAACAATAAGTTTGAGCTTCAAAACGGAGGCCCTG
ACATTCACAAGGAAGGCAAGTGCCTTGTGCTTCCTGTTCAAGATGTCGTGTCAGATTACAAATGCGAAGAGAGATTTGGTATTTGTGATGAAAAAAAGAGCAAATTCACA
AACGTTGGCAGCTCACCAGAGAGCTCACGACATACCTTCCTAACGACGGCAGGGCATTGCTATCCAAATTACAAGACCAAGCACCTTGAGACAAACGAGTTAAGGCGACT
GGTAGAATTTAACGATGTGCAATTTGTGGATGAGCCTAAGCAAAATAAGCAGCCTGCAGAGAGGAATGCTGTGACCGAGACGCCTTTCTGGTTGACATACACCTCTTACA
ATAACGTGGACCATGCGAGAATTGTCGCTACCCTATTGATGAACTGTACTAATGTATTAGAATTGTACCAAATTCATGCCCATGTTTTAAGAACCAACATGTTGGAAAAC
CATCCTTCTTCATTTTACTGGAACATCATTATTAGATCATACACTAGGCTTGAGGTCCCAAGAATAGCCCTCTTTGTATACATCGCCATGTTGCGAGCAGGCATTTTGCC
TGATTGTTACACACTTCCAATTGTATTTAAGGCTTTATCTCTAGCTTATGCTTTTGATTTAGGATTACAATTACATTCAGTTGCTATACGGCTTGGTTTTGAGTTCGACC
AATATAGCGAAAGTGGCCTCATCAGTCTGTACTCAAAGATAGGAGATCTTGAGTGTGCATGTAAGGTGTTTGAACAAAACCATAACAGGAAGTTGGGTTCTTGGAATGCT
ATTATAGCTGGTTTATCTCAAGGTGGGTGTGCTAAAGAGGCAGTCAATATGTTCATTAAGTTAAGACAAAGTGGGCTCGACCCAGATGATTTTACAATTGTTAGCGTAAC
ATCAGCTTGTGGTAGTCTAGGCAACCTAGAACTGTCTCTCCAAATGCACAAATTTGTGTTTCAAGTGAAAGTCACAGGGAAATCGAACATTTTGATGCTAAATTCCCTCA
TCGACATGTATGGAAAATGTGGTAGGATGGACTTAGCAATGAAAGTATTTTCGAACATGGGTCATAGAAATGTATCATCCTGGACATCGCTGATTGTAGGTTATGCAATG
CATGGACAAGTGAAACAAGCGCTCGAGAACTTCCAATTCATGAGAGAAGCAGGAGTTCCTCCTAATCAAGTTACGTTTGTCGGGGTACTAAGCGCTTGTGTACACGGTGG
GATGATCAACGAAGGGAAACATTACTTCGATATGATGAAAAATGTTTACGGTTTCAAGCCTCAACTGCCACATTATGGATGCATGGTCGATTTGCTCAGTAAAGCAGGGT
TGCTTGAAGAGGCGAGAAGGATGATTGAAGAGATGCCGATGAAGGCGAATTCGATAATATGGGGATGTTTGATTGGAGGTTGTGAGAAACATGGGAATGTGGAGATTGGG
GAATGGGCAGGTAAACATTTACAAGAATTGGAGCCTTGGAATGATGGTGTTTATGTGGTTTTGTCCAATATTTATGCTACCAATGGCATGTGGAAAGAGGCTCAAAAGAT
GAGAGATGTTATGAAACAAAGGCAACTTGCTAAGGTTCCTGGGTATAGCTTGGCTACAAGATTAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCACAATTTCAAGCTACTCCCACACTTTCTCCTACTATCACCTAAGAACGTCGATTGCAACAGTTCCAGCCTTCATAAACTGCTTCTACCGTCTCTTTCCTCTGCTGC
TTCCCGTGCTAGGCGAGGATTCTATCATTACGCTCCGGAGCCATTATCAGGGAATGAGTACGTGTGCTATAATGGGAACAATAAGTTTGAGCTTCAAAACGGAGGCCCTG
ACATTCACAAGGAAGGCAAGTGCCTTGTGCTTCCTGTTCAAGATGTCGTGTCAGATTACAAATGCGAAGAGAGATTTGGTATTTGTGATGAAAAAAAGAGCAAATTCACA
AACGTTGGCAGCTCACCAGAGAGCTCACGACATACCTTCCTAACGACGGCAGGGCATTGCTATCCAAATTACAAGACCAAGCACCTTGAGACAAACGAGTTAAGGCGACT
GGTAGAATTTAACGATGTGCAATTTGTGGATGAGCCTAAGCAAAATAAGCAGCCTGCAGAGAGGAATGCTGTGACCGAGACGCCTTTCTGGTTGACATACACCTCTTACA
ATAACGTGGACCATGCGAGAATTGTCGCTACCCTATTGATGAACTGTACTAATGTATTAGAATTGTACCAAATTCATGCCCATGTTTTAAGAACCAACATGTTGGAAAAC
CATCCTTCTTCATTTTACTGGAACATCATTATTAGATCATACACTAGGCTTGAGGTCCCAAGAATAGCCCTCTTTGTATACATCGCCATGTTGCGAGCAGGCATTTTGCC
TGATTGTTACACACTTCCAATTGTATTTAAGGCTTTATCTCTAGCTTATGCTTTTGATTTAGGATTACAATTACATTCAGTTGCTATACGGCTTGGTTTTGAGTTCGACC
AATATAGCGAAAGTGGCCTCATCAGTCTGTACTCAAAGATAGGAGATCTTGAGTGTGCATGTAAGGTGTTTGAACAAAACCATAACAGGAAGTTGGGTTCTTGGAATGCT
ATTATAGCTGGTTTATCTCAAGGTGGGTGTGCTAAAGAGGCAGTCAATATGTTCATTAAGTTAAGACAAAGTGGGCTCGACCCAGATGATTTTACAATTGTTAGCGTAAC
ATCAGCTTGTGGTAGTCTAGGCAACCTAGAACTGTCTCTCCAAATGCACAAATTTGTGTTTCAAGTGAAAGTCACAGGGAAATCGAACATTTTGATGCTAAATTCCCTCA
TCGACATGTATGGAAAATGTGGTAGGATGGACTTAGCAATGAAAGTATTTTCGAACATGGGTCATAGAAATGTATCATCCTGGACATCGCTGATTGTAGGTTATGCAATG
CATGGACAAGTGAAACAAGCGCTCGAGAACTTCCAATTCATGAGAGAAGCAGGAGTTCCTCCTAATCAAGTTACGTTTGTCGGGGTACTAAGCGCTTGTGTACACGGTGG
GATGATCAACGAAGGGAAACATTACTTCGATATGATGAAAAATGTTTACGGTTTCAAGCCTCAACTGCCACATTATGGATGCATGGTCGATTTGCTCAGTAAAGCAGGGT
TGCTTGAAGAGGCGAGAAGGATGATTGAAGAGATGCCGATGAAGGCGAATTCGATAATATGGGGATGTTTGATTGGAGGTTGTGAGAAACATGGGAATGTGGAGATTGGG
GAATGGGCAGGTAAACATTTACAAGAATTGGAGCCTTGGAATGATGGTGTTTATGTGGTTTTGTCCAATATTTATGCTACCAATGGCATGTGGAAAGAGGCTCAAAAGAT
GAGAGATGTTATGAAACAAAGGCAACTTGCTAAGGTTCCTGGGTATAGCTTGGCTACAAGATTAGATTGA
Protein sequenceShow/hide protein sequence
MHNFKLLPHFLLLSPKNVDCNSSSLHKLLLPSLSSAASRARRGFYHYAPEPLSGNEYVCYNGNNKFELQNGGPDIHKEGKCLVLPVQDVVSDYKCEERFGICDEKKSKFT
NVGSSPESSRHTFLTTAGHCYPNYKTKHLETNELRRLVEFNDVQFVDEPKQNKQPAERNAVTETPFWLTYTSYNNVDHARIVATLLMNCTNVLELYQIHAHVLRTNMLEN
HPSSFYWNIIIRSYTRLEVPRIALFVYIAMLRAGILPDCYTLPIVFKALSLAYAFDLGLQLHSVAIRLGFEFDQYSESGLISLYSKIGDLECACKVFEQNHNRKLGSWNA
IIAGLSQGGCAKEAVNMFIKLRQSGLDPDDFTIVSVTSACGSLGNLELSLQMHKFVFQVKVTGKSNILMLNSLIDMYGKCGRMDLAMKVFSNMGHRNVSSWTSLIVGYAM
HGQVKQALENFQFMREAGVPPNQVTFVGVLSACVHGGMINEGKHYFDMMKNVYGFKPQLPHYGCMVDLLSKAGLLEEARRMIEEMPMKANSIIWGCLIGGCEKHGNVEIG
EWAGKHLQELEPWNDGVYVVLSNIYATNGMWKEAQKMRDVMKQRQLAKVPGYSLATRLD