; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI01G33460 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI01G33460
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationChr1:28384031..28390328
RNA-Seq ExpressionCSPI01G33460
SyntenyCSPI01G33460
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011660274.1 pentatricopeptide repeat-containing protein At5g08510 [Cucumis sativus]5.3e-30599.61Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ
        VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVH 
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ

Query:  DQNEDEELLYSS
        D NEDEELLYSS
Subjt:  DQNEDEELLYSS

XP_022924284.1 pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Cucurbita moschata]7.0e-26586.55Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNG+D+TKFLI+KLLQ+P+LPYACTLFD IPKPSV+LYNKFIQTFSSIGH HRCWLLY QMC QGCSPN +SFTFLFPACAS  N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHF KSGFASD+FA+TALLDMY KLG+L+SARQLFDEMPVRDIPTWNS+IAGYARSG+M AALELF+KMP RNVISWTALISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENEKGTKPNEV+IASVLPAC+ LGALDIGKRIEAYAR NGFFKN YVSNA+LE+HARCGNIEEA++VFDEIGSKRNLCSWNTMIMGLAVHGRC D
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLI++ RPDDVTFVGLLLACTHGGMVA+GRQLFESME KFQ+APKLEHYGCLVDLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVAR RKMMKGGH+ KRAG SYIEVGDGIHEF+VEDRSH KS EIYALLH +Y IIKLH     
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ

Query:  DQNE-DEELLYSS
        +QNE +EELLYSS
Subjt:  DQNE-DEELLYSS

XP_022979295.1 pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Cucurbita maxima]3.0e-26887.52Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNG+D+TKFLIEKLLQ+P+LPYACTLFD IPKPSV+LYNKFIQTFSSIGHPHRCWLLY QMC QGCSPN +SFTFLFPACAS  N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHFCKSGFASD+FA+TALLDMY KLG+L+SARQLFDEMPVRDIPTWNS+IAGYARSG+M AALELF+KMP+RNVISWTALISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENEKGTKPNEV+IASVLPAC+QLGALDIGKRIE YAR NGFFKN YVSNA+LE+HARCGNIEEA++VFDEIGSKRNLCSWNTMIMGLAVHGRC D
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLI++ RPDDVTFVGLLLACTHGGMVA+GRQ+FESME KFQ+APKLEHYGCLVDLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVAR+RKMMKGGHI KRAG SYIEVGDGIHEFIVEDRSH KS EIYALLH IY IIKLH     
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ

Query:  DQNE-DEELLYSS
         QNE +EELLYSS
Subjt:  DQNE-DEELLYSS

XP_023527365.1 pentatricopeptide repeat-containing protein At5g08510 [Cucurbita pepo subsp. pepo]4.8e-26686.74Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNG+D+TKFLIEKLLQ+P+LPYACTLFD IPKPSV+LYNKFIQTFSSIGH HRCWLLY QMC QGCSPN +SFTFLFPACAS  N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHFCKSGFASD+FA+TALLDMY KLG+L+SARQLFDE PVRDIPTWNS+IAGYARSG+M AAL+LF+KMP RNVISWTALISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENEKGTKPNEV+IASVLPAC+ LGALDIGKRIEAYAR NGFFKN YVSNA+LE+HARCGNIEEA++VFDEIGSKRNLCSWNTMIMGLAVHGRC D
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQML+++ RPDDVTFVGLLLACTHGGMVA+GRQLFESME KFQ+APKLEHYGCLVDLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVAR+RKMMKGGHI KRAG SYIEVGDGIHEFIVEDRSH KS EIYALLH +Y IIKLH     
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ

Query:  DQNE-DEELLYSS
        +QNE +EELLYSS
Subjt:  DQNE-DEELLYSS

XP_038877076.1 pentatricopeptide repeat-containing protein At5g08510-like [Benincasa hispida]1.4e-27389.47Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        M QLKQIHAYSLRNGLD+TKFLIEKLLQ P+LPYACTLFD IP+PSV+LYNKFIQTFSSIG PHRCWLLY QMC QGCSPNQ+SFTFLF ACASL N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHFCKSGFASD+FA TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNS+IAGYARSG MEAA +LF+KMPVR+V+SWT LISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFK-NAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCI
        +F+ LENEKG KPNEV+IASVLPAC+QLGALDIGKRIEAYARNNGFFK N+YVSNA+LE+HARCGNI EA++VFDEIGSKRNLCSWNTMIMGLAVHGRC 
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFK-NAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCI

Query:  DALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHG
        DALQLYDQMLIR+MRPDDVTFVGLLLACTHGGMVAEGRQ+FESMES FQ+APKLEHYGCLVDLLGRAGELQEAY LIQNMPMAPDSVIWG LLGACSFHG
Subjt:  DALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHG

Query:  NVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVH
        NVELGEVAAESLF LEPWNPGNYVILSNIYA AGDWSGVARLRKMMKGG ITKRAGYSYIEVGDGIHEFIVEDRSHL+SGEIYALLH IYDIIKLHK+ H
Subjt:  NVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVH

Query:  QDQNEDEELLYSS
         +QNED ELLYSS
Subjt:  QDQNEDEELLYSS

TrEMBL top hitse value%identityAlignment
A0A0A0LY28 Uncharacterized protein2.5e-30599.61Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ
        VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVH 
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ

Query:  DQNEDEELLYSS
        D NEDEELLYSS
Subjt:  DQNEDEELLYSS

A0A6J1D160 pentatricopeptide repeat-containing protein At5g08510 isoform X28.6e-26185.32Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAY LR+G+D+TKFLIEKLLQ+P+LPYAC LFD IPKPSV+LYNKFIQ++SS G  HRCW LY QMC QGCSPNQ+SFTFLF ACASL NV+P
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLH+HFCKSGFASD+FA+TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNS++AGY+RSG M AALELF++MPVRNV+SWTALISGY+QNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENE+G KPNEV++ASVLPAC+QLGALDIG+RIEAYARNNGFFKN YVSNA+LE+HARCGNIEEA+QVFDEIGSKRNLCSWNTMIMGLAVHGRC  
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        A++LYDQML +++RPDDVTF+GLLLACTHGGMVA+GRQLFESMESKFQ+APKLEHYGCLVDLLGRAGEL+EAYNLIQ MPM PDSVIWG LLGACSFHG+
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ
        VEL EVAAESLFKLEPWNPGNYVILSNIYA AGDW GVARLRK MKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKS EIYALLH IY IIKL K    
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ

Query:  DQNE
         QNE
Subjt:  DQNE

A0A6J1D2G9 pentatricopeptide repeat-containing protein At5g08510 isoform X11.4e-26385.13Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAY LR+G+D+TKFLIEKLLQ+P+LPYAC LFD IPKPSV+LYNKFIQ++SS G  HRCW LY QMC QGCSPNQ+SFTFLF ACASL NV+P
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLH+HFCKSGFASD+FA+TALLDMYAKLGMLRSARQLFDEMPVRDIPTWNS++AGY+RSG M AALELF++MPVRNV+SWTALISGY+QNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENE+G KPNEV++ASVLPAC+QLGALDIG+RIEAYARNNGFFKN YVSNA+LE+HARCGNIEEA+QVFDEIGSKRNLCSWNTMIMGLAVHGRC  
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        A++LYDQML +++RPDDVTF+GLLLACTHGGMVA+GRQLFESMESKFQ+APKLEHYGCLVDLLGRAGEL+EAYNLIQ MPM PDSVIWG LLGACSFHG+
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ
        VEL EVAAESLFKLEPWNPGNYVILSNIYA AGDW GVARLRK MKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKS EIYALLH IY IIKL K    
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ

Query:  DQNEDEELLYS
         QNE EELL+S
Subjt:  DQNEDEELLYS

A0A6J1E8Q2 pentatricopeptide repeat-containing protein At5g08510 isoform X13.4e-26586.55Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNG+D+TKFLI+KLLQ+P+LPYACTLFD IPKPSV+LYNKFIQTFSSIGH HRCWLLY QMC QGCSPN +SFTFLFPACAS  N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHF KSGFASD+FA+TALLDMY KLG+L+SARQLFDEMPVRDIPTWNS+IAGYARSG+M AALELF+KMP RNVISWTALISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENEKGTKPNEV+IASVLPAC+ LGALDIGKRIEAYAR NGFFKN YVSNA+LE+HARCGNIEEA++VFDEIGSKRNLCSWNTMIMGLAVHGRC D
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLI++ RPDDVTFVGLLLACTHGGMVA+GRQLFESME KFQ+APKLEHYGCLVDLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVAR RKMMKGGH+ KRAG SYIEVGDGIHEF+VEDRSH KS EIYALLH +Y IIKLH     
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ

Query:  DQNE-DEELLYSS
        +QNE +EELLYSS
Subjt:  DQNE-DEELLYSS

A0A6J1ISU4 pentatricopeptide repeat-containing protein At5g08510 isoform X11.5e-26887.52Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNG+D+TKFLIEKLLQ+P+LPYACTLFD IPKPSV+LYNKFIQTFSSIGHPHRCWLLY QMC QGCSPN +SFTFLFPACAS  N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
        GQMLHSHFCKSGFASD+FA+TALLDMY KLG+L+SARQLFDEMPVRDIPTWNS+IAGYARSG+M AALELF+KMP+RNVISWTALISGYAQNGKYAKALE
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENEKGTKPNEV+IASVLPAC+QLGALDIGKRIE YAR NGFFKN YVSNA+LE+HARCGNIEEA++VFDEIGSKRNLCSWNTMIMGLAVHGRC D
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLI++ RPDDVTFVGLLLACTHGGMVA+GRQ+FESME KFQ+APKLEHYGCLVDLLGRAGE++EAY+LIQ+MPM PDSVIWG LLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVAR+RKMMKGGHI KRAG SYIEVGDGIHEFIVEDRSH KS EIYALLH IY IIKLH     
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQ

Query:  DQNE-DEELLYSS
         QNE +EELLYSS
Subjt:  DQNE-DEELLYSS

SwissProt top hitse value%identityAlignment
Q9C501 Pentatricopeptide repeat-containing protein At1g333503.6e-10738.6Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKL-----LQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI--GHPHRCWLLYCQMCSQGC-SPNQYSFTFLFPAC
        +N LKQ+ ++ + +GL H+ FL  KL     L+L +L YA  +FD+   P+ +LY   +  +SS    H    +  +  M ++    PN + +  +  + 
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKL-----LQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI--GHPHRCWLLYCQMCSQGC-SPNQYSFTFLFPAC

Query:  ASLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYA-KLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQ
          L + +   ++H+H  KSGF   +   TALL  YA  +  +  ARQLFDEM  R++ +W ++++GYARSG +  A+ LF  MP R+V SW A+++   Q
Subjt:  ASLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYA-KLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQ

Query:  NGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG
        NG + +A+ +F  + NE   +PNEV++  VL AC+Q G L + K I A+A       + +VSN++++L+ +CGN+EEA  VF ++ SK++L +WN+MI  
Subjt:  NGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG

Query:  LAVHGRCIDALQLYDQML---IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIW
         A+HGR  +A+ ++++M+   I  ++PD +TF+GLL ACTHGG+V++GR  F+ M ++F + P++EHYGCL+DLLGRAG   EA  ++  M M  D  IW
Subjt:  LAVHGRCIDALQLYDQML---IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIW

Query:  GTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKI
        G+LL AC  HG+++L EVA ++L  L P N G   +++N+Y   G+W    R RKM+K  +  K  G+S IE+ + +H+F   D+SH ++ EIY +L  +
Subjt:  GTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKI

Q9FNN7 Pentatricopeptide repeat-containing protein At5g085102.4e-16754.42Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MN +KQ+HA+ LR G+D TK L+++LL +P+L YA  LFD       +LYNK IQ +     PH   +LY  +   G  P+ ++F F+F A AS  +  P
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
         ++LHS F +SGF SD F  T L+  YAKLG L  AR++FDEM  RD+P WN++I GY R G M+AA+ELF+ MP +NV SWT +ISG++QNG Y++AL+
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ +E +K  KPN +++ SVLPAC+ LG L+IG+R+E YAR NGFF N YV NA +E++++CG I+ A+++F+E+G++RNLCSWN+MI  LA HG+  +
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        AL L+ QML    +PD VTFVGLLLAC HGGMV +G++LF+SME   +++PKLEHYGC++DLLGR G+LQEAY+LI+ MPM PD+V+WGTLLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVH
        VE+ E+A+E+LFKLEP NPGN VI+SNIYA    W GV R+RK+MK   +TK AGYSY +EVG  +H+F VED+SH +S EIY +L +I+  +KL K   
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVH

Query:  QDQNEDEEL
            + E+L
Subjt:  QDQNEDEEL

Q9LS72 Pentatricopeptide repeat-containing protein At3g292305.4e-11137.57Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDL----PYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLF
        +NQ+KQ+HA  +R  L     +  KL+    L      A  +F+Q+ +P+V+L N  I+  +    P++ + ++ +M   G   + +++ FL  AC+   
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDL----PYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLF

Query:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYA---------------------------------KLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGH
         +   +M+H+H  K G +SD++   AL+D Y+                                 K G LR AR+LFDEMP RD+ +WN+++ GYAR   
Subjt:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYA---------------------------------KLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGH

Query:  MEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMF----------------------IGLENE----------KGTKPNEVSIASVLPACSQLGAL
        M  A ELF KMP RN +SW+ ++ GY++ G    A  MF                       GL  E           G K +  ++ S+L AC++ G L
Subjt:  MEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMF----------------------IGLENE----------KGTKPNEVSIASVLPACSQLGAL

Query:  DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGG
         +G RI +  + +    NAYV NA+L+++A+CGN+++A  VF++I  K++L SWNTM+ GL VHG   +A++L+ +M    +RPD VTF+ +L +C H G
Subjt:  DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGG

Query:  MVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAL
        ++ EG   F SME  + + P++EHYGCLVDLLGR G L+EA  ++Q MPM P+ VIWG LLGAC  H  V++ +   ++L KL+P +PGNY +LSNIYA 
Subjt:  MVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAL

Query:  AGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYD
        A DW GVA +R  MK   + K +G S +E+ DGIHEF V D+SH KS +IY +L  + +
Subjt:  AGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYD

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205402.2e-11240.63Show/hide
Query:  NQLKQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCS-PNQYSFTFLFPACASLF
        N+ K+I+A  + +GL  + F++ K++    ++ D+ YA  LF+Q+  P+V+LYN  I+ ++          +Y Q+  +    P++++F F+F +CASL 
Subjt:  NQLKQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCS-PNQYSFTFLFPACASLF

Query:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYA
        + Y G+ +H H CK G    +    AL+DMY K   L  A ++FDEM  RD+ +WNSL++GYAR G M+ A  LF+ M  + ++SWTA+ISGY   G Y 
Subjt:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYA

Query:  KALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHG
        +A++ F  ++   G +P+E+S+ SVLP+C+QLG+L++GK I  YA   GF K   V NA++E++++CG I +A Q+F ++  K ++ SW+TMI G A HG
Subjt:  KALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHG

Query:  RCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACS
            A++ +++M   K++P+ +TF+GLL AC+H GM  EG + F+ M   +Q+ PK+EHYGCL+D+L RAG+L+ A  + + MPM PDS IWG+LL +C 
Subjt:  RCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACS

Query:  FHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHK
          GN+++  VA + L +LEP + GNYV+L+NIYA  G W  V+RLRKM++  ++ K  G S IEV + + EF+  D S     EI  +L           
Subjt:  FHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHK

Query:  HVHQDQN
          HQDQ+
Subjt:  HVHQDQN

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic1.1e-10840.51Show/hide
Query:  MNQLKQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACA
        ++++ QIHA  LR N L H ++ +  L           + ++  LF Q   P ++L+   I T S  G   + +LLY Q+ S   +PN+++F+ L  +C+
Subjt:  MNQLKQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACA

Query:  SLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNG
        +      G+++H+H  K G   D +  T L+D+YAK G + SA+++FD MP R + +  ++I  YA+ G++EAA  LF+ M  R+++SW  +I GYAQ+G
Subjt:  SLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNG

Query:  KYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLA
            AL +F  L  E   KP+E+++ + L ACSQ+GAL+ G+ I  + +++    N  V   +++++++CG++EEA  VF++   ++++ +WN MI G A
Subjt:  KYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLA

Query:  VHGRCIDALQLYDQML-IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLL
        +HG   DAL+L+++M  I  ++P D+TF+G L AC H G+V EG ++FESM  ++ + PK+EHYGCLV LLGRAG+L+ AY  I+NM M  DSV+W ++L
Subjt:  VHGRCIDALQLYDQML-IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLL

Query:  GACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDII
        G+C  HG+  LG+  AE L  L   N G YV+LSNIYA  GD+ GVA++R +MK   I K  G S IE+ + +HEF   DR H KS EIY +L KI + I
Subjt:  GACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDII

Query:  KLHKHV
        K H +V
Subjt:  KLHKHV

Arabidopsis top hitse value%identityAlignment
AT1G33350.1 Pentatricopeptide repeat (PPR) superfamily protein2.6e-10838.6Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKL-----LQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI--GHPHRCWLLYCQMCSQGC-SPNQYSFTFLFPAC
        +N LKQ+ ++ + +GL H+ FL  KL     L+L +L YA  +FD+   P+ +LY   +  +SS    H    +  +  M ++    PN + +  +  + 
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKL-----LQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSI--GHPHRCWLLYCQMCSQGC-SPNQYSFTFLFPAC

Query:  ASLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYA-KLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQ
          L + +   ++H+H  KSGF   +   TALL  YA  +  +  ARQLFDEM  R++ +W ++++GYARSG +  A+ LF  MP R+V SW A+++   Q
Subjt:  ASLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYA-KLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQ

Query:  NGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG
        NG + +A+ +F  + NE   +PNEV++  VL AC+Q G L + K I A+A       + +VSN++++L+ +CGN+EEA  VF ++ SK++L +WN+MI  
Subjt:  NGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMG

Query:  LAVHGRCIDALQLYDQML---IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIW
         A+HGR  +A+ ++++M+   I  ++PD +TF+GLL ACTHGG+V++GR  F+ M ++F + P++EHYGCL+DLLGRAG   EA  ++  M M  D  IW
Subjt:  LAVHGRCIDALQLYDQML---IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIW

Query:  GTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKI
        G+LL AC  HG+++L EVA ++L  L P N G   +++N+Y   G+W    R RKM+K  +  K  G+S IE+ + +H+F   D+SH ++ EIY +L  +
Subjt:  GTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKI

AT2G20540.1 mitochondrial editing factor 211.6e-11340.63Show/hide
Query:  NQLKQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCS-PNQYSFTFLFPACASLF
        N+ K+I+A  + +GL  + F++ K++    ++ D+ YA  LF+Q+  P+V+LYN  I+ ++          +Y Q+  +    P++++F F+F +CASL 
Subjt:  NQLKQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCS-PNQYSFTFLFPACASLF

Query:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYA
        + Y G+ +H H CK G    +    AL+DMY K   L  A ++FDEM  RD+ +WNSL++GYAR G M+ A  LF+ M  + ++SWTA+ISGY   G Y 
Subjt:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYA

Query:  KALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHG
        +A++ F  ++   G +P+E+S+ SVLP+C+QLG+L++GK I  YA   GF K   V NA++E++++CG I +A Q+F ++  K ++ SW+TMI G A HG
Subjt:  KALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHG

Query:  RCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACS
            A++ +++M   K++P+ +TF+GLL AC+H GM  EG + F+ M   +Q+ PK+EHYGCL+D+L RAG+L+ A  + + MPM PDS IWG+LL +C 
Subjt:  RCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACS

Query:  FHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHK
          GN+++  VA + L +LEP + GNYV+L+NIYA  G W  V+RLRKM++  ++ K  G S IEV + + EF+  D S     EI  +L           
Subjt:  FHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHK

Query:  HVHQDQN
          HQDQ+
Subjt:  HVHQDQN

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.8e-11237.57Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDL----PYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLF
        +NQ+KQ+HA  +R  L     +  KL+    L      A  +F+Q+ +P+V+L N  I+  +    P++ + ++ +M   G   + +++ FL  AC+   
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDL----PYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLF

Query:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYA---------------------------------KLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGH
         +   +M+H+H  K G +SD++   AL+D Y+                                 K G LR AR+LFDEMP RD+ +WN+++ GYAR   
Subjt:  NVYPGQMLHSHFCKSGFASDMFAMTALLDMYA---------------------------------KLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGH

Query:  MEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMF----------------------IGLENE----------KGTKPNEVSIASVLPACSQLGAL
        M  A ELF KMP RN +SW+ ++ GY++ G    A  MF                       GL  E           G K +  ++ S+L AC++ G L
Subjt:  MEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMF----------------------IGLENE----------KGTKPNEVSIASVLPACSQLGAL

Query:  DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGG
         +G RI +  + +    NAYV NA+L+++A+CGN+++A  VF++I  K++L SWNTM+ GL VHG   +A++L+ +M    +RPD VTF+ +L +C H G
Subjt:  DIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHGG

Query:  MVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAL
        ++ EG   F SME  + + P++EHYGCLVDLLGR G L+EA  ++Q MPM P+ VIWG LLGAC  H  V++ +   ++L KL+P +PGNY +LSNIYA 
Subjt:  MVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAL

Query:  AGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYD
        A DW GVA +R  MK   + K +G S +E+ DGIHEF V D+SH KS +IY +L  + +
Subjt:  AGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYD

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein8.0e-11040.51Show/hide
Query:  MNQLKQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACA
        ++++ QIHA  LR N L H ++ +  L           + ++  LF Q   P ++L+   I T S  G   + +LLY Q+ S   +PN+++F+ L  +C+
Subjt:  MNQLKQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACA

Query:  SLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNG
        +      G+++H+H  K G   D +  T L+D+YAK G + SA+++FD MP R + +  ++I  YA+ G++EAA  LF+ M  R+++SW  +I GYAQ+G
Subjt:  SLFNVYPGQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNG

Query:  KYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLA
            AL +F  L  E   KP+E+++ + L ACSQ+GAL+ G+ I  + +++    N  V   +++++++CG++EEA  VF++   ++++ +WN MI G A
Subjt:  KYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLA

Query:  VHGRCIDALQLYDQML-IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLL
        +HG   DAL+L+++M  I  ++P D+TF+G L AC H G+V EG ++FESM  ++ + PK+EHYGCLV LLGRAG+L+ AY  I+NM M  DSV+W ++L
Subjt:  VHGRCIDALQLYDQML-IRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLL

Query:  GACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDII
        G+C  HG+  LG+  AE L  L   N G YV+LSNIYA  GD+ GVA++R +MK   I K  G S IE+ + +HEF   DR H KS EIY +L KI + I
Subjt:  GACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDII

Query:  KLHKHV
        K H +V
Subjt:  KLHKHV

AT5G08510.1 Pentatricopeptide repeat (PPR) superfamily protein1.7e-16854.42Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP
        MN +KQ+HA+ LR G+D TK L+++LL +P+L YA  LFD       +LYNK IQ +     PH   +LY  +   G  P+ ++F F+F A AS  +  P
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE
         ++LHS F +SGF SD F  T L+  YAKLG L  AR++FDEM  RD+P WN++I GY R G M+AA+ELF+ MP +NV SWT +ISG++QNG Y++AL+
Subjt:  GQMLHSHFCKSGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ +E +K  KPN +++ SVLPAC+ LG L+IG+R+E YAR NGFF N YV NA +E++++CG I+ A+++F+E+G++RNLCSWN+MI  LA HG+  +
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        AL L+ QML    +PD VTFVGLLLAC HGGMV +G++LF+SME   +++PKLEHYGC++DLLGR G+LQEAY+LI+ MPM PD+V+WGTLLGACSFHGN
Subjt:  ALQLYDQMLIRKMRPDDVTFVGLLLACTHGGMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVH
        VE+ E+A+E+LFKLEP NPGN VI+SNIYA    W GV R+RK+MK   +TK AGYSY +EVG  +H+F VED+SH +S EIY +L +I+  +KL K   
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVARLRKMMKGGHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVH

Query:  QDQNEDEEL
            + E+L
Subjt:  QDQNEDEEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCAATTGAAGCAAATTCATGCTTATAGCCTCAGAAACGGCCTAGATCACACAAAGTTCCTCATTGAAAAGCTTCTGCAGTTACCAGATCTTCCGTATGCTTGCAC
CCTGTTTGACCAAATTCCTAAGCCATCTGTTTATCTCTACAACAAGTTCATTCAAACATTTTCTTCAATTGGTCACCCCCACCGATGCTGGTTGCTTTACTGTCAAATGT
GTTCCCAAGGTTGCTCTCCGAATCAGTATTCATTCACCTTTCTCTTTCCCGCGTGTGCTTCCCTTTTTAATGTTTACCCAGGTCAGATGCTTCATTCTCATTTCTGTAAG
TCAGGATTTGCTTCTGATATGTTTGCTATGACGGCATTGTTGGACATGTATGCGAAATTGGGAATGTTGAGGTCTGCACGCCAACTGTTTGATGAAATGCCTGTTCGAGA
TATACCCACCTGGAATTCGTTGATTGCGGGTTATGCAAGGTCCGGGCATATGGAGGCTGCGTTAGAATTGTTCAACAAAATGCCGGTGAGAAATGTGATTTCCTGGACAG
CTTTGATATCTGGGTATGCACAAAATGGGAAGTATGCGAAGGCCTTGGAGATGTTTATAGGATTGGAAAATGAGAAAGGCACTAAGCCAAATGAGGTGTCCATAGCAAGT
GTTCTTCCTGCCTGTTCTCAGCTTGGGGCATTGGATATTGGGAAGAGGATTGAAGCATATGCAAGAAATAATGGATTTTTCAAAAACGCATATGTGAGCAATGCGGTACT
GGAATTGCATGCTAGGTGTGGGAACATCGAGGAAGCGCAGCAAGTTTTTGATGAGATTGGAAGCAAAAGAAATTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTG
TGCATGGAAGATGCATTGATGCTCTTCAGCTTTATGATCAAATGTTGATACGGAAAATGAGACCGGACGACGTGACATTTGTAGGGCTTCTCTTGGCTTGCACACACGGA
GGCATGGTTGCAGAAGGCCGACAACTCTTTGAATCAATGGAGAGTAAGTTTCAAGTTGCTCCCAAATTAGAGCACTATGGCTGCTTGGTAGATTTATTAGGGAGAGCTGG
AGAGCTGCAGGAAGCTTACAATCTCATTCAAAACATGCCAATGGCTCCTGACTCTGTTATATGGGGAACGCTTTTGGGAGCTTGTAGCTTCCATGGCAATGTTGAATTGG
GTGAAGTAGCAGCTGAGTCCCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTTTCTAACATTTACGCGTTGGCAGGTGATTGGTCTGGAGTTGCAAGA
TTAAGGAAGATGATGAAAGGAGGACATATTACAAAGAGAGCAGGATATAGTTATATTGAAGTGGGAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCACATTTGAA
GAGTGGTGAAATATATGCTTTACTTCATAAAATTTATGACATTATTAAACTTCATAAGCATGTACATCAGGATCAAAACGAAGATGAAGAACTACTCTATTCTTCGTAA
mRNA sequenceShow/hide mRNA sequence
GTTTTATTTATTTTCTAAATTGTTGTATTTGGTGGAATTTTACCGTAGTTAATGCTAAACCATATCTGGATATCTACAAAACACCATTGTTGCACGCCGAAGCTGATTCC
GGCGTCCCGCTTGTTGTCACAACCTTCTCTGTCGCATTACTTTCACTCTAGCATGTCTTTATTTTTGGGTTCTGCTCCCTCTCGCCACTTTTCTCTTTGGTTCTCTTCGT
ACAGATTACTACCACTCTTTTTAAGTGTTCTCCTCACACCGGTGGGCGCTTGTGTGACTGGCGTGTAGATTAGAGCATGACCCTCAACTCCATTGACTCTAAATCGTATT
ACCCATCGCTTGTAGAGTTCAAATATGTTGGCAAATGGCTTTTAAGTTGGGTCAAACTAGATAATAAGAATTGTTATGAACCAATTGAAGCAAATTCATGCTTATAGCCT
CAGAAACGGCCTAGATCACACAAAGTTCCTCATTGAAAAGCTTCTGCAGTTACCAGATCTTCCGTATGCTTGCACCCTGTTTGACCAAATTCCTAAGCCATCTGTTTATC
TCTACAACAAGTTCATTCAAACATTTTCTTCAATTGGTCACCCCCACCGATGCTGGTTGCTTTACTGTCAAATGTGTTCCCAAGGTTGCTCTCCGAATCAGTATTCATTC
ACCTTTCTCTTTCCCGCGTGTGCTTCCCTTTTTAATGTTTACCCAGGTCAGATGCTTCATTCTCATTTCTGTAAGTCAGGATTTGCTTCTGATATGTTTGCTATGACGGC
ATTGTTGGACATGTATGCGAAATTGGGAATGTTGAGGTCTGCACGCCAACTGTTTGATGAAATGCCTGTTCGAGATATACCCACCTGGAATTCGTTGATTGCGGGTTATG
CAAGGTCCGGGCATATGGAGGCTGCGTTAGAATTGTTCAACAAAATGCCGGTGAGAAATGTGATTTCCTGGACAGCTTTGATATCTGGGTATGCACAAAATGGGAAGTAT
GCGAAGGCCTTGGAGATGTTTATAGGATTGGAAAATGAGAAAGGCACTAAGCCAAATGAGGTGTCCATAGCAAGTGTTCTTCCTGCCTGTTCTCAGCTTGGGGCATTGGA
TATTGGGAAGAGGATTGAAGCATATGCAAGAAATAATGGATTTTTCAAAAACGCATATGTGAGCAATGCGGTACTGGAATTGCATGCTAGGTGTGGGAACATCGAGGAAG
CGCAGCAAGTTTTTGATGAGATTGGAAGCAAAAGAAATTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTGTGCATGGAAGATGCATTGATGCTCTTCAGCTTTAT
GATCAAATGTTGATACGGAAAATGAGACCGGACGACGTGACATTTGTAGGGCTTCTCTTGGCTTGCACACACGGAGGCATGGTTGCAGAAGGCCGACAACTCTTTGAATC
AATGGAGAGTAAGTTTCAAGTTGCTCCCAAATTAGAGCACTATGGCTGCTTGGTAGATTTATTAGGGAGAGCTGGAGAGCTGCAGGAAGCTTACAATCTCATTCAAAACA
TGCCAATGGCTCCTGACTCTGTTATATGGGGAACGCTTTTGGGAGCTTGTAGCTTCCATGGCAATGTTGAATTGGGTGAAGTAGCAGCTGAGTCCCTCTTCAAGCTTGAG
CCATGGAACCCTGGAAATTATGTCATTCTTTCTAACATTTACGCGTTGGCAGGTGATTGGTCTGGAGTTGCAAGATTAAGGAAGATGATGAAAGGAGGACATATTACAAA
GAGAGCAGGATATAGTTATATTGAAGTGGGAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCACATTTGAAGAGTGGTGAAATATATGCTTTACTTCATAAAATTT
ATGACATTATTAAACTTCATAAGCATGTACATCAGGATCAAAACGAAGATGAAGAACTACTCTATTCTTCGTAA
Protein sequenceShow/hide protein sequence
MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFDQIPKPSVYLYNKFIQTFSSIGHPHRCWLLYCQMCSQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCK
SGFASDMFAMTALLDMYAKLGMLRSARQLFDEMPVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQNGKYAKALEMFIGLENEKGTKPNEVSIAS
VLPACSQLGALDIGKRIEAYARNNGFFKNAYVSNAVLELHARCGNIEEAQQVFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIRKMRPDDVTFVGLLLACTHG
GMVAEGRQLFESMESKFQVAPKLEHYGCLVDLLGRAGELQEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYALAGDWSGVAR
LRKMMKGGHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHKIYDIIKLHKHVHQDQNEDEELLYSS