; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0008525 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0008525
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr02:19357526..19370525
RNA-Seq ExpressionPay0008525
SyntenyPay0008525
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011660274.1 pentatricopeptide repeat-containing protein At5g08510 [Cucumis sativus]8.1e-29095.11Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLF QIPKPSVYLYNKFIQTFSSIG P RCWLLYCQMC QGCSPNQYSFTFLFPACASLFNVYP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE
        GQMLHSHFCKSGFASD+FAMTALLDMYAKLGMLR ARQLFDEM VRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQ+GKYAKALE
Subjt:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKN YVSNA+LELHARCGNIEEA ++FDEIGSKRNLCSWNTMIMGLAVHGRCID
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLI+KMR DDVTFVGLLLACTHGGMV EGRQLFESMESKFQ+APKLEHYGCLVDLLGRAGEL EAYNLIQNMPMAPDSVIWGTLLGACSFHGN
Subjt:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVARLRKMMKG HITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLH+IY IIKLHKH HH
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH

Query:  DQNEDEELLNS
        D NEDEELL S
Subjt:  DQNEDEELLNS

XP_022147487.1 pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Momordica charantia]1.7e-25884.34Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAY LR+G+D+TKFLIEKLLQ+P+LPYAC LF  IPKPSV+LYNKFIQ++SS GQ  RCW LY QMC QGCSPNQ+SFTFLF ACASL NV+P
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE
        GQMLH+HFCKSGFASD+FA+TALLDMYAKLGMLR ARQLFDEM VRDIPTWNS++AGY+RSG M AALELF++MPVRNV+SWTALISGY+Q+GKYAKALE
Subjt:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENE+G KPNEV++ASVLPAC+QLGALDIG+RIEAYARNNGFFKN+YVSNAILE+HARCGNIEEA ++FDEIGSKRNLCSWNTMIMGLAVHGRC  
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        A++LYDQML Q++R DDVTF+GLLLACTHGGMV +GRQLFESMESKFQIAPKLEHYGCLVDLLGRAGEL EAYNLIQ MPM PDSVIWG LLGACSFHG+
Subjt:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH
        VEL EVAAESLFKLEPWNPGNYVILSNIYA AGDW GVARLRK MKG HITKRAGYSYIEVGDGIHEFIVEDRSHLKS EIYALLH IY+IIKL K A H
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH

Query:  DQNEDEELLNS
         QNE EELL+S
Subjt:  DQNEDEELLNS

XP_022979295.1 pentatricopeptide repeat-containing protein At5g08510 isoform X1 [Cucurbita maxima]1.6e-26186.44Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNG+D+TKFLIEKLLQ+P+LPYACTLF  IPKPSV+LYNKFIQTFSSIG P RCWLLY QMC QGCSPN +SFTFLFPACAS  N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE
        GQMLHSHFCKSGFASD+FA+TALLDMY KLG+L+ ARQLFDEM VRDIPTWNS+IAGYARSG+M AALELF+KMP+RNVISWTALISGYAQ+GKYAKALE
Subjt:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENEKGTKPNEV+IASVLPAC+QLGALDIGKRIE YAR NGFFKN+YVSNAILE+HARCGNIEEA R+FDEIGSKRNLCSWNTMIMGLAVHGRC D
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLIQ+ R DDVTFVGLLLACTHGGMV +GRQ+FESME KFQIAPKLEHYGCLVDLLGRAGE+ EAY+LIQ+MPM PDSVIWG LLGACSFHGN
Subjt:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVAR+RKMMKG HI KRAG SYIEVGDGIHEFIVEDRSH KS EIYALLH IYAIIKL     H
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH

Query:  DQNEDEELL
         QNE EE L
Subjt:  DQNEDEELL

XP_023527365.1 pentatricopeptide repeat-containing protein At5g08510 [Cucurbita pepo subsp. pepo]2.5e-25985.66Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNG+D+TKFLIEKLLQ+P+LPYACTLF  IPKPSV+LYNKFIQTFSSIG   RCWLLY QMC QGCSPN +SFTFLFPACAS  N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE
        GQMLHSHFCKSGFASD+FA+TALLDMY KLG+L+ ARQLFDE  VRDIPTWNS+IAGYARSG+M AAL+LF+KMP RNVISWTALISGYAQ+GKYAKALE
Subjt:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENEKGTKPNEV+IASVLPAC+ LGALDIGKRIEAYAR NGFFKN+YVSNAILE+HARCGNIEEA R+FDEIGSKRNLCSWNTMIMGLAVHGRC D
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQML+Q+ R DDVTFVGLLLACTHGGMV +GRQLFESME KFQIAPKLEHYGCLVDLLGRAGE+ EAY+LIQ+MPM PDSVIWG LLGACSFHGN
Subjt:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVAR+RKMMKG HI KRAG SYIEVGDGIHEFIVEDRSH KS EIYALLH +YAIIKL     H
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH

Query:  DQNEDEELL
        +QNE EE L
Subjt:  DQNEDEELL

XP_038877076.1 pentatricopeptide repeat-containing protein At5g08510-like [Benincasa hispida]2.8e-26687.7Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP
        M QLKQIHAYSLRNGLD+TKFLIEKLLQ P+LPYACTLF  IP+PSV+LYNKFIQTFSSIGQP RCWLLY QMC QGCSPNQ+SFTFLF ACASL N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE
        GQMLHSHFCKSGFASD+FA TALLDMYAKLGMLR ARQLFDEM VRDIPTWNS+IAGYARSG MEAA +LF+KMPVR+V+SWT LISGYAQ+GKYAKALE
Subjt:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFK-NMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCI
        +F+ LENEKG KPNEV+IASVLPAC+QLGALDIGKRIEAYARNNGFFK N YVSNAILE+HARCGNI EA R+FDEIGSKRNLCSWNTMIMGLAVHGRC 
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFK-NMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCI

Query:  DALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHG
        DALQLYDQMLI++MR DDVTFVGLLLACTHGGMV EGRQ+FESMES FQIAPKLEHYGCLVDLLGRAGEL EAY LIQNMPMAPDSVIWG LLGACSFHG
Subjt:  DALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHG

Query:  NVELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAH
        NVELGEVAAESLF LEPWNPGNYVILSNIYA AGDWSGVARLRKMMKG  ITKRAGYSYIEVGDGIHEFIVEDRSHL+SGEIYALLH IY IIKLHK+ H
Subjt:  NVELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAH

Query:  HDQNEDEELLNS
        H+QNEDE L +S
Subjt:  HDQNEDEELLNS

TrEMBL top hitse value%identityAlignment
A0A0A0LY28 Uncharacterized protein3.9e-29095.11Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLF QIPKPSVYLYNKFIQTFSSIG P RCWLLYCQMC QGCSPNQYSFTFLFPACASLFNVYP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE
        GQMLHSHFCKSGFASD+FAMTALLDMYAKLGMLR ARQLFDEM VRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQ+GKYAKALE
Subjt:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKN YVSNA+LELHARCGNIEEA ++FDEIGSKRNLCSWNTMIMGLAVHGRCID
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLI+KMR DDVTFVGLLLACTHGGMV EGRQLFESMESKFQ+APKLEHYGCLVDLLGRAGEL EAYNLIQNMPMAPDSVIWGTLLGACSFHGN
Subjt:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVARLRKMMKG HITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLH+IY IIKLHKH HH
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH

Query:  DQNEDEELLNS
        D NEDEELL S
Subjt:  DQNEDEELLNS

A0A6J1D160 pentatricopeptide repeat-containing protein At5g08510 isoform X22.2e-25684.52Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAY LR+G+D+TKFLIEKLLQ+P+LPYAC LF  IPKPSV+LYNKFIQ++SS GQ  RCW LY QMC QGCSPNQ+SFTFLF ACASL NV+P
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE
        GQMLH+HFCKSGFASD+FA+TALLDMYAKLGMLR ARQLFDEM VRDIPTWNS++AGY+RSG M AALELF++MPVRNV+SWTALISGY+Q+GKYAKALE
Subjt:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENE+G KPNEV++ASVLPAC+QLGALDIG+RIEAYARNNGFFKN+YVSNAILE+HARCGNIEEA ++FDEIGSKRNLCSWNTMIMGLAVHGRC  
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        A++LYDQML Q++R DDVTF+GLLLACTHGGMV +GRQLFESMESKFQIAPKLEHYGCLVDLLGRAGEL EAYNLIQ MPM PDSVIWG LLGACSFHG+
Subjt:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH
        VEL EVAAESLFKLEPWNPGNYVILSNIYA AGDW GVARLRK MKG HITKRAGYSYIEVGDGIHEFIVEDRSHLKS EIYALLH IY+IIKL K A H
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH

Query:  DQNE
         QNE
Subjt:  DQNE

A0A6J1D2G9 pentatricopeptide repeat-containing protein At5g08510 isoform X18.0e-25984.34Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAY LR+G+D+TKFLIEKLLQ+P+LPYAC LF  IPKPSV+LYNKFIQ++SS GQ  RCW LY QMC QGCSPNQ+SFTFLF ACASL NV+P
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE
        GQMLH+HFCKSGFASD+FA+TALLDMYAKLGMLR ARQLFDEM VRDIPTWNS++AGY+RSG M AALELF++MPVRNV+SWTALISGY+Q+GKYAKALE
Subjt:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENE+G KPNEV++ASVLPAC+QLGALDIG+RIEAYARNNGFFKN+YVSNAILE+HARCGNIEEA ++FDEIGSKRNLCSWNTMIMGLAVHGRC  
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        A++LYDQML Q++R DDVTF+GLLLACTHGGMV +GRQLFESMESKFQIAPKLEHYGCLVDLLGRAGEL EAYNLIQ MPM PDSVIWG LLGACSFHG+
Subjt:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH
        VEL EVAAESLFKLEPWNPGNYVILSNIYA AGDW GVARLRK MKG HITKRAGYSYIEVGDGIHEFIVEDRSHLKS EIYALLH IY+IIKL K A H
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH

Query:  DQNEDEELLNS
         QNE EELL+S
Subjt:  DQNEDEELLNS

A0A6J1E8Q2 pentatricopeptide repeat-containing protein At5g08510 isoform X11.8e-25885.46Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNG+D+TKFLI+KLLQ+P+LPYACTLF  IPKPSV+LYNKFIQTFSSIG   RCWLLY QMC QGCSPN +SFTFLFPACAS  N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE
        GQMLHSHF KSGFASD+FA+TALLDMY KLG+L+ ARQLFDEM VRDIPTWNS+IAGYARSG+M AALELF+KMP RNVISWTALISGYAQ+GKYAKALE
Subjt:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENEKGTKPNEV+IASVLPAC+ LGALDIGKRIEAYAR NGFFKN+YVSNAILE+HARCGNIEEA R+FDEIGSKRNLCSWNTMIMGLAVHGRC D
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLIQ+ R DDVTFVGLLLACTHGGMV +GRQLFESME KFQIAPKLEHYGCLVDLLGRAGE+ EAY+LIQ+MPM PDSVIWG LLGACSFHGN
Subjt:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVAR RKMMKG H+ KRAG SYIEVGDGIHEF+VEDRSH KS EIYALLH +YAIIKL     H
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH

Query:  DQNEDEELL
        +QNE EE L
Subjt:  DQNEDEELL

A0A6J1ISU4 pentatricopeptide repeat-containing protein At5g08510 isoform X17.7e-26286.44Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP
        MNQLKQIHAYSLRNG+D+TKFLIEKLLQ+P+LPYACTLF  IPKPSV+LYNKFIQTFSSIG P RCWLLY QMC QGCSPN +SFTFLFPACAS  N YP
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE
        GQMLHSHFCKSGFASD+FA+TALLDMY KLG+L+ ARQLFDEM VRDIPTWNS+IAGYARSG+M AALELF+KMP+RNVISWTALISGYAQ+GKYAKALE
Subjt:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ LENEKGTKPNEV+IASVLPAC+QLGALDIGKRIE YAR NGFFKN+YVSNAILE+HARCGNIEEA R+FDEIGSKRNLCSWNTMIMGLAVHGRC D
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        ALQLYDQMLIQ+ R DDVTFVGLLLACTHGGMV +GRQ+FESME KFQIAPKLEHYGCLVDLLGRAGE+ EAY+LIQ+MPM PDSVIWG LLGACSFHGN
Subjt:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH
        VELGEVAAESLFKLEPWNPGNYVILSNIYA AGDWSGVAR+RKMMKG HI KRAG SYIEVGDGIHEFIVEDRSH KS EIYALLH IYAIIKL     H
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHH

Query:  DQNEDEELL
         QNE EE L
Subjt:  DQNEDEELL

SwissProt top hitse value%identityAlignment
Q9C501 Pentatricopeptide repeat-containing protein At1g333501.7e-10438.23Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKL-----LQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSI--GQPQRCWLLYCQMCFQGC-SPNQYSFTFLFPAC
        +N LKQ+ ++ + +GL H+ FL  KL     L+L +L YA  +F +   P+ +LY   +  +SS         +  +  M  +    PN + +  +  + 
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKL-----LQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSI--GQPQRCWLLYCQMCFQGC-SPNQYSFTFLFPAC

Query:  ASLFNVYPGQMLHSHFCKSGFASDIFAMTALLDMYA-KLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQ
          L + +   ++H+H  KSGF   +   TALL  YA  +  +  ARQLFDEMS R++ +W ++++GYARSG +  A+ LF  MP R+V SW A+++   Q
Subjt:  ASLFNVYPGQMLHSHFCKSGFASDIFAMTALLDMYA-KLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQ

Query:  HGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMG
        +G + +A+ +F  + NE   +PNEV++  VL AC+Q G L + K I A+A       +++VSN++++L+ +CGN+EEA  +F ++ SK++L +WN+MI  
Subjt:  HGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMG

Query:  LAVHGRCIDALQLYDQML---IQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIW
         A+HGR  +A+ ++++M+   I  ++ D +TF+GLL ACTHGG+V++GR  F+ M ++F I P++EHYGCL+DLLGRAG   EA  ++  M M  D  IW
Subjt:  LAVHGRCIDALQLYDQML---IQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIW

Query:  GTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALL
        G+LL AC  HG+++L EVA ++L  L P N G   +++N+Y + G+W    R RKM+K  +  K  G+S IE+ + +H+F   D+SH ++ EIY +L
Subjt:  GTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALL

Q9FNN7 Pentatricopeptide repeat-containing protein At5g085109.0e-16754.62Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP
        MN +KQ+HA+ LR G+D TK L+++LL +P+L YA  LF        +LYNK IQ +    QP    +LY  + F G  P+ ++F F+F A AS  +  P
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE
         ++LHS F +SGF SD F  T L+  YAKLG L CAR++FDEMS RD+P WN++I GY R G M+AA+ELF+ MP +NV SWT +ISG++Q+G Y++AL+
Subjt:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ +E +K  KPN +++ SVLPAC+ LG L+IG+R+E YAR NGFF N+YV NA +E++++CG I+ A R+F+E+G++RNLCSWN+MI  LA HG+  +
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        AL L+ QML +  + D VTFVGLLLAC HGGMV +G++LF+SME   +I+PKLEHYGC++DLLGR G+L EAY+LI+ MPM PD+V+WGTLLGACSFHGN
Subjt:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAH
        VE+ E+A+E+LFKLEP NPGN VI+SNIYA    W GV R+RK+MK   +TK AGYSY +EVG  +H+F VED+SH +S EIY +L  I+  +KL K   
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAH

Query:  HDQNEDEEL
            + E+L
Subjt:  HDQNEDEEL

Q9LS72 Pentatricopeptide repeat-containing protein At3g292304.3e-10837.55Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDL----PYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLF
        +NQ+KQ+HA  +R  L     +  KL+    L      A  +F Q+ +P+V+L N  I+  +   QP + + ++ +M   G   + +++ FL  AC+   
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDL----PYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLF

Query:  NVYPGQMLHSHFCKSGFASDIFAMTALLDMYA---------------------------------KLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGH
         +   +M+H+H  K G +SDI+   AL+D Y+                                 K G LR AR+LFDEM  RD+ +WN+++ GYAR   
Subjt:  NVYPGQMLHSHFCKSGFASDIFAMTALLDMYA---------------------------------KLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGH

Query:  MEAALELFNKMPVRNVISWTALISGYAQHGKYAKALEMF----------------------IGLENE----------KGTKPNEVSIASVLPACSQLGAL
        M  A ELF KMP RN +SW+ ++ GY++ G    A  MF                       GL  E           G K +  ++ S+L AC++ G L
Subjt:  MEAALELFNKMPVRNVISWTALISGYAQHGKYAKALEMF----------------------IGLENE----------KGTKPNEVSIASVLPACSQLGAL

Query:  DIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIQKMRLDDVTFVGLLLACTHGG
         +G RI +  + +    N YV NA+L+++A+CGN+++A  +F++I  K++L SWNTM+ GL VHG   +A++L+ +M  + +R D VTF+ +L +C H G
Subjt:  DIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIQKMRLDDVTFVGLLLACTHGG

Query:  MVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAK
        ++ EG   F SME  + + P++EHYGCLVDLLGR G L EA  ++Q MPM P+ VIWG LLGAC  H  V++ +   ++L KL+P +PGNY +LSNIYA 
Subjt:  MVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAK

Query:  AGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALL
        A DW GVA +R  MK   + K +G S +E+ DGIHEF V D+SH KS +IY +L
Subjt:  AGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALL

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205401.6e-11041.42Show/hide
Query:  NQLKQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCS-PNQYSFTFLFPACASLF
        N+ K+I+A  + +GL  + F++ K++    ++ D+ YA  LF Q+  P+V+LYN  I+ ++          +Y Q+  +    P++++F F+F +CASL 
Subjt:  NQLKQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCS-PNQYSFTFLFPACASLF

Query:  NVYPGQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYA
        + Y G+ +H H CK G    +    AL+DMY K   L  A ++FDEM  RD+ +WNSL++GYAR G M+ A  LF+ M  + ++SWTA+ISGY   G Y 
Subjt:  NVYPGQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYA

Query:  KALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHG
        +A++ F  ++   G +P+E+S+ SVLP+C+QLG+L++GK I  YA   GF K   V NA++E++++CG I +A ++F ++  K ++ SW+TMI G A HG
Subjt:  KALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHG

Query:  RCIDALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACS
            A++ +++M   K++ + +TF+GLL AC+H GM  EG + F+ M   +QI PK+EHYGCL+D+L RAG+L  A  + + MPM PDS IWG+LL +C 
Subjt:  RCIDALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACS

Query:  FHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRS
          GN+++  VA + L +LEP + GNYV+L+NIYA  G W  V+RLRKM++  ++ K  G S IEV + + EF+  D S
Subjt:  FHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRS

Q9SZT8 Pentatricopeptide repeat-containing protein ELI1, chloroplastic2.6e-10539.76Show/hide
Query:  MNQLKQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACA
        ++++ QIHA  LR N L H ++ +  L           + ++  LF Q   P ++L+   I T S  G   + +LLY Q+     +PN+++F+ L  +C+
Subjt:  MNQLKQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACA

Query:  SLFNVYPGQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHG
        +      G+++H+H  K G   D +  T L+D+YAK G +  A+++FD M  R + +  ++I  YA+ G++EAA  LF+ M  R+++SW  +I GYAQHG
Subjt:  SLFNVYPGQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHG

Query:  KYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLA
            AL +F  L  E   KP+E+++ + L ACSQ+GAL+ G+ I  + +++    N+ V   +++++++CG++EEA  +F++   ++++ +WN MI G A
Subjt:  KYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLA

Query:  VHGRCIDALQLYDQML-IQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLL
        +HG   DAL+L+++M  I  ++  D+TF+G L AC H G+V EG ++FESM  ++ I PK+EHYGCLV LLGRAG+L  AY  I+NM M  DSV+W ++L
Subjt:  VHGRCIDALQLYDQML-IQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLL

Query:  GACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAII
        G+C  HG+  LG+  AE L  L   N G YV+LSNIYA  GD+ GVA++R +MK   I K  G S IE+ + +HEF   DR H KS EIY +L +I   I
Subjt:  GACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAII

Query:  KLH
        K H
Subjt:  KLH

Arabidopsis top hitse value%identityAlignment
AT1G33350.1 Pentatricopeptide repeat (PPR) superfamily protein1.2e-10538.23Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKL-----LQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSI--GQPQRCWLLYCQMCFQGC-SPNQYSFTFLFPAC
        +N LKQ+ ++ + +GL H+ FL  KL     L+L +L YA  +F +   P+ +LY   +  +SS         +  +  M  +    PN + +  +  + 
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKL-----LQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSI--GQPQRCWLLYCQMCFQGC-SPNQYSFTFLFPAC

Query:  ASLFNVYPGQMLHSHFCKSGFASDIFAMTALLDMYA-KLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQ
          L + +   ++H+H  KSGF   +   TALL  YA  +  +  ARQLFDEMS R++ +W ++++GYARSG +  A+ LF  MP R+V SW A+++   Q
Subjt:  ASLFNVYPGQMLHSHFCKSGFASDIFAMTALLDMYA-KLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQ

Query:  HGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMG
        +G + +A+ +F  + NE   +PNEV++  VL AC+Q G L + K I A+A       +++VSN++++L+ +CGN+EEA  +F ++ SK++L +WN+MI  
Subjt:  HGKYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMG

Query:  LAVHGRCIDALQLYDQML---IQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIW
         A+HGR  +A+ ++++M+   I  ++ D +TF+GLL ACTHGG+V++GR  F+ M ++F I P++EHYGCL+DLLGRAG   EA  ++  M M  D  IW
Subjt:  LAVHGRCIDALQLYDQML---IQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIW

Query:  GTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALL
        G+LL AC  HG+++L EVA ++L  L P N G   +++N+Y + G+W    R RKM+K  +  K  G+S IE+ + +H+F   D+SH ++ EIY +L
Subjt:  GTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALL

AT2G20540.1 mitochondrial editing factor 211.1e-11141.42Show/hide
Query:  NQLKQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCS-PNQYSFTFLFPACASLF
        N+ K+I+A  + +GL  + F++ K++    ++ D+ YA  LF Q+  P+V+LYN  I+ ++          +Y Q+  +    P++++F F+F +CASL 
Subjt:  NQLKQIHAYSLRNGLDHTKFLIEKLL----QLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCS-PNQYSFTFLFPACASLF

Query:  NVYPGQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYA
        + Y G+ +H H CK G    +    AL+DMY K   L  A ++FDEM  RD+ +WNSL++GYAR G M+ A  LF+ M  + ++SWTA+ISGY   G Y 
Subjt:  NVYPGQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYA

Query:  KALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHG
        +A++ F  ++   G +P+E+S+ SVLP+C+QLG+L++GK I  YA   GF K   V NA++E++++CG I +A ++F ++  K ++ SW+TMI G A HG
Subjt:  KALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHG

Query:  RCIDALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACS
            A++ +++M   K++ + +TF+GLL AC+H GM  EG + F+ M   +QI PK+EHYGCL+D+L RAG+L  A  + + MPM PDS IWG+LL +C 
Subjt:  RCIDALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACS

Query:  FHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRS
          GN+++  VA + L +LEP + GNYV+L+NIYA  G W  V+RLRKM++  ++ K  G S IEV + + EF+  D S
Subjt:  FHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRS

AT3G29230.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-10937.55Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDL----PYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLF
        +NQ+KQ+HA  +R  L     +  KL+    L      A  +F Q+ +P+V+L N  I+  +   QP + + ++ +M   G   + +++ FL  AC+   
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDL----PYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLF

Query:  NVYPGQMLHSHFCKSGFASDIFAMTALLDMYA---------------------------------KLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGH
         +   +M+H+H  K G +SDI+   AL+D Y+                                 K G LR AR+LFDEM  RD+ +WN+++ GYAR   
Subjt:  NVYPGQMLHSHFCKSGFASDIFAMTALLDMYA---------------------------------KLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGH

Query:  MEAALELFNKMPVRNVISWTALISGYAQHGKYAKALEMF----------------------IGLENE----------KGTKPNEVSIASVLPACSQLGAL
        M  A ELF KMP RN +SW+ ++ GY++ G    A  MF                       GL  E           G K +  ++ S+L AC++ G L
Subjt:  MEAALELFNKMPVRNVISWTALISGYAQHGKYAKALEMF----------------------IGLENE----------KGTKPNEVSIASVLPACSQLGAL

Query:  DIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIQKMRLDDVTFVGLLLACTHGG
         +G RI +  + +    N YV NA+L+++A+CGN+++A  +F++I  K++L SWNTM+ GL VHG   +A++L+ +M  + +R D VTF+ +L +C H G
Subjt:  DIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIQKMRLDDVTFVGLLLACTHGG

Query:  MVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAK
        ++ EG   F SME  + + P++EHYGCLVDLLGR G L EA  ++Q MPM P+ VIWG LLGAC  H  V++ +   ++L KL+P +PGNY +LSNIYA 
Subjt:  MVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAK

Query:  AGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALL
        A DW GVA +R  MK   + K +G S +E+ DGIHEF V D+SH KS +IY +L
Subjt:  AGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALL

AT4G37380.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.8e-10639.76Show/hide
Query:  MNQLKQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACA
        ++++ QIHA  LR N L H ++ +  L           + ++  LF Q   P ++L+   I T S  G   + +LLY Q+     +PN+++F+ L  +C+
Subjt:  MNQLKQIHAYSLR-NGLDHTKFLIEKL------LQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACA

Query:  SLFNVYPGQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHG
        +      G+++H+H  K G   D +  T L+D+YAK G +  A+++FD M  R + +  ++I  YA+ G++EAA  LF+ M  R+++SW  +I GYAQHG
Subjt:  SLFNVYPGQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHG

Query:  KYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLA
            AL +F  L  E   KP+E+++ + L ACSQ+GAL+ G+ I  + +++    N+ V   +++++++CG++EEA  +F++   ++++ +WN MI G A
Subjt:  KYAKALEMFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLA

Query:  VHGRCIDALQLYDQML-IQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLL
        +HG   DAL+L+++M  I  ++  D+TF+G L AC H G+V EG ++FESM  ++ I PK+EHYGCLV LLGRAG+L  AY  I+NM M  DSV+W ++L
Subjt:  VHGRCIDALQLYDQML-IQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLL

Query:  GACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAII
        G+C  HG+  LG+  AE L  L   N G YV+LSNIYA  GD+ GVA++R +MK   I K  G S IE+ + +HEF   DR H KS EIY +L +I   I
Subjt:  GACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAII

Query:  KLH
        K H
Subjt:  KLH

AT5G08510.1 Pentatricopeptide repeat (PPR) superfamily protein6.4e-16854.62Show/hide
Query:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP
        MN +KQ+HA+ LR G+D TK L+++LL +P+L YA  LF        +LYNK IQ +    QP    +LY  + F G  P+ ++F F+F A AS  +  P
Subjt:  MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYP

Query:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE
         ++LHS F +SGF SD F  T L+  YAKLG L CAR++FDEMS RD+P WN++I GY R G M+AA+ELF+ MP +NV SWT +ISG++Q+G Y++AL+
Subjt:  GQMLHSHFCKSGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALE

Query:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID
        MF+ +E +K  KPN +++ SVLPAC+ LG L+IG+R+E YAR NGFF N+YV NA +E++++CG I+ A R+F+E+G++RNLCSWN+MI  LA HG+  +
Subjt:  MFIGLENEKGTKPNEVSIASVLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCID

Query:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN
        AL L+ QML +  + D VTFVGLLLAC HGGMV +G++LF+SME   +I+PKLEHYGC++DLLGR G+L EAY+LI+ MPM PD+V+WGTLLGACSFHGN
Subjt:  ALQLYDQMLIQKMRLDDVTFVGLLLACTHGGMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGN

Query:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAH
        VE+ E+A+E+LFKLEP NPGN VI+SNIYA    W GV R+RK+MK   +TK AGYSY +EVG  +H+F VED+SH +S EIY +L  I+  +KL K   
Subjt:  VELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVARLRKMMKGAHITKRAGYSY-IEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAH

Query:  HDQNEDEEL
            + E+L
Subjt:  HDQNEDEEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCAATTGAAGCAAATTCATGCTTACAGCCTCAGAAACGGCCTAGATCACACAAAGTTCCTCATTGAAAAGCTTCTGCAATTACCAGATCTTCCGTATGCTTGCAC
CCTGTTTGTCCAAATTCCTAAGCCATCTGTTTATCTCTACAACAAGTTCATTCAAACATTTTCTTCAATTGGTCAACCCCAACGATGCTGGTTGCTTTACTGTCAAATGT
GTTTCCAAGGTTGCTCTCCAAATCAGTATTCATTCACCTTTCTCTTTCCCGCCTGCGCTTCCCTTTTTAATGTTTACCCAGGTCAGATGCTTCATTCTCATTTCTGTAAG
TCAGGATTTGCTTCTGATATATTCGCTATGACGGCATTGTTGGACATGTATGCAAAATTGGGAATGTTGAGGTGTGCGCGCCAACTGTTTGATGAAATGTCTGTTCGAGA
TATACCCACCTGGAATTCGTTGATTGCGGGTTATGCAAGGTCCGGGCATATGGAGGCTGCGTTAGAATTGTTCAACAAAATGCCGGTAAGAAATGTGATTTCCTGGACAG
CGTTGATATCTGGGTATGCACAACATGGGAAGTACGCGAAGGCCTTGGAGATGTTTATAGGATTGGAAAATGAGAAAGGCACTAAGCCAAATGAGGTGTCCATAGCAAGT
GTTCTTCCTGCCTGTTCTCAGCTTGGGGCTTTGGATATTGGGAAGAGGATTGAAGCATATGCAAGAAATAATGGATTTTTCAAAAACATGTATGTGAGCAATGCGATACT
GGAATTGCATGCTAGGTGCGGGAACATCGAGGAGGCGGGGCGAATTTTTGATGAGATTGGAAGCAAACGAAACTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTG
TGCATGGAAGATGCATTGATGCTCTTCAGCTTTATGATCAAATGTTGATACAGAAAATGAGACTGGACGACGTGACATTTGTAGGGCTTCTCTTGGCTTGCACACACGGA
GGCATGGTTACAGAAGGTCGACAACTCTTTGAATCAATGGAGAGTAAGTTTCAAATTGCTCCCAAATTGGAGCACTATGGCTGCTTGGTAGATTTATTAGGGAGAGCTGG
AGAGCTACATGAAGCTTACAATCTCATTCAAAACATGCCAATGGCTCCTGACTCTGTTATATGGGGAACTCTTTTGGGAGCTTGTAGCTTCCATGGAAATGTTGAATTGG
GTGAAGTAGCAGCTGAGTCCCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTTTCTAACATTTACGCCAAGGCTGGTGATTGGTCTGGAGTTGCAAGG
TTAAGGAAGATGATGAAAGGAGCACATATTACAAAGAGAGCAGGATATAGTTATATTGAAGTGGGAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCACATTTGAA
GAGTGGTGAAATATATGCTTTACTTCATAGGATTTATGCCATTATTAAACTTCATAAGCATGCACATCACGATCAAAACGAAGATGAAGAACTACTCAATTCTTAG
mRNA sequenceShow/hide mRNA sequence
TTCGGTTTCTGCTCCCTCCCGCCACTTTTCTCTTTGGTTCTCTTCGCACAGATTACTACCACTCGTTTTAAGTGTTCTCCTCACACCGGTGGACGCTTATGTGACTGGCG
TGCAGATTAGAGCATGACCCACAGCTCCATTGACTCGTATTACCCAGAGCTTGTAGAGTTCAAATATGTAAGTTCCAACCTATTTGAACTCGAAACATCTACAAATTCAT
TTTTCTTTTTCAGCAGAAATAAACTCATTTATTCCCTCTCGAACTCGATTTTATTTGTTGGTTAGACTCTAACCATCTTCAAATTTATTCTTCCTTTTGTGATTTCGAGC
TAATTTTACACAGATGACGAAACCCAAGAATGGGTGAAAAGCCCCCTAGGTTGGCAAATGGCTTTTAAGTTGGGTCAAACTAGATAATAAGAACTGTTATGAACCAATTG
AAGCAAATTCATGCTTACAGCCTCAGAAACGGCCTAGATCACACAAAGTTCCTCATTGAAAAGCTTCTGCAATTACCAGATCTTCCGTATGCTTGCACCCTGTTTGTCCA
AATTCCTAAGCCATCTGTTTATCTCTACAACAAGTTCATTCAAACATTTTCTTCAATTGGTCAACCCCAACGATGCTGGTTGCTTTACTGTCAAATGTGTTTCCAAGGTT
GCTCTCCAAATCAGTATTCATTCACCTTTCTCTTTCCCGCCTGCGCTTCCCTTTTTAATGTTTACCCAGGTCAGATGCTTCATTCTCATTTCTGTAAGTCAGGATTTGCT
TCTGATATATTCGCTATGACGGCATTGTTGGACATGTATGCAAAATTGGGAATGTTGAGGTGTGCGCGCCAACTGTTTGATGAAATGTCTGTTCGAGATATACCCACCTG
GAATTCGTTGATTGCGGGTTATGCAAGGTCCGGGCATATGGAGGCTGCGTTAGAATTGTTCAACAAAATGCCGGTAAGAAATGTGATTTCCTGGACAGCGTTGATATCTG
GGTATGCACAACATGGGAAGTACGCGAAGGCCTTGGAGATGTTTATAGGATTGGAAAATGAGAAAGGCACTAAGCCAAATGAGGTGTCCATAGCAAGTGTTCTTCCTGCC
TGTTCTCAGCTTGGGGCTTTGGATATTGGGAAGAGGATTGAAGCATATGCAAGAAATAATGGATTTTTCAAAAACATGTATGTGAGCAATGCGATACTGGAATTGCATGC
TAGGTGCGGGAACATCGAGGAGGCGGGGCGAATTTTTGATGAGATTGGAAGCAAACGAAACTTGTGCTCGTGGAATACCATGATAATGGGATTGGCTGTGCATGGAAGAT
GCATTGATGCTCTTCAGCTTTATGATCAAATGTTGATACAGAAAATGAGACTGGACGACGTGACATTTGTAGGGCTTCTCTTGGCTTGCACACACGGAGGCATGGTTACA
GAAGGTCGACAACTCTTTGAATCAATGGAGAGTAAGTTTCAAATTGCTCCCAAATTGGAGCACTATGGCTGCTTGGTAGATTTATTAGGGAGAGCTGGAGAGCTACATGA
AGCTTACAATCTCATTCAAAACATGCCAATGGCTCCTGACTCTGTTATATGGGGAACTCTTTTGGGAGCTTGTAGCTTCCATGGAAATGTTGAATTGGGTGAAGTAGCAG
CTGAGTCCCTCTTCAAGCTTGAGCCATGGAACCCTGGAAATTATGTCATTCTTTCTAACATTTACGCCAAGGCTGGTGATTGGTCTGGAGTTGCAAGGTTAAGGAAGATG
ATGAAAGGAGCACATATTACAAAGAGAGCAGGATATAGTTATATTGAAGTGGGAGATGGGATTCATGAGTTCATTGTAGAAGATAGATCACATTTGAAGAGTGGTGAAAT
ATATGCTTTACTTCATAGGATTTATGCCATTATTAAACTTCATAAGCATGCACATCACGATCAAAACGAAGATGAAGAACTACTCAATTCTTAGTAAGTTTTTGGCGGTT
TGATTGAAATGTTTATATTGCTTCATGGTTATGGTATTAGATATGATTATGGTTTATAAGGACAATCGTCAAAAATGGTAAAATTGGTAAAATATTTCTACTTCATGGCA
AAATCGTAAGTAGTCGAATTTTGTACTTGGTTTGTG
Protein sequenceShow/hide protein sequence
MNQLKQIHAYSLRNGLDHTKFLIEKLLQLPDLPYACTLFVQIPKPSVYLYNKFIQTFSSIGQPQRCWLLYCQMCFQGCSPNQYSFTFLFPACASLFNVYPGQMLHSHFCK
SGFASDIFAMTALLDMYAKLGMLRCARQLFDEMSVRDIPTWNSLIAGYARSGHMEAALELFNKMPVRNVISWTALISGYAQHGKYAKALEMFIGLENEKGTKPNEVSIAS
VLPACSQLGALDIGKRIEAYARNNGFFKNMYVSNAILELHARCGNIEEAGRIFDEIGSKRNLCSWNTMIMGLAVHGRCIDALQLYDQMLIQKMRLDDVTFVGLLLACTHG
GMVTEGRQLFESMESKFQIAPKLEHYGCLVDLLGRAGELHEAYNLIQNMPMAPDSVIWGTLLGACSFHGNVELGEVAAESLFKLEPWNPGNYVILSNIYAKAGDWSGVAR
LRKMMKGAHITKRAGYSYIEVGDGIHEFIVEDRSHLKSGEIYALLHRIYAIIKLHKHAHHDQNEDEELLNS