; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh12G006630 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh12G006630
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCmo_Chr12:4346379..4347902
RNA-Seq ExpressionCmoCh12G006630
SyntenyCmoCh12G006630
Gene Ontology termsGO:1900865 - chloroplast RNA modification (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585842.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]4.4e-29699.01Show/hide
Query:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR
        MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPY+LKACAR
Subjt:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR

Query:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
        IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
Subjt:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC

Query:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
        FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
Subjt:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG

Query:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
        DGEGAL+LFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
Subjt:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL

Query:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS
        SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALG HGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTH+QI DIHLVLEELNKQIS
Subjt:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS

Query:  QPDLLYF
        QPDLLYF
Subjt:  QPDLLYF

XP_022951910.1 pentatricopeptide repeat-containing protein At2g20540-like [Cucurbita moschata]3.6e-298100Show/hide
Query:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR
        MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR
Subjt:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR

Query:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
        IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
Subjt:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC

Query:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
        FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
Subjt:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG

Query:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
        DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
Subjt:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL

Query:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS
        SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS
Subjt:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS

Query:  QPDLLYF
        QPDLLYF
Subjt:  QPDLLYF

XP_022973287.1 pentatricopeptide repeat-containing protein At2g20540-like [Cucurbita maxima]2.3e-28996.65Show/hide
Query:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR
        MKQLKQAHAQVLTSGLGSNSFALSR+LDFCAEPRHGSLSHALKL++HIQHPTICICNTMIKAFLLRGEFLNAIV+YSAMRRN IAPDSYTHPYVLKACAR
Subjt:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR

Query:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
        IKKFHLGESVHACAVKLGFV+DVFVGNSLLVMYCAFGNM+DAGQ+FDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
Subjt:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC

Query:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
        FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALD+GIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTR+LFDEMPQRDTICWNVMISGMAMHG
Subjt:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG

Query:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
        DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKM TIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
Subjt:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL

Query:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS
        SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIK+NGVVTEFIAGDKTH+QI DIHLVLE L+KQIS
Subjt:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS

Query:  QPDLLYF
        QPD LYF
Subjt:  QPDLLYF

XP_023537919.1 pentatricopeptide repeat-containing protein ELI1, chloroplastic-like [Cucurbita pepo subsp. pepo]2.0e-24084.22Show/hide
Query:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR
        MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKL++HIQHPTICICNTMIKAFLL                                   
Subjt:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR

Query:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
                                               RDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
Subjt:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC

Query:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
        FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
Subjt:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG

Query:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
        DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
Subjt:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL

Query:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS
        SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIK+NGVVTEFIAGDKTH+QI DIHLVLEELNKQIS
Subjt:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS

Query:  QPDLLYF
         PDLLYF
Subjt:  QPDLLYF

XP_038890502.1 pentatricopeptide repeat-containing protein At2g20540-like [Benincasa hispida]2.6e-24883.53Show/hide
Query:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR
        MKQLKQAHAQVLTSGL +++F LSRLL+FCA+P +GSLSHA KL++HIQHP+ICICN MIKA LLRGEFL+A+V++SAM RN I PD+YT PYVLKA AR
Subjt:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR

Query:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
        +  FHLGESVHAC++KLGFVLD FVGNSLLVMYCAFGNMR+AGQVFDEMPE+S+VSWTVMIYGYAKMGDV+TARGLFD A  KDRGIWGAMISGYVQNNC
Subjt:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC

Query:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
        FKEGLHMFRLMQ TE+EPDEAIIVTIL ACAHMGALD GIWIHRYL  L LPLTLRVSTGL+DMYAKCGHL++ ++LFDEM QRDTICWNVMISGMAMHG
Subjt:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG

Query:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
        DGEGALKLF EMEKAR+KPDDITFIA+LAACSNSGM +EGL+I NKM TIHKIEPK+EHYGC+IDLLSR GRF+EAEGVIQRLP TA+ SEEAVAWRAFL
Subjt:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL

Query:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS
        SACCKHG  QQAE AAERLF+LERHSGAYVLLSNMYAALGKHG+AKRVR+MMKLKGVEKVPGCSSIK+NGVV EFIAG+KTH QI +IHLVLE L+KQI 
Subjt:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS

Query:  QPDL
        Q DL
Subjt:  QPDL

TrEMBL top hitse value%identityAlignment
A0A0A0KNK8 Uncharacterized protein3.8e-23780.36Show/hide
Query:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR
        M QLKQAHAQVL SGL +++F LSRLL+FCAE R+GSLSHA KL++HIQHPTICI NTMIKA LLRGEFLNAI ++SA+ RN I PD+YT PYVLKA AR
Subjt:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR

Query:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
        +   HLGES+HAC +KLG  ++ FVGNSLLVMY +F NMR A QVFDEMPELSAVSWTVMIYGYA MGDV+TAR LFD A +KD GIWGAMISGYVQNNC
Subjt:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC

Query:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
        FKEGLHMFRLMQLTEVEPDEAIIVTIL ACAHMGALD GIWIHRYL  L LPLTLRVSTGL+DMYAKCGHLD+ ++LF+EM QRD +CWN MISGMAM G
Subjt:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG

Query:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
        DGEGA+KLF EMEKA IKPD+ITFIA+LAACSNSGM +EG++I N+M T+HKIEPK+EHYGC+IDLLSR GRFEEAEGVIQRLP TA+ SEEAVAWRAFL
Subjt:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL

Query:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS
        SACCKHG+ QQAE AAERLF+LERHSGAYVLLSNMYAALGKH DAKRVR MMKLKGVEKVPGCSSIK+NGVV EFIAG+KTH  ID+IHLVLEELNKQI 
Subjt:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS

Query:  QPDL
        + DL
Subjt:  QPDL

A0A1S4E5C9 pentatricopeptide repeat-containing protein At2g20540-like2.7e-23579.96Show/hide
Query:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR
        M  LKQAHAQVLTSGL +++F LS+LL+FCAE R+GSLSHA KL++HIQHPTICICNTMIKA LLRGEFLNAIV++SAM RN I PD+YT PYVLKA AR
Subjt:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR

Query:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
        +   HLGES+HAC +KLGF +D FVGNSLLVMY +F  M  A QVFDEMPELSAVSWTVMIYGYA MGDV+TAR LFD A +KD GIWGAMISGYVQNNC
Subjt:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC

Query:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
        FKEGLHMFRLMQLTEVEPDEAIIVTIL ACAHMGALD GIWIHRYL  L L LTLRV TGL+DMYAKCGHLD+ ++LF+EM QRD +CWN MISGMAM G
Subjt:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG

Query:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
        DGEGA+KLF EMEKA IKPD+ITFIA+LAACSNSGM +EG++I N+M TIHKIEPK+EHYGC+IDLLSR GRFEEAE VI RLP TA  SEEAVAWRAFL
Subjt:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL

Query:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS
        SACCKHG+ QQAE AAERLF+LERHSGAYVLLSNMYAALGKH DAKRVR MMKLKGVEKVPGCSSIK+NGVV EFIAG+KTH  I +IHLVLEELNKQI 
Subjt:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS

Query:  QPDL
        + DL
Subjt:  QPDL

A0A5D3DJ94 Pentatricopeptide repeat-containing protein8.0e-23579.76Show/hide
Query:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR
        M  LKQAHAQVLTSGL +++F LS+LL+FCAE R+GSLSHA KL++HIQHPTICICNTMIKA LLRGEFLNAIV++SAM RN I PD+YT PYVLKA AR
Subjt:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR

Query:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
        +   HLGES+HAC +KLGF +D FVGNSLLVMY +F  M  A QVFDEMPELSAVSWTVMIYGYA MGDV+TAR LFD A +KD GIWGAMISGYVQNNC
Subjt:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC

Query:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
        FKEGLHMFRLMQLTEVEPDEAIIVTIL ACAHMGA D GIWIHRYL  L L LTLRV TGL+DMYAKCGHLD+ ++LF+EM QRD +CWN MISGMAM G
Subjt:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG

Query:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
        DGEGA+KLF EMEKA IKPD+ITFIA+LAACSNSGM +EG++I N+M TIHKIEPK+EHYGC+IDLLSR GRFEEAE VI RLP TA  SEEAVAWRAFL
Subjt:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL

Query:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS
        SACCKHG+ QQAE AAERLF+LERHSGAYVLLSNMYAALGKH DAKRVR MMKLKGVEKVPGCSSIK+NGVV EFIAG+KTH  I +IHLVLEELNKQI 
Subjt:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS

Query:  QPDL
        + DL
Subjt:  QPDL

A0A6J1GK82 pentatricopeptide repeat-containing protein At2g20540-like1.7e-298100Show/hide
Query:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR
        MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR
Subjt:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR

Query:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
        IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
Subjt:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC

Query:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
        FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
Subjt:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG

Query:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
        DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
Subjt:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL

Query:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS
        SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS
Subjt:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS

Query:  QPDLLYF
        QPDLLYF
Subjt:  QPDLLYF

A0A6J1I746 pentatricopeptide repeat-containing protein At2g20540-like1.1e-28996.65Show/hide
Query:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR
        MKQLKQAHAQVLTSGLGSNSFALSR+LDFCAEPRHGSLSHALKL++HIQHPTICICNTMIKAFLLRGEFLNAIV+YSAMRRN IAPDSYTHPYVLKACAR
Subjt:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR

Query:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
        IKKFHLGESVHACAVKLGFV+DVFVGNSLLVMYCAFGNM+DAGQ+FDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
Subjt:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC

Query:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
        FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALD+GIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTR+LFDEMPQRDTICWNVMISGMAMHG
Subjt:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG

Query:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
        DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKM TIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
Subjt:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL

Query:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS
        SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIK+NGVVTEFIAGDKTH+QI DIHLVLE L+KQIS
Subjt:  SACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQIS

Query:  QPDLLYF
        QPD LYF
Subjt:  QPDLLYF

SwissProt top hitse value%identityAlignment
Q9FJY7 Pentatricopeptide repeat-containing protein At5g665205.4e-9538.65Show/hide
Query:  KQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGS-LSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR
        ++LKQ HA++L +GL  +S+A+++ L FC        L +A  ++     P   + N MI+ F    E   ++++Y  M  +    ++YT P +LKAC+ 
Subjt:  KQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGS-LSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR

Query:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
        +  F     +HA   KLG+  DV+  NSL+  Y   GN + A  +FD +PE   VSW  +I GY K G ++ A  LF +   K+   W  MISGYVQ + 
Subjt:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC

Query:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
         KE L +F  MQ ++VEPD   +   L ACA +GAL+ G WIH YL+   + +   +   L+DMYAKCG ++    +F  + ++    W  +ISG A HG
Subjt:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG

Query:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
         G  A+  F EM+K  IKP+ ITF A+L ACS +G+  EG  I   M   + ++P  EHYGCI+DLL RAG  +EA+  IQ +P        AV W A L
Subjt:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL

Query:  SACCKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDI
         AC  H  ++  E   E L  ++  H G YV  +N++A   K   A   R++MK +GV KVPGCS+I L G   EF+AGD++H +I+ I
Subjt:  SACCKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDI

Q9FMA1 Pentatricopeptide repeat-containing protein At5g563102.5e-9235.64Show/hide
Query:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGE---FLNAIVIYSAMRRNEIAPDSYTHPYVLKA
        +K LKQ+H  ++ +GL  ++  +++ ++ C+   H  L +A  ++ H   P   + NTMI+A  L  E      AI +Y  +      PD++T P+VLK 
Subjt:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGE---FLNAIVIYSAMRRNEIAPDSYTHPYVLKA

Query:  CARIKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAP--IKDRGIWGAMISGY
          R+     G  +H   V  GF   V V   L+ MY + G + DA ++FDEM       W  ++ GY K+G+++ AR L +  P  +++   W  +ISGY
Subjt:  CARIKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAP--IKDRGIWGAMISGY

Query:  VQNNCFKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISG
         ++    E + +F+ M +  VEPDE  ++ +L ACA +G+L++G  I  Y+ H  +   + ++  ++DMYAK G++     +F+ + +R+ + W  +I+G
Subjt:  VQNNCFKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISG

Query:  MAMHGDGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVA
        +A HG G  AL +F  M KA ++P+D+TFIAIL+ACS+ G  + G ++ N M + + I P  EHYGC+IDLL RAG+  EA+ VI+ +P  A A+     
Subjt:  MAMHGDGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVA

Query:  WRAFLSACCKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEE
        W + L+A   H +++  E A   L +LE  +SG Y+LL+N+Y+ LG+  +++ +R MMK  GV+K+ G SSI++   V +FI+GD TH Q++ IH +L+E
Subjt:  WRAFLSACCKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEE

Query:  LNKQI
        ++ QI
Subjt:  LNKQI

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic2.7e-10235.5Show/hide
Query:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFC-AEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACA
        ++ L+  HAQ++  GL + ++ALS+L++FC   P    L +A+ +++ IQ P + I NTM +   L  + ++A+ +Y  M    + P+SYT P+VLK+CA
Subjt:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFC-AEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACA

Query:  RIKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNN
        + K F  G+ +H   +KLG  LD++V  SL+ MY   G + DA +VFD+ P    VS+T +I GYA  G +  A+ LFD  P+KD   W AMISGY +  
Subjt:  RIKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNN

Query:  CFKEGLHMFRLMQLTEVEPDEAIIVT--------------------------------------------------------------------------
         +KE L +F+ M  T V PDE+ +VT                                                                          
Subjt:  CFKEGLHMFRLMQLTEVEPDEAIIVT--------------------------------------------------------------------------

Query:  ---------------------------ILCACAHMGALDIGIWIHRYLSHLELPLTLRVS--TGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGM
                                   IL ACAH+GA+DIG WIH Y+      +T   S  T L+DMYAKCG ++    +F+ +  +    WN MI G 
Subjt:  ---------------------------ILCACAHMGALDIGIWIHRYLSHLELPLTLRVS--TGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGM

Query:  AMHGDGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAW
        AMHG  + +  LF  M K  I+PDDITF+ +L+ACS+SGM + G  I   M   +K+ PK EHYGC+IDLL  +G F+EAE +I    N      + V W
Subjt:  AMHGDGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAW

Query:  RAFLSACCKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEEL
         + L AC  HG ++  E  AE L ++E  + G+YVLLSN+YA+ G+  +  + R ++  KG++KVPGCSSI+++ VV EFI GDK H +  +I+ +LEE+
Subjt:  RAFLSACCKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEEL

Q9LSB8 Putative pentatricopeptide repeat-containing protein At3g159304.3e-9236.14Show/hide
Query:  KQAHAQVLTSGLGSNSF---ALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACARI
        K+ H  V+  GLGSN +   AL ++   C     G +  A  ++       +   N MI  +    E+  +I +   M RN ++P S T   VL AC+++
Subjt:  KQAHAQVLTSGLGSNSF---ALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACARI

Query:  KKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNCF
        K   L + VH    +      + + N+L+  Y A G M  A ++F  M     +SWT ++ GY + G++  AR  FD+ P++DR  W  MI GY++  CF
Subjt:  KKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNCF

Query:  KEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHGD
         E L +FR MQ   + PDE  +V++L ACAH+G+L+IG WI  Y+   ++   + V   L+DMY KCG  +  + +F +M QRD   W  M+ G+A +G 
Subjt:  KEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHGD

Query:  GEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFLS
        G+ A+K+F +M+   I+PDDIT++ +L+AC++SGM ++  K   KM + H+IEP   HYGC++D+L RAG  +EA  +++++P     +  ++ W A L 
Subjt:  GEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFLS

Query:  ACCKHGEMQQAEGAAERLFELERHSGA-YVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQ
        A   H +   AE AA+++ ELE  +GA Y LL N+YA   +  D + VR+ +    ++K PG S I++NG   EF+AGDK+H+Q ++I++ LEEL ++
Subjt:  ACCKHGEMQQAEGAAERLFELERHSGA-YVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQ

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205404.2e-10037.68Show/hide
Query:  KQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEI-APDSYTHPYVLKACARIKK
        K+ +A ++  GL  +SF +++++DFC   +   + +A +L+  + +P + + N++I+A+     + + I IY  + R     PD +T P++ K+CA +  
Subjt:  KQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEI-APDSYTHPYVLKACARIKK

Query:  FHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNCFKE
         +LG+ VH    K G    V   N+L+ MY  F ++ DA +VFDEM E   +SW  ++ GYA++G +  A+GLF     K    W AMISGY    C+ E
Subjt:  FHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNCFKE

Query:  GLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHGDGE
         +  FR MQL  +EPDE  ++++L +CA +G+L++G WIH Y           V   L++MY+KCG +     LF +M  +D I W+ MISG A HG+  
Subjt:  GLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHGDGE

Query:  GALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFLSAC
        GA++ F EM++A++KP+ ITF+ +L+ACS+ GM  EGL+  + M   ++IEPK EHYGC+ID+L+RAG+ E A  + + +P       ++  W + LS+C
Subjt:  GALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFLSAC

Query:  CKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLE
           G +  A  A + L ELE    G YVLL+N+YA LGK  D  R+RKM++ + ++K PG S I++N +V EF++GD +     +I +VL+
Subjt:  CKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLE

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.9e-10335.5Show/hide
Query:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFC-AEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACA
        ++ L+  HAQ++  GL + ++ALS+L++FC   P    L +A+ +++ IQ P + I NTM +   L  + ++A+ +Y  M    + P+SYT P+VLK+CA
Subjt:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFC-AEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACA

Query:  RIKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNN
        + K F  G+ +H   +KLG  LD++V  SL+ MY   G + DA +VFD+ P    VS+T +I GYA  G +  A+ LFD  P+KD   W AMISGY +  
Subjt:  RIKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNN

Query:  CFKEGLHMFRLMQLTEVEPDEAIIVT--------------------------------------------------------------------------
         +KE L +F+ M  T V PDE+ +VT                                                                          
Subjt:  CFKEGLHMFRLMQLTEVEPDEAIIVT--------------------------------------------------------------------------

Query:  ---------------------------ILCACAHMGALDIGIWIHRYLSHLELPLTLRVS--TGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGM
                                   IL ACAH+GA+DIG WIH Y+      +T   S  T L+DMYAKCG ++    +F+ +  +    WN MI G 
Subjt:  ---------------------------ILCACAHMGALDIGIWIHRYLSHLELPLTLRVS--TGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGM

Query:  AMHGDGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAW
        AMHG  + +  LF  M K  I+PDDITF+ +L+ACS+SGM + G  I   M   +K+ PK EHYGC+IDLL  +G F+EAE +I    N      + V W
Subjt:  AMHGDGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAW

Query:  RAFLSACCKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEEL
         + L AC  HG ++  E  AE L ++E  + G+YVLLSN+YA+ G+  +  + R ++  KG++KVPGCSSI+++ VV EFI GDK H +  +I+ +LEE+
Subjt:  RAFLSACCKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEEL

AT2G20540.1 mitochondrial editing factor 213.0e-10137.68Show/hide
Query:  KQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEI-APDSYTHPYVLKACARIKK
        K+ +A ++  GL  +SF +++++DFC   +   + +A +L+  + +P + + N++I+A+     + + I IY  + R     PD +T P++ K+CA +  
Subjt:  KQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEI-APDSYTHPYVLKACARIKK

Query:  FHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNCFKE
         +LG+ VH    K G    V   N+L+ MY  F ++ DA +VFDEM E   +SW  ++ GYA++G +  A+GLF     K    W AMISGY    C+ E
Subjt:  FHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNCFKE

Query:  GLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHGDGE
         +  FR MQL  +EPDE  ++++L +CA +G+L++G WIH Y           V   L++MY+KCG +     LF +M  +D I W+ MISG A HG+  
Subjt:  GLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHGDGE

Query:  GALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFLSAC
        GA++ F EM++A++KP+ ITF+ +L+ACS+ GM  EGL+  + M   ++IEPK EHYGC+ID+L+RAG+ E A  + + +P       ++  W + LS+C
Subjt:  GALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFLSAC

Query:  CKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLE
           G +  A  A + L ELE    G YVLL+N+YA LGK  D  R+RKM++ + ++K PG S I++N +V EF++GD +     +I +VL+
Subjt:  CKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLE

AT3G15930.1 Pentatricopeptide repeat (PPR) superfamily protein3.0e-9336.14Show/hide
Query:  KQAHAQVLTSGLGSNSF---ALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACARI
        K+ H  V+  GLGSN +   AL ++   C     G +  A  ++       +   N MI  +    E+  +I +   M RN ++P S T   VL AC+++
Subjt:  KQAHAQVLTSGLGSNSF---ALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACARI

Query:  KKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNCF
        K   L + VH    +      + + N+L+  Y A G M  A ++F  M     +SWT ++ GY + G++  AR  FD+ P++DR  W  MI GY++  CF
Subjt:  KKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNCF

Query:  KEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHGD
         E L +FR MQ   + PDE  +V++L ACAH+G+L+IG WI  Y+   ++   + V   L+DMY KCG  +  + +F +M QRD   W  M+ G+A +G 
Subjt:  KEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHGD

Query:  GEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFLS
        G+ A+K+F +M+   I+PDDIT++ +L+AC++SGM ++  K   KM + H+IEP   HYGC++D+L RAG  +EA  +++++P     +  ++ W A L 
Subjt:  GEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFLS

Query:  ACCKHGEMQQAEGAAERLFELERHSGA-YVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQ
        A   H +   AE AA+++ ELE  +GA Y LL N+YA   +  D + VR+ +    ++K PG S I++NG   EF+AGDK+H+Q ++I++ LEEL ++
Subjt:  ACCKHGEMQQAEGAAERLFELERHSGA-YVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQ

AT5G56310.1 Pentatricopeptide repeat (PPR) superfamily protein1.8e-9335.64Show/hide
Query:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGE---FLNAIVIYSAMRRNEIAPDSYTHPYVLKA
        +K LKQ+H  ++ +GL  ++  +++ ++ C+   H  L +A  ++ H   P   + NTMI+A  L  E      AI +Y  +      PD++T P+VLK 
Subjt:  MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGE---FLNAIVIYSAMRRNEIAPDSYTHPYVLKA

Query:  CARIKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAP--IKDRGIWGAMISGY
          R+     G  +H   V  GF   V V   L+ MY + G + DA ++FDEM       W  ++ GY K+G+++ AR L +  P  +++   W  +ISGY
Subjt:  CARIKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAP--IKDRGIWGAMISGY

Query:  VQNNCFKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISG
         ++    E + +F+ M +  VEPDE  ++ +L ACA +G+L++G  I  Y+ H  +   + ++  ++DMYAK G++     +F+ + +R+ + W  +I+G
Subjt:  VQNNCFKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISG

Query:  MAMHGDGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVA
        +A HG G  AL +F  M KA ++P+D+TFIAIL+ACS+ G  + G ++ N M + + I P  EHYGC+IDLL RAG+  EA+ VI+ +P  A A+     
Subjt:  MAMHGDGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVA

Query:  WRAFLSACCKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEE
        W + L+A   H +++  E A   L +LE  +SG Y+LL+N+Y+ LG+  +++ +R MMK  GV+K+ G SSI++   V +FI+GD TH Q++ IH +L+E
Subjt:  WRAFLSACCKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEE

Query:  LNKQI
        ++ QI
Subjt:  LNKQI

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.8e-9638.65Show/hide
Query:  KQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGS-LSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR
        ++LKQ HA++L +GL  +S+A+++ L FC        L +A  ++     P   + N MI+ F    E   ++++Y  M  +    ++YT P +LKAC+ 
Subjt:  KQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGS-LSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACAR

Query:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC
        +  F     +HA   KLG+  DV+  NSL+  Y   GN + A  +FD +PE   VSW  +I GY K G ++ A  LF +   K+   W  MISGYVQ + 
Subjt:  IKKFHLGESVHACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNC

Query:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG
         KE L +F  MQ ++VEPD   +   L ACA +GAL+ G WIH YL+   + +   +   L+DMYAKCG ++    +F  + ++    W  +ISG A HG
Subjt:  FKEGLHMFRLMQLTEVEPDEAIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHG

Query:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL
         G  A+  F EM+K  IKP+ ITF A+L ACS +G+  EG  I   M   + ++P  EHYGCI+DLL RAG  +EA+  IQ +P        AV W A L
Subjt:  DGEGALKLFKEMEKARIKPDDITFIAILAACSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFL

Query:  SACCKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDI
         AC  H  ++  E   E L  ++  H G YV  +N++A   K   A   R++MK +GV KVPGCS+I L G   EF+AGD++H +I+ I
Subjt:  SACCKHGEMQQAEGAAERLFELE-RHSGAYVLLSNMYAALGKHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGCAGCTAAAGCAAGCTCATGCGCAGGTTCTTACATCTGGACTCGGCAGCAACAGCTTCGCGTTAAGCAGATTGTTGGATTTTTGTGCAGAACCACGCCATGGCAG
CCTTTCTCATGCATTGAAACTCTACCGACACATTCAACACCCAACCATTTGCATCTGCAACACCATGATCAAAGCCTTTCTTCTACGAGGTGAATTTCTCAACGCCATTG
TTATATACTCTGCAATGCGAAGAAATGAGATCGCCCCTGATAGTTATACTCATCCTTACGTTCTAAAAGCTTGCGCGAGGATTAAAAAATTTCATCTCGGAGAGTCGGTT
CATGCTTGTGCTGTGAAATTGGGTTTTGTATTGGATGTATTTGTGGGTAATTCTTTGCTTGTTATGTATTGCGCGTTCGGTAACATGAGAGATGCAGGACAGGTGTTCGA
CGAAATGCCTGAGCTAAGTGCTGTGTCGTGGACGGTTATGATTTACGGGTATGCAAAGATGGGTGATGTGAACACCGCAAGAGGATTGTTTGACCGAGCTCCAATCAAAG
ATAGGGGAATATGGGGCGCCATGATATCTGGGTATGTGCAAAACAATTGCTTCAAGGAGGGGCTGCACATGTTTCGTCTGATGCAACTGACTGAAGTAGAGCCTGATGAA
GCCATAATTGTGACAATCCTTTGTGCTTGCGCTCACATGGGAGCTCTTGACATTGGGATTTGGATTCACAGGTATTTGAGTCATCTCGAATTGCCATTAACTCTGCGAGT
GAGCACTGGGTTGATGGATATGTATGCAAAATGTGGGCATCTAGACATGACCAGGCACCTGTTCGATGAAATGCCGCAGAGAGATACCATTTGTTGGAACGTAATGATTT
CAGGAATGGCAATGCATGGAGATGGAGAAGGCGCACTGAAGCTCTTCAAGGAGATGGAGAAGGCAAGAATCAAGCCGGATGACATAACATTCATAGCCATTTTGGCAGCT
TGTAGCAACTCAGGCATGGCCAATGAAGGCTTGAAGATACTGAACAAAATGTGTACAATACACAAAATTGAGCCAAAAGCTGAACACTACGGGTGTATAATCGACCTGCT
GAGTCGAGCTGGGCGGTTCGAGGAAGCGGAGGGAGTGATACAGAGACTTCCAAACACAGCCACTGCTTCAGAGGAAGCAGTGGCTTGGAGGGCTTTTCTGAGTGCTTGCT
GTAAGCATGGGGAAATGCAGCAGGCTGAGGGTGCTGCTGAGAGGCTGTTTGAATTGGAAAGACATAGTGGGGCTTACGTTTTACTGTCGAATATGTATGCTGCTCTGGGG
AAGCATGGGGATGCTAAAAGAGTGAGGAAGATGATGAAGTTGAAAGGGGTTGAGAAGGTACCTGGTTGCAGCTCCATTAAACTCAATGGAGTTGTCACTGAGTTCATTGC
AGGGGATAAAACCCACATCCAAATTGATGATATTCACTTGGTTTTGGAAGAGTTGAATAAACAAATTTCTCAGCCTGATCTTTTATATTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGCAGCTAAAGCAAGCTCATGCGCAGGTTCTTACATCTGGACTCGGCAGCAACAGCTTCGCGTTAAGCAGATTGTTGGATTTTTGTGCAGAACCACGCCATGGCAG
CCTTTCTCATGCATTGAAACTCTACCGACACATTCAACACCCAACCATTTGCATCTGCAACACCATGATCAAAGCCTTTCTTCTACGAGGTGAATTTCTCAACGCCATTG
TTATATACTCTGCAATGCGAAGAAATGAGATCGCCCCTGATAGTTATACTCATCCTTACGTTCTAAAAGCTTGCGCGAGGATTAAAAAATTTCATCTCGGAGAGTCGGTT
CATGCTTGTGCTGTGAAATTGGGTTTTGTATTGGATGTATTTGTGGGTAATTCTTTGCTTGTTATGTATTGCGCGTTCGGTAACATGAGAGATGCAGGACAGGTGTTCGA
CGAAATGCCTGAGCTAAGTGCTGTGTCGTGGACGGTTATGATTTACGGGTATGCAAAGATGGGTGATGTGAACACCGCAAGAGGATTGTTTGACCGAGCTCCAATCAAAG
ATAGGGGAATATGGGGCGCCATGATATCTGGGTATGTGCAAAACAATTGCTTCAAGGAGGGGCTGCACATGTTTCGTCTGATGCAACTGACTGAAGTAGAGCCTGATGAA
GCCATAATTGTGACAATCCTTTGTGCTTGCGCTCACATGGGAGCTCTTGACATTGGGATTTGGATTCACAGGTATTTGAGTCATCTCGAATTGCCATTAACTCTGCGAGT
GAGCACTGGGTTGATGGATATGTATGCAAAATGTGGGCATCTAGACATGACCAGGCACCTGTTCGATGAAATGCCGCAGAGAGATACCATTTGTTGGAACGTAATGATTT
CAGGAATGGCAATGCATGGAGATGGAGAAGGCGCACTGAAGCTCTTCAAGGAGATGGAGAAGGCAAGAATCAAGCCGGATGACATAACATTCATAGCCATTTTGGCAGCT
TGTAGCAACTCAGGCATGGCCAATGAAGGCTTGAAGATACTGAACAAAATGTGTACAATACACAAAATTGAGCCAAAAGCTGAACACTACGGGTGTATAATCGACCTGCT
GAGTCGAGCTGGGCGGTTCGAGGAAGCGGAGGGAGTGATACAGAGACTTCCAAACACAGCCACTGCTTCAGAGGAAGCAGTGGCTTGGAGGGCTTTTCTGAGTGCTTGCT
GTAAGCATGGGGAAATGCAGCAGGCTGAGGGTGCTGCTGAGAGGCTGTTTGAATTGGAAAGACATAGTGGGGCTTACGTTTTACTGTCGAATATGTATGCTGCTCTGGGG
AAGCATGGGGATGCTAAAAGAGTGAGGAAGATGATGAAGTTGAAAGGGGTTGAGAAGGTACCTGGTTGCAGCTCCATTAAACTCAATGGAGTTGTCACTGAGTTCATTGC
AGGGGATAAAACCCACATCCAAATTGATGATATTCACTTGGTTTTGGAAGAGTTGAATAAACAAATTTCTCAGCCTGATCTTTTATATTTCTAA
Protein sequenceShow/hide protein sequence
MKQLKQAHAQVLTSGLGSNSFALSRLLDFCAEPRHGSLSHALKLYRHIQHPTICICNTMIKAFLLRGEFLNAIVIYSAMRRNEIAPDSYTHPYVLKACARIKKFHLGESV
HACAVKLGFVLDVFVGNSLLVMYCAFGNMRDAGQVFDEMPELSAVSWTVMIYGYAKMGDVNTARGLFDRAPIKDRGIWGAMISGYVQNNCFKEGLHMFRLMQLTEVEPDE
AIIVTILCACAHMGALDIGIWIHRYLSHLELPLTLRVSTGLMDMYAKCGHLDMTRHLFDEMPQRDTICWNVMISGMAMHGDGEGALKLFKEMEKARIKPDDITFIAILAA
CSNSGMANEGLKILNKMCTIHKIEPKAEHYGCIIDLLSRAGRFEEAEGVIQRLPNTATASEEAVAWRAFLSACCKHGEMQQAEGAAERLFELERHSGAYVLLSNMYAALG
KHGDAKRVRKMMKLKGVEKVPGCSSIKLNGVVTEFIAGDKTHIQIDDIHLVLEELNKQISQPDLLYF