; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg12440 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg12440
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationCarg_Chr14:73289..78946
RNA-Seq ExpressionCarg12440
SyntenyCarg12440
Gene Ontology termsGO:0006468 - protein phosphorylation (biological process)
GO:0016020 - membrane (cellular component)
GO:0004674 - protein serine/threonine kinase activity (molecular function)
GO:0005515 - protein binding (molecular function)
GO:0005524 - ATP binding (molecular function)
InterPro domainsIPR000719 - Protein kinase domain
IPR001245 - Serine-threonine/tyrosine-protein kinase, catalytic domain
IPR002885 - Pentatricopeptide repeat
IPR008271 - Serine/threonine-protein kinase, active site
IPR011009 - Protein kinase-like domain superfamily
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR017441 - Protein kinase, ATP binding site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6580431.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]4.4e-26988.53Show/hide
Query:  MVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR
        MVLKGALSYILPKLLRLSSMKELEL HAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR
Subjt:  MVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR

Query:  MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV
        MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV
Subjt:  MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV

Query:  RDYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSL
        RDYKSADSLLQLMPQTN                                                            SGIRATEVTFISILGACAETGSL
Subjt:  RDYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSL

Query:  EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHK
        EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHK
Subjt:  EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHK

Query:  GLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYA
        GLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYA
Subjt:  GLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYA

Query:  EEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ
        EEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ
Subjt:  EEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ

KAG7017188.1 Pentatricopeptide repeat-containing protein [Cucurbita argyrosperma subsp. argyrosperma]0.0e+00100Show/hide
Query:  MNCNEENRGGDDRETEAPVLRNSVEAEGKPVIQIGSITDQHLTIDDNLLVDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRF
        MNCNEENRGGDDRETEAPVLRNSVEAEGKPVIQIGSITDQHLTIDDNLLVDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRF
Subjt:  MNCNEENRGGDDRETEAPVLRNSVEAEGKPVIQIGSITDQHLTIDDNLLVDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRF

Query:  AREVNMMSRVKHENLVKFIGACKEPLMVIVTELLPGMSLRKYLMNNRKQQLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFG
        AREVNMMSRVKHENLVKFIGACKEPLMVIVTELLPGMSLRKYLMNNRKQQLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFG
Subjt:  AREVNMMSRVKHENLVKFIGACKEPLMVIVTELLPGMSLRKYLMNNRKQQLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFG

Query:  LAREESVTEMMTAETGTYRWMAPELYSTVTLRQGEKKHYNNKVDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWV
        LAREESVTEMMTAETGTYRWMAPELYSTVTLRQGEKKHYNNKVDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWV
Subjt:  LAREESVTEMMTAETGTYRWMAPELYSTVTLRQGEKKHYNNKVDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWV

Query:  EDSKMRPSFSQIIRMLNAYLFTLPPPSSPSSPSSPKSDTTETPTSSNGRWNMVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFS
        EDSKMRPSFSQIIRMLNAYLFTLPPPSSPSSPSSPKSDTTETPTSSNGRWNMVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFS
Subjt:  EDSKMRPSFSQIIRMLNAYLFTLPPPSSPSSPSSPKSDTTETPTSSNGRWNMVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFS

Query:  SLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQRMDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLK
        SLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQRMDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLK
Subjt:  SLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQRMDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLK

Query:  LGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRVRDYKSADSLLQLMPQTNSGIRATEVTFISILGACAETGSLEMGKKIHES
        LGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRVRDYKSADSLLQLMPQTNSGIRATEVTFISILGACAETGSLEMGKKIHES
Subjt:  LGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRVRDYKSADSLLQLMPQTNSGIRATEVTFISILGACAETGSLEMGKKIHES

Query:  LKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHKGLVAEGRHF
        LKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHKGLVAEGRHF
Subjt:  LKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHKGLVAEGRHF

Query:  LSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYAEEERWDDVG
        LSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYAEEERWDDVG
Subjt:  LSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYAEEERWDDVG

Query:  RLRNEMIEYGVCKKAGSSHVKIQ
        RLRNEMIEYGVCKKAGSSHVKIQ
Subjt:  RLRNEMIEYGVCKKAGSSHVKIQ

TQD73287.1 hypothetical protein C1H46_041180 [Malus baccata]4.8e-30860.34Show/hide
Query:  MNCNEENRGGDDRETEAPVLRNSVEAEGKPVIQIGSITDQHLTIDDNLLVDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRF
        M+C+  + G ++RE E  VLR SV+ E     Q G I  Q LTID+NLLVDPKLLFIG+KIGEGAHGKVYEGRY ++IVA+KVLHRGST EERAALE+RF
Subjt:  MNCNEENRGGDDRETEAPVLRNSVEAEGKPVIQIGSITDQHLTIDDNLLVDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRF

Query:  AREVNMMSRVKHENLVKFIGACKEPLMVIVTELLPGMSLRKYLMNNRKQQLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFG
        AREVNMMSRVKH+NLVKFIGACK+PLMVIVTELLPGMSLRKYLM+ R   L+  +AI F+LDIA AM+CLHANGIIHRDLKPDNLLLTANQ+ VKLADFG
Subjt:  AREVNMMSRVKHENLVKFIGACKEPLMVIVTELLPGMSLRKYLMNNRKQQLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFG

Query:  LAREESVTEMMTAETGTYRWMAPELYSTVTLRQGEKKHYNNKVDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWV
        LAREE+VTEMMTAETGTYRWMAPELYSTVTLRQGEKKHYNNKVDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERP +P DI PDLAFI+QSCWV
Subjt:  LAREESVTEMMTAETGTYRWMAPELYSTVTLRQGEKKHYNNKVDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWV

Query:  EDSKMRPSFSQIIRMLNAYLFTLPPPSSPSSPSSPKSDTTETPTSSN-------------------------------------------GRWNMV-LKG
        ED  +RP+FSQIIRMLN++LF L PP    SP  P +DT E   S+                                              W M+ +  
Subjt:  EDSKMRPSFSQIIRMLNAYLFTLPPPSSPSSPSSPKSDTTETPTSSN-------------------------------------------GRWNMV-LKG

Query:  ALSYILPKLLRL---SSMKELELTHAFIVKAGL-CNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQRM
        A+S  L KLL+L   SSM ++E   AF+ KAGL  +H P++ KL+AF+SLSP G L HAHALF++ ++DD F+CNTMIRAYSNSVFP++A+ IYN MQ M
Subjt:  ALSYILPKLLRL---SSMKELELTHAFIVKAGL-CNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQRM

Query:  DVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRVR
         V SDHFT+NF LKACAR +K  E   +  G   + RKG EIH RVLKLG D+D +VQNSLL +Y  CG V  AR +F+EMT RS  SWNIM+SAY+R+ 
Subjt:  DVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRVR

Query:  DYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSLE
        D+++ADSL + MP+ N                                                            + + ATEVT ISILGACAETG+LE
Subjt:  DYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSLE

Query:  MGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHKG
        +G+KIHESLK  H++I GYLG A+VDMY+KCG+++ A EVF+E++MKPV CWNAMI+GLAVHGYC+ AL +F +ME  + + +PNR+TF+ +LIACSHKG
Subjt:  MGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHKG

Query:  LVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYAE
        LV EGR + + MI +Y IMPD KHYGCMVDLLSR G L+EAYEMIK  P  S  ++WRTLLG CRVH ++EL E++F +L +LE  +D DYVLLSN+YAE
Subjt:  LVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYAE

Query:  EERWDDVGRLRNEMIEYGVCKKAGSSHV
         ERWDDV RLRNEMI  GV K  G SHV
Subjt:  EERWDDVGRLRNEMIEYGVCKKAGSSHV

XP_022935119.1 pentatricopeptide repeat-containing protein At5g15300-like [Cucurbita moschata]2.8e-26888.16Show/hide
Query:  MVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR
        MVLKGALSYILPKLLRLSSMKELEL HAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR
Subjt:  MVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR

Query:  MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV
        MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV
Subjt:  MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV

Query:  RDYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSL
        RDYKSADSLLQLMPQTN                                                            SGIRATEVTFISILGACAETGSL
Subjt:  RDYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSL

Query:  EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHK
        EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSME GNDDHKPNRVTFVAILIACSHK
Subjt:  EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHK

Query:  GLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYA
        GLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRH+ELGEEAFCKLGELEARKDGDYVLLSNIYA
Subjt:  GLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYA

Query:  EEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ
        EEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ
Subjt:  EEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ

XP_023526674.1 pentatricopeptide repeat-containing protein At5g15300-like [Cucurbita pepo subsp. pepo]7.0e-26787.59Show/hide
Query:  MVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR
        MVLKGALSYILPKLLRLSSMKELEL HAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR
Subjt:  MVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR

Query:  MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV
        MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIH+RVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV
Subjt:  MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV

Query:  RDYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSL
        RDYKSADSLLQLMPQ N                                                            SGIRATEVTFISILGACAETGSL
Subjt:  RDYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSL

Query:  EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHK
        EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEM+PVSCWNAMIMGLAVHGYCERAL+MFDSMEAGNDDHKPNRVTFVAILIACSHK
Subjt:  EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHK

Query:  GLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYA
        GLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRH+ELGEEAFCKLGELEARKDGDYVLLSNIYA
Subjt:  GLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYA

Query:  EEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ
        EEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ
Subjt:  EEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ

TrEMBL top hitse value%identityAlignment
A0A540KGD7 Protein kinase domain-containing protein2.3e-30860.34Show/hide
Query:  MNCNEENRGGDDRETEAPVLRNSVEAEGKPVIQIGSITDQHLTIDDNLLVDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRF
        M+C+  + G ++RE E  VLR SV+ E     Q G I  Q LTID+NLLVDPKLLFIG+KIGEGAHGKVYEGRY ++IVA+KVLHRGST EERAALE+RF
Subjt:  MNCNEENRGGDDRETEAPVLRNSVEAEGKPVIQIGSITDQHLTIDDNLLVDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRF

Query:  AREVNMMSRVKHENLVKFIGACKEPLMVIVTELLPGMSLRKYLMNNRKQQLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFG
        AREVNMMSRVKH+NLVKFIGACK+PLMVIVTELLPGMSLRKYLM+ R   L+  +AI F+LDIA AM+CLHANGIIHRDLKPDNLLLTANQ+ VKLADFG
Subjt:  AREVNMMSRVKHENLVKFIGACKEPLMVIVTELLPGMSLRKYLMNNRKQQLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFG

Query:  LAREESVTEMMTAETGTYRWMAPELYSTVTLRQGEKKHYNNKVDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWV
        LAREE+VTEMMTAETGTYRWMAPELYSTVTLRQGEKKHYNNKVDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERP +P DI PDLAFI+QSCWV
Subjt:  LAREESVTEMMTAETGTYRWMAPELYSTVTLRQGEKKHYNNKVDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWV

Query:  EDSKMRPSFSQIIRMLNAYLFTLPPPSSPSSPSSPKSDTTETPTSSN-------------------------------------------GRWNMV-LKG
        ED  +RP+FSQIIRMLN++LF L PP    SP  P +DT E   S+                                              W M+ +  
Subjt:  EDSKMRPSFSQIIRMLNAYLFTLPPPSSPSSPSSPKSDTTETPTSSN-------------------------------------------GRWNMV-LKG

Query:  ALSYILPKLLRL---SSMKELELTHAFIVKAGL-CNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQRM
        A+S  L KLL+L   SSM ++E   AF+ KAGL  +H P++ KL+AF+SLSP G L HAHALF++ ++DD F+CNTMIRAYSNSVFP++A+ IYN MQ M
Subjt:  ALSYILPKLLRL---SSMKELELTHAFIVKAGL-CNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQRM

Query:  DVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRVR
         V SDHFT+NF LKACAR +K  E   +  G   + RKG EIH RVLKLG D+D +VQNSLL +Y  CG V  AR +F+EMT RS  SWNIM+SAY+R+ 
Subjt:  DVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRVR

Query:  DYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSLE
        D+++ADSL + MP+ N                                                            + + ATEVT ISILGACAETG+LE
Subjt:  DYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSLE

Query:  MGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHKG
        +G+KIHESLK  H++I GYLG A+VDMY+KCG+++ A EVF+E++MKPV CWNAMI+GLAVHGYC+ AL +F +ME  + + +PNR+TF+ +LIACSHKG
Subjt:  MGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHKG

Query:  LVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYAE
        LV EGR + + MI +Y IMPD KHYGCMVDLLSR G L+EAYEMIK  P  S  ++WRTLLG CRVH ++EL E++F +L +LE  +D DYVLLSN+YAE
Subjt:  LVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYAE

Query:  EERWDDVGRLRNEMIEYGVCKKAGSSHV
         ERWDDV RLRNEMI  GV K  G SHV
Subjt:  EERWDDVGRLRNEMIEYGVCKKAGSSHV

A0A5A7TL32 Pentatricopeptide repeat-containing protein5.1e-22374.81Show/hide
Query:  ILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQRMDVHSDHFT
        ++PKL RLSS+KELE   AFIVKAG  NHIP+MTKLIAFSSLSPSGSLP A+ALFQ+ SMDDSFICNTMIRAYSNSVFP+KALLIYN MQRMDV SDHFT
Subjt:  ILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQRMDVHSDHFT

Query:  YNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRVRDYKSADSL
        YNFVLKACA AIKCTE DDQCFGHDIISRKGAEIH+R+LKLG DQDHHVQNSLLL+YSG GLV FAR++F EMTVR+AVSWNIMMSAYNRV DYKSAD L
Subjt:  YNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRVRDYKSADSL

Query:  LQLMPQTNS------------------------------------------------------------GIRATEVTFISILGACAETGSLEMGKKIHES
        L+ MPQTN+                                                             IRATEVTFISILGACAE G+LE GKKIHES
Subjt:  LQLMPQTNS------------------------------------------------------------GIRATEVTFISILGACAETGSLEMGKKIHES

Query:  LKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHKGLVAEGRHF
        LK +HY+IEGYLGNAIVDMYAKCGEL+LALEVFNEMEMKPVSCWNAMIMGLAVHGYCE+AL+MFDSM+A + DHKPNRVTF+A+LIACSHKGLVAEGRHF
Subjt:  LKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHKGLVAEGRHF

Query:  LSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYAEEERWDDVG
         SLM+ KYKIMPDLKHYGCM+DLLSRWGFL+EAY +IK CPFSSC+V+WRTLLGGCRVHR +ELGEE+F +L ELE  KDGDYVLLSNIYAEEERWDDV 
Subjt:  LSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYAEEERWDDVG

Query:  RLRNEMIEYGVCKKAGSSHV
        RLR EMI YGVCKKAGSSHV
Subjt:  RLRNEMIEYGVCKKAGSSHV

A0A6J1DSP5 pentatricopeptide repeat-containing protein At5g15300-like5.5e-22574.06Show/hide
Query:  MVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR
        M+LKG ++ I+ KL RLSSM+ELE  HAFIVKAGLCNHIP+M KLIAFSSLSPSGSL HAHALFQ+ SMDDSFICNTMIRAYS SVFP+K+LLIYNHMQR
Subjt:  MVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR

Query:  MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV
        MDVHSDHFTYNFVLKACARAIKCTEKDD+CFG ++ISRKGAEIH+RV KLGLDQDHHVQNSL+LMYS CGLVV AR++F+E+TVRSAVSWNIMMSAY+RV
Subjt:  MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV

Query:  RDYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSL
         DYKSA  LL LMPQ N                                                            S I ATEVTFISILGACAETG+L
Subjt:  RDYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSL

Query:  EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAG-NDDHKPNRVTFVAILIACSH
        E GKKIHESLK +H+RIEGYLGNAIVDMYAKCGEL LALEVFNE+EMKPVSCWNAMIMGLAVHG+CERAL+MFDSMEA  +DD KPNRVTF+A+LIACSH
Subjt:  EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAG-NDDHKPNRVTFVAILIACSH

Query:  KGLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIY
        +GLVAEGRH+ SLM+ KYKI+PDLKHYGCM+DLLSRWGFL+EAYEMIK CPFSSC V+WRTLLGGCR+H  +EL EE+F KLGELEARKDGDYVLLSN+Y
Subjt:  KGLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIY

Query:  AEEERWDDVGRLRNEMIEYGVCKKAGSSHVKI
        AEEERWDDV RLRN MI+YGVCKKAG SHVKI
Subjt:  AEEERWDDVGRLRNEMIEYGVCKKAGSSHVKI

A0A6J1F4H4 pentatricopeptide repeat-containing protein At5g15300-like1.4e-26888.16Show/hide
Query:  MVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR
        MVLKGALSYILPKLLRLSSMKELEL HAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR
Subjt:  MVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR

Query:  MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV
        MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV
Subjt:  MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV

Query:  RDYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSL
        RDYKSADSLLQLMPQTN                                                            SGIRATEVTFISILGACAETGSL
Subjt:  RDYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSL

Query:  EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHK
        EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSME GNDDHKPNRVTFVAILIACSHK
Subjt:  EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHK

Query:  GLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYA
        GLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRH+ELGEEAFCKLGELEARKDGDYVLLSNIYA
Subjt:  GLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYA

Query:  EEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ
        EEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ
Subjt:  EEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ

A0A6J1IX07 pentatricopeptide repeat-containing protein At5g15300-like2.6e-25984.96Show/hide
Query:  MVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR
        MVLKG LSYILPKLLRLSSMKELEL HAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQD SMDDSFICNTMIRAYSN+VFPLKALL+YNHMQR
Subjt:  MVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQR

Query:  MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV
        MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIH+RVLKLGL QDHHVQNSLLLMYSGCGLVVFAR LFEEMTVRSAVSWNIMMSAYNRV
Subjt:  MDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRV

Query:  RDYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSL
         DYKSAD+LLQLMPQTN                                                            SGIR+TEVTFISILGACAE GSL
Subjt:  RDYKSADSLLQLMPQTN------------------------------------------------------------SGIRATEVTFISILGACAETGSL

Query:  EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHK
        EMGKKIH SLKAEHYRIEGYLGNA+VDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHG+CERAL+MFDSMEAGNDDHKPNRVTFVAILIACSHK
Subjt:  EMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHK

Query:  GLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYA
        GLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIK CPFSSCAVVWRTLLGGCRVHRH+ELGEEAFCKLGELEARKDGDYVLLSNIYA
Subjt:  GLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYA

Query:  EEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ
        +EERWDDVGRLRNEMIE GVCKKAGSSHVKIQ
Subjt:  EEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ

SwissProt top hitse value%identityAlignment
Q8LK93 Pentatricopeptide repeat-containing protein At2g02980, chloroplastic5.3e-7635.67Show/hide
Query:  RLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPS-GSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQRMDVHSDHFTYNFVL
        + +S++EL    A+ +K+ +   +  + KLI F + SP+  S+ +A  LF+ +S  D  I N+M R YS    PL+   ++  +    +  D++T+  +L
Subjt:  RLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPS-GSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQRMDVHSDHFTYNFVL

Query:  KACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRVRDYKSADSLLQLMP
        KACA A K  E             +G ++H   +KLGLD + +V  +L+ MY+ C  V  AR +F+ +     V +N M++ Y R      A SL + M 
Subjt:  KACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRVRDYKSADSLLQLMP

Query:  QTNSGIRATEVTFISILGACAETGSLEMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFD
             ++  E+T +S+L +CA  GSL++GK IH+  K   +     +  A++DM+AKCG L  A+ +F +M  K    W+AMI+  A HG  E+++ MF+
Subjt:  QTNSGIRATEVTFISILGACAETGSLEMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFD

Query:  SMEAGNDDHKPNRVTFVAILIACSHKGLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELG
         M + N   +P+ +TF+ +L ACSH G V EGR + S M++K+ I+P +KHYG MVDLLSR G L++AYE I   P S   ++WR LL  C  H +L+L 
Subjt:  SMEAGNDDHKPNRVTFVAILIACSHKGLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELG

Query:  EEAFCKLGELEARKDGDYVLLSNIYAEEERWDDVGRLRNEMIEYGVCKKAGSSHVKI
        E+   ++ EL+    GDYV+LSN+YA  ++W+ V  LR  M +    K  G S +++
Subjt:  EEAFCKLGELEARKDGDYVLLSNIYAEEERWDDVGRLRNEMIEYGVCKKAGSSHVKI

Q9FMA1 Pentatricopeptide repeat-containing protein At5g563106.9e-7635.18Show/hide
Query:  LKGALSYILPKL-LRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLK---ALLIYNHM
        L   L++ +  L +  +++K L+ +H +++  GL      + K I   + S +G L +A+++F      ++++ NTMIRA S    P     A+ +Y  +
Subjt:  LKGALSYILPKL-LRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLK---ALLIYNHM

Query:  QRMDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYN
          +    D FT+ FVLK   R        D  FG         +IH +V+  G D   HV   L+ MY  CG +  AR +F+EM V+    WN +++ Y 
Subjt:  QRMDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYN

Query:  RVRDYKSADSLLQLMP---------------QTNSG----------------IRATEVTFISILGACAETGSLEMGKKIHESLKAEHYRIEGYLGNAIVD
        +V +   A SLL++MP                  SG                +   EVT +++L ACA+ GSLE+G++I   +          L NA++D
Subjt:  RVRDYKSADSLLQLMP---------------QTNSG----------------IRATEVTFISILGACAETGSLEMGKKIHESLKAEHYRIEGYLGNAIVD

Query:  MYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSM-EAGNDDHKPNRVTFVAILIACSHKGLVAEGRHFLSLMINKYKIMPDLKHY
        MYAK G ++ AL+VF  +  + V  W  +I GLA HG+   AL MF+ M +AG    +PN VTF+AIL ACSH G V  G+   + M +KY I P+++HY
Subjt:  MYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSM-EAGNDDHKPNRVTFVAILIACSHKGLVAEGRHFLSLMINKYKIMPDLKHY

Query:  GCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYAEEERWDDVGRLRNEMIEYGVCKKAGS
        GCM+DLL R G L+EA E+IK  PF + A +W +LL    VH  LELGE A  +L +LE    G+Y+LL+N+Y+   RWD+   +RN M   GV K AG 
Subjt:  GCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYAEEERWDDVGRLRNEMIEYGVCKKAGS

Query:  SHVKIQ
        S ++++
Subjt:  SHVKIQ

Q9LS72 Pentatricopeptide repeat-containing protein At3g292304.1e-7631.24Show/hide
Query:  LPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQRMDVHSDHFTY
        LPK   L+ +K+L   HA I++  L   + +  KLI+  SL    +L  A  +F  +   +  +CN++IRA++ +  P +A  +++ MQR  + +D+FTY
Subjt:  LPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQRMDVHSDHFTY

Query:  NFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCG-------LVVF--------------------------ARM
         F+LKAC+             G   +      +H+ + KLGL  D +V N+L+  YS CG       + +F                          AR 
Subjt:  NFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCG-------LVVF--------------------------ARM

Query:  LFEEMTVRSAVSWNIMMSAYNRVRDYKSADSLLQLMPQTN------------------------------------------------------------
        LF+EM  R  +SWN M+  Y R R+   A  L + MP+ N                                                            
Subjt:  LFEEMTVRSAVSWNIMMSAYNRVRDYKSADSLLQLMPQTN------------------------------------------------------------

Query:  --SGIRATEVTFISILGACAETGSLEMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDS
          SG++      ISIL AC E+G L +G +IH  LK  +     Y+ NA++DMYAKCG L  A +VFN++  K +  WN M+ GL VHG+ + A+++F  
Subjt:  --SGIRATEVTFISILGACAETGSLEMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDS

Query:  MEAGNDDHKPNRVTFVAILIACSHKGLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGE
        M    +  +P++VTF+A+L +C+H GL+ EG  +   M   Y ++P ++HYGC+VDLL R G L+EA ++++  P     V+W  LLG CR+H  +++ +
Subjt:  MEAGNDDHKPNRVTFVAILIACSHKGLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGE

Query:  EAFCKLGELEARKDGDYVLLSNIYAEEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ
        E    L +L+    G+Y LLSNIYA  E W+ V  +R++M   GV K +G+S V+++
Subjt:  EAFCKLGELEARKDGDYVLLSNIYAEEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ

Q9LXF2 Pentatricopeptide repeat-containing protein At5g153002.4e-7632.95Show/hide
Query:  PKLLR-LSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQRMDVHSDHFTY
        PKL +   +++ L+  HA +V  GL +++ V+ +LI  +SLS  G+L +AH LF +I   D  ICN ++R  + S+ P K + +Y  M++  V  D +T+
Subjt:  PKLLR-LSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPLKALLIYNHMQRMDVHSDHFTY

Query:  NFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGC-------------------------------GLVVFARMLF
         FVLKAC++                    G   H +V++ G   + +V+N+L+L ++ C                               G +  A  LF
Subjt:  NFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGC-------------------------------GLVVFARMLF

Query:  EEMTVRSAVSWNIMMSAYNRVRDYKSADSLLQLMPQ-----------------------------TNSGIRATEVTFISILGACAETGSLEMGKKIH-ES
        +EM  +  V+WN+M++   + ++  SA  L     +                              ++G     VT +S+L ACA  G LE GK++H   
Subjt:  EEMTVRSAVSWNIMMSAYNRVRDYKSADSLLQLMPQ-----------------------------TNSGIRATEVTFISILGACAETGSLEMGKKIH-ES

Query:  LKAEHYRIEGYLG----NAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHKGLVAE
        L+        Y+G    NA++DMYAKCG +  A+EVF  ++ + +S WN +I+GLA+H + E +++MF+ M+       PN VTF+ +++ACSH G V E
Subjt:  LKAEHYRIEGYLG----NAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERALDMFDSMEAGNDDHKPNRVTFVAILIACSHKGLVAE

Query:  GRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYAEEERW
        GR + SLM + Y I P++KHYGCMVD+L R G L+EA+  ++       A+VWRTLLG C+++ ++ELG+ A  KL  +   + GDYVLLSNIYA   +W
Subjt:  GRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYAEEERW

Query:  DDVGRLRNEMIEYGVCKKAGSSHVK
        D V ++R    +  V K  G S ++
Subjt:  DDVGRLRNEMIEYGVCKKAGSSHVK

Q9LXY5 Pentatricopeptide repeat-containing protein At3g565507.9e-8040.23Show/hide
Query:  ILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFI-CNTMIRAYSNSVFPLKALLIYNHMQRMDV-HSDH
        I+  L   +SMK+L   H+ ++  GL +H  +   L+ F ++S +GSL HA  LF     D S    N +IR +SNS  PL ++L YN M    V   D 
Subjt:  ILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFI-CNTMIRAYSNSVFPLKALLIYNHMQRMDV-HSDH

Query:  FTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRVRDYKSAD
        FT+NF LK+C R IK   K   C           EIH  V++ G   D  V  SL+  YS  G V  A  +F+EM VR  VSWN+M+  ++ V  +  A 
Subjt:  FTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRVRDYKSAD

Query:  SLLQLMPQTNSGIRATEVTFISILGACAETGSLEMGKKIHESLKAEHYRIEG--YLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGY
        S+ + M   N G+     T +++L +CA   +L MG  +H    A   R E   ++ NA++DMYAKCG L  A+ VFN M  + V  WN+MI+G  VHG+
Subjt:  SLLQLMPQTNSGIRATEVTFISILGACAETGSLEMGKKIHESLKAEHYRIEG--YLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGY

Query:  CERALDMFDSMEAGNDDHKPNRVTFVAILIACSHKGLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGC
           A+  F  M A     +PN +TF+ +L+ CSH+GLV EG     +M +++ + P++KHYGCMVDL  R G L+ + EMI         V+WRTLLG C
Subjt:  CERALDMFDSMEAGNDDHKPNRVTFVAILIACSHKGLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGC

Query:  RVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYA
        ++HR+LELGE A  KL +LEA   GDYVL+++IY+
Subjt:  RVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYA

Arabidopsis top hitse value%identityAlignment
AT3G27560.1 Protein kinase superfamily protein1.0e-11466.25Show/hide
Query:  GSITDQHLTIDDNLLVDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRFAREVNMMSRVKHENLVKFIGACKEPLMVIVTELL
        G  +++   +D   LVDP+ LF+G KIGEGAH KVYEG+YRNQ VAIK++ RG + EE A  +NRFARE+ M+S+V+H+NLVKFIGACKEP+MVIVTELL
Subjt:  GSITDQHLTIDDNLLVDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRFAREVNMMSRVKHENLVKFIGACKEPLMVIVTELL

Query:  PGMSLRKYLMNNRKQQLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFGLAREESVTEMMTAETGTYRWMAPELYSTVTLRQG
         G +LRKYL++ R ++LD RLA+ FALDIARAM+CLH++GIIHRDLKP+NL+L+A+ ++VKLADFGLAREES+TEMMTAETGTYRWMAPELYSTVTLRQG
Subjt:  PGMSLRKYLMNNRKQQLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFGLAREESVTEMMTAETGTYRWMAPELYSTVTLRQG

Query:  EKKHYNNKVDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWVEDSKMRPSFSQIIRMLNAYLFT------LPPP--
        EKKHYN+KVD YSF IVLWEL+ N++PFEGMSNLQAAYAAAFK  RPS   D+P DL  IV SCW ED   RP+F++II+ML  YL T      +PPP  
Subjt:  EKKHYNNKVDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWVEDSKMRPSFSQIIRMLNAYLFT------LPPP--

Query:  ---SSPSSPSSPKSDTT
           SS +   SP+S  T
Subjt:  ---SSPSSPSSPKSDTT

AT3G56550.1 Pentatricopeptide repeat (PPR) superfamily protein5.6e-8140.23Show/hide
Query:  ILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFI-CNTMIRAYSNSVFPLKALLIYNHMQRMDV-HSDH
        I+  L   +SMK+L   H+ ++  GL +H  +   L+ F ++S +GSL HA  LF     D S    N +IR +SNS  PL ++L YN M    V   D 
Subjt:  ILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFI-CNTMIRAYSNSVFPLKALLIYNHMQRMDV-HSDH

Query:  FTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRVRDYKSAD
        FT+NF LK+C R IK   K   C           EIH  V++ G   D  V  SL+  YS  G V  A  +F+EM VR  VSWN+M+  ++ V  +  A 
Subjt:  FTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNRVRDYKSAD

Query:  SLLQLMPQTNSGIRATEVTFISILGACAETGSLEMGKKIHESLKAEHYRIEG--YLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGY
        S+ + M   N G+     T +++L +CA   +L MG  +H    A   R E   ++ NA++DMYAKCG L  A+ VFN M  + V  WN+MI+G  VHG+
Subjt:  SLLQLMPQTNSGIRATEVTFISILGACAETGSLEMGKKIHESLKAEHYRIEG--YLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGY

Query:  CERALDMFDSMEAGNDDHKPNRVTFVAILIACSHKGLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGC
           A+  F  M A     +PN +TF+ +L+ CSH+GLV EG     +M +++ + P++KHYGCMVDL  R G L+ + EMI         V+WRTLLG C
Subjt:  CERALDMFDSMEAGNDDHKPNRVTFVAILIACSHKGLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGC

Query:  RVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYA
        ++HR+LELGE A  KL +LEA   GDYVL+++IY+
Subjt:  RVHRHLELGEEAFCKLGELEARKDGDYVLLSNIYA

AT5G01850.1 Protein kinase superfamily protein4.4e-13476.49Show/hide
Query:  TIDDNLLVDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRFAREVNMMSRVKHENLVKFIGACKEPLMVIVTELLPGMSLRKY
        TI+++LLVDPKLLFIGSKIGEGAHGKVY+GRY  QIVAIKV++RGS  +++++LE+RF REVNMMSRV+H NLVKFIGACK+PLMVIVTELLPGMSLRKY
Subjt:  TIDDNLLVDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRFAREVNMMSRVKHENLVKFIGACKEPLMVIVTELLPGMSLRKY

Query:  LMNNRKQQLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFGLAREESVTEMMTAETGTYRWMAPELYSTVTLRQGEKKHYNNK
        L + R Q L   LA++FALDIARA+ CLHANGIIHRDLKPDNLLLT N +SVKLADFGLAREESVTEMMTAETGTYRWMAPELYSTVTLRQGEKKHYNNK
Subjt:  LMNNRKQQLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFGLAREESVTEMMTAETGTYRWMAPELYSTVTLRQGEKKHYNNK

Query:  VDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWVEDSKMRPSFSQIIRMLNAYLFTLPPPSSPSSPSSPKSDTTET
        VDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERP +P  I P LAFIVQSCWVED  MRPSFSQIIR+LN +L TL PP  P  P  P++ T  T
Subjt:  VDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWVEDSKMRPSFSQIIRMLNAYLFTLPPPSSPSSPSSPKSDTTET

Query:  PTSSNGRWNMVLKGALSYI
           +   +++  KG  ++I
Subjt:  PTSSNGRWNMVLKGALSYI

AT5G40540.1 Protein kinase superfamily protein1.5e-11060.59Show/hide
Query:  GSITDQHLTIDDNLLVDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRFAREVNMMSRVKHENLVKFIGACKEPLMVIVTELL
        G  +++   +D   +VDP+ LF+G KIGEGAH K+YEG+Y+N+ VAIK++ RG + EE A  E+RFAREV+M+SRV+H+NLVKFIGACKEP+MVIVTELL
Subjt:  GSITDQHLTIDDNLLVDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRFAREVNMMSRVKHENLVKFIGACKEPLMVIVTELL

Query:  PGMSLRKYLMNNRKQQLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFGLAREESVTEMMTAETGTYRWMAPELYSTVTLRQG
         G +LRKYL++ R   LD R+A+ +ALDIARAM+CLH++G+IHRDLKP++L+LTA+ ++VKLADFGLAREES+TEMMTAETGTYRWMAPELYSTVTLR G
Subjt:  PGMSLRKYLMNNRKQQLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFGLAREESVTEMMTAETGTYRWMAPELYSTVTLRQG

Query:  EKKHYNNKVDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWVEDSKMRPSFSQIIRMLNAYLFTLP----------
        EKKHYN+KVD YSF IVLWEL+ N++PFEGMSNLQAAYAAAFK  RPS   D+P DLA IV SCW ED   RP+F++II+ML   L T+           
Subjt:  EKKHYNNKVDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWVEDSKMRPSFSQIIRMLNAYLFTLP----------

Query:  ----------PPSSPSSPS-SPKSDTTETPTSSNGRWNMV
                  PP SP + S     D  + PT +N   N V
Subjt:  ----------PPSSPSSPS-SPKSDTTETPTSSNGRWNMV

AT5G50180.1 Protein kinase superfamily protein5.2e-11166.67Show/hide
Query:  VDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRFAREVNMMSRVKHENLVKFIGACKEPLMVIVTELLPGMSLRKYLMNNRKQ
        +DP+LLF+G KIGEGAH KVYEG+Y+NQ VAIK++HRG T EE A  ++RF REV M+SRV+H+NLVKFIGACKEP+MVIVTELL G +LRKYL+N R  
Subjt:  VDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRFAREVNMMSRVKHENLVKFIGACKEPLMVIVTELLPGMSLRKYLMNNRKQ

Query:  QLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFGLAREESVTEMMTAETGTYRWMAPELYSTVTLRQGEKKHYNNKVDVYSFG
         L+ R+AI FALDIAR M+CLH++GIIHRDLKP+NLLLTA+ ++VKLADFGLAREES+TEMMTAETGTYRWMAPELYSTVTLR GEKKHYN+KVD YSF 
Subjt:  QLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFGLAREESVTEMMTAETGTYRWMAPELYSTVTLRQGEKKHYNNKVDVYSFG

Query:  IVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWVEDSKMRPSFSQIIRMLNAYLFTLPPPSSPSSPSSPKSDTTETPTSSNGR
        IVLWELL N++PFEGMSNLQAAYAAAFK  RPS    +P +L  IV SCW ED   RP+F+ II +L  YL  +  P S        S  T  P  S G 
Subjt:  IVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWVEDSKMRPSFSQIIRMLNAYLFTLPPPSSPSSPSSPKSDTTETPTSSNGR

Query:  WNMVLK
         +++ K
Subjt:  WNMVLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATTGTAACGAGGAGAATAGAGGGGGAGATGATAGGGAAACTGAGGCGCCAGTTTTGAGGAACTCTGTTGAGGCAGAAGGGAAACCAGTCATCCAGATTGGATCCAT
AACAGACCAACACTTGACCATCGATGATAATCTTCTCGTTGATCCAAAATTACTATTTATTGGATCCAAGATTGGCGAGGGTGCTCATGGGAAGGTTTATGAAGGCAGGT
ACCGGAACCAAATTGTGGCCATTAAAGTTCTTCATCGAGGGAGTACTGTAGAAGAAAGGGCAGCACTTGAAAATCGTTTTGCCCGTGAAGTAAATATGATGTCCCGAGTA
AAACATGAAAATCTTGTTAAGTTTATTGGAGCTTGTAAAGAACCTCTAATGGTGATAGTTACAGAGCTATTACCAGGAATGTCACTCAGGAAGTATCTGATGAATAATCG
TAAGCAACAGCTGGACCCTCGGCTGGCCATTAACTTTGCTTTGGATATTGCTCGTGCTATGGATTGTCTACATGCAAATGGGATTATACATAGAGATCTCAAACCTGATA
ATTTGTTGCTTACTGCAAACCAAAGGTCTGTGAAGCTTGCAGACTTTGGACTTGCTAGAGAAGAATCTGTGACCGAGATGATGACTGCAGAAACAGGGACTTACCGCTGG
ATGGCTCCTGAGCTATATAGCACTGTGACATTGCGGCAGGGAGAGAAGAAGCATTACAACAACAAGGTGGACGTTTACAGCTTTGGAATTGTGTTATGGGAACTGTTGAC
CAACCGAATGCCATTTGAAGGGATGTCCAATTTGCAAGCTGCATATGCTGCTGCTTTCAAGCAAGAGAGACCGAGTATTCCAGGCGACATACCTCCAGATCTAGCATTTA
TCGTACAGTCGTGTTGGGTGGAAGATTCTAAGATGAGGCCCAGCTTCAGCCAGATCATCCGCATGCTTAATGCATATCTATTTACACTCCCACCTCCTTCTTCGCCGTCT
TCACCGTCTTCACCAAAATCTGACACAACGGAGACACCAACGTCTAGCAATGGAAGATGGAACATGGTGCTGAAAGGTGCCCTTTCTTACATACTTCCCAAGCTTCTCCG
TCTCTCCTCCATGAAAGAACTGGAACTGACCCACGCATTCATCGTCAAAGCTGGTCTCTGCAATCACATTCCTGTAATGACGAAGCTAATTGCGTTCTCATCCCTTTCTC
CATCTGGAAGTCTCCCTCATGCTCATGCTTTGTTCCAAGACATTTCCATGGACGATTCTTTCATTTGTAACACCATGATTCGTGCCTACTCTAACAGCGTTTTCCCCCTT
AAAGCTTTGCTTATTTACAACCATATGCAACGTATGGATGTTCATTCTGATCATTTCACCTACAATTTTGTGCTCAAGGCTTGTGCAAGAGCTATCAAATGCACTGAAAA
GGACGACCAATGTTTTGGCCACGATATCATTTCACGCAAGGGCGCTGAAATTCATAGCCGCGTCCTGAAATTGGGGCTCGATCAAGATCATCACGTCCAGAATTCATTGC
TTCTAATGTACTCCGGGTGTGGCTTGGTAGTTTTTGCTCGTATGCTTTTCGAAGAAATGACTGTGAGAAGTGCTGTTTCATGGAACATTATGATGTCGGCTTACAATCGA
GTTCGTGACTATAAGTCAGCGGATTCTCTTCTTCAATTGATGCCTCAGACAAATTCGGGCATTAGAGCAACTGAAGTAACGTTTATATCTATTTTGGGCGCCTGTGCTGA
AACCGGTTCATTGGAGATGGGGAAGAAAATCCACGAGTCCTTGAAAGCGGAGCATTACAGAATTGAAGGATATCTAGGTAATGCCATAGTTGATATGTACGCTAAATGTG
GGGAGCTGAGCTTAGCTTTGGAGGTATTCAACGAAATGGAGATGAAGCCTGTAAGTTGCTGGAACGCGATGATTATGGGTTTGGCAGTTCATGGCTACTGTGAGAGAGCT
TTGGACATGTTTGATTCCATGGAGGCAGGGAATGATGATCACAAACCCAATCGGGTGACTTTCGTGGCTATTCTGATTGCCTGTAGTCACAAGGGTCTGGTTGCAGAAGG
ACGTCATTTTTTAAGTCTGATGATTAACAAATACAAGATAATGCCGGACTTAAAGCACTACGGTTGCATGGTTGACCTTCTCAGCAGATGGGGTTTTTTGCAGGAAGCAT
ACGAGATGATCAAAGGTTGCCCTTTCAGTTCATGCGCTGTTGTGTGGAGAACGTTGTTGGGTGGTTGTAGAGTACACAGACATTTAGAATTGGGCGAGGAAGCGTTCTGC
AAACTGGGAGAGTTGGAGGCAAGGAAGGATGGAGATTATGTTCTATTGTCAAACATCTACGCCGAAGAAGAGCGGTGGGATGATGTGGGGCGACTGAGAAACGAGATGAT
TGAGTATGGAGTTTGCAAGAAAGCTGGATCCAGCCACGTTAAAATTCAATAG
mRNA sequenceShow/hide mRNA sequence
AAAGTTATCCAAAGTTGTCGTCGGAAGAGAGGGTTGGAATTATCTCGCAGCGGACTGCTTCTTTCGTCGCAGTGGGTTGAGAGGGAGAGAGAGTGTGCGCGCGCGCGTGT
GTGCCGTTTCCATTTCCATCTCTAGTGAGAGAAACTGAGAGCTTTGCCTTGTTTGTTTATTATGTGTTTAGATGCTCATTCAACTACGGAAACACGAATCAAGCTCCACA
CTCCAACTTAGATAGCGACCCCATCCATTTTTCTTATTGCTTTCCTTTCTCCTCTATACTTTCCTCCCAAATTTTGGCTTCAACTGTAAGTTATTTATTTTATTTATTTT
TCTAGGGTTTGTTTGCTTAGGTTACGCCGAGGAGGACGCTTTGAAATCAAGCAAAGCTATGTGCACAGTTTTAAGCTTGCGATTCTCCCGATGTTTAAACCGTAAAAGGG
CGTCCGGGATAGCCGAGCTACATTTTTTTGTTGGCAAAGGGGGATAAGTTTTATTCCAAATCGATCGCTTATTCTTCTTCTTATTATTATATAAACTCTGATTTTCTTTT
CGTTTCTGTCAAAATCGTACAACAGGTGTTCAAGTTTTTGGTGCTGTGAATGGGTTCATATACGATAGTTGAGATTTAGTCGGTGGTTTGAAGATGTTCTACTGATTGGC
CATCGAGAAATGTGAATGATTGAAGTAATGTTAGCTTTGGTTTAAGTTTGTCCACAACGTATCAGTATGAGGAAGTATACAATAATTTGAGGGTGCACTATGAATTGTAA
CGAGGAGAATAGAGGGGGAGATGATAGGGAAACTGAGGCGCCAGTTTTGAGGAACTCTGTTGAGGCAGAAGGGAAACCAGTCATCCAGATTGGATCCATAACAGACCAAC
ACTTGACCATCGATGATAATCTTCTCGTTGATCCAAAATTACTATTTATTGGATCCAAGATTGGCGAGGGTGCTCATGGGAAGGTTTATGAAGGCAGGTACCGGAACCAA
ATTGTGGCCATTAAAGTTCTTCATCGAGGGAGTACTGTAGAAGAAAGGGCAGCACTTGAAAATCGTTTTGCCCGTGAAGTAAATATGATGTCCCGAGTAAAACATGAAAA
TCTTGTTAAGTTTATTGGAGCTTGTAAAGAACCTCTAATGGTGATAGTTACAGAGCTATTACCAGGAATGTCACTCAGGAAGTATCTGATGAATAATCGTAAGCAACAGC
TGGACCCTCGGCTGGCCATTAACTTTGCTTTGGATATTGCTCGTGCTATGGATTGTCTACATGCAAATGGGATTATACATAGAGATCTCAAACCTGATAATTTGTTGCTT
ACTGCAAACCAAAGGTCTGTGAAGCTTGCAGACTTTGGACTTGCTAGAGAAGAATCTGTGACCGAGATGATGACTGCAGAAACAGGGACTTACCGCTGGATGGCTCCTGA
GCTATATAGCACTGTGACATTGCGGCAGGGAGAGAAGAAGCATTACAACAACAAGGTGGACGTTTACAGCTTTGGAATTGTGTTATGGGAACTGTTGACCAACCGAATGC
CATTTGAAGGGATGTCCAATTTGCAAGCTGCATATGCTGCTGCTTTCAAGCAAGAGAGACCGAGTATTCCAGGCGACATACCTCCAGATCTAGCATTTATCGTACAGTCG
TGTTGGGTGGAAGATTCTAAGATGAGGCCCAGCTTCAGCCAGATCATCCGCATGCTTAATGCATATCTATTTACACTCCCACCTCCTTCTTCGCCGTCTTCACCGTCTTC
ACCAAAATCTGACACAACGGAGACACCAACGTCTAGCAATGGAAGATGGAACATGGTGCTGAAAGGTGCCCTTTCTTACATACTTCCCAAGCTTCTCCGTCTCTCCTCCA
TGAAAGAACTGGAACTGACCCACGCATTCATCGTCAAAGCTGGTCTCTGCAATCACATTCCTGTAATGACGAAGCTAATTGCGTTCTCATCCCTTTCTCCATCTGGAAGT
CTCCCTCATGCTCATGCTTTGTTCCAAGACATTTCCATGGACGATTCTTTCATTTGTAACACCATGATTCGTGCCTACTCTAACAGCGTTTTCCCCCTTAAAGCTTTGCT
TATTTACAACCATATGCAACGTATGGATGTTCATTCTGATCATTTCACCTACAATTTTGTGCTCAAGGCTTGTGCAAGAGCTATCAAATGCACTGAAAAGGACGACCAAT
GTTTTGGCCACGATATCATTTCACGCAAGGGCGCTGAAATTCATAGCCGCGTCCTGAAATTGGGGCTCGATCAAGATCATCACGTCCAGAATTCATTGCTTCTAATGTAC
TCCGGGTGTGGCTTGGTAGTTTTTGCTCGTATGCTTTTCGAAGAAATGACTGTGAGAAGTGCTGTTTCATGGAACATTATGATGTCGGCTTACAATCGAGTTCGTGACTA
TAAGTCAGCGGATTCTCTTCTTCAATTGATGCCTCAGACAAATTCGGGCATTAGAGCAACTGAAGTAACGTTTATATCTATTTTGGGCGCCTGTGCTGAAACCGGTTCAT
TGGAGATGGGGAAGAAAATCCACGAGTCCTTGAAAGCGGAGCATTACAGAATTGAAGGATATCTAGGTAATGCCATAGTTGATATGTACGCTAAATGTGGGGAGCTGAGC
TTAGCTTTGGAGGTATTCAACGAAATGGAGATGAAGCCTGTAAGTTGCTGGAACGCGATGATTATGGGTTTGGCAGTTCATGGCTACTGTGAGAGAGCTTTGGACATGTT
TGATTCCATGGAGGCAGGGAATGATGATCACAAACCCAATCGGGTGACTTTCGTGGCTATTCTGATTGCCTGTAGTCACAAGGGTCTGGTTGCAGAAGGACGTCATTTTT
TAAGTCTGATGATTAACAAATACAAGATAATGCCGGACTTAAAGCACTACGGTTGCATGGTTGACCTTCTCAGCAGATGGGGTTTTTTGCAGGAAGCATACGAGATGATC
AAAGGTTGCCCTTTCAGTTCATGCGCTGTTGTGTGGAGAACGTTGTTGGGTGGTTGTAGAGTACACAGACATTTAGAATTGGGCGAGGAAGCGTTCTGCAAACTGGGAGA
GTTGGAGGCAAGGAAGGATGGAGATTATGTTCTATTGTCAAACATCTACGCCGAAGAAGAGCGGTGGGATGATGTGGGGCGACTGAGAAACGAGATGATTGAGTATGGAG
TTTGCAAGAAAGCTGGATCCAGCCACGTTAAAATTCAATAG
Protein sequenceShow/hide protein sequence
MNCNEENRGGDDRETEAPVLRNSVEAEGKPVIQIGSITDQHLTIDDNLLVDPKLLFIGSKIGEGAHGKVYEGRYRNQIVAIKVLHRGSTVEERAALENRFAREVNMMSRV
KHENLVKFIGACKEPLMVIVTELLPGMSLRKYLMNNRKQQLDPRLAINFALDIARAMDCLHANGIIHRDLKPDNLLLTANQRSVKLADFGLAREESVTEMMTAETGTYRW
MAPELYSTVTLRQGEKKHYNNKVDVYSFGIVLWELLTNRMPFEGMSNLQAAYAAAFKQERPSIPGDIPPDLAFIVQSCWVEDSKMRPSFSQIIRMLNAYLFTLPPPSSPS
SPSSPKSDTTETPTSSNGRWNMVLKGALSYILPKLLRLSSMKELELTHAFIVKAGLCNHIPVMTKLIAFSSLSPSGSLPHAHALFQDISMDDSFICNTMIRAYSNSVFPL
KALLIYNHMQRMDVHSDHFTYNFVLKACARAIKCTEKDDQCFGHDIISRKGAEIHSRVLKLGLDQDHHVQNSLLLMYSGCGLVVFARMLFEEMTVRSAVSWNIMMSAYNR
VRDYKSADSLLQLMPQTNSGIRATEVTFISILGACAETGSLEMGKKIHESLKAEHYRIEGYLGNAIVDMYAKCGELSLALEVFNEMEMKPVSCWNAMIMGLAVHGYCERA
LDMFDSMEAGNDDHKPNRVTFVAILIACSHKGLVAEGRHFLSLMINKYKIMPDLKHYGCMVDLLSRWGFLQEAYEMIKGCPFSSCAVVWRTLLGGCRVHRHLELGEEAFC
KLGELEARKDGDYVLLSNIYAEEERWDDVGRLRNEMIEYGVCKKAGSSHVKIQ