; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0025713 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0025713
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr09:21340192..21343906
RNA-Seq ExpressionPI0025713
SyntenyPI0025713
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003723 - RNA binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025166.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0087.5Show/hide
Query:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ
        MS S  + IL+GLSIYKL TFIPK WKN+PVSN  +FMI SIF SLK+FASHGQLSK FEAFSL+QLR+SY+DSFDLI+QSISILLVSCT CSSLP GKQ
Subjt:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ

Query:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL
        LHG II SGL EDS LVPKLVTFYSSFKLL EAHTLVENSNLFHPC WN+LI SYVRNELHE+AILAYKQMLSKGVRPDNFTFPSILKACGET+NL  GL
Subjt:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL

Query:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA
        EVHK IN+WS +WSLFV NALISMYGRCGE+DTARNLFDNMLDRDAVSWNSMISCYAS+GMW+EAFELFD MQSKCLEIN+VTWNIIAGGCLR+G FT+A
Subjt:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        LKLLSQMRNFGIHLD+VAMIIGLGACSHIGAIRLGKEIHGFTIRH YHK STVQN+L+TMYARCKDIT AY+LFR+NDDKSIITWNSMLSGL+H+DRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKR DF+DYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH
         KA+RLFEEMK   IKPDH+TMVAVLSACSHSGL+ QGE+LFAEMQ+VHGLSPRLEHYACMADLFGRVGLL++AKEIITRMPYRPTSA+WATLIGACCIH
Subjt:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH

Query:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI
         NTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAK PGCSWV+VGSEFVSF VGDTSN QALESK LLD L DVMKHG+L+
Subjt:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI

Query:  TTDNYDICDNIF
         TD+YDI D+IF
Subjt:  TTDNYDICDNIF

XP_004141540.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucumis sativus]0.0e+0091.29Show/hide
Query:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ
        MSFSPSQCILKGLSI KL+TFIPKSWKNLPVSNSS+FMI SIFSSLK+FASHGQLSK+FEAFSLIQLRTSY+DSFDLILQSISILLVSCT  SSLP GKQ
Subjt:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ

Query:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL
        LHGHIISSGLVEDSFLV KLV FYSS + LPEAHTLVE SNLF PCSWNILI SYV+++L+EAAILAYKQM+SKGVRPDNFTFPSILKACGET+NL+ GL
Subjt:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL

Query:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA
        EVHKSIN+WST WSLFVHNALISMYGRCGEVDTARNLFDNML+RDAVSWNSMISCY+SRGMWREAFELF+SMQSKCLEINVVTWNIIAGGCLRVGNFTQA
Subjt:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        LKLLSQMRNFGIHLD+VAMIIGLGACSHIGAIRLGKEIHGFTIRHY+H LSTVQN+LVTMYARCKDI HAYMLFRLNDDKS ITWNSMLSGLTHL RVE+
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        ALCLFRELLLFGVEP+YVTFASILPLCARVADLQHGREFHCYITK  DFRD+LLLWNALVDMYAR+GKV EAKR+F SLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH
        GKAVRLFEEMKRFQIKPDH+TM+AVLSACSHSGL+NQ ELLFAEMQ+VHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH
Subjt:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH

Query:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI
        GN DIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLA+IRTLMRDSGVAK+PGCSWVDVGSEF+SFSVGDTS+ QALESKLLLDSLYDVMKHGSLI
Subjt:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI

Query:  TTDNYDICDNIF
         TD+YD  DNIF
Subjt:  TTDNYDICDNIF

XP_008459581.1 PREDICTED: pentatricopeptide repeat-containing protein At1g71490 [Cucumis melo]0.0e+0091.99Show/hide
Query:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ
        MS SPS+CILKGLSI KL+TFIPK+WK LPVSNSS+FMI SIFSSLK+FASHGQLSKTFEAFSLIQLRTSY+DSFDLILQSISILLVSCT CSSLP GKQ
Subjt:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ

Query:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL
        LHGHIISSGLVEDSFLV KLV FYSS + LPEAHTLVE SNLF PCSWNIL+ SYVRN+L+EAAILAYKQMLSKGVRPDNFTFPSILKACGET+NLE GL
Subjt:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL

Query:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA
        EVHKSINA ST WSLFVHNALISMYGRCGEVDTAR LFD ML+RD VSWNSMISCY+SRGMWREAFELF+SMQSK LEINVVTWNIIAGGCLRVGNFT+A
Subjt:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        L LLSQMRNFGIHLD+VAMIIGLGACSHIGAIRLGKEIHGFTIRHY+H LSTVQN+LVTMYARCKDI HAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        ALCLFRELLLFGVEPNYVTFASILPLCARVA+LQHGREFHCYITKRHDFRD+LLLWNALVDMYARSGKV EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH
        GKA+RLFEEMKRFQIKPDH+TMVAVLSACSHSGLLNQGELLFAEMQ+VHGLSPRLEHY+CMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH
Subjt:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH

Query:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI
        GNTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLA+IRT MRDSGVAKVPGCSWVDVGSEFVSFSVGDTS+ QALESKLLLDSLYDV+KH SLI
Subjt:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI

Query:  TTDNYDICDNIF
        TTDNYD  DNIF
Subjt:  TTDNYDICDNIF

XP_022925519.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita moschata]0.0e+0087.5Show/hide
Query:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ
        MS S  + IL+GLSIYKL TFIPK WKN+PVSN  +FMI SIF SLK+FASHGQLSK FEAFSL+QLR SY+DSFDLI+QSISILLVSCT CSSLP GKQ
Subjt:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ

Query:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL
        LHG IISSGL EDS LVPKLVTFYSSFKLL EAHTLVENSNLFHPC WN+LI SYVRNELHE+AILAYKQMLSKGVRPDNFTFPSILKACGET+NL  GL
Subjt:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL

Query:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA
        EVHK IN+WS +WSLFV NALISMYGRCGE+DTARNLFDNMLDRDAVSWNSMISCYAS GMW+EAFELFD MQSKCLEIN+VTWNIIAGGCLR+G FTQA
Subjt:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        LKLLSQMRNFGIHLD+VAMIIGLGACSHIGAIRLGKEIHGFTIRH YHK STVQN+L+TMYARCKDI  AY+LFRLNDDKSIITWNSMLSGL+H+DRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELLL+GVEPNYVTFASILPLCARVADLQHGREFHCYITKR DF+DYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH
         KA+RLFEEMK   IKPDH+TMVAVLSACSHSGL+ QGE+LFAEMQ+VHGLSPRLEHYACMADLFGRVGLL++AKEIITRMPYRPTSA+WATLIGACCIH
Subjt:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH

Query:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI
         NTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAK PGCSWV+VGSEFVSF VGDTSN QALESK LLD L DVMKHG+L+
Subjt:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI

Query:  TTDNYDICDNIF
          D+YDI D+IF
Subjt:  TTDNYDICDNIF

XP_038890628.1 pentatricopeptide repeat-containing protein At1g71490 [Benincasa hispida]0.0e+0092.09Show/hide
Query:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ
        MS+S S CILKGLSIYKLQTFIPK W+N+PVSN SKFMIDSIFSSLKNFAS+GQLSKTFEAFSLI+LR SY+DSFDLILQSISILLVSCT+CSSLP GKQ
Subjt:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ

Query:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL
        LHGHII+SGL EDSFLVPKLVTFYSSFKLLPEAHTLVE SNLFHPC+WN+LI SYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGET+NLE GL
Subjt:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL

Query:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA
        EVHKSINAWSTKWSLFV NAL+SMYGRCGEVDTARNLFDNML+ DAVSWNSMISCYAS+GMW+EAFELFD MQSKC+ INVVTWNIIAGGCLRVGNFT+A
Subjt:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        LKLLSQMRN GI+LDNVAM+IGLGACSHIGAIRLGKEIHGFTIRHYYHKLST+QN+LVTMYARCKDI HAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKR DFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI+GYGMQGEG
Subjt:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH
         KA+RLFEEMKRF+IKPDH+TMVAVLSACSHSGLL QGELLFAEMQ+VHGL P LEHYACMADLFGRVGLLNKAKEIITRMPYRPTSA+WATLIGACCIH
Subjt:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH

Query:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMK
        GNTDIGEWAAEKLLEM PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGS FVSF VGDTSN QALESKL+LDSL DVMK
Subjt:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMK

TrEMBL top hitse value%identityAlignment
A0A1S3CB12 pentatricopeptide repeat-containing protein At1g714900.0e+0091.99Show/hide
Query:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ
        MS SPS+CILKGLSI KL+TFIPK+WK LPVSNSS+FMI SIFSSLK+FASHGQLSKTFEAFSLIQLRTSY+DSFDLILQSISILLVSCT CSSLP GKQ
Subjt:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ

Query:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL
        LHGHIISSGLVEDSFLV KLV FYSS + LPEAHTLVE SNLF PCSWNIL+ SYVRN+L+EAAILAYKQMLSKGVRPDNFTFPSILKACGET+NLE GL
Subjt:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL

Query:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA
        EVHKSINA ST WSLFVHNALISMYGRCGEVDTAR LFD ML+RD VSWNSMISCY+SRGMWREAFELF+SMQSK LEINVVTWNIIAGGCLRVGNFT+A
Subjt:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        L LLSQMRNFGIHLD+VAMIIGLGACSHIGAIRLGKEIHGFTIRHY+H LSTVQN+LVTMYARCKDI HAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        ALCLFRELLLFGVEPNYVTFASILPLCARVA+LQHGREFHCYITKRHDFRD+LLLWNALVDMYARSGKV EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH
        GKA+RLFEEMKRFQIKPDH+TMVAVLSACSHSGLLNQGELLFAEMQ+VHGLSPRLEHY+CMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH
Subjt:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH

Query:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI
        GNTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLA+IRT MRDSGVAKVPGCSWVDVGSEFVSFSVGDTS+ QALESKLLLDSLYDV+KH SLI
Subjt:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI

Query:  TTDNYDICDNIF
        TTDNYD  DNIF
Subjt:  TTDNYDICDNIF

A0A5D3BN10 Pentatricopeptide repeat-containing protein0.0e+0091.99Show/hide
Query:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ
        MS SPS+CILKGLSI KL+TFIPK+WK LPVSNSS+FMI SIFSSLK+FASHGQLSKTFEAFSLIQLRTSY+DSFDLILQSISILLVSCT CSSLP GKQ
Subjt:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ

Query:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL
        LHGHIISSGLVEDSFLV KLV FYSS + LPEAHTLVE SNLF PCSWNIL+ SYVRN+L+EAAILAYKQMLSKGVRPDNFTFPSILKACGET+NLE GL
Subjt:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL

Query:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA
        EVHKSINA ST WSLFVHNALISMYGRCGEVDTAR LFD ML+RD VSWNSMISCY+SRGMWREAFELF+SMQSK LEINVVTWNIIAGGCLRVGNFT+A
Subjt:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        L LLSQMRNFGIHLD+VAMIIGLGACSHIGAIRLGKEIHGFTIRHY+H LSTVQN+LVTMYARCKDI HAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        ALCLFRELLLFGVEPNYVTFASILPLCARVA+LQHGREFHCYITKRHDFRD+LLLWNALVDMYARSGKV EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH
        GKA+RLFEEMKRFQIKPDH+TMVAVLSACSHSGLLNQGELLFAEMQ+VHGLSPRLEHY+CMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH
Subjt:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH

Query:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI
        GNTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLA+IRT MRDSGVAKVPGCSWVDVGSEFVSFSVGDTS+ QALESKLLLDSLYDV+KH SLI
Subjt:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI

Query:  TTDNYDICDNIF
        TTDNYD  DNIF
Subjt:  TTDNYDICDNIF

A0A6J1CJU8 pentatricopeptide repeat-containing protein At1g71490-like0.0e+0086.52Show/hide
Query:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ
        MS S SQ I +GLS+YKLQTFIPK WKN PVSN S+FMI  +FSSLK+FA HGQLSK FEAFSLIQLRT Y+DSFDLILQS SILLVSCT+ SSLP G+Q
Subjt:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ

Query:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL
        LHG II SGL +DS LVPKLVTFYSSFKLL EAHTLVENSN+FHPC WN+LI SYVRN LHEAAIL YKQMLS+G+RPDNFTFPSILKACGET+NL  GL
Subjt:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL

Query:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA
        EVHK INAWST+WSLFV NALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYAS+GMW+EAFELFD+MQSKC+EIN+VTWNIIAGGCLRVGNF  A
Subjt:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        LKLLSQMRNFG HLD VAMIIGLGACSHIGAIRLGKEIHGFTIRH YH+LS VQN+LVTMYARCKDI +AY+LFRLN DKSIITWNSMLSG THLDRVE+
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELLL GVEPNYVT ASILPLCARVADLQHGREFHCYITKR D  DYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH
         KA+RLF+EMKRFQIKPDH+TMVAVLSACSHSGLL QGELLFAEMQ+VHGLSP LEHYACMADLFGRVGLLNKAK IITRMPYRPTSA+WATLIGACCIH
Subjt:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH

Query:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI
        GNT+IGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAK PGCSWVDVGS FVSF VGDTSN QALE+ LLLD+L +VMKHGSL+
Subjt:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI

Query:  TTDNYDICDNIF
        T D++DI ++ F
Subjt:  TTDNYDICDNIF

A0A6J1EI84 pentatricopeptide repeat-containing protein At1g71490 isoform X10.0e+0087.5Show/hide
Query:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ
        MS S  + IL+GLSIYKL TFIPK WKN+PVSN  +FMI SIF SLK+FASHGQLSK FEAFSL+QLR SY+DSFDLI+QSISILLVSCT CSSLP GKQ
Subjt:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ

Query:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL
        LHG IISSGL EDS LVPKLVTFYSSFKLL EAHTLVENSNLFHPC WN+LI SYVRNELHE+AILAYKQMLSKGVRPDNFTFPSILKACGET+NL  GL
Subjt:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL

Query:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA
        EVHK IN+WS +WSLFV NALISMYGRCGE+DTARNLFDNMLDRDAVSWNSMISCYAS GMW+EAFELFD MQSKCLEIN+VTWNIIAGGCLR+G FTQA
Subjt:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        LKLLSQMRNFGIHLD+VAMIIGLGACSHIGAIRLGKEIHGFTIRH YHK STVQN+L+TMYARCKDI  AY+LFRLNDDKSIITWNSMLSGL+H+DRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELLL+GVEPNYVTFASILPLCARVADLQHGREFHCYITKR DF+DYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH
         KA+RLFEEMK   IKPDH+TMVAVLSACSHSGL+ QGE+LFAEMQ+VHGLSPRLEHYACMADLFGRVGLL++AKEIITRMPYRPTSA+WATLIGACCIH
Subjt:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH

Query:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI
         NTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAK PGCSWV+VGSEFVSF VGDTSN QALESK LLD L DVMKHG+L+
Subjt:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI

Query:  TTDNYDICDNIF
          D+YDI D+IF
Subjt:  TTDNYDICDNIF

A0A6J1I8V4 pentatricopeptide repeat-containing protein At1g71490 isoform X10.0e+0086.94Show/hide
Query:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ
        MS S  + IL+GLSIYKL TFIPK WKN+PVSN  +FMI+SIF SLK+FASHGQLSK FEAFSL+QLR SY+DSFDLI+QSISILLVSCT CSSLP GKQ
Subjt:  MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQ

Query:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL
        LHG IISSGL EDS LVPKLVTFYSSFKLL EAHTLVENSNLFHPC WN+LI SYVRNELHE+AILAYKQMLSKGVRPDNFTFPSILKACGET+NL  GL
Subjt:  LHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGL

Query:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA
        EVHK IN+WS +WSLFV NALISMYGRCGE+DTARNLFDNMLDRDAVSWNSMISCYAS+GMW+EAFELFD MQSKCLEIN+VTWNIIAGGCLR+G FT+A
Subjt:  EVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        LKLLSQMRNFGIHLD+VAMIIGLGACSHIGAIRLGKEIHGFTIRH YHK STVQN+L+TMYARCKDI  AY+LFRLNDDKSIITWNSMLSGL+H+DRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRE LLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKR DF+DYLLLWNALVDMYARSGKV+EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH
         KA+RLFEEMK   IKPDH+TMVAVLSACSHSGL+ QGE+LFAEMQ+VHGLSP LEHYACMADLFGRVGLL++AKEIITRMPYRPTSA+WATLIGACCIH
Subjt:  GKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIH

Query:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI
         NTDIGEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAK PGCSWV+VGSEFVSF VGDTSN QALESK LLD L DVMKHG+L+
Subjt:  GNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI

Query:  TTDNYDICDNIF
         TD+YDI +++F
Subjt:  TTDNYDICDNIF

SwissProt top hitse value%identityAlignment
Q4V389 Pentatricopeptide repeat-containing protein At1g228305.4e-20251.78Show/hide
Query:  MSFSPSQCILKGLSIYKLQTFIPKSWKNL--PVSNSSKFMIDS-----IFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCS
        M  SPS+ IL+GL++ ++  FIP+SWK L  P+S +SK   D      +F+S ++  SHGQL + F  FSL++ +   S S + +L S + LL +C   +
Subjt:  MSFSPSQCILKGLSIYKLQTFIPKSWKNL--PVSNSSKFMIDS-----IFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCS

Query:  SLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGET
            G+QLH H ISSGL  DS LVPKLVTFYS+F LL EA T+ ENS + HP  WN+LI SY+RN+  + ++  YK+M+SKG+R D FT+PS++KAC   
Subjt:  SLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGET

Query:  KNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLR
         +   G  VH SI   S + +L+V NALISMY R G+VD AR LFD M +RDAVSWN++I+CY S     EAF+L D M    +E ++VTWN IAGGCL 
Subjt:  KNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLR

Query:  VGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIR--HYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSG
         GN+  AL  +  MRN  + + +VAMI GL ACSHIGA++ GK  H   IR   + H +  V+NSL+TMY+RC D+ HA+++F+  +  S+ TWNS++SG
Subjt:  VGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIR--HYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSG

Query:  LTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI
          + +R E+   L +E+LL G  PN++T ASILPL ARV +LQHG+EFHCYI +R  ++D L+LWN+LVDMYA+SG+++ AKRVFDS+ K+D+VTYTSLI
Subjt:  LTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI

Query:  AGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWA
         GYG  G+G  A+  F++M R  IKPDHVTMVAVLSACSHS L+ +G  LF +M+ V G+  RLEHY+CM DL+ R G L+KA++I   +PY P+SA+ A
Subjt:  AGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWA

Query:  TLIGACCIHGNTDIGEWAAEK-LLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSE
        TL+ AC IHGNT+IGEWAA+K LLE +PEH G+Y+L+A+MYA  GSWSKL  ++TL+ D GV K    + ++  SE
Subjt:  TLIGACCIHGNTDIGEWAAEK-LLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSE

Q9C9I6 Pentatricopeptide repeat-containing protein At1g714903.7e-23559.91Show/hide
Query:  DSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVEN
        +S+F SL + ASHG L   F+ FSL++L++S + S DL+L S + LL +C D  +   G Q+H H ISSG+   S LVPKLVTFYS+F L  EA +++EN
Subjt:  DSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVEN

Query:  SNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFD
        S++ HP  WN+LIASY +NEL E  I AYK+M+SKG+RPD FT+PS+LKACGET ++  G  VH SI   S K SL+V NALISMY R   +  AR LFD
Subjt:  SNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFD

Query:  NMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIH
         M +RDAVSWN++I+CYAS GMW EAFELFD M    +E++V+TWNII+GGCL+ GN+  AL L+S+MRNF   LD VAMIIGL ACS IGAIRLGKEIH
Subjt:  NMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIH

Query:  GFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREF
        G  I   Y  +  V+N+L+TMY++CKD+ HA ++FR  ++ S+ TWNS++SG   L++ E+A  L RE+L+ G +PN +T ASILPLCAR+A+LQHG+EF
Subjt:  GFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREF

Query:  HCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGE
        HCYI +R  F+DY +LWN+LVD+YA+SGK++ AK+V D +SK+DEVTYTSLI GYG QGEGG A+ LF+EM R  IKPDHVT+VAVLSACSHS L+++GE
Subjt:  HCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGE

Query:  LLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSK
         LF +MQ  +G+ P L+H++CM DL+GR G L KAK+II  MPY+P+ A WATL+ AC IHGNT IG+WAAEKLLEM+PE+ GYYVLIANMYAAAGSWSK
Subjt:  LLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSK

Query:  LAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMK
        LA++RT+MRD GV K PGC+W+D  S F  FSVGDTS+ +A  +  LLD L  +MK
Subjt:  LAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMK

Q9LFL5 Pentatricopeptide repeat-containing protein At5g168604.4e-11934.63Show/hide
Query:  SSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLF
        S ++++  +G  +K    F L+   +   D++     +   +  +C + SS+  G+  H   + +G + + F+   LV  YS  + L +A  + +  +++
Subjt:  SSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLF

Query:  HPCSWNILIASYVRNELHEAAILAYKQMLSK-GVRPDNFTFPSILKACGETKNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNML
           SWN +I SY +    + A+  + +M ++ G RPDN T  ++L  C       LG ++H          ++FV N L+ MY +CG +D A  +F NM 
Subjt:  HPCSWNILIASYVRNELHEAAILAYKQMLSK-GVRPDNFTFPSILKACGETKNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNML

Query:  DRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFT
         +D VSWN+M++ Y+  G + +A  LF+ MQ + ++++VVTW+    G  + G   +AL +  QM + GI  + V +I  L  C+ +GA+  GKEIH + 
Subjt:  DRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFT

Query:  IRH-------YYHKLSTVQNSLVTMYARCKDITHAYMLF--RLNDDKSIITWNSMLSGLTHLDRVEDALCLFRELLLFGVE--PNYVTFASILPLCARVA
        I++        +   + V N L+ MYA+CK +  A  +F      ++ ++TW  M+ G +       AL L  E+     +  PN  T +  L  CA +A
Subjt:  IRH-------YYHKLSTVQNSLVTMYARCKDITHAYMLF--RLNDDKSIITWNSMLSGLTHLDRVEDALCLFRELLLFGVE--PNYVTFASILPLCARVA

Query:  DLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSH
         L+ G++ H Y  +       L + N L+DMYA+ G + +A+ VFD++  K+EVT+TSL+ GYGM G G +A+ +F+EM+R   K D VT++ VL ACSH
Subjt:  DLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSH

Query:  SGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMY
        SG+++QG   F  M+TV G+SP  EHYAC+ DL GR G LN A  +I  MP  P   +W   +  C IHG  ++GE+AAEK+ E+   H G Y L++N+Y
Subjt:  SGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMY

Query:  AAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALE-SKLLLDSLYDVMKHG
        A AG W  + +IR+LMR  GV K PGCSWV+      +F VGD ++  A E  ++LLD +  +   G
Subjt:  AAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALE-SKLLLDSLYDVMKHG

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic2.9e-11534.17Show/hide
Query:  LRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQLHGHIISSGLVEDSFLVPKLVTF---YSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEA
        L +S    +D I    S+ L+   +C +L   + +H  +I  GL   ++ + KL+ F      F+ LP A ++ +     +   WN +   +  +    +
Subjt:  LRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQLHGHIISSGLVEDSFLVPKLVTF---YSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEA

Query:  AILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWR
        A+  Y  M+S G+ P+++TFP +LK+C ++K  + G ++H  +        L+VH +LISMY + G ++ A  +FD    RD VS+ ++I  YASRG   
Subjt:  AILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWR

Query:  EAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYAR
         A +LFD +  K    +VV+WN +  G    GN+ +AL+L   M    +  D   M+  + AC+  G+I LG+++H +   H +     + N+L+ +Y++
Subjt:  EAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYAR

Query:  CKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKR-HDFRDYLLLWNALVDM
        C ++  A  LF     K +I+WN+++ G TH++  ++AL LF+E+L  G  PN VT  SILP CA +  +  GR  H YI KR     +   L  +L+DM
Subjt:  CKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKR-HDFRDYLLLWNALVDM

Query:  YARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMA
        YA+ G +  A +VF+S+  K   ++ ++I G+ M G    +  LF  M++  I+PD +T V +LSACSHSG+L+ G  +F  M   + ++P+LEHY CM 
Subjt:  YARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMA

Query:  DLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVD
        DL G  GL  +A+E+I  M   P   IW +L+ AC +HGN ++GE  AE L+++ PE+ G YVL++N+YA+AG W+++AK R L+ D G+ KVPGCS ++
Subjt:  DLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVD

Query:  VGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI
        + S    F +GD  + +  E   +L+ +  +++    +
Subjt:  VGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202309.8e-10330.72Show/hide
Query:  SSLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGE
        SSL +  Q H  I+ SG   D ++  KL+  YS++    +A  ++++       S++ LI +  + +L   +I  + +M S G+ PD+   P++ K C E
Subjt:  SSLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGE

Query:  TKNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCL
            ++G ++H            FV  ++  MY RCG +  AR +FD M D+D V+ ++++  YA +G   E   +   M+S  +E N+V+WN I  G  
Subjt:  TKNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCL

Query:  RVGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARC-----------------KDITHAYM---
        R G   +A+ +  ++ + G   D V +   L +      + +G+ IHG+ I+    K   V ++++ MY +                    + +AY+   
Subjt:  RVGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARC-----------------KDITHAYM---

Query:  -----------LFRLNDDK----SIITWNSMLSGLTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWN
                   +F L  ++    ++++W S+++G     +  +AL LFRE+ + GV+PN+VT  S+LP C  +A L HGR  H +  + H   D + + +
Subjt:  -----------LFRLNDDK----SIITWNSMLSGLTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWN

Query:  ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEH
        AL+DMYA+ G++  ++ VF+ +  K+ V + SL+ G+ M G+  + + +FE + R ++KPD ++  ++LSAC   GL ++G   F  M   +G+ PRLEH
Subjt:  ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEH

Query:  YACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPG
        Y+CM +L GR G L +A ++I  MP+ P S +W  L+ +C +  N D+ E AAEKL  + PE+ G YVL++N+YAA G W+++  IR  M   G+ K PG
Subjt:  YACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPG

Query:  CSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMK
        CSW+ V +   +   GD S+ Q  +    +D +   M+
Subjt:  CSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMK

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.1e-11634.17Show/hide
Query:  LRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQLHGHIISSGLVEDSFLVPKLVTF---YSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEA
        L +S    +D I    S+ L+   +C +L   + +H  +I  GL   ++ + KL+ F      F+ LP A ++ +     +   WN +   +  +    +
Subjt:  LRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQLHGHIISSGLVEDSFLVPKLVTF---YSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEA

Query:  AILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWR
        A+  Y  M+S G+ P+++TFP +LK+C ++K  + G ++H  +        L+VH +LISMY + G ++ A  +FD    RD VS+ ++I  YASRG   
Subjt:  AILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWR

Query:  EAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYAR
         A +LFD +  K    +VV+WN +  G    GN+ +AL+L   M    +  D   M+  + AC+  G+I LG+++H +   H +     + N+L+ +Y++
Subjt:  EAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYAR

Query:  CKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKR-HDFRDYLLLWNALVDM
        C ++  A  LF     K +I+WN+++ G TH++  ++AL LF+E+L  G  PN VT  SILP CA +  +  GR  H YI KR     +   L  +L+DM
Subjt:  CKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKR-HDFRDYLLLWNALVDM

Query:  YARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMA
        YA+ G +  A +VF+S+  K   ++ ++I G+ M G    +  LF  M++  I+PD +T V +LSACSHSG+L+ G  +F  M   + ++P+LEHY CM 
Subjt:  YARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMA

Query:  DLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVD
        DL G  GL  +A+E+I  M   P   IW +L+ AC +HGN ++GE  AE L+++ PE+ G YVL++N+YA+AG W+++AK R L+ D G+ KVPGCS ++
Subjt:  DLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVD

Query:  VGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI
        + S    F +GD  + +  E   +L+ +  +++    +
Subjt:  VGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLI

AT1G22830.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.8e-20351.78Show/hide
Query:  MSFSPSQCILKGLSIYKLQTFIPKSWKNL--PVSNSSKFMIDS-----IFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCS
        M  SPS+ IL+GL++ ++  FIP+SWK L  P+S +SK   D      +F+S ++  SHGQL + F  FSL++ +   S S + +L S + LL +C   +
Subjt:  MSFSPSQCILKGLSIYKLQTFIPKSWKNL--PVSNSSKFMIDS-----IFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCS

Query:  SLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGET
            G+QLH H ISSGL  DS LVPKLVTFYS+F LL EA T+ ENS + HP  WN+LI SY+RN+  + ++  YK+M+SKG+R D FT+PS++KAC   
Subjt:  SLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGET

Query:  KNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLR
         +   G  VH SI   S + +L+V NALISMY R G+VD AR LFD M +RDAVSWN++I+CY S     EAF+L D M    +E ++VTWN IAGGCL 
Subjt:  KNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLR

Query:  VGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIR--HYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSG
         GN+  AL  +  MRN  + + +VAMI GL ACSHIGA++ GK  H   IR   + H +  V+NSL+TMY+RC D+ HA+++F+  +  S+ TWNS++SG
Subjt:  VGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIR--HYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSG

Query:  LTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI
          + +R E+   L +E+LL G  PN++T ASILPL ARV +LQHG+EFHCYI +R  ++D L+LWN+LVDMYA+SG+++ AKRVFDS+ K+D+VTYTSLI
Subjt:  LTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI

Query:  AGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWA
         GYG  G+G  A+  F++M R  IKPDHVTMVAVLSACSHS L+ +G  LF +M+ V G+  RLEHY+CM DL+ R G L+KA++I   +PY P+SA+ A
Subjt:  AGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWA

Query:  TLIGACCIHGNTDIGEWAAEK-LLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSE
        TL+ AC IHGNT+IGEWAA+K LLE +PEH G+Y+L+A+MYA  GSWSKL  ++TL+ D GV K    + ++  SE
Subjt:  TLIGACCIHGNTDIGEWAAEK-LLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSE

AT1G22830.2 Tetratricopeptide repeat (TPR)-like superfamily protein3.8e-20351.78Show/hide
Query:  MSFSPSQCILKGLSIYKLQTFIPKSWKNL--PVSNSSKFMIDS-----IFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCS
        M  SPS+ IL+GL++ ++  FIP+SWK L  P+S +SK   D      +F+S ++  SHGQL + F  FSL++ +   S S + +L S + LL +C   +
Subjt:  MSFSPSQCILKGLSIYKLQTFIPKSWKNL--PVSNSSKFMIDS-----IFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCS

Query:  SLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGET
            G+QLH H ISSGL  DS LVPKLVTFYS+F LL EA T+ ENS + HP  WN+LI SY+RN+  + ++  YK+M+SKG+R D FT+PS++KAC   
Subjt:  SLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGET

Query:  KNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLR
         +   G  VH SI   S + +L+V NALISMY R G+VD AR LFD M +RDAVSWN++I+CY S     EAF+L D M    +E ++VTWN IAGGCL 
Subjt:  KNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLR

Query:  VGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIR--HYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSG
         GN+  AL  +  MRN  + + +VAMI GL ACSHIGA++ GK  H   IR   + H +  V+NSL+TMY+RC D+ HA+++F+  +  S+ TWNS++SG
Subjt:  VGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIR--HYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSG

Query:  LTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI
          + +R E+   L +E+LL G  PN++T ASILPL ARV +LQHG+EFHCYI +R  ++D L+LWN+LVDMYA+SG+++ AKRVFDS+ K+D+VTYTSLI
Subjt:  LTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI

Query:  AGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWA
         GYG  G+G  A+  F++M R  IKPDHVTMVAVLSACSHS L+ +G  LF +M+ V G+  RLEHY+CM DL+ R G L+KA++I   +PY P+SA+ A
Subjt:  AGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWA

Query:  TLIGACCIHGNTDIGEWAAEK-LLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSE
        TL+ AC IHGNT+IGEWAA+K LLE +PEH G+Y+L+A+MYA  GSWSKL  ++TL+ D GV K    + ++  SE
Subjt:  TLIGACCIHGNTDIGEWAAEK-LLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSE

AT1G71490.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.6e-23659.91Show/hide
Query:  DSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVEN
        +S+F SL + ASHG L   F+ FSL++L++S + S DL+L S + LL +C D  +   G Q+H H ISSG+   S LVPKLVTFYS+F L  EA +++EN
Subjt:  DSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVEN

Query:  SNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFD
        S++ HP  WN+LIASY +NEL E  I AYK+M+SKG+RPD FT+PS+LKACGET ++  G  VH SI   S K SL+V NALISMY R   +  AR LFD
Subjt:  SNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFD

Query:  NMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIH
         M +RDAVSWN++I+CYAS GMW EAFELFD M    +E++V+TWNII+GGCL+ GN+  AL L+S+MRNF   LD VAMIIGL ACS IGAIRLGKEIH
Subjt:  NMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIH

Query:  GFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREF
        G  I   Y  +  V+N+L+TMY++CKD+ HA ++FR  ++ S+ TWNS++SG   L++ E+A  L RE+L+ G +PN +T ASILPLCAR+A+LQHG+EF
Subjt:  GFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREF

Query:  HCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGE
        HCYI +R  F+DY +LWN+LVD+YA+SGK++ AK+V D +SK+DEVTYTSLI GYG QGEGG A+ LF+EM R  IKPDHVT+VAVLSACSHS L+++GE
Subjt:  HCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGE

Query:  LLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSK
         LF +MQ  +G+ P L+H++CM DL+GR G L KAK+II  MPY+P+ A WATL+ AC IHGNT IG+WAAEKLLEM+PE+ GYYVLIANMYAAAGSWSK
Subjt:  LLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSK

Query:  LAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMK
        LA++RT+MRD GV K PGC+W+D  S F  FSVGDTS+ +A  +  LLD L  +MK
Subjt:  LAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMK

AT5G16860.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.1e-12034.63Show/hide
Query:  SSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLF
        S ++++  +G  +K    F L+   +   D++     +   +  +C + SS+  G+  H   + +G + + F+   LV  YS  + L +A  + +  +++
Subjt:  SSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQLHGHIISSGLVEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLF

Query:  HPCSWNILIASYVRNELHEAAILAYKQMLSK-GVRPDNFTFPSILKACGETKNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNML
           SWN +I SY +    + A+  + +M ++ G RPDN T  ++L  C       LG ++H          ++FV N L+ MY +CG +D A  +F NM 
Subjt:  HPCSWNILIASYVRNELHEAAILAYKQMLSK-GVRPDNFTFPSILKACGETKNLELGLEVHKSINAWSTKWSLFVHNALISMYGRCGEVDTARNLFDNML

Query:  DRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFT
         +D VSWN+M++ Y+  G + +A  LF+ MQ + ++++VVTW+    G  + G   +AL +  QM + GI  + V +I  L  C+ +GA+  GKEIH + 
Subjt:  DRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFT

Query:  IRH-------YYHKLSTVQNSLVTMYARCKDITHAYMLF--RLNDDKSIITWNSMLSGLTHLDRVEDALCLFRELLLFGVE--PNYVTFASILPLCARVA
        I++        +   + V N L+ MYA+CK +  A  +F      ++ ++TW  M+ G +       AL L  E+     +  PN  T +  L  CA +A
Subjt:  IRH-------YYHKLSTVQNSLVTMYARCKDITHAYMLF--RLNDDKSIITWNSMLSGLTHLDRVEDALCLFRELLLFGVE--PNYVTFASILPLCARVA

Query:  DLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSH
         L+ G++ H Y  +       L + N L+DMYA+ G + +A+ VFD++  K+EVT+TSL+ GYGM G G +A+ +F+EM+R   K D VT++ VL ACSH
Subjt:  DLQHGREFHCYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSH

Query:  SGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMY
        SG+++QG   F  M+TV G+SP  EHYAC+ DL GR G LN A  +I  MP  P   +W   +  C IHG  ++GE+AAEK+ E+   H G Y L++N+Y
Subjt:  SGLLNQGELLFAEMQTVHGLSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMY

Query:  AAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALE-SKLLLDSLYDVMKHG
        A AG W  + +IR+LMR  GV K PGCSWV+      +F VGD ++  A E  ++LLD +  +   G
Subjt:  AAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNSQALE-SKLLLDSLYDVMKHG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTTTCTCCTTCTCAATGCATCCTCAAGGGTCTTTCTATATATAAGCTCCAAACGTTCATACCTAAATCATGGAAAAATTTACCTGTGAGCAACAGTAGTAAATT
TATGATTGATTCTATTTTTTCTTCCCTTAAGAACTTTGCCTCTCATGGACAATTGTCGAAAACATTTGAAGCCTTCTCCCTCATTCAATTGCGCACAAGTTATAGTGATT
CATTTGACCTCATCTTGCAATCCATCTCGATTCTTCTTGTATCATGCACCGATTGTAGCTCACTCCCACAAGGTAAGCAACTTCATGGTCACATTATCTCGTCGGGTCTT
GTGGAAGACTCTTTTTTGGTCCCCAAGCTTGTCACGTTTTACTCAAGCTTTAAACTTTTGCCTGAGGCTCATACCCTTGTTGAGAATTCTAATTTATTTCACCCCTGTTC
TTGGAATATACTTATCGCATCATACGTTAGAAATGAACTTCATGAGGCAGCCATTTTAGCTTATAAACAGATGCTGAGTAAAGGGGTCAGACCAGATAATTTCACTTTTC
CCTCCATTTTGAAGGCTTGTGGTGAAACAAAAAATTTGGAACTTGGTTTAGAGGTTCACAAGTCTATTAATGCTTGGTCAACTAAATGGAGTCTGTTTGTTCACAATGCT
CTGATATCTATGTATGGAAGATGTGGAGAGGTGGACACTGCACGTAACTTGTTTGACAATATGCTTGACCGGGATGCAGTATCTTGGAATTCAATGATCTCTTGTTATGC
CTCCAGGGGTATGTGGAGGGAGGCATTTGAACTATTTGACAGCATGCAGAGTAAGTGTCTTGAAATTAATGTTGTAACTTGGAATATTATAGCTGGAGGTTGCTTACGGG
TTGGTAATTTTACTCAAGCACTGAAGTTACTGTCTCAAATGAGAAATTTTGGCATTCATTTGGACAACGTAGCAATGATAATTGGTTTAGGTGCTTGTTCCCACATTGGT
GCCATTAGATTGGGAAAGGAAATCCATGGCTTTACTATCAGACATTATTATCATAAGTTATCCACTGTTCAAAATTCTTTAGTTACCATGTATGCTCGTTGTAAAGACAT
TACGCATGCATATATGTTGTTTCGATTAAATGACGACAAAAGTATAATCACGTGGAATTCCATGCTTTCTGGTCTCACACACTTGGACCGGGTTGAGGATGCCTTGTGTC
TGTTTAGAGAATTGTTACTGTTTGGTGTTGAACCAAACTATGTGACATTTGCTAGCATTCTTCCTCTTTGTGCTCGAGTTGCAGATTTACAACATGGGAGAGAATTTCAT
TGCTATATTACTAAACGTCATGATTTTAGGGATTATTTGTTATTGTGGAATGCATTGGTGGACATGTATGCAAGGTCGGGCAAGGTTTTAGAAGCAAAAAGAGTTTTTGA
TTCGTTAAGCAAGAAGGATGAAGTGACCTATACTTCTCTGATTGCAGGCTATGGTATGCAAGGTGAGGGGGGGAAAGCTGTAAGACTGTTCGAAGAGATGAAAAGGTTTC
AAATCAAACCAGATCATGTAACTATGGTTGCTGTCCTATCAGCTTGTAGTCATTCAGGTCTCCTGAATCAAGGTGAACTTTTATTTGCAGAGATGCAAACTGTGCATGGT
CTAAGCCCCCGTTTGGAACACTATGCTTGCATGGCAGACCTGTTTGGGAGGGTTGGTCTGTTGAACAAAGCAAAGGAAATTATCACAAGAATGCCTTACAGACCCACGTC
TGCTATTTGGGCCACTCTTATTGGAGCATGTTGCATCCATGGAAACACAGATATTGGGGAATGGGCAGCAGAGAAACTTCTGGAAATGCGGCCTGAACATTCTGGTTACT
ATGTCTTGATTGCTAACATGTACGCTGCTGCCGGTTCTTGGAGTAAGTTGGCAAAAATAAGGACTCTTATGAGGGATTCTGGTGTAGCAAAAGTTCCTGGTTGTTCTTGG
GTTGACGTTGGGTCTGAATTCGTCTCATTCTCAGTTGGTGACACATCTAATTCTCAAGCCCTTGAGTCGAAGCTCTTGTTAGACAGTTTGTATGATGTAATGAAACACGG
TAGTCTAATAACGACAGATAATTACGATATTTGTGACAACATTTTTTGA
mRNA sequenceShow/hide mRNA sequence
TTGGCATTCGATGGGTTCATTGGTCGGACCGACATAGTCGCCGCTCGATGAAACCATCTGGTGTGGTGTGGGACTTTTTAAAAGGGTATCTTAGACTTTTCCTGTTTCTC
TTTGTCCAAAAACCCAACTTTTAAAAACTCATCCCAAAATCATTACCTCTCAAAAAAAAAAAAAAAGACAAATCCTATTTTTGGTGATTTCCCAACATGAAGCTGTGAAT
ATTAGGGATTAGGAGCTATGACGGTTTAACTTTGTAAAGCCGCCACAAGTTCTCTGCATTGGACATCACCGCCGTCTTAACCTTCGCTCCGTCAGTTTGTCGTTGAATGC
CGATGGGACAACTTATTTTTGTGGGTTTTGGGATCTCCATCGTGAGGCATGTAATCTAGCTACCGAGATAAACCATGTTATTGGTTCTTGCTTACCAGCTCTAAACTACA
ACTAAGGGATAAACTATGTTATTGTTTCTTGCTGACCATGACCTTGCAAATAAAGCACATGACTTTTGTGTATGTCATTTTCTCCTTCTCAATGCATCCTCAAGGGTCTT
TCTATATATAAGCTCCAAACGTTCATACCTAAATCATGGAAAAATTTACCTGTGAGCAACAGTAGTAAATTTATGATTGATTCTATTTTTTCTTCCCTTAAGAACTTTGC
CTCTCATGGACAATTGTCGAAAACATTTGAAGCCTTCTCCCTCATTCAATTGCGCACAAGTTATAGTGATTCATTTGACCTCATCTTGCAATCCATCTCGATTCTTCTTG
TATCATGCACCGATTGTAGCTCACTCCCACAAGGTAAGCAACTTCATGGTCACATTATCTCGTCGGGTCTTGTGGAAGACTCTTTTTTGGTCCCCAAGCTTGTCACGTTT
TACTCAAGCTTTAAACTTTTGCCTGAGGCTCATACCCTTGTTGAGAATTCTAATTTATTTCACCCCTGTTCTTGGAATATACTTATCGCATCATACGTTAGAAATGAACT
TCATGAGGCAGCCATTTTAGCTTATAAACAGATGCTGAGTAAAGGGGTCAGACCAGATAATTTCACTTTTCCCTCCATTTTGAAGGCTTGTGGTGAAACAAAAAATTTGG
AACTTGGTTTAGAGGTTCACAAGTCTATTAATGCTTGGTCAACTAAATGGAGTCTGTTTGTTCACAATGCTCTGATATCTATGTATGGAAGATGTGGAGAGGTGGACACT
GCACGTAACTTGTTTGACAATATGCTTGACCGGGATGCAGTATCTTGGAATTCAATGATCTCTTGTTATGCCTCCAGGGGTATGTGGAGGGAGGCATTTGAACTATTTGA
CAGCATGCAGAGTAAGTGTCTTGAAATTAATGTTGTAACTTGGAATATTATAGCTGGAGGTTGCTTACGGGTTGGTAATTTTACTCAAGCACTGAAGTTACTGTCTCAAA
TGAGAAATTTTGGCATTCATTTGGACAACGTAGCAATGATAATTGGTTTAGGTGCTTGTTCCCACATTGGTGCCATTAGATTGGGAAAGGAAATCCATGGCTTTACTATC
AGACATTATTATCATAAGTTATCCACTGTTCAAAATTCTTTAGTTACCATGTATGCTCGTTGTAAAGACATTACGCATGCATATATGTTGTTTCGATTAAATGACGACAA
AAGTATAATCACGTGGAATTCCATGCTTTCTGGTCTCACACACTTGGACCGGGTTGAGGATGCCTTGTGTCTGTTTAGAGAATTGTTACTGTTTGGTGTTGAACCAAACT
ATGTGACATTTGCTAGCATTCTTCCTCTTTGTGCTCGAGTTGCAGATTTACAACATGGGAGAGAATTTCATTGCTATATTACTAAACGTCATGATTTTAGGGATTATTTG
TTATTGTGGAATGCATTGGTGGACATGTATGCAAGGTCGGGCAAGGTTTTAGAAGCAAAAAGAGTTTTTGATTCGTTAAGCAAGAAGGATGAAGTGACCTATACTTCTCT
GATTGCAGGCTATGGTATGCAAGGTGAGGGGGGGAAAGCTGTAAGACTGTTCGAAGAGATGAAAAGGTTTCAAATCAAACCAGATCATGTAACTATGGTTGCTGTCCTAT
CAGCTTGTAGTCATTCAGGTCTCCTGAATCAAGGTGAACTTTTATTTGCAGAGATGCAAACTGTGCATGGTCTAAGCCCCCGTTTGGAACACTATGCTTGCATGGCAGAC
CTGTTTGGGAGGGTTGGTCTGTTGAACAAAGCAAAGGAAATTATCACAAGAATGCCTTACAGACCCACGTCTGCTATTTGGGCCACTCTTATTGGAGCATGTTGCATCCA
TGGAAACACAGATATTGGGGAATGGGCAGCAGAGAAACTTCTGGAAATGCGGCCTGAACATTCTGGTTACTATGTCTTGATTGCTAACATGTACGCTGCTGCCGGTTCTT
GGAGTAAGTTGGCAAAAATAAGGACTCTTATGAGGGATTCTGGTGTAGCAAAAGTTCCTGGTTGTTCTTGGGTTGACGTTGGGTCTGAATTCGTCTCATTCTCAGTTGGT
GACACATCTAATTCTCAAGCCCTTGAGTCGAAGCTCTTGTTAGACAGTTTGTATGATGTAATGAAACACGGTAGTCTAATAACGACAGATAATTACGATATTTGTGACAA
CATTTTTTGAGGAAACATTAAATTGATTATTGCTGAAACTAGCAGTATACGTGTTGGTTTTAATCCTTGGTGTACATCAAAATACATAAACAAATAAAGCTTGTAACCTT
TCATATGTAAAATTACTATTTAACAAAGTTTGTCCACATGCATTTGCATTTTTATTTTGTCCTTTCCAATGAAGCTCGGTTTACACGTATAGTTTGGCTCGTCTTAAAAG
TGGATCATGATGCTATTTTTAGTGTGTTATGCCAGCTGACCAGGTGTGTTGGTTTATCCTACAAGCTCATTGTATGTTCCATTTTTTGGATGATGTGAGTTGCTGCATCT
AGCCTTAATAGTAATGACAGTTCTAATGGTTTTTCAGGCCAGCAGCGAGTTGAGCATCTGACTTGCTTCTGCACAACAGGATAAAACTTGATCCAAATTGTAAGTATCCT
TTGCATGTACATTGAGTGGTCGGTTTGGGACTTTGGAGATTTTTACATTTTTTTTTTAAGAACATAGCTGCTCTTAGATAAATTTATGAAACTGTAGACGTTGAAGGTTA
AAGCTTGTTGTGACTTGTGAAAAAGCTAAAACACTTGGGGCGTTTGGATGTTCGCAATGTAATGTAATAAAAAACTCATATTTTGATTTGAGCGTTT
Protein sequenceShow/hide protein sequence
MSFSPSQCILKGLSIYKLQTFIPKSWKNLPVSNSSKFMIDSIFSSLKNFASHGQLSKTFEAFSLIQLRTSYSDSFDLILQSISILLVSCTDCSSLPQGKQLHGHIISSGL
VEDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCSWNILIASYVRNELHEAAILAYKQMLSKGVRPDNFTFPSILKACGETKNLELGLEVHKSINAWSTKWSLFVHNA
LISMYGRCGEVDTARNLFDNMLDRDAVSWNSMISCYASRGMWREAFELFDSMQSKCLEINVVTWNIIAGGCLRVGNFTQALKLLSQMRNFGIHLDNVAMIIGLGACSHIG
AIRLGKEIHGFTIRHYYHKLSTVQNSLVTMYARCKDITHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALCLFRELLLFGVEPNYVTFASILPLCARVADLQHGREFH
CYITKRHDFRDYLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKAVRLFEEMKRFQIKPDHVTMVAVLSACSHSGLLNQGELLFAEMQTVHG
LSPRLEHYACMADLFGRVGLLNKAKEIITRMPYRPTSAIWATLIGACCIHGNTDIGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSW
VDVGSEFVSFSVGDTSNSQALESKLLLDSLYDVMKHGSLITTDNYDICDNIF