; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G01350 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G01350
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPentatricopeptide repeat-containing protein
Genome locationClcChr08:2306473..2309042
RNA-Seq ExpressionClc08G01350
SyntenyClc08G01350
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0000786 - nucleosome (cellular component)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003723 - RNA binding (molecular function)
GO:0046982 - protein heterodimerization activity (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7025166.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0089.16Show/hide
Query:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ
        MSSSS + IL+GLSIYKL TFIPK W+NVPVSNG E MI SIF SLK+FASHGQLSK+FEAFSL+QLR+SYNDSFDLI+QSISILLVSCT CSSLP GKQ
Subjt:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ

Query:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL
        LHG II SGLE+DS LVPKLVTFYSSFKLL EAHTLVENSNLFHPCPWNLLITSY+RNELH++AILAYKQMLSKGVRPDNFTFPSILKACGE+QNL FGL
Subjt:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL

Query:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA
        EVHK IN+WS +WSLFVQNALISMYGRCGE+DTARNLFDNML+RDAVSWNSM+SCYASKGMWKEAFELFD MQSKC+EIN+VTWNI+AGGCLR+G FTRA
Subjt:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        LKLLSQMRNFGIHLD+VAMIIGLGACSHIGAIRLGKEIHGFTIRH YHK STVQNAL+TMYARCKDI  AY+LFR+NDDKSIITWNSMLSGL+H+DRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELL FGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDF D LLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH
         KALRLFEEMK   IKPDHITMVAVL+ACSHSGL+KQGE+LFAEMQSVHGLSP LEHYACMADLFGRVGLL +AKE+ITRMPYRPTSAMWATLIGACCIH
Subjt:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH

Query:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND
         NTD+GEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAK PGCSWV+VGSEFVSF VGDTSNPQALESK LLD LND
Subjt:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND

XP_022141804.1 pentatricopeptide repeat-containing protein At1g71490-like [Momordica charantia]0.0e+0088.73Show/hide
Query:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ
        MSSSSSQ I +GLS+YKLQTFIPK W+N PVSNGSE MI  +FSSLK+FA HGQLSK+FEAFSLIQLR  YNDSFDLILQS SILLVSCTN SSLPPG+Q
Subjt:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ

Query:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL
        LHG II SGLE DS LVPKLVTFYSSFKLL EAHTLVENSN+FHPCPWNLLITSY+RN LH+AAIL YKQMLS+G+RPDNFTFPSILKACGE+QNL FGL
Subjt:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL

Query:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA
        EVHK INAWS +WSLFVQNALISMYGRCGEVDTARNLFDNML+RDAVSWNSM+SCYASKGMWKEAFELFD+MQSKC+EIN+VTWNI+AGGCLRVGNF  A
Subjt:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        LKLLSQMRNFG HLD VAMIIGLGACSHIGAIRLGKEIHGFTIRH YH+LS VQNALVTMYARCKDIM+AY+LFRLN DKSIITWNSMLSG THLDRVE+
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELL  GVEPNYVT ASILPLCARVADLQHGREFHCYITKR+D  D LLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH
         KALRLF+EMKRFQIKPDHITMVAVL+ACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLL KAK +ITRMPYRPTSAMWATLIGACCIH
Subjt:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH

Query:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND
        GNT++GEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAK PGCSWVDVGS FVSF VGDTSNPQALE+ LLLD+LN+
Subjt:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND

XP_022925519.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita moschata]0.0e+0089.16Show/hide
Query:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ
        MSSSS + IL+GLSIYKL TFIPK W+NVPVSNG E MI SIF SLK+FASHGQLSK+FEAFSL+QLR SYNDSFDLI+QSISILLVSCT CSSLP GKQ
Subjt:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ

Query:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL
        LHG IISSGLE+DS LVPKLVTFYSSFKLL EAHTLVENSNLFHPCPWNLLITSY+RNELH++AILAYKQMLSKGVRPDNFTFPSILKACGE+QNL FGL
Subjt:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL

Query:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA
        EVHK IN+WS +WSLFVQNALISMYGRCGE+DTARNLFDNML+RDAVSWNSM+SCYAS GMWKEAFELFD MQSKC+EIN+VTWNI+AGGCLR+G FT+A
Subjt:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        LKLLSQMRNFGIHLD+VAMIIGLGACSHIGAIRLGKEIHGFTIRH YHK STVQNAL+TMYARCKDIM AY+LFRLNDDKSIITWNSMLSGL+H+DRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELL +GVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDF D LLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH
         KALRLFEEMK   IKPDHITMVAVL+ACSHSGL+KQGE+LFAEMQSVHGLSP LEHYACMADLFGRVGLL +AKE+ITRMPYRPTSAMWATLIGACCIH
Subjt:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH

Query:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND
         NTD+GEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAK PGCSWV+VGSEFVSF VGDTSNPQALESK LLD LND
Subjt:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND

XP_022973516.1 pentatricopeptide repeat-containing protein At1g71490 isoform X1 [Cucurbita maxima]0.0e+0089.45Show/hide
Query:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ
        MSSSS + IL+GLSIYKL TFIPK W+NVPVSNG E MI+SIF SLK+FASHGQLSK+FEAFSL+QLR SYNDSFDLI+QSISILLVSCT CSSLP GKQ
Subjt:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ

Query:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL
        LHG IISSGLE+DS LVPKLVTFYSSFKLL EAHTLVENSNLFHPCPWNLLITSY+RNELH++AILAYKQMLSKGVRPDNFTFPSILKACGE+QNL FGL
Subjt:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL

Query:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA
        EVHK IN+WS +WSLFVQNALISMYGRCGE+DTARNLFDNML+RDAVSWNSM+SCYASKGMWKEAFELFD MQSKC+EIN+VTWNI+AGGCLR+G FTRA
Subjt:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        LKLLSQMRNFGIHLD+VAMIIGLGACSHIGAIRLGKEIHGFTIRH YHK STVQNAL+TMYARCKDIM AY+LFRLNDDKSIITWNSMLSGL+H+DRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRE L FGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDF D LLLWNALVDMYARSGKV+EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH
         KALRLFEEMK   IKPDHITMVAVL+ACSHSGL+KQGE+LFAEMQSVHGLSPHLEHYACMADLFGRVGLL +AKE+ITRMPYRPTSAMWATLIGACCIH
Subjt:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH

Query:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND
         NTD+GEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAK PGCSWV+VGSEFVSF VGDTSNPQALESK LLD LND
Subjt:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND

XP_038890628.1 pentatricopeptide repeat-containing protein At1g71490 [Benincasa hispida]0.0e+0092.51Show/hide
Query:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ
        MS SSS CILKGLSIYKLQTFIPKPWRNVPVSNGS+ MIDSIFSSLKNFAS+GQLSK+FEAFSLI+LRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ
Subjt:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ

Query:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL
        LHGHII+SGLE+DSFLVPKLVTFYSSFKLLPEAHTLVE SNLFHPC WNLLI SY+RNELH+AAILAYKQMLSKGVRPDNFTFPSILKACGE++NLEFGL
Subjt:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL

Query:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA
        EVHKSINAWS KWSLFVQNAL+SMYGRCGEVDTARNLFDNMLE DAVSWNSM+SCYASKGMWKEAFELFD MQSKCV INVVTWNI+AGGCLRVGNFTRA
Subjt:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        LKLLSQMRN GI+LDNVAM+IGLGACSHIGAIRLGKEIHGFTIRHYYHKLST+QNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELL FGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDF D LLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI+GYGMQGEG
Subjt:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH
         KALRLFEEMKRF+IKPDHITMVAVL+ACSHSGLL+QGELLFAEMQSVHGL PHLEHYACMADLFGRVGLL KAKE+ITRMPYRPTSAMWATLIGACCIH
Subjt:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH

Query:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND-------A
        GNTD+GEWAAEKLLEM PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGS FVSF VGDTSNPQALESKL+LDSLND       A
Subjt:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND-------A

Query:  SSELSFDLLLQDRIKLAITKL
        S ELS DLLLQ+RIKLAITKL
Subjt:  SSELSFDLLLQDRIKLAITKL

TrEMBL top hitse value%identityAlignment
A0A1S3CB12 pentatricopeptide repeat-containing protein At1g714900.0e+0089.45Show/hide
Query:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ
        MS S S+CILKGLSI KL+TFIPK W+ +PVSN SE MI SIFSSLK+FASHGQLSK+FEAFSLIQLR SYNDSFDLILQSISILLVSCT CSSLPPGKQ
Subjt:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ

Query:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL
        LHGHIISSGL +DSFLV KLV FYSS + LPEAHTLVE SNLF PC WN+L+TSY+RN+L++AAILAYKQMLSKGVRPDNFTFPSILKACGE+QNLEFGL
Subjt:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL

Query:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA
        EVHKSINA S  WSLFV NALISMYGRCGEVDTAR LFD MLERD VSWNSM+SCY+S+GMW+EAFELF+SMQSK +EINVVTWNI+AGGCLRVGNFTRA
Subjt:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        L LLSQMRNFGIHLD+VAMIIGLGACSHIGAIRLGKEIHGFTIRHY+H LSTVQNALVTMYARCKDI HAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELL FGVEPNYVTFASILPLCARVA+LQHGREFHCYITKR DF D LLLWNALVDMYARSGKV EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH
        GKALRLFEEMKRFQIKPDHITMVAVL+ACSHSGLL QGELLFAEMQSVHGLSP LEHY+CMADLFGRVGLL KAKE+ITRMPYRPTSA+WATLIGACCIH
Subjt:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH

Query:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND
        GNTD+GEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLA+IRT MRDSGVAKVPGCSWVDVGSEFVSFSVGDTS+PQALESKLLLDSL D
Subjt:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND

A0A5D3BN10 Pentatricopeptide repeat-containing protein0.0e+0089.45Show/hide
Query:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ
        MS S S+CILKGLSI KL+TFIPK W+ +PVSN SE MI SIFSSLK+FASHGQLSK+FEAFSLIQLR SYNDSFDLILQSISILLVSCT CSSLPPGKQ
Subjt:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ

Query:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL
        LHGHIISSGL +DSFLV KLV FYSS + LPEAHTLVE SNLF PC WN+L+TSY+RN+L++AAILAYKQMLSKGVRPDNFTFPSILKACGE+QNLEFGL
Subjt:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL

Query:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA
        EVHKSINA S  WSLFV NALISMYGRCGEVDTAR LFD MLERD VSWNSM+SCY+S+GMW+EAFELF+SMQSK +EINVVTWNI+AGGCLRVGNFTRA
Subjt:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        L LLSQMRNFGIHLD+VAMIIGLGACSHIGAIRLGKEIHGFTIRHY+H LSTVQNALVTMYARCKDI HAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELL FGVEPNYVTFASILPLCARVA+LQHGREFHCYITKR DF D LLLWNALVDMYARSGKV EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH
        GKALRLFEEMKRFQIKPDHITMVAVL+ACSHSGLL QGELLFAEMQSVHGLSP LEHY+CMADLFGRVGLL KAKE+ITRMPYRPTSA+WATLIGACCIH
Subjt:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH

Query:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND
        GNTD+GEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLA+IRT MRDSGVAKVPGCSWVDVGSEFVSFSVGDTS+PQALESKLLLDSL D
Subjt:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND

A0A6J1CJU8 pentatricopeptide repeat-containing protein At1g71490-like0.0e+0088.73Show/hide
Query:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ
        MSSSSSQ I +GLS+YKLQTFIPK W+N PVSNGSE MI  +FSSLK+FA HGQLSK+FEAFSLIQLR  YNDSFDLILQS SILLVSCTN SSLPPG+Q
Subjt:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ

Query:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL
        LHG II SGLE DS LVPKLVTFYSSFKLL EAHTLVENSN+FHPCPWNLLITSY+RN LH+AAIL YKQMLS+G+RPDNFTFPSILKACGE+QNL FGL
Subjt:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL

Query:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA
        EVHK INAWS +WSLFVQNALISMYGRCGEVDTARNLFDNML+RDAVSWNSM+SCYASKGMWKEAFELFD+MQSKC+EIN+VTWNI+AGGCLRVGNF  A
Subjt:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        LKLLSQMRNFG HLD VAMIIGLGACSHIGAIRLGKEIHGFTIRH YH+LS VQNALVTMYARCKDIM+AY+LFRLN DKSIITWNSMLSG THLDRVE+
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELL  GVEPNYVT ASILPLCARVADLQHGREFHCYITKR+D  D LLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH
         KALRLF+EMKRFQIKPDHITMVAVL+ACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLL KAK +ITRMPYRPTSAMWATLIGACCIH
Subjt:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH

Query:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND
        GNT++GEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAK PGCSWVDVGS FVSF VGDTSNPQALE+ LLLD+LN+
Subjt:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND

A0A6J1EI84 pentatricopeptide repeat-containing protein At1g71490 isoform X10.0e+0089.16Show/hide
Query:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ
        MSSSS + IL+GLSIYKL TFIPK W+NVPVSNG E MI SIF SLK+FASHGQLSK+FEAFSL+QLR SYNDSFDLI+QSISILLVSCT CSSLP GKQ
Subjt:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ

Query:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL
        LHG IISSGLE+DS LVPKLVTFYSSFKLL EAHTLVENSNLFHPCPWNLLITSY+RNELH++AILAYKQMLSKGVRPDNFTFPSILKACGE+QNL FGL
Subjt:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL

Query:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA
        EVHK IN+WS +WSLFVQNALISMYGRCGE+DTARNLFDNML+RDAVSWNSM+SCYAS GMWKEAFELFD MQSKC+EIN+VTWNI+AGGCLR+G FT+A
Subjt:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        LKLLSQMRNFGIHLD+VAMIIGLGACSHIGAIRLGKEIHGFTIRH YHK STVQNAL+TMYARCKDIM AY+LFRLNDDKSIITWNSMLSGL+H+DRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRELL +GVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDF D LLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH
         KALRLFEEMK   IKPDHITMVAVL+ACSHSGL+KQGE+LFAEMQSVHGLSP LEHYACMADLFGRVGLL +AKE+ITRMPYRPTSAMWATLIGACCIH
Subjt:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH

Query:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND
         NTD+GEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAK PGCSWV+VGSEFVSF VGDTSNPQALESK LLD LND
Subjt:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND

A0A6J1I8V4 pentatricopeptide repeat-containing protein At1g71490 isoform X10.0e+0089.45Show/hide
Query:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ
        MSSSS + IL+GLSIYKL TFIPK W+NVPVSNG E MI+SIF SLK+FASHGQLSK+FEAFSL+QLR SYNDSFDLI+QSISILLVSCT CSSLP GKQ
Subjt:  MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQ

Query:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL
        LHG IISSGLE+DS LVPKLVTFYSSFKLL EAHTLVENSNLFHPCPWNLLITSY+RNELH++AILAYKQMLSKGVRPDNFTFPSILKACGE+QNL FGL
Subjt:  LHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGL

Query:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA
        EVHK IN+WS +WSLFVQNALISMYGRCGE+DTARNLFDNML+RDAVSWNSM+SCYASKGMWKEAFELFD MQSKC+EIN+VTWNI+AGGCLR+G FTRA
Subjt:  EVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRA

Query:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED
        LKLLSQMRNFGIHLD+VAMIIGLGACSHIGAIRLGKEIHGFTIRH YHK STVQNAL+TMYARCKDIM AY+LFRLNDDKSIITWNSMLSGL+H+DRVED
Subjt:  LKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVED

Query:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
        AL LFRE L FGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDF D LLLWNALVDMYARSGKV+EAKRVFDSLSKKDEVTYTSLIAGYGMQGEG
Subjt:  ALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEG

Query:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH
         KALRLFEEMK   IKPDHITMVAVL+ACSHSGL+KQGE+LFAEMQSVHGLSPHLEHYACMADLFGRVGLL +AKE+ITRMPYRPTSAMWATLIGACCIH
Subjt:  GKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIH

Query:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND
         NTD+GEWAAEKLLEM+PEHSGYYVLIANMYAAAGSWSKLAKIRTLMRD GVAK PGCSWV+VGSEFVSF VGDTSNPQALESK LLD LND
Subjt:  GNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLND

SwissProt top hitse value%identityAlignment
Q4V389 Pentatricopeptide repeat-containing protein At1g228304.9e-20351.18Show/hide
Query:  MSSSSSQCILKGLSIYKLQTFIPKPWRNV--PVSNGSELMIDS-----IFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCS
        M SS S+ IL+GL++ ++  FIP+ W+ +  P+S  S+   D      +F+S ++  SHGQL ++F  FSL++ ++    S + +L S + LL +C   +
Subjt:  MSSSSSQCILKGLSIYKLQTFIPKPWRNV--PVSNGSELMIDS-----IFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCS

Query:  SLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGES
           PG+QLH H ISSGLE DS LVPKLVTFYS+F LL EA T+ ENS + HP PWN+LI SYIRN+    ++  YK+M+SKG+R D FT+PS++KAC   
Subjt:  SLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGES

Query:  QNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLR
         +  +G  VH SI   S + +L+V NALISMY R G+VD AR LFD M ERDAVSWN++++CY S+    EAF+L D M    VE ++VTWN +AGGCL 
Subjt:  QNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLR

Query:  VGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIR--HYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSG
         GN+  AL  +  MRN  + + +VAMI GL ACSHIGA++ GK  H   IR   + H +  V+N+L+TMY+RC D+ HA+++F+  +  S+ TWNS++SG
Subjt:  VGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIR--HYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSG

Query:  LTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI
          + +R E+   L +E+L  G  PN++T ASILPL ARV +LQHG+EFHCYI +RQ + DCL+LWN+LVDMYA+SG+++ AKRVFDS+ K+D+VTYTSLI
Subjt:  LTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI

Query:  AGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWA
         GYG  G+G  AL  F++M R  IKPDH+TMVAVL+ACSHS L+++G  LF +M+ V G+   LEHY+CM DL+ R G L KA+++   +PY P+SAM A
Subjt:  AGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWA

Query:  TLIGACCIHGNTDMGEWAAEK-LLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSE
        TL+ AC IHGNT++GEWAA+K LLE +PEH G+Y+L+A+MYA  GSWSKL  ++TL+ D GV K    + ++  SE
Subjt:  TLIGACCIHGNTDMGEWAAEK-LLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSE

Q9C9I6 Pentatricopeptide repeat-containing protein At1g714901.4e-23457.96Show/hide
Query:  LMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTL
        ++ +S+F SL + ASHG L  +F+ FSL++L++S   S DL+L S + LL +C +  +   G Q+H H ISSG+E  S LVPKLVTFYS+F L  EA ++
Subjt:  LMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTL

Query:  VENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARN
        +ENS++ HP PWN+LI SY +NEL +  I AYK+M+SKG+RPD FT+PS+LKACGE+ ++ FG  VH SI   S K SL+V NALISMY R   +  AR 
Subjt:  VENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARN

Query:  LFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGK
        LFD M ERDAVSWN++++CYAS+GMW EAFELFD M    VE++V+TWNI++GGCL+ GN+  AL L+S+MRNF   LD VAMIIGL ACS IGAIRLGK
Subjt:  LFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGK

Query:  EIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHG
        EIHG  I   Y  +  V+N L+TMY++CKD+ HA ++FR  ++ S+ TWNS++SG   L++ E+A  L RE+L  G +PN +T ASILPLCAR+A+LQHG
Subjt:  EIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHG

Query:  REFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLK
        +EFHCYI +R+ F D  +LWN+LVD+YA+SGK++ AK+V D +SK+DEVTYTSLI GYG QGEGG AL LF+EM R  IKPDH+T+VAVL+ACSHS L+ 
Subjt:  REFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLK

Query:  QGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIHGNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGS
        +GE LF +MQ  +G+ P L+H++CM DL+GR G L KAK++I  MPY+P+ A WATL+ AC IHGNT +G+WAAEKLLEM+PE+ GYYVLIANMYAAAGS
Subjt:  QGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIHGNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGS

Query:  WSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLNDASSELSFDLLLQDRIKLAITKL
        WSKLA++RT+MRD GV K PGC+W+D  S F  FSVGDTS+P+A  +  LLD LN          L++D    AI K+
Subjt:  WSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLNDASSELSFDLLLQDRIKLAITKL

Q9LFL5 Pentatricopeptide repeat-containing protein At5g168604.9e-11834.15Show/hide
Query:  SSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLF
        S ++++  +G  +K    F L+   +   D++     +   +  +C   SS+  G+  H   + +G   + F+   LV  YS  + L +A  + +  +++
Subjt:  SSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLF

Query:  HPCPWNLLITSYIRNELHDAAILAYKQMLSK-GVRPDNFTFPSILKACGESQNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNML
            WN +I SY +      A+  + +M ++ G RPDN T  ++L  C        G ++H       M  ++FV N L+ MY +CG +D A  +F NM 
Subjt:  HPCPWNLLITSYIRNELHDAAILAYKQMLSK-GVRPDNFTFPSILKACGESQNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNML

Query:  ERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFT
         +D VSWN+M++ Y+  G +++A  LF+ MQ + ++++VVTW+    G  + G    AL +  QM + GI  + V +I  L  C+ +GA+  GKEIH + 
Subjt:  ERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFT

Query:  IRH-------YYHKLSTVQNALVTMYARCKDIMHAYMLF--RLNDDKSIITWNSMLSGLTHLDRVEDALGLFRELLQFGVE--PNYVTFASILPLCARVA
        I++        +   + V N L+ MYA+CK +  A  +F      ++ ++TW  M+ G +       AL L  E+ +   +  PN  T +  L  CA +A
Subjt:  IRH-------YYHKLSTVQNALVTMYARCKDIMHAYMLF--RLNDDKSIITWNSMLSGLTHLDRVEDALGLFRELLQFGVE--PNYVTFASILPLCARVA

Query:  DLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSH
         L+ G++ H Y  + Q     L + N L+DMYA+ G + +A+ VFD++  K+EVT+TSL+ GYGM G G +AL +F+EM+R   K D +T++ VL ACSH
Subjt:  DLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSH

Query:  SGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIHGNTDMGEWAAEKLLEMRPEHSGYYVLIANMY
        SG++ QG   F  M++V G+SP  EHYAC+ DL GR G L  A  +I  MP  P   +W   +  C IHG  ++GE+AAEK+ E+   H G Y L++N+Y
Subjt:  SGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIHGNTDMGEWAAEKLLEMRPEHSGYYVLIANMY

Query:  AAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALE-SKLLLD----------------SLNDASSELSFDLLLQDRIKLAI
        A AG W  + +IR+LMR  GV K PGCSWV+      +F VGD ++P A E  ++LLD                +L+D   E   DLL +   KLA+
Subjt:  AAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALE-SKLLLD----------------SLNDASSELSFDLLLQDRIKLAI

Q9LN01 Pentatricopeptide repeat-containing protein At1g08070, chloroplastic1.3e-11534.71Show/hide
Query:  LRASYNDSFDLILQSISILLVSCTNCSSLPPGKQLHGHIISSGLEDDSFLVPKLVTF---YSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDA
        L +S +  +D I    S+ L+   NC +L   + +H  +I  GL + ++ + KL+ F      F+ LP A ++ +     +   WN +   +  +    +
Subjt:  LRASYNDSFDLILQSISILLVSCTNCSSLPPGKQLHGHIISSGLEDDSFLVPKLVTF---YSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDA

Query:  AILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWK
        A+  Y  M+S G+ P+++TFP +LK+C +S+  + G ++H  +        L+V  +LISMY + G ++ A  +FD    RD VS+ +++  YAS+G  +
Subjt:  AILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWK

Query:  EAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYAR
         A +LFD +  K    +VV+WN +  G    GN+  AL+L   M    +  D   M+  + AC+  G+I LG+++H +   H +     + NAL+ +Y++
Subjt:  EAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYAR

Query:  CKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKR-QDFMDCLLLWNALVDM
        C ++  A  LF     K +I+WN+++ G TH++  ++AL LF+E+L+ G  PN VT  SILP CA +  +  GR  H YI KR +   +   L  +L+DM
Subjt:  CKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKR-QDFMDCLLLWNALVDM

Query:  YARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMA
        YA+ G +  A +VF+S+  K   ++ ++I G+ M G    +  LF  M++  I+PD IT V +L+ACSHSG+L  G  +F  M   + ++P LEHY CM 
Subjt:  YARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMA

Query:  DLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIHGNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVD
        DL G  GL K+A+E+I  M   P   +W +L+ AC +HGN ++GE  AE L+++ PE+ G YVL++N+YA+AG W+++AK R L+ D G+ KVPGCS ++
Subjt:  DLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIHGNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVD

Query:  VGSEFVSFSVGDTSNPQALESKLLLDSL
        + S    F +GD  +P+  E   +L+ +
Subjt:  VGSEFVSFSVGDTSNPQALESKLLLDSL

Q9LNU6 Pentatricopeptide repeat-containing protein At1g202305.8e-10330.75Show/hide
Query:  SSLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGE
        SSL    Q H  I+ SG ++D ++  KL+  YS++    +A  ++++        ++ LI +  + +L   +I  + +M S G+ PD+   P++ K C E
Subjt:  SSLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGE

Query:  SQNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCL
            + G ++H       +    FVQ ++  MY RCG +  AR +FD M ++D V+ +++L  YA KG  +E   +   M+S  +E N+V+WN +  G  
Subjt:  SQNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCL

Query:  RVGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARC-----------------KDIMHAYM---
        R G    A+ +  ++ + G   D V +   L +      + +G+ IHG+ I+    K   V +A++ MY +                    + +AY+   
Subjt:  RVGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARC-----------------KDIMHAYM---

Query:  -----------LFRLNDDK----SIITWNSMLSGLTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWN
                   +F L  ++    ++++W S+++G     +  +AL LFRE+   GV+PN+VT  S+LP C  +A L HGR  H +   R   +D + + +
Subjt:  -----------LFRLNDDK----SIITWNSMLSGLTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWN

Query:  ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEH
        AL+DMYA+ G++  ++ VF+ +  K+ V + SL+ G+ M G+  + + +FE + R ++KPD I+  ++L+AC   GL  +G   F  M   +G+ P LEH
Subjt:  ALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEH

Query:  YACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIHGNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPG
        Y+CM +L GR G L++A ++I  MP+ P S +W  L+ +C +  N D+ E AAEKL  + PE+ G YVL++N+YAA G W+++  IR  M   G+ K PG
Subjt:  YACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIHGNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPG

Query:  CSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLNDASSELSFDL
        CSW+ V +   +   GD S+PQ       +D + +   E+S ++
Subjt:  CSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLNDASSELSFDL

Arabidopsis top hitse value%identityAlignment
AT1G08070.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.4e-11734.71Show/hide
Query:  LRASYNDSFDLILQSISILLVSCTNCSSLPPGKQLHGHIISSGLEDDSFLVPKLVTF---YSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDA
        L +S +  +D I    S+ L+   NC +L   + +H  +I  GL + ++ + KL+ F      F+ LP A ++ +     +   WN +   +  +    +
Subjt:  LRASYNDSFDLILQSISILLVSCTNCSSLPPGKQLHGHIISSGLEDDSFLVPKLVTF---YSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDA

Query:  AILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWK
        A+  Y  M+S G+ P+++TFP +LK+C +S+  + G ++H  +        L+V  +LISMY + G ++ A  +FD    RD VS+ +++  YAS+G  +
Subjt:  AILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWK

Query:  EAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYAR
         A +LFD +  K    +VV+WN +  G    GN+  AL+L   M    +  D   M+  + AC+  G+I LG+++H +   H +     + NAL+ +Y++
Subjt:  EAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIRHYYHKLSTVQNALVTMYAR

Query:  CKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKR-QDFMDCLLLWNALVDM
        C ++  A  LF     K +I+WN+++ G TH++  ++AL LF+E+L+ G  PN VT  SILP CA +  +  GR  H YI KR +   +   L  +L+DM
Subjt:  CKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKR-QDFMDCLLLWNALVDM

Query:  YARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMA
        YA+ G +  A +VF+S+  K   ++ ++I G+ M G    +  LF  M++  I+PD IT V +L+ACSHSG+L  G  +F  M   + ++P LEHY CM 
Subjt:  YARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMA

Query:  DLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIHGNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVD
        DL G  GL K+A+E+I  M   P   +W +L+ AC +HGN ++GE  AE L+++ PE+ G YVL++N+YA+AG W+++AK R L+ D G+ KVPGCS ++
Subjt:  DLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIHGNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVD

Query:  VGSEFVSFSVGDTSNPQALESKLLLDSL
        + S    F +GD  +P+  E   +L+ +
Subjt:  VGSEFVSFSVGDTSNPQALESKLLLDSL

AT1G22830.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.5e-20451.18Show/hide
Query:  MSSSSSQCILKGLSIYKLQTFIPKPWRNV--PVSNGSELMIDS-----IFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCS
        M SS S+ IL+GL++ ++  FIP+ W+ +  P+S  S+   D      +F+S ++  SHGQL ++F  FSL++ ++    S + +L S + LL +C   +
Subjt:  MSSSSSQCILKGLSIYKLQTFIPKPWRNV--PVSNGSELMIDS-----IFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCS

Query:  SLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGES
           PG+QLH H ISSGLE DS LVPKLVTFYS+F LL EA T+ ENS + HP PWN+LI SYIRN+    ++  YK+M+SKG+R D FT+PS++KAC   
Subjt:  SLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGES

Query:  QNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLR
         +  +G  VH SI   S + +L+V NALISMY R G+VD AR LFD M ERDAVSWN++++CY S+    EAF+L D M    VE ++VTWN +AGGCL 
Subjt:  QNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLR

Query:  VGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIR--HYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSG
         GN+  AL  +  MRN  + + +VAMI GL ACSHIGA++ GK  H   IR   + H +  V+N+L+TMY+RC D+ HA+++F+  +  S+ TWNS++SG
Subjt:  VGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIR--HYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSG

Query:  LTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI
          + +R E+   L +E+L  G  PN++T ASILPL ARV +LQHG+EFHCYI +RQ + DCL+LWN+LVDMYA+SG+++ AKRVFDS+ K+D+VTYTSLI
Subjt:  LTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI

Query:  AGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWA
         GYG  G+G  AL  F++M R  IKPDH+TMVAVL+ACSHS L+++G  LF +M+ V G+   LEHY+CM DL+ R G L KA+++   +PY P+SAM A
Subjt:  AGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWA

Query:  TLIGACCIHGNTDMGEWAAEK-LLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSE
        TL+ AC IHGNT++GEWAA+K LLE +PEH G+Y+L+A+MYA  GSWSKL  ++TL+ D GV K    + ++  SE
Subjt:  TLIGACCIHGNTDMGEWAAEK-LLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSE

AT1G22830.2 Tetratricopeptide repeat (TPR)-like superfamily protein3.5e-20451.18Show/hide
Query:  MSSSSSQCILKGLSIYKLQTFIPKPWRNV--PVSNGSELMIDS-----IFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCS
        M SS S+ IL+GL++ ++  FIP+ W+ +  P+S  S+   D      +F+S ++  SHGQL ++F  FSL++ ++    S + +L S + LL +C   +
Subjt:  MSSSSSQCILKGLSIYKLQTFIPKPWRNV--PVSNGSELMIDS-----IFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCS

Query:  SLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGES
           PG+QLH H ISSGLE DS LVPKLVTFYS+F LL EA T+ ENS + HP PWN+LI SYIRN+    ++  YK+M+SKG+R D FT+PS++KAC   
Subjt:  SLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGES

Query:  QNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLR
         +  +G  VH SI   S + +L+V NALISMY R G+VD AR LFD M ERDAVSWN++++CY S+    EAF+L D M    VE ++VTWN +AGGCL 
Subjt:  QNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLR

Query:  VGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIR--HYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSG
         GN+  AL  +  MRN  + + +VAMI GL ACSHIGA++ GK  H   IR   + H +  V+N+L+TMY+RC D+ HA+++F+  +  S+ TWNS++SG
Subjt:  VGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFTIR--HYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSG

Query:  LTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI
          + +R E+   L +E+L  G  PN++T ASILPL ARV +LQHG+EFHCYI +RQ + DCL+LWN+LVDMYA+SG+++ AKRVFDS+ K+D+VTYTSLI
Subjt:  LTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLI

Query:  AGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWA
         GYG  G+G  AL  F++M R  IKPDH+TMVAVL+ACSHS L+++G  LF +M+ V G+   LEHY+CM DL+ R G L KA+++   +PY P+SAM A
Subjt:  AGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWA

Query:  TLIGACCIHGNTDMGEWAAEK-LLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSE
        TL+ AC IHGNT++GEWAA+K LLE +PEH G+Y+L+A+MYA  GSWSKL  ++TL+ D GV K    + ++  SE
Subjt:  TLIGACCIHGNTDMGEWAAEK-LLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSE

AT1G71490.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.0e-23557.96Show/hide
Query:  LMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTL
        ++ +S+F SL + ASHG L  +F+ FSL++L++S   S DL+L S + LL +C +  +   G Q+H H ISSG+E  S LVPKLVTFYS+F L  EA ++
Subjt:  LMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTL

Query:  VENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARN
        +ENS++ HP PWN+LI SY +NEL +  I AYK+M+SKG+RPD FT+PS+LKACGE+ ++ FG  VH SI   S K SL+V NALISMY R   +  AR 
Subjt:  VENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARN

Query:  LFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGK
        LFD M ERDAVSWN++++CYAS+GMW EAFELFD M    VE++V+TWNI++GGCL+ GN+  AL L+S+MRNF   LD VAMIIGL ACS IGAIRLGK
Subjt:  LFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGK

Query:  EIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHG
        EIHG  I   Y  +  V+N L+TMY++CKD+ HA ++FR  ++ S+ TWNS++SG   L++ E+A  L RE+L  G +PN +T ASILPLCAR+A+LQHG
Subjt:  EIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHG

Query:  REFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLK
        +EFHCYI +R+ F D  +LWN+LVD+YA+SGK++ AK+V D +SK+DEVTYTSLI GYG QGEGG AL LF+EM R  IKPDH+T+VAVL+ACSHS L+ 
Subjt:  REFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLK

Query:  QGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIHGNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGS
        +GE LF +MQ  +G+ P L+H++CM DL+GR G L KAK++I  MPY+P+ A WATL+ AC IHGNT +G+WAAEKLLEM+PE+ GYYVLIANMYAAAGS
Subjt:  QGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIHGNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGS

Query:  WSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLNDASSELSFDLLLQDRIKLAITKL
        WSKLA++RT+MRD GV K PGC+W+D  S F  FSVGDTS+P+A  +  LLD LN          L++D    AI K+
Subjt:  WSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALESKLLLDSLNDASSELSFDLLLQDRIKLAITKL

AT5G16860.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.5e-11934.15Show/hide
Query:  SSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLF
        S ++++  +G  +K    F L+   +   D++     +   +  +C   SS+  G+  H   + +G   + F+   LV  YS  + L +A  + +  +++
Subjt:  SSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQLHGHIISSGLEDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLF

Query:  HPCPWNLLITSYIRNELHDAAILAYKQMLSK-GVRPDNFTFPSILKACGESQNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNML
            WN +I SY +      A+  + +M ++ G RPDN T  ++L  C        G ++H       M  ++FV N L+ MY +CG +D A  +F NM 
Subjt:  HPCPWNLLITSYIRNELHDAAILAYKQMLSK-GVRPDNFTFPSILKACGESQNLEFGLEVHKSINAWSMKWSLFVQNALISMYGRCGEVDTARNLFDNML

Query:  ERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFT
         +D VSWN+M++ Y+  G +++A  LF+ MQ + ++++VVTW+    G  + G    AL +  QM + GI  + V +I  L  C+ +GA+  GKEIH + 
Subjt:  ERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIGAIRLGKEIHGFT

Query:  IRH-------YYHKLSTVQNALVTMYARCKDIMHAYMLF--RLNDDKSIITWNSMLSGLTHLDRVEDALGLFRELLQFGVE--PNYVTFASILPLCARVA
        I++        +   + V N L+ MYA+CK +  A  +F      ++ ++TW  M+ G +       AL L  E+ +   +  PN  T +  L  CA +A
Subjt:  IRH-------YYHKLSTVQNALVTMYARCKDIMHAYMLF--RLNDDKSIITWNSMLSGLTHLDRVEDALGLFRELLQFGVE--PNYVTFASILPLCARVA

Query:  DLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSH
         L+ G++ H Y  + Q     L + N L+DMYA+ G + +A+ VFD++  K+EVT+TSL+ GYGM G G +AL +F+EM+R   K D +T++ VL ACSH
Subjt:  DLQHGREFHCYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSH

Query:  SGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIHGNTDMGEWAAEKLLEMRPEHSGYYVLIANMY
        SG++ QG   F  M++V G+SP  EHYAC+ DL GR G L  A  +I  MP  P   +W   +  C IHG  ++GE+AAEK+ E+   H G Y L++N+Y
Subjt:  SGLLKQGELLFAEMQSVHGLSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIHGNTDMGEWAAEKLLEMRPEHSGYYVLIANMY

Query:  AAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALE-SKLLLD----------------SLNDASSELSFDLLLQDRIKLAI
        A AG W  + +IR+LMR  GV K PGCSWV+      +F VGD ++P A E  ++LLD                +L+D   E   DLL +   KLA+
Subjt:  AAAGSWSKLAKIRTLMRDSGVAKVPGCSWVDVGSEFVSFSVGDTSNPQALE-SKLLLD----------------SLNDASSELSFDLLLQDRIKLAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCTTCCTCTTCTCAATGCATCCTCAAAGGTCTTTCTATATATAAGCTCCAAACGTTCATACCTAAACCATGGAGAAATGTACCTGTGAGCAATGGTAGTGAACT
TATGATTGATTCTATTTTTTCTTCCCTTAAAAACTTTGCCTCTCATGGTCAATTGTCTAAGTCATTTGAAGCCTTCTCCCTCATTCAATTGCGCGCTAGTTATAATGATT
CATTTGACCTCATCTTGCAATCCATCTCCATTCTTCTTGTGTCATGCACCAATTGTAGCTCACTCCCACCCGGTAAGCAACTTCATGGTCACATCATCTCATCAGGTCTT
GAGGACGACTCCTTTTTGGTCCCGAAGCTTGTCACATTCTACTCAAGCTTTAAACTTCTGCCCGAGGCTCATACCCTTGTTGAGAATTCTAATTTATTTCACCCCTGTCC
TTGGAATCTACTCATCACATCATATATTAGAAATGAACTTCATGATGCAGCCATTTTAGCCTATAAACAGATGCTGAGTAAAGGGGTCAGACCAGATAATTTCACTTTTC
CCTCCATTTTAAAGGCTTGTGGTGAATCACAGAATTTGGAATTTGGTTTAGAGGTTCACAAGTCTATTAATGCTTGGTCAATGAAGTGGAGTTTGTTTGTTCAGAACGCG
CTGATATCTATGTATGGAAGATGTGGAGAGGTGGACACTGCACGTAACTTGTTTGACAATATGCTTGAACGGGATGCAGTATCTTGGAATTCAATGCTATCTTGTTATGC
CTCCAAGGGTATGTGGAAGGAAGCATTTGAACTATTTGACAGCATGCAGAGTAAGTGTGTCGAAATTAATGTTGTAACTTGGAATATTGTAGCCGGAGGTTGCTTGCGGG
TTGGTAATTTTACTCGAGCACTTAAGTTACTGTCGCAAATGAGAAATTTTGGTATTCATTTGGACAATGTAGCAATGATAATTGGTTTAGGTGCTTGTTCACACATTGGT
GCCATTAGATTGGGAAAGGAAATCCATGGCTTTACTATCAGACATTATTACCATAAGTTATCCACTGTTCAAAATGCTTTAGTTACCATGTATGCTCGTTGTAAAGACAT
TATGCATGCATATATGTTGTTTCGATTAAATGACGACAAAAGTATAATCACGTGGAATTCCATGCTTTCTGGTCTTACACACTTGGACCGGGTTGAGGATGCATTGGGTC
TGTTTAGAGAATTGTTACAGTTTGGTGTAGAACCGAACTATGTGACATTTGCTAGCATTCTTCCTCTTTGTGCTCGAGTTGCAGATTTACAACATGGGAGAGAATTTCAT
TGCTACATTACTAAACGTCAAGATTTTATGGATTGTTTGTTATTGTGGAATGCTTTGGTGGATATGTATGCAAGGTCGGGCAAGGTTTTAGAAGCAAAAAGAGTTTTTGA
TTCGTTAAGCAAGAAGGATGAAGTGACGTATACTTCCCTGATTGCAGGTTACGGTATGCAAGGAGAGGGGGGCAAAGCCCTAAGACTATTTGAAGAGATGAAAAGGTTCC
AGATCAAACCAGATCATATAACTATGGTTGCTGTCCTAACAGCTTGCAGCCATTCAGGTCTCCTGAAACAAGGTGAACTTTTATTTGCAGAGATGCAAAGTGTGCATGGT
CTAAGCCCCCATTTGGAACACTATGCTTGCATGGCAGACCTGTTTGGGAGGGTTGGTTTGTTGAAGAAAGCAAAGGAAGTTATCACGAGAATGCCTTACAGACCAACGTC
TGCCATGTGGGCCACTCTTATCGGAGCATGTTGCATCCATGGAAACACAGATATGGGGGAATGGGCAGCAGAGAAACTTCTGGAAATGCGGCCCGAACATTCTGGTTACT
ATGTCTTGATTGCTAACATGTATGCTGCTGCAGGTTCCTGGAGTAAGTTGGCAAAAATAAGGACTCTTATGAGAGATTCTGGTGTTGCAAAAGTTCCTGGTTGTTCTTGG
GTTGACGTTGGCTCTGAATTCGTCTCATTCTCGGTTGGGGACACATCTAATCCTCAAGCCCTTGAATCTAAGCTCTTGTTAGACAGTTTGAACGATGCCAGCAGCGAGTT
GAGCTTCGACTTGCTTCTGCAGGACAGGATAAAACTTGCCATCACCAAATTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCATCTTCCTCTTCTCAATGCATCCTCAAAGGTCTTTCTATATATAAGCTCCAAACGTTCATACCTAAACCATGGAGAAATGTACCTGTGAGCAATGGTAGTGAACT
TATGATTGATTCTATTTTTTCTTCCCTTAAAAACTTTGCCTCTCATGGTCAATTGTCTAAGTCATTTGAAGCCTTCTCCCTCATTCAATTGCGCGCTAGTTATAATGATT
CATTTGACCTCATCTTGCAATCCATCTCCATTCTTCTTGTGTCATGCACCAATTGTAGCTCACTCCCACCCGGTAAGCAACTTCATGGTCACATCATCTCATCAGGTCTT
GAGGACGACTCCTTTTTGGTCCCGAAGCTTGTCACATTCTACTCAAGCTTTAAACTTCTGCCCGAGGCTCATACCCTTGTTGAGAATTCTAATTTATTTCACCCCTGTCC
TTGGAATCTACTCATCACATCATATATTAGAAATGAACTTCATGATGCAGCCATTTTAGCCTATAAACAGATGCTGAGTAAAGGGGTCAGACCAGATAATTTCACTTTTC
CCTCCATTTTAAAGGCTTGTGGTGAATCACAGAATTTGGAATTTGGTTTAGAGGTTCACAAGTCTATTAATGCTTGGTCAATGAAGTGGAGTTTGTTTGTTCAGAACGCG
CTGATATCTATGTATGGAAGATGTGGAGAGGTGGACACTGCACGTAACTTGTTTGACAATATGCTTGAACGGGATGCAGTATCTTGGAATTCAATGCTATCTTGTTATGC
CTCCAAGGGTATGTGGAAGGAAGCATTTGAACTATTTGACAGCATGCAGAGTAAGTGTGTCGAAATTAATGTTGTAACTTGGAATATTGTAGCCGGAGGTTGCTTGCGGG
TTGGTAATTTTACTCGAGCACTTAAGTTACTGTCGCAAATGAGAAATTTTGGTATTCATTTGGACAATGTAGCAATGATAATTGGTTTAGGTGCTTGTTCACACATTGGT
GCCATTAGATTGGGAAAGGAAATCCATGGCTTTACTATCAGACATTATTACCATAAGTTATCCACTGTTCAAAATGCTTTAGTTACCATGTATGCTCGTTGTAAAGACAT
TATGCATGCATATATGTTGTTTCGATTAAATGACGACAAAAGTATAATCACGTGGAATTCCATGCTTTCTGGTCTTACACACTTGGACCGGGTTGAGGATGCATTGGGTC
TGTTTAGAGAATTGTTACAGTTTGGTGTAGAACCGAACTATGTGACATTTGCTAGCATTCTTCCTCTTTGTGCTCGAGTTGCAGATTTACAACATGGGAGAGAATTTCAT
TGCTACATTACTAAACGTCAAGATTTTATGGATTGTTTGTTATTGTGGAATGCTTTGGTGGATATGTATGCAAGGTCGGGCAAGGTTTTAGAAGCAAAAAGAGTTTTTGA
TTCGTTAAGCAAGAAGGATGAAGTGACGTATACTTCCCTGATTGCAGGTTACGGTATGCAAGGAGAGGGGGGCAAAGCCCTAAGACTATTTGAAGAGATGAAAAGGTTCC
AGATCAAACCAGATCATATAACTATGGTTGCTGTCCTAACAGCTTGCAGCCATTCAGGTCTCCTGAAACAAGGTGAACTTTTATTTGCAGAGATGCAAAGTGTGCATGGT
CTAAGCCCCCATTTGGAACACTATGCTTGCATGGCAGACCTGTTTGGGAGGGTTGGTTTGTTGAAGAAAGCAAAGGAAGTTATCACGAGAATGCCTTACAGACCAACGTC
TGCCATGTGGGCCACTCTTATCGGAGCATGTTGCATCCATGGAAACACAGATATGGGGGAATGGGCAGCAGAGAAACTTCTGGAAATGCGGCCCGAACATTCTGGTTACT
ATGTCTTGATTGCTAACATGTATGCTGCTGCAGGTTCCTGGAGTAAGTTGGCAAAAATAAGGACTCTTATGAGAGATTCTGGTGTTGCAAAAGTTCCTGGTTGTTCTTGG
GTTGACGTTGGCTCTGAATTCGTCTCATTCTCGGTTGGGGACACATCTAATCCTCAAGCCCTTGAATCTAAGCTCTTGTTAGACAGTTTGAACGATGCCAGCAGCGAGTT
GAGCTTCGACTTGCTTCTGCAGGACAGGATAAAACTTGCCATCACCAAATTGTAA
Protein sequenceShow/hide protein sequence
MSSSSSQCILKGLSIYKLQTFIPKPWRNVPVSNGSELMIDSIFSSLKNFASHGQLSKSFEAFSLIQLRASYNDSFDLILQSISILLVSCTNCSSLPPGKQLHGHIISSGL
EDDSFLVPKLVTFYSSFKLLPEAHTLVENSNLFHPCPWNLLITSYIRNELHDAAILAYKQMLSKGVRPDNFTFPSILKACGESQNLEFGLEVHKSINAWSMKWSLFVQNA
LISMYGRCGEVDTARNLFDNMLERDAVSWNSMLSCYASKGMWKEAFELFDSMQSKCVEINVVTWNIVAGGCLRVGNFTRALKLLSQMRNFGIHLDNVAMIIGLGACSHIG
AIRLGKEIHGFTIRHYYHKLSTVQNALVTMYARCKDIMHAYMLFRLNDDKSIITWNSMLSGLTHLDRVEDALGLFRELLQFGVEPNYVTFASILPLCARVADLQHGREFH
CYITKRQDFMDCLLLWNALVDMYARSGKVLEAKRVFDSLSKKDEVTYTSLIAGYGMQGEGGKALRLFEEMKRFQIKPDHITMVAVLTACSHSGLLKQGELLFAEMQSVHG
LSPHLEHYACMADLFGRVGLLKKAKEVITRMPYRPTSAMWATLIGACCIHGNTDMGEWAAEKLLEMRPEHSGYYVLIANMYAAAGSWSKLAKIRTLMRDSGVAKVPGCSW
VDVGSEFVSFSVGDTSNPQALESKLLLDSLNDASSELSFDLLLQDRIKLAITKL