; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G014780 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G014780
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr02:20531174..20533297
RNA-Seq ExpressionLsi02G014780
SyntenyLsi02G014780
Gene Ontology termsGO:0005515 - protein binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039490.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]0.0e+0093.25Show/hide
Query:  KRSLLSNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALI
        K S LSNRRKCRNS CW+KCSSFEQGLRPRPQPKPSK+D  VRKE PLKET +RKSSVGICSQIEKLVLCK+YRDALEMFEIFELE GF VGNST+DALI
Subjt:  KRSLLSNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALI

Query:  NACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATM
        NACIGLKSIRGVKRL NYMVDNGFEPDQYMRNR+LLMHVKCGMMIDACRLFD+MPERNAVSW+TIISGYVDSGNYVEAFRLFILMWEE Y CGPRT ATM
Subjt:  NACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATM

Query:  IRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSI
        IRASAGLE+IF GRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLY+EM  SGVKMDHFTFSI
Subjt:  IRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSI

Query:  IIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYL
        IIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS RNVISWNALIAGYGNHGRGEEAI+MFEKML EGMMPNHVT+L
Subjt:  IIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYL

Query:  AVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSN
        AVLSACSISGLFERGWEIFQSMTRDHK++PRAMH+ACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRV GNLELGKFAAEKLYGMEPEKLSN
Subjt:  AVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSN

Query:  YIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHS
        YIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAF SGD HH Q+EKVVGKVDELMLKISKLGYVP EQNFMLPDVDEHEEKI+MYHS
Subjt:  YIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHS

Query:  EKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW
        EKLAIAYG+LNTLE+TPLQIVQSHRIC DCHSVIKLIAMITKREIV+RDASRFHHFRDG+CSCGDYW
Subjt:  EKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW

XP_004148701.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Cucumis sativus]0.0e+0091.97Show/hide
Query:  MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGL--RPRPQPKPSKVDPDVRKETPLKETRIRKSS
        M MELPLSRYQNYVYDRLQCN    STS+FSLR+SDS+LF K S LSN RK RNS CWIKCSSFEQGL  RPRPQPKPSK+D   RKETPLKET ++KSS
Subjt:  MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGL--RPRPQPKPSKVDPDVRKETPLKETRIRKSS

Query:  VGICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPER
        VGICSQIEKLVLCKKYRDALEMFEIFELE GF VG ST+DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNR+LLMHVKCGMMIDACRLFD+MP R
Subjt:  VGICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPER

Query:  NAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPD
        NAVSW TIISGYVDSGNYVEAFRLFILM EE+YDCGPRTFATMIRASAGLE+IFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPD
Subjt:  NAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPD

Query:  KTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS
        KTIVGWNSIIAGYALHGYSEEALDLY+EMRDSGVKMDHFTFSIIIRICSRLASVARAKQ HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS
Subjt:  KTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS

Query:  FRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRK
         RN+ISWNALIAGYGNHG GEEAI+MFEKML EGMMPNHVT+LAVLSACSISGLFERGWEIFQSMTRDHK+KPRAMH+ACMIELLGREGLLDEAYALIRK
Subjt:  FRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRK

Query:  APFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQI
        APFQPTANMWAALLRACRV GNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADV QTLKRKGLRMLPACSWIEVNNQPHAF SGD HH QI
Subjt:  APFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQI

Query:  EKVVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFR
        EKVVGKVDELML ISKLGYVP EQNFMLPDVDE+EEKI+MYHSEKLAIAYG+LNTLE+TPLQIVQSHRIC DCHSVIKLIAMITKREIV+RDASRFHHFR
Subjt:  EKVVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFR

Query:  DGSCSCGDYW
        DGSCSCGDYW
Subjt:  DGSCSCGDYW

XP_008459324.1 PREDICTED: pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Cucumis melo]0.0e+0092.09Show/hide
Query:  MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVG
        M MELPLSRYQNYVYDRLQC     ST YFSLR+SDS LF K S LSNRRKCRNS CW+KCSSFEQGLRPRPQPKPSK+D  VRKE PLKET +RKSSVG
Subjt:  MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVG

Query:  ICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNA
        ICSQIEKLVLCK+YRDALEMFEIFELE GF VGNST+DALINACIGLKSIRGVKRL NYMVDNGFEPDQYMRNR+LLMHVKCGMMIDACRLFD+MPERNA
Subjt:  ICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNA

Query:  VSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT
        VSW+TIISGYVDSGNYVEAFRLFILMWEE Y CGPRT ATMIRASAGLE+IF GRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT
Subjt:  VSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT

Query:  IVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFR
        IVGWNSIIAGYALHGYSEEALDLY+EM  SGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS R
Subjt:  IVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFR

Query:  NVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAP
        NVISWNALIAGYGNHGRGEEAI+MFEKML EGMMPNHVT+LAVLSACSISGLFERGWEIFQSMTRDHK++PRAMH+ACMIELLGREGLLDEAYALIRKAP
Subjt:  NVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAP

Query:  FQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEK
        FQPTANMWAALLRACRV GNLELGKFAAEKLYGMEPEKLSNYIVLLNIYN+SGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAF SGD HH Q+EK
Subjt:  FQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEK

Query:  VVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDG
        VVGKVDELMLKISKLGYVP EQNFMLPDVDEHEEKI+MYHSEKLAIAYG+LNTLE+TPLQIVQSHRIC DCHSVIKLIAMITKREIV+RDASRFHHFRDG
Subjt:  VVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDG

Query:  SCSCGDYW
        +CSCGDYW
Subjt:  SCSCGDYW

XP_022133879.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic [Momordica charantia]0.0e+0087.82Show/hide
Query:  MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLL------SNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKE-TR
        MTME+PL RYQNYVYDRLQC+STSSS+SY  +RF+DS+LFRKRSLL      SNRRK RNS CWIKCSS EQGLRPRP+P+PSK+D DVRK T   E TR
Subjt:  MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLL------SNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKE-TR

Query:  IRKSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFD
        IRKS VGICSQIEKLVLCKKYRDALEMFEIFELEGG+ +GNST+DALINACIGLKSIRGVKRLCNYM+DNGFEPDQYM+NRILLMHVKCGMMIDACRLFD
Subjt:  IRKSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFD

Query:  KMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVF
        +MPERNAVSW+TIISGYVDSGNY+EAFRLFI+MWEE  D GPRTFA MIRASAGLE+IFPGRQLHSCAIKAG+GQDIFVSCALIDMYSKCGSLEDAHCVF
Subjt:  KMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVF

Query:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHV
        DEMPDKTIVGWNSIIAGYALHGYSEEALDL YEMRDSG+KMDHFTFSIIIRICSRLASVARAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+
Subjt:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHV

Query:  FDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAY
        FDRMS +N+ISWNALIAGYGNHGRGEEAI+MFE+ML EGM PNHVT+LAVLSACSISGLFERGWEIFQS+T DHKIKPRAMH+ACMIELLGREGLLDEAY
Subjt:  FDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAY

Query:  ALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDN
        ALIR APF+PTANMWAALLRACRV  NLELGK AAE LYGMEP+KLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRM+PACSWIEV NQPH+F SGD 
Subjt:  ALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDN

Query:  HHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRF
        HHA+IEKVV KVDE+MLKISKLGYV EQNF+LPDVDE EEKI MYHSEKLAIAYG+L+TL++TPLQIVQSHRICGDCHS IKLIA+IT+REIVVRDASRF
Subjt:  HHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRF

Query:  HHFRDGSCSCGDYW
        HHFRDGSCSCGDYW
Subjt:  HHFRDGSCSCGDYW

XP_038890388.1 pentatricopeptide repeat-containing protein At5g50390, chloroplastic isoform X1 [Benincasa hispida]0.0e+0091.95Show/hide
Query:  MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVG
        M ME+PLS YQNY+YDR+QCN    STSY SLRFS  +LFR+R  L NRRKCRNSL WIKCSSFEQGLRPRPQPKPSK+DP V K TPLKET + +SSVG
Subjt:  MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVG

Query:  ICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNA
        ICSQIEKLVLCKKYRDALEMFEIFELEGGF  GN+T DALINAC+ LKSIRGVK+LCNYMVDNGFEPDQYMRNR+LLMHVKCGMMIDACRLFD+MPERNA
Subjt:  ICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNA

Query:  VSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT
        VSWNTIISG+VDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLE+IFPGRQLHSCAIKA LGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT
Subjt:  VSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT

Query:  IVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFR
        IVGWNSIIAGYALHGYSEEALDLYYEMRDSG+KMDHFTFSIIIRICSRLASVA AKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS R
Subjt:  IVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFR

Query:  NVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAP
        N+ISWNALIAGYGNHGRG EAI+MFEKML EG +PNHVT+LAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAP
Subjt:  NVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAP

Query:  FQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEK
        FQPTANMWAALLRACRV GNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAF SGD HH QIEK
Subjt:  FQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEK

Query:  VVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDG
        VVGKVDELMLKISKLGYVP EQNFMLPDVDEHEEKIQMYHSEKLAIAYG+LNTLEQTPLQIVQSHRIC DCH VIKLIAMITKREIV+RDASRFHHFRDG
Subjt:  VVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDG

Query:  SCSCGDYW
        SCSCGDYW
Subjt:  SCSCGDYW

TrEMBL top hitse value%identityAlignment
A0A0A0KXD9 DYW_deaminase domain-containing protein0.0e+0091.97Show/hide
Query:  MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGL--RPRPQPKPSKVDPDVRKETPLKETRIRKSS
        M MELPLSRYQNYVYDRLQCN    STS+FSLR+SDS+LF K S LSN RK RNS CWIKCSSFEQGL  RPRPQPKPSK+D   RKETPLKET ++KSS
Subjt:  MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGL--RPRPQPKPSKVDPDVRKETPLKETRIRKSS

Query:  VGICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPER
        VGICSQIEKLVLCKKYRDALEMFEIFELE GF VG ST+DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNR+LLMHVKCGMMIDACRLFD+MP R
Subjt:  VGICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPER

Query:  NAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPD
        NAVSW TIISGYVDSGNYVEAFRLFILM EE+YDCGPRTFATMIRASAGLE+IFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPD
Subjt:  NAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPD

Query:  KTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS
        KTIVGWNSIIAGYALHGYSEEALDLY+EMRDSGVKMDHFTFSIIIRICSRLASVARAKQ HASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS
Subjt:  KTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS

Query:  FRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRK
         RN+ISWNALIAGYGNHG GEEAI+MFEKML EGMMPNHVT+LAVLSACSISGLFERGWEIFQSMTRDHK+KPRAMH+ACMIELLGREGLLDEAYALIRK
Subjt:  FRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRK

Query:  APFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQI
        APFQPTANMWAALLRACRV GNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADV QTLKRKGLRMLPACSWIEVNNQPHAF SGD HH QI
Subjt:  APFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQI

Query:  EKVVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFR
        EKVVGKVDELML ISKLGYVP EQNFMLPDVDE+EEKI+MYHSEKLAIAYG+LNTLE+TPLQIVQSHRIC DCHSVIKLIAMITKREIV+RDASRFHHFR
Subjt:  EKVVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFR

Query:  DGSCSCGDYW
        DGSCSCGDYW
Subjt:  DGSCSCGDYW

A0A1S3C9W7 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0092.09Show/hide
Query:  MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVG
        M MELPLSRYQNYVYDRLQC     ST YFSLR+SDS LF K S LSNRRKCRNS CW+KCSSFEQGLRPRPQPKPSK+D  VRKE PLKET +RKSSVG
Subjt:  MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVG

Query:  ICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNA
        ICSQIEKLVLCK+YRDALEMFEIFELE GF VGNST+DALINACIGLKSIRGVKRL NYMVDNGFEPDQYMRNR+LLMHVKCGMMIDACRLFD+MPERNA
Subjt:  ICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNA

Query:  VSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT
        VSW+TIISGYVDSGNYVEAFRLFILMWEE Y CGPRT ATMIRASAGLE+IF GRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT
Subjt:  VSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT

Query:  IVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFR
        IVGWNSIIAGYALHGYSEEALDLY+EM  SGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS R
Subjt:  IVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFR

Query:  NVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAP
        NVISWNALIAGYGNHGRGEEAI+MFEKML EGMMPNHVT+LAVLSACSISGLFERGWEIFQSMTRDHK++PRAMH+ACMIELLGREGLLDEAYALIRKAP
Subjt:  NVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAP

Query:  FQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEK
        FQPTANMWAALLRACRV GNLELGKFAAEKLYGMEPEKLSNYIVLLNIYN+SGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAF SGD HH Q+EK
Subjt:  FQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEK

Query:  VVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDG
        VVGKVDELMLKISKLGYVP EQNFMLPDVDEHEEKI+MYHSEKLAIAYG+LNTLE+TPLQIVQSHRIC DCHSVIKLIAMITKREIV+RDASRFHHFRDG
Subjt:  VVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDG

Query:  SCSCGDYW
        +CSCGDYW
Subjt:  SCSCGDYW

A0A5A7T8C6 Pentatricopeptide repeat-containing protein0.0e+0093.25Show/hide
Query:  KRSLLSNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALI
        K S LSNRRKCRNS CW+KCSSFEQGLRPRPQPKPSK+D  VRKE PLKET +RKSSVGICSQIEKLVLCK+YRDALEMFEIFELE GF VGNST+DALI
Subjt:  KRSLLSNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALI

Query:  NACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATM
        NACIGLKSIRGVKRL NYMVDNGFEPDQYMRNR+LLMHVKCGMMIDACRLFD+MPERNAVSW+TIISGYVDSGNYVEAFRLFILMWEE Y CGPRT ATM
Subjt:  NACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATM

Query:  IRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSI
        IRASAGLE+IF GRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLY+EM  SGVKMDHFTFSI
Subjt:  IRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSI

Query:  IIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYL
        IIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMS RNVISWNALIAGYGNHGRGEEAI+MFEKML EGMMPNHVT+L
Subjt:  IIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYL

Query:  AVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSN
        AVLSACSISGLFERGWEIFQSMTRDHK++PRAMH+ACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRV GNLELGKFAAEKLYGMEPEKLSN
Subjt:  AVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSN

Query:  YIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHS
        YIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAF SGD HH Q+EKVVGKVDELMLKISKLGYVP EQNFMLPDVDEHEEKI+MYHS
Subjt:  YIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVP-EQNFMLPDVDEHEEKIQMYHS

Query:  EKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW
        EKLAIAYG+LNTLE+TPLQIVQSHRIC DCHSVIKLIAMITKREIV+RDASRFHHFRDG+CSCGDYW
Subjt:  EKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW

A0A6J1BWH3 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0087.82Show/hide
Query:  MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLL------SNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKE-TR
        MTME+PL RYQNYVYDRLQC+STSSS+SY  +RF+DS+LFRKRSLL      SNRRK RNS CWIKCSS EQGLRPRP+P+PSK+D DVRK T   E TR
Subjt:  MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLL------SNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKE-TR

Query:  IRKSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFD
        IRKS VGICSQIEKLVLCKKYRDALEMFEIFELEGG+ +GNST+DALINACIGLKSIRGVKRLCNYM+DNGFEPDQYM+NRILLMHVKCGMMIDACRLFD
Subjt:  IRKSSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFD

Query:  KMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVF
        +MPERNAVSW+TIISGYVDSGNY+EAFRLFI+MWEE  D GPRTFA MIRASAGLE+IFPGRQLHSCAIKAG+GQDIFVSCALIDMYSKCGSLEDAHCVF
Subjt:  KMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVF

Query:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHV
        DEMPDKTIVGWNSIIAGYALHGYSEEALDL YEMRDSG+KMDHFTFSIIIRICSRLASVARAKQ HA LVRNGFGLDVVANTALVDFYSKWGK+DDARH+
Subjt:  DEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHV

Query:  FDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAY
        FDRMS +N+ISWNALIAGYGNHGRGEEAI+MFE+ML EGM PNHVT+LAVLSACSISGLFERGWEIFQS+T DHKIKPRAMH+ACMIELLGREGLLDEAY
Subjt:  FDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAY

Query:  ALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDN
        ALIR APF+PTANMWAALLRACRV  NLELGK AAE LYGMEP+KLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRM+PACSWIEV NQPH+F SGD 
Subjt:  ALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDN

Query:  HHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRF
        HHA+IEKVV KVDE+MLKISKLGYV EQNF+LPDVDE EEKI MYHSEKLAIAYG+L+TL++TPLQIVQSHRICGDCHS IKLIA+IT+REIVVRDASRF
Subjt:  HHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRF

Query:  HHFRDGSCSCGDYW
        HHFRDGSCSCGDYW
Subjt:  HHFRDGSCSCGDYW

A0A6J1JGW0 pentatricopeptide repeat-containing protein At5g50390, chloroplastic0.0e+0085.79Show/hide
Query:  MELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLL------SNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRK
        ME+PL  YQNYV+D L+  S SSSTSYFS  FS SELFR RSLL      SNRRK RNS CW+KCSS EQGLRPR +PKPSKVD DVRK TP KETRI K
Subjt:  MELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLL------SNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRK

Query:  SSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMP
        SSV IC  IEKLVLC K+RDALEMFEI ELEGG+ VGNSTFDALI ACIGLKSIRG KRLC YM+DNG EPDQY+ NRILLMHV+CGMMIDA +LFD+MP
Subjt:  SSVGICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMP

Query:  ERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEM
        ERNAVSWNTIISGYVDSGNY EAFRLFI+MWEEY  C PRTFAT+IRASAGLE+IFPG+QLHSCA+KAG+GQDIFVSCALIDMYSKCG LEDAHCVFDEM
Subjt:  ERNAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEM

Query:  PDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR
        PDKTIVGWNSIIAGYALHG+SEEAL+LY++MRDSGVK+DHFTFSIIIRICSRLASV RAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARH+FDR
Subjt:  PDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDR

Query:  MSFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALI
        MS +N+ISWNALIAGYGNHGRGEEAIE+FE+ML EGM+PNHVT+LAVLSACSISGLFERGWEIFQSMTRDHKIK RAMHY CMIELLGREGLLDEAYALI
Subjt:  MSFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALI

Query:  RKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHA
        RKAPFQPTANMWAALLRACRV  NLELGK+AAEKLYGMEPEKL NYIVLLNIY SSGKLKEAADVV+TLKRKGL MLPACSWIEV +QPHAF SGD HH 
Subjt:  RKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHA

Query:  QIEKVVGKVDELMLKISKLGYVPEQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHF
        +IEKVV KVDELML+ISKLGYVPEQN +LPDVD HEEKIQ+YHSEKLAIAYG++NTL+QTPLQIVQ HR+CGDCHSVIKLIAMITKREIVVRDASRFHHF
Subjt:  QIEKVVGKVDELMLKISKLGYVPEQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHF

Query:  RDGSCSCGDYW
        RDG CSCGDYW
Subjt:  RDGSCSCGDYW

SwissProt top hitse value%identityAlignment
Q9FK33 Pentatricopeptide repeat-containing protein At5g50390, chloroplastic1.4e-25559.97Show/hide
Query:  MELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGLRPRP--QPKPSKVDPDVRKETPLKETRIRKSSVG
        ME+PLSRYQ+   D ++ +S++     F  +FS              R+ +N    + CSS  QGL+P+P  +P+P +++    K+  L +T+I KS V 
Subjt:  MELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGLRPRP--QPKPSKVDPDVRKETPLKETRIRKSSVG

Query:  ICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNA
        ICSQIEKLVLC ++R+A E+FEI E+   F+VG ST+DAL+ ACI LKSIR VKR+  +M+ NGFEP+QYM NRILLMHVKCGM+IDA RLFD++PERN 
Subjt:  ICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNA

Query:  VSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT
         S+ +IISG+V+ GNYVEAF LF +MWEE  DC   TFA M+RASAGL  I+ G+QLH CA+K G+  + FVSC LIDMYSKCG +EDA C F+ MP+KT
Subjt:  VSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT

Query:  IVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFR
         V WN++IAGYALHGYSEEAL L Y+MRDSGV +D FT SI+IRI ++LA +   KQAHASL+RNGF  ++VANTALVDFYSKWG+VD AR+VFD++  +
Subjt:  IVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFR

Query:  NVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAP
        N+ISWNAL+ GY NHGRG +A+++FEKM+   + PNHVT+LAVLSAC+ SGL E+GWEIF SM+  H IKPRAMHYACMIELLGR+GLLDEA A IR+AP
Subjt:  NVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAP

Query:  FQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIE-
         + T NMWAALL ACR+  NLELG+  AEKLYGM PEKL NY+V+ N+YNS GK  EAA V++TL+ KGL M+PAC+W+EV +Q H+F SGD   +  E 
Subjt:  FQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIE-

Query:  ---KVVGKVDELMLKISKLGYVPEQNFMLPDVDE-HEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHH
           ++  KVDELM +IS+ GY  E+  +LPDVDE  EE++  YHSEKLAIAYG++NT E  PLQI Q+HRIC +CH V++ I+++T RE+VVRDASRFHH
Subjt:  ---KVVGKVDELMLKISKLGYVPEQNFMLPDVDE-HEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHH

Query:  FRDGSCSCGDYW
        F++G CSCG YW
Subjt:  FRDGSCSCGDYW

Q9LIQ7 Pentatricopeptide repeat-containing protein At3g24000, mitochondrial9.7e-13538.63Show/hide
Query:  ELEGGFQVGNSTF-DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLF
        +LEG +   +  F + L+  C   K +   + +  +++ + F  D  M N +L M+ KCG + +A ++F+KMP+R+ V+W T+ISGY       +A   F
Subjt:  ELEGGFQVGNSTF-DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLF

Query:  ILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDL
          M    Y     T +++I+A+A       G QLH   +K G   ++ V  AL+D+Y++ G ++DA  VFD +  +  V WN++IAG+A    +E+AL+L
Subjt:  ILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDL

Query:  YYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIE
        +  M   G +  HF+++ +   CS    + + K  HA ++++G  L   A   L+D Y+K G + DAR +FDR++ R+V+SWN+L+  Y  HG G+EA+ 
Subjt:  YYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIE

Query:  MFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLEL
         FE+M   G+ PN +++L+VL+ACS SGL + GW  ++ M +D  I P A HY  +++LLGR G L+ A   I + P +PTA +W ALL ACR+  N EL
Subjt:  MFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLEL

Query:  GKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVPEQNF
        G +AAE ++ ++P+    +++L NIY S G+  +AA V + +K  G++  PACSW+E+ N  H F + D  H Q E++  K +E++ KI +LGYVP+ + 
Subjt:  GKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVPEQNF

Query:  MLPDVDEHEEKIQM-YHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW
        ++  VD+ E ++ + YHSEK+A+A+ +LNT   + + I ++ R+CGDCH+ IKL + +  REI+VRD +RFHHF+DG+CSC DYW
Subjt:  MLPDVDEHEEKIQM-YHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW

Q9LW63 Putative pentatricopeptide repeat-containing protein At3g233305.7e-12736.82Show/hide
Query:  NSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVK---CGMMIDACRLFDKMPER--------------------------------
        ++ F +++ +C  +  +R  + +  ++V  G + D Y  N ++ M+ K    G  I    +FD+MP+R                                
Subjt:  NSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVK---CGMMIDACRLFDKMPER--------------------------------

Query:  -NAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP
         + VS+NTII+GY  SG Y +A R+   M          T ++++   +    +  G+++H   I+ G+  D+++  +L+DMY+K   +ED+  VF  + 
Subjt:  -NAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP

Query:  DKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM
         +  + WNS++AGY  +G   EAL L+ +M  + VK     FS +I  C+ LA++   KQ H  ++R GFG ++   +ALVD YSK G +  AR +FDRM
Subjt:  DKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM

Query:  SFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIR
        +  + +SW A+I G+  HG G EA+ +FE+M  +G+ PN V ++AVL+ACS  GL +  W  F SMT+ + +     HYA + +LLGR G L+EAY  I 
Subjt:  SFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIR

Query:  KAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQ
        K   +PT ++W+ LL +C V  NLEL +  AEK++ ++ E +  Y+++ N+Y S+G+ KE A +   +++KGLR  PACSWIE+ N+ H F SGD  H  
Subjt:  KAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQ

Query:  IEKVVGKVDELMLKISKLGYVPEQNFMLPDVD-EHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHF
        ++K+   +  +M ++ K GYV + + +L DVD EH+ ++   HSE+LA+A+G++NT   T +++ ++ RIC DCH  IK I+ IT+REI+VRD SRFHHF
Subjt:  IEKVVGKVDELMLKISKLGYVPEQNFMLPDVD-EHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHF

Query:  RDGSCSCGDYW
          G+CSCGDYW
Subjt:  RDGSCSCGDYW

Q9S7F4 Putative pentatricopeptide repeat-containing protein At2g015101.6e-12939.03Show/hide
Query:  YRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDS
        Y +++ +F +   + G Q  + TF  ++ A +GL      ++L    V  GF  D  + N+IL  + K   +++   LFD+MPE + VS+N +IS Y  +
Subjt:  YRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDS

Query:  GNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYAL
          Y  +   F  M    +D     FATM+  +A L  +  GRQLH  A+ A     + V  +L+DMY+KC   E+A  +F  +P +T V W ++I+GY  
Subjt:  GNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYAL

Query:  HGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYG
         G     L L+ +MR S ++ D  TF+ +++  +  AS+   KQ HA ++R+G   +V + + LVD Y+K G + DA  VF+ M  RN +SWNALI+ + 
Subjt:  HGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYG

Query:  NHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLR
        ++G GE AI  F KM+  G+ P+ V+ L VL+ACS  G  E+G E FQ+M+  + I P+  HYACM++LLGR G   EA  L+ + PF+P   MW+++L 
Subjt:  NHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLR

Query:  ACRVDGNLELGKFAAEKLYGMEP-EKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKI
        ACR+  N  L + AAEKL+ ME     + Y+ + NIY ++G+ ++  DV + ++ +G++ +PA SW+EVN++ H F S D  H   +++V K++EL  +I
Subjt:  ACRVDGNLELGKFAAEKLYGMEP-EKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKI

Query:  SKLGYVPEQNFMLPDVDEHEEKIQ--MYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW
         + GY P+ + ++ DVDE + KI+   YHSE+LA+A+ +++T E  P+ ++++ R C DCH+ IKLI+ I KREI VRD SRFHHF +G CSCGDYW
Subjt:  SKLGYVPEQNFMLPDVDEHEEKIQ--MYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW

Q9SI53 Pentatricopeptide repeat-containing protein At2g03880, mitochondrial3.1e-13340Show/hide
Query:  GFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWE
        G    ++T+  LI  CI  +++     +C ++  NG  P  ++ N ++ M+VK  ++ DA +LFD+MP+RN +SW T+IS Y     + +A  L +LM  
Subjt:  GFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWE

Query:  EYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMR
        +       T+++++R+  G+  +   R LH   IK GL  D+FV  ALID+++K G  EDA  VFDEM     + WNSII G+A +  S+ AL+L+  M+
Subjt:  EYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMR

Query:  DSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKM
         +G   +  T + ++R C+ LA +    QAH  +V+  +  D++ N ALVD Y K G ++DA  VF++M  R+VI+W+ +I+G   +G  +EA+++FE+M
Subjt:  DSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKM

Query:  LGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAA
           G  PN++T + VL ACS +GL E GW  F+SM + + I P   HY CMI+LLG+ G LD+A  L+ +   +P A  W  LL ACRV  N+ L ++AA
Subjt:  LGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAA

Query:  EKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDV
        +K+  ++PE    Y +L NIY +S K     ++   ++ +G++  P CSWIEVN Q HAF  GDN H QI +V  K+++L+ +++ +GYVPE NF+L D+
Subjt:  EKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDV

Query:  D-EHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW
        + E  E    +HSEKLA+A+G++    +  ++I ++ RICGDCH   KL + +  R IV+RD  R+HHF+DG CSCGDYW
Subjt:  D-EHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW

Arabidopsis top hitse value%identityAlignment
AT2G03880.1 Pentatricopeptide repeat (PPR) superfamily protein2.2e-13440Show/hide
Query:  GFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWE
        G    ++T+  LI  CI  +++     +C ++  NG  P  ++ N ++ M+VK  ++ DA +LFD+MP+RN +SW T+IS Y     + +A  L +LM  
Subjt:  GFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLFILMWE

Query:  EYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMR
        +       T+++++R+  G+  +   R LH   IK GL  D+FV  ALID+++K G  EDA  VFDEM     + WNSII G+A +  S+ AL+L+  M+
Subjt:  EYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMR

Query:  DSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKM
         +G   +  T + ++R C+ LA +    QAH  +V+  +  D++ N ALVD Y K G ++DA  VF++M  R+VI+W+ +I+G   +G  +EA+++FE+M
Subjt:  DSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKM

Query:  LGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAA
           G  PN++T + VL ACS +GL E GW  F+SM + + I P   HY CMI+LLG+ G LD+A  L+ +   +P A  W  LL ACRV  N+ L ++AA
Subjt:  LGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAA

Query:  EKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDV
        +K+  ++PE    Y +L NIY +S K     ++   ++ +G++  P CSWIEVN Q HAF  GDN H QI +V  K+++L+ +++ +GYVPE NF+L D+
Subjt:  EKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDV

Query:  D-EHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW
        + E  E    +HSEKLA+A+G++    +  ++I ++ RICGDCH   KL + +  R IV+RD  R+HHF+DG CSCGDYW
Subjt:  D-EHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW

AT3G02010.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-13039.03Show/hide
Query:  YRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDS
        Y +++ +F +   + G Q  + TF  ++ A +GL      ++L    V  GF  D  + N+IL  + K   +++   LFD+MPE + VS+N +IS Y  +
Subjt:  YRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDS

Query:  GNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYAL
          Y  +   F  M    +D     FATM+  +A L  +  GRQLH  A+ A     + V  +L+DMY+KC   E+A  +F  +P +T V W ++I+GY  
Subjt:  GNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYAL

Query:  HGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYG
         G     L L+ +MR S ++ D  TF+ +++  +  AS+   KQ HA ++R+G   +V + + LVD Y+K G + DA  VF+ M  RN +SWNALI+ + 
Subjt:  HGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYG

Query:  NHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLR
        ++G GE AI  F KM+  G+ P+ V+ L VL+ACS  G  E+G E FQ+M+  + I P+  HYACM++LLGR G   EA  L+ + PF+P   MW+++L 
Subjt:  NHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLR

Query:  ACRVDGNLELGKFAAEKLYGMEP-EKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKI
        ACR+  N  L + AAEKL+ ME     + Y+ + NIY ++G+ ++  DV + ++ +G++ +PA SW+EVN++ H F S D  H   +++V K++EL  +I
Subjt:  ACRVDGNLELGKFAAEKLYGMEP-EKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKI

Query:  SKLGYVPEQNFMLPDVDEHEEKIQ--MYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW
         + GY P+ + ++ DVDE + KI+   YHSE+LA+A+ +++T E  P+ ++++ R C DCH+ IKLI+ I KREI VRD SRFHHF +G CSCGDYW
Subjt:  SKLGYVPEQNFMLPDVDEHEEKIQ--MYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW

AT3G23330.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.0e-12836.82Show/hide
Query:  NSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVK---CGMMIDACRLFDKMPER--------------------------------
        ++ F +++ +C  +  +R  + +  ++V  G + D Y  N ++ M+ K    G  I    +FD+MP+R                                
Subjt:  NSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVK---CGMMIDACRLFDKMPER--------------------------------

Query:  -NAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP
         + VS+NTII+GY  SG Y +A R+   M          T ++++   +    +  G+++H   I+ G+  D+++  +L+DMY+K   +ED+  VF  + 
Subjt:  -NAVSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMP

Query:  DKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM
         +  + WNS++AGY  +G   EAL L+ +M  + VK     FS +I  C+ LA++   KQ H  ++R GFG ++   +ALVD YSK G +  AR +FDRM
Subjt:  DKTIVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRM

Query:  SFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIR
        +  + +SW A+I G+  HG G EA+ +FE+M  +G+ PN V ++AVL+ACS  GL +  W  F SMT+ + +     HYA + +LLGR G L+EAY  I 
Subjt:  SFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIR

Query:  KAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQ
        K   +PT ++W+ LL +C V  NLEL +  AEK++ ++ E +  Y+++ N+Y S+G+ KE A +   +++KGLR  PACSWIE+ N+ H F SGD  H  
Subjt:  KAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQ

Query:  IEKVVGKVDELMLKISKLGYVPEQNFMLPDVD-EHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHF
        ++K+   +  +M ++ K GYV + + +L DVD EH+ ++   HSE+LA+A+G++NT   T +++ ++ RIC DCH  IK I+ IT+REI+VRD SRFHHF
Subjt:  IEKVVGKVDELMLKISKLGYVPEQNFMLPDVD-EHEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHF

Query:  RDGSCSCGDYW
          G+CSCGDYW
Subjt:  RDGSCSCGDYW

AT3G24000.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.9e-13038.06Show/hide
Query:  ELEGGFQVGNSTF-DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLF
        +LEG +   +  F + L+  C   K +   + +  +++ + F  D  M N +L M+ KCG + +A ++F+KMP+R+ V+W T+ISGY       +A   F
Subjt:  ELEGGFQVGNSTF-DALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAFRLF

Query:  ILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDL
          M    Y     T +++I+A+A       G QLH   +K G   ++ V  AL+D+Y++ G ++DA  VFD +  +  V WN++IAG+A    +E+AL+L
Subjt:  ILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDL

Query:  YYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIE
        +  M   G +  HF+++ +   CS    + + K  HA ++++G  L   A   L+D Y+K G + DAR +FDR++ R+V+SWN+L+  Y  HG G+EA+ 
Subjt:  YYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIE

Query:  MFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLEL
         FE+M   G+ PN +++L+VL+ACS SGL + GW  ++ M +D  I P A HY  +++LLGR G L+ A   I + P +PTA +W ALL ACR+  N EL
Subjt:  MFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLEL

Query:  GKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVPEQNF
        G +AAE ++ ++P+    +++L NIY S G+  +AA V + +K  G++  PACSW+E+ N  H F + D  H Q E++  K +E++ KI +LGYVP+ + 
Subjt:  GKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVPEQNF

Query:  MLPDVDEHEEKIQM-YHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGS
        ++  VD+ E ++ + YHSEK+A+A+ +LNT   + + I ++ R+CGDCH+ IKL + +  REI+VRD +RFHHF+D S
Subjt:  MLPDVDEHEEKIQM-YHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGS

AT5G50390.1 Pentatricopeptide repeat (PPR-like) superfamily protein1.0e-25659.97Show/hide
Query:  MELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGLRPRP--QPKPSKVDPDVRKETPLKETRIRKSSVG
        ME+PLSRYQ+   D ++ +S++     F  +FS              R+ +N    + CSS  QGL+P+P  +P+P +++    K+  L +T+I KS V 
Subjt:  MELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGLRPRP--QPKPSKVDPDVRKETPLKETRIRKSSVG

Query:  ICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNA
        ICSQIEKLVLC ++R+A E+FEI E+   F+VG ST+DAL+ ACI LKSIR VKR+  +M+ NGFEP+QYM NRILLMHVKCGM+IDA RLFD++PERN 
Subjt:  ICSQIEKLVLCKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNA

Query:  VSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT
         S+ +IISG+V+ GNYVEAF LF +MWEE  DC   TFA M+RASAGL  I+ G+QLH CA+K G+  + FVSC LIDMYSKCG +EDA C F+ MP+KT
Subjt:  VSWNTIISGYVDSGNYVEAFRLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKT

Query:  IVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFR
         V WN++IAGYALHGYSEEAL L Y+MRDSGV +D FT SI+IRI ++LA +   KQAHASL+RNGF  ++VANTALVDFYSKWG+VD AR+VFD++  +
Subjt:  IVGWNSIIAGYALHGYSEEALDLYYEMRDSGVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFR

Query:  NVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAP
        N+ISWNAL+ GY NHGRG +A+++FEKM+   + PNHVT+LAVLSAC+ SGL E+GWEIF SM+  H IKPRAMHYACMIELLGR+GLLDEA A IR+AP
Subjt:  NVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTYLAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAP

Query:  FQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIE-
         + T NMWAALL ACR+  NLELG+  AEKLYGM PEKL NY+V+ N+YNS GK  EAA V++TL+ KGL M+PAC+W+EV +Q H+F SGD   +  E 
Subjt:  FQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYNSSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIE-

Query:  ---KVVGKVDELMLKISKLGYVPEQNFMLPDVDE-HEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHH
           ++  KVDELM +IS+ GY  E+  +LPDVDE  EE++  YHSEKLAIAYG++NT E  PLQI Q+HRIC +CH V++ I+++T RE+VVRDASRFHH
Subjt:  ---KVVGKVDELMLKISKLGYVPEQNFMLPDVDE-HEEKIQMYHSEKLAIAYGVLNTLEQTPLQIVQSHRICGDCHSVIKLIAMITKREIVVRDASRFHH

Query:  FRDGSCSCGDYW
        F++G CSCG YW
Subjt:  FRDGSCSCGDYW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCATGGAACTCCCTCTCTCCCGATATCAAAACTATGTTTATGATCGCCTTCAATGTAACTCCACTTCTAGCTCTACTTCCTACTTCTCCCTGCGTTTCTCAGATTC
CGAGCTTTTTAGGAAGAGATCTTTGCTTTCTAATAGAAGAAAATGCCGTAATTCACTTTGTTGGATCAAGTGCTCTTCGTTTGAACAAGGGCTACGCCCACGGCCCCAAC
CTAAACCTTCCAAAGTTGATCCGGACGTTCGTAAAGAAACCCCTTTGAAGGAGACCCGTATAAGGAAATCCAGTGTAGGGATATGTAGCCAGATAGAGAAGCTGGTTTTG
TGTAAAAAGTATCGAGATGCACTTGAGATGTTTGAAATTTTTGAACTGGAAGGCGGTTTTCAGGTTGGTAACAGCACGTTTGATGCGCTGATTAATGCGTGTATTGGGTT
GAAGTCTATAAGAGGGGTGAAGAGGTTGTGTAATTACATGGTTGATAATGGATTTGAACCCGATCAATACATGAGGAACAGGATTCTACTTATGCATGTGAAATGTGGGA
TGATGATTGATGCTTGTAGATTGTTCGATAAAATGCCTGAAAGGAATGCGGTTTCGTGGAATACTATAATTTCTGGGTATGTAGACTCTGGAAATTATGTTGAAGCGTTT
AGATTGTTCATTTTGATGTGGGAAGAGTATTATGATTGTGGGCCTCGCACCTTTGCCACAATGATTCGGGCATCGGCTGGTTTGGAAGTTATTTTTCCTGGTAGGCAATT
GCATTCATGTGCGATAAAGGCAGGTCTGGGACAGGACATTTTTGTTTCCTGTGCGCTGATTGACATGTACAGCAAGTGTGGAAGCCTTGAAGATGCTCATTGTGTTTTTG
ATGAGATGCCCGATAAGACAATAGTTGGATGGAACTCAATTATAGCTGGTTACGCACTCCATGGCTACAGTGAAGAAGCTCTGGATCTATACTATGAGATGCGTGACTCC
GGAGTTAAAATGGACCATTTCACCTTTTCTATAATTATAAGAATATGCTCGAGATTGGCCTCGGTAGCTCGTGCTAAGCAAGCGCATGCGAGTTTAGTTCGTAATGGCTT
TGGGTTAGATGTAGTAGCTAATACAGCCCTTGTGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTAGGCATGTTTTTGACAGGATGTCCTTTAGAAACGTAATAT
CATGGAATGCTTTGATTGCTGGATATGGGAATCATGGTCGTGGGGAGGAGGCCATTGAGATGTTTGAGAAGATGCTTGGGGAAGGCATGATGCCCAACCATGTGACATAT
CTTGCAGTTTTATCTGCTTGTAGTATTTCAGGTTTGTTTGAACGTGGATGGGAAATTTTTCAATCGATGACTAGAGATCACAAGATTAAACCGCGCGCTATGCATTATGC
GTGCATGATTGAATTGCTAGGTCGAGAAGGGCTCCTAGATGAAGCCTATGCCCTTATAAGGAAAGCTCCATTTCAACCTACAGCAAATATGTGGGCTGCCTTGCTTAGAG
CTTGTAGAGTTGATGGAAATCTAGAACTTGGGAAGTTTGCTGCTGAGAAACTTTATGGGATGGAACCTGAGAAGCTTAGTAATTATATTGTGCTTTTAAACATATACAAC
AGTTCTGGTAAGTTAAAGGAAGCAGCTGATGTTGTTCAGACATTGAAAAGAAAGGGCTTAAGAATGCTTCCAGCATGCAGTTGGATTGAAGTTAATAACCAGCCCCATGC
ATTCCGGTCTGGGGATAACCACCATGCTCAAATAGAAAAAGTAGTGGGAAAAGTGGATGAATTAATGTTGAAGATCTCAAAGCTTGGTTATGTGCCTGAACAGAACTTCA
TGCTTCCAGATGTTGATGAACATGAAGAGAAGATACAGATGTACCACAGTGAGAAGTTGGCAATAGCTTATGGAGTATTAAATACTTTAGAACAAACGCCATTGCAGATT
GTGCAGAGCCATCGCATTTGTGGTGACTGCCATTCTGTGATTAAGCTGATTGCTATGATAACCAAACGTGAAATTGTGGTCAGAGATGCTAGCAGATTCCATCATTTCAG
AGATGGGAGTTGCTCTTGTGGAGACTATTGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACCATGGAACTCCCTCTCTCCCGATATCAAAACTATGTTTATGATCGCCTTCAATGTAACTCCACTTCTAGCTCTACTTCCTACTTCTCCCTGCGTTTCTCAGATTC
CGAGCTTTTTAGGAAGAGATCTTTGCTTTCTAATAGAAGAAAATGCCGTAATTCACTTTGTTGGATCAAGTGCTCTTCGTTTGAACAAGGGCTACGCCCACGGCCCCAAC
CTAAACCTTCCAAAGTTGATCCGGACGTTCGTAAAGAAACCCCTTTGAAGGAGACCCGTATAAGGAAATCCAGTGTAGGGATATGTAGCCAGATAGAGAAGCTGGTTTTG
TGTAAAAAGTATCGAGATGCACTTGAGATGTTTGAAATTTTTGAACTGGAAGGCGGTTTTCAGGTTGGTAACAGCACGTTTGATGCGCTGATTAATGCGTGTATTGGGTT
GAAGTCTATAAGAGGGGTGAAGAGGTTGTGTAATTACATGGTTGATAATGGATTTGAACCCGATCAATACATGAGGAACAGGATTCTACTTATGCATGTGAAATGTGGGA
TGATGATTGATGCTTGTAGATTGTTCGATAAAATGCCTGAAAGGAATGCGGTTTCGTGGAATACTATAATTTCTGGGTATGTAGACTCTGGAAATTATGTTGAAGCGTTT
AGATTGTTCATTTTGATGTGGGAAGAGTATTATGATTGTGGGCCTCGCACCTTTGCCACAATGATTCGGGCATCGGCTGGTTTGGAAGTTATTTTTCCTGGTAGGCAATT
GCATTCATGTGCGATAAAGGCAGGTCTGGGACAGGACATTTTTGTTTCCTGTGCGCTGATTGACATGTACAGCAAGTGTGGAAGCCTTGAAGATGCTCATTGTGTTTTTG
ATGAGATGCCCGATAAGACAATAGTTGGATGGAACTCAATTATAGCTGGTTACGCACTCCATGGCTACAGTGAAGAAGCTCTGGATCTATACTATGAGATGCGTGACTCC
GGAGTTAAAATGGACCATTTCACCTTTTCTATAATTATAAGAATATGCTCGAGATTGGCCTCGGTAGCTCGTGCTAAGCAAGCGCATGCGAGTTTAGTTCGTAATGGCTT
TGGGTTAGATGTAGTAGCTAATACAGCCCTTGTGGATTTCTATAGCAAATGGGGAAAAGTAGATGATGCTAGGCATGTTTTTGACAGGATGTCCTTTAGAAACGTAATAT
CATGGAATGCTTTGATTGCTGGATATGGGAATCATGGTCGTGGGGAGGAGGCCATTGAGATGTTTGAGAAGATGCTTGGGGAAGGCATGATGCCCAACCATGTGACATAT
CTTGCAGTTTTATCTGCTTGTAGTATTTCAGGTTTGTTTGAACGTGGATGGGAAATTTTTCAATCGATGACTAGAGATCACAAGATTAAACCGCGCGCTATGCATTATGC
GTGCATGATTGAATTGCTAGGTCGAGAAGGGCTCCTAGATGAAGCCTATGCCCTTATAAGGAAAGCTCCATTTCAACCTACAGCAAATATGTGGGCTGCCTTGCTTAGAG
CTTGTAGAGTTGATGGAAATCTAGAACTTGGGAAGTTTGCTGCTGAGAAACTTTATGGGATGGAACCTGAGAAGCTTAGTAATTATATTGTGCTTTTAAACATATACAAC
AGTTCTGGTAAGTTAAAGGAAGCAGCTGATGTTGTTCAGACATTGAAAAGAAAGGGCTTAAGAATGCTTCCAGCATGCAGTTGGATTGAAGTTAATAACCAGCCCCATGC
ATTCCGGTCTGGGGATAACCACCATGCTCAAATAGAAAAAGTAGTGGGAAAAGTGGATGAATTAATGTTGAAGATCTCAAAGCTTGGTTATGTGCCTGAACAGAACTTCA
TGCTTCCAGATGTTGATGAACATGAAGAGAAGATACAGATGTACCACAGTGAGAAGTTGGCAATAGCTTATGGAGTATTAAATACTTTAGAACAAACGCCATTGCAGATT
GTGCAGAGCCATCGCATTTGTGGTGACTGCCATTCTGTGATTAAGCTGATTGCTATGATAACCAAACGTGAAATTGTGGTCAGAGATGCTAGCAGATTCCATCATTTCAG
AGATGGGAGTTGCTCTTGTGGAGACTATTGGTGA
Protein sequenceShow/hide protein sequence
MTMELPLSRYQNYVYDRLQCNSTSSSTSYFSLRFSDSELFRKRSLLSNRRKCRNSLCWIKCSSFEQGLRPRPQPKPSKVDPDVRKETPLKETRIRKSSVGICSQIEKLVL
CKKYRDALEMFEIFELEGGFQVGNSTFDALINACIGLKSIRGVKRLCNYMVDNGFEPDQYMRNRILLMHVKCGMMIDACRLFDKMPERNAVSWNTIISGYVDSGNYVEAF
RLFILMWEEYYDCGPRTFATMIRASAGLEVIFPGRQLHSCAIKAGLGQDIFVSCALIDMYSKCGSLEDAHCVFDEMPDKTIVGWNSIIAGYALHGYSEEALDLYYEMRDS
GVKMDHFTFSIIIRICSRLASVARAKQAHASLVRNGFGLDVVANTALVDFYSKWGKVDDARHVFDRMSFRNVISWNALIAGYGNHGRGEEAIEMFEKMLGEGMMPNHVTY
LAVLSACSISGLFERGWEIFQSMTRDHKIKPRAMHYACMIELLGREGLLDEAYALIRKAPFQPTANMWAALLRACRVDGNLELGKFAAEKLYGMEPEKLSNYIVLLNIYN
SSGKLKEAADVVQTLKRKGLRMLPACSWIEVNNQPHAFRSGDNHHAQIEKVVGKVDELMLKISKLGYVPEQNFMLPDVDEHEEKIQMYHSEKLAIAYGVLNTLEQTPLQI
VQSHRICGDCHSVIKLIAMITKREIVVRDASRFHHFRDGSCSCGDYW