; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014158 (gene) of Snake gourd v1 genome

Gene IDTan0014158
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG09:62346771..62348419
RNA-Seq ExpressionTan0014158
SyntenyTan0014158
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6590068.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.4e-24480.84Show/hide
Query:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVG
        MRWS+S L KSS S ++ TIF    + TSLLSNCRDP HIYQIHGFMLHRALDQDNL LSRFIDACSSLGL LYAFSVFSNK HP + +  T     S  
Subjt:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVG

Query:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK
           ++A+SLY+RI IEGLRPDSY+IP VLKAVVQLSAVEVGRQIHTQTVSS LDTDVNV TSLIQMYSSCGCVSDARKLFD V F+DVALWNAMVAGYVK
Subjt:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK

Query:  VAELNSARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA
        VAELNSARKVFD+MPQRNVISWTALI  +           LF KMQLE+VEPDEIAMLAVLSACADLGALELGEWIHNYIEKH LCRIVPLYNALIDMYA
Subjt:  VAELNSARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA

Query:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL
        KSGNI RALEVFE MKHK+VITWSTM+AA+ALHGLGG+A DMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYF+RMQS+YKI+P+IEHYGCMIDL
Subjt:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL

Query:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN
        LARAGYLQEAQKL QDMP+EANAAIWGSLLAASN HRDA+LAEQALRHLAKLEPENSGNYTLLSNTY              +MRNAGVKKAPGGSFIEIN
Subjt:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN

Query:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE
        NRV+EFLAGD+S SQ+ GIY VLC IIVQLKMAG   EEWSKFL+YDE
Subjt:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE

KAG7023735.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.6e-24882.12Show/hide
Query:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVG
        MRWS S LCKSS S ++ TIF  P A TSLLSNCRDP HIYQIHGFMLHRALDQDNL LSRFI ACSSLGL LYAFSVFSNK HP + +  T     S  
Subjt:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVG

Query:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK
           ++AISLY+RI IEGLRPDSY+IPFVLKAVVQLSAVEVGRQIHTQTVSS LDTDVNV TSLIQMYSSCGCVSDARKLFD V F+DVALWNAMVAGYVK
Subjt:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK

Query:  VAELNSARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA
        VAELNSARKVFD+MPQRNVISWTALI  +           LF KMQLE+VEPDEIAMLAVLSACADLGALELGEWIHNYIEKH LCRIVPLYNALIDMYA
Subjt:  VAELNSARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA

Query:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL
        KSGNI RALEVFE MKHKSVITWSTM+AA+ALHGLGG+AIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYF+RMQS+YKI+P+IEHYGCMIDL
Subjt:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL

Query:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN
        LARAGYLQEAQKL QDMP+EANAAIWGSLLAASN HRDA+LAEQALRHLAKLEPENSGNYTLLSNTY              +MRNAGVKKAPGGSFIEIN
Subjt:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN

Query:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE
        NRV+EFLAGD+S SQL GIY VLC IIVQLKMAG   EEWSKFL+YDE
Subjt:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE

XP_022960640.1 pentatricopeptide repeat-containing protein At5g56310 [Cucurbita moschata]1.4e-24481.2Show/hide
Query:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVG
        MRWS+  LCKSS SF++      P A TSLLSNCRDP HIYQIHGFMLHRALDQDNL LSRFIDACSSLGL LYA SVFSNK HP + +  T     S  
Subjt:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVG

Query:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK
           ++AISLY+RI IEGLRPDSY+IPFVLKA+VQLSAVEVGRQIHTQTVSS LDTDVNVVTSLIQMYSSCGCVSDARKLFD V F+DVALWNAMVAGYVK
Subjt:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK

Query:  VAELNSARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA
        VAELNSARKVFD+MPQRNVISWTALI  +           LF KMQLE+VEPDEIAMLAVLSACADLGALELGEWIHNYIEKH LCRIVPLYNALIDMYA
Subjt:  VAELNSARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA

Query:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL
        KSGNI RALEVFE MKHK+VITWSTM+AA+ALHGLGG+A DMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYF+RMQS+YKI+P+IEHYGCMIDL
Subjt:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL

Query:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN
        LARAGYLQEAQKL QDMP+EANAAIWGSLLAASN HRDA+LAEQALRHLAKLEPENSGNYTLLSNTY              +MRNAGVKKAPGGSFIEIN
Subjt:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN

Query:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE
        NRV+EFLAGD+S SQL GIY VLC IIVQLKMAG   EEWSKFL+YDE
Subjt:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE

XP_022987264.1 pentatricopeptide repeat-containing protein At5g56310 [Cucurbita maxima]1.9e-24982.66Show/hide
Query:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVG
        MRWS+S LCKSS SF++ TIFT P A TSLLSNCRD  HIYQIHGFMLHRALDQDNL LSRFIDACSSLGL LYAFSVFSNKTHP + +  T     S  
Subjt:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVG

Query:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK
           ++AISLY+RI IEGLRPDSY+IPFVLKAVVQLS VEVGRQIHTQTVSS LDTDVNVVTSLIQMYSSCGCVSDARKLFD V ++DVALWNAMVAGYVK
Subjt:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK

Query:  VAELNSARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA
        VAELNSARKVFD+MPQRNVISWTALI  +           LF KMQLE+VEPDEIAMLAVLSACADLGALELGEWIHNYIEKH LCRIVPLYNALIDMYA
Subjt:  VAELNSARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA

Query:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL
        KSGNI RALEVFE MKHKSVITWSTM+AA+ALHGLGG+AIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYF+RMQS+YKI+P+IEHYGCMIDL
Subjt:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL

Query:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN
        LARAGYLQEAQKL QDMPFEANAAIWGSLLAASN HRDA+LAEQALRHLAKLEPENSGNYTLLSNTY              +MRNAGVKKAPGGSFIEIN
Subjt:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN

Query:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE
        NRVYEFLAGD+S SQL GIY VLC IIVQLKMAG   EE SKFL+YDE
Subjt:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE

XP_023515616.1 pentatricopeptide repeat-containing protein At5g56310 [Cucurbita pepo subsp. pepo]1.4e-24982.48Show/hide
Query:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVG
        MRWS+S LCKSS SF++ TIFT P A TSLLSNCRD  HIYQIHGFMLHRALDQDNL LSRFIDACSSLGL LYAFSVFSNKTHP + +  T     S  
Subjt:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVG

Query:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK
           ++AISLY+RI IEGLRPDSY+IPFVLKAVVQLSAVEVGRQIHTQTVSS LDTDVNV TSLIQMYS CGCVSDARKLFD V F+DVALWNAMVAGYVK
Subjt:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK

Query:  VAELNSARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA
        VAELNSARKVFD+MPQRNVISWTALI  +           LF KMQLE+VEPDEIAMLAVLSACADLGALELGEWIHNYIEKH LCRIVPLYNALIDMYA
Subjt:  VAELNSARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA

Query:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL
        KSGNI RALEVFE MKHKSVITWSTM+AA+ALHGLGG+AIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYF+RMQS+YKI+P+IEHYGCMIDL
Subjt:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL

Query:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN
        LARAGYLQEAQ+L QDMP+EANAAIWGSLLAASN HRDAELAEQALRHLAKLEPENSGNYTLLSNTY              +MRNAGVKKAPGGSFIEIN
Subjt:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN

Query:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE
        NRV+EFLAGD+S SQL GIY VLC IIVQLKMAG   EEWSKFL+YDE
Subjt:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE

TrEMBL top hitse value%identityAlignment
A0A1S3BR86 pentatricopeptide repeat-containing protein At5g563107.2e-23979.2Show/hide
Query:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHPIFVSTTPPLKP-SVG
        M WS SLL KSS SF +P+IFT+PL FTSLLSNCR   H+YQ+HGFMLHRALDQDNLFLS+FIDAC+SLGLS YAFS+FSNK HP        +K  S  
Subjt:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHPIFVSTTPPLKP-SVG

Query:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK
           +DAI LYTRI+I+GLRPDSYSIPFVLKAVV+LSAVEVGRQIH QTVSSALD DVNV TSLIQMYSSCG VSDARK FDFV F+DVALWNAMVAGYVK
Subjt:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK

Query:  VAELNSARKVFDEMPQRNVISWTALI-------RLH---MLFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA
        + EL +ARKVF+EMPQRNVISWT LI       R H    LF KMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA
Subjt:  VAELNSARKVFDEMPQRNVISWTALI-------RLH---MLFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA

Query:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL
        KSGNIRRALEVFE MK KSVITWST++AALALHGLG EAIDMFLRMEK +VRPNEVTF AILSACSHVGMVDVGRYYFD+MQS+Y+IEP+IEHYGCMIDL
Subjt:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL

Query:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN
        LARAGYLQEA KLL DMPFEANA IWGSLLAASN H+DA LA+QAL+HLAKLEPENSGNY LLSNTY              LMRNAGVKKAPGGS IEIN
Subjt:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN

Query:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE
        NRVYEFLAGDKS S +  +Y VLCKII+QLKMAG  QEEW KFL+YDE
Subjt:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE

A0A5A7UQ55 Pentatricopeptide repeat-containing protein7.2e-23979.2Show/hide
Query:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHPIFVSTTPPLKP-SVG
        M WS SLL KSS SF +P+IFT+PL FTSLLSNCR   H+YQ+HGFMLHRALDQDNLFLS+FIDAC+SLGLS YAFS+FSNK HP        +K  S  
Subjt:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHPIFVSTTPPLKP-SVG

Query:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK
           +DAI LYTRI+I+GLRPDSYSIPFVLKAVV+LSAVEVGRQIH QTVSSALD DVNV TSLIQMYSSCG VSDARK FDFV F+DVALWNAMVAGYVK
Subjt:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK

Query:  VAELNSARKVFDEMPQRNVISWTALI-------RLH---MLFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA
        + EL +ARKVF+EMPQRNVISWT LI       R H    LF KMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA
Subjt:  VAELNSARKVFDEMPQRNVISWTALI-------RLH---MLFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA

Query:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL
        KSGNIRRALEVFE MK KSVITWST++AALALHGLG EAIDMFLRMEK +VRPNEVTF AILSACSHVGMVDVGRYYFD+MQS+Y+IEP+IEHYGCMIDL
Subjt:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL

Query:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN
        LARAGYLQEA KLL DMPFEANA IWGSLLAASN H+DA LA+QAL+HLAKLEPENSGNY LLSNTY              LMRNAGVKKAPGGS IEIN
Subjt:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN

Query:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE
        NRVYEFLAGDKS S +  +Y VLCKII+QLKMAG  QEEW KFL+YDE
Subjt:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE

A0A6J1D0Q9 pentatricopeptide repeat-containing protein At5g563101.2e-24179.42Show/hide
Query:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP--IFVSTTPPLKPSV
        MRWS+S+LCKSSP FI+ TIFT P AF SLLS+C DP H+ QIHGFM+ RALDQDNL LSRFIDACSSL L  YA+SVFS+KT+P     +TT       
Subjt:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP--IFVSTTPPLKPSV

Query:  GHPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYV
          P + AISL++RIR EGL+PDSYSIPFVLKAVV++S ++VGRQIHTQTV SALDTDVNVVTSLIQMYSSCGCVSDARKLFDFV +RDVALWN+MVAGYV
Subjt:  GHPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYV

Query:  KVAELNSARKVFDEMPQRNVISWTALIRLH----------MLFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMY
        KVA+LNSARK+FDEMPQRNVI+WTALI  +           LF KMQL+EVEPDEIAMLAVLSACADLGALELGEWIHNYI KHGLCRIVPLYNALIDMY
Subjt:  KVAELNSARKVFDEMPQRNVISWTALIRLH----------MLFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMY

Query:  AKSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMID
        +KSGNIRRALE+FE MK KSVITWSTM+AALALHG GGEAID+FLRMEKARVRPNE+TFIAILSACSHVGMVD+GRYYFDRMQS YKI+P+IEHYGCMID
Subjt:  AKSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMID

Query:  LLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEI
        LLARAG+LQEAQK+L+DMPFEANAAIWGSLLAASNIHRDAELAEQALRHLA+LEPENSGNYTLLSNTY              +MRNAGVKKAPGGSFIEI
Subjt:  LLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEI

Query:  NNRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE
        NN+VYEFLAGDKS SQL  I+ VLCK IVQLKMAGLLQ+EWSKFLD+DE
Subjt:  NNRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE

A0A6J1H9N0 pentatricopeptide repeat-containing protein At5g563106.7e-24581.2Show/hide
Query:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVG
        MRWS+  LCKSS SF++      P A TSLLSNCRDP HIYQIHGFMLHRALDQDNL LSRFIDACSSLGL LYA SVFSNK HP + +  T     S  
Subjt:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVG

Query:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK
           ++AISLY+RI IEGLRPDSY+IPFVLKA+VQLSAVEVGRQIHTQTVSS LDTDVNVVTSLIQMYSSCGCVSDARKLFD V F+DVALWNAMVAGYVK
Subjt:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK

Query:  VAELNSARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA
        VAELNSARKVFD+MPQRNVISWTALI  +           LF KMQLE+VEPDEIAMLAVLSACADLGALELGEWIHNYIEKH LCRIVPLYNALIDMYA
Subjt:  VAELNSARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA

Query:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL
        KSGNI RALEVFE MKHK+VITWSTM+AA+ALHGLGG+A DMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYF+RMQS+YKI+P+IEHYGCMIDL
Subjt:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL

Query:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN
        LARAGYLQEAQKL QDMP+EANAAIWGSLLAASN HRDA+LAEQALRHLAKLEPENSGNYTLLSNTY              +MRNAGVKKAPGGSFIEIN
Subjt:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN

Query:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE
        NRV+EFLAGD+S SQL GIY VLC IIVQLKMAG   EEWSKFL+YDE
Subjt:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE

A0A6J1JID8 pentatricopeptide repeat-containing protein At5g563109.1e-25082.66Show/hide
Query:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVG
        MRWS+S LCKSS SF++ TIFT P A TSLLSNCRD  HIYQIHGFMLHRALDQDNL LSRFIDACSSLGL LYAFSVFSNKTHP + +  T     S  
Subjt:  MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVG

Query:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK
           ++AISLY+RI IEGLRPDSY+IPFVLKAVVQLS VEVGRQIHTQTVSS LDTDVNVVTSLIQMYSSCGCVSDARKLFD V ++DVALWNAMVAGYVK
Subjt:  HPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVK

Query:  VAELNSARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA
        VAELNSARKVFD+MPQRNVISWTALI  +           LF KMQLE+VEPDEIAMLAVLSACADLGALELGEWIHNYIEKH LCRIVPLYNALIDMYA
Subjt:  VAELNSARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYA

Query:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL
        KSGNI RALEVFE MKHKSVITWSTM+AA+ALHGLGG+AIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYF+RMQS+YKI+P+IEHYGCMIDL
Subjt:  KSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDL

Query:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN
        LARAGYLQEAQKL QDMPFEANAAIWGSLLAASN HRDA+LAEQALRHLAKLEPENSGNYTLLSNTY              +MRNAGVKKAPGGSFIEIN
Subjt:  LARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEIN

Query:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE
        NRVYEFLAGD+S SQL GIY VLC IIVQLKMAG   EE SKFL+YDE
Subjt:  NRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic6.6e-8834.89Show/hide
Query:  IHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHPIFVSTTPPLKPSVGHPLLD-AISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGR
        +HG  +  A+  D    +  I    S G    A  VF+       VS    +   V     D A+ L+ ++  E ++    ++  VL A  ++  +E GR
Subjt:  IHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHPIFVSTTPPLKPSVGHPLLD-AISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGR

Query:  QIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMPQRNVISWTALIRLH----------MLF
        Q+ +    + ++ ++ +  +++ MY+ CG + DA++LFD +  +D   W  M+ GY    +  +AR+V + MPQ+++++W ALI  +          ++F
Subjt:  QIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMPQRNVISWTALIRLH----------MLF

Query:  VKMQLEE-VEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAID
         ++QL++ ++ ++I +++ LSACA +GALELG WIH+YI+KHG+     + +ALI MY+K G++ ++ EVF  ++ + V  WS M+  LA+HG G EA+D
Subjt:  VKMQLEE-VEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAID

Query:  MFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAEL
        MF +M++A V+PN VTF  +  ACSH G+VD     F +M+S Y I P+ +HY C++D+L R+GYL++A K ++ MP   + ++WG+LL A  IH +  L
Subjt:  MFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAEL

Query:  AEQALRHLAKLEPENSGNYTLLSNTYL--------------MRNAGVKKAPGGSFIEINNRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAG
        AE A   L +LEP N G + LLSN Y               MR  G+KK PG S IEI+  ++EFL+GD +    + +Y  L +++ +LK  G
Subjt:  AEQALRHLAKLEPENSGNYTLLSNTYL--------------MRNAGVKKAPGGSFIEINNRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAG

Q683I9 Pentatricopeptide repeat-containing protein At3g628905.0e-8839.87Show/hide
Query:  ISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNS
        IS+Y R+R   + PD ++ PF+L +      + +G++ H Q +   LD D  V TSL+ MYSSCG +  A+++FD    +D+  WN++V  Y K   ++ 
Subjt:  ISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNS

Query:  ARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEE-----VEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAK
        ARK+FDEMP+RNVISW+ LI  ++          LF +MQL +     V P+E  M  VLSAC  LGALE G+W+H YI+K+ +   + L  ALIDMYAK
Subjt:  ARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEE-----VEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAK

Query:  SGNIRRALEVFEIM-KHKSVITWSTMVAALALHGLGGEAIDMFLRMEKA-RVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMID
         G++ RA  VF  +   K V  +S M+  LA++GL  E   +F  M  +  + PN VTF+ IL AC H G+++ G+ YF  M   + I P I+HYGCM+D
Subjt:  SGNIRRALEVFEIM-KHKSVITWSTMVAALALHGLGGEAIDMFLRMEKA-RVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMID

Query:  LLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTYL--------------MRNAGVKKAPGGSFIEI
        L  R+G ++EA+  +  MP E +  IWGSLL+ S +  D +  E AL+ L +L+P NSG Y LLSN Y               M   G+ K PG S++E+
Subjt:  LLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTYL--------------MRNAGVKKAPGGSFIEI

Query:  NNRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE
           V+EF+ GD+S  + + IY +L +I+ +L+ AG + +     LD +E
Subjt:  NNRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665208.6e-8837.85Show/hide
Query:  SLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDAC---SSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVGHPLLDAISLYTRIRIEGLRPDSYSI
        S L  C     + QIH  ML   L QD+  +++F+  C   +S     YA  VF     P  F+        S       ++ LY R+       ++Y+ 
Subjt:  SLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDAC---SSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVGHPLLDAISLYTRIRIEGLRPDSYSI

Query:  PFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMPQRNVISWTAL
        P +LKA   LSA E   QIH Q      + DV  V SLI  Y+  G    A  LFD +   D   WN+++ GYVK  +++ A  +F +M ++N ISWT +
Subjt:  PFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMPQRNVISWTAL

Query:  IRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWST
        I  ++          LF +MQ  +VEPD +++   LSACA LGALE G+WIH+Y+ K  +     L   LIDMYAK G +  ALEVF+ +K KSV  W+ 
Subjt:  IRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWST

Query:  MVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAI
        +++  A HG G EAI  F+ M+K  ++PN +TF A+L+ACS+ G+V+ G+  F  M+  Y ++P IEHYGC++DLL RAG L EA++ +Q+MP + NA I
Subjt:  MVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAI

Query:  WGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEINNRVYEFLAGDKSGSQLQGI
        WG+LL A  IH++ EL E+    L  ++P + G Y   +N +              LM+  GV K PG S I +    +EFLAGD+S  +++ I
Subjt:  WGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEINNRVYEFLAGDKSGSQLQGI

Q9FMA1 Pentatricopeptide repeat-containing protein At5g563108.2e-13150.1Show/hide
Query:  QIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNK------THPIFVSTTPPLKPSVGHPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLS
        Q H +M+   L++DNL +++FI+ACS+ G   YA+SVF+++       H   +     L     H +  AI++Y ++     +PD+++ PFVLK  V++S
Subjt:  QIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNK------THPIFVSTTPPLKPSVGHPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLS

Query:  AVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMP--QRNVISWTALIRLHM----
         V  GRQIH Q V    D+ V+VVT LIQMY SCG + DARK+FD +  +DV +WNA++AGY KV E++ AR + + MP   RN +SWT +I  +     
Subjt:  AVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMP--QRNVISWTALIRLHM----

Query:  ------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWSTMVAALALHG
              +F +M +E VEPDE+ +LAVLSACADLG+LELGE I +Y++  G+ R V L NA+IDMYAKSGNI +AL+VFE +  ++V+TW+T++A LA HG
Subjt:  ------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWSTMVAALALHG

Query:  LGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASN
         G EA+ MF RM KA VRPN+VTFIAILSACSHVG VD+G+  F+ M+S Y I P IEHYGCMIDLL RAG L+EA ++++ MPF+ANAAIWGSLLAASN
Subjt:  LGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASN

Query:  IHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEINNRVYEFLAGDKSGSQLQGIYEVLCKIIVQLK
        +H D EL E+AL  L KLEP NSGNY LL+N Y              +M+  GVKK  G S IE+ NRVY+F++GD +  Q++ I+E+L ++ +Q++
Subjt:  IHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEINNRVYEFLAGDKSGSQLQGIYEVLCKIIVQLK

Q9SIL5 Pentatricopeptide repeat-containing protein At2g205401.5e-9237.92Show/hide
Query:  QIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVGHPLLDAISLYTRIRIEGLR-PDSYSIPFVLKAVVQLSAVEV
        +I+  ++   L Q +  +++ +D C  +    YA  +F+  ++P +F+  +     +      D I +Y ++  +    PD ++ PF+ K+   L +  +
Subjt:  QIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVGHPLLDAISLYTRIRIEGLR-PDSYSIPFVLKAVVQLSAVEV

Query:  GRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMPQRNVISWTALIRLHM----------
        G+Q+H           V    +LI MY     + DA K+FD +  RDV  WN++++GY ++ ++  A+ +F  M  + ++SWTA+I  +           
Subjt:  GRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMPQRNVISWTALIRLHM----------

Query:  LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAI
         F +MQL  +EPDEI++++VL +CA LG+LELG+WIH Y E+ G  +   + NALI+MY+K G I +A+++F  M+ K VI+WSTM++  A HG    AI
Subjt:  LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAI

Query:  DMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAE
        + F  M++A+V+PN +TF+ +LSACSHVGM   G  YFD M+  Y+IEPKIEHYGC+ID+LARAG L+ A ++ + MP + ++ IWGSLL++     + +
Subjt:  DMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAE

Query:  LAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEINNRVYEFLAGDKS
        +A  A+ HL +LEPE+ GNY LL+N Y              ++RN  +KK PGGS IE+NN V EF++GD S
Subjt:  LAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEINNRVYEFLAGDKS

Arabidopsis top hitse value%identityAlignment
AT2G20540.1 mitochondrial editing factor 211.1e-9337.92Show/hide
Query:  QIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVGHPLLDAISLYTRIRIEGLR-PDSYSIPFVLKAVVQLSAVEV
        +I+  ++   L Q +  +++ +D C  +    YA  +F+  ++P +F+  +     +      D I +Y ++  +    PD ++ PF+ K+   L +  +
Subjt:  QIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVGHPLLDAISLYTRIRIEGLR-PDSYSIPFVLKAVVQLSAVEV

Query:  GRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMPQRNVISWTALIRLHM----------
        G+Q+H           V    +LI MY     + DA K+FD +  RDV  WN++++GY ++ ++  A+ +F  M  + ++SWTA+I  +           
Subjt:  GRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMPQRNVISWTALIRLHM----------

Query:  LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAI
         F +MQL  +EPDEI++++VL +CA LG+LELG+WIH Y E+ G  +   + NALI+MY+K G I +A+++F  M+ K VI+WSTM++  A HG    AI
Subjt:  LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAI

Query:  DMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAE
        + F  M++A+V+PN +TF+ +LSACSHVGM   G  YFD M+  Y+IEPKIEHYGC+ID+LARAG L+ A ++ + MP + ++ IWGSLL++     + +
Subjt:  DMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAE

Query:  LAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEINNRVYEFLAGDKS
        +A  A+ HL +LEPE+ GNY LL+N Y              ++RN  +KK PGGS IE+NN V EF++GD S
Subjt:  LAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEINNRVYEFLAGDKS

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.7e-8934.89Show/hide
Query:  IHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHPIFVSTTPPLKPSVGHPLLD-AISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGR
        +HG  +  A+  D    +  I    S G    A  VF+       VS    +   V     D A+ L+ ++  E ++    ++  VL A  ++  +E GR
Subjt:  IHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHPIFVSTTPPLKPSVGHPLLD-AISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGR

Query:  QIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMPQRNVISWTALIRLH----------MLF
        Q+ +    + ++ ++ +  +++ MY+ CG + DA++LFD +  +D   W  M+ GY    +  +AR+V + MPQ+++++W ALI  +          ++F
Subjt:  QIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMPQRNVISWTALIRLH----------MLF

Query:  VKMQLEE-VEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAID
         ++QL++ ++ ++I +++ LSACA +GALELG WIH+YI+KHG+     + +ALI MY+K G++ ++ EVF  ++ + V  WS M+  LA+HG G EA+D
Subjt:  VKMQLEE-VEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAID

Query:  MFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAEL
        MF +M++A V+PN VTF  +  ACSH G+VD     F +M+S Y I P+ +HY C++D+L R+GYL++A K ++ MP   + ++WG+LL A  IH +  L
Subjt:  MFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAEL

Query:  AEQALRHLAKLEPENSGNYTLLSNTYL--------------MRNAGVKKAPGGSFIEINNRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAG
        AE A   L +LEP N G + LLSN Y               MR  G+KK PG S IEI+  ++EFL+GD +    + +Y  L +++ +LK  G
Subjt:  AEQALRHLAKLEPENSGNYTLLSNTYL--------------MRNAGVKKAPGGSFIEINNRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAG

AT3G62890.1 Pentatricopeptide repeat (PPR) superfamily protein3.6e-8939.87Show/hide
Query:  ISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNS
        IS+Y R+R   + PD ++ PF+L +      + +G++ H Q +   LD D  V TSL+ MYSSCG +  A+++FD    +D+  WN++V  Y K   ++ 
Subjt:  ISLYTRIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNS

Query:  ARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEE-----VEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAK
        ARK+FDEMP+RNVISW+ LI  ++          LF +MQL +     V P+E  M  VLSAC  LGALE G+W+H YI+K+ +   + L  ALIDMYAK
Subjt:  ARKVFDEMPQRNVISWTALIRLHM----------LFVKMQLEE-----VEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAK

Query:  SGNIRRALEVFEIM-KHKSVITWSTMVAALALHGLGGEAIDMFLRMEKA-RVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMID
         G++ RA  VF  +   K V  +S M+  LA++GL  E   +F  M  +  + PN VTF+ IL AC H G+++ G+ YF  M   + I P I+HYGCM+D
Subjt:  SGNIRRALEVFEIM-KHKSVITWSTMVAALALHGLGGEAIDMFLRMEKA-RVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMID

Query:  LLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTYL--------------MRNAGVKKAPGGSFIEI
        L  R+G ++EA+  +  MP E +  IWGSLL+ S +  D +  E AL+ L +L+P NSG Y LLSN Y               M   G+ K PG S++E+
Subjt:  LLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTYL--------------MRNAGVKKAPGGSFIEI

Query:  NNRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE
           V+EF+ GD+S  + + IY +L +I+ +L+ AG + +     LD +E
Subjt:  NNRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE

AT5G56310.1 Pentatricopeptide repeat (PPR) superfamily protein5.8e-13250.1Show/hide
Query:  QIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNK------THPIFVSTTPPLKPSVGHPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLS
        Q H +M+   L++DNL +++FI+ACS+ G   YA+SVF+++       H   +     L     H +  AI++Y ++     +PD+++ PFVLK  V++S
Subjt:  QIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNK------THPIFVSTTPPLKPSVGHPLLDAISLYTRIRIEGLRPDSYSIPFVLKAVVQLS

Query:  AVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMP--QRNVISWTALIRLHM----
         V  GRQIH Q V    D+ V+VVT LIQMY SCG + DARK+FD +  +DV +WNA++AGY KV E++ AR + + MP   RN +SWT +I  +     
Subjt:  AVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMP--QRNVISWTALIRLHM----

Query:  ------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWSTMVAALALHG
              +F +M +E VEPDE+ +LAVLSACADLG+LELGE I +Y++  G+ R V L NA+IDMYAKSGNI +AL+VFE +  ++V+TW+T++A LA HG
Subjt:  ------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWSTMVAALALHG

Query:  LGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASN
         G EA+ MF RM KA VRPN+VTFIAILSACSHVG VD+G+  F+ M+S Y I P IEHYGCMIDLL RAG L+EA ++++ MPF+ANAAIWGSLLAASN
Subjt:  LGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASN

Query:  IHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEINNRVYEFLAGDKSGSQLQGIYEVLCKIIVQLK
        +H D EL E+AL  L KLEP NSGNY LL+N Y              +M+  GVKK  G S IE+ NRVY+F++GD +  Q++ I+E+L ++ +Q++
Subjt:  IHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEINNRVYEFLAGDKSGSQLQGIYEVLCKIIVQLK

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.1e-8937.85Show/hide
Query:  SLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDAC---SSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVGHPLLDAISLYTRIRIEGLRPDSYSI
        S L  C     + QIH  ML   L QD+  +++F+  C   +S     YA  VF     P  F+        S       ++ LY R+       ++Y+ 
Subjt:  SLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDAC---SSLGLSLYAFSVFSNKTHP-IFVSTTPPLKPSVGHPLLDAISLYTRIRIEGLRPDSYSI

Query:  PFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMPQRNVISWTAL
        P +LKA   LSA E   QIH Q      + DV  V SLI  Y+  G    A  LFD +   D   WN+++ GYVK  +++ A  +F +M ++N ISWT +
Subjt:  PFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMPQRNVISWTAL

Query:  IRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWST
        I  ++          LF +MQ  +VEPD +++   LSACA LGALE G+WIH+Y+ K  +     L   LIDMYAK G +  ALEVF+ +K KSV  W+ 
Subjt:  IRLHM----------LFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWST

Query:  MVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAI
        +++  A HG G EAI  F+ M+K  ++PN +TF A+L+ACS+ G+V+ G+  F  M+  Y ++P IEHYGC++DLL RAG L EA++ +Q+MP + NA I
Subjt:  MVAALALHGLGGEAIDMFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAI

Query:  WGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEINNRVYEFLAGDKSGSQLQGI
        WG+LL A  IH++ EL E+    L  ++P + G Y   +N +              LM+  GV K PG S I +    +EFLAGD+S  +++ I
Subjt:  WGSLLAASNIHRDAELAEQALRHLAKLEPENSGNYTLLSNTY--------------LMRNAGVKKAPGGSFIEINNRVYEFLAGDKSGSQLQGI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCTGGTCCATTTCCCTCCTCTGTAAATCGTCTCCTTCCTTCATCAAACCCACCATTTTCACAACTCCCCTCGCCTTTACTTCCCTTCTAAGCAACTGCCGCGACCC
TTGTCACATTTATCAAATCCATGGCTTCATGTTGCACAGAGCTCTCGATCAAGACAACCTCTTCCTCAGCAGATTCATCGACGCTTGTTCTTCTCTCGGCCTCTCTTTAT
ACGCCTTTTCCGTCTTCTCAAACAAAACCCACCCGATCTTCGTCTCTACAACACCGCCATTAAAGCCCTCTGTCGGACATCCTCTGTTGGACGCCATTTCGCTTTACACC
AGGATTCGAATTGAGGGGTTGCGGCCGGATTCCTACTCTATTCCCTTTGTTTTGAAGGCCGTCGTTCAGTTATCCGCCGTTGAAGTGGGGCGGCAGATTCATACCCAGAC
GGTTTCTTCGGCTTTGGATACGGACGTGAATGTTGTCACTTCGTTGATTCAAATGTATTCTTCTTGTGGGTGTGTTTCTGATGCTCGTAAGCTGTTTGATTTTGTTGCTT
TTAGGGATGTTGCTTTGTGGAATGCCATGGTTGCTGGGTATGTTAAAGTTGCAGAACTTAATAGTGCGCGTAAGGTGTTCGACGAAATGCCTCAAAGGAATGTGATCTCT
TGGACTGCTTTGATTCGATTGCATATGCTGTTCGTGAAGATGCAGCTTGAAGAAGTGGAGCCTGATGAAATTGCAATGTTGGCTGTGCTCTCTGCTTGTGCTGATCTGGG
GGCTCTTGAGCTTGGCGAGTGGATCCATAACTATATTGAAAAGCATGGTTTGTGCAGGATTGTTCCATTGTACAATGCCCTTATAGATATGTATGCAAAATCAGGCAACA
TAAGAAGAGCACTGGAAGTTTTTGAGATCATGAAGCATAAAAGTGTCATAACTTGGTCCACCATGGTTGCTGCCTTGGCTCTTCACGGGCTTGGAGGAGAAGCCATTGAC
ATGTTCCTTCGAATGGAGAAGGCAAGAGTTAGGCCAAATGAAGTAACTTTCATAGCAATCCTATCTGCTTGCAGTCATGTTGGAATGGTGGATGTGGGTCGTTATTATTT
TGATCGAATGCAATCAATCTACAAAATTGAGCCAAAAATTGAGCACTATGGCTGCATGATTGATCTGCTGGCTCGTGCTGGTTATCTTCAAGAGGCACAAAAACTGCTTC
AGGACATGCCATTTGAAGCAAATGCAGCGATATGGGGGTCTCTTCTTGCTGCTTCCAATATCCATAGGGATGCTGAGCTTGCAGAGCAGGCTTTGAGGCATCTTGCAAAG
CTGGAGCCTGAAAATAGTGGGAATTATACACTCTTATCCAACACATATCTGATGAGAAATGCAGGTGTGAAGAAGGCTCCGGGTGGAAGCTTTATTGAAATTAATAACAG
AGTATATGAATTTCTTGCTGGAGATAAGTCAGGTTCTCAGTTACAAGGGATCTATGAAGTCTTGTGCAAGATAATTGTGCAGTTGAAAATGGCCGGATTGTTACAGGAGG
AATGGAGTAAGTTTCTCGACTACGATGAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCGCTGGTCCATTTCCCTCCTCTGTAAATCGTCTCCTTCCTTCATCAAACCCACCATTTTCACAACTCCCCTCGCCTTTACTTCCCTTCTAAGCAACTGCCGCGACCC
TTGTCACATTTATCAAATCCATGGCTTCATGTTGCACAGAGCTCTCGATCAAGACAACCTCTTCCTCAGCAGATTCATCGACGCTTGTTCTTCTCTCGGCCTCTCTTTAT
ACGCCTTTTCCGTCTTCTCAAACAAAACCCACCCGATCTTCGTCTCTACAACACCGCCATTAAAGCCCTCTGTCGGACATCCTCTGTTGGACGCCATTTCGCTTTACACC
AGGATTCGAATTGAGGGGTTGCGGCCGGATTCCTACTCTATTCCCTTTGTTTTGAAGGCCGTCGTTCAGTTATCCGCCGTTGAAGTGGGGCGGCAGATTCATACCCAGAC
GGTTTCTTCGGCTTTGGATACGGACGTGAATGTTGTCACTTCGTTGATTCAAATGTATTCTTCTTGTGGGTGTGTTTCTGATGCTCGTAAGCTGTTTGATTTTGTTGCTT
TTAGGGATGTTGCTTTGTGGAATGCCATGGTTGCTGGGTATGTTAAAGTTGCAGAACTTAATAGTGCGCGTAAGGTGTTCGACGAAATGCCTCAAAGGAATGTGATCTCT
TGGACTGCTTTGATTCGATTGCATATGCTGTTCGTGAAGATGCAGCTTGAAGAAGTGGAGCCTGATGAAATTGCAATGTTGGCTGTGCTCTCTGCTTGTGCTGATCTGGG
GGCTCTTGAGCTTGGCGAGTGGATCCATAACTATATTGAAAAGCATGGTTTGTGCAGGATTGTTCCATTGTACAATGCCCTTATAGATATGTATGCAAAATCAGGCAACA
TAAGAAGAGCACTGGAAGTTTTTGAGATCATGAAGCATAAAAGTGTCATAACTTGGTCCACCATGGTTGCTGCCTTGGCTCTTCACGGGCTTGGAGGAGAAGCCATTGAC
ATGTTCCTTCGAATGGAGAAGGCAAGAGTTAGGCCAAATGAAGTAACTTTCATAGCAATCCTATCTGCTTGCAGTCATGTTGGAATGGTGGATGTGGGTCGTTATTATTT
TGATCGAATGCAATCAATCTACAAAATTGAGCCAAAAATTGAGCACTATGGCTGCATGATTGATCTGCTGGCTCGTGCTGGTTATCTTCAAGAGGCACAAAAACTGCTTC
AGGACATGCCATTTGAAGCAAATGCAGCGATATGGGGGTCTCTTCTTGCTGCTTCCAATATCCATAGGGATGCTGAGCTTGCAGAGCAGGCTTTGAGGCATCTTGCAAAG
CTGGAGCCTGAAAATAGTGGGAATTATACACTCTTATCCAACACATATCTGATGAGAAATGCAGGTGTGAAGAAGGCTCCGGGTGGAAGCTTTATTGAAATTAATAACAG
AGTATATGAATTTCTTGCTGGAGATAAGTCAGGTTCTCAGTTACAAGGGATCTATGAAGTCTTGTGCAAGATAATTGTGCAGTTGAAAATGGCCGGATTGTTACAGGAGG
AATGGAGTAAGTTTCTCGACTACGATGAGTGA
Protein sequenceShow/hide protein sequence
MRWSISLLCKSSPSFIKPTIFTTPLAFTSLLSNCRDPCHIYQIHGFMLHRALDQDNLFLSRFIDACSSLGLSLYAFSVFSNKTHPIFVSTTPPLKPSVGHPLLDAISLYT
RIRIEGLRPDSYSIPFVLKAVVQLSAVEVGRQIHTQTVSSALDTDVNVVTSLIQMYSSCGCVSDARKLFDFVAFRDVALWNAMVAGYVKVAELNSARKVFDEMPQRNVIS
WTALIRLHMLFVKMQLEEVEPDEIAMLAVLSACADLGALELGEWIHNYIEKHGLCRIVPLYNALIDMYAKSGNIRRALEVFEIMKHKSVITWSTMVAALALHGLGGEAID
MFLRMEKARVRPNEVTFIAILSACSHVGMVDVGRYYFDRMQSIYKIEPKIEHYGCMIDLLARAGYLQEAQKLLQDMPFEANAAIWGSLLAASNIHRDAELAEQALRHLAK
LEPENSGNYTLLSNTYLMRNAGVKKAPGGSFIEINNRVYEFLAGDKSGSQLQGIYEVLCKIIVQLKMAGLLQEEWSKFLDYDE