; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G003610 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G003610
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionPentatricopeptide repeat-containing protein, chloroplastic
Genome locationCmo_Chr20:1772111..1778127
RNA-Seq ExpressionCmoCh20G003610
SyntenyCmoCh20G003610
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6570645.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]2.1e-18585.93Show/hide
Query:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD
        MGDGFFEAIEELERM RDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNV DVVDLLVD
Subjt:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD

Query:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK
        MDCV                      E+ I FV+  L                         + VDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK
Subjt:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK

Query:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
        ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
Subjt:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI

Query:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
        VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
Subjt:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL

Query:  IGPSL
        IGPSL
Subjt:  IGPSL

KAG7010495.1 Pentatricopeptide repeat-containing protein, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]5.6e-18686.17Show/hide
Query:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD
        MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNV DVVDLLVD
Subjt:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD

Query:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK
        MDCV                      E+ I FV+  L                         + VDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK
Subjt:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK

Query:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
        ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
Subjt:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI

Query:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
        VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
Subjt:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL

Query:  IGPSL
        IGPSL
Subjt:  IGPSL

XP_022944005.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita moschata]8.6e-18786.42Show/hide
Query:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD
        MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD
Subjt:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD

Query:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK
        MDCV                      E+ I FV+  L                         + VDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK
Subjt:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK

Query:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
        ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
Subjt:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI

Query:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
        VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
Subjt:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL

Query:  IGPSL
        IGPSL
Subjt:  IGPSL

XP_022986849.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita maxima]2.4e-18485.43Show/hide
Query:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD
        MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNV DVVDLLVD
Subjt:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD

Query:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK
        MDCV                      E+ I FV+  L                         + VDGDYRGAVKMVLNLRESGLKPEVYC+LIAMTAVVK
Subjt:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK

Query:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
        ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEG SSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
Subjt:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI

Query:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
        VLAICASQKETRAMNRLL+RIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
Subjt:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL

Query:  IGPSL
        IGPSL
Subjt:  IGPSL

XP_023512972.1 pentatricopeptide repeat-containing protein At2g30100, chloroplastic [Cucurbita pepo subsp. pepo]4.7e-18585.93Show/hide
Query:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD
        MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNV DVVDLLVD
Subjt:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD

Query:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK
        MDCV                      E+ I FV+  L                         + VDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK
Subjt:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK

Query:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
        ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSH VVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
Subjt:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI

Query:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
        VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
Subjt:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL

Query:  IGPSL
        IGPSL
Subjt:  IGPSL

TrEMBL top hitse value%identityAlignment
A0A0A0KC35 Uncharacterized protein3.2e-17179.01Show/hide
Query:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD
        MGDGFFEAIEELERMTR+PSDVLEEMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEG+HNV DVVDLLVD
Subjt:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD

Query:  MDCV-----------------------ERQIFVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK
        MDCV                       +   FV+  L                         + VDGDYRGAVKMVL+LRESGL+PEVY YLIAMTAVVK
Subjt:  MDCV-----------------------ERQIFVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK

Query:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
        ELNEFAKALRKLK YARDG VAELDK+NVELV +YQ+ELLADGV+LSNWVL+EG SS  GVVHERLLAMYICAGQG+EAERQLWEMKLVGKEADADLYDI
Subjt:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI

Query:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
        VLAICASQKET+AM RLLTRIEITSP +KKKSLTWLLRGYIKGGHFRDAA TLVKM+NLGFLPEYLDRVAVLQGLRK IREPE+V TYLDLCKCLSDANL
Subjt:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL

Query:  IGPSL
        IGPSL
Subjt:  IGPSL

A0A1S3CNE0 pentatricopeptide repeat-containing protein At2g30100, chloroplastic2.1e-17079.26Show/hide
Query:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD
        MGDGFFEAIEELERMTR+PSDVLEEMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWI KLVEG+HNV DVVDLLVD
Subjt:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD

Query:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK
        MDCV                      E+ I FV+  L                         + VDGDYRGAVKMVL+LRESGL+PEVY YLIAMTAVVK
Subjt:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK

Query:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
        ELNEFAKALRKLKSYARDG VAELDK+NVELV +YQ+ELLADGVRLSNWVL+EG SS HGVVHERLLAMYICAGQG+EAERQLWEMKL+GKEADADLYDI
Subjt:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI

Query:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
        VLAICASQKE +AM RLLTRIEITSP +KKKSLTWLLRGYIKGGHFRDAA T+VKM+NLGFLPEYLDRVAVLQGLRK IREPE V TYLDLCKCLSDANL
Subjt:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL

Query:  IGPSL
        IGPSL
Subjt:  IGPSL

A0A5A7VNN8 Pentatricopeptide repeat-containing protein2.1e-17079.26Show/hide
Query:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD
        MGDGFFEAIEELERMTR+PSDVLEEMNDRLSARE QLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWI KLVEG+HNV DVVDLLVD
Subjt:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD

Query:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK
        MDCV                      E+ I FV+  L                         + VDGDYRGAVKMVL+LRESGL+PEVY YLIAMTAVVK
Subjt:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK

Query:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
        ELNEFAKALRKLKSYARDG VAELDK+NVELV +YQ+ELLADGVRLSNWVL+EG SS HGVVHERLLAMYICAGQG+EAERQLWEMKL+GKEADADLYDI
Subjt:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI

Query:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
        VLAICASQKE +AM RLLTRIEITSP +KKKSLTWLLRGYIKGGHFRDAA T+VKM+NLGFLPEYLDRVAVLQGLRK IREPE V TYLDLCKCLSDANL
Subjt:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL

Query:  IGPSL
        IGPSL
Subjt:  IGPSL

A0A6J1FYE9 pentatricopeptide repeat-containing protein At2g30100, chloroplastic4.2e-18786.42Show/hide
Query:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD
        MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD
Subjt:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD

Query:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK
        MDCV                      E+ I FV+  L                         + VDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK
Subjt:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK

Query:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
        ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
Subjt:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI

Query:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
        VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
Subjt:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL

Query:  IGPSL
        IGPSL
Subjt:  IGPSL

A0A6J1JH85 pentatricopeptide repeat-containing protein At2g30100, chloroplastic1.1e-18485.43Show/hide
Query:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD
        MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNV DVVDLLVD
Subjt:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD

Query:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK
        MDCV                      E+ I FV+  L                         + VDGDYRGAVKMVLNLRESGLKPEVYC+LIAMTAVVK
Subjt:  MDCV----------------------ERQI-FVEVTL-------------------------VKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVK

Query:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
        ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEG SSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI
Subjt:  ELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDI

Query:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
        VLAICASQKETRAMNRLL+RIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL
Subjt:  VLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCKCLSDANL

Query:  IGPSL
        IGPSL
Subjt:  IGPSL

SwissProt top hitse value%identityAlignment
P0C8A0 Pentatricopeptide repeat-containing protein At3g497308.6e-0418.88Show/hide
Query:  GDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERL
        G    A  ++ ++R+ G +P V CY + + A+ +      +A+R      R G  A++      +    +  ++  G  + + +  +G   S  V + ++
Subjt:  GDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERL

Query:  LAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFL--PE
        +  +    Q  E    + +MK  G   D  +Y++V+ +     E +   RL   +E         +   ++ G+   G   +A     +MV+ G    P+
Subjt:  LAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFL--PE

Query:  YLDRVAVLQGLRKRIREPENVETYLDLCKCLSD
        Y      L+ L   +   + +E   D+  C+S+
Subjt:  YLDRVAVLQGLRKRIREPENVETYLDLCKCLSD

Q0WNN7 Pentatricopeptide repeat-containing protein At2g30100, chloroplastic2.3e-12656.55Show/hide
Query:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD
        +G+GFFEAIEELERMTR+PSD+LEEMN RLS+RE QL+LVYF+QEGRDSWC LEVFEWL+KENRVD+E MELMVSIMC W+KKL+E + N   V DLL++
Subjt:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD

Query:  MDCV-----------------------ERQIFVEVTLVK------------------------------VDGDYRGAVKMVLNLRESGLKPEVYCYLIAM
        MDCV                          +FV+  L +                              VDGDYR AV MV+ LR SGLKPE Y YLIAM
Subjt:  MDCV-----------------------ERQIFVEVTLVK------------------------------VDGDYRGAVKMVLNLRESGLKPEVYCYLIAM

Query:  TAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEG--GSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEA
        TA+VKELN   K LR+LK +AR G VAE+D  +  L+++YQSE L+ G++L+ W ++EG    S  GVVHERLLAMYICAG+G EAE+QLW+MKL G+E 
Subjt:  TAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEG--GSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEA

Query:  DADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCK
        +ADL+DIV+AICASQKE  A++RLLTR+E    + KKK+L+WLLRGY+KGGHF +AAETLV M++ G  PEY+DRVAV+QG+ ++I+ P +VE Y+ LCK
Subjt:  DADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCK

Query:  CLSDANLIGPSL
         L DA L+GP L
Subjt:  CLSDANLIGPSL

Q0WVV0 Pentatricopeptide repeat-containing protein At1g10910, chloroplastic8.0e-1023.4Show/hide
Query:  AIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCAL-EVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCVER
        AI E++R + D    L+ +   L  ++  ++L  F   GR  W  L ++FEW+Q+  ++        VS   S IK +  G  NV   +++   +     
Subjt:  AIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCAL-EVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCVER

Query:  QIFVEV------TLVKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSN
        +I V +       LVK +G     +K+   ++  GLKP+V  Y   +   +K  N + KA                    +EL+     EL  +G+++ +
Subjt:  QIFVEV------TLVKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSN

Query:  WVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRD
                    V++  +LA+    G+  EAE  + +MK+ G   +   Y  +L   + + + +  + L+T ++       K  +T LL+ YIKGG F  
Subjt:  WVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRD

Query:  AAETLVKMVNLGFLPEYLDRVAVLQGLRK
        + E L ++ + G+    +    ++ GL K
Subjt:  AAETLVKMVNLGFLPEYLDRVAVLQGLRK

Arabidopsis top hitse value%identityAlignment
AT1G10910.1 Pentatricopeptide repeat (PPR) superfamily protein5.7e-1123.4Show/hide
Query:  AIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCAL-EVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCVER
        AI E++R + D    L+ +   L  ++  ++L  F   GR  W  L ++FEW+Q+  ++        VS   S IK +  G  NV   +++   +     
Subjt:  AIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCAL-EVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCVER

Query:  QIFVEV------TLVKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSN
        +I V +       LVK +G     +K+   ++  GLKP+V  Y   +   +K  N + KA                    +EL+     EL  +G+++ +
Subjt:  QIFVEV------TLVKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSN

Query:  WVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRD
                    V++  +LA+    G+  EAE  + +MK+ G   +   Y  +L   + + + +  + L+T ++       K  +T LL+ YIKGG F  
Subjt:  WVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRD

Query:  AAETLVKMVNLGFLPEYLDRVAVLQGLRK
        + E L ++ + G+    +    ++ GL K
Subjt:  AAETLVKMVNLGFLPEYLDRVAVLQGLRK

AT1G74580.1 Pentatricopeptide repeat (PPR) superfamily protein6.7e-0423.38Show/hide
Query:  VRDVVDLLVDMDC--VERQIFVEVTLVKV---DGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVK
        V++ V++   MD    E  +F    ++ V    G +  A K+ + +R+ G+ P+VY + I M +  K     A ALR L + +  G    +      +  
Subjt:  VRDVVDLLVDMDC--VERQIFVEVTLVKV---DGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVK

Query:  RYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSL
         Y+    A+G  L   +L  G S      + +LL +    G   E E+ L ++   G   +   Y++ +     + E     R++  +    P+    + 
Subjt:  RYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSL

Query:  TWLLRGYIKGGHFRDAAETLVKMVNLGFLPE
          L+ G  K   F++A   L KMVN G  P+
Subjt:  TWLLRGYIKGGHFRDAAETLVKMVNLGFLPE

AT2G30100.1 pentatricopeptide (PPR) repeat-containing protein1.7e-12756.55Show/hide
Query:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD
        +G+GFFEAIEELERMTR+PSD+LEEMN RLS+RE QL+LVYF+QEGRDSWC LEVFEWL+KENRVD+E MELMVSIMC W+KKL+E + N   V DLL++
Subjt:  MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVD

Query:  MDCV-----------------------ERQIFVEVTLVK------------------------------VDGDYRGAVKMVLNLRESGLKPEVYCYLIAM
        MDCV                          +FV+  L +                              VDGDYR AV MV+ LR SGLKPE Y YLIAM
Subjt:  MDCV-----------------------ERQIFVEVTLVK------------------------------VDGDYRGAVKMVLNLRESGLKPEVYCYLIAM

Query:  TAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEG--GSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEA
        TA+VKELN   K LR+LK +AR G VAE+D  +  L+++YQSE L+ G++L+ W ++EG    S  GVVHERLLAMYICAG+G EAE+QLW+MKL G+E 
Subjt:  TAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEG--GSSSHGVVHERLLAMYICAGQGLEAERQLWEMKLVGKEA

Query:  DADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCK
        +ADL+DIV+AICASQKE  A++RLLTR+E    + KKK+L+WLLRGY+KGGHF +AAETLV M++ G  PEY+DRVAV+QG+ ++I+ P +VE Y+ LCK
Subjt:  DADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKRIREPENVETYLDLCK

Query:  CLSDANLIGPSL
         L DA L+GP L
Subjt:  CLSDANLIGPSL

AT3G49730.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.1e-0518.88Show/hide
Query:  GDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERL
        G    A  ++ ++R+ G +P V CY + + A+ +      +A+R      R G  A++      +    +  ++  G  + + +  +G   S  V + ++
Subjt:  GDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERL

Query:  LAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFL--PE
        +  +    Q  E    + +MK  G   D  +Y++V+ +     E +   RL   +E         +   ++ G+   G   +A     +MV+ G    P+
Subjt:  LAMYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFL--PE

Query:  YLDRVAVLQGLRKRIREPENVETYLDLCKCLSD
        Y      L+ L   +   + +E   D+  C+S+
Subjt:  YLDRVAVLQGLRKRIREPENVETYLDLCKCLSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGATGGGTTTTTTGAGGCAATTGAGGAATTGGAACGAATGACGAGGGATCCATCGGATGTTCTTGAAGAAATGAACGACCGCCTATCGGCCAGGGAATTTCAGCT
AGTGCTGGTGTACTTCTCTCAAGAAGGGAGGGATTCGTGGTGTGCTCTTGAGGTTTTTGAGTGGCTCCAAAAGGAAAATCGGGTCGACAAGGAGACCATGGAGCTGATGG
TGTCTATAATGTGTAGTTGGATCAAGAAGTTGGTCGAGGGACAACATAACGTCAGAGATGTGGTTGACCTTCTCGTGGATATGGATTGTGTAGAGCGTCAGATTTTCGTT
GAGGTGACTCTGGTTAAGGTTGATGGTGACTATAGGGGTGCAGTGAAAATGGTGCTGAATCTCAGAGAATCTGGATTAAAGCCAGAGGTTTACTGCTATCTTATTGCCAT
GACTGCCGTGGTTAAAGAGCTGAATGAATTTGCAAAAGCTCTTCGCAAACTGAAAAGTTACGCAAGAGACGGGATGGTGGCTGAACTCGATAAAGACAATGTCGAACTTG
TCAAGAGGTATCAGTCAGAGCTTCTAGCTGATGGAGTACGGTTATCCAACTGGGTGCTTGACGAGGGAGGCTCTTCGAGTCACGGGGTGGTTCATGAGAGACTCCTTGCA
ATGTACATTTGTGCTGGGCAAGGACTAGAGGCAGAGCGGCAGCTTTGGGAAATGAAGCTTGTAGGTAAGGAGGCTGATGCCGATCTCTACGATATCGTGCTAGCCATATG
TGCTTCACAGAAGGAGACAAGAGCAATGAACCGGTTGCTTACCAGGATTGAGATTACGAGTCCCCGGCTTAAGAAGAAGAGTTTAACATGGCTACTAAGGGGTTACATAA
AAGGAGGTCATTTCCGTGATGCTGCAGAAACATTAGTAAAAATGGTCAATTTGGGTTTTCTCCCTGAGTACTTGGACAGAGTAGCTGTGCTGCAAGGGCTTAGAAAACGG
ATTCGGGAACCTGAAAACGTCGAGACTTACCTCGATCTCTGCAAGTGTCTCTCTGATGCTAATCTTATTGGACCCAGTCTTGAGCTCGACTGTCGCGGATCCCAACTTTG
CTGCTGTTTCTTCACTCCAGTAGCTGAAATCAGACAACTTGGCAGCATAGTCTTCGGTCAGACAGTGATGTCCGAAAAGGATCCGATTATGTTGCTGAAGTCTTCACAAG
CTGCTTCGAGCTCGGATCGCCTGAGCTTCGGGACACCTGACATGACAACGGAGTTAATGGTGGATCTGAAGGCAATAATGGAGACGAAGAAGCAAACGAAGATGGCGGTA
ATGAAACGAACGAATGTCGCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGATGGGTTTTTTGAGGCAATTGAGGAATTGGAACGAATGACGAGGGATCCATCGGATGTTCTTGAAGAAATGAACGACCGCCTATCGGCCAGGGAATTTCAGCT
AGTGCTGGTGTACTTCTCTCAAGAAGGGAGGGATTCGTGGTGTGCTCTTGAGGTTTTTGAGTGGCTCCAAAAGGAAAATCGGGTCGACAAGGAGACCATGGAGCTGATGG
TGTCTATAATGTGTAGTTGGATCAAGAAGTTGGTCGAGGGACAACATAACGTCAGAGATGTGGTTGACCTTCTCGTGGATATGGATTGTGTAGAGCGTCAGATTTTCGTT
GAGGTGACTCTGGTTAAGGTTGATGGTGACTATAGGGGTGCAGTGAAAATGGTGCTGAATCTCAGAGAATCTGGATTAAAGCCAGAGGTTTACTGCTATCTTATTGCCAT
GACTGCCGTGGTTAAAGAGCTGAATGAATTTGCAAAAGCTCTTCGCAAACTGAAAAGTTACGCAAGAGACGGGATGGTGGCTGAACTCGATAAAGACAATGTCGAACTTG
TCAAGAGGTATCAGTCAGAGCTTCTAGCTGATGGAGTACGGTTATCCAACTGGGTGCTTGACGAGGGAGGCTCTTCGAGTCACGGGGTGGTTCATGAGAGACTCCTTGCA
ATGTACATTTGTGCTGGGCAAGGACTAGAGGCAGAGCGGCAGCTTTGGGAAATGAAGCTTGTAGGTAAGGAGGCTGATGCCGATCTCTACGATATCGTGCTAGCCATATG
TGCTTCACAGAAGGAGACAAGAGCAATGAACCGGTTGCTTACCAGGATTGAGATTACGAGTCCCCGGCTTAAGAAGAAGAGTTTAACATGGCTACTAAGGGGTTACATAA
AAGGAGGTCATTTCCGTGATGCTGCAGAAACATTAGTAAAAATGGTCAATTTGGGTTTTCTCCCTGAGTACTTGGACAGAGTAGCTGTGCTGCAAGGGCTTAGAAAACGG
ATTCGGGAACCTGAAAACGTCGAGACTTACCTCGATCTCTGCAAGTGTCTCTCTGATGCTAATCTTATTGGACCCAGTCTTGAGCTCGACTGTCGCGGATCCCAACTTTG
CTGCTGTTTCTTCACTCCAGTAGCTGAAATCAGACAACTTGGCAGCATAGTCTTCGGTCAGACAGTGATGTCCGAAAAGGATCCGATTATGTTGCTGAAGTCTTCACAAG
CTGCTTCGAGCTCGGATCGCCTGAGCTTCGGGACACCTGACATGACAACGGAGTTAATGGTGGATCTGAAGGCAATAATGGAGACGAAGAAGCAAACGAAGATGGCGGTA
ATGAAACGAACGAATGTCGCATAG
Protein sequenceShow/hide protein sequence
MGDGFFEAIEELERMTRDPSDVLEEMNDRLSAREFQLVLVYFSQEGRDSWCALEVFEWLQKENRVDKETMELMVSIMCSWIKKLVEGQHNVRDVVDLLVDMDCVERQIFV
EVTLVKVDGDYRGAVKMVLNLRESGLKPEVYCYLIAMTAVVKELNEFAKALRKLKSYARDGMVAELDKDNVELVKRYQSELLADGVRLSNWVLDEGGSSSHGVVHERLLA
MYICAGQGLEAERQLWEMKLVGKEADADLYDIVLAICASQKETRAMNRLLTRIEITSPRLKKKSLTWLLRGYIKGGHFRDAAETLVKMVNLGFLPEYLDRVAVLQGLRKR
IREPENVETYLDLCKCLSDANLIGPSLELDCRGSQLCCCFFTPVAEIRQLGSIVFGQTVMSEKDPIMLLKSSQAASSSDRLSFGTPDMTTELMVDLKAIMETKKQTKMAV
MKRTNVA