; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr008138 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr008138
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00006406:72723..74749
RNA-Seq ExpressionSgr008138
SyntenySgr008138
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044657 - Pentatricopeptide repeat-containing protein NFD5-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6582566.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]7.7e-16786.78Show/hide
Query:  TQKSKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEK
        T +S  EKP FRWVEVGSDITE QKQAISQLPPKMTKRCKA+M+QIICFSPQ GNLSD+LAAWVRIMKPKRADWLSVLKHLR+L+HPLYIEVA+AAL+E+
Subjt:  TQKSKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEK

Query:  TFEASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILK
        TFEASTRDYTKIIHYYGKRNQL+DAE++LL MKER FACDQITLTTMIHIYSKAD+L LAKQTFE++KLLE+ LD+RSY AMIMA+IRAGMP+EGE ILK
Subjt:  TFEASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILK

Query:  EMDAKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTA
        EMD K+IYAGSEVYKALLRAYSM  NAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQS++A+IAFDNMR+AGLEPSDKCIAL+L+AYEKENRLN A
Subjt:  EMDAKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTA

Query:  LELLIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS
        LELLIDLEKE LMVGKEASE LAAW KRLGVVEEVELVLREYA KEAS
Subjt:  LELLIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS

TYK17533.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]1.4e-16585.55Show/hide
Query:  KSKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTF
        +S+ EKP FRWVEVG +ITETQKQAISQLPPKMTK+CKA+M+QIICFSPQKG LSD+LAAWVRIMKP+RADWLSVLKHLR+LNHPLYI+VA+AAL+E TF
Subjt:  KSKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTF

Query:  EASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEM
        EA+TRDYTKIIH+YGK+NQL+DAEKVLL+M+ER FACDQITLTTMIHIYSKADKL LAKQTFEELKLLEQSLDKRSYGAMIMAY+RAG+P+EGE ILKEM
Subjt:  EASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEM

Query:  DAKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALE
        DAK+IYAGSEVYKALLRAYSM G+AEGAQRVFDAIQLAAIPPD+KLCGLL+NAYLMAGQSR+A+IAFDNMR+AG+EPSDKCIAL L+AYEKENRLN ALE
Subjt:  DAKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALE

Query:  LLIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS
        LLIDLEK+N+MVGKEAS+ LAAW KRLGVVEE+E+VLREY AKE S
Subjt:  LLIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS

XP_022147089.1 pentatricopeptide repeat-containing protein At1g01970 [Momordica charantia]4.3e-17089.57Show/hide
Query:  SKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFE
        S+IEK  FRWVEVGSDITETQKQAIS+LPPKM KRCKALMRQIICFSPQKGNLSDLL AWVRIMKPKRADWL VLKHLRL NHP YIEVA+AALLEKTFE
Subjt:  SKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFE

Query:  ASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMD
        ASTRD+TKIIHYYGK+N+L+DAEK+LLSMKE+ FACDQITLTTM+HIYSKADKL LAKQTFEELKLLEQ LDKRSYGAMIMAYIRAGMP EGE IL+EMD
Subjt:  ASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMD

Query:  AKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALEL
        AKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPD+KLCGLLINAYLMAGQ+++ RI+FDNMRKAGLEP DKCIAL+LAAYEKENRLN ALEL
Subjt:  AKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALEL

Query:  LIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS
        LIDLEKENLMVGKEASE LAAWFKRLGV+EEVELVLREYA K AS
Subjt:  LIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS

XP_022924339.1 pentatricopeptide repeat-containing protein At1g01970 [Cucurbita moschata]1.1e-16587.43Show/hide
Query:  EKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFEAST
        EKP FRW+EVGSDITE QKQAISQLPPKMTKRCKA+M+QIICFSPQ GNLSD+LAAWVRIMKPKRADWLSVLKHLR+L+HPLYIEVA+AAL+E+TFEAST
Subjt:  EKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFEAST

Query:  RDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKE
        RDYTKIIHYYGKRNQL+DAE++LLSM+ER FACDQITLTTMIHIYSKAD+L LAKQTFEELKLLE+ LD+RSY AMIMA+IRAGMP+EGE ILKEMD K+
Subjt:  RDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKE

Query:  IYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLID
        IYAGSEVYKALLRAYSM  NAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMA QS++A+IAFDNMR+AGLEPSDKCIAL+L+AYEKENRLN ALELLID
Subjt:  IYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLID

Query:  LEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS
        LEKE L+VGKEASE LAAW KRLGVVEEVELVLREYA KEAS
Subjt:  LEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS

XP_023526280.1 pentatricopeptide repeat-containing protein At1g01970 isoform X1 [Cucurbita pepo subsp. pepo]2.6e-16787.07Show/hide
Query:  TQKSKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEK
        T +S  EKP FRWVEVGSDITE QKQAISQLPPKMTKRCKA+M+QIICFSPQ GNLSD+LAAWVRIMKPKRADWLSVLKHLR+L+HPLYIEVA+AAL+E+
Subjt:  TQKSKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEK

Query:  TFEASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILK
        TFEASTRDYTKIIHYYGKRNQL+DAE++LLSM+ER FACDQITLTTMIHIYSKAD+L LAKQTFEELKLLE+ LD+RSY AMIMA+IRAGMP+EGE ILK
Subjt:  TFEASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILK

Query:  EMDAKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTA
        EMD K+IYAGSEVYKALLRAYSM  NA+GAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQS++A+IAFDNMR+AGLEPSDKCIAL+L+AYEKENRLN A
Subjt:  EMDAKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTA

Query:  LELLIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS
        LELLIDLEKE LMVGKEASE LAAW KRLGVVEEVELVLREYA KEAS
Subjt:  LELLIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS

TrEMBL top hitse value%identityAlignment
A0A1S3AWA7 pentatricopeptide repeat-containing protein At1g019701.6e-16585.26Show/hide
Query:  KSKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTF
        +S+ EKP FRWVEVG +ITETQKQAISQLPPKMTK+CKA+M+QIICFSPQKG LSD+LAAWVRIMKP+RADWLSVLKHLR+LNHPLYI+VA+AAL+E TF
Subjt:  KSKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTF

Query:  EASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEM
        EA+TRDYTKIIH+YGK+NQL+DAEKVLL+M+ER FACDQITLTTMIHIYSKADKL LAKQTFEELKLLEQSLDKRSYGAMIMAY+RAG+P+EGE ILKEM
Subjt:  EASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEM

Query:  DAKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALE
        DAK+IYAGSEVYKALLRAYSM G+AEGAQRVFDAIQLAAIPPD+KLCGLL+NAYLMAGQSR+A+IAFDNMR+AG+EPSDKCIAL L+AYEKENRLN ALE
Subjt:  DAKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALE

Query:  LLIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS
        LLIDLEK+N+MVGKEAS+ LAAW KRLGVVEE+E+VLREY AKE +
Subjt:  LLIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS

A0A5D3D032 Pentatricopeptide repeat-containing protein7.0e-16685.55Show/hide
Query:  KSKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTF
        +S+ EKP FRWVEVG +ITETQKQAISQLPPKMTK+CKA+M+QIICFSPQKG LSD+LAAWVRIMKP+RADWLSVLKHLR+LNHPLYI+VA+AAL+E TF
Subjt:  KSKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTF

Query:  EASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEM
        EA+TRDYTKIIH+YGK+NQL+DAEKVLL+M+ER FACDQITLTTMIHIYSKADKL LAKQTFEELKLLEQSLDKRSYGAMIMAY+RAG+P+EGE ILKEM
Subjt:  EASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEM

Query:  DAKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALE
        DAK+IYAGSEVYKALLRAYSM G+AEGAQRVFDAIQLAAIPPD+KLCGLL+NAYLMAGQSR+A+IAFDNMR+AG+EPSDKCIAL L+AYEKENRLN ALE
Subjt:  DAKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALE

Query:  LLIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS
        LLIDLEK+N+MVGKEAS+ LAAW KRLGVVEE+E+VLREY AKE S
Subjt:  LLIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS

A0A6J1D001 pentatricopeptide repeat-containing protein At1g019702.1e-17089.57Show/hide
Query:  SKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFE
        S+IEK  FRWVEVGSDITETQKQAIS+LPPKM KRCKALMRQIICFSPQKGNLSDLL AWVRIMKPKRADWL VLKHLRL NHP YIEVA+AALLEKTFE
Subjt:  SKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFE

Query:  ASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMD
        ASTRD+TKIIHYYGK+N+L+DAEK+LLSMKE+ FACDQITLTTM+HIYSKADKL LAKQTFEELKLLEQ LDKRSYGAMIMAYIRAGMP EGE IL+EMD
Subjt:  ASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMD

Query:  AKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALEL
        AKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPD+KLCGLLINAYLMAGQ+++ RI+FDNMRKAGLEP DKCIAL+LAAYEKENRLN ALEL
Subjt:  AKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALEL

Query:  LIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS
        LIDLEKENLMVGKEASE LAAWFKRLGV+EEVELVLREYA K AS
Subjt:  LIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS

A0A6J1EER0 pentatricopeptide repeat-containing protein At1g019705.4e-16687.43Show/hide
Query:  EKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFEAST
        EKP FRW+EVGSDITE QKQAISQLPPKMTKRCKA+M+QIICFSPQ GNLSD+LAAWVRIMKPKRADWLSVLKHLR+L+HPLYIEVA+AAL+E+TFEAST
Subjt:  EKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFEAST

Query:  RDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKE
        RDYTKIIHYYGKRNQL+DAE++LLSM+ER FACDQITLTTMIHIYSKAD+L LAKQTFEELKLLE+ LD+RSY AMIMA+IRAGMP+EGE ILKEMD K+
Subjt:  RDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKE

Query:  IYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLID
        IYAGSEVYKALLRAYSM  NAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMA QS++A+IAFDNMR+AGLEPSDKCIAL+L+AYEKENRLN ALELLID
Subjt:  IYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLID

Query:  LEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS
        LEKE L+VGKEASE LAAW KRLGVVEEVELVLREYA KEAS
Subjt:  LEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS

A0A6J1IR15 pentatricopeptide repeat-containing protein At1g019709.1e-16687.13Show/hide
Query:  EKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFEAST
        EKP FRWVEVGSDITE QKQAISQLPPKMTKRCKA+M+QIICF PQ G+LSD+LAAWVRIMKPKRADWLSVLKHLR+L+HPLYIEVA+AAL+E+TFEAST
Subjt:  EKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFEAST

Query:  RDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKE
        RDYTKIIHYYGK+NQL+DAE++LLSM+ER FACDQITLTTMIHIYSKAD+L LAKQTFEELKLLE+ LD+RSY AMIMA++RAGMP+EGE ILKEMD K+
Subjt:  RDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKE

Query:  IYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLID
        IYAGSEVYKALLRAYSM  NAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQS++A+IAFDNMR+AGLEPSDKCIAL+L+AYEKENRLN ALELLID
Subjt:  IYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLID

Query:  LEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS
        LEKE LMVGKEASE LAAW KRLGVVEEVELVLREYA KEAS
Subjt:  LEKENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEAS

SwissProt top hitse value%identityAlignment
O64624 Pentatricopeptide repeat-containing protein At2g18940, chloroplastic4.6e-1325.64Show/hide
Query:  YTKIIHYYGKRNQ-LKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEI
        Y  I+  +GK  +  +    VL  M+ +    D+ T +T++   ++   L  AK+ F ELK         +Y A++  + +AG+  E   +LKEM+    
Subjt:  YTKIIHYYGKRNQ-LKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEI

Query:  YAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDL
         A S  Y  L+ AY   G ++ A  V + +    + P+      +I+AY  AG+  EA   F +M++AG  P+      +L+   K++R N  +++L D+
Subjt:  YAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDL

Query:  EKENLMVGKEASERLAAWFKRLGVVEEVELVLRE
        +       +     + A     G+ + V  V RE
Subjt:  EKENLMVGKEASERLAAWFKRLGVVEEVELVLRE

O82178 Pentatricopeptide repeat-containing protein At2g351303.7e-1524.11Show/hide
Query:  KRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIYAGSEVYKAL
        ++   ++A  V   MK         T   MI++Y KA K Y++ + + E++  +   +  +Y A++ A+ R G+ ++ E I +++    +     VY AL
Subjt:  KRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIYAGSEVYKAL

Query:  LRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLEKENLMVGKE
        + +YS  G   GA  +F  +Q     PD     ++++AY  AG   +A   F+ M++ G+ P+ K   L+L+AY K   +     ++ ++ +  +     
Subjt:  LRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLEKENLMVGKE

Query:  ASERLAAWFKRLGVVEEVELVLRE
            +   + RLG   ++E +L E
Subjt:  ASERLAAWFKRLGVVEEVELVLRE

Q940Z1 Pentatricopeptide repeat-containing protein At1g195251.7e-3641.63Show/hide
Query:  MKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIYAGSEVYKALLRAYSMTGNAEGAQ
        M +     D +T T ++H+YSK+     A + FE LK      D++ Y AMI+ Y+ AG P  GE ++KEM AKE+ A  EVY ALLRAY+  G+A GA 
Subjt:  MKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIYAGSEVYKALLRAYSMTGNAEGAQ

Query:  RVFDAIQLAAIPP-DDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLEKENLMVGKEASERLAAWFKRLG
         +  ++Q A+  P   +   L + AY  AGQ  +A+  FD MRK G +P DKCIA ++ AY+ EN L+ AL LL+ LEK+ + +G      L  W   LG
Subjt:  RVFDAIQLAAIPP-DDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLEKENLMVGKEASERLAAWFKRLG

Query:  VVEEVELVL
        ++EE E +L
Subjt:  VVEEVELVL

Q9LFC5 Pentatricopeptide repeat-containing protein At5g011106.0e-1325.96Show/hide
Query:  YTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIY
        Y  I+H   KR  L +A+K+   M ER    D  TLT +I  + K   L  A + F+++K     LD  +Y  ++  + + G  D  + I  +M +KEI 
Subjt:  YTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIY

Query:  AGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLE
             Y  L+ A    G+   A RV+D +    I P   +C  +I  Y  +G + +     + M   G  P       ++  + +E  ++ A  L+  +E
Subjt:  AGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLE

Query:  KE--NLMVGKEASERLAAWFKRLGVVEEVELVLRE
        +E   L+        +   F R   ++E E+VLR+
Subjt:  KE--NLMVGKEASERLAAWFKRLGVVEEVELVLRE

Q9LPC4 Pentatricopeptide repeat-containing protein At1g019706.4e-12463.91Show/hide
Query:  TFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFEASTRDY
        +F W +VG ++TE Q +AI+++P KM+KRC+ALMRQIICFSP+KG+  DLL AW+R M P RADWLS+LK L+ L+ P YI+VA+ +LL+ +FEA+ RDY
Subjt:  TFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFEASTRDY

Query:  TKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIYA
        TKIIHYYGK NQ++DAE+ LLSMK R F  DQ+TLT M+ +YSKA    LA++TF E+KLL + LD RSYG+MIMAYIRAG+P++GE +L+EMD++EI A
Subjt:  TKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIYA

Query:  GSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLEK
        G EVYKALLR YSM G+AEGA+RVFDA+Q+A I PD KLCGLLINAY ++GQS+ AR+AF+NMRKAG++ +DKC+AL+LAAYEKE +LN AL  L++LEK
Subjt:  GSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLEK

Query:  ENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEA
        +++M+GKEAS  LA WFK+LGVVEEVEL+LRE+++ ++
Subjt:  ENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEA

Arabidopsis top hitse value%identityAlignment
AT1G01970.1 Tetratricopeptide repeat (TPR)-like superfamily protein4.5e-12563.91Show/hide
Query:  TFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFEASTRDY
        +F W +VG ++TE Q +AI+++P KM+KRC+ALMRQIICFSP+KG+  DLL AW+R M P RADWLS+LK L+ L+ P YI+VA+ +LL+ +FEA+ RDY
Subjt:  TFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFEASTRDY

Query:  TKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIYA
        TKIIHYYGK NQ++DAE+ LLSMK R F  DQ+TLT M+ +YSKA    LA++TF E+KLL + LD RSYG+MIMAYIRAG+P++GE +L+EMD++EI A
Subjt:  TKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIYA

Query:  GSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLEK
        G EVYKALLR YSM G+AEGA+RVFDA+Q+A I PD KLCGLLINAY ++GQS+ AR+AF+NMRKAG++ +DKC+AL+LAAYEKE +LN AL  L++LEK
Subjt:  GSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLEK

Query:  ENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEA
        +++M+GKEAS  LA WFK+LGVVEEVEL+LRE+++ ++
Subjt:  ENLMVGKEASERLAAWFKRLGVVEEVELVLREYAAKEA

AT1G19520.1 pentatricopeptide (PPR) repeat-containing protein9.9e-6437.34Show/hide
Query:  YGKLRLQHSLSAS---SKTATGQWKCKNFV---------LPLLGRLNWAKTQKSKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFS
        Y  L  Q  L  S   S+T  G+ + + FV         + L  R    K +   +E    +WVE+   I E +++A  + P  +T +CK +M ++   S
Subjt:  YGKLRLQHSLSAS---SKTATGQWKCKNFV---------LPLLGRLNWAKTQKSKIEKPTFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFS

Query:  PQKG-NLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFEASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIH
         Q+G + S LLA W  +++P R DW++++  LR  N   Y++VA+  L EK+F AS  DY+K+IH + K N ++D E++L  M +     D +T T ++H
Subjt:  PQKG-NLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFEASTRDYTKIIHYYGKRNQLKDAEKVLLSMKERDFACDQITLTTMIH

Query:  IYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPP-DDKL
        +YSK+     A + FE LK      D++ Y AMI+ Y+ AG P  GE ++KEM AKE+ A  EVY ALLRAY+  G+A GA  +  ++Q A+  P   + 
Subjt:  IYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIYAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPP-DDKL

Query:  CGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVL
          L + AY  AGQ  +A+  FD MRK G +P DKCIA ++ AY+ EN L+ AL LL+ LEK+ + +G      L  W   LG++EE E +L
Subjt:  CGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVL

AT2G18940.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.2e-1425.64Show/hide
Query:  YTKIIHYYGKRNQ-LKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEI
        Y  I+  +GK  +  +    VL  M+ +    D+ T +T++   ++   L  AK+ F ELK         +Y A++  + +AG+  E   +LKEM+    
Subjt:  YTKIIHYYGKRNQ-LKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEI

Query:  YAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDL
         A S  Y  L+ AY   G ++ A  V + +    + P+      +I+AY  AG+  EA   F +M++AG  P+      +L+   K++R N  +++L D+
Subjt:  YAGSEVYKALLRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDL

Query:  EKENLMVGKEASERLAAWFKRLGVVEEVELVLRE
        +       +     + A     G+ + V  V RE
Subjt:  EKENLMVGKEASERLAAWFKRLGVVEEVELVLRE

AT2G35130.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-1624.11Show/hide
Query:  KRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIYAGSEVYKAL
        ++   ++A  V   MK         T   MI++Y KA K Y++ + + E++  +   +  +Y A++ A+ R G+ ++ E I +++    +     VY AL
Subjt:  KRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIYAGSEVYKAL

Query:  LRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLEKENLMVGKE
        + +YS  G   GA  +F  +Q     PD     ++++AY  AG   +A   F+ M++ G+ P+ K   L+L+AY K   +     ++ ++ +  +     
Subjt:  LRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLEKENLMVGKE

Query:  ASERLAAWFKRLGVVEEVELVLRE
            +   + RLG   ++E +L E
Subjt:  ASERLAAWFKRLGVVEEVELVLRE

AT2G35130.2 Tetratricopeptide repeat (TPR)-like superfamily protein2.7e-1624.11Show/hide
Query:  KRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIYAGSEVYKAL
        ++   ++A  V   MK         T   MI++Y KA K Y++ + + E++  +   +  +Y A++ A+ R G+ ++ E I +++    +     VY AL
Subjt:  KRNQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIYAGSEVYKAL

Query:  LRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLEKENLMVGKE
        + +YS  G   GA  +F  +Q     PD     ++++AY  AG   +A   F+ M++ G+ P+ K   L+L+AY K   +     ++ ++ +  +     
Subjt:  LRAYSMTGNAEGAQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLEKENLMVGKE

Query:  ASERLAAWFKRLGVVEEVELVLRE
            +   + RLG   ++E +L E
Subjt:  ASERLAAWFKRLGVVEEVELVLRE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGTTTTCTCTTCTTCCTTACATTTTTCGTCAGAGATTTTGGATTCTCATTTCGCCGTACTTGCAGATTGCAATATTCCCGCATTCTGCGATTCTTTTTGCCGCTT
TATCGAATTCGCGGAATTGGCTCACACTGGAGAACAACATTGTTATCTTTTTTGTACGCTCTTGAGTTTTCTAGTTGATTATGGGAAGCTACGCCTGCAACATTCTCTAT
CAGCATCATCCAAAACAGCCACTGGTCAATGGAAGTGTAAGAACTTCGTATTACCATTACTTGGGAGGCTCAATTGGGCAAAGACTCAGAAGAGTAAAATAGAGAAACCG
ACGTTTCGGTGGGTCGAGGTGGGCTCTGATATTACCGAAACGCAGAAGCAGGCCATATCTCAGCTTCCTCCGAAGATGACTAAAAGATGTAAGGCGCTAATGAGGCAAAT
TATCTGTTTCTCGCCTCAAAAAGGTAATTTATCAGATTTGTTGGCGGCTTGGGTGAGGATTATGAAGCCTAAAAGAGCTGATTGGCTTTCAGTTCTTAAGCATTTGAGGC
TTTTGAATCATCCACTTTACATCGAGGTGGCAGATGCTGCTCTTCTAGAGAAAACATTTGAAGCCAGTACTCGTGACTACACGAAGATTATTCATTACTATGGGAAGCGA
AACCAACTCAAAGATGCTGAAAAAGTTCTCTTAAGCATGAAAGAAAGGGATTTTGCTTGCGATCAAATAACTTTAACGACAATGATCCACATATATAGCAAGGCCGACAA
ACTTTATCTGGCCAAACAAACATTTGAAGAGCTCAAACTGCTTGAGCAATCGTTGGATAAAAGATCGTATGGTGCGATGATTATGGCGTATATCAGGGCTGGGATGCCTG
ATGAAGGAGAATGCATTCTCAAAGAAATGGATGCAAAAGAAATTTATGCAGGAAGTGAAGTTTACAAGGCCTTGTTAAGAGCATACTCCATGACTGGCAATGCTGAAGGA
GCCCAAAGGGTATTTGATGCAATTCAACTGGCTGCTATTCCTCCTGATGATAAGTTATGTGGCCTCCTTATCAATGCCTATCTGATGGCGGGTCAAAGCCGAGAGGCGCG
AATTGCATTTGACAATATGAGGAAGGCAGGTCTTGAACCTAGTGACAAATGCATAGCTTTGATATTAGCTGCATATGAAAAGGAGAATAGGCTAAACACAGCACTGGAAC
TTCTAATAGACTTGGAGAAGGAGAATCTCATGGTTGGGAAGGAAGCTTCAGAAAGATTGGCAGCCTGGTTTAAAAGACTAGGGGTTGTAGAAGAGGTAGAACTTGTCTTG
AGAGAATATGCTGCGAAAGAAGCCAGCAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGTTTTCTCTTCTTCCTTACATTTTTCGTCAGAGATTTTGGATTCTCATTTCGCCGTACTTGCAGATTGCAATATTCCCGCATTCTGCGATTCTTTTTGCCGCTT
TATCGAATTCGCGGAATTGGCTCACACTGGAGAACAACATTGTTATCTTTTTTGTACGCTCTTGAGTTTTCTAGTTGATTATGGGAAGCTACGCCTGCAACATTCTCTAT
CAGCATCATCCAAAACAGCCACTGGTCAATGGAAGTGTAAGAACTTCGTATTACCATTACTTGGGAGGCTCAATTGGGCAAAGACTCAGAAGAGTAAAATAGAGAAACCG
ACGTTTCGGTGGGTCGAGGTGGGCTCTGATATTACCGAAACGCAGAAGCAGGCCATATCTCAGCTTCCTCCGAAGATGACTAAAAGATGTAAGGCGCTAATGAGGCAAAT
TATCTGTTTCTCGCCTCAAAAAGGTAATTTATCAGATTTGTTGGCGGCTTGGGTGAGGATTATGAAGCCTAAAAGAGCTGATTGGCTTTCAGTTCTTAAGCATTTGAGGC
TTTTGAATCATCCACTTTACATCGAGGTGGCAGATGCTGCTCTTCTAGAGAAAACATTTGAAGCCAGTACTCGTGACTACACGAAGATTATTCATTACTATGGGAAGCGA
AACCAACTCAAAGATGCTGAAAAAGTTCTCTTAAGCATGAAAGAAAGGGATTTTGCTTGCGATCAAATAACTTTAACGACAATGATCCACATATATAGCAAGGCCGACAA
ACTTTATCTGGCCAAACAAACATTTGAAGAGCTCAAACTGCTTGAGCAATCGTTGGATAAAAGATCGTATGGTGCGATGATTATGGCGTATATCAGGGCTGGGATGCCTG
ATGAAGGAGAATGCATTCTCAAAGAAATGGATGCAAAAGAAATTTATGCAGGAAGTGAAGTTTACAAGGCCTTGTTAAGAGCATACTCCATGACTGGCAATGCTGAAGGA
GCCCAAAGGGTATTTGATGCAATTCAACTGGCTGCTATTCCTCCTGATGATAAGTTATGTGGCCTCCTTATCAATGCCTATCTGATGGCGGGTCAAAGCCGAGAGGCGCG
AATTGCATTTGACAATATGAGGAAGGCAGGTCTTGAACCTAGTGACAAATGCATAGCTTTGATATTAGCTGCATATGAAAAGGAGAATAGGCTAAACACAGCACTGGAAC
TTCTAATAGACTTGGAGAAGGAGAATCTCATGGTTGGGAAGGAAGCTTCAGAAAGATTGGCAGCCTGGTTTAAAAGACTAGGGGTTGTAGAAGAGGTAGAACTTGTCTTG
AGAGAATATGCTGCGAAAGAAGCCAGCAGCTAA
Protein sequenceShow/hide protein sequence
MSVFSSSLHFSSEILDSHFAVLADCNIPAFCDSFCRFIEFAELAHTGEQHCYLFCTLLSFLVDYGKLRLQHSLSASSKTATGQWKCKNFVLPLLGRLNWAKTQKSKIEKP
TFRWVEVGSDITETQKQAISQLPPKMTKRCKALMRQIICFSPQKGNLSDLLAAWVRIMKPKRADWLSVLKHLRLLNHPLYIEVADAALLEKTFEASTRDYTKIIHYYGKR
NQLKDAEKVLLSMKERDFACDQITLTTMIHIYSKADKLYLAKQTFEELKLLEQSLDKRSYGAMIMAYIRAGMPDEGECILKEMDAKEIYAGSEVYKALLRAYSMTGNAEG
AQRVFDAIQLAAIPPDDKLCGLLINAYLMAGQSREARIAFDNMRKAGLEPSDKCIALILAAYEKENRLNTALELLIDLEKENLMVGKEASERLAAWFKRLGVVEEVELVL
REYAAKEASS