; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr026137 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr026137
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationtig00153031:2082346..2084504
RNA-Seq ExpressionSgr026137
SyntenySgr026137
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154319.1 pentatricopeptide repeat-containing protein At4g39952, mitochondrial isoform X1 [Momordica charantia]2.7e-17651.77Show/hide
Query:  MLRLRPRQCQI-------LTTL-------TVTTIFLSKHATISSSISS--PDYITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH---
        MLRLRP Q  I        ++L       +   IFLSK      S+       ITTGNS +VFFATKL+SFYASHRQP  S   FH     D   W+   
Subjt:  MLRLRPRQCQI-------LTTL-------TVTTIFLSKHATISSSISS--PDYITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH---

Query:  --------------------------------------------EHSW----VGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTA
                                                     H      + LKLG F  NSAVGSSLVYMYSKCGHTESAS LFDEIT+KDVVAWTA
Subjt:  --------------------------------------------EHSW----VGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTA

Query:  LIVGYVQNNESQK----------------------------------EGRCLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWI
        LIVG+VQNNES+K                                  EGRCLHG+ LK+GI+C EV+KS LLS+YSRC SPEEAYRCFC LN+KDL+SW 
Subjt:  LIVGYVQNNESQK----------------------------------EGRCLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWI

Query:  SIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFHGWILRQCCAVSRITQSPCTDRAIK----------------------
        SIIAVHAKFGLMT+CL+LFWEM+A EIIP+EIAISC ++G GNSDR SEGKAFHGWILRQC AVS IT +       K                      
Subjt:  SIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFHGWILRQCCAVSRITQSPCTDRAIK----------------------

Query:  ----------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ---------
                  GQKEKCID FR MQLLGI PDLISL  VISSC QVGAVNIGR IHCYAIKNS+I+NVSIAN+LLDMYGKSGNLTTAWRIF          
Subjt:  ----------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ---------

Query:  ------------------------------------------------------------------------------------DTAKGYYYLMEYIDFI
                                                                                            +T++  +  +E  D I
Subjt:  ------------------------------------------------------------------------------------DTAKGYYYLMEYIDFI

Query:  LH----ANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPD
        L     ++YGMHGHAESAIEIF+LMEES+VK N+LTF SL+SACNH GLV EGRR FDRMQKYGVK SLKH A MVDLLGRSGSLQEAETLVLSMPISPD
Subjt:  LH----ANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPD

Query:  GTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
        GTVWGSLLSACKLHNE+EMGVRIAR+AIESDPENDG
Subjt:  GTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

XP_022154323.1 pentatricopeptide repeat-containing protein At4g39952, mitochondrial isoform X3 [Momordica charantia]2.9e-17057.88Show/hide
Query:  VGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCL
        + LKLG F  NSAVGSSLVYMYSKCGHTESAS LFDEIT+KDVVAWTALIVG+VQNNES+K                                  EGRCL
Subjt:  VGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCL

Query:  HGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKA
        HG+ LK+GI+C EV+KS LLS+YSRC SPEEAYRCFC LN+KDL+SW SIIAVHAKFGLMT+CL+LFWEM+A EIIP+EIAISC ++G GNSDR SEGKA
Subjt:  HGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKA

Query:  FHGWILRQCCAVSRITQSPCTDRAIK--------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGR
        FHGWILRQC AVS IT +       K                                GQKEKCID FR MQLLGI PDLISL  VISSC QVGAVNIGR
Subjt:  FHGWILRQCCAVSRITQSPCTDRAIK--------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGR

Query:  FIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ-------------------------------------------------------------
         IHCYAIKNS+I+NVSIAN+LLDMYGKSGNLTTAWRIF                                                              
Subjt:  FIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ-------------------------------------------------------------

Query:  --------------------------------DTAKGYYYLMEYIDFILH----ANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAE
                                        +T++  +  +E  D IL     ++YGMHGHAESAIEIF+LMEES+VK N+LTF SL+SACNH GLV E
Subjt:  --------------------------------DTAKGYYYLMEYIDFILH----ANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAE

Query:  GRRHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
        GRR FDRMQKYGVK SLKH A MVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNE+EMGVRIAR+AIESDPENDG
Subjt:  GRRHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

XP_022986468.1 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g39952, mitochondrial-like [Cucurbita maxima]7.3e-16649.85Show/hide
Query:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH-----------------------------------------------EHSW----V
        ITTGNSN+ FFATKL++FYA H QP  S   F      D   W+                                                H      +
Subjt:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH-----------------------------------------------EHSW----V

Query:  GLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCLH
         LKLG F  NSAVGSSL+YMYSKCG+ ESAS +F++ITVKDVVAWTALI+GYVQNNES+K                                  EGRCLH
Subjt:  GLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCLH

Query:  GMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAF
        G+ LKSG +C EVVKSS+LS+YSRC SPEEAYRCF  L +KDL+SW SI+AVH+K GLM++CL+LFWEM+AS IIP++I ISC L+G GN DRISEGKAF
Subjt:  GMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAF

Query:  HGWILRQCCAVSRITQSPCTDRAIK-------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFI
        H WIL+QCCAVS IT +       K                               G+KEKCIDFFR+M LLGIEPDL SLV VISSCL VGAVNIGR +
Subjt:  HGWILRQCCAVSRITQSPCTDRAIK-------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFI

Query:  HCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ---------------------------------------------------------------
        HCYAIKNS+IDNVSIAN+LLDMYGKSGNLT AWRIF                                                                
Subjt:  HCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ---------------------------------------------------------------

Query:  ------------------------------DTAKGYYYLMEYIDFIL----HANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGR
                                      +T++  +  ME  D IL     +NYGMHGH ESAIEIFQLME+S++K NALTF SL+SACNH G + EGR
Subjt:  ------------------------------DTAKGYYYLMEYIDFIL----HANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGR

Query:  RHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
        R FD M KYG+KPSLKH A MVDLLGRSGSL+EAE LVLSMPI+PDGTVWGSLLSACKLHNE+EMG+RI R+AIESDP+NDG
Subjt:  RHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

XP_023513090.1 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g39952, mitochondrial-like [Cucurbita pepo subsp. pepo]4.3e-16650Show/hide
Query:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH-----------------------------------------------EHSW----V
        ITTGNSN+ FFATKL++FYA H QP  S   F      D   W+                                                H      +
Subjt:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH-----------------------------------------------EHSW----V

Query:  GLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCLH
         LKLG F  NSAVGSSL+YMYSKCG+ ESAS +F+EITVKDVVAWTALI+GYVQNNES+K                                  EGRCLH
Subjt:  GLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCLH

Query:  GMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAF
        G+ LKSG +C EVVKSS+LS+YSRC SPEEAYRCF  L +KDL+SW SIIAVH+K GLM++CL+LFWEM+AS IIP++I ISC L+G GN DRISEGKA 
Subjt:  GMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAF

Query:  HGWILRQCCAVSRITQSPCTDRAIK-------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFI
        H WIL+QCCA+S IT +       K                               G+KEKCIDFFR+M LLGIEPDL SLV VISSCL VGAVNIGR +
Subjt:  HGWILRQCCAVSRITQSPCTDRAIK-------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFI

Query:  HCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ---------------------------------------------------------------
        HCYAIKNS+IDNVSIAN+LLDMYGKSGNLT AWRIF                                                                
Subjt:  HCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ---------------------------------------------------------------

Query:  ------------------------------DTAKGYYYLMEYIDFIL----HANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGR
                                      +T++  +  ME  D IL     +NYGMHGH ESAIEIFQLME+S++K NALTF SL+SACNH G + EGR
Subjt:  ------------------------------DTAKGYYYLMEYIDFIL----HANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGR

Query:  RHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
        R FD M KYG+KPSLKH A MVDLLGRSGSL+EAE LVLSMPI+PDGTVWGSLLSACKLHNE+EMG+RIAR+AIESDP+NDG
Subjt:  RHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

XP_038901304.1 pentatricopeptide repeat-containing protein At4g39952, mitochondrial [Benincasa hispida]7.3e-16649.56Show/hide
Query:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH-----------------------------------------------EHSW----V
        ITTGNSN+VFFATKL++FYASHRQP  S   F L  + D   W+                                                H      +
Subjt:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH-----------------------------------------------EHSW----V

Query:  GLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCLH
         LKLG F  NSA+GSS +YMYSKCGH ESAS +F+EITVKDVVAWTALIVGYVQNNES +                                  EG+CLH
Subjt:  GLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCLH

Query:  GMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAF
        G+ LK G +C EV+KS++LS+YSRC SPEEAYRCF  +++KDL+SW SIIAVH+KFGLM+ CL+LFWEM+A EIIP+EI ISC LMG GNSDRI EGKAF
Subjt:  GMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAF

Query:  HGWILRQCCAVSRITQSPCTDRAIK-------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFI
        H WIL+QCCA+S ITQ+       K                               GQ EKCID  R+M +LG EPD  SLV VISSCLQVGA+NIGR +
Subjt:  HGWILRQCCAVSRITQSPCTDRAIK-------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFI

Query:  HCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ---------------------------------------------------------------
        HCYAIKNS+I+NVSIAN+L+DMYGKSGNLT  WRIF                                                                
Subjt:  HCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ---------------------------------------------------------------

Query:  ------------------------------DTAKGYYYLMEYIDFILH----ANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGR
                                      +T++  +  ME  D IL     +NYGMHG  ESA+EIFQLMEES++K NA TF SL+SACNHTG V EGR
Subjt:  ------------------------------DTAKGYYYLMEYIDFILH----ANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGR

Query:  RHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
          FDRMQKYGVKPSLKH A MVDLLGRSGSL+ AE LVLSMPI+PDGTVWGSLLSACK+HNE+EMGVRIARYAIESDP+NDG
Subjt:  RHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

TrEMBL top hitse value%identityAlignment
A0A1S3C9K9 pentatricopeptide repeat-containing protein At4g39952, mitochondrial isoform X16.9e-16248.68Show/hide
Query:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPW-------------------------------------------------HEHSWVGL
        ITTGNS++VFFATKL++FYASHRQP  S   F L  + D   W                                                 H  +  GL
Subjt:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPW-------------------------------------------------HEHSWVGL

Query:  --KLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCLH
          KLG F  NSA+GSS +YMYSKCGH ESAS +F EITVKDVVAWTALIVGYVQNNES +                                  EG+CLH
Subjt:  --KLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCLH

Query:  GMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAF
        G+ LK+G +C +VVKS++LS+YSRC SPEEAYRCFC L++KDL+SW SIIAVH+KFGLM++CL+LFWEM+ SEIIP+EI ISC LMG GNS RI EGKAF
Subjt:  GMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAF

Query:  HGWILRQCCAVSRITQSPCTDRAIK-------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFI
        H WIL+QCCA++ IT +       K                               GQKE CI F R+M LLG EPDL SLV VISSC QVGA+NIGR I
Subjt:  HGWILRQCCAVSRITQSPCTDRAIK-------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFI

Query:  HCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQDT-----------------------------------------------------------AK
        HCYAIKNS+I+NVSIAN+L+DMYGKSG++T  WRIF  T                                                            K
Subjt:  HCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQDT-----------------------------------------------------------AK

Query:  GYYYLME------------YIDF--------------------------ILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGR
         + Y+ E             ID                           ++ +NYGMHGH ESA+EIFQLMEES++K NA TF SL+SACNHTG V EGR
Subjt:  GYYYLME------------YIDF--------------------------ILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGR

Query:  RHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
          FDRM KYG++PSLKH A ++DLLGRSGSL+ AE LVLSMPI+PDGTVWGSLLSACK+HNE+EMGVR+ARYAIESDP+NDG
Subjt:  RHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

A0A6J1DK10 pentatricopeptide repeat-containing protein At4g39952, mitochondrial isoform X31.4e-17057.88Show/hide
Query:  VGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCL
        + LKLG F  NSAVGSSLVYMYSKCGHTESAS LFDEIT+KDVVAWTALIVG+VQNNES+K                                  EGRCL
Subjt:  VGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCL

Query:  HGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKA
        HG+ LK+GI+C EV+KS LLS+YSRC SPEEAYRCFC LN+KDL+SW SIIAVHAKFGLMT+CL+LFWEM+A EIIP+EIAISC ++G GNSDR SEGKA
Subjt:  HGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKA

Query:  FHGWILRQCCAVSRITQSPCTDRAIK--------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGR
        FHGWILRQC AVS IT +       K                                GQKEKCID FR MQLLGI PDLISL  VISSC QVGAVNIGR
Subjt:  FHGWILRQCCAVSRITQSPCTDRAIK--------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGR

Query:  FIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ-------------------------------------------------------------
         IHCYAIKNS+I+NVSIAN+LLDMYGKSGNLTTAWRIF                                                              
Subjt:  FIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ-------------------------------------------------------------

Query:  --------------------------------DTAKGYYYLMEYIDFILH----ANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAE
                                        +T++  +  +E  D IL     ++YGMHGHAESAIEIF+LMEES+VK N+LTF SL+SACNH GLV E
Subjt:  --------------------------------DTAKGYYYLMEYIDFILH----ANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAE

Query:  GRRHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
        GRR FDRMQKYGVK SLKH A MVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNE+EMGVRIAR+AIESDPENDG
Subjt:  GRRHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

A0A6J1DLQ3 pentatricopeptide repeat-containing protein At4g39952, mitochondrial isoform X11.3e-17651.77Show/hide
Query:  MLRLRPRQCQI-------LTTL-------TVTTIFLSKHATISSSISS--PDYITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH---
        MLRLRP Q  I        ++L       +   IFLSK      S+       ITTGNS +VFFATKL+SFYASHRQP  S   FH     D   W+   
Subjt:  MLRLRPRQCQI-------LTTL-------TVTTIFLSKHATISSSISS--PDYITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH---

Query:  --------------------------------------------EHSW----VGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTA
                                                     H      + LKLG F  NSAVGSSLVYMYSKCGHTESAS LFDEIT+KDVVAWTA
Subjt:  --------------------------------------------EHSW----VGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTA

Query:  LIVGYVQNNESQK----------------------------------EGRCLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWI
        LIVG+VQNNES+K                                  EGRCLHG+ LK+GI+C EV+KS LLS+YSRC SPEEAYRCFC LN+KDL+SW 
Subjt:  LIVGYVQNNESQK----------------------------------EGRCLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWI

Query:  SIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFHGWILRQCCAVSRITQSPCTDRAIK----------------------
        SIIAVHAKFGLMT+CL+LFWEM+A EIIP+EIAISC ++G GNSDR SEGKAFHGWILRQC AVS IT +       K                      
Subjt:  SIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFHGWILRQCCAVSRITQSPCTDRAIK----------------------

Query:  ----------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ---------
                  GQKEKCID FR MQLLGI PDLISL  VISSC QVGAVNIGR IHCYAIKNS+I+NVSIAN+LLDMYGKSGNLTTAWRIF          
Subjt:  ----------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ---------

Query:  ------------------------------------------------------------------------------------DTAKGYYYLMEYIDFI
                                                                                            +T++  +  +E  D I
Subjt:  ------------------------------------------------------------------------------------DTAKGYYYLMEYIDFI

Query:  LH----ANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPD
        L     ++YGMHGHAESAIEIF+LMEES+VK N+LTF SL+SACNH GLV EGRR FDRMQKYGVK SLKH A MVDLLGRSGSLQEAETLVLSMPISPD
Subjt:  LH----ANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPD

Query:  GTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
        GTVWGSLLSACKLHNE+EMGVRIAR+AIESDPENDG
Subjt:  GTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

A0A6J1FVR1 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g39952, mitochondrial-like1.9e-16449.85Show/hide
Query:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH-----------------------------------------------EHSW----V
        ITTGNSN+ FFATKL++FYA H QP  S   F      D   W+                                                H      +
Subjt:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH-----------------------------------------------EHSW----V

Query:  GLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCLH
         LKLG F  NSAVGSSL+YMYSKCG+ ESAS +F+EITVKDVVAWTALI+GYVQNNES+K                                  EGRCLH
Subjt:  GLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCLH

Query:  GMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAF
        G+ LKSG +C EVVKSS+LS+YSRC SPEEAYRCF  L +KDL+SW SIIAVH+K GLM++CL+LFWEM+AS IIP++I ISC L+G GN DRISEG AF
Subjt:  GMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAF

Query:  HGWILRQCCAVSRITQSPCTDRAIK-------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFI
        H WIL+QC AVS IT +       K                               G+KEKCIDFFR+M LLGIEPDL SLV VISSCL V AVNIGR +
Subjt:  HGWILRQCCAVSRITQSPCTDRAIK-------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFI

Query:  HCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQDT-----------------------------------------------------------AK
        HCYAIKNS+IDNVSIAN+LLDMYGKSGNLT AWRIF  T                                                            K
Subjt:  HCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQDT-----------------------------------------------------------AK

Query:  GYYYLME------------YIDF--------------------------ILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGR
         + Y+ E             ID                           ++ +NYGMHGH ESAIEIFQLME+S++K NALTF SL+SACNH G + EGR
Subjt:  GYYYLME------------YIDF--------------------------ILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGR

Query:  RHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
        R FD M KYG+KPSLKH A MVDLLGRSGSL+EAE LVLSMPI+PDGTVWGSLLSACKLHNE+EMG+RIAR+AIESDP+NDG
Subjt:  RHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

A0A6J1JGK6 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At4g39952, mitochondrial-like3.5e-16649.85Show/hide
Query:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH-----------------------------------------------EHSW----V
        ITTGNSN+ FFATKL++FYA H QP  S   F      D   W+                                                H      +
Subjt:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH-----------------------------------------------EHSW----V

Query:  GLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCLH
         LKLG F  NSAVGSSL+YMYSKCG+ ESAS +F++ITVKDVVAWTALI+GYVQNNES+K                                  EGRCLH
Subjt:  GLKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------------------------------EGRCLH

Query:  GMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAF
        G+ LKSG +C EVVKSS+LS+YSRC SPEEAYRCF  L +KDL+SW SI+AVH+K GLM++CL+LFWEM+AS IIP++I ISC L+G GN DRISEGKAF
Subjt:  GMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAF

Query:  HGWILRQCCAVSRITQSPCTDRAIK-------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFI
        H WIL+QCCAVS IT +       K                               G+KEKCIDFFR+M LLGIEPDL SLV VISSCL VGAVNIGR +
Subjt:  HGWILRQCCAVSRITQSPCTDRAIK-------------------------------GQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFI

Query:  HCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ---------------------------------------------------------------
        HCYAIKNS+IDNVSIAN+LLDMYGKSGNLT AWRIF                                                                
Subjt:  HCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ---------------------------------------------------------------

Query:  ------------------------------DTAKGYYYLMEYIDFIL----HANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGR
                                      +T++  +  ME  D IL     +NYGMHGH ESAIEIFQLME+S++K NALTF SL+SACNH G + EGR
Subjt:  ------------------------------DTAKGYYYLMEYIDFIL----HANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGR

Query:  RHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
        R FD M KYG+KPSLKH A MVDLLGRSGSL+EAE LVLSMPI+PDGTVWGSLLSACKLHNE+EMG+RI R+AIESDP+NDG
Subjt:  RHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

SwissProt top hitse value%identityAlignment
Q3E9N1 Pentatricopeptide repeat-containing protein At4g39952, mitochondrial1.6e-9933.82Show/hide
Query:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH------------------------------------------EHSW--VG------
        IT G S ++F A+KL+S YAS+ +P  S   FHL    D   W+                                          E  W  VG      
Subjt:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH------------------------------------------EHSW--VG------

Query:  -LKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQ-------------------------------------KEGR
         LK G F  N+AVG+S VY YSKCG  + A  +FDE+  +DVVAWTA+I G+VQN ES+                                     KEGR
Subjt:  -LKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQ-------------------------------------KEGR

Query:  CLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEG
        CLHG  +K+G+   + V+SS+ S YS+  +P EAY  F  L  +D+ SW SIIA  A+ G M +  ++FWEM+   + P+ + ISC +  +G    + +G
Subjt:  CLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEG

Query:  KAFHGWILRQC----------------------------CAVSRITQSPCTDRAIKGQKE-----KCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVN
        KAFHG+++R C                            C +S        +  +KG  +     KCI+ FRK+Q LGIE D  S   VISSC  +GAV 
Subjt:  KAFHGWILRQC----------------------------CAVSRITQSPCTDRAIKGQKE-----KCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVN

Query:  IGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIF--------------------QDTAKG---------------------------------
        +G+ +HCY +K S+   +S+ N+L+D+YGK G+LT AWR+F                    + + K                                  
Subjt:  IGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIF--------------------QDTAKG---------------------------------

Query:  -----YYYLME------------YIDF--------------------------ILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLV
             + Y+ E             ID                           ++ + YGMHG  ESAI +F  MEES VK    TF +L+SAC H GLV
Subjt:  -----YYYLME------------YIDF--------------------------ILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLV

Query:  AEGRRHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
         +G++ F +M +Y VKP+LKH +C+VDLL RSG+L+EAE+ V+SMP SPDG +WG+LLS+C  H E+EMG+R+A  A+ SDP+NDG
Subjt:  AEGRRHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

Q9CA56 Pentatricopeptide repeat-containing protein At1g74600, chloroplastic2.7e-5430.46Show/hide
Query:  HSWVGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLF---DEITVKDVVAWTALIVGYVQNNESQKE-------------------------------G
        H+WV  K G F  +S+V ++L+ MYSK G  + +  +F   D+I  +++V    +I  + Q+ +  K                                G
Subjt:  HSWVGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLF---DEITVKDVVAWTALIVGYVQNNESQKE-------------------------------G

Query:  RCLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISE
        + +HG  LKSG+V    V SSL ++YS+C S EE+Y+ F  +  KD   W S+I+   ++G + + + LF EM      P+E  ++  L    +   +  
Subjt:  RCLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISE

Query:  GKAFHGWILR------------------QCCAVSRITQ--------------SPCTDRAIKGQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAV-
        GK  HG+ LR                  +C ++    Q              S  +  +  G  +     FR M + G   D     F ISS L+  A+ 
Subjt:  GKAFHGWILR------------------QCCAVSRITQ--------------SPCTDRAIKGQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAV-

Query:  ---NIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQDTAKGYYYLMEYIDFI----LHANYGMHGHAESAIEIFQLMEESHVKLNALTFP
           ++G  +H Y  K  +    S+ ++LL MY K G++    + F          +   D I    L A+Y  HG A  A++++ LM+E   K + +TF 
Subjt:  ---NIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQDTAKGYYYLMEYIDFI----LHANYGMHGHAESAIEIFQLMEESHVKLNALTFP

Query:  SLISACNHTGLVAEGRRHFDRMQK-YGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
         ++SAC+H GLV E   H + M K YG++P  +H  CMVD LGRSG L+EAE+ + +M I PD  VWG+LL+ACK+H E E+G   A+ AIE +P + G
Subjt:  SLISACNHTGLVAEGRRHFDRMQK-YGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

Q9M9E2 Pentatricopeptide repeat-containing protein At1g15510, chloroplastic2.2e-5627.85Show/hide
Query:  VGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNN----------------------------------ESQKEGRCLHGMGLKSGIVCIE
        V ++L+ MY KCG  +SA  LFD +  +D+++W A+I GY +N                                     ++ GR +H   + +G     
Subjt:  VGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNN----------------------------------ESQKEGRCLHGMGLKSGIVCIE

Query:  VVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFH---------GW
         V +SL  +Y    S  EA + F  + RKD++SW ++I+ +    L    ++ +  M    + P+EI ++  L        +  G   H          +
Subjt:  VVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFH---------GW

Query:  ILRQCCAVSRITQSPCTDRA------------------IKGQK--EKCID---FFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVID
        ++     ++  ++  C D+A                  I G +   +C +   F R+M++  ++P+ I+L   +++C ++GA+  G+ IH + ++  V  
Subjt:  ILRQCCAVSRITQSPCTDRA------------------IKGQK--EKCID---FFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVID

Query:  NVSIANALLDMYGKSGNLTTAWRIFQDTAKGYYYLMEYIDFILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQK
        +  + NALLDMY + G + TAW  F    K           IL   Y   G     +E+F  M +S V+ + +TF SL+  C+ + +V +G  +F +M+ 
Subjt:  NVSIANALLDMYGKSGNLTTAWRIFQDTAKGYYYLMEYIDFILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQK

Query:  YGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
        YGV P+LKH AC+VDLLGR+G LQEA   +  MP++PD  VWG+LL+AC++H++ ++G   A++  E D ++ G
Subjt:  YGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

Q9SJZ3 Pentatricopeptide repeat-containing protein At2g22410, mitochondrial3.9e-5329.75Show/hide
Query:  SAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------EG-------------RCLHGMGLKSGIVCIEVVK------
        S V ++ ++M++ CG  E+A  +FDE  V+D+V+W  LI GY +  E++K          EG              C     L  G    E VK      
Subjt:  SAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------EG-------------RCLHGMGLKSGIVCIEVVK------

Query:  -----SSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFHGWILRQCCA
             ++L+ ++S+C    EA R F NL ++ ++SW ++I+ +A+ GL+     LF +M   +++            IG S +   G             
Subjt:  -----SSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFHGWILRQCCA

Query:  VSRITQSPCTDRAIKGQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ--DT
                          +  +  F++MQ    +PD I+++  +S+C Q+GA+++G +IH Y  K S+  NV++  +L+DMY K GN++ A  +F    T
Subjt:  VSRITQSPCTDRAIKGQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ--DT

Query:  AKGYYYLMEYIDFILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQ-KYGVKPSLKHCACMVDLLGRSGSLQEAE
             Y        +     +HG A +AI  F  M ++ +  + +TF  L+SAC H G++  GR +F +M+ ++ + P LKH + MVDLLGR+G L+EA+
Subjt:  AKGYYYLMEYIDFILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQ-KYGVKPSLKHCACMVDLLGRSGSLQEAE

Query:  TLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
         L+ SMP+  D  VWG+LL  C++H   E+G + A+  +E DP + G
Subjt:  TLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic6.9e-5828.11Show/hide
Query:  LSKHATISSSIS-SPDYITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWHEHSWVGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLF
        L+K    S SI      +++G   D +  + +   ++S R   S HGG  L           H ++ LK G F   ++VG+SLV  Y K    +SA  +F
Subjt:  LSKHATISSSIS-SPDYITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWHEHSWVGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLF

Query:  DEITVKDVVAWTALIVGYVQNNESQKE----------------------------------GRCLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRC
        DE+T +DV++W ++I GYV N  ++K                                   GR +H +G+K+     +   ++LL +YS+C   + A   
Subjt:  DEITVKDVVAWTALIVGYVQNNESQKE----------------------------------GRCLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRC

Query:  FCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFHGWILRQCCAVSRITQSPCTDRAIK-GQKEKCID
        F  ++ + ++S+ S+IA +A+ GL  + + LF EM    I P+   ++  L        + EGK  H WI            +   D   K G  ++   
Subjt:  FCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFHGWILRQCCAVSRITQSPCTDRAIK-GQKEKCID

Query:  FFRKMQLLGI--------------------------------EPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTA
         F +M++  I                                 PD  ++  V+ +C  + A + GR IH Y ++N    +  +AN+L+DMY K G L  A
Subjt:  FFRKMQLLGI--------------------------------EPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTA

Query:  WRIFQDTAKGYYYLMEYIDF-ILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQ-KYGVKPSLKHCACMVDLLGR
          +F D A       + + + ++ A YGMHG  + AI +F  M ++ ++ + ++F SL+ AC+H+GLV EG R F+ M+ +  ++P+++H AC+VD+L R
Subjt:  WRIFQDTAKGYYYLMEYIDF-ILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQ-KYGVKPSLKHCACMVDLLGR

Query:  SGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
        +G L +A   + +MPI PD T+WG+LL  C++H++ ++  ++A    E +PEN G
Subjt:  SGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

Arabidopsis top hitse value%identityAlignment
AT1G15510.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.6e-5727.85Show/hide
Query:  VGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNN----------------------------------ESQKEGRCLHGMGLKSGIVCIE
        V ++L+ MY KCG  +SA  LFD +  +D+++W A+I GY +N                                     ++ GR +H   + +G     
Subjt:  VGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNN----------------------------------ESQKEGRCLHGMGLKSGIVCIE

Query:  VVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFH---------GW
         V +SL  +Y    S  EA + F  + RKD++SW ++I+ +    L    ++ +  M    + P+EI ++  L        +  G   H          +
Subjt:  VVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFH---------GW

Query:  ILRQCCAVSRITQSPCTDRA------------------IKGQK--EKCID---FFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVID
        ++     ++  ++  C D+A                  I G +   +C +   F R+M++  ++P+ I+L   +++C ++GA+  G+ IH + ++  V  
Subjt:  ILRQCCAVSRITQSPCTDRA------------------IKGQK--EKCID---FFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVID

Query:  NVSIANALLDMYGKSGNLTTAWRIFQDTAKGYYYLMEYIDFILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQK
        +  + NALLDMY + G + TAW  F    K           IL   Y   G     +E+F  M +S V+ + +TF SL+  C+ + +V +G  +F +M+ 
Subjt:  NVSIANALLDMYGKSGNLTTAWRIFQDTAKGYYYLMEYIDFILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQK

Query:  YGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
        YGV P+LKH AC+VDLLGR+G LQEA   +  MP++PD  VWG+LL+AC++H++ ++G   A++  E D ++ G
Subjt:  YGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

AT1G74600.1 pentatricopeptide (PPR) repeat-containing protein1.9e-5530.46Show/hide
Query:  HSWVGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLF---DEITVKDVVAWTALIVGYVQNNESQKE-------------------------------G
        H+WV  K G F  +S+V ++L+ MYSK G  + +  +F   D+I  +++V    +I  + Q+ +  K                                G
Subjt:  HSWVGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLF---DEITVKDVVAWTALIVGYVQNNESQKE-------------------------------G

Query:  RCLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISE
        + +HG  LKSG+V    V SSL ++YS+C S EE+Y+ F  +  KD   W S+I+   ++G + + + LF EM      P+E  ++  L    +   +  
Subjt:  RCLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISE

Query:  GKAFHGWILR------------------QCCAVSRITQ--------------SPCTDRAIKGQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAV-
        GK  HG+ LR                  +C ++    Q              S  +  +  G  +     FR M + G   D     F ISS L+  A+ 
Subjt:  GKAFHGWILR------------------QCCAVSRITQ--------------SPCTDRAIKGQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAV-

Query:  ---NIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQDTAKGYYYLMEYIDFI----LHANYGMHGHAESAIEIFQLMEESHVKLNALTFP
           ++G  +H Y  K  +    S+ ++LL MY K G++    + F          +   D I    L A+Y  HG A  A++++ LM+E   K + +TF 
Subjt:  ---NIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQDTAKGYYYLMEYIDFI----LHANYGMHGHAESAIEIFQLMEESHVKLNALTFP

Query:  SLISACNHTGLVAEGRRHFDRMQK-YGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
         ++SAC+H GLV E   H + M K YG++P  +H  CMVD LGRSG L+EAE+ + +M I PD  VWG+LL+ACK+H E E+G   A+ AIE +P + G
Subjt:  SLISACNHTGLVAEGRRHFDRMQK-YGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

AT2G22410.1 SLOW GROWTH 12.8e-5429.75Show/hide
Query:  SAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------EG-------------RCLHGMGLKSGIVCIEVVK------
        S V ++ ++M++ CG  E+A  +FDE  V+D+V+W  LI GY +  E++K          EG              C     L  G    E VK      
Subjt:  SAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQK----------EG-------------RCLHGMGLKSGIVCIEVVK------

Query:  -----SSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFHGWILRQCCA
             ++L+ ++S+C    EA R F NL ++ ++SW ++I+ +A+ GL+     LF +M   +++            IG S +   G             
Subjt:  -----SSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFHGWILRQCCA

Query:  VSRITQSPCTDRAIKGQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ--DT
                          +  +  F++MQ    +PD I+++  +S+C Q+GA+++G +IH Y  K S+  NV++  +L+DMY K GN++ A  +F    T
Subjt:  VSRITQSPCTDRAIKGQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIFQ--DT

Query:  AKGYYYLMEYIDFILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQ-KYGVKPSLKHCACMVDLLGRSGSLQEAE
             Y        +     +HG A +AI  F  M ++ +  + +TF  L+SAC H G++  GR +F +M+ ++ + P LKH + MVDLLGR+G L+EA+
Subjt:  AKGYYYLMEYIDFILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQ-KYGVKPSLKHCACMVDLLGRSGSLQEAE

Query:  TLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
         L+ SMP+  D  VWG+LL  C++H   E+G + A+  +E DP + G
Subjt:  TLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein4.9e-5928.11Show/hide
Query:  LSKHATISSSIS-SPDYITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWHEHSWVGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLF
        L+K    S SI      +++G   D +  + +   ++S R   S HGG  L           H ++ LK G F   ++VG+SLV  Y K    +SA  +F
Subjt:  LSKHATISSSIS-SPDYITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWHEHSWVGLKLGHFACNSAVGSSLVYMYSKCGHTESASFLF

Query:  DEITVKDVVAWTALIVGYVQNNESQKE----------------------------------GRCLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRC
        DE+T +DV++W ++I GYV N  ++K                                   GR +H +G+K+     +   ++LL +YS+C   + A   
Subjt:  DEITVKDVVAWTALIVGYVQNNESQKE----------------------------------GRCLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRC

Query:  FCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFHGWILRQCCAVSRITQSPCTDRAIK-GQKEKCID
        F  ++ + ++S+ S+IA +A+ GL  + + LF EM    I P+   ++  L        + EGK  H WI            +   D   K G  ++   
Subjt:  FCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEGKAFHGWILRQCCAVSRITQSPCTDRAIK-GQKEKCID

Query:  FFRKMQLLGI--------------------------------EPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTA
         F +M++  I                                 PD  ++  V+ +C  + A + GR IH Y ++N    +  +AN+L+DMY K G L  A
Subjt:  FFRKMQLLGI--------------------------------EPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTA

Query:  WRIFQDTAKGYYYLMEYIDF-ILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQ-KYGVKPSLKHCACMVDLLGR
          +F D A       + + + ++ A YGMHG  + AI +F  M ++ ++ + ++F SL+ AC+H+GLV EG R F+ M+ +  ++P+++H AC+VD+L R
Subjt:  WRIFQDTAKGYYYLMEYIDF-ILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQ-KYGVKPSLKHCACMVDLLGR

Query:  SGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
        +G L +A   + +MPI PD T+WG+LL  C++H++ ++  ++A    E +PEN G
Subjt:  SGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG

AT4G39952.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-10033.82Show/hide
Query:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH------------------------------------------EHSW--VG------
        IT G S ++F A+KL+S YAS+ +P  S   FHL    D   W+                                          E  W  VG      
Subjt:  ITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWH------------------------------------------EHSW--VG------

Query:  -LKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQ-------------------------------------KEGR
         LK G F  N+AVG+S VY YSKCG  + A  +FDE+  +DVVAWTA+I G+VQN ES+                                     KEGR
Subjt:  -LKLGHFACNSAVGSSLVYMYSKCGHTESASFLFDEITVKDVVAWTALIVGYVQNNESQ-------------------------------------KEGR

Query:  CLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEG
        CLHG  +K+G+   + V+SS+ S YS+  +P EAY  F  L  +D+ SW SIIA  A+ G M +  ++FWEM+   + P+ + ISC +  +G    + +G
Subjt:  CLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWEMRASEIIPEEIAISCTLMGIGNSDRISEG

Query:  KAFHGWILRQC----------------------------CAVSRITQSPCTDRAIKGQKE-----KCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVN
        KAFHG+++R C                            C +S        +  +KG  +     KCI+ FRK+Q LGIE D  S   VISSC  +GAV 
Subjt:  KAFHGWILRQC----------------------------CAVSRITQSPCTDRAIKGQKE-----KCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVN

Query:  IGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIF--------------------QDTAKG---------------------------------
        +G+ +HCY +K S+   +S+ N+L+D+YGK G+LT AWR+F                    + + K                                  
Subjt:  IGRFIHCYAIKNSVIDNVSIANALLDMYGKSGNLTTAWRIF--------------------QDTAKG---------------------------------

Query:  -----YYYLME------------YIDF--------------------------ILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLV
             + Y+ E             ID                           ++ + YGMHG  ESAI +F  MEES VK    TF +L+SAC H GLV
Subjt:  -----YYYLME------------YIDF--------------------------ILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLV

Query:  AEGRRHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG
         +G++ F +M +Y VKP+LKH +C+VDLL RSG+L+EAE+ V+SMP SPDG +WG+LLS+C  H E+EMG+R+A  A+ SDP+NDG
Subjt:  AEGRRHFDRMQKYGVKPSLKHCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAGACTCCGGCCGCGCCAATGTCAGATCCTAACTACCTTAACAGTTACTACAATTTTTCTTTCAAAACACGCTACAATCTCTTCTTCAATTTCATCCCCTGATTA
TATCACCACTGGCAACTCCAACGATGTCTTCTTTGCTACGAAGCTCGTGTCCTTTTATGCCTCTCATAGGCAACCCACATCTTCCCATGGTGGTTTCCACTTGCGCCGAA
CTAATGATGCTCAACCATGGCATGAACATTCATGGGTTGGCTTGAAACTTGGGCACTTTGCTTGTAATTCTGCTGTTGGTTCTTCTTTGGTATACATGTATTCCAAATGC
GGTCATACAGAAAGTGCATCTTTTTTGTTCGATGAAATTACGGTTAAAGATGTAGTTGCTTGGACTGCGCTTATAGTTGGGTACGTCCAGAATAATGAGAGTCAGAAGGA
GGGCAGATGCTTACATGGTATGGGTCTAAAAAGTGGGATTGTGTGTATTGAAGTTGTCAAGTCTTCTCTTCTCTCTGTGTACTCGAGGTGCTGGTCACCTGAAGAAGCTT
ACAGATGCTTTTGTAACTTAAACAGAAAAGATCTTCTCTCCTGGATATCAATTATTGCAGTTCATGCTAAATTTGGGTTAATGACTGATTGTCTAAATTTATTTTGGGAA
ATGCGAGCTAGCGAAATAATCCCAGAAGAAATAGCGATCAGTTGCACGCTCATGGGTATTGGTAATTCTGATAGAATCTCTGAAGGAAAAGCCTTTCATGGTTGGATTCT
GAGACAATGTTGTGCGGTAAGTCGAATAACTCAAAGTCCATGTACTGATAGAGCAATAAAGGGGCAAAAAGAAAAGTGTATAGACTTTTTCAGGAAGATGCAACTTCTAG
GCATAGAACCTGATTTGATCAGTTTAGTTTTTGTAATTTCTTCATGTTTACAAGTTGGAGCAGTCAATATTGGCCGGTTTATTCACTGCTATGCAATTAAGAACTCTGTA
ATTGACAACGTATCTATAGCTAACGCACTCCTGGACATGTACGGGAAAAGTGGTAATTTAACCACCGCATGGAGGATATTTCAGGACACGGCAAAGGGATACTACTATCT
CATGGAATACATTGATTTCATCCTACACGCAAATTATGGGATGCATGGGCATGCGGAATCTGCAATTGAGATCTTCCAACTAATGGAGGAGTCACATGTAAAACTGAATG
CGCTTACCTTTCCTTCTCTTATCTCAGCTTGTAATCATACAGGGCTTGTGGCAGAAGGAAGGCGTCACTTTGACAGAATGCAGAAATATGGTGTCAAACCTAGTCTGAAG
CACTGTGCTTGTATGGTAGATCTTCTTGGCAGGTCAGGTAGCCTTCAAGAGGCGGAGACTCTGGTTTTATCAATGCCCATCTCTCCTGATGGCACTGTGTGGGGCTCCTT
GCTAAGTGCTTGTAAACTTCACAATGAATATGAAATGGGGGTAAGGATTGCTAGGTATGCAATTGAGTCTGATCCAGAAAATGATGGGACTACATAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTAGACTCCGGCCGCGCCAATGTCAGATCCTAACTACCTTAACAGTTACTACAATTTTTCTTTCAAAACACGCTACAATCTCTTCTTCAATTTCATCCCCTGATTA
TATCACCACTGGCAACTCCAACGATGTCTTCTTTGCTACGAAGCTCGTGTCCTTTTATGCCTCTCATAGGCAACCCACATCTTCCCATGGTGGTTTCCACTTGCGCCGAA
CTAATGATGCTCAACCATGGCATGAACATTCATGGGTTGGCTTGAAACTTGGGCACTTTGCTTGTAATTCTGCTGTTGGTTCTTCTTTGGTATACATGTATTCCAAATGC
GGTCATACAGAAAGTGCATCTTTTTTGTTCGATGAAATTACGGTTAAAGATGTAGTTGCTTGGACTGCGCTTATAGTTGGGTACGTCCAGAATAATGAGAGTCAGAAGGA
GGGCAGATGCTTACATGGTATGGGTCTAAAAAGTGGGATTGTGTGTATTGAAGTTGTCAAGTCTTCTCTTCTCTCTGTGTACTCGAGGTGCTGGTCACCTGAAGAAGCTT
ACAGATGCTTTTGTAACTTAAACAGAAAAGATCTTCTCTCCTGGATATCAATTATTGCAGTTCATGCTAAATTTGGGTTAATGACTGATTGTCTAAATTTATTTTGGGAA
ATGCGAGCTAGCGAAATAATCCCAGAAGAAATAGCGATCAGTTGCACGCTCATGGGTATTGGTAATTCTGATAGAATCTCTGAAGGAAAAGCCTTTCATGGTTGGATTCT
GAGACAATGTTGTGCGGTAAGTCGAATAACTCAAAGTCCATGTACTGATAGAGCAATAAAGGGGCAAAAAGAAAAGTGTATAGACTTTTTCAGGAAGATGCAACTTCTAG
GCATAGAACCTGATTTGATCAGTTTAGTTTTTGTAATTTCTTCATGTTTACAAGTTGGAGCAGTCAATATTGGCCGGTTTATTCACTGCTATGCAATTAAGAACTCTGTA
ATTGACAACGTATCTATAGCTAACGCACTCCTGGACATGTACGGGAAAAGTGGTAATTTAACCACCGCATGGAGGATATTTCAGGACACGGCAAAGGGATACTACTATCT
CATGGAATACATTGATTTCATCCTACACGCAAATTATGGGATGCATGGGCATGCGGAATCTGCAATTGAGATCTTCCAACTAATGGAGGAGTCACATGTAAAACTGAATG
CGCTTACCTTTCCTTCTCTTATCTCAGCTTGTAATCATACAGGGCTTGTGGCAGAAGGAAGGCGTCACTTTGACAGAATGCAGAAATATGGTGTCAAACCTAGTCTGAAG
CACTGTGCTTGTATGGTAGATCTTCTTGGCAGGTCAGGTAGCCTTCAAGAGGCGGAGACTCTGGTTTTATCAATGCCCATCTCTCCTGATGGCACTGTGTGGGGCTCCTT
GCTAAGTGCTTGTAAACTTCACAATGAATATGAAATGGGGGTAAGGATTGCTAGGTATGCAATTGAGTCTGATCCAGAAAATGATGGGACTACATAA
Protein sequenceShow/hide protein sequence
MLRLRPRQCQILTTLTVTTIFLSKHATISSSISSPDYITTGNSNDVFFATKLVSFYASHRQPTSSHGGFHLRRTNDAQPWHEHSWVGLKLGHFACNSAVGSSLVYMYSKC
GHTESASFLFDEITVKDVVAWTALIVGYVQNNESQKEGRCLHGMGLKSGIVCIEVVKSSLLSVYSRCWSPEEAYRCFCNLNRKDLLSWISIIAVHAKFGLMTDCLNLFWE
MRASEIIPEEIAISCTLMGIGNSDRISEGKAFHGWILRQCCAVSRITQSPCTDRAIKGQKEKCIDFFRKMQLLGIEPDLISLVFVISSCLQVGAVNIGRFIHCYAIKNSV
IDNVSIANALLDMYGKSGNLTTAWRIFQDTAKGYYYLMEYIDFILHANYGMHGHAESAIEIFQLMEESHVKLNALTFPSLISACNHTGLVAEGRRHFDRMQKYGVKPSLK
HCACMVDLLGRSGSLQEAETLVLSMPISPDGTVWGSLLSACKLHNEYEMGVRIARYAIESDPENDGTT