; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018295 (gene) of Snake gourd v1 genome

Gene IDTan0018295
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionpentatricopeptide repeat-containing protein At1g30610, chloroplastic
Genome locationLG05:79608907..79620069
RNA-Seq ExpressionTan0018295
SyntenyTan0018295
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR044645 - Pentatricopeptide repeat-containing protein DG1/EMB2279-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019446.1 Pentatricopeptide repeat-containing protein, chloroplastic, partial [Cucurbita argyrosperma subsp. argyrosperma]3.7e-25356.68Show/hide
Query:  MVGVIITNANLCIPCC-GMGF--------QHYII-------------------------HRIPIICSVFQFSGVGDINVEQLRPRQRGNLI-FDFQM---
        MVGVI+ NANLCIPCC G GF         HY++                         HR    C   + S  G+ +++       GNL+  DFQ    
Subjt:  MVGVIITNANLCIPCC-GMGF--------QHYII-------------------------HRIPIICSVFQFSGVGDINVEQLRPRQRGNLI-FDFQM---

Query:  ---------GISSKRIFNLSHPSMNMERLLCKEKTSTKSAESK-----VTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKD
                  + S+R    S     M     KE  S KSAES      VT+VQGN+DVKN +  V  +DLF+N E+I  K DLSGNKFD+KRKGVTRSKD
Subjt:  ---------GISSKRIFNLSHPSMNMERLLCKEKTSTKSAESK-----VTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKD

Query:  EVKGKVTPFDSQVNDKKQEERRKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPG
        E+KGKVTPFDSQVNDK+ EE+R GNWSNYIEPK  R +++  LHFKANTLDVK E H V  GSS+K+S+K W DDDTKP KD+LKVGK+GVQL  NYIPG
Subjt:  EVKGKVTPFDSQVNDKKQEERRKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPG

Query:  DKV----------GLSKSGKPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVI
        DKV          GLSKSGK F E TEESSLEVEH AFN+FDA DIMDKPRVSKMEMEERIQMLSK                               RVI
Subjt:  DKV----------GLSKSGKPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVI

Query:  QVLGKLENWRRVLQVID-------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKK
        QVLGKL NW+RVLQVI+       FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYP+LVAYHSIA TLGQAGYMRELFDVID+M+SPPKKK
Subjt:  QVLGKLENWRRVLQVID-------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKK

Query:  FKTGALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWV-------------------------------------------------------------
        FKTGA EKWDPRLQPD+VIYNAVLNACVKRKNWEGAFWV                                                             
Subjt:  FKTGALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWV-------------------------------------------------------------

Query:  ---------------------SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCF
                             +L Y F             +  +EKICKVANKPLVVTYTGLIQACLDSKNL SAVYIFNHMKAFCSPNLVT NI    +
Subjt:  ---------------------SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCF

Query:  LEHGMFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNEL
        L+HGMF+EA+ELFQN+SE+GRNI  VSDYRD V          LPDIYTFNTMLD SFAEKRWDDF +FYNQML YGYHFNPKRHL       R   +EL
Subjt:  LEHGMFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNEL

Query:  LKTTWKHLAQADRTPPPPLIRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKE
        L+TTWKHLAQADRT PPPLI+E                      SSD HHFS+SAWLNLLKEKRFP+D+VI+LIHKVS+ L RN+SPNPV QNLLLS KE
Subjt:  LKTTWKHLAQADRTPPPPLIRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKE

Query:  FCRTRISVGDHKLEEIVCTSK
        FCR+RI+V D +LEE+VCT++
Subjt:  FCRTRISVGDHKLEEIVCTSK

XP_022927392.1 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucurbita moschata]2.1e-25156.82Show/hide
Query:  MVGVIITNANLCIPCC-GMGF--------QHYIIHRIPIICSVFQFSGV--GDINVEQLRPR-------------------QRGNLI-FDFQM-------
        MVGVI+ NANLCIPCC G GF         HY++    +  S    SG+  G      LR R                     GNL+  DFQ        
Subjt:  MVGVIITNANLCIPCC-GMGF--------QHYIIHRIPIICSVFQFSGV--GDINVEQLRPR-------------------QRGNLI-FDFQM-------

Query:  -----GISSKRIFNLSHPSMNMERLLCKEKTSTKSAESK-----VTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKG
              + S+R    S     M     KE  S KSAES      VT+VQGN+DVKN +  V  +DLF+N E+I  K DLSGNKFD+KRKGVTRSKDE+KG
Subjt:  -----GISSKRIFNLSHPSMNMERLLCKEKTSTKSAESK-----VTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKG

Query:  KVTPFDSQVNDKKQEERRKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPGDKV-
        KVTPF+SQVNDK+ EE+R GNWSNYIEPK  R +++  LHFKANTLDVK E H V  GSS+K+S+K W DDD+KP KD+LKVGK+GVQL  NYIPGDKV 
Subjt:  KVTPFDSQVNDKKQEERRKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPGDKV-

Query:  ---------GLSKSGKPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLG
                 GLSKSGK F E TEESSLEVEH AFN+ DA DIMDKPRVSKMEMEERIQMLS                                RVIQVLG
Subjt:  ---------GLSKSGKPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLG

Query:  KLENWRRVLQVID-------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTG
        KL NW+RVLQVI+       FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYP+LVAYHSIA TLGQAGYMRELFDVID+M+SPPKKKFKTG
Subjt:  KLENWRRVLQVID-------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTG

Query:  ALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWV-----------------------------------------------------------------
        A EKWDPRLQPD+VIYNAVLNACVKRKNWEGAFWV                                                                 
Subjt:  ALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWV-----------------------------------------------------------------

Query:  -----------------SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHG
                         +L Y F             +  +EKICKVANKPLVVTYTGLIQACLDSKNL SAVYIFNHMKAFCSPNLVT NI    +L+HG
Subjt:  -----------------SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHG

Query:  MFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTT
        MF+EA+ELFQN+SE+GRNI  VSDYRD V          LPDIYTFNTMLD SFAEKRWDDF +FYNQML YGYHFNPKRHL       R   +ELL+TT
Subjt:  MFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTT

Query:  WKHLAQADRTPPPPLIRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRT
        WKHLAQADRT PPPLI+E                      SSD HHFS+SAWLNLLKEKRFP+D+VI+LIHKVS+ L RN+SPNPV QNLLLS KEFCR+
Subjt:  WKHLAQADRTPPPPLIRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRT

Query:  RISVGDHKLEEIVCTSK
        RISV D +LEE+VCT++
Subjt:  RISVGDHKLEEIVCTSK

XP_023000737.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucurbita maxima]6.4e-25357.25Show/hide
Query:  MVGVIITNANLCIPCC-GMGF--------QHYIIHRIPIICSVFQFSGV--GDINVEQLRPR-------------------QRGNLI-FDFQM-------
        MVGVI+ NANLCIPCC G GF         HY++  +    S    SG+  G      LR R                     GNL+  DFQ        
Subjt:  MVGVIITNANLCIPCC-GMGF--------QHYIIHRIPIICSVFQFSGV--GDINVEQLRPR-------------------QRGNLI-FDFQM-------

Query:  -----GISSKRIFNLSHPSMNMERLLCKEKTSTKSAESK-----VTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKG
              + S+R    S     M     KE  S KSAES      VT+VQGN+DVKN +  V  +DLF+N ERI  K DLSGNKFD+KRKGVTRSKDE+KG
Subjt:  -----GISSKRIFNLSHPSMNMERLLCKEKTSTKSAESK-----VTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKG

Query:  KVTPFDSQVNDKKQEERRKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPGDKV-
        KVTPFDSQ+NDK+ EE+R GNWSNYIEPKV R +++  LHFKANTLDVK E H V  GSS+K+SEK W DDD KP KD+LKVGK+GVQL  NYIPGDKV 
Subjt:  KVTPFDSQVNDKKQEERRKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPGDKV-

Query:  ---------GLSKSGKPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLG
                 GLSKSGK F E TEESSLEVEH AFN+ DA DIMDKPRVSKMEMEERIQMLSK                               RVIQVLG
Subjt:  ---------GLSKSGKPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLG

Query:  KLENWRRVLQVID-------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTG
        KL NW+RVLQVI+       FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYP+LVAYHSIA TLGQAGYMRELFDVID+M+SPPKKKFKTG
Subjt:  KLENWRRVLQVID-------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTG

Query:  ALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWV-----------------------------------------------------------------
        A EKWDPRLQPD+VIYNAVLNACVKRKNWEGAFWV                                                                 
Subjt:  ALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWV-----------------------------------------------------------------

Query:  -----------------SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHG
                         +L Y F             +  +EKICKVANKPLVVTYTGLIQACLDSKNL SAVYIFNHMKAFCSPNLVT NI    +L+HG
Subjt:  -----------------SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHG

Query:  MFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTT
        MF EA+ELFQN+SE+GRNI  VSDYRD V          LPDIYTFNTMLD SFAEKRWDDF +FYNQML YGYHFNPKRHL       R   +ELL+TT
Subjt:  MFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTT

Query:  WKHLAQADRTPPPPLIRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRT
        WKHLAQADR  PPPLI+E                      SSD HHFS+SAWLNLLKEKRFP+D+VIQLIHKVS+ L RN+SPNPV QNLLLS KEFCR+
Subjt:  WKHLAQADRTPPPPLIRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRT

Query:  RISVGDHKLEEIVCTSK
        RISV D +LEE+VCT++
Subjt:  RISVGDHKLEEIVCTSK

XP_023519692.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic [Cucurbita pepo subsp. pepo]9.2e-25256.82Show/hide
Query:  MVGVIITNANLCIPCC-GMGF--------QHYIIHRIPIICSVFQFSGVGDINVEQLRPRQRGN----------------------LIFDFQM-------
        MVGVI+ NANLCIPCC G GF         HY++       S    SG+   + +    R RG+                      L  DFQ        
Subjt:  MVGVIITNANLCIPCC-GMGF--------QHYIIHRIPIICSVFQFSGVGDINVEQLRPRQRGN----------------------LIFDFQM-------

Query:  -----GISSKRIFNLSHPSMNMERLLCKEKTSTKSAESK-----VTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKG
              + S+R    S     M     KE  S KSAES      VT+VQGN+DVK  +  V ++DLF+N ERI  K DLSGNKFD+KRKGVTRSKDE+KG
Subjt:  -----GISSKRIFNLSHPSMNMERLLCKEKTSTKSAESK-----VTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKG

Query:  KVTPFDSQVNDKKQEERRKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPGDKV-
        KVTPFDSQVNDK+  E+R GNWSNYIEPKV R +++  LHFKANTLDVK E H V  GSS+K+SEK W DDDTK  KD+LKVGK+GVQL  NYIPGDKV 
Subjt:  KVTPFDSQVNDKKQEERRKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPGDKV-

Query:  ---------GLSKSGKPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLG
                 GLSKSGK F E TEESSLEVEH AFN+ DA DIMDKPRVSKMEMEERIQMLSK                               RVIQVLG
Subjt:  ---------GLSKSGKPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLG

Query:  KLENWRRVLQVID-------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTG
        KL NW+RVLQVI+       FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYP+LVAYHSIA TLGQAGYMRELFDVID+M+SPPKKKFKTG
Subjt:  KLENWRRVLQVID-------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTG

Query:  ALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWV-----------------------------------------------------------------
        ALEKWDPRLQPD+VIYNAVLNACVKRKNWEGAFWV                                                                 
Subjt:  ALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWV-----------------------------------------------------------------

Query:  -----------------SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHG
                         +L Y F             +  +EKICKVANKPLVVTYTGLIQACLDSKNL SAVYIFNHMKAFCSPNLVT NI    +L+HG
Subjt:  -----------------SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHG

Query:  MFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTT
        MF+EA+ELFQN+SE+GRNI  VSDYRD V          LPDIYTFNTMLD SFAEKRWDDF +FYNQML YGYHFNPKRHL       R   +ELL+TT
Subjt:  MFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTT

Query:  WKHLAQADRTPPPPLIRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRT
        WKHLAQADRT PPPLI+E                      SSD HHFS+SAWLNLLKEKRFP+D+VI+LIHKVS+ L RN+SPNPV QNLLLS KEFCR+
Subjt:  WKHLAQADRTPPPPLIRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRT

Query:  RISVGDHKLEEIVCTSK
        RISV D +LEE+VCT++
Subjt:  RISVGDHKLEEIVCTSK

XP_038894404.1 pentatricopeptide repeat-containing protein At1g30610, chloroplastic isoform X1 [Benincasa hispida]6.0e-25161.38Show/hide
Query:  RLLCKEKTSTKSAE-----------SKVTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKGKVTPFDSQVNDKKQEER
        +L  KE  S KSAE           +KVT+VQGNVDVKNM KRV RKDLFNN ERI  ++DLSGNK D+KRKG++RS DEVKGKVTPFDSQVNDK+ EE+
Subjt:  RLLCKEKTSTKSAE-----------SKVTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKGKVTPFDSQVNDKKQEER

Query:  RKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPGDKVG----------LSKSGKP
        R  N SNY EPKV RL NE  ++FKANTLD+KRE HR  +GSS+++S K W +DDTKPAKDIL   K+ VQL RNYI GDKVG           SKSGK 
Subjt:  RKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPGDKVG----------LSKSGKP

Query:  FLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLGKLENWRRVLQVID----
        FLE TE+SSLEVEH AFNNFDALDIMDKPRVSKMEMEERIQML K                               RVIQVLGKL NWRRVLQVI+    
Subjt:  FLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLGKLENWRRVLQVID----

Query:  ---FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYN
           FKSHKLRFIYTTALDVLGKARRPVEALN+FHAMQQHF+SYP+LVAYHSIA TLGQAGYM+ELFDVID+M+SPPKKKFKTG LEKWDPRL+PD+VIYN
Subjt:  ---FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYN

Query:  AVLNACVKRKNWEGAFWV----------------------------------------------------------------------------------
        AVLNACVKRKN EGAFWV                                                                                  
Subjt:  AVLNACVKRKNWEGAFWV----------------------------------------------------------------------------------

Query:  SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYN-IFSCFLEHGMFEEARELFQNLSEHGR
        +L Y F             +  +EKICKVA KPLVVTYTGLIQACLDSK++ SAVYIFNHMK FCSPNLVTYN +   +LEHGMFEEARELFQNLSEHGR
Subjt:  SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYN-IFSCFLEHGMFEEARELFQNLSEHGR

Query:  NIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTTWKHLAQADRTPPPPLIR
        NI TVSDYRD V          LPDIY FNTMLD SFAEKRWDDFGYFY+QML YGYHFNPKRHL       R   +ELL+TTWKHLAQADRTPPPPL++
Subjt:  NIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTTWKHLAQADRTPPPPLIR

Query:  E----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRTRISVGDHKLEEIVCTSK
        E                      SSD HHFSES WLNLLKEKRFP+DTVIQLI+KVS+ LTRN+ PNPVF+NLLLSCKEFCRTRISV DH+LEE VCT++
Subjt:  E----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRTRISVGDHKLEEIVCTSK

TrEMBL top hitse value%identityAlignment
A0A0A0LVN7 Uncharacterized protein3.4e-24460.78Show/hide
Query:  RLLCKEKTSTKSAES-----------KVTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKGKVTPFDSQVNDKKQEER
        +L  KE  S KSAES           KVT+VQ NVDVKNM KRV +KDLFNN ERI  +KDLSGNKFD +RK VTRS D+VKGK+TPF S VNDK+ EE+
Subjt:  RLLCKEKTSTKSAES-----------KVTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKGKVTPFDSQVNDKKQEER

Query:  RKGNWSNYIEPKVRRL-SNETLHFKANTLDVKREKHRVCDGSSVKMSEK--TWDDDDTKPAKDILKVGKFGVQLARNYIPGDKVGLSK----------SG
        R  NWS+YIEP+V R  S + +HFKANTL+VK+E  RV DG+S+K SEK   W DDD KPAK +LK GK+G+QL R+Y PGDKVG  K          SG
Subjt:  RKGNWSNYIEPKVRRL-SNETLHFKANTLDVKREKHRVCDGSSVKMSEK--TWDDDDTKPAKDILKVGKFGVQLARNYIPGDKVGLSK----------SG

Query:  KPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLGKLENWRRVLQVID--
        K FLE  E++SLEVEH AFNNFDA DIMDKPRVSKMEMEERIQMLSK                               RVIQVLGKL NWRRVLQ+I+  
Subjt:  KPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLGKLENWRRVLQVID--

Query:  -----FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVI
             FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYP+LVAYHSIA TLGQAGYMRELFDVID+M+SPPKKKFKTG LEKWDPRLQPD+VI
Subjt:  -----FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVI

Query:  YNAVLNACVKRKNWEGAFWV--------------------------------------------------------------------------------
        YNAVLNACVKRKN EGAFWV                                                                                
Subjt:  YNAVLNACVKRKNWEGAFWV--------------------------------------------------------------------------------

Query:  --SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHGMFEEARELFQNLSEH
          +L Y F             +  +EKICKVANKPLVVTYTGLIQACLDSK+L SAVYIFNHMKAFCSPNLVTYNI    +LEHGMFEEARELFQNLSE 
Subjt:  --SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHGMFEEARELFQNLSEH

Query:  GRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTTWKHLAQADRTPPPPL
         RNI TVSDYRD V          LPDIY FNTMLD SFAEKRWDDF YFYNQM  YGYHFNPKRHL       R   +ELL+TTWKHLAQADRTPPPPL
Subjt:  GRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTTWKHLAQADRTPPPPL

Query:  IRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRTRISVGDHKLEEIV
        ++E                      S DAHHFSESAWLNLLKEKRFP DTVI+LIHKV + LTRN SPNPVF+NLLLSCKEFCRTRIS+ DH+LEE V
Subjt:  IRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRTRISVGDHKLEEIV

A0A1S3C8Z0 pentatricopeptide repeat-containing protein At1g30610, chloroplastic1.0e-24059.78Show/hide
Query:  RLLCKEKTSTKSAES-----------KVTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKGKVTPFDSQVNDKKQEER
        +L  KE  S KSAES           KVT+VQ NV+VKNM KRV +KDLFNN ERI  +K LSGNKFD + KGVTRS D+VKGK+TPF S VNDK+ EE+
Subjt:  RLLCKEKTSTKSAES-----------KVTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKGKVTPFDSQVNDKKQEER

Query:  RKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEK--TWDDDDTKPAKDILKVGKFGVQLARNYIPGDKVGLSK----------SG
        + GNWS+YIEPKV R + E  +HFKAN L+ K+E  RV  G+S+K SEK   W +DD KPAKD+LK GK+G+QL R+Y PGDKVG  K          SG
Subjt:  RKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEK--TWDDDDTKPAKDILKVGKFGVQLARNYIPGDKVGLSK----------SG

Query:  KPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLGKLENWRRVLQVID--
        K FLE TEE+SLEVEH AFNNFDALDIMDKPRVSKMEMEERIQMLSK                               RVIQVLGKL NWRRVLQVI+  
Subjt:  KPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLGKLENWRRVLQVID--

Query:  -----FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVI
             FKSHK RFIYTTALDVLGKARRPVEALNVFHAMQ+HFSSYP+LVAYHSIA TLGQAGYMRELFDVID+M+SPPKKKFKTGALEKWDPRLQPD+VI
Subjt:  -----FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVI

Query:  YNAVLNACVKRKNWEGAFWV--------------------------------------------------------------------------------
        YNAVLNACVKRKN EGAFWV                                                                                
Subjt:  YNAVLNACVKRKNWEGAFWV--------------------------------------------------------------------------------

Query:  --SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHGMFEEARELFQNLSEH
          +L Y F             +  +EKICKVANKPLVVTYTGLIQACLDSK+L SAVY+FN MKAFCSPNLVTYNI    +LEHGMFEEAREL QNLSE 
Subjt:  --SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHGMFEEARELFQNLSEH

Query:  GRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTTWKHLAQADRTPPPPL
         +NI TVSDYRD V          LPDIY FNTMLD SFAEKRWDDF YFYNQM  YGYHFNPKRHL       R   +ELL+TTWKHLAQADRTPPPPL
Subjt:  GRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTTWKHLAQADRTPPPPL

Query:  IRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRTRISVGDHKLEEIVCT
        ++E                      S DAHHFSESAWLNLLKEKRFP+DTVI+LIHKV +    N SPNPVF+NLLLSCKEFCRTRISV DH+LEE V T
Subjt:  IRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRTRISVGDHKLEEIVCT

Query:  SKL
        +++
Subjt:  SKL

A0A6J1CLQ9 pentatricopeptide repeat-containing protein At1g30610, chloroplastic1.9e-25057.19Show/hide
Query:  MVGVIITNANLCIPCCGM-GFQHYIIHRIPIICSVFQF-------SGVG-DINVEQLRP-RQRGN---LIFDFQMGISSKRIFN--------LSHPSMN-
        MVGVI+ NAN+CIPCC   GF+   +H      ++F F       SG+G ++  E+ R  R RGN    I     G S  R+ N        L  PS + 
Subjt:  MVGVIITNANLCIPCCGM-GFQHYIIHRIPIICSVFQF-------SGVG-DINVEQLRP-RQRGN---LIFDFQMGISSKRIFN--------LSHPSMN-

Query:  ----ME---------------RLLCKEKTSTKSAES-----------KVTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKD
            ME               +L  KE  S KSAES           KVT+VQGNVDVKNM KRV +K LFNN ER+  KKDL  NKFDNKRKG+TR+KD
Subjt:  ----ME---------------RLLCKEKTSTKSAES-----------KVTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKD

Query:  EVKGKVTPFDSQVNDKKQEERRKGNWSNYIEPKVRRLSNETL-HFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPG
        E +GKVT FDSQVNDK+ EE+RK N  + IEPKVRRL+NE L   KANTLD+KR++ RVCD SS+K  E+ W D DTK AK  L+VGK GVQLARNY+PG
Subjt:  EVKGKVTPFDSQVNDKKQEERRKGNWSNYIEPKVRRLSNETL-HFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPG

Query:  DKV----------GLSKSGKPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVI
        +KV          GLSKSGKPF+E TEESSLEVE  A NNFDALDIMDKPRVSKMEMEERIQMLSK                               RVI
Subjt:  DKV----------GLSKSGKPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVI

Query:  QVLGKLENWRRVLQVID-------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKK
        QVLGKL NW+RVLQVI+       FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYP+LVAYHSIA TLGQAGYMRELFDVID+M+SPPKKK
Subjt:  QVLGKLENWRRVLQVID-------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKK

Query:  FKTGALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWV-------------------------------------------------------------
        FKTGALEKWDPRLQPD+VIYNAVLNACVKRKNWEGAFWV                                                             
Subjt:  FKTGALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWV-------------------------------------------------------------

Query:  ---------------------SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCF
                             +L Y F             +  +EKICKVANKPLVVTYTGLIQACLDSKNL SAVYIFNHMKAFCSPNLVTYNI    +
Subjt:  ---------------------SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCF

Query:  LEHGMFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNEL
        L+HGMFEEARELFQNLSE G++I T+SDY+D V          LPDIYTFN MLD  FA KRWDDFGYFYNQM  YGYHFNPKRHL      GR   +E+
Subjt:  LEHGMFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNEL

Query:  LKTTWKHLAQADRTPPPPLIRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKE
        L+TTWKHLAQ DRT PPPL++E                      SSDAHHFSESAWLNLLKEK FP+DTVI LIHKVS+ LT N+ PNPVFQNLL SCKE
Subjt:  LKTTWKHLAQADRTPPPPLIRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKE

Query:  FCRTRISVGDHKLEEIVC
        FCRTRI+V D KLE+IVC
Subjt:  FCRTRISVGDHKLEEIVC

A0A6J1EH18 LOW QUALITY PROTEIN: pentatricopeptide repeat-containing protein At1g30610, chloroplastic1.0e-25156.82Show/hide
Query:  MVGVIITNANLCIPCC-GMGF--------QHYIIHRIPIICSVFQFSGV--GDINVEQLRPR-------------------QRGNLI-FDFQM-------
        MVGVI+ NANLCIPCC G GF         HY++    +  S    SG+  G      LR R                     GNL+  DFQ        
Subjt:  MVGVIITNANLCIPCC-GMGF--------QHYIIHRIPIICSVFQFSGV--GDINVEQLRPR-------------------QRGNLI-FDFQM-------

Query:  -----GISSKRIFNLSHPSMNMERLLCKEKTSTKSAESK-----VTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKG
              + S+R    S     M     KE  S KSAES      VT+VQGN+DVKN +  V  +DLF+N E+I  K DLSGNKFD+KRKGVTRSKDE+KG
Subjt:  -----GISSKRIFNLSHPSMNMERLLCKEKTSTKSAESK-----VTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKG

Query:  KVTPFDSQVNDKKQEERRKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPGDKV-
        KVTPF+SQVNDK+ EE+R GNWSNYIEPK  R +++  LHFKANTLDVK E H V  GSS+K+S+K W DDD+KP KD+LKVGK+GVQL  NYIPGDKV 
Subjt:  KVTPFDSQVNDKKQEERRKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPGDKV-

Query:  ---------GLSKSGKPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLG
                 GLSKSGK F E TEESSLEVEH AFN+ DA DIMDKPRVSKMEMEERIQMLS                                RVIQVLG
Subjt:  ---------GLSKSGKPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLG

Query:  KLENWRRVLQVID-------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTG
        KL NW+RVLQVI+       FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYP+LVAYHSIA TLGQAGYMRELFDVID+M+SPPKKKFKTG
Subjt:  KLENWRRVLQVID-------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTG

Query:  ALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWV-----------------------------------------------------------------
        A EKWDPRLQPD+VIYNAVLNACVKRKNWEGAFWV                                                                 
Subjt:  ALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWV-----------------------------------------------------------------

Query:  -----------------SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHG
                         +L Y F             +  +EKICKVANKPLVVTYTGLIQACLDSKNL SAVYIFNHMKAFCSPNLVT NI    +L+HG
Subjt:  -----------------SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHG

Query:  MFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTT
        MF+EA+ELFQN+SE+GRNI  VSDYRD V          LPDIYTFNTMLD SFAEKRWDDF +FYNQML YGYHFNPKRHL       R   +ELL+TT
Subjt:  MFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTT

Query:  WKHLAQADRTPPPPLIRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRT
        WKHLAQADRT PPPLI+E                      SSD HHFS+SAWLNLLKEKRFP+D+VI+LIHKVS+ L RN+SPNPV QNLLLS KEFCR+
Subjt:  WKHLAQADRTPPPPLIRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRT

Query:  RISVGDHKLEEIVCTSK
        RISV D +LEE+VCT++
Subjt:  RISVGDHKLEEIVCTSK

A0A6J1KEH7 pentatricopeptide repeat-containing protein At1g30610, chloroplastic3.1e-25357.25Show/hide
Query:  MVGVIITNANLCIPCC-GMGF--------QHYIIHRIPIICSVFQFSGV--GDINVEQLRPR-------------------QRGNLI-FDFQM-------
        MVGVI+ NANLCIPCC G GF         HY++  +    S    SG+  G      LR R                     GNL+  DFQ        
Subjt:  MVGVIITNANLCIPCC-GMGF--------QHYIIHRIPIICSVFQFSGV--GDINVEQLRPR-------------------QRGNLI-FDFQM-------

Query:  -----GISSKRIFNLSHPSMNMERLLCKEKTSTKSAESK-----VTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKG
              + S+R    S     M     KE  S KSAES      VT+VQGN+DVKN +  V  +DLF+N ERI  K DLSGNKFD+KRKGVTRSKDE+KG
Subjt:  -----GISSKRIFNLSHPSMNMERLLCKEKTSTKSAESK-----VTNVQGNVDVKNMLKRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKG

Query:  KVTPFDSQVNDKKQEERRKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPGDKV-
        KVTPFDSQ+NDK+ EE+R GNWSNYIEPKV R +++  LHFKANTLDVK E H V  GSS+K+SEK W DDD KP KD+LKVGK+GVQL  NYIPGDKV 
Subjt:  KVTPFDSQVNDKKQEERRKGNWSNYIEPKVRRLSNE-TLHFKANTLDVKREKHRVCDGSSVKMSEKTWDDDDTKPAKDILKVGKFGVQLARNYIPGDKV-

Query:  ---------GLSKSGKPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLG
                 GLSKSGK F E TEESSLEVEH AFN+ DA DIMDKPRVSKMEMEERIQMLSK                               RVIQVLG
Subjt:  ---------GLSKSGKPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLG

Query:  KLENWRRVLQVID-------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTG
        KL NW+RVLQVI+       FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYP+LVAYHSIA TLGQAGYMRELFDVID+M+SPPKKKFKTG
Subjt:  KLENWRRVLQVID-------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTG

Query:  ALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWV-----------------------------------------------------------------
        A EKWDPRLQPD+VIYNAVLNACVKRKNWEGAFWV                                                                 
Subjt:  ALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWV-----------------------------------------------------------------

Query:  -----------------SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHG
                         +L Y F             +  +EKICKVANKPLVVTYTGLIQACLDSKNL SAVYIFNHMKAFCSPNLVT NI    +L+HG
Subjt:  -----------------SLAYLF-------------VSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHG

Query:  MFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTT
        MF EA+ELFQN+SE+GRNI  VSDYRD V          LPDIYTFNTMLD SFAEKRWDDF +FYNQML YGYHFNPKRHL       R   +ELL+TT
Subjt:  MFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTT

Query:  WKHLAQADRTPPPPLIRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRT
        WKHLAQADR  PPPLI+E                      SSD HHFS+SAWLNLLKEKRFP+D+VIQLIHKVS+ L RN+SPNPV QNLLLS KEFCR+
Subjt:  WKHLAQADRTPPPPLIRE----------------------SSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRNNSPNPVFQNLLLSCKEFCRT

Query:  RISVGDHKLEEIVCTSK
        RISV D +LEE+VCT++
Subjt:  RISVGDHKLEEIVCTSK

SwissProt top hitse value%identityAlignment
Q5G1S8 Pentatricopeptide repeat-containing protein At3g18110, chloroplastic6.9e-0825.11Show/hide
Query:  VIQVLGKLENWRRVLQVID------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKK
        V++ +G+ E+W+R L+V +      + S   R +    L VLG+  +   A+ +F   +        +  Y+++     ++G   +  +++D M+     
Subjt:  VIQVLGKLENWRRVLQVID------FKSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKK

Query:  KFKTGALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWVSLAYLFVSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAF-CSPNLVTY
          + G +        PD++ +N ++NA +K     G    +LA   V  ++ +     +P  +TY  L+ AC    NL  AV +F  M+A  C P+L TY
Subjt:  KFKTGALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWVSLAYLFVSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAF-CSPNLVTY

Query:  N-IFSCFLEHGMFEEARELFQNLSEHG
        N + S +   G+  EA  LF  L   G
Subjt:  N-IFSCFLEHGMFEEARELFQNLSEHG

Q9FJW6 Pentatricopeptide repeat-containing protein At5g67570, chloroplastic1.9e-3428.1Show/hide
Query:  QMLSKRVIQVLGKLENWRRVLQVIDF-------KSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDT
        QML  +++  LG+ ++W++   V+ +       K  + RF+YT  L VLG ARRP EAL +F+ M      YP++ AYH IA TLGQAG ++EL  VI+ 
Subjt:  QMLSKRVIQVLGKLENWRRVLQVIDF-------KSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDT

Query:  MQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWVSL---------------------------------------------------
        M+  P K  K    + WDP L+PD+V+YNA+LNACV    W+   WV +                                                   
Subjt:  MQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWVSL---------------------------------------------------

Query:  -------------AYLFVSDVEK-------------ICKVAN--------------------KPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNL
                     A   V D+E+              C + N                    +PL +T+TGLI A L+  ++   + IF +MK  C PN+
Subjt:  -------------AYLFVSDVEK-------------ICKVAN--------------------KPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNL

Query:  VTYN-IFSCFLEHGMFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRH
         T N +   +  + MF EA+ELF+ +         VS     ++PN           YT++ ML+ S    +W+ F + Y  M+  GY  +  +H
Subjt:  VTYN-IFSCFLEHGMFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRH

Q9LQ15 Pentatricopeptide repeat-containing protein At1g62914, mitochondrial4.8e-0925.93Show/hide
Query:  IYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLNACVKRKN
        IY+T +D L K R   +ALN+F  M+      PN++ Y S+   L   G   +   ++  M            +E+   ++ P++V ++A+++A VK+  
Subjt:  IYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLNACVKRKN

Query:  WEGAFWVSLAYLFVSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHM-KAFCSPNLVTYN-IFSCFLEHGMFEEARELFQNLSEHGRNIGTV
              V    L+    E++ K +  P + TY+ LI        L  A  +   M +  C PN+VTYN + + F +    ++  ELF+ +S+ G    TV
Subjt:  WEGAFWVSLAYLFVSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHM-KAFCSPNLVTYN-IFSCFLEHGMFEEARELFQNLSEHGRNIGTV

Query:  S--------------DYRDLVLPNRVVIGIFLPDIYTFNTMLD
        +              D   +V    V +G+  P+I T+N +LD
Subjt:  S--------------DYRDLVLPNRVVIGIFLPDIYTFNTMLD

Q9SA76 Pentatricopeptide repeat-containing protein At1g30610, chloroplastic7.7e-10840.78Show/hide
Query:  EESSLEVEHTAFNNFD-ALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLGKLENWRRVLQVID-------F
        ++S   +E  AF   D + DI+DKP  S++EME+RI+ L+K                               R+I  LGKL NWRRVLQVI+       +
Subjt:  EESSLEVEHTAFNNFD-ALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLGKLENWRRVLQVID-------F

Query:  KSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLN
        KS+K+R IYTTAL+VLGK+RRPVEALNVFHAM    SSYP++VAY SIA TLGQAG+++ELF VIDTM+SPPKKKFK   LEKWDPRL+PDVV+YNAVLN
Subjt:  KSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLN

Query:  ACVKRKNWEGAFWV----------------------------------------------SLAYL-----------------------------------
        ACV+RK WEGAFWV                                              +LAY                                    
Subjt:  ACVKRKNWEGAFWV----------------------------------------------SLAYL-----------------------------------

Query:  --------------------FVSDV----------------------EKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-F
                            FV+ V                      +KIC+VANKPLVVTYTGLIQAC+DS N+ +A YIF+ MK  CSPNLVT NI  
Subjt:  --------------------FVSDV----------------------EKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-F

Query:  SCFLEHGMFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKH
          +L+ G+FEEARELFQ +SE G +I   SD+   V          LPD YTFNTMLDT   +++WDDFGY Y +ML +GYHFN KRHL       R   
Subjt:  SCFLEHGMFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKH

Query:  NELLKTTWKHLAQADRTPPPPLIR--------------------------ESSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFL-TRNNSPNPVFQ
         E+++ TW+H+ +++R PP PLI+                          E ++   FS SAW  +L   RF +D+V++L+  V+  L +R+ S + V  
Subjt:  NELLKTTWKHLAQADRTPPPPLIR--------------------------ESSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFL-TRNNSPNPVFQ

Query:  NLLLSCKEFCRTR
        NLL SCK++ +TR
Subjt:  NLLLSCKEFCRTR

Q9SH26 Pentatricopeptide repeat-containing protein At1g634004.3e-1027.98Show/hide
Query:  IYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLNACVKRKN
        IY+T +D L K R   +ALN+F  M+      PN++ Y S+   L       +   ++  M            +E+   ++ P+VV +NA+++A VK   
Subjt:  IYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLNACVKRKN

Query:  WEGAFWVSLAYLFVSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHM-KAFCSPNLVTYN-IFSCFLEHGMFEEARELFQNLSEHGRNIGTV
         EG   V    L+    +++ K +  P + TY+ LI        L  A ++F  M    C PN+VTYN + + F +    +E  ELF+ +S+ G    TV
Subjt:  WEGAFWVSLAYLFVSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHM-KAFCSPNLVTYN-IFSCFLEHGMFEEARELFQNLSEHGRNIGTV

Query:  S--------------DYRDLVLPNRVVIGIFLPDIYTFNTMLD
        +              D   +V    V  G+  P+I T+NT+LD
Subjt:  S--------------DYRDLVLPNRVVIGIFLPDIYTFNTMLD

Arabidopsis top hitse value%identityAlignment
AT1G30610.1 pentatricopeptide (PPR) repeat-containing protein5.4e-10940.78Show/hide
Query:  EESSLEVEHTAFNNFD-ALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLGKLENWRRVLQVID-------F
        ++S   +E  AF   D + DI+DKP  S++EME+RI+ L+K                               R+I  LGKL NWRRVLQVI+       +
Subjt:  EESSLEVEHTAFNNFD-ALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLGKLENWRRVLQVID-------F

Query:  KSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLN
        KS+K+R IYTTAL+VLGK+RRPVEALNVFHAM    SSYP++VAY SIA TLGQAG+++ELF VIDTM+SPPKKKFK   LEKWDPRL+PDVV+YNAVLN
Subjt:  KSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLN

Query:  ACVKRKNWEGAFWV----------------------------------------------SLAYL-----------------------------------
        ACV+RK WEGAFWV                                              +LAY                                    
Subjt:  ACVKRKNWEGAFWV----------------------------------------------SLAYL-----------------------------------

Query:  --------------------FVSDV----------------------EKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-F
                            FV+ V                      +KIC+VANKPLVVTYTGLIQAC+DS N+ +A YIF+ MK  CSPNLVT NI  
Subjt:  --------------------FVSDV----------------------EKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-F

Query:  SCFLEHGMFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKH
          +L+ G+FEEARELFQ +SE G +I   SD+   V          LPD YTFNTMLDT   +++WDDFGY Y +ML +GYHFN KRHL       R   
Subjt:  SCFLEHGMFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKH

Query:  NELLKTTWKHLAQADRTPPPPLIR--------------------------ESSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFL-TRNNSPNPVFQ
         E+++ TW+H+ +++R PP PLI+                          E ++   FS SAW  +L   RF +D+V++L+  V+  L +R+ S + V  
Subjt:  NELLKTTWKHLAQADRTPPPPLIR--------------------------ESSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFL-TRNNSPNPVFQ

Query:  NLLLSCKEFCRTR
        NLL SCK++ +TR
Subjt:  NLLLSCKEFCRTR

AT1G30610.2 pentatricopeptide (PPR) repeat-containing protein3.1e-11242.74Show/hide
Query:  EESSLEVEHTAFNNFD-ALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLGKLENWRRVLQVID-------F
        ++S   +E  AF   D + DI+DKP  S++EME+RI+ L+K                               R+I  LGKL NWRRVLQVI+       +
Subjt:  EESSLEVEHTAFNNFD-ALDIMDKPRVSKMEMEERIQMLSK-------------------------------RVIQVLGKLENWRRVLQVID-------F

Query:  KSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLN
        KS+K+R IYTTAL+VLGK+RRPVEALNVFHAM    SSYP++VAY SIA TLGQAG+++ELF VIDTM+SPPKKKFK   LEKWDPRL+PDVV+YNAVLN
Subjt:  KSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLN

Query:  ACVKRKNWEGAFWV----------------------------------------------SLAYLF-----------------VSDVE------------
        ACV+RK WEGAFWV                                              +LAY                   V D+E            
Subjt:  ACVKRKNWEGAFWV----------------------------------------------SLAYLF-----------------VSDVE------------

Query:  --------------------KICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHGMFEEARELFQNLSEHGRNIGT
                            KIC+VANKPLVVTYTGLIQAC+DS N+ +A YIF+ MK  CSPNLVT NI    +L+ G+FEEARELFQ +SE G +I  
Subjt:  --------------------KICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNI-FSCFLEHGMFEEARELFQNLSEHGRNIGT

Query:  VSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTTWKHLAQADRTPPPPLIR----
         SD+   V          LPD YTFNTMLDT   +++WDDFGY Y +ML +GYHFN KRHL       R    E+++ TW+H+ +++R PP PLI+    
Subjt:  VSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHL-----GGRKKHNELLKTTWKHLAQADRTPPPPLIR----

Query:  ----------------------ESSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFL-TRNNSPNPVFQNLLLSCKEFCRTR
                              E ++   FS SAW  +L   RF +D+V++L+  V+  L +R+ S + V  NLL SCK++ +TR
Subjt:  ----------------------ESSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFL-TRNNSPNPVFQNLLLSCKEFCRTR

AT1G62914.1 pentatricopeptide (PPR) repeat-containing protein3.4e-1025.93Show/hide
Query:  IYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLNACVKRKN
        IY+T +D L K R   +ALN+F  M+      PN++ Y S+   L   G   +   ++  M            +E+   ++ P++V ++A+++A VK+  
Subjt:  IYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLNACVKRKN

Query:  WEGAFWVSLAYLFVSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHM-KAFCSPNLVTYN-IFSCFLEHGMFEEARELFQNLSEHGRNIGTV
              V    L+    E++ K +  P + TY+ LI        L  A  +   M +  C PN+VTYN + + F +    ++  ELF+ +S+ G    TV
Subjt:  WEGAFWVSLAYLFVSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHM-KAFCSPNLVTYN-IFSCFLEHGMFEEARELFQNLSEHGRNIGTV

Query:  S--------------DYRDLVLPNRVVIGIFLPDIYTFNTMLD
        +              D   +V    V +G+  P+I T+N +LD
Subjt:  S--------------DYRDLVLPNRVVIGIFLPDIYTFNTMLD

AT1G63400.1 Pentatricopeptide repeat (PPR) superfamily protein3.1e-1127.98Show/hide
Query:  IYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLNACVKRKN
        IY+T +D L K R   +ALN+F  M+      PN++ Y S+   L       +   ++  M            +E+   ++ P+VV +NA+++A VK   
Subjt:  IYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLNACVKRKN

Query:  WEGAFWVSLAYLFVSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHM-KAFCSPNLVTYN-IFSCFLEHGMFEEARELFQNLSEHGRNIGTV
         EG   V    L+    +++ K +  P + TY+ LI        L  A ++F  M    C PN+VTYN + + F +    +E  ELF+ +S+ G    TV
Subjt:  WEGAFWVSLAYLFVSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHM-KAFCSPNLVTYN-IFSCFLEHGMFEEARELFQNLSEHGRNIGTV

Query:  S--------------DYRDLVLPNRVVIGIFLPDIYTFNTMLD
        +              D   +V    V  G+  P+I T+NT+LD
Subjt:  S--------------DYRDLVLPNRVVIGIFLPDIYTFNTMLD

AT5G67570.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.4e-3528.1Show/hide
Query:  QMLSKRVIQVLGKLENWRRVLQVIDF-------KSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDT
        QML  +++  LG+ ++W++   V+ +       K  + RF+YT  L VLG ARRP EAL +F+ M      YP++ AYH IA TLGQAG ++EL  VI+ 
Subjt:  QMLSKRVIQVLGKLENWRRVLQVIDF-------KSHKLRFIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDT

Query:  MQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWVSL---------------------------------------------------
        M+  P K  K    + WDP L+PD+V+YNA+LNACV    W+   WV +                                                   
Subjt:  MQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWVSL---------------------------------------------------

Query:  -------------AYLFVSDVEK-------------ICKVAN--------------------KPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNL
                     A   V D+E+              C + N                    +PL +T+TGLI A L+  ++   + IF +MK  C PN+
Subjt:  -------------AYLFVSDVEK-------------ICKVAN--------------------KPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNL

Query:  VTYN-IFSCFLEHGMFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRH
         T N +   +  + MF EA+ELF+ +         VS     ++PN           YT++ ML+ S    +W+ F + Y  M+  GY  +  +H
Subjt:  VTYN-IFSCFLEHGMFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDIYTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGGGAGTGATAATTACGAATGCAAATTTGTGTATTCCTTGTTGTGGAATGGGTTTCCAGCACTACATTATACACAGAATTCCTATCATTTGTTCGGTTTTTCAGTT
TTCCGGTGTAGGGGACATAAATGTGGAGCAATTAAGGCCTCGACAAAGGGGTAATCTGATCTTCGATTTCCAAATGGGAATCTCCTCGAAAAGGATTTTCAATTTAAGCC
ATCCTTCGATGAATATGGAGAGGTTGCTTTGCAAGGAAAAAACGAGTACAAAGAGTGCTGAAAGCAAAGTGACTAATGTTCAAGGTAATGTGGATGTAAAGAACATGCTT
AAACGTGTTGTTCGGAAAGATTTGTTCAATAATCCAGAGAGAATTATGCTTAAAAAAGATCTTTCAGGAAATAAATTTGATAACAAAAGGAAAGGAGTGACAAGATCAAA
GGATGAGGTTAAAGGCAAGGTAACCCCTTTTGATTCGCAGGTTAATGATAAAAAACAGGAAGAGAGAAGGAAAGGAAACTGGTCGAATTACATTGAGCCAAAAGTACGAA
GGTTGAGCAATGAGACTCTACATTTTAAGGCCAATACTTTGGATGTCAAAAGAGAAAAGCACCGAGTATGTGATGGAAGTTCCGTGAAAATGTCGGAAAAGACTTGGGAT
GATGATGACACGAAACCAGCTAAGGATATTCTGAAGGTTGGGAAATTTGGTGTTCAGCTTGCGAGGAACTATATTCCAGGCGACAAGGTTGGGTTATCCAAAAGTGGTAA
GCCATTCCTTGAAATTACTGAAGAGAGTAGCTTGGAGGTAGAACATACAGCCTTCAACAATTTTGATGCATTAGACATCATGGATAAACCGAGAGTTTCCAAGATGGAAA
TGGAAGAGAGAATTCAGATGCTTTCAAAGAGGGTTATTCAAGTTTTGGGTAAGCTAGAAAATTGGAGGCGAGTGCTACAAGTCATCGATTTCAAGTCACACAAGCTAAGA
TTTATATACACCACTGCCCTTGATGTACTTGGGAAAGCGAGGAGACCCGTCGAGGCACTGAATGTATTCCATGCGATGCAGCAACACTTTTCCTCATATCCTAACTTAGT
AGCATATCATAGTATTGCTTTCACTCTTGGACAAGCAGGATATATGAGGGAACTCTTTGACGTGATTGATACCATGCAGTCTCCTCCAAAGAAGAAATTTAAAACAGGGG
CACTTGAAAAGTGGGACCCACGGCTGCAACCTGATGTAGTTATCTATAATGCGGTTTTAAATGCTTGTGTTAAGCGAAAGAATTGGGAAGGGGCATTTTGGGTCTCATTG
GCATATTTATTTGTTAGTGATGTTGAGAAGATATGTAAAGTTGCTAATAAGCCTCTTGTAGTGACTTACACTGGTTTGATTCAAGCTTGTTTGGACTCAAAAAACTTACA
TAGTGCAGTCTATATATTCAACCACATGAAGGCCTTCTGCTCTCCCAATCTCGTTACTTATAATATATTTAGCTGTTTCCTGGAACATGGGATGTTTGAAGAAGCTAGAG
AGCTGTTTCAGAATTTGTCAGAACACGGACGAAATATCGGTACTGTATCTGACTATAGGGATCTAGTATTACCAAATAGAGTTGTAATTGGTATTTTTCTACCAGATATC
TACACGTTCAACACCATGCTAGATACATCTTTTGCAGAAAAAAGATGGGATGATTTTGGCTATTTCTATAACCAAATGCTTGCTTATGGGTATCACTTCAACCCAAAACG
TCATCTTGGAGGCCGGAAGAAACATAATGAGCTTCTGAAAACGACATGGAAGCACCTAGCTCAGGCTGATCGGACTCCGCCACCTCCGCTCATTAGAGAAAGCAGCGATG
CACATCATTTCTCTGAGTCAGCTTGGCTAAATTTACTCAAAGAGAAGAGGTTTCCTGAGGATACTGTTATTCAATTAATTCATAAGGTTAGCATCTTTCTTACTAGAAAT
AACTCACCAAATCCAGTGTTTCAGAATCTGCTATTGAGTTGTAAAGAATTTTGCAGAACTAGAATTAGTGTAGGTGACCATAAACTTGAAGAAATTGTTTGTACATCGAA
ACTCAACCTGCTGTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTGGGAGTGATAATTACGAATGCAAATTTGTGTATTCCTTGTTGTGGAATGGGTTTCCAGCACTACATTATACACAGAATTCCTATCATTTGTTCGGTTTTTCAGTT
TTCCGGTGTAGGGGACATAAATGTGGAGCAATTAAGGCCTCGACAAAGGGGTAATCTGATCTTCGATTTCCAAATGGGAATCTCCTCGAAAAGGATTTTCAATTTAAGCC
ATCCTTCGATGAATATGGAGAGGTTGCTTTGCAAGGAAAAAACGAGTACAAAGAGTGCTGAAAGCAAAGTGACTAATGTTCAAGGTAATGTGGATGTAAAGAACATGCTT
AAACGTGTTGTTCGGAAAGATTTGTTCAATAATCCAGAGAGAATTATGCTTAAAAAAGATCTTTCAGGAAATAAATTTGATAACAAAAGGAAAGGAGTGACAAGATCAAA
GGATGAGGTTAAAGGCAAGGTAACCCCTTTTGATTCGCAGGTTAATGATAAAAAACAGGAAGAGAGAAGGAAAGGAAACTGGTCGAATTACATTGAGCCAAAAGTACGAA
GGTTGAGCAATGAGACTCTACATTTTAAGGCCAATACTTTGGATGTCAAAAGAGAAAAGCACCGAGTATGTGATGGAAGTTCCGTGAAAATGTCGGAAAAGACTTGGGAT
GATGATGACACGAAACCAGCTAAGGATATTCTGAAGGTTGGGAAATTTGGTGTTCAGCTTGCGAGGAACTATATTCCAGGCGACAAGGTTGGGTTATCCAAAAGTGGTAA
GCCATTCCTTGAAATTACTGAAGAGAGTAGCTTGGAGGTAGAACATACAGCCTTCAACAATTTTGATGCATTAGACATCATGGATAAACCGAGAGTTTCCAAGATGGAAA
TGGAAGAGAGAATTCAGATGCTTTCAAAGAGGGTTATTCAAGTTTTGGGTAAGCTAGAAAATTGGAGGCGAGTGCTACAAGTCATCGATTTCAAGTCACACAAGCTAAGA
TTTATATACACCACTGCCCTTGATGTACTTGGGAAAGCGAGGAGACCCGTCGAGGCACTGAATGTATTCCATGCGATGCAGCAACACTTTTCCTCATATCCTAACTTAGT
AGCATATCATAGTATTGCTTTCACTCTTGGACAAGCAGGATATATGAGGGAACTCTTTGACGTGATTGATACCATGCAGTCTCCTCCAAAGAAGAAATTTAAAACAGGGG
CACTTGAAAAGTGGGACCCACGGCTGCAACCTGATGTAGTTATCTATAATGCGGTTTTAAATGCTTGTGTTAAGCGAAAGAATTGGGAAGGGGCATTTTGGGTCTCATTG
GCATATTTATTTGTTAGTGATGTTGAGAAGATATGTAAAGTTGCTAATAAGCCTCTTGTAGTGACTTACACTGGTTTGATTCAAGCTTGTTTGGACTCAAAAAACTTACA
TAGTGCAGTCTATATATTCAACCACATGAAGGCCTTCTGCTCTCCCAATCTCGTTACTTATAATATATTTAGCTGTTTCCTGGAACATGGGATGTTTGAAGAAGCTAGAG
AGCTGTTTCAGAATTTGTCAGAACACGGACGAAATATCGGTACTGTATCTGACTATAGGGATCTAGTATTACCAAATAGAGTTGTAATTGGTATTTTTCTACCAGATATC
TACACGTTCAACACCATGCTAGATACATCTTTTGCAGAAAAAAGATGGGATGATTTTGGCTATTTCTATAACCAAATGCTTGCTTATGGGTATCACTTCAACCCAAAACG
TCATCTTGGAGGCCGGAAGAAACATAATGAGCTTCTGAAAACGACATGGAAGCACCTAGCTCAGGCTGATCGGACTCCGCCACCTCCGCTCATTAGAGAAAGCAGCGATG
CACATCATTTCTCTGAGTCAGCTTGGCTAAATTTACTCAAAGAGAAGAGGTTTCCTGAGGATACTGTTATTCAATTAATTCATAAGGTTAGCATCTTTCTTACTAGAAAT
AACTCACCAAATCCAGTGTTTCAGAATCTGCTATTGAGTTGTAAAGAATTTTGCAGAACTAGAATTAGTGTAGGTGACCATAAACTTGAAGAAATTGTTTGTACATCGAA
ACTCAACCTGCTGTACTAA
Protein sequenceShow/hide protein sequence
MVGVIITNANLCIPCCGMGFQHYIIHRIPIICSVFQFSGVGDINVEQLRPRQRGNLIFDFQMGISSKRIFNLSHPSMNMERLLCKEKTSTKSAESKVTNVQGNVDVKNML
KRVVRKDLFNNPERIMLKKDLSGNKFDNKRKGVTRSKDEVKGKVTPFDSQVNDKKQEERRKGNWSNYIEPKVRRLSNETLHFKANTLDVKREKHRVCDGSSVKMSEKTWD
DDDTKPAKDILKVGKFGVQLARNYIPGDKVGLSKSGKPFLEITEESSLEVEHTAFNNFDALDIMDKPRVSKMEMEERIQMLSKRVIQVLGKLENWRRVLQVIDFKSHKLR
FIYTTALDVLGKARRPVEALNVFHAMQQHFSSYPNLVAYHSIAFTLGQAGYMRELFDVIDTMQSPPKKKFKTGALEKWDPRLQPDVVIYNAVLNACVKRKNWEGAFWVSL
AYLFVSDVEKICKVANKPLVVTYTGLIQACLDSKNLHSAVYIFNHMKAFCSPNLVTYNIFSCFLEHGMFEEARELFQNLSEHGRNIGTVSDYRDLVLPNRVVIGIFLPDI
YTFNTMLDTSFAEKRWDDFGYFYNQMLAYGYHFNPKRHLGGRKKHNELLKTTWKHLAQADRTPPPPLIRESSDAHHFSESAWLNLLKEKRFPEDTVIQLIHKVSIFLTRN
NSPNPVFQNLLLSCKEFCRTRISVGDHKLEEIVCTSKLNLLY