; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030248 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030248
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationchr8:45751221..45753263
RNA-Seq ExpressionLag0030248
SyntenyLag0030248
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016032.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]9.0e-20759.46Show/hide
Query:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA
        KSKFSNHDSLVLLNHV+ AYSKCSDI AA RLFD+M QRNIFSW VII GLA+NGLF DGFE FCEMQ + IFPDQFAYSG+LQICIGLE IELG+MV A
Subjt:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA

Query:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------
        QIV+RGF SHT VST+LLNMYAKLQ ++DSY+VF TMTEVN+VSWNAMISGFT NG Y +AFDHFLRMKGEGV PD QT I +AK               
Subjt:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------

Query:  ------------------------------------SYRACCR--------------------------------------------------------E
                                            S+   CR                                                        +
Subjt:  ------------------------------------SYRACCR--------------------------------------------------------E

Query:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD
        +     +K GLEVN+ISISNAV NAY KCGSL+D+RKV +Y MEERDLVSWTTLVTAYSQCSEWDKAIEIFSNM EEGF PNQFAFSSVL+SCASLCLL+
Subjt:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD

Query:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV
         GQQVHGF+ KVGLDM+KCI S LI+MYAKCG+LAEAKKVFD+IS+ DT+SWT + +GHAQ+ +VDDA +LFRRME                   HGG V
Subjt:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV

Query:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG
        EEGLQYFKLMKETYGLVP MEHYSCIVDLLSRVG LNDAMEFI                                    +F       Y      Y+ESG
Subjt:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG

Query:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANS
         +KDGL+LRHVMKEQG+KKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLK NS
Subjt:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANS

XP_022134356.1 pentatricopeptide repeat-containing protein At2g33680-like [Momordica charantia]1.9e-20459.67Show/hide
Query:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA
        KS+FSN DSLVLLNHV+HAYSKCSDIGAA RLFDQM QRNIFSW VIIVGLAENGLF DGFE FCEMQ +GIFPDQFAYSGI++ICIGLE IELGRMV A
Subjt:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA

Query:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKS--------------
        QIV RGF SHT VST+LL+MY+KLQ +EDSYKVF TMTEVN+VSWNAMISGF  N  Y EAFDHFLRMKGEG +PD  T IGVAK+              
Subjt:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKS--------------

Query:  --------------------------------------------------------------------------------YRAC-------------CRE
                                                                                        Y  C               +
Subjt:  --------------------------------------------------------------------------------YRAC-------------CRE

Query:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD
        +     +K G EVNHISISNAV NAY KCGSLDDVRKV+ YRMEERDLVSWTTLVTAYSQ SEWDKAIEIFSNM EEGF PNQF FSSVL+SCASLCLL+
Subjt:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD

Query:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV
         GQQVHGFL KVGLDM+K I S LI+MYAKCG+LAEAKKVFDRISN DTVSWT + SGHAQ+ IVDDA  LFR+ME                   HGG V
Subjt:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV

Query:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG
        EEGLQYFKLMKETYGL PEMEHYSCIVDLLSRVG LNDA+EFI+                                   +F       Y      Y+ESG
Subjt:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG

Query:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSL
         ++DGL+LRHVMK++G+KKEPGCSWISVNG LHKFYAGD QHPEKDKIYAKLEELRLK NSL
Subjt:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSL

XP_022939229.1 pentatricopeptide repeat-containing protein At3g16610-like [Cucurbita moschata]9.0e-20759.46Show/hide
Query:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA
        KSKFSNHDSLVLLNHV+ AYSKCSDI AA RLFD+M QRNIFSW VII GLA+NGLF DGFE FCEMQ + IFPDQFAYSG+LQICIGLE IELG+MV A
Subjt:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA

Query:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------
        QIV+RGF SHT VST+LLNMYAKLQ ++DSY+VF TMTEVN+VSWNAMISGFT NG Y +AFDHFLRMKGEGV PD QT I +AK               
Subjt:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------

Query:  ------------------------------------SYRACCR--------------------------------------------------------E
                                            S+   CR                                                        +
Subjt:  ------------------------------------SYRACCR--------------------------------------------------------E

Query:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD
        +     +K GLEVN+ISISNAV NAY KCGSL+D+RKV +Y MEERDLVSWTTLVTAYSQCSEWDKAIEIFSNM EEGF PNQFAFSSVL+SCASLCLL+
Subjt:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD

Query:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV
         GQQVHGF+ KVGLDM+KCI S LI+MYAKCG+LAEAKKVFD+IS+ DT+SWT + +GHAQ+ +VDDA +LFRRME                   HGG V
Subjt:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV

Query:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG
        EEGLQYFKLMKETYGLVP MEHYSCIVDLLSRVG LNDAMEFI                                    +F       Y      Y+ESG
Subjt:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG

Query:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANS
         +KDGL+LRHVMKEQG+KKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLK NS
Subjt:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANS

XP_022993736.1 pentatricopeptide repeat-containing protein At2g27610-like [Cucurbita maxima]1.8e-20759.52Show/hide
Query:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA
        KSK SNHDS+VLLNHV+ AYSKCSDI AA RLFD+M QRNIFSW VII GLA+NGLF DGFE FCEMQ + IFPDQFAYSG+LQICIGLE IELG+MV A
Subjt:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA

Query:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------
        QIV+RGF SHT VST+LLNMYAKLQ ++DSY+VF TMTEVN+VSWNAMISGFT NG Y +AFDHFLRMKGEGV PD QT I +AK               
Subjt:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------

Query:  ------------------------------------SYRACCR--------------------------------------------------------E
                                            S+   CR                                                        +
Subjt:  ------------------------------------SYRACCR--------------------------------------------------------E

Query:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD
        +     +K GLEVN+ISISNAV NAY KCGSL+D+RKV +Y MEERDLVSWTTLVTAYSQCSEWDKAIEIFSNM EEGF PNQFAFSSVLVSCASLCLL+
Subjt:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD

Query:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV
         GQQVHGF+CKVGLDM+KCI S LI+MYAKCG+LAEAKK FD+IS+ DTVSWT + +GHAQ+ +VD+A +LFRRME                   HGG V
Subjt:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV

Query:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG
        EEGLQYFKLMKETYGLVP MEHYSCIVDLLSRVG LNDAMEFI                                    +F       Y      Y+ESG
Subjt:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG

Query:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSL
        I+K+GL+LRHVMKEQG+KKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSL
Subjt:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSL

XP_023549692.1 pentatricopeptide repeat-containing protein At3g16610-like [Cucurbita pepo subsp. pepo]5.8e-20659.15Show/hide
Query:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA
        KSKFSNHDSLVLLNHV+ AYSKCSDI AA RLFD+M QRNIFSW VII GLA+NGLF DGFE FCEMQ + IFPDQFAYSG+LQICIGLE IELG+MV A
Subjt:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA

Query:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------
        QIV+RGF SHT VST+LLNMYAKLQ ++DSY+VF TMTEVN+VSWNAMISGFT NG Y +AFDHFLRMKGEGV PD QT I +AK               
Subjt:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------

Query:  ------------------------------------SYRACCR--------------------------------------------------------E
                                            S+   CR                                                        +
Subjt:  ------------------------------------SYRACCR--------------------------------------------------------E

Query:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD
        +     +K GLEVN+ISISNAV NAY KCGSL+D+RKV +Y MEERDLVSWTTLVTAYSQCSEWDKAIEIFSNM EEGF PNQFAFSSVLVSCASLCLL+
Subjt:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD

Query:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV
         GQQVHG +CKVGLDM+KCI S LI+MYAKCG+LAEAKKVFD+IS+ DT+SWT + +GHAQ+ +VDDA +LFRRM+                   HGG V
Subjt:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV

Query:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG
        EEGLQYFKLMKETYGLVP MEHYSCIVDLLSRVG LNDAMEFI                                    +F       Y      Y+ESG
Subjt:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG

Query:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANS
         +KDGL+LR+VMKEQG+KKEPGCSWI+VNGTLHKFYAGDQQHPEKDKIYAKLEELRLK NS
Subjt:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANS

TrEMBL top hitse value%identityAlignment
A0A0A0KBQ4 Uncharacterized protein1.1e-20258.31Show/hide
Query:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA
        KSKFSNH SLVLLNHV+HAYSKCSDI AA RLFDQM QRN FSW V+I GLAENGLF DGFE FCEMQ +GIFPDQFAYSGILQICIGL+ IELG MV A
Subjt:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA

Query:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------
        QIV+RGF SHT VST+LLNMYAKLQ +EDSYKVF TMTEVN+VSWNAMI+GFT N  YL+AFD FLRM GEGV PD QT IGVAK               
Subjt:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------

Query:  ------------------------------------SYRACCR--------------------------------------------------------E
                                            S+   CR                                                        +
Subjt:  ------------------------------------SYRACCR--------------------------------------------------------E

Query:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD
        +     +K GLEVN++SISNAV NAY KCGSL+DVRKV + RME+RDL+SWT+LVTAYSQCSEWDKAIEIFSNM  EG  PNQF FSSVLVSCA+LCLL+
Subjt:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD

Query:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV
         GQQVHG +CKVGLDM+KCI S L++MYAKCG L +AKKVF+RISN DTVSWT + +GHAQ+ IVDDA +LFRRM                    HGG V
Subjt:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV

Query:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG
        EEGLQYFKLMK+TYGLVPEMEHY+CIVDLLSRVGHLNDAMEFI                                    +F       Y      Y+ESG
Subjt:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG

Query:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSL
         +KDGL+LRH+MKEQG+KKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEEL+LK  SL
Subjt:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSL

A0A5D3C2B1 Pentatricopeptide repeat-containing protein1.6e-20158.31Show/hide
Query:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA
        KSKFSNHDSLVLLNHV+HAYSKCSDI AA R+FDQM QRNIFSW  II GLAENGLF DGFE FCEMQ +GIFPD FAYSGILQICIGL+ +ELG+MV A
Subjt:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA

Query:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------
        QIV+RGF SHT VST+LLNMYAKLQ +EDS KVF TMTEVN+VSWNAMI+GFT NG YL+AFD FLRMKGEGV PD QT IGVAK               
Subjt:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------

Query:  ------------------------------------SYRACCR--------------------------------------------------------E
                                            S+   CR                                                        +
Subjt:  ------------------------------------SYRACCR--------------------------------------------------------E

Query:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD
        +     +K GLEVN +SISNAV NAY KCGSL+DVRKV + RME+RDL+SWT+LVTAYSQCSEWDKAIEIFSNM  EG+ PNQFAFSSVLVSCA+LCLL+
Subjt:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD

Query:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRME------------------HHGGRV
         GQQVHG +CKVGLDM+KCI S L++MYAKCG LA+AKKVF+RISN DTVSWT + +GHAQ+ IVDDA +LFRRM                    HGG V
Subjt:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRME------------------HHGGRV

Query:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG
        EEGLQYFKLMK+TYGLVPEMEHY+CIVDLLSRVG LNDAM FI                                    +F       Y      Y+ESG
Subjt:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG

Query:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSL
         +KDGL+LRHVMKEQG+KKEPG SWISVNGTLHKFYAGDQQHPEKDKIYAKLEEL+LK  SL
Subjt:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSL

A0A6J1BXL8 pentatricopeptide repeat-containing protein At2g33680-like9.1e-20559.67Show/hide
Query:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA
        KS+FSN DSLVLLNHV+HAYSKCSDIGAA RLFDQM QRNIFSW VIIVGLAENGLF DGFE FCEMQ +GIFPDQFAYSGI++ICIGLE IELGRMV A
Subjt:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA

Query:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKS--------------
        QIV RGF SHT VST+LL+MY+KLQ +EDSYKVF TMTEVN+VSWNAMISGF  N  Y EAFDHFLRMKGEG +PD  T IGVAK+              
Subjt:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKS--------------

Query:  --------------------------------------------------------------------------------YRAC-------------CRE
                                                                                        Y  C               +
Subjt:  --------------------------------------------------------------------------------YRAC-------------CRE

Query:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD
        +     +K G EVNHISISNAV NAY KCGSLDDVRKV+ YRMEERDLVSWTTLVTAYSQ SEWDKAIEIFSNM EEGF PNQF FSSVL+SCASLCLL+
Subjt:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD

Query:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV
         GQQVHGFL KVGLDM+K I S LI+MYAKCG+LAEAKKVFDRISN DTVSWT + SGHAQ+ IVDDA  LFR+ME                   HGG V
Subjt:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV

Query:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG
        EEGLQYFKLMKETYGL PEMEHYSCIVDLLSRVG LNDA+EFI+                                   +F       Y      Y+ESG
Subjt:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG

Query:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSL
         ++DGL+LRHVMK++G+KKEPGCSWISVNG LHKFYAGD QHPEKDKIYAKLEELRLK NSL
Subjt:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSL

A0A6J1FGJ6 pentatricopeptide repeat-containing protein At3g16610-like4.3e-20759.46Show/hide
Query:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA
        KSKFSNHDSLVLLNHV+ AYSKCSDI AA RLFD+M QRNIFSW VII GLA+NGLF DGFE FCEMQ + IFPDQFAYSG+LQICIGLE IELG+MV A
Subjt:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA

Query:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------
        QIV+RGF SHT VST+LLNMYAKLQ ++DSY+VF TMTEVN+VSWNAMISGFT NG Y +AFDHFLRMKGEGV PD QT I +AK               
Subjt:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------

Query:  ------------------------------------SYRACCR--------------------------------------------------------E
                                            S+   CR                                                        +
Subjt:  ------------------------------------SYRACCR--------------------------------------------------------E

Query:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD
        +     +K GLEVN+ISISNAV NAY KCGSL+D+RKV +Y MEERDLVSWTTLVTAYSQCSEWDKAIEIFSNM EEGF PNQFAFSSVL+SCASLCLL+
Subjt:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD

Query:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV
         GQQVHGF+ KVGLDM+KCI S LI+MYAKCG+LAEAKKVFD+IS+ DT+SWT + +GHAQ+ +VDDA +LFRRME                   HGG V
Subjt:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV

Query:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG
        EEGLQYFKLMKETYGLVP MEHYSCIVDLLSRVG LNDAMEFI                                    +F       Y      Y+ESG
Subjt:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG

Query:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANS
         +KDGL+LRHVMKEQG+KKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLK NS
Subjt:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANS

A0A6J1JX63 pentatricopeptide repeat-containing protein At2g27610-like8.8e-20859.52Show/hide
Query:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA
        KSK SNHDS+VLLNHV+ AYSKCSDI AA RLFD+M QRNIFSW VII GLA+NGLF DGFE FCEMQ + IFPDQFAYSG+LQICIGLE IELG+MV A
Subjt:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA

Query:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------
        QIV+RGF SHT VST+LLNMYAKLQ ++DSY+VF TMTEVN+VSWNAMISGFT NG Y +AFDHFLRMKGEGV PD QT I +AK               
Subjt:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAK---------------

Query:  ------------------------------------SYRACCR--------------------------------------------------------E
                                            S+   CR                                                        +
Subjt:  ------------------------------------SYRACCR--------------------------------------------------------E

Query:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD
        +     +K GLEVN+ISISNAV NAY KCGSL+D+RKV +Y MEERDLVSWTTLVTAYSQCSEWDKAIEIFSNM EEGF PNQFAFSSVLVSCASLCLL+
Subjt:  RRFMPGLKLGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD

Query:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV
         GQQVHGF+CKVGLDM+KCI S LI+MYAKCG+LAEAKK FD+IS+ DTVSWT + +GHAQ+ +VD+A +LFRRME                   HGG V
Subjt:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV

Query:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG
        EEGLQYFKLMKETYGLVP MEHYSCIVDLLSRVG LNDAMEFI                                    +F       Y      Y+ESG
Subjt:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESG

Query:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSL
        I+K+GL+LRHVMKEQG+KKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSL
Subjt:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSL

SwissProt top hitse value%identityAlignment
Q9SIT7 Pentatricopeptide repeat-containing protein At2g136004.7e-7329.55Show/hide
Query:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELF-----------------------CE--------MQG
        KS FSN   + + N +  AYSKC  +    ++FD+M QRNI++W  ++ GL + G   +   LF                       CE        M  
Subjt:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELF-----------------------CE--------MQG

Query:  RGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMK
         G   ++++++ +L  C GL  +  G  V + I    F S   + ++L++MY+K   V D+ +VF  M + N+VSWN++I+ F  NGP +EA D F  M 
Subjt:  RGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMK

Query:  GEGVIPDVQTLIGVAKSYRACCRERRFMPGLKLGLEV-----------NHISISNAVTNAYTKCGSLDDVR-----------------------------
           V PD  TL  V     AC      +  +K+G EV           N I +SNA  + Y KC  + + R                             
Subjt:  GEGVIPDVQTLIGVAKSYRACCRERRFMPGLKLGLEV-----------NHISISNAVTNAYTKCGSLDDVR-----------------------------

Query:  -KVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQV------HGFLCKVGLDMNKCIGSGLINMYA
         ++++ +M ER++VSW  L+  Y+Q  E ++A+ +F  +  E   P  ++F+++L +CA L  L  G Q       HGF  + G + +  +G+ LI+MY 
Subjt:  -KVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQV------HGFLCKVGLDMNKCIGSGLINMYA

Query:  KCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRM-------EH-----------HGGRVEEGLQYFKLMKETYGLVPEMEHYSCIVDL
        KCG + E   VF ++   D VSW  +  G AQ    ++A ELFR M       +H           H G VEEG  YF  M   +G+ P  +HY+C+VDL
Subjt:  KCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRM-------EH-----------HGGRVEEGLQYFKLMKETYGLVPEMEHYSCIVDL

Query:  LSRVGHLNDAMEFIKDAFFQ-------------------------SRKLC---------YPCSFIQYLESGIFKDGLNLRHVMKEQGIKKEPGCSWISVN
        L R G L +A   I++   Q                         + KL          Y      Y E G ++D +N+R  M+++G+ K+PGCSWI + 
Subjt:  LSRVGHLNDAMEFIKDAFFQ-------------------------SRKLC---------YPCSFIQYLESGIFKDGLNLRHVMKEQGIKKEPGCSWISVN

Query:  GTLHKFYAGDQQHPEKDKIYAKLEEL
        G  H F   D+ HP K +I++ L+ L
Subjt:  GTLHKFYAGDQQHPEKDKIYAKLEEL

Q9SN39 Pentatricopeptide repeat-containing protein DOT4, chloroplastic5.7e-7130.47Show/hide
Query:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA
        KS F   +S+   N +   Y K   + +A ++FD+M +R++ SW  II G   NGL   G  +F +M   GI  D      +   C     I LGR V +
Subjt:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA

Query:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGV---AKSYRACCRERRFM
          V   F        +LL+MY+K   ++ +  VF+ M++ ++VS+ +MI+G+   G   EA   F  M+ EG+ PDV T+  V      YR     +R  
Subjt:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGV---AKSYRACCRERRFM

Query:  PGLK---LGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEE-GFTPNQFAFSSVLVSCASLCLLD
          +K   LG +   I +SNA+ + Y KCGS+ +  ++++  M  +D++SW T++  YS+    ++A+ +F+ + EE  F+P++   + VL +CASL   D
Subjt:  PGLK---LGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEE-GFTPNQFAFSSVLVSCASLCLLD

Query:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV
         G+++HG++ + G   ++ + + L++MYAKCG L  A  +FD I++ D VSWT + +G+  +    +A  LF +M                    H G V
Subjt:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV

Query:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIK------DAFFQSRKLC----------------------------YPCSFIQYLESG
        +EG ++F +M+    + P +EHY+CIVD+L+R G L  A  FI+      DA      LC                            Y      Y E+ 
Subjt:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIK------DAFFQSRKLC----------------------------YPCSFIQYLESG

Query:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLK
         ++    LR  + ++G++K PGCSWI + G ++ F AGD  +PE + I A L ++R +
Subjt:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLK

Q9STS9 Putative pentatricopeptide repeat-containing protein At3g478404.2e-7429.59Show/hide
Query:  YSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCVSTSLLN
        Y +   I  + R+F +M  RN  +W  II GL   G +++G   F EM       D + ++  L+ C GL  ++ G+ +   ++VRGF +  CV+ SL  
Subjt:  YSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCVSTSLLN

Query:  MYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKSYRACCRERRFMPG-------LKLGLEVNHISIS
        MY +   ++D   +F+ M+E ++VSW ++I  +   G  ++A + F++M+   V P+ QT    A  + AC    R + G       L LGL  + +S+S
Subjt:  MYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKSYRACCRERRFMPG-------LKLGLEVNHISIS

Query:  NAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKC
        N++   Y+ CG+L     V++  M  RD++SW+T++  Y Q    ++  + FS M + G  P  FA +S+L    ++ +++ G+QVH      GL+ N  
Subjt:  NAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKC

Query:  IGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRVEEGLQYFKLMKETYGLVPE
        + S LINMY+KCG++ EA  +F      D VS T + +G+A++    +A +LF +                     H G+++ G  YF +M+ETY + P 
Subjt:  IGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRVEEGLQYFKLMKETYGLVPE

Query:  MEHYSCIVDLLSRVGHLNDAMEFIKDAFFQSRKLCYPCSFIQ----------------------------------YLESGIFKDGLNLRHVMKEQGIKK
         EHY C+VDLL R G L+DA + I +  ++   + +    I                                   Y  +G  ++  N+R  MK +G+ K
Subjt:  MEHYSCIVDLLSRVGHLNDAMEFIKDAFFQSRKLCYPCSFIQ----------------------------------YLESGIFKDGLNLRHVMKEQGIKK

Query:  EPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLE
        EPG S I +   +  F +GD+ HP+ + IY  LE
Subjt:  EPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLE

Q9SVP7 Pentatricopeptide repeat-containing protein At4g136504.2e-7429.47Show/hide
Query:  YSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCVSTSLLN
        Y+KC+DI  A   F +    N+  W V++V        R+ F +F +MQ   I P+Q+ Y  IL+ CI L  +ELG  + +QI+   F+ +  V + L++
Subjt:  YSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCVSTSLLN

Query:  MYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKSYRACCRERRFMPGLKLGLEV------NHISISN
        MYAKL  ++ ++ +       ++VSW  MI+G+T      +A   F +M   G+  D    +G+  +  AC   +    G ++  +       + +   N
Subjt:  MYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKSYRACCRERRFMPGLKLGLEV------NHISISN

Query:  AVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLC-LLDGQQVHGFLCKVGLDMNKCI
        A+   Y++CG +++   + + + E  D ++W  LV+ + Q    ++A+ +F  M  EG   N F F S + + +    +  G+QVH  + K G D    +
Subjt:  AVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLC-LLDGQQVHGFLCKVGLDMNKCI

Query:  GSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRVEEGLQYFKLMKETYGLVPEM
         + LI+MYAKCG++++A+K F  +S  + VSW  + + ++++    +A + F +M H                  H G V++G+ YF+ M   YGL P+ 
Subjt:  GSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRVEEGLQYFKLMKETYGLVPEM

Query:  EHYSCIVDLLSRVGHLNDAMEFIKDAFFQS-----RKLCYPCSFIQYLESGIFK----------------------------DGLNL-RHVMKEQGIKKE
        EHY C+VD+L+R G L+ A EFI++   +      R L   C   + +E G F                             D  +L R  MKE+G+KKE
Subjt:  EHYSCIVDLLSRVGHLNDAMEFIKDAFFQS-----RKLCYPCSFIQYLESGIFK----------------------------DGLNL-RHVMKEQGIKKE

Query:  PGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSLG
        PG SWI V  ++H FY GDQ HP  D+I+   ++L  +A+ +G
Subjt:  PGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSLG

Q9ZUW3 Pentatricopeptide repeat-containing protein At2g276101.0e-7530.83Show/hide
Query:  NHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCV
        N + + Y KC ++  A  LFD+   +++ +W  +I G A NGL  +   +F  M+   +   + +++ ++++C  L+ +     +   +V  GF     +
Subjt:  NHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCV

Query:  STSLLNMYAKLQMVEDSYKVFKTMTEV-NIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKSYRACCRERRFMPGLKLGLEVNHISIS
         T+L+  Y+K   + D+ ++FK +  V N+VSW AMISGF  N    EA D F  MK +GV P+  T   +  +             +K   E    ++ 
Subjt:  STSLLNMYAKLQMVEDSYKVFKTMTEV-NIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKSYRACCRERRFMPGLKLGLEVNHISIS

Query:  NAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCA--SLCLLDGQQVHGFLCKVGLDMNK
         A+ +AY K G +++  KV +  ++++D+V+W+ ++  Y+Q  E + AI++F  + + G  PN+F FSS+L  CA  +  +  G+Q HGF  K  LD + 
Subjt:  NAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCA--SLCLLDGQQVHGFLCKVGLDMNK

Query:  CIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRVEEGLQYFKLMKETYGLVP
        C+ S L+ MYAK G +  A++VF R    D VSW ++ SG+AQ+     A ++F+ M+                   H G VEEG +YF +M     + P
Subjt:  CIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRVEEGLQYFKLMKETYGLVP

Query:  EMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESGIFKDGLNLRHVMKEQGIK
          EH SC+VDL SR G L  AM+ I++                                  A        Y      Y ESG +++   +R +M E+ +K
Subjt:  EMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESGIFKDGLNLRHVMKEQGIK

Query:  KEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSLG
        KEPG SWI V    + F AGD+ HP KD+IY KLE+L  +   LG
Subjt:  KEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSLG

Arabidopsis top hitse value%identityAlignment
AT2G13600.1 Pentatricopeptide repeat (PPR) superfamily protein3.3e-7429.55Show/hide
Query:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELF-----------------------CE--------MQG
        KS FSN   + + N +  AYSKC  +    ++FD+M QRNI++W  ++ GL + G   +   LF                       CE        M  
Subjt:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELF-----------------------CE--------MQG

Query:  RGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMK
         G   ++++++ +L  C GL  +  G  V + I    F S   + ++L++MY+K   V D+ +VF  M + N+VSWN++I+ F  NGP +EA D F  M 
Subjt:  RGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMK

Query:  GEGVIPDVQTLIGVAKSYRACCRERRFMPGLKLGLEV-----------NHISISNAVTNAYTKCGSLDDVR-----------------------------
           V PD  TL  V     AC      +  +K+G EV           N I +SNA  + Y KC  + + R                             
Subjt:  GEGVIPDVQTLIGVAKSYRACCRERRFMPGLKLGLEV-----------NHISISNAVTNAYTKCGSLDDVR-----------------------------

Query:  -KVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQV------HGFLCKVGLDMNKCIGSGLINMYA
         ++++ +M ER++VSW  L+  Y+Q  E ++A+ +F  +  E   P  ++F+++L +CA L  L  G Q       HGF  + G + +  +G+ LI+MY 
Subjt:  -KVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQV------HGFLCKVGLDMNKCIGSGLINMYA

Query:  KCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRM-------EH-----------HGGRVEEGLQYFKLMKETYGLVPEMEHYSCIVDL
        KCG + E   VF ++   D VSW  +  G AQ    ++A ELFR M       +H           H G VEEG  YF  M   +G+ P  +HY+C+VDL
Subjt:  KCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRM-------EH-----------HGGRVEEGLQYFKLMKETYGLVPEMEHYSCIVDL

Query:  LSRVGHLNDAMEFIKDAFFQ-------------------------SRKLC---------YPCSFIQYLESGIFKDGLNLRHVMKEQGIKKEPGCSWISVN
        L R G L +A   I++   Q                         + KL          Y      Y E G ++D +N+R  M+++G+ K+PGCSWI + 
Subjt:  LSRVGHLNDAMEFIKDAFFQ-------------------------SRKLC---------YPCSFIQYLESGIFKDGLNLRHVMKEQGIKKEPGCSWISVN

Query:  GTLHKFYAGDQQHPEKDKIYAKLEEL
        G  H F   D+ HP K +I++ L+ L
Subjt:  GTLHKFYAGDQQHPEKDKIYAKLEEL

AT2G27610.1 Tetratricopeptide repeat (TPR)-like superfamily protein7.1e-7730.83Show/hide
Query:  NHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCV
        N + + Y KC ++  A  LFD+   +++ +W  +I G A NGL  +   +F  M+   +   + +++ ++++C  L+ +     +   +V  GF     +
Subjt:  NHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCV

Query:  STSLLNMYAKLQMVEDSYKVFKTMTEV-NIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKSYRACCRERRFMPGLKLGLEVNHISIS
         T+L+  Y+K   + D+ ++FK +  V N+VSW AMISGF  N    EA D F  MK +GV P+  T   +  +             +K   E    ++ 
Subjt:  STSLLNMYAKLQMVEDSYKVFKTMTEV-NIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKSYRACCRERRFMPGLKLGLEVNHISIS

Query:  NAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCA--SLCLLDGQQVHGFLCKVGLDMNK
         A+ +AY K G +++  KV +  ++++D+V+W+ ++  Y+Q  E + AI++F  + + G  PN+F FSS+L  CA  +  +  G+Q HGF  K  LD + 
Subjt:  NAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCA--SLCLLDGQQVHGFLCKVGLDMNK

Query:  CIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRVEEGLQYFKLMKETYGLVP
        C+ S L+ MYAK G +  A++VF R    D VSW ++ SG+AQ+     A ++F+ M+                   H G VEEG +YF +M     + P
Subjt:  CIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRVEEGLQYFKLMKETYGLVP

Query:  EMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESGIFKDGLNLRHVMKEQGIK
          EH SC+VDL SR G L  AM+ I++                                  A        Y      Y ESG +++   +R +M E+ +K
Subjt:  EMEHYSCIVDLLSRVGHLNDAMEFIKD----------------------------------AFFQSRKLCYPCSFIQYLESGIFKDGLNLRHVMKEQGIK

Query:  KEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSLG
        KEPG SWI V    + F AGD+ HP KD+IY KLE+L  +   LG
Subjt:  KEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSLG

AT3G47840.1 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-7529.59Show/hide
Query:  YSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCVSTSLLN
        Y +   I  + R+F +M  RN  +W  II GL   G +++G   F EM       D + ++  L+ C GL  ++ G+ +   ++VRGF +  CV+ SL  
Subjt:  YSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCVSTSLLN

Query:  MYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKSYRACCRERRFMPG-------LKLGLEVNHISIS
        MY +   ++D   +F+ M+E ++VSW ++I  +   G  ++A + F++M+   V P+ QT    A  + AC    R + G       L LGL  + +S+S
Subjt:  MYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKSYRACCRERRFMPG-------LKLGLEVNHISIS

Query:  NAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKC
        N++   Y+ CG+L     V++  M  RD++SW+T++  Y Q    ++  + FS M + G  P  FA +S+L    ++ +++ G+QVH      GL+ N  
Subjt:  NAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLD-GQQVHGFLCKVGLDMNKC

Query:  IGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRVEEGLQYFKLMKETYGLVPE
        + S LINMY+KCG++ EA  +F      D VS T + +G+A++    +A +LF +                     H G+++ G  YF +M+ETY + P 
Subjt:  IGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRVEEGLQYFKLMKETYGLVPE

Query:  MEHYSCIVDLLSRVGHLNDAMEFIKDAFFQSRKLCYPCSFIQ----------------------------------YLESGIFKDGLNLRHVMKEQGIKK
         EHY C+VDLL R G L+DA + I +  ++   + +    I                                   Y  +G  ++  N+R  MK +G+ K
Subjt:  MEHYSCIVDLLSRVGHLNDAMEFIKDAFFQSRKLCYPCSFIQ----------------------------------YLESGIFKDGLNLRHVMKEQGIKK

Query:  EPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLE
        EPG S I +   +  F +GD+ HP+ + IY  LE
Subjt:  EPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLE

AT4G13650.1 Pentatricopeptide repeat (PPR) superfamily protein3.0e-7529.47Show/hide
Query:  YSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCVSTSLLN
        Y+KC+DI  A   F +    N+  W V++V        R+ F +F +MQ   I P+Q+ Y  IL+ CI L  +ELG  + +QI+   F+ +  V + L++
Subjt:  YSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFESHTCVSTSLLN

Query:  MYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKSYRACCRERRFMPGLKLGLEV------NHISISN
        MYAKL  ++ ++ +       ++VSW  MI+G+T      +A   F +M   G+  D    +G+  +  AC   +    G ++  +       + +   N
Subjt:  MYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKSYRACCRERRFMPGLKLGLEV------NHISISN

Query:  AVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLC-LLDGQQVHGFLCKVGLDMNKCI
        A+   Y++CG +++   + + + E  D ++W  LV+ + Q    ++A+ +F  M  EG   N F F S + + +    +  G+QVH  + K G D    +
Subjt:  AVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLC-LLDGQQVHGFLCKVGLDMNKCI

Query:  GSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRVEEGLQYFKLMKETYGLVPEM
         + LI+MYAKCG++++A+K F  +S  + VSW  + + ++++    +A + F +M H                  H G V++G+ YF+ M   YGL P+ 
Subjt:  GSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRVEEGLQYFKLMKETYGLVPEM

Query:  EHYSCIVDLLSRVGHLNDAMEFIKDAFFQS-----RKLCYPCSFIQYLESGIFK----------------------------DGLNL-RHVMKEQGIKKE
        EHY C+VD+L+R G L+ A EFI++   +      R L   C   + +E G F                             D  +L R  MKE+G+KKE
Subjt:  EHYSCIVDLLSRVGHLNDAMEFIKDAFFQS-----RKLCYPCSFIQYLESGIFK----------------------------DGLNL-RHVMKEQGIKKE

Query:  PGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSLG
        PG SWI V  ++H FY GDQ HP  D+I+   ++L  +A+ +G
Subjt:  PGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSLG

AT4G18750.1 Pentatricopeptide repeat (PPR) superfamily protein4.0e-7230.47Show/hide
Query:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA
        KS F   +S+   N +   Y K   + +A ++FD+M +R++ SW  II G   NGL   G  +F +M   GI  D      +   C     I LGR V +
Subjt:  KSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRA

Query:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGV---AKSYRACCRERRFM
          V   F        +LL+MY+K   ++ +  VF+ M++ ++VS+ +MI+G+   G   EA   F  M+ EG+ PDV T+  V      YR     +R  
Subjt:  QIVVRGFESHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGV---AKSYRACCRERRFM

Query:  PGLK---LGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEE-GFTPNQFAFSSVLVSCASLCLLD
          +K   LG +   I +SNA+ + Y KCGS+ +  ++++  M  +D++SW T++  YS+    ++A+ +F+ + EE  F+P++   + VL +CASL   D
Subjt:  PGLK---LGLEVNHISISNAVTNAYTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEE-GFTPNQFAFSSVLVSCASLCLLD

Query:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV
         G+++HG++ + G   ++ + + L++MYAKCG L  A  +FD I++ D VSWT + +G+  +    +A  LF +M                    H G V
Subjt:  -GQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAEAKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEH------------------HGGRV

Query:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIK------DAFFQSRKLC----------------------------YPCSFIQYLESG
        +EG ++F +M+    + P +EHY+CIVD+L+R G L  A  FI+      DA      LC                            Y      Y E+ 
Subjt:  EEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIK------DAFFQSRKLC----------------------------YPCSFIQYLESG

Query:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLK
         ++    LR  + ++G++K PGCSWI + G ++ F AGD  +PE + I A L ++R +
Subjt:  IFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCAAAATCAAAATTTTCAAACCACGACTCCTTGGTACTGCTTAATCATGTTTCTCACGCTTATTCGAAATGCTCCGATATTGGTGCTGCCTATCGTCTGTTTGATCA
AATGTACCAGAGAAACATATTTTCGTGGGCTGTCATAATTGTTGGATTGGCTGAGAATGGTTTGTTCCGCGATGGATTTGAGTTATTCTGCGAAATGCAGGGTCGAGGAA
TTTTCCCAGATCAGTTTGCTTATTCTGGTATCTTGCAGATATGTATTGGTTTGGAGTTCATTGAGTTGGGCAGAATGGTTCGTGCCCAGATTGTTGTTAGAGGCTTTGAA
TCTCATACTTGTGTGTCTACTTCTCTTCTTAATATGTATGCAAAGTTACAAATGGTTGAGGATTCATACAAGGTGTTTAAGACCATGACTGAAGTTAATATAGTCTCATG
GAATGCTATGATCTCAGGGTTCACAGATAATGGTCCTTACTTAGAGGCTTTTGATCATTTTCTCAGAATGAAGGGAGAAGGAGTAATACCTGATGTACAAACGTTGATTG
GTGTTGCAAAAAGCTATCGTGCTTGTTGTCGGGAAAGAAGGTTCATGCCAGGGCTAAAATTAGGATTGGAAGTGAATCATATAAGTATCTCCAATGCAGTGACTAATGCG
TATACTAAATGTGGATCGCTAGATGATGTAAGGAAGGTCATTTACTACAGGATGGAAGAAAGAGATTTAGTATCTTGGACCACCCTAGTGACGGCTTATTCTCAATGTTC
TGAATGGGATAAAGCAATAGAGATCTTCTCAAATATGGGAGAAGAAGGTTTTACACCAAATCAATTTGCCTTTTCAAGTGTGCTCGTTTCATGTGCGAGCCTTTGCTTAC
TCGATGGTCAGCAAGTCCACGGGTTCCTCTGCAAGGTTGGCTTGGACATGAACAAATGCATAGGAAGTGGTCTAATTAACATGTATGCCAAATGTGGCACTCTGGCTGAG
GCAAAGAAGGTTTTCGATAGAATCTCTAATGTTGATACAGTTTCATGGACCACTGTAACATCAGGGCATGCTCAATACAGTATTGTGGATGACGCCTTTGAACTCTTTAG
AAGGATGGAGCACCATGGAGGTCGGGTAGAGGAAGGCCTACAGTACTTCAAGCTAATGAAGGAAACCTATGGTTTGGTGCCAGAGATGGAGCATTATTCCTGTATTGTTG
ATCTCTTAAGTCGTGTGGGACATCTAAACGATGCAATGGAGTTTATAAAAGATGCCTTCTTTCAAAGCAGAAAACTCTGCTACCCATGTTCTTTTATCCAATACCTCGAA
TCAGGGATTTTCAAAGATGGACTTAATTTGCGGCATGTGATGAAAGAGCAGGGCATAAAAAAGGAACCAGGGTGTAGTTGGATCTCTGTGAATGGTACATTGCACAAGTT
TTATGCAGGAGATCAACAACATCCAGAAAAAGATAAAATTTATGCAAAGCTAGAAGAGTTGAGGTTGAAGGCCAATTCTTTAGGGGGATGTACCAGATTTGAGTTACGAG
CTGTAAGTTGTGGACCTCGGATAAGTTATATGGATACGGATGGAAGCCCCACGTCTCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCAAAATCAAAATTTTCAAACCACGACTCCTTGGTACTGCTTAATCATGTTTCTCACGCTTATTCGAAATGCTCCGATATTGGTGCTGCCTATCGTCTGTTTGATCA
AATGTACCAGAGAAACATATTTTCGTGGGCTGTCATAATTGTTGGATTGGCTGAGAATGGTTTGTTCCGCGATGGATTTGAGTTATTCTGCGAAATGCAGGGTCGAGGAA
TTTTCCCAGATCAGTTTGCTTATTCTGGTATCTTGCAGATATGTATTGGTTTGGAGTTCATTGAGTTGGGCAGAATGGTTCGTGCCCAGATTGTTGTTAGAGGCTTTGAA
TCTCATACTTGTGTGTCTACTTCTCTTCTTAATATGTATGCAAAGTTACAAATGGTTGAGGATTCATACAAGGTGTTTAAGACCATGACTGAAGTTAATATAGTCTCATG
GAATGCTATGATCTCAGGGTTCACAGATAATGGTCCTTACTTAGAGGCTTTTGATCATTTTCTCAGAATGAAGGGAGAAGGAGTAATACCTGATGTACAAACGTTGATTG
GTGTTGCAAAAAGCTATCGTGCTTGTTGTCGGGAAAGAAGGTTCATGCCAGGGCTAAAATTAGGATTGGAAGTGAATCATATAAGTATCTCCAATGCAGTGACTAATGCG
TATACTAAATGTGGATCGCTAGATGATGTAAGGAAGGTCATTTACTACAGGATGGAAGAAAGAGATTTAGTATCTTGGACCACCCTAGTGACGGCTTATTCTCAATGTTC
TGAATGGGATAAAGCAATAGAGATCTTCTCAAATATGGGAGAAGAAGGTTTTACACCAAATCAATTTGCCTTTTCAAGTGTGCTCGTTTCATGTGCGAGCCTTTGCTTAC
TCGATGGTCAGCAAGTCCACGGGTTCCTCTGCAAGGTTGGCTTGGACATGAACAAATGCATAGGAAGTGGTCTAATTAACATGTATGCCAAATGTGGCACTCTGGCTGAG
GCAAAGAAGGTTTTCGATAGAATCTCTAATGTTGATACAGTTTCATGGACCACTGTAACATCAGGGCATGCTCAATACAGTATTGTGGATGACGCCTTTGAACTCTTTAG
AAGGATGGAGCACCATGGAGGTCGGGTAGAGGAAGGCCTACAGTACTTCAAGCTAATGAAGGAAACCTATGGTTTGGTGCCAGAGATGGAGCATTATTCCTGTATTGTTG
ATCTCTTAAGTCGTGTGGGACATCTAAACGATGCAATGGAGTTTATAAAAGATGCCTTCTTTCAAAGCAGAAAACTCTGCTACCCATGTTCTTTTATCCAATACCTCGAA
TCAGGGATTTTCAAAGATGGACTTAATTTGCGGCATGTGATGAAAGAGCAGGGCATAAAAAAGGAACCAGGGTGTAGTTGGATCTCTGTGAATGGTACATTGCACAAGTT
TTATGCAGGAGATCAACAACATCCAGAAAAAGATAAAATTTATGCAAAGCTAGAAGAGTTGAGGTTGAAGGCCAATTCTTTAGGGGGATGTACCAGATTTGAGTTACGAG
CTGTAAGTTGTGGACCTCGGATAAGTTATATGGATACGGATGGAAGCCCCACGTCTCTTTGA
Protein sequenceShow/hide protein sequence
MSKSKFSNHDSLVLLNHVSHAYSKCSDIGAAYRLFDQMYQRNIFSWAVIIVGLAENGLFRDGFELFCEMQGRGIFPDQFAYSGILQICIGLEFIELGRMVRAQIVVRGFE
SHTCVSTSLLNMYAKLQMVEDSYKVFKTMTEVNIVSWNAMISGFTDNGPYLEAFDHFLRMKGEGVIPDVQTLIGVAKSYRACCRERRFMPGLKLGLEVNHISISNAVTNA
YTKCGSLDDVRKVIYYRMEERDLVSWTTLVTAYSQCSEWDKAIEIFSNMGEEGFTPNQFAFSSVLVSCASLCLLDGQQVHGFLCKVGLDMNKCIGSGLINMYAKCGTLAE
AKKVFDRISNVDTVSWTTVTSGHAQYSIVDDAFELFRRMEHHGGRVEEGLQYFKLMKETYGLVPEMEHYSCIVDLLSRVGHLNDAMEFIKDAFFQSRKLCYPCSFIQYLE
SGIFKDGLNLRHVMKEQGIKKEPGCSWISVNGTLHKFYAGDQQHPEKDKIYAKLEELRLKANSLGGCTRFELRAVSCGPRISYMDTDGSPTSL