; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G000620 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G000620
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionAMP-dependent synthetase and ligase family protein
Genome locationCmo_Chr09:274892..284237
RNA-Seq ExpressionCmoCh09G000620
SyntenyCmoCh09G000620
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000873 - AMP-dependent synthetase/ligase
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR020845 - AMP-binding, conserved site
IPR025110 - AMP-binding enzyme, C-terminal domain
IPR042099 - ANL, N-terminal domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591227.1 Pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0081.15Show/hide
Query:  ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQ---------------------------GIEYLGNEAESTKFMQRQIVDALRVGDRSSASNL
        ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQ                           GIEYLGNEAESTKFMQRQIVDALRVGDRSSASNL
Subjt:  ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQ---------------------------GIEYLGNEAESTKFMQRQIVDALRVGDRSSASNL

Query:  LMELGQEKHSLTADNFVGILSYCARSPDPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISF
        LMELGQEKHSLTADNFVGILSYCARSPDPL                           FVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISF
Subjt:  LMELGQEKHSLTADNFVGILSYCARSPDPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISF

Query:  LAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI----------VTSY-------------MYLLGDLKSAH
        L ESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDIMDRRMVGKNEATYSELLK + C K +          V +Y                LGDLKSAH
Subjt:  LAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI----------VTSY-------------MYLLGDLKSAH

Query:  TALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGRTLPSNYRSNFVMKVLRWSFND
        TALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGRTLPSNYRSNFVMKVLRWSFND
Subjt:  TALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGRTLPSNYRSNFVMKVLRWSFND

Query:  VICACALTRNCGLAEQLMQQFSISCVLTTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI--------------------------------
        VICACALTRNCGLAEQLMQQ                MHELGLQPSSHTFDGFVRSVVSERGFSDGIKI                                
Subjt:  VICACALTRNCGLAEQLMQQFSISCVLTTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI--------------------------------

Query:  --------------------------DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMM
                                  DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMM
Subjt:  --------------------------DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMM

Query:  NLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSG
        NLLKALGAEGMTKELLQYLNVAENLFYYNNT LGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSG
Subjt:  NLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSG

Query:  FCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQV
        FCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEF VEKMKR+KIQPDPSTCHSVFSAYVSLGYHSTAMEALQV
Subjt:  FCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQV

Query:  LSMRMLCKEQDTSPVVTEYVEDFVLAEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA
        LSMRMLCKE DTSPVVTEYVEDFVLAEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA
Subjt:  LSMRMLCKEQDTSPVVTEYVEDFVLAEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA

XP_022937086.1 pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucurbita moschata]0.0e+0084.28Show/hide
Query:  ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSP
        ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSP
Subjt:  ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSP

Query:  DPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST
        DPL                           FVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST
Subjt:  DPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST

Query:  VHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI----------VTSY-------------MYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPV
        VHVSQCLDIMDRRMVGKNEATYSELLK + C K +          V +Y                LGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPV
Subjt:  VHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI----------VTSY-------------MYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPV

Query:  PLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGRTLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISCVL
        PLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGRTLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQ       
Subjt:  PLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGRTLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISCVL

Query:  TTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI----------------------------------------------------------D
                 MHELGLQPSSHTFDGFVRSVVSERGFSDGIKI                                                          D
Subjt:  TTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI----------------------------------------------------------D

Query:  QPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFY
        QPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFY
Subjt:  QPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFY

Query:  YNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNL
        YNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNL
Subjt:  YNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNL

Query:  LDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAE
        LDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAE
Subjt:  LDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAE

Query:  DSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA
        DSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA
Subjt:  DSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA

XP_022937087.1 pentatricopeptide repeat-containing protein At1g76280 isoform X2 [Cucurbita moschata]0.0e+0083.25Show/hide
Query:  MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLL
        MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPL                           FVMETWKIMEERGVFLDNTCTLL
Subjt:  MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLL

Query:  MIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI----------VTSY----
        MIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDIMDRRMVGKNEATYSELLK + C K +          V +Y    
Subjt:  MIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI----------VTSY----

Query:  ---------MYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGRT
                    LGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGRT
Subjt:  ---------MYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGRT

Query:  LPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISCVLTTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI------------
        LPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQ                MHELGLQPSSHTFDGFVRSVVSERGFSDGIKI            
Subjt:  LPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISCVLTTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI------------

Query:  ----------------------------------------------DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRI
                                                      DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRI
Subjt:  ----------------------------------------------DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRI

Query:  RMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCC
        RMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCC
Subjt:  RMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCC

Query:  SVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHS
        SVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHS
Subjt:  SVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHS

Query:  VFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSY
        VFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSY
Subjt:  VFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSY

Query:  DDGYTA
        DDGYTA
Subjt:  DDGYTA

XP_022976056.1 pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucurbita maxima]0.0e+0082.46Show/hide
Query:  ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSP
        ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSP
Subjt:  ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSP

Query:  DPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST
        DPL                           FVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST
Subjt:  DPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST

Query:  VHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI------------------------VTSYMYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIP
        VHVSQCLD+MDRRMVGKNEATYSELLK +   K +                        + SY   LGDLKSA+TALQKMV LVIGAAGQKL SLELDIP
Subjt:  VHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI------------------------VTSYMYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIP

Query:  VPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGR-TLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISC
        VPLRTEFYHDNFNFEENGPSTDE+YCKK+VPCEGDI QFSVNGMKCGEVESGR TLPSNYRSNFVMKVLRWSFNDVICACA TRNCGLAEQLMQQ     
Subjt:  VPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGR-TLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISC

Query:  VLTTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI---------------------------------------------------------
                   MHELGLQPSSHTFDGFVRSVVSERGFSDGIKI                                                         
Subjt:  VLTTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI---------------------------------------------------------

Query:  -DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENL
         DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENL
Subjt:  -DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENL

Query:  FYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDAL
        FYYNNT LGTPIYNTALHFLVESKEIHMA ELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDAL
Subjt:  FYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDAL

Query:  NLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVL
        NLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKR+KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKE DTSPVVTEYVEDFVL
Subjt:  NLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVL

Query:  AEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA
        AEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSS NQSPWAMRLASSYDDGYTA
Subjt:  AEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA

XP_023536089.1 pentatricopeptide repeat-containing protein At1g76280 [Cucurbita pepo subsp. pepo]0.0e+0082.23Show/hide
Query:  ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSP
        AR RLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSP
Subjt:  ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSP

Query:  DPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST
        DPL                           FVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST
Subjt:  DPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST

Query:  VHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI------------------------VTSYMYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIP
        VHVSQCLD+MDRRMVGKNEATYSELLK + C K +                        + SY   LGDLKSA+TALQKMVALVIGAAGQKLPSLELDIP
Subjt:  VHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI------------------------VTSYMYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIP

Query:  VPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGR-TLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISC
        VPLRTE YH+NFNFEENGPSTDE+YCKKMVPCEGDI QFSVNGMKCGEVESGR TLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQ     
Subjt:  VPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGR-TLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISC

Query:  VLTTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI---------------------------------------------------------
                   MHELGLQPSSHTFDGFVRSVVSERGFSDGIKI                                                         
Subjt:  VLTTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI---------------------------------------------------------

Query:  -DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENL
         DQPERAMRML KMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENL
Subjt:  -DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENL

Query:  FYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDAL
        FYY+NT LGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMI+CCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDAL
Subjt:  FYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDAL

Query:  NLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVL
        NLLDQASSEGIELDVVIMNTIVQKACEK   DVIEFVVEKMKR+KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKE DTSPVVTEYVEDFVL
Subjt:  NLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVL

Query:  AEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA
        AEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA
Subjt:  AEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA

TrEMBL top hitse value%identityAlignment
A0A6J1F9C6 pentatricopeptide repeat-containing protein At1g76280 isoform X10.0e+0084.28Show/hide
Query:  ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSP
        ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSP
Subjt:  ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSP

Query:  DPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST
        DPL                           FVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST
Subjt:  DPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST

Query:  VHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI----------VTSY-------------MYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPV
        VHVSQCLDIMDRRMVGKNEATYSELLK + C K +          V +Y                LGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPV
Subjt:  VHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI----------VTSY-------------MYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPV

Query:  PLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGRTLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISCVL
        PLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGRTLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQ       
Subjt:  PLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGRTLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISCVL

Query:  TTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI----------------------------------------------------------D
                 MHELGLQPSSHTFDGFVRSVVSERGFSDGIKI                                                          D
Subjt:  TTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI----------------------------------------------------------D

Query:  QPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFY
        QPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFY
Subjt:  QPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFY

Query:  YNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNL
        YNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNL
Subjt:  YNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNL

Query:  LDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAE
        LDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAE
Subjt:  LDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAE

Query:  DSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA
        DSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA
Subjt:  DSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA

A0A6J1FA55 pentatricopeptide repeat-containing protein At1g76280 isoform X20.0e+0083.25Show/hide
Query:  MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLL
        MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPL                           FVMETWKIMEERGVFLDNTCTLL
Subjt:  MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLL

Query:  MIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI----------VTSY----
        MIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDIMDRRMVGKNEATYSELLK + C K +          V +Y    
Subjt:  MIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI----------VTSY----

Query:  ---------MYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGRT
                    LGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGRT
Subjt:  ---------MYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGRT

Query:  LPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISCVLTTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI------------
        LPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQ                MHELGLQPSSHTFDGFVRSVVSERGFSDGIKI            
Subjt:  LPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISCVLTTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI------------

Query:  ----------------------------------------------DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRI
                                                      DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRI
Subjt:  ----------------------------------------------DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRI

Query:  RMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCC
        RMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCC
Subjt:  RMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCC

Query:  SVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHS
        SVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHS
Subjt:  SVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHS

Query:  VFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSY
        VFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSY
Subjt:  VFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSY

Query:  DDGYTA
        DDGYTA
Subjt:  DDGYTA

A0A6J1FFJ8 probable acyl-activating enzyme 1, peroxisomal0.0e+00100Show/hide
Query:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA
        MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA
Subjt:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA

Query:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSGTTS
        AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSGTTS
Subjt:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSGTTS

Query:  RPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSEQKPL
        RPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSEQKPL
Subjt:  RPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSEQKPL

Query:  PGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEVMVRG
        PGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEVMVRG
Subjt:  PGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEVMVRG

Query:  NTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGCSASE
        NTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGCSASE
Subjt:  NTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGCSASE

Query:  EDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSLPKQTSKL
        EDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSLPKQTSKL
Subjt:  EDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSLPKQTSKL

A0A6J1IEQ4 pentatricopeptide repeat-containing protein At1g76280 isoform X20.0e+0081.31Show/hide
Query:  MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLL
        MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPL                           FVMETWKIMEERGVFLDNTCTLL
Subjt:  MQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLL

Query:  MIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI------------------
        MIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLD+MDRRMVGKNEATYSELLK +   K +                  
Subjt:  MIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI------------------

Query:  ------VTSYMYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGR
              + SY   LGDLKSA+TALQKMV LVIGAAGQKL SLELDIPVPLRTEFYHDNFNFEENGPSTDE+YCKK+VPCEGDI QFSVNGMKCGEVESGR
Subjt:  ------VTSYMYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGR

Query:  -TLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISCVLTTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI----------
         TLPSNYRSNFVMKVLRWSFNDVICACA TRNCGLAEQLMQQ                MHELGLQPSSHTFDGFVRSVVSERGFSDGIKI          
Subjt:  -TLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISCVLTTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI----------

Query:  ------------------------------------------------DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAK
                                                        DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAK
Subjt:  ------------------------------------------------DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAK

Query:  RIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMID
        RIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNT LGTPIYNTALHFLVESKEIHMA ELFNNMKHSGLFPDAATFEMMID
Subjt:  RIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMID

Query:  CCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTC
        CCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKR+KIQPDPSTC
Subjt:  CCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTC

Query:  HSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLAS
        HSVFSAYVSLGYHSTAMEALQVLSMRMLCKE DTSPVVTEYVEDFVLAEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSS NQSPWAMRLAS
Subjt:  HSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLAS

Query:  SYDDGYTA
        SYDDGYTA
Subjt:  SYDDGYTA

A0A6J1IFV2 pentatricopeptide repeat-containing protein At1g76280 isoform X10.0e+0082.46Show/hide
Query:  ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSP
        ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSP
Subjt:  ARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSP

Query:  DPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST
        DPL                           FVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST
Subjt:  DPLVVLAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQST

Query:  VHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI------------------------VTSYMYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIP
        VHVSQCLD+MDRRMVGKNEATYSELLK +   K +                        + SY   LGDLKSA+TALQKMV LVIGAAGQKL SLELDIP
Subjt:  VHVSQCLDIMDRRMVGKNEATYSELLKESFCYKGI------------------------VTSYMYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIP

Query:  VPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGR-TLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISC
        VPLRTEFYHDNFNFEENGPSTDE+YCKK+VPCEGDI QFSVNGMKCGEVESGR TLPSNYRSNFVMKVLRWSFNDVICACA TRNCGLAEQLMQQ     
Subjt:  VPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEVESGR-TLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISC

Query:  VLTTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI---------------------------------------------------------
                   MHELGLQPSSHTFDGFVRSVVSERGFSDGIKI                                                         
Subjt:  VLTTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKI---------------------------------------------------------

Query:  -DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENL
         DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENL
Subjt:  -DQPERAMRMLAKMKQMEVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENL

Query:  FYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDAL
        FYYNNT LGTPIYNTALHFLVESKEIHMA ELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDAL
Subjt:  FYYNNTWLGTPIYNTALHFLVESKEIHMAIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDAL

Query:  NLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVL
        NLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKR+KIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKE DTSPVVTEYVEDFVL
Subjt:  NLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVVEKMKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVL

Query:  AEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA
        AEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSS NQSPWAMRLASSYDDGYTA
Subjt:  AEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWAMRLASSYDDGYTA

SwissProt top hitse value%identityAlignment
F4HUK6 Butanoate--CoA ligase AAE12.6e-22666.24Show/hide
Query:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA
        M+G  +  ANY+PLTPI+FL+RSA VY DR+S+VYG V+YTWR+T  RC R+ASAL ++GI+ GDVV+ LAPN+PAM ELHF VPMAGA+LC+LN+RHD+
Subjt:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA

Query:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSGTTS
        ++V+ LL HS  K+I  D+QF  I  GA + +S + +K+P++V+I E       + R+  +   +E+E ++  GK DF++ RP DE D I++NYTSGTTS
Subjt:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSGTTS

Query:  RPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSEQKPL
         PKGV+YSHRGAYLN+L+AVLLN+M S P YLW  PMFHCNGW L WGV A  GTNIC RN +AK IFDNIS HKVTHMGGAPT+LNMIINAP SEQKPL
Subjt:  RPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSEQKPL

Query:  PGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEVMVRG
        PGKV+ +TG APPP+HV+FKM  +GF + H+YGLTETYGP T+C+WKPEWDSLP+++QAK+ +RQG+ H+GLEE+ +KDPVTM + PADG T+GEV+ RG
Subjt:  PGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEVMVRG

Query:  NTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGCSASE
        NTVM+GYLK+ +AT+EAF GGWF SGDLGV+HPDGYIELKDRSKDIIISGGENISSIEVES LF+HP VLEAAVV RPD++WGET CAFVKLKDG  AS 
Subjt:  NTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGCSASE

Query:  EDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSLPKQ
        E++I +CR+ LPHYMAPRS+VF+DLPKTSTGK QKF+L+ +AKA+ SL K+
Subjt:  EDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSLPKQ

M4IRL4 Isovalerate--CoA ligase CCL21.6e-19961.59Show/hide
Query:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASAL-VRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHD
        M+G+ +CSAN++PL+PITFLERS+  Y D  SLVYG V+YTW +T  RC +LASAL   +GI+ GDVVA  + N+P +YELHFAVPMAG +LC+LN R+D
Subjt:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASAL-VRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHD

Query:  AAMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSGTT
        +AMVSTLL+HSEAK+I V+ Q       A+  ++++  K P +V++ +            ++S Y  +  LL +G  DF+IRRPK+E DPI++NYTSGTT
Subjt:  AAMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSGTT

Query:  SRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSE-QK
        +RPK V+YSHRGAYLN+++ VLL+ M +  VYLW VPMFHCNGW   WG AAQ  TNIC R  S K IFDNI LHKVTH G APTVLNMI+N+P      
Subjt:  SRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSE-QK

Query:  PLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEVMV
        PLP KV +MTGG+PPP  V+ +M  MGF + H YGLTET GPA  C  KPEWD+L  +++  L +RQGL H+ +EE+D++DPVTMES  ADG TIGEVM 
Subjt:  PLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEVMV

Query:  RGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGC--
        RGNTVMSGY KDLKAT EAF GGWFRSGDLGV+H DGYI+LKDR KD++ISGGENIS++EVE+VL+SH +VLEAAVV RPD  WGETPCAFV LK+G   
Subjt:  RGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGC--

Query:  SASEEDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSL
          S + IIKFCR+ LPHYMAP++VVF++LPKTSTGK QK+ILKE+A AMGSL
Subjt:  SASEEDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSL

M4IS92 Probable CoA ligase CCL132.5e-20061.59Show/hide
Query:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASAL-VRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHD
        M+G+ +CSAN++PL+PITFLERS+  Y D  SLVYG V+YTW +T  RC +LASAL    GI+ GDVVA  + NIP +YELHFAVPMAG +LC+LN R+D
Subjt:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASAL-VRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHD

Query:  AAMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSGTT
        +AMVSTLL+HSEAK+I V+ Q       A+  ++++  K P +V++ +            ++S Y  +  LL +G  DF+IRRPK+E DPI++NYTSGTT
Subjt:  AAMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSGTT

Query:  SRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSE-QK
        +RPK V+YSHRGAYLN+++ VLL+ M +  VYLW VPMFHCNGW   WG AAQ  TNIC R  S K IFDNI LHKVTH G APTVLNMI+N+P      
Subjt:  SRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSE-QK

Query:  PLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEVMV
        PLP KV +MTGG+PPP  V+ +M  MGF + H YGLTET+GPAT C  KPEWD+L  +++  L +RQGL H+ +EE+D++DPV+MES  ADG TIGEVM 
Subjt:  PLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEVMV

Query:  RGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGC--
        RGNTVMSGY KDLKAT EAF GGWFR+GDLGV+H DGYI+LKDR KD++ISGGEN+S++EVE+VL+SH +VLEAAVV RPD  WGETPCAFV LK+G   
Subjt:  RGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGC--

Query:  SASEEDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSL
          S + IIKFCR+ LPHYMAP++VVF++LPKTSTGK QK+ILKE+AKAMGSL
Subjt:  SASEEDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSL

Q8VZF1 Acetate/butyrate--CoA ligase AAE7, peroxisomal7.8e-17052.42Show/hide
Query:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA
        +D + +  ANY  LTP+ FL+R+A V+  R S+++G  +YTWR+T  RC RLASAL    I  G  VA +APNIPAMYE HF VPM GAVL  +N+R +A
Subjt:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA

Query:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERRE---KIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSG
          V+ LLSHS++ +I+VD +F  +   +++ M E+     K PL+++I ++   P  ++R   + G +E+E  L +G  ++  + P DE   IAL YTSG
Subjt:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERRE---KIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSG

Query:  TTSRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSEQ
        TT+ PKGV+  HRGAY+ ALS  L+  M    VYLW +PMFHCNGW   W +A  SGT+IC R  +AKE++  I+ +KVTH   AP VLN I+NAP  + 
Subjt:  TTSRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSEQ

Query:  -KPLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEV
          PLP  V +MT GA PP  VLF M   GF + H YGL+ETYGP+TVC+WKPEWDSLP + QAKL++RQG+++ G+E++D+ D  T +  PADGKT GE+
Subjt:  -KPLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEV

Query:  MVRGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGC
        + RGN VM GYLK+ +A +E F GGWF SGD+ V+HPD YIE+KDRSKD+IISGGENISS+EVE+V++ HP+VLEA+VV RPD+ W E+PCAFV LK   
Subjt:  MVRGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGC

Query:  SASE-----EDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSLPK
           +     +DI+KFCRE LP Y  P+SVVF  LPKT+TGK QK IL+ +AK MG +P+
Subjt:  SASE-----EDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSLPK

Q9SEY5 Isovalerate--CoA ligase AAE21.8e-20662.77Show/hide
Query:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA
        ++G+ R  AN+ PL+PITFLERSA VY DR SLV+G V++TW +T  RC RLASAL  +GI+RGDVVAALAPN+PAM+ELHFAVPMAG +LC LN R D 
Subjt:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA

Query:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMS---ERREKIPLVVIINEYDQPPSHIDREGS-ASGY---LEFESLLESGKLDFKIRRPKDELDPIALN
        + +S LL+HSEAKI+ VD+Q   I  GA+  ++   + R+ + LV+I    D   S  D   + AS Y    E+E+LL+SG  +F+I +P+ E DPI++N
Subjt:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMS---ERREKIPLVVIINEYDQPPSHIDREGS-ASGY---LEFESLLESGKLDFKIRRPKDELDPIALN

Query:  YTSGTTSRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAP
        YTSGTTSRPKGV+YSHRGAYLN+L+ V L+ M   PVYLW VPMFHCNGW L WGVAAQ GTNIC R  S K IF NI++HKVTHMGGAPTVLNMI+N  
Subjt:  YTSGTTSRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAP

Query:  TSEQKPLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTI
         +E KPLP +V +MTGG+PP   +L KM  +GF + H YGLTETYGP T C WKPEWDSL  +++ KL +RQG+QH+GLE +D+KDP+TME+ P DG T+
Subjt:  TSEQKPLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTI

Query:  GEVMVRGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLK
        GEVM RGNTVMSGY KD++ATR+AF G WF SGDL V++PDGYIE+KDR KD+IISGGENISS+EVE VL SH +VLEAAVV RPD HWG+TPC FVKLK
Subjt:  GEVMVRGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLK

Query:  DGC-SASEEDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSL
        +G  +   E+II FCR+HLPHYMAP+++VF D+PKTSTGK QK++L+++A  MGSL
Subjt:  DGC-SASEEDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSL

Arabidopsis top hitse value%identityAlignment
AT1G20560.1 acyl activating enzyme 11.9e-22766.24Show/hide
Query:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA
        M+G  +  ANY+PLTPI+FL+RSA VY DR+S+VYG V+YTWR+T  RC R+ASAL ++GI+ GDVV+ LAPN+PAM ELHF VPMAGA+LC+LN+RHD+
Subjt:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA

Query:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSGTTS
        ++V+ LL HS  K+I  D+QF  I  GA + +S + +K+P++V+I E       + R+  +   +E+E ++  GK DF++ RP DE D I++NYTSGTTS
Subjt:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSGTTS

Query:  RPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSEQKPL
         PKGV+YSHRGAYLN+L+AVLLN+M S P YLW  PMFHCNGW L WGV A  GTNIC RN +AK IFDNIS HKVTHMGGAPT+LNMIINAP SEQKPL
Subjt:  RPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSEQKPL

Query:  PGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEVMVRG
        PGKV+ +TG APPP+HV+FKM  +GF + H+YGLTETYGP T+C+WKPEWDSLP+++QAK+ +RQG+ H+GLEE+ +KDPVTM + PADG T+GEV+ RG
Subjt:  PGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEVMVRG

Query:  NTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGCSASE
        NTVM+GYLK+ +AT+EAF GGWF SGDLGV+HPDGYIELKDRSKDIIISGGENISSIEVES LF+HP VLEAAVV RPD++WGET CAFVKLKDG  AS 
Subjt:  NTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGCSASE

Query:  EDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSLPKQ
        E++I +CR+ LPHYMAPRS+VF+DLPKTSTGK QKF+L+ +AKA+ SL K+
Subjt:  EDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSLPKQ

AT1G20560.2 acyl activating enzyme 11.7e-19666.74Show/hide
Query:  MYELHFAVPMAGAVLCSLNVRHDAAMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKL
        M ELHF VPMAGA+LC+LN+RHD+++V+ LL HS  K+I  D+QF  I  GA + +S + +K+P++V+I E       + R+  +   +E+E ++  GK 
Subjt:  MYELHFAVPMAGAVLCSLNVRHDAAMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKL

Query:  DFKIRRPKDELDPIALNYTSGTTSRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKV
        DF++ RP DE D I++NYTSGTTS PKGV+YSHRGAYLN+L+AVLLN+M S P YLW  PMFHCNGW L WGV A  GTNIC RN +AK IFDNIS HKV
Subjt:  DFKIRRPKDELDPIALNYTSGTTSRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKV

Query:  THMGGAPTVLNMIINAPTSEQKPLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVD
        THMGGAPT+LNMIINAP SEQKPLPGKV+ +TG APPP+HV+FKM  +GF + H+YGLTETYGP T+C+WKPEWDSLP+++QAK+ +RQG+ H+GLEE+ 
Subjt:  THMGGAPTVLNMIINAPTSEQKPLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVD

Query:  IKDPVTMESAPADGKTIGEVMVRGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVG
        +KDPVTM + PADG T+GEV+ RGNTVM+GYLK+ +AT+EAF GGWF SGDLGV+HPDGYIELKDRSKDIIISGGENISSIEVES LF+HP VLEAAVV 
Subjt:  IKDPVTMESAPADGKTIGEVMVRGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVG

Query:  RPDDHWGETPCAFVKLKDGCSASEEDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSLPKQ
        RPD++WGET CAFVKLKDG  AS E++I +CR+ LPHYMAPRS+VF+DLPKTSTGK QKF+L+ +AKA+ SL K+
Subjt:  RPDDHWGETPCAFVKLKDGCSASEEDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSLPKQ

AT1G66120.1 AMP-dependent synthetase and ligase family protein6.6e-16449.64Show/hide
Query:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA
        MD +  C AN +PLTPITFL+R++  Y +R S++YG+ ++TW +T  RC RLA++L+ + I R DVV+ LAPN+PAMYE+HF+VPM GAVL  +N R DA
Subjt:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA

Query:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAM-SERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGK----LDFKIRRPKDELDPIALNYT
          ++ +L H+E KI+ VDY+F  ++   ++ + + + +  P +++INE D       +E      L++E L+  G+        + R  +E DPI+LNYT
Subjt:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAM-SERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGK----LDFKIRRPKDELDPIALNYT

Query:  SGTTSRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTS
        SGTT+ PKGV+ SH+GAYL+ALS+++  +M   PVYLW +PMFHCNGW+ TW VAA+ GTN+C R+ +A EI+ NI LH VTHM   PTV   ++    +
Subjt:  SGTTSRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTS

Query:  EQKPLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGE
        +Q P    V ++TGG+ PP+ ++ K+  +GF ++H YGLTE  GP   C W+ EW+ LP+ +Q +L  RQG++++ L +VD+K+  T+ES P DGKT+GE
Subjt:  EQKPLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGE

Query:  VMVRGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLK--
        ++++G+++M GYLK+ KAT EAF  GW  +GD+GV HPDGY+E+KDRSKDIIISGGENISSIEVE VL+ +  VLEAAVV  P   WGETPCAFV LK  
Subjt:  VMVRGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLK--

Query:  -DGCSASEEDIIKFCREHLPHYMAPRSVV-FKDLPKTSTGKTQKFILKEEAKAM
         +G   SE D+IK+CRE++PH+M P+ VV F++LPK S GK  K  L++ AKA+
Subjt:  -DGCSASEEDIIKFCREHLPHYMAPRSVV-FKDLPKTSTGKTQKFILKEEAKAM

AT2G17650.1 AMP-dependent synthetase and ligase family protein1.3e-20762.77Show/hide
Query:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA
        ++G+ R  AN+ PL+PITFLERSA VY DR SLV+G V++TW +T  RC RLASAL  +GI+RGDVVAALAPN+PAM+ELHFAVPMAG +LC LN R D 
Subjt:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA

Query:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMS---ERREKIPLVVIINEYDQPPSHIDREGS-ASGY---LEFESLLESGKLDFKIRRPKDELDPIALN
        + +S LL+HSEAKI+ VD+Q   I  GA+  ++   + R+ + LV+I    D   S  D   + AS Y    E+E+LL+SG  +F+I +P+ E DPI++N
Subjt:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMS---ERREKIPLVVIINEYDQPPSHIDREGS-ASGY---LEFESLLESGKLDFKIRRPKDELDPIALN

Query:  YTSGTTSRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAP
        YTSGTTSRPKGV+YSHRGAYLN+L+ V L+ M   PVYLW VPMFHCNGW L WGVAAQ GTNIC R  S K IF NI++HKVTHMGGAPTVLNMI+N  
Subjt:  YTSGTTSRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAP

Query:  TSEQKPLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTI
         +E KPLP +V +MTGG+PP   +L KM  +GF + H YGLTETYGP T C WKPEWDSL  +++ KL +RQG+QH+GLE +D+KDP+TME+ P DG T+
Subjt:  TSEQKPLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTI

Query:  GEVMVRGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLK
        GEVM RGNTVMSGY KD++ATR+AF G WF SGDL V++PDGYIE+KDR KD+IISGGENISS+EVE VL SH +VLEAAVV RPD HWG+TPC FVKLK
Subjt:  GEVMVRGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLK

Query:  DGC-SASEEDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSL
        +G  +   E+II FCR+HLPHYMAP+++VF D+PKTSTGK QK++L+++A  MGSL
Subjt:  DGC-SASEEDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSL

AT3G16910.1 acyl-activating enzyme 75.6e-17152.42Show/hide
Query:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA
        +D + +  ANY  LTP+ FL+R+A V+  R S+++G  +YTWR+T  RC RLASAL    I  G  VA +APNIPAMYE HF VPM GAVL  +N+R +A
Subjt:  MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDA

Query:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERRE---KIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSG
          V+ LLSHS++ +I+VD +F  +   +++ M E+     K PL+++I ++   P  ++R   + G +E+E  L +G  ++  + P DE   IAL YTSG
Subjt:  AMVSTLLSHSEAKIIVVDYQFEHIVMGAIKAMSERRE---KIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSG

Query:  TTSRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSEQ
        TT+ PKGV+  HRGAY+ ALS  L+  M    VYLW +PMFHCNGW   W +A  SGT+IC R  +AKE++  I+ +KVTH   AP VLN I+NAP  + 
Subjt:  TTSRPKGVIYSHRGAYLNALSAVLLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSEQ

Query:  -KPLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEV
          PLP  V +MT GA PP  VLF M   GF + H YGL+ETYGP+TVC+WKPEWDSLP + QAKL++RQG+++ G+E++D+ D  T +  PADGKT GE+
Subjt:  -KPLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVHAYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEV

Query:  MVRGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGC
        + RGN VM GYLK+ +A +E F GGWF SGD+ V+HPD YIE+KDRSKD+IISGGENISS+EVE+V++ HP+VLEA+VV RPD+ W E+PCAFV LK   
Subjt:  MVRGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGC

Query:  SASE-----EDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSLPK
           +     +DI+KFCRE LP Y  P+SVVF  LPKT+TGK QK IL+ +AK MG +P+
Subjt:  SASE-----EDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSLPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGAATGGACCGCTGCTCTGCCAATTACCTTCCTCTCACTCCAATCACCTTCTTGGAGCGCTCCGCCGCGGTTTACGGCGACCGGATTTCTCTCGTCTATGGACG
TGTTCAATACACTTGGAGAGAGACGCTTTTGCGATGTACCAGGCTCGCTTCTGCTCTTGTTCGAATGGGAATCGCTCGTGGAGATGTGGTTGCTGCCTTGGCACCGAATA
TTCCAGCTATGTACGAGCTTCATTTTGCTGTACCGATGGCTGGTGCAGTTCTTTGCTCCCTCAACGTACGCCATGATGCGGCAATGGTTTCTACACTGTTGAGCCATTCG
GAAGCTAAAATCATTGTTGTAGATTACCAGTTTGAACATATTGTAATGGGAGCAATTAAAGCTATGTCCGAAAGGAGAGAAAAGATACCTCTTGTTGTCATTATTAATGA
ATATGATCAGCCACCTTCCCACATCGACAGAGAGGGCTCTGCTTCAGGATATCTAGAGTTTGAGAGCCTTTTAGAGTCTGGAAAACTCGATTTCAAGATCAGGCGGCCTA
AAGATGAATTGGATCCAATTGCTCTTAATTATACTTCTGGCACAACATCAAGGCCAAAAGGTGTTATTTACTCCCATAGAGGTGCATATCTCAATGCTCTTTCTGCAGTT
CTTCTGAATGATATGTGCTCACTCCCTGTGTATCTGTGGGTTGTTCCAATGTTCCACTGCAATGGATGGAGCTTAACTTGGGGCGTGGCTGCACAGAGCGGCACAAACAT
CTGCCAGAGAAACGCGAGTGCTAAAGAAATTTTTGACAATATATCTCTGCATAAGGTTACTCATATGGGCGGCGCACCTACTGTCTTAAACATGATTATCAATGCACCAA
CAAGCGAACAGAAGCCCCTTCCAGGGAAGGTAACCATGATGACTGGTGGCGCTCCGCCGCCTTCCCATGTTCTTTTTAAGATGAGAGCAATGGGATTCCTGATTGTCCAT
GCATATGGTTTGACCGAAACTTATGGGCCAGCAACAGTTTGCTCTTGGAAACCAGAATGGGATTCACTCCCTCAAGATAAACAGGCAAAGCTAAGTTCTCGCCAAGGATT
GCAGCATGTTGGGCTAGAGGAAGTGGACATAAAGGATCCTGTCACCATGGAGAGTGCTCCAGCCGATGGAAAAACCATTGGTGAAGTTATGGTCAGAGGCAACACTGTGA
TGAGTGGATACTTAAAAGATCTCAAAGCCACACGGGAAGCCTTCAATGGCGGATGGTTTCGAAGTGGGGACTTGGGGGTGAGGCACCCTGATGGTTACATAGAACTGAAG
GACCGTTCTAAGGACATAATCATTTCTGGGGGAGAGAATATTAGCTCGATTGAGGTGGAGTCTGTTCTTTTCAGCCATCCATCAGTTCTTGAAGCTGCTGTTGTGGGAAG
ACCTGATGACCACTGGGGTGAAACACCATGTGCATTTGTGAAGCTCAAAGATGGGTGCAGTGCGAGTGAAGAAGATATTATAAAATTCTGCAGAGAACACCTGCCTCACT
ACATGGCTCCGCGAAGTGTGGTGTTTAAAGATTTGCCAAAAACTTCGACAGGGAAAACTCAGAAGTTTATTCTCAAGGAGGAGGCCAAGGCCATGGGTAGCCTTCCAAAA
CAGACCAGCAAACTGGCAAGGCTTCGTTTGGGATCCATAGCTGACTCGCTATACAGATTCAGGCCACACGAACATGGGCGAAAACAGGATGCGAATAAGATGGTATTCCG
TCGAGCTCTTCTCATCTCCCAAGGTATTGAATATTTGGGAAATGAAGCCGAGTCAACTAAGTTCATGCAGAGACAGATTGTCGATGCACTTCGAGTGGGTGATAGAAGTA
GTGCTTCCAATCTGCTCATGGAACTTGGCCAGGAAAAACACTCCTTAACTGCAGATAATTTTGTTGGCATTTTGAGCTACTGTGCAAGATCACCTGATCCACTGGTAGTT
CTTGCGTTCTTTGTTATCCTCTTGAATTTTGATTTAGTCGTCACCTTTTATGGCTACAATTTTCACTCTCCTCGATTTGTCATGGAAACTTGGAAAATAATGGAAGAAAG
AGGAGTTTTCCTGGATAACACATGCACTTTACTTATGATAAAAGCACTCTGTAAGGGCGGTTACTTGGATGAGGCATTTGGTCTAATAAGTTTCCTGGCAGAAAGTCGTG
TCATGTTTCCTGTTCTGCCTGTGTACAATCTTTTCTTGAGAGCCTGTGGCAAAAGGCAAAGTACGGTTCATGTTAGTCAATGTTTGGATATAATGGATCGTAGAATGGTC
GGGAAGAATGAAGCTACATATTCTGAGCTACTCAAGGAAAGTTTCTGCTATAAGGGTATTGTTACATCATATATGTACCTGCTGGGAGACCTAAAATCTGCACATACTGC
ACTGCAAAAGATGGTGGCTTTGGTTATTGGAGCCGCAGGACAAAAGTTACCCTCTTTAGAATTGGACATTCCTGTACCCTTAAGAACTGAATTCTATCATGACAATTTTA
ATTTCGAGGAAAATGGACCTTCTACCGACGAGGTATACTGTAAGAAAATGGTCCCCTGCGAAGGTGACATAGAGCAATTTTCTGTTAATGGTATGAAGTGTGGAGAAGTT
GAAAGTGGTCGAACTTTGCCAAGCAATTACAGAAGCAATTTTGTTATGAAGGTTTTGAGGTGGTCTTTCAATGATGTGATATGTGCATGTGCGCTTACTAGGAACTGTGG
TCTTGCAGAGCAGTTAATGCAACAGTTCAGCATTTCATGTGTACTGACGACACTAGGGTTCGTTTCTTTGAAGATGCATGAACTGGGATTGCAACCTTCGTCCCACACAT
TTGATGGTTTTGTTAGATCAGTTGTTTCAGAGAGAGGTTTCAGTGATGGCATTAAAATAGATCAGCCAGAACGTGCCATGCGTATGTTGGCTAAAATGAAACAAATGGAG
GTGCTTCCAGATGTGAAGACCTATGAGCTTTTATATTCATTATTTGGTAACGTGAATGCTCCATACGAGGAGGGCAACAGATTGTCACAGGTGGATGCTGCAAAAAGGAT
ACGCATGATAGAGATGGATATGGAGAAACATGGGATCCAACACAGTCATTTCTCTATGATGAACTTGTTGAAAGCTCTTGGTGCGGAGGGGATGACGAAGGAGCTTCTTC
AGTATTTAAATGTGGCAGAGAACCTCTTCTATTACAATAACACTTGGCTGGGGACGCCTATTTACAACACAGCGTTGCATTTTTTAGTTGAATCCAAAGAGATCCATATG
GCAATAGAATTATTCAATAATATGAAGCATTCTGGTCTCTTTCCAGATGCTGCGACATTTGAGATGATGATCGACTGTTGTAGTGTTATCGGATGCTTGAAATCTGCGTT
TGCTCTTCTCTCCCTGATGATTCGCTCCGGGTTTTGTCCTCAGATATTAACTTATACCAGTCTAGTAAAGATTGTGCTGGGATTTGAGAGATTTGATGATGCCTTGAATC
TCTTAGATCAAGCCAGTTCAGAAGGGATTGAACTTGATGTAGTTATAATGAATACAATCGTGCAGAAAGCTTGTGAAAAGGGAAGGATTGATGTGATTGAGTTTGTCGTT
GAGAAGATGAAGCGCAAAAAGATCCAACCCGACCCTTCAACGTGCCATAGTGTCTTCTCGGCATATGTGAGCCTTGGCTATCACAGCACCGCCATGGAAGCGCTGCAAGT
ACTGAGCATGCGTATGCTATGCAAAGAACAGGACACTTCTCCAGTCGTTACAGAATATGTCGAAGACTTTGTGCTTGCAGAAGACTCCGAAGCGGAATCACGGATTTTGG
AATTCTTCAAATGCTCTGAAGAGAGCCTAAGTTTTGCCCTTCTCAACTTGAGATGGTCTGCCATGCTGGGATATTCCCTTTGTTCCTCCCCTAATCAGAGTCCATGGGCA
ATGAGACTTGCAAGTTCCTATGATGATGGCTACACAGCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGACGGAATGGACCGCTGCTCTGCCAATTACCTTCCTCTCACTCCAATCACCTTCTTGGAGCGCTCCGCCGCGGTTTACGGCGACCGGATTTCTCTCGTCTATGGACG
TGTTCAATACACTTGGAGAGAGACGCTTTTGCGATGTACCAGGCTCGCTTCTGCTCTTGTTCGAATGGGAATCGCTCGTGGAGATGTGGTTGCTGCCTTGGCACCGAATA
TTCCAGCTATGTACGAGCTTCATTTTGCTGTACCGATGGCTGGTGCAGTTCTTTGCTCCCTCAACGTACGCCATGATGCGGCAATGGTTTCTACACTGTTGAGCCATTCG
GAAGCTAAAATCATTGTTGTAGATTACCAGTTTGAACATATTGTAATGGGAGCAATTAAAGCTATGTCCGAAAGGAGAGAAAAGATACCTCTTGTTGTCATTATTAATGA
ATATGATCAGCCACCTTCCCACATCGACAGAGAGGGCTCTGCTTCAGGATATCTAGAGTTTGAGAGCCTTTTAGAGTCTGGAAAACTCGATTTCAAGATCAGGCGGCCTA
AAGATGAATTGGATCCAATTGCTCTTAATTATACTTCTGGCACAACATCAAGGCCAAAAGGTGTTATTTACTCCCATAGAGGTGCATATCTCAATGCTCTTTCTGCAGTT
CTTCTGAATGATATGTGCTCACTCCCTGTGTATCTGTGGGTTGTTCCAATGTTCCACTGCAATGGATGGAGCTTAACTTGGGGCGTGGCTGCACAGAGCGGCACAAACAT
CTGCCAGAGAAACGCGAGTGCTAAAGAAATTTTTGACAATATATCTCTGCATAAGGTTACTCATATGGGCGGCGCACCTACTGTCTTAAACATGATTATCAATGCACCAA
CAAGCGAACAGAAGCCCCTTCCAGGGAAGGTAACCATGATGACTGGTGGCGCTCCGCCGCCTTCCCATGTTCTTTTTAAGATGAGAGCAATGGGATTCCTGATTGTCCAT
GCATATGGTTTGACCGAAACTTATGGGCCAGCAACAGTTTGCTCTTGGAAACCAGAATGGGATTCACTCCCTCAAGATAAACAGGCAAAGCTAAGTTCTCGCCAAGGATT
GCAGCATGTTGGGCTAGAGGAAGTGGACATAAAGGATCCTGTCACCATGGAGAGTGCTCCAGCCGATGGAAAAACCATTGGTGAAGTTATGGTCAGAGGCAACACTGTGA
TGAGTGGATACTTAAAAGATCTCAAAGCCACACGGGAAGCCTTCAATGGCGGATGGTTTCGAAGTGGGGACTTGGGGGTGAGGCACCCTGATGGTTACATAGAACTGAAG
GACCGTTCTAAGGACATAATCATTTCTGGGGGAGAGAATATTAGCTCGATTGAGGTGGAGTCTGTTCTTTTCAGCCATCCATCAGTTCTTGAAGCTGCTGTTGTGGGAAG
ACCTGATGACCACTGGGGTGAAACACCATGTGCATTTGTGAAGCTCAAAGATGGGTGCAGTGCGAGTGAAGAAGATATTATAAAATTCTGCAGAGAACACCTGCCTCACT
ACATGGCTCCGCGAAGTGTGGTGTTTAAAGATTTGCCAAAAACTTCGACAGGGAAAACTCAGAAGTTTATTCTCAAGGAGGAGGCCAAGGCCATGGGTAGCCTTCCAAAA
CAGACCAGCAAACTGGCAAGGCTTCGTTTGGGATCCATAGCTGACTCGCTATACAGATTCAGGCCACACGAACATGGGCGAAAACAGGATGCGAATAAGATGGTATTCCG
TCGAGCTCTTCTCATCTCCCAAGGTATTGAATATTTGGGAAATGAAGCCGAGTCAACTAAGTTCATGCAGAGACAGATTGTCGATGCACTTCGAGTGGGTGATAGAAGTA
GTGCTTCCAATCTGCTCATGGAACTTGGCCAGGAAAAACACTCCTTAACTGCAGATAATTTTGTTGGCATTTTGAGCTACTGTGCAAGATCACCTGATCCACTGGTAGTT
CTTGCGTTCTTTGTTATCCTCTTGAATTTTGATTTAGTCGTCACCTTTTATGGCTACAATTTTCACTCTCCTCGATTTGTCATGGAAACTTGGAAAATAATGGAAGAAAG
AGGAGTTTTCCTGGATAACACATGCACTTTACTTATGATAAAAGCACTCTGTAAGGGCGGTTACTTGGATGAGGCATTTGGTCTAATAAGTTTCCTGGCAGAAAGTCGTG
TCATGTTTCCTGTTCTGCCTGTGTACAATCTTTTCTTGAGAGCCTGTGGCAAAAGGCAAAGTACGGTTCATGTTAGTCAATGTTTGGATATAATGGATCGTAGAATGGTC
GGGAAGAATGAAGCTACATATTCTGAGCTACTCAAGGAAAGTTTCTGCTATAAGGGTATTGTTACATCATATATGTACCTGCTGGGAGACCTAAAATCTGCACATACTGC
ACTGCAAAAGATGGTGGCTTTGGTTATTGGAGCCGCAGGACAAAAGTTACCCTCTTTAGAATTGGACATTCCTGTACCCTTAAGAACTGAATTCTATCATGACAATTTTA
ATTTCGAGGAAAATGGACCTTCTACCGACGAGGTATACTGTAAGAAAATGGTCCCCTGCGAAGGTGACATAGAGCAATTTTCTGTTAATGGTATGAAGTGTGGAGAAGTT
GAAAGTGGTCGAACTTTGCCAAGCAATTACAGAAGCAATTTTGTTATGAAGGTTTTGAGGTGGTCTTTCAATGATGTGATATGTGCATGTGCGCTTACTAGGAACTGTGG
TCTTGCAGAGCAGTTAATGCAACAGTTCAGCATTTCATGTGTACTGACGACACTAGGGTTCGTTTCTTTGAAGATGCATGAACTGGGATTGCAACCTTCGTCCCACACAT
TTGATGGTTTTGTTAGATCAGTTGTTTCAGAGAGAGGTTTCAGTGATGGCATTAAAATAGATCAGCCAGAACGTGCCATGCGTATGTTGGCTAAAATGAAACAAATGGAG
GTGCTTCCAGATGTGAAGACCTATGAGCTTTTATATTCATTATTTGGTAACGTGAATGCTCCATACGAGGAGGGCAACAGATTGTCACAGGTGGATGCTGCAAAAAGGAT
ACGCATGATAGAGATGGATATGGAGAAACATGGGATCCAACACAGTCATTTCTCTATGATGAACTTGTTGAAAGCTCTTGGTGCGGAGGGGATGACGAAGGAGCTTCTTC
AGTATTTAAATGTGGCAGAGAACCTCTTCTATTACAATAACACTTGGCTGGGGACGCCTATTTACAACACAGCGTTGCATTTTTTAGTTGAATCCAAAGAGATCCATATG
GCAATAGAATTATTCAATAATATGAAGCATTCTGGTCTCTTTCCAGATGCTGCGACATTTGAGATGATGATCGACTGTTGTAGTGTTATCGGATGCTTGAAATCTGCGTT
TGCTCTTCTCTCCCTGATGATTCGCTCCGGGTTTTGTCCTCAGATATTAACTTATACCAGTCTAGTAAAGATTGTGCTGGGATTTGAGAGATTTGATGATGCCTTGAATC
TCTTAGATCAAGCCAGTTCAGAAGGGATTGAACTTGATGTAGTTATAATGAATACAATCGTGCAGAAAGCTTGTGAAAAGGGAAGGATTGATGTGATTGAGTTTGTCGTT
GAGAAGATGAAGCGCAAAAAGATCCAACCCGACCCTTCAACGTGCCATAGTGTCTTCTCGGCATATGTGAGCCTTGGCTATCACAGCACCGCCATGGAAGCGCTGCAAGT
ACTGAGCATGCGTATGCTATGCAAAGAACAGGACACTTCTCCAGTCGTTACAGAATATGTCGAAGACTTTGTGCTTGCAGAAGACTCCGAAGCGGAATCACGGATTTTGG
AATTCTTCAAATGCTCTGAAGAGAGCCTAAGTTTTGCCCTTCTCAACTTGAGATGGTCTGCCATGCTGGGATATTCCCTTTGTTCCTCCCCTAATCAGAGTCCATGGGCA
ATGAGACTTGCAAGTTCCTATGATGATGGCTACACAGCCTAATAGATGAAAATGAAATCATTAAGAGTCTCTTTGTTCTACGCTAATTAAGTTGAATATTTCGTTTTTTC
AACATGGAATTAGAACAAGAGATCAAAACAATTGTTGCATTTTTACTTGGAGAGCGTCCAAATGCAAATAAAGCACTTATTAACAGTTAATTAGTAGCCGAATTGAAA
Protein sequenceShow/hide protein sequence
MDGMDRCSANYLPLTPITFLERSAAVYGDRISLVYGRVQYTWRETLLRCTRLASALVRMGIARGDVVAALAPNIPAMYELHFAVPMAGAVLCSLNVRHDAAMVSTLLSHS
EAKIIVVDYQFEHIVMGAIKAMSERREKIPLVVIINEYDQPPSHIDREGSASGYLEFESLLESGKLDFKIRRPKDELDPIALNYTSGTTSRPKGVIYSHRGAYLNALSAV
LLNDMCSLPVYLWVVPMFHCNGWSLTWGVAAQSGTNICQRNASAKEIFDNISLHKVTHMGGAPTVLNMIINAPTSEQKPLPGKVTMMTGGAPPPSHVLFKMRAMGFLIVH
AYGLTETYGPATVCSWKPEWDSLPQDKQAKLSSRQGLQHVGLEEVDIKDPVTMESAPADGKTIGEVMVRGNTVMSGYLKDLKATREAFNGGWFRSGDLGVRHPDGYIELK
DRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVKLKDGCSASEEDIIKFCREHLPHYMAPRSVVFKDLPKTSTGKTQKFILKEEAKAMGSLPK
QTSKLARLRLGSIADSLYRFRPHEHGRKQDANKMVFRRALLISQGIEYLGNEAESTKFMQRQIVDALRVGDRSSASNLLMELGQEKHSLTADNFVGILSYCARSPDPLVV
LAFFVILLNFDLVVTFYGYNFHSPRFVMETWKIMEERGVFLDNTCTLLMIKALCKGGYLDEAFGLISFLAESRVMFPVLPVYNLFLRACGKRQSTVHVSQCLDIMDRRMV
GKNEATYSELLKESFCYKGIVTSYMYLLGDLKSAHTALQKMVALVIGAAGQKLPSLELDIPVPLRTEFYHDNFNFEENGPSTDEVYCKKMVPCEGDIEQFSVNGMKCGEV
ESGRTLPSNYRSNFVMKVLRWSFNDVICACALTRNCGLAEQLMQQFSISCVLTTLGFVSLKMHELGLQPSSHTFDGFVRSVVSERGFSDGIKIDQPERAMRMLAKMKQME
VLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRMIEMDMEKHGIQHSHFSMMNLLKALGAEGMTKELLQYLNVAENLFYYNNTWLGTPIYNTALHFLVESKEIHM
AIELFNNMKHSGLFPDAATFEMMIDCCSVIGCLKSAFALLSLMIRSGFCPQILTYTSLVKIVLGFERFDDALNLLDQASSEGIELDVVIMNTIVQKACEKGRIDVIEFVV
EKMKRKKIQPDPSTCHSVFSAYVSLGYHSTAMEALQVLSMRMLCKEQDTSPVVTEYVEDFVLAEDSEAESRILEFFKCSEESLSFALLNLRWSAMLGYSLCSSPNQSPWA
MRLASSYDDGYTA