; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017738 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017738
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionAMP-dependent synthetase and ligase family protein
Genome locationtig00153055:450245..461605
RNA-Seq ExpressionSgr017738
SyntenySgr017738
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000873 - AMP-dependent synthetase/ligase
IPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily
IPR020845 - AMP-binding, conserved site
IPR025110 - AMP-binding enzyme, C-terminal domain
IPR042099 - ANL, N-terminal domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAE5964315.1 unnamed protein product [Arabidopsis arenosa]0.0e+0043.43Show/hide
Query:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI
        M+G  RCSANYVPLTPI+FLER+A V+  R S+VYG ++YTWR T +RC RLASAL+ +G++R DV                                  
Subjt:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI

Query:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESDQ
                        VAALAPN+PA+ EL+F  PMAG+VLC LN   D+ M++  L  ++ K+  VD +FL + + ++K++S   E+ PL++ I E+  
Subjt:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESDQ

Query:  PSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGVA
          S            +++  L++G P+F   RP DE DPI+LN+TSGTTS+PK V+YSHRGAYLN+ +  ++N++  +PVYLWTVPM+HC+GW   W VA
Subjt:  PSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGVA

Query:  AQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKPEW
        A GG N+C R V  + IFD+I  HKVT+ GG+P +LNMI N P S +K  P  V VM+GG+ PP  V+ +++ LGF ++ +YG TE YG  T C W PEW
Subjt:  AQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKPEW

Query:  DSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIIISG
        ++LPE++ ++L +R GL H   E VD+ +P TM+S P DGKT+G ++ RGNTVM+GY KD +AT+ AF+GGW+ S D+GV   DG       S+D+I  G
Subjt:  DSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIIISG

Query:  GENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGCSASEEEIIKFCKKHLPN---YMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMGS
        GE + S E+E +L+SHP+V EAAVVGRPD+  GE+ CAFV+LK+G  A EEEII+FC++ + N    M P+ VVF  +PKT TGK +K +L+E AK MG 
Subjt:  GENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGCSASEEEIIKFCKKHLPN---YMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMGS

Query:  LPKRISKLLRLQ---------------IETRTHGFLHIASDIFCLLS-SRRCLFG------------------EVLRTMHR---------------QVKH
        +P +  + L L                  +R  G   I +   C  S S   L G                  + LR+  R                +  
Subjt:  LPKRISKLLRLQ---------------IETRTHGFLHIASDIFCLLS-SRRCLFG------------------EVLRTMHR---------------QVKH

Query:  SPGHGSSYKCPNLVSP--------------LHFSSGQGLVWNQSLTRYTDLSRTN-----MGEE------------------RLLANWYSSDSSHL--QR
        +  H   + C     P              +   S   L   QSL     L + +     +GE+                  +  + +++S    L  QR
Subjt:  SPGHGSSYKCPNLVSP--------------LHFSSGQGLVWNQSLTRYTDLSRTN-----MGEE------------------RLLANWYSSDSSHL--QR

Query:  MVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALA------IGAAGGKLPSLKLDIPIPSRTEFY
         VGKNE+TY+ LLKLAV Q+NLS+V++IW  +V +YS  +L LRKFIWS+TRLGDLKSAY  LQ MV LA      +    GKL S  LDIP+P++ E  
Subjt:  MVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALA------IGAAGGKLPSLKLDIPIPSRTEFY

Query:  HNNFDFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTF
             F  N H+ +                            S  + L   +      +VLRWSFNDVIHAC  ++N  LAEQLM QM+++GL PS HT+
Subjt:  HNNFDFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTF

Query:  DGFVRSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALV
        DGF+R+V    G+  GM +LKVMQQ+ LKPY STL  VS  CSKA ++DLAE LL+QIS C Y +PFN  L+A D++D P                    
Subjt:  DGFVRSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALV

Query:  CTFGSKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMT
                         ERA+R+L +MKQLK+ PD++TYELL+SLFGNVNAPYEEGN LSQVD  KRI  IE DM ++G +HS +S  N+L+ALGAEGM 
Subjt:  CTFGSKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMT

Query:  KELLQYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLV
         E++++L  AENL  ++N  LGTP YN VLH L+E+ E  + I +F  MK  G   D AT+ +MIDCCS++   KSA AL+S+M+R GF P+ +T+T+L+
Subjt:  KELLQYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLV

Query:  KIVLGFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDP
        KI+L    F++ALNLLDQA+ E I LDV+  NT+L+KA EKG IDVIE++VE+M+REK+ PDP+TCH VF+ YV  GYH+TA+EAL VLS+RML +ED  
Subjt:  KIVLGFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDP

Query:  SAD--LTEYVENFALAEDSAADSRILEFFK
        S      E  ENF ++ED  A+++I+E F+
Subjt:  SAD--LTEYVENFALAEDSAADSRILEFFK

XP_022132725.1 probable acyl-activating enzyme 1, peroxisomal [Momordica charantia]2.5e-29082.31Show/hide
Query:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI
        MDG  RCSAN+VPLTPITFLERSA VY +RISLVYGR+RYTW+DTL+RCTRLASAL  +GIA GDV                                  
Subjt:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI

Query:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESDQ
                        VAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDA MVSTLLTHSEAKII VDYQFLHIVKGAI+VMS+R+EKLPLVVIIQE DQ
Subjt:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESDQ

Query:  PSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGVA
        PSSH DR  SASEDLEFE LLATGE DFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLS VLLN +CS P+YLWTVPMFHCNGWCFTW VA
Subjt:  PSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGVA

Query:  AQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKPEW
        AQGGTN+CQRNVTAKEIFDNIS HKVTHM GAPT+LNMIIN P+S+QKPLP KVTVMTGGAPPPSHVLYKMRALGF +VHSYGLTETYGPATVC WKPEW
Subjt:  AQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKPEW

Query:  DSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIIISG
        DSLP+DKQAKLNSRQGLQHVGMEEVDIK+PVTMES PADGKTMGEVMFRGNTVMNGYLKDLKAT+EAF GGWFRSGDLGVRHLDGYIELKDRSKDIIISG
Subjt:  DSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIIISG

Query:  GENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGCSASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMGSLPK
        GENISSIEVESVLFSHP+VLEAAVVGRPDDHWGETPCAFV+LKDGCSA+EEEII+FC++HLP+YM+PRCVVFK LPKTSTGKTQKFILK+EAKAMGSL K
Subjt:  GENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGCSASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMGSLPK

Query:  RISKL
        RISKL
Subjt:  RISKL

XP_022937086.1 pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucurbita moschata]2.7e-28481.41Show/hide
Query:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALAIGAAGGKLPSLKLDIPIPSRTEFYHNNF
        +RMVGKNE TYSELLK+AVCQ+NLSSVHEIW DFVKNYSPSVLSLRKFIW YTRLGDLKSA+ ALQKMVAL IGAAG KLPSL+LDIP+P RTEFYH+NF
Subjt:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALAIGAAGGKLPSLKLDIPIPSRTEFYHNNF

Query:  DFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTFDGFV
        +FEENG STDE++CKK+V C+GDIEQFSV+G+KCGEVESG  TL +NY S  VMKVLRWSFNDVI ACA  RNCGLAEQLMQQM +LGLQPS HTFDGFV
Subjt:  DFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTFDGFV

Query:  RSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALVCTFG
        RSVVSERGFSDG+KILK+MQQRKLKPY STL AVSISCSKALELDLAEALLEQISAC YPHPFNAFLSACD MD P                        
Subjt:  RSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALVCTFG

Query:  SKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMTKELL
                     ERAMRML KMKQ++VLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIR+IE DM KHGI+HSH SMMNLLKALGAEGMTKELL
Subjt:  SKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMTKELL

Query:  QYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLVKIVL
        QYLNVAENLFYYNNT LGTPIYNT LHFLVESKEI +AIELFNNMKHSG FPDAATFEMMIDCCSV+GCLKSAFALLS+M+R GFCPQILTYTSLVKIVL
Subjt:  QYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLVKIVL

Query:  GFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDPSADL
        GFERFDDALNLLDQASSEGIELDVVI+NT++QKA EKGRIDVIEFVVE+M R+KI+PDPSTCHSVFSAYV+LGYHSTA+EALQVLSMRMLCKE D S  +
Subjt:  GFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDPSADL

Query:  TEYVENFALAEDSAADSRILEFFK
        TEYVE+F LAEDS A+SRILEFFK
Subjt:  TEYVENFALAEDSAADSRILEFFK

XP_022937087.1 pentatricopeptide repeat-containing protein At1g76280 isoform X2 [Cucurbita moschata]2.7e-28481.41Show/hide
Query:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALAIGAAGGKLPSLKLDIPIPSRTEFYHNNF
        +RMVGKNE TYSELLK+AVCQ+NLSSVHEIW DFVKNYSPSVLSLRKFIW YTRLGDLKSA+ ALQKMVAL IGAAG KLPSL+LDIP+P RTEFYH+NF
Subjt:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALAIGAAGGKLPSLKLDIPIPSRTEFYHNNF

Query:  DFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTFDGFV
        +FEENG STDE++CKK+V C+GDIEQFSV+G+KCGEVESG  TL +NY S  VMKVLRWSFNDVI ACA  RNCGLAEQLMQQM +LGLQPS HTFDGFV
Subjt:  DFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTFDGFV

Query:  RSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALVCTFG
        RSVVSERGFSDG+KILK+MQQRKLKPY STL AVSISCSKALELDLAEALLEQISAC YPHPFNAFLSACD MD P                        
Subjt:  RSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALVCTFG

Query:  SKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMTKELL
                     ERAMRML KMKQ++VLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIR+IE DM KHGI+HSH SMMNLLKALGAEGMTKELL
Subjt:  SKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMTKELL

Query:  QYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLVKIVL
        QYLNVAENLFYYNNT LGTPIYNT LHFLVESKEI +AIELFNNMKHSG FPDAATFEMMIDCCSV+GCLKSAFALLS+M+R GFCPQILTYTSLVKIVL
Subjt:  QYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLVKIVL

Query:  GFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDPSADL
        GFERFDDALNLLDQASSEGIELDVVI+NT++QKA EKGRIDVIEFVVE+M R+KI+PDPSTCHSVFSAYV+LGYHSTA+EALQVLSMRMLCKE D S  +
Subjt:  GFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDPSADL

Query:  TEYVENFALAEDSAADSRILEFFK
        TEYVE+F LAEDS A+SRILEFFK
Subjt:  TEYVENFALAEDSAADSRILEFFK

XP_022976056.1 pentatricopeptide repeat-containing protein At1g76280 isoform X1 [Cucurbita maxima]3.5e-28481.41Show/hide
Query:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALAIGAAGGKLPSLKLDIPIPSRTEFYHNNF
        +RMVGKNE TYSELLK+AV Q+NLSSVHEIW DFVKNYSPSVLSLRKFIWSYTRLGDLKSAY ALQKMV L IGAAG KL SL+LDIP+P RTEFYH+NF
Subjt:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALAIGAAGGKLPSLKLDIPIPSRTEFYHNNF

Query:  DFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTFDGFV
        +FEENG STDEL+CKK+V C+GDI QFSV+G+KCGEVESG LTL +NY S  VMKVLRWSFNDVI ACA  RNCGLAEQLMQQM +LGLQPS HTFDGFV
Subjt:  DFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTFDGFV

Query:  RSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALVCTFG
        RSVVSERGFSDG+KILK+MQQRKLKPY STL AVSISCSKALELDLAEALLEQISAC YPHPFNAFLSACD MD P                        
Subjt:  RSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALVCTFG

Query:  SKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMTKELL
                     ERAMRML KMKQ++VLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIR+IE DM KHGI+HSH SMMNLLKALGAEGMTKELL
Subjt:  SKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMTKELL

Query:  QYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLVKIVL
        QYLNVAENLFYYNNT LGTPIYNT LHFLVESKEI +A ELFNNMKHSG FPDAATFEMMIDCCSV+GCLKSAFALLS+M+R GFCPQILTYTSLVKIVL
Subjt:  QYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLVKIVL

Query:  GFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDPSADL
        GFERFDDALNLLDQASSEGIELDVVI+NT++QKA EKGRIDVIEFVVE+M REKI+PDPSTCHSVFSAYV+LGYHSTA+EALQVLSMRMLCKE D S  +
Subjt:  GFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDPSADL

Query:  TEYVENFALAEDSAADSRILEFFK
        TEYVE+F LAEDS A+SRILEFFK
Subjt:  TEYVENFALAEDSAADSRILEFFK

TrEMBL top hitse value%identityAlignment
A0A6J1BTA0 probable acyl-activating enzyme 1, peroxisomal1.2e-29082.31Show/hide
Query:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI
        MDG  RCSAN+VPLTPITFLERSA VY +RISLVYGR+RYTW+DTL+RCTRLASAL  +GIA GDV                                  
Subjt:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI

Query:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESDQ
                        VAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDA MVSTLLTHSEAKII VDYQFLHIVKGAI+VMS+R+EKLPLVVIIQE DQ
Subjt:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESDQ

Query:  PSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGVA
        PSSH DR  SASEDLEFE LLATGE DFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLS VLLN +CS P+YLWTVPMFHCNGWCFTW VA
Subjt:  PSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGVA

Query:  AQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKPEW
        AQGGTN+CQRNVTAKEIFDNIS HKVTHM GAPT+LNMIIN P+S+QKPLP KVTVMTGGAPPPSHVLYKMRALGF +VHSYGLTETYGPATVC WKPEW
Subjt:  AQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKPEW

Query:  DSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIIISG
        DSLP+DKQAKLNSRQGLQHVGMEEVDIK+PVTMES PADGKTMGEVMFRGNTVMNGYLKDLKAT+EAF GGWFRSGDLGVRHLDGYIELKDRSKDIIISG
Subjt:  DSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIIISG

Query:  GENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGCSASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMGSLPK
        GENISSIEVESVLFSHP+VLEAAVVGRPDDHWGETPCAFV+LKDGCSA+EEEII+FC++HLP+YM+PRCVVFK LPKTSTGKTQKFILK+EAKAMGSL K
Subjt:  GENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGCSASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMGSLPK

Query:  RISKL
        RISKL
Subjt:  RISKL

A0A6J1F9C6 pentatricopeptide repeat-containing protein At1g76280 isoform X11.3e-28481.41Show/hide
Query:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALAIGAAGGKLPSLKLDIPIPSRTEFYHNNF
        +RMVGKNE TYSELLK+AVCQ+NLSSVHEIW DFVKNYSPSVLSLRKFIW YTRLGDLKSA+ ALQKMVAL IGAAG KLPSL+LDIP+P RTEFYH+NF
Subjt:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALAIGAAGGKLPSLKLDIPIPSRTEFYHNNF

Query:  DFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTFDGFV
        +FEENG STDE++CKK+V C+GDIEQFSV+G+KCGEVESG  TL +NY S  VMKVLRWSFNDVI ACA  RNCGLAEQLMQQM +LGLQPS HTFDGFV
Subjt:  DFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTFDGFV

Query:  RSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALVCTFG
        RSVVSERGFSDG+KILK+MQQRKLKPY STL AVSISCSKALELDLAEALLEQISAC YPHPFNAFLSACD MD P                        
Subjt:  RSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALVCTFG

Query:  SKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMTKELL
                     ERAMRML KMKQ++VLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIR+IE DM KHGI+HSH SMMNLLKALGAEGMTKELL
Subjt:  SKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMTKELL

Query:  QYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLVKIVL
        QYLNVAENLFYYNNT LGTPIYNT LHFLVESKEI +AIELFNNMKHSG FPDAATFEMMIDCCSV+GCLKSAFALLS+M+R GFCPQILTYTSLVKIVL
Subjt:  QYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLVKIVL

Query:  GFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDPSADL
        GFERFDDALNLLDQASSEGIELDVVI+NT++QKA EKGRIDVIEFVVE+M R+KI+PDPSTCHSVFSAYV+LGYHSTA+EALQVLSMRMLCKE D S  +
Subjt:  GFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDPSADL

Query:  TEYVENFALAEDSAADSRILEFFK
        TEYVE+F LAEDS A+SRILEFFK
Subjt:  TEYVENFALAEDSAADSRILEFFK

A0A6J1FA55 pentatricopeptide repeat-containing protein At1g76280 isoform X21.3e-28481.41Show/hide
Query:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALAIGAAGGKLPSLKLDIPIPSRTEFYHNNF
        +RMVGKNE TYSELLK+AVCQ+NLSSVHEIW DFVKNYSPSVLSLRKFIW YTRLGDLKSA+ ALQKMVAL IGAAG KLPSL+LDIP+P RTEFYH+NF
Subjt:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALAIGAAGGKLPSLKLDIPIPSRTEFYHNNF

Query:  DFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTFDGFV
        +FEENG STDE++CKK+V C+GDIEQFSV+G+KCGEVESG  TL +NY S  VMKVLRWSFNDVI ACA  RNCGLAEQLMQQM +LGLQPS HTFDGFV
Subjt:  DFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTFDGFV

Query:  RSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALVCTFG
        RSVVSERGFSDG+KILK+MQQRKLKPY STL AVSISCSKALELDLAEALLEQISAC YPHPFNAFLSACD MD P                        
Subjt:  RSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALVCTFG

Query:  SKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMTKELL
                     ERAMRML KMKQ++VLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIR+IE DM KHGI+HSH SMMNLLKALGAEGMTKELL
Subjt:  SKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMTKELL

Query:  QYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLVKIVL
        QYLNVAENLFYYNNT LGTPIYNT LHFLVESKEI +AIELFNNMKHSG FPDAATFEMMIDCCSV+GCLKSAFALLS+M+R GFCPQILTYTSLVKIVL
Subjt:  QYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLVKIVL

Query:  GFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDPSADL
        GFERFDDALNLLDQASSEGIELDVVI+NT++QKA EKGRIDVIEFVVE+M R+KI+PDPSTCHSVFSAYV+LGYHSTA+EALQVLSMRMLCKE D S  +
Subjt:  GFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDPSADL

Query:  TEYVENFALAEDSAADSRILEFFK
        TEYVE+F LAEDS A+SRILEFFK
Subjt:  TEYVENFALAEDSAADSRILEFFK

A0A6J1IEQ4 pentatricopeptide repeat-containing protein At1g76280 isoform X21.7e-28481.41Show/hide
Query:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALAIGAAGGKLPSLKLDIPIPSRTEFYHNNF
        +RMVGKNE TYSELLK+AV Q+NLSSVHEIW DFVKNYSPSVLSLRKFIWSYTRLGDLKSAY ALQKMV L IGAAG KL SL+LDIP+P RTEFYH+NF
Subjt:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALAIGAAGGKLPSLKLDIPIPSRTEFYHNNF

Query:  DFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTFDGFV
        +FEENG STDEL+CKK+V C+GDI QFSV+G+KCGEVESG LTL +NY S  VMKVLRWSFNDVI ACA  RNCGLAEQLMQQM +LGLQPS HTFDGFV
Subjt:  DFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTFDGFV

Query:  RSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALVCTFG
        RSVVSERGFSDG+KILK+MQQRKLKPY STL AVSISCSKALELDLAEALLEQISAC YPHPFNAFLSACD MD P                        
Subjt:  RSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALVCTFG

Query:  SKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMTKELL
                     ERAMRML KMKQ++VLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIR+IE DM KHGI+HSH SMMNLLKALGAEGMTKELL
Subjt:  SKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMTKELL

Query:  QYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLVKIVL
        QYLNVAENLFYYNNT LGTPIYNT LHFLVESKEI +A ELFNNMKHSG FPDAATFEMMIDCCSV+GCLKSAFALLS+M+R GFCPQILTYTSLVKIVL
Subjt:  QYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLVKIVL

Query:  GFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDPSADL
        GFERFDDALNLLDQASSEGIELDVVI+NT++QKA EKGRIDVIEFVVE+M REKI+PDPSTCHSVFSAYV+LGYHSTA+EALQVLSMRMLCKE D S  +
Subjt:  GFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDPSADL

Query:  TEYVENFALAEDSAADSRILEFFK
        TEYVE+F LAEDS A+SRILEFFK
Subjt:  TEYVENFALAEDSAADSRILEFFK

A0A6J1IFV2 pentatricopeptide repeat-containing protein At1g76280 isoform X11.7e-28481.41Show/hide
Query:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALAIGAAGGKLPSLKLDIPIPSRTEFYHNNF
        +RMVGKNE TYSELLK+AV Q+NLSSVHEIW DFVKNYSPSVLSLRKFIWSYTRLGDLKSAY ALQKMV L IGAAG KL SL+LDIP+P RTEFYH+NF
Subjt:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALAIGAAGGKLPSLKLDIPIPSRTEFYHNNF

Query:  DFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTFDGFV
        +FEENG STDEL+CKK+V C+GDI QFSV+G+KCGEVESG LTL +NY S  VMKVLRWSFNDVI ACA  RNCGLAEQLMQQM +LGLQPS HTFDGFV
Subjt:  DFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCHTFDGFV

Query:  RSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALVCTFG
        RSVVSERGFSDG+KILK+MQQRKLKPY STL AVSISCSKALELDLAEALLEQISAC YPHPFNAFLSACD MD P                        
Subjt:  RSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVALVCTFG

Query:  SKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMTKELL
                     ERAMRML KMKQ++VLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIR+IE DM KHGI+HSH SMMNLLKALGAEGMTKELL
Subjt:  SKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEGMTKELL

Query:  QYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLVKIVL
        QYLNVAENLFYYNNT LGTPIYNT LHFLVESKEI +A ELFNNMKHSG FPDAATFEMMIDCCSV+GCLKSAFALLS+M+R GFCPQILTYTSLVKIVL
Subjt:  QYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLVKIVL

Query:  GFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDPSADL
        GFERFDDALNLLDQASSEGIELDVVI+NT++QKA EKGRIDVIEFVVE+M REKI+PDPSTCHSVFSAYV+LGYHSTA+EALQVLSMRMLCKE D S  +
Subjt:  GFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDPSADL

Query:  TEYVENFALAEDSAADSRILEFFK
        TEYVE+F LAEDS A+SRILEFFK
Subjt:  TEYVENFALAEDSAADSRILEFFK

SwissProt top hitse value%identityAlignment
F4HUK6 Butanoate--CoA ligase AAE16.1e-22362.23Show/hide
Query:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI
        M+G  +  ANYVPLTPI+FL+RSA VY +R+S+VYG V+YTWR T +RC R+ASAL+ +GI+ GDV                                  
Subjt:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI

Query:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESDQ
                        V+ LAPN+PAM ELHF VPMAGA+LCTLNIRHD+ +V+ LL HS  K+I  D+QFL I +GA +++S + +K+P++V+I E   
Subjt:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESDQ

Query:  PSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGVA
         S  + R   + E +E+E ++A G+ DFE+ RP DE D IS+NYTSGTTS PKGV+YSHRGAYLNSL+AVLLN++ S P YLWT PMFHCNGWC  WGV 
Subjt:  PSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGVA

Query:  AQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKPEW
        A GGTNIC RNVTAK IFDNIS HKVTHMGGAPTILNMIIN P S+QKPLPGKV+ +TG APPP+HV++KM  LGF + HSYGLTETYGP T+CTWKPEW
Subjt:  AQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKPEW

Query:  DSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIIISG
        DSLP ++QAK+ +RQG+ H+G+EE+ +K+PVTM + PADG TMGEV+FRGNTVMNGYLK+ +AT+EAFKGGWF SGDLGV+H DGYIELKDRSKDIIISG
Subjt:  DSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIIISG

Query:  GENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGCSASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMGSLPK
        GENISSIEVES LF+HP VLEAAVV RPD++WGET CAFV+LKDG  AS EE+I +C+  LP+YMAPR +VF+ LPKTSTGK QKF+L+ +AKA+ SL K
Subjt:  GENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGCSASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMGSLPK

Query:  R
        +
Subjt:  R

M4IRL4 Isovalerate--CoA ligase CCL21.4e-19857.81Show/hide
Query:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASAL-AHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFL
        M+G+ +CSAN+VPL+PITFLERS+  Y +  SLVYG VRYTW  T  RC +LASAL  H+GI+ GDV                                 
Subjt:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASAL-AHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFL

Query:  ITFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESD
                         VA  + N+P +YELHFAVPMAG +LCTLN R+D+ MVSTLL HSEAK+I V+ Q L   + A+ +++++  K P +V++ +S+
Subjt:  ITFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESD

Query:  QPSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGV
                  S+S D  +  LLA G  DFEIRRPK+EWDPIS+NYTSGTT+RPK V+YSHRGAYLNS++ VLL+ + +  VYLW+VPMFHCNGWCF WG 
Subjt:  QPSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGV

Query:  AAQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSD-QKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKP
        AAQG TNIC R V+ K IFDNI LHKVTH G APT+LNMI+N+P  +   PLP KV VMTGG+PPP  V+ +M  +GF + H YGLTET GPA  C  KP
Subjt:  AAQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSD-QKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKP

Query:  EWDSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIII
        EWD+L  +++  L +RQGL H+ MEE+D+++PVTMES  ADG T+GEVMFRGNTVM+GY KDLKAT+EAF+GGWFRSGDLGV+H DGYI+LKDR KD++I
Subjt:  EWDSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIII

Query:  SGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGC--SASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMG
        SGGENIS++EVE+VL+SH +VLEAAVV RPD  WGETPCAFV LK+G     S ++IIKFC+  LP+YMAP+ VVF+ LPKTSTGK QK+ILKE+A AMG
Subjt:  SGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGC--SASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMG

Query:  SL
        SL
Subjt:  SL

M4IS88 Acetate--CoA ligase CCL36.9e-16648.77Show/hide
Query:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI
        +D + + +ANY  LTP+ FLER+ATV+  R S+++G   YTW  T  RC + ASAL +  I  G                                    
Subjt:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI

Query:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVM---SKRREKLPLVVIIQE
                        VA +APN+PA+YE HFAVPMAGAV+  +NIR +A  ++ LL HS A  + VD +F  + + A+K++   SK   K PL+V+I +
Subjt:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVM---SKRREKLPLVVIIQE

Query:  SDQPSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTW
               ++        +E+E  L  G+P+F+ + P+DEW  ISL YTSGTT+ PKGV+ SHRGAYL SLSA ++  I    +YLWT+PMFHCNGWC+TW
Subjt:  SDQPSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTW

Query:  GVAAQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQ-KPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTW
        G+AA  GTNIC R VTAK ++  I+ + VTH   AP +LN I+N P  +   PLP  V VMT GA PP  VL+ M   GF + H+YGL+ETYGP+T+C W
Subjt:  GVAAQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQ-KPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTW

Query:  KPEWDSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDI
        KPEWDSLP  KQA+LN+RQG++++ +E +D+ +  TM+  PADG TMGE++ RGN VM GYLK+ KA +E+F  GWF SGDL V+H DGYIE+KDRSKDI
Subjt:  KPEWDSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDI

Query:  IISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGCSASE-----EEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEE
        IISGGENISS+EVE+ L+ HP+VLE +VV RPD+ WGE+PCAFV LK     S      E+IIKFCK  +P Y  P+ VVF  LPKT+TGK QK +L+ +
Subjt:  IISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGCSASE-----EEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEE

Query:  AKAMGSLPK
        AK MG+L K
Subjt:  AKAMGSLPK

M4IS92 Probable CoA ligase CCL138.9e-19857.64Show/hide
Query:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASAL-AHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFL
        M+G+ +CSAN+VPL+PITFLERS+  Y +  SLVYG VRYTW  T  RC +LASAL  H GI+ GDV                                 
Subjt:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASAL-AHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFL

Query:  ITFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESD
                         VA  + NIP +YELHFAVPMAG +LCTLN R+D+ MVSTLL HSEAK+I V+ Q L   + A+ +++++  K P +V++ +S+
Subjt:  ITFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESD

Query:  QPSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGV
                  S+S D  +  LLA G  DFEIRRPK+E DPIS+NYTSGTT+RPK V+YSHRGAYLNS++ VLL+ + +  VYLW+VPMFHCNGWCF WG 
Subjt:  QPSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGV

Query:  AAQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSD-QKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKP
        AAQG TNIC R V+ K IFDNI LHKVTH G APT+LNMI+N+P  +   PLP KV VMTGG+PPP  V+ +M  +GF + H YGLTET+GPAT C  KP
Subjt:  AAQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSD-QKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKP

Query:  EWDSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIII
        EWD+L  +++  L +RQGL H+ MEE+D+++PV+MES  ADG T+GEVMFRGNTVM+GY KDLKAT+EAF+GGWFR+GDLGV+H DGYI+LKDR KD++I
Subjt:  EWDSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIII

Query:  SGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGC--SASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMG
        SGGEN+S++EVE+VL+SH +VLEAAVV RPD  WGETPCAFV LK+G     S ++IIKFC+  LP+YMAP+ VVF+ LPKTSTGK QK+ILKE+AKAMG
Subjt:  SGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGC--SASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMG

Query:  SL
        SL
Subjt:  SL

Q9SEY5 Isovalerate--CoA ligase AAE22.4e-20357.76Show/hide
Query:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI
        ++G+ R  AN+ PL+PITFLERSA VY +R SLV+G V++TW  T +RC RLASAL ++GI+RGDV                                  
Subjt:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI

Query:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMS---KRREKLPLVVIIQE
                        VAALAPN+PAM+ELHFAVPMAG +LC LN R D   +S LL HSEAKI+ VD+Q L I  GA+ +++   K R+ L LV+I Q 
Subjt:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMS---KRREKLPLVVIIQE

Query:  SDQPSSHIDRTCS----ASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGW
        +D   S  D + +     S D E+ETLL +G+ +FEI +P+ EWDPIS+NYTSGTTSRPKGV+YSHRGAYLNSL+ V L+ +   PVYLWTVPMFHCNGW
Subjt:  SDQPSSHIDRTCS----ASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGW

Query:  CFTWGVAAQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATV
        C  WGVAAQGGTNIC R V+ K IF NI++HKVTHMGGAPT+LNMI+N  V++ KPLP +V +MTGG+PP   +L KM  LGF + H YGLTETYGP T 
Subjt:  CFTWGVAAQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATV

Query:  CTWKPEWDSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRS
        C WKPEWDSL  +++ KL +RQG+QH+G+E +D+K+P+TME+ P DG TMGEVMFRGNTVM+GY KD++AT++AF+G WF SGDL V++ DGYIE+KDR 
Subjt:  CTWKPEWDSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRS

Query:  KDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGC-SASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEA
        KD+IISGGENISS+EVE VL SH +VLEAAVV RPD HWG+TPC FV+LK+G  +   EEII FC+ HLP+YMAP+ +VF  +PKTSTGK QK++L+++A
Subjt:  KDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGC-SASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEA

Query:  KAMGSL
          MGSL
Subjt:  KAMGSL

Arabidopsis top hitse value%identityAlignment
AT1G20560.1 acyl activating enzyme 14.3e-22462.23Show/hide
Query:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI
        M+G  +  ANYVPLTPI+FL+RSA VY +R+S+VYG V+YTWR T +RC R+ASAL+ +GI+ GDV                                  
Subjt:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI

Query:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESDQ
                        V+ LAPN+PAM ELHF VPMAGA+LCTLNIRHD+ +V+ LL HS  K+I  D+QFL I +GA +++S + +K+P++V+I E   
Subjt:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESDQ

Query:  PSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGVA
         S  + R   + E +E+E ++A G+ DFE+ RP DE D IS+NYTSGTTS PKGV+YSHRGAYLNSL+AVLLN++ S P YLWT PMFHCNGWC  WGV 
Subjt:  PSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGVA

Query:  AQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKPEW
        A GGTNIC RNVTAK IFDNIS HKVTHMGGAPTILNMIIN P S+QKPLPGKV+ +TG APPP+HV++KM  LGF + HSYGLTETYGP T+CTWKPEW
Subjt:  AQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKPEW

Query:  DSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIIISG
        DSLP ++QAK+ +RQG+ H+G+EE+ +K+PVTM + PADG TMGEV+FRGNTVMNGYLK+ +AT+EAFKGGWF SGDLGV+H DGYIELKDRSKDIIISG
Subjt:  DSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIIISG

Query:  GENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGCSASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMGSLPK
        GENISSIEVES LF+HP VLEAAVV RPD++WGET CAFV+LKDG  AS EE+I +C+  LP+YMAPR +VF+ LPKTSTGK QKF+L+ +AKA+ SL K
Subjt:  GENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGCSASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMGSLPK

Query:  R
        +
Subjt:  R

AT1G20560.2 acyl activating enzyme 12.8e-19968.63Show/hide
Query:  MYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESDQPSSHIDRTCSASEDLEFETLLATGEP
        M ELHF VPMAGA+LCTLNIRHD+ +V+ LL HS  K+I  D+QFL I +GA +++S + +K+P++V+I E    S  + R   + E +E+E ++A G+ 
Subjt:  MYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESDQPSSHIDRTCSASEDLEFETLLATGEP

Query:  DFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGVAAQGGTNICQRNVTAKEIFDNISLHKV
        DFE+ RP DE D IS+NYTSGTTS PKGV+YSHRGAYLNSL+AVLLN++ S P YLWT PMFHCNGWC  WGV A GGTNIC RNVTAK IFDNIS HKV
Subjt:  DFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGVAAQGGTNICQRNVTAKEIFDNISLHKV

Query:  THMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKPEWDSLPEDKQAKLNSRQGLQHVGMEEVD
        THMGGAPTILNMIIN P S+QKPLPGKV+ +TG APPP+HV++KM  LGF + HSYGLTETYGP T+CTWKPEWDSLP ++QAK+ +RQG+ H+G+EE+ 
Subjt:  THMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKPEWDSLPEDKQAKLNSRQGLQHVGMEEVD

Query:  IKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVG
        +K+PVTM + PADG TMGEV+FRGNTVMNGYLK+ +AT+EAFKGGWF SGDLGV+H DGYIELKDRSKDIIISGGENISSIEVES LF+HP VLEAAVV 
Subjt:  IKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVG

Query:  RPDDHWGETPCAFVQLKDGCSASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMGSLPKR
        RPD++WGET CAFV+LKDG  AS EE+I +C+  LP+YMAPR +VF+ LPKTSTGK QKF+L+ +AKA+ SL K+
Subjt:  RPDDHWGETPCAFVQLKDGCSASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMGSLPKR

AT1G76280.3 Tetratricopeptide repeat (TPR)-like superfamily protein3.0e-16450Show/hide
Query:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALA------IGAAGGKLPSLKLDIPIPSRTE
        QR VGKN +TY  LLKLAV Q+NLS+V++IW  +V +Y+  +LSLR+FIWS+TRLGDLKSAY  LQ MV LA      + +  GKL S +L IP+PS+ E
Subjt:  QRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYIALQKMVALA------IGAAGGKLPSLKLDIPIPSRTE

Query:  FYHNNFDFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCH
             F F      TD     ++V C+                 S  + L   +     ++VLRWSFNDVIHAC  ++N  LAEQLM QM++LGL PS H
Subjt:  FYHNNFDFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARNCGLAEQLMQQMRDLGLQPSCH

Query:  TFDGFVRSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVA
        T+DGF+R+V    G+  GM +LKVMQQ+ LKPY STL  V+  CSKAL++DLAE LL+QIS C Y +PFN  L+A D++D P                  
Subjt:  TFDGFVRSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDDIFVSVPYSFVA

Query:  LVCTFGSKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEG
                           ERA+R+L +MK+LK+ PD++TYELL+SLFGNVNAPYEEGN LSQVD  KRI  IE DM ++G +HS +S +N+L+ALGAEG
Subjt:  LVCTFGSKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAEG

Query:  MTKELLQYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTS
        M  E++++L  AENL  ++N  LGTP YN VLH L+E+ E  + I +F  MK  G   D AT+ +MIDCCS++   KSA AL+S+M+R GF P+ +T+T+
Subjt:  MTKELLQYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTS

Query:  LVKIVLGFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKED
        L+KI+L    F++ALNLLDQA+ E I LDV+  NT+L+KA EKG IDVIE++VE+M+REK+ PDP+TCH VFS YV  GYH+TA+EAL VLS+RML +ED
Subjt:  LVKIVLGFERFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKED

Query:  DPSAD--LTEYVENFALAEDSAADSRILEFFK
          S      E  ENF ++ED  A+++I+E F+
Subjt:  DPSAD--LTEYVENFALAEDSAADSRILEFFK

AT2G17650.1 AMP-dependent synthetase and ligase family protein1.7e-20457.76Show/hide
Query:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI
        ++G+ R  AN+ PL+PITFLERSA VY +R SLV+G V++TW  T +RC RLASAL ++GI+RGDV                                  
Subjt:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI

Query:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMS---KRREKLPLVVIIQE
                        VAALAPN+PAM+ELHFAVPMAG +LC LN R D   +S LL HSEAKI+ VD+Q L I  GA+ +++   K R+ L LV+I Q 
Subjt:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMS---KRREKLPLVVIIQE

Query:  SDQPSSHIDRTCS----ASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGW
        +D   S  D + +     S D E+ETLL +G+ +FEI +P+ EWDPIS+NYTSGTTSRPKGV+YSHRGAYLNSL+ V L+ +   PVYLWTVPMFHCNGW
Subjt:  SDQPSSHIDRTCS----ASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGW

Query:  CFTWGVAAQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATV
        C  WGVAAQGGTNIC R V+ K IF NI++HKVTHMGGAPT+LNMI+N  V++ KPLP +V +MTGG+PP   +L KM  LGF + H YGLTETYGP T 
Subjt:  CFTWGVAAQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATV

Query:  CTWKPEWDSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRS
        C WKPEWDSL  +++ KL +RQG+QH+G+E +D+K+P+TME+ P DG TMGEVMFRGNTVM+GY KD++AT++AF+G WF SGDL V++ DGYIE+KDR 
Subjt:  CTWKPEWDSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRS

Query:  KDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGC-SASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEA
        KD+IISGGENISS+EVE VL SH +VLEAAVV RPD HWG+TPC FV+LK+G  +   EEII FC+ HLP+YMAP+ +VF  +PKTSTGK QK++L+++A
Subjt:  KDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGC-SASEEEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEA

Query:  KAMGSL
          MGSL
Subjt:  KAMGSL

AT3G16910.1 acyl-activating enzyme 78.3e-16748.11Show/hide
Query:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI
        +D + +  ANY  LTP+ FL+R+A V+  R S+++G   YTWR T +RC RLASALA   I  G                                    
Subjt:  MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLI

Query:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRRE---KLPLVVIIQE
                        VA +APNIPAMYE HF VPM GAVL  +NIR +A  V+ LL+HS++ +I VD +F  + + ++++M ++     K PL+++I +
Subjt:  TFLNLNYATEYDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRRE---KLPLVVIIQE

Query:  SDQPSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTW
               ++R  S    +E+E  LATG+P++  + P DEW  I+L YTSGTT+ PKGV+  HRGAY+ +LS  L+  +    VYLWT+PMFHCNGWCF W
Subjt:  SDQPSSHIDRTCSASEDLEFETLLATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTW

Query:  GVAAQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQ-KPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTW
         +A   GT+IC R VTAKE++  I+ +KVTH   AP +LN I+N P  D   PLP  V VMT GA PP  VL+ M   GF + H+YGL+ETYGP+TVC W
Subjt:  GVAAQGGTNICQRNVTAKEIFDNISLHKVTHMGGAPTILNMIINTPVSDQ-KPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTW

Query:  KPEWDSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDI
        KPEWDSLP + QAKLN+RQG+++ GME++D+ +  T +  PADGKT GE++FRGN VM GYLK+ +A +E F GGWF SGD+ V+H D YIE+KDRSKD+
Subjt:  KPEWDSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADGKTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDI

Query:  IISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGCSASE-----EEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEE
        IISGGENISS+EVE+V++ HP+VLEA+VV RPD+ W E+PCAFV LK      +     ++I+KFC++ LP Y  P+ VVF  LPKT+TGK QK IL+ +
Subjt:  IISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGCSASE-----EEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEE

Query:  AKAMGSLPK
        AK MG +P+
Subjt:  AKAMGSLPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGAATGCACCGCTGCTCCGCCAATTACGTCCCTCTCACTCCAATTACCTTCTTGGAGCGCTCCGCCACGGTCTACGACGAAAGAATTTCTCTCGTGTATGGACG
TGTCCGGTACACTTGGAGAGACACGCTCGAGCGATGTACCAGGCTTGCTTCTGCGCTTGCCCACATGGGAATCGCTCGTGGAGATGTGGTCTATATTCGGAACTCTCCCA
TTTTCATTCGTTTGATGTTTTTGATTTTTCTGTTTGAAAGTCTTCAGAGGCATTTTGTTAGATTAAGCTCTTTCCTAATTACATTCCTTAATCTAAATTATGCAACTGAA
TACGATTCACTCGAGCAGGTTGCTGCTTTGGCACCAAATATTCCAGCTATGTACGAGCTGCATTTTGCTGTGCCAATGGCTGGTGCAGTTCTTTGCACCCTCAACATACG
CCATGATGCAGGAATGGTTTCAACACTGCTAACCCATTCGGAAGCTAAAATCATTGCTGTAGACTACCAGTTTCTACATATTGTGAAGGGAGCTATCAAGGTTATGTCCA
AGAGGAGAGAAAAGCTGCCTCTTGTGGTCATTATTCAAGAGTCTGATCAGCCATCCTCCCACATCGATAGAACCTGCTCCGCTTCAGAAGATCTAGAGTTTGAGACCCTT
TTAGCCACTGGGGAACCAGATTTTGAGATTAGACGGCCTAAAGACGAATGGGATCCGATTTCTCTTAACTATACTTCAGGCACAACATCAAGGCCAAAAGGTGTTATTTA
CTCCCATAGAGGTGCATATCTCAATTCTCTGTCTGCAGTCCTTCTGAATGATATCTGCTCACTCCCTGTGTATCTGTGGACTGTTCCAATGTTTCACTGCAATGGATGGT
GTTTCACTTGGGGTGTGGCTGCACAGGGTGGCACAAACATCTGCCAGAGGAATGTGACTGCCAAAGAAATCTTTGATAATATTTCTCTGCATAAGGTTACTCATATGGGC
GGTGCACCAACCATTTTGAACATGATTATCAATACACCAGTTAGCGATCAAAAGCCACTTCCCGGGAAGGTAACTGTGATGACTGGTGGCGCTCCACCGCCTTCTCATGT
ACTCTATAAGATGAGAGCATTGGGATTCCTTATTGTCCATTCATACGGTTTAACCGAAACATATGGTCCAGCAACAGTTTGCACTTGGAAACCTGAATGGGATTCCCTTC
CTGAAGATAAACAGGCAAAACTAAACTCTCGCCAAGGATTGCAGCATGTTGGGATGGAGGAAGTAGACATAAAGAATCCGGTCACCATGGAGAGTGCTCCAGCTGATGGA
AAAACCATGGGTGAAGTTATGTTCAGAGGCAACACTGTGATGAATGGATATTTGAAAGATCTGAAAGCCACACAGGAGGCGTTTAAAGGCGGCTGGTTTCGGAGTGGGGA
CTTGGGGGTCAGGCACCTCGACGGTTATATAGAACTGAAGGACCGTTCGAAGGACATTATAATTTCTGGGGGAGAAAACATTAGCTCAATTGAGGTGGAGTCTGTGCTTT
TCAGTCATCCATCAGTTCTTGAAGCTGCTGTTGTGGGAAGACCTGATGATCACTGGGGAGAAACACCATGTGCATTTGTACAGCTGAAAGATGGGTGCAGTGCCAGCGAA
GAAGAGATCATAAAGTTCTGCAAAAAGCACCTACCCAATTACATGGCTCCTCGATGCGTCGTGTTTAAAGTTTTGCCAAAAACTTCAACGGGGAAGACTCAGAAATTTAT
TCTCAAGGAGGAGGCCAAGGCCATGGGTAGTCTTCCAAAGAGGATCAGCAAACTTCTTCGGCTGCAGATCGAGACTCGAACTCATGGCTTTCTTCATATTGCATCCGATA
TCTTCTGTTTACTAAGCTCTCGTCGCTGTCTTTTCGGGGAAGTCCTGCGAACCATGCACAGGCAAGTAAAACACTCTCCGGGACATGGTAGCAGTTACAAATGCCCTAAC
CTAGTTTCTCCTCTGCATTTTTCCTCAGGGCAAGGTCTCGTTTGGAATCAATCGCTGACTCGTTATACAGATTTAAGCCGCACGAACATGGGCGAAGAGAGGCTGCTAGC
AAATTGGTATTCATCGGACTCTTCTCATCTCCAAAGAATGGTTGGGAAGAATGAAGTTACATATTCGGAGCTTCTCAAGCTTGCAGTGTGTCAGCAAAACTTGTCTTCAG
TGCATGAGATCTGGATGGACTTTGTTAAAAACTACAGTCCAAGTGTTTTGTCTCTGAGAAAGTTTATATGGTCTTATACAAGGCTGGGAGACCTAAAATCTGCATATATT
GCATTGCAAAAGATGGTGGCTTTGGCCATTGGAGCCGCAGGAGGAAAGTTACCCTCTTTAAAATTGGACATTCCTATACCTTCAAGAACTGAATTCTACCATAACAATTT
TGATTTTGAGGAGAATGGACATTCTACTGATGAGTTACATTGTAAGAAATTGGTCACCTGCGATGGTGACATAGAGCAATTTTCTGTTCATGGTGTGAAATGTGGAGAAG
TTGAAAGTGGTCCATTAACTTTGCAGAACAATTACATAAGCTGTTCTGTTATGAAGGTTTTGAGATGGTCTTTCAATGATGTGATACATGCATGTGCACATGCTAGGAAC
TGTGGTCTTGCAGAGCAGCTAATGCAACAGATGCGTGATCTCGGATTGCAACCTTCATGCCACACATTTGATGGTTTTGTTAGATCAGTTGTTTCAGAGAGGGGTTTCAG
TGATGGCATGAAAATATTAAAAGTAATGCAACAGAGGAAATTGAAGCCATATGGTTCAACTCTTGGTGCTGTTTCAATAAGTTGTAGCAAGGCACTAGAACTTGATTTGG
CTGAGGCTCTACTCGAACAAATTTCTGCTTGTCCTTACCCACACCCCTTCAATGCATTTCTTTCAGCATGTGACACGATGGACCTGCCAAATGAAAGCCTTTATGATGAT
ATATTTGTTTCTGTACCATATAGTTTTGTAGCACTTGTTTGCACTTTTGGGTCCAAATCAGATTGTGGGATCCTTATTGCAGGATCAGCCGAACGTGCCATGCGTATGTT
GGTTAAAATGAAACAATTGAAGGTGCTTCCAGATGTCAAGACATATGAGCTTTTGTATTCTTTATTTGGTAACGTGAATGCTCCGTATGAGGAGGGCAACAGATTGTCAC
AAGTGGATGCTGCTAAAAGGATACGCATTATAGAGACTGATATGGCAAAACATGGGATCCGACACAGTCATTTATCTATGATGAACTTGTTGAAAGCTCTAGGCGCAGAG
GGGATGACAAAGGAGCTGCTTCAGTATTTAAATGTTGCAGAGAACCTCTTCTATTACAATAACACTGATCTGGGAACGCCTATTTACAACACGGTGTTGCATTTTTTAGT
TGAATCCAAGGAAATTCAGTTGGCAATAGAATTATTCAATAATATGAAGCATTCTGGTTTCTTTCCAGATGCTGCGACATTTGAGATGATGATTGACTGTTGTAGTGTTA
TGGGATGCTTGAAATCTGCTTTTGCCCTTCTTTCCATAATGGTCCGCATGGGGTTTTGTCCACAGATATTAACTTATACAAGTCTAGTAAAGATTGTGTTGGGATTTGAG
AGATTTGATGATGCCTTGAATCTTTTGGATCAAGCCAGTTCAGAAGGGATTGAACTTGATGTAGTTATAATCAATACCGTCTTGCAGAAAGCTAGTGAAAAGGGAAGGAT
TGATGTGATTGAGTTTGTGGTTGAGAGGATGAATCGTGAAAAGATCGAACCCGACCCTTCAACATGCCATAGTGTCTTCTCTGCATATGTGAACCTTGGCTATCATAGCA
CTGCTGTGGAAGCACTGCAAGTACTGAGCATGCGGATGTTATGCAAAGAAGACGACCCCTCTGCAGACTTGACAGAATACGTAGAAAACTTTGCCCTTGCAGAGGACTCC
GCAGCTGATTCACGTATTTTGGAATTCTTCAAAGCTCTGAAGAGAACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGGAATGCACCGCTGCTCCGCCAATTACGTCCCTCTCACTCCAATTACCTTCTTGGAGCGCTCCGCCACGGTCTACGACGAAAGAATTTCTCTCGTGTATGGACG
TGTCCGGTACACTTGGAGAGACACGCTCGAGCGATGTACCAGGCTTGCTTCTGCGCTTGCCCACATGGGAATCGCTCGTGGAGATGTGGTCTATATTCGGAACTCTCCCA
TTTTCATTCGTTTGATGTTTTTGATTTTTCTGTTTGAAAGTCTTCAGAGGCATTTTGTTAGATTAAGCTCTTTCCTAATTACATTCCTTAATCTAAATTATGCAACTGAA
TACGATTCACTCGAGCAGGTTGCTGCTTTGGCACCAAATATTCCAGCTATGTACGAGCTGCATTTTGCTGTGCCAATGGCTGGTGCAGTTCTTTGCACCCTCAACATACG
CCATGATGCAGGAATGGTTTCAACACTGCTAACCCATTCGGAAGCTAAAATCATTGCTGTAGACTACCAGTTTCTACATATTGTGAAGGGAGCTATCAAGGTTATGTCCA
AGAGGAGAGAAAAGCTGCCTCTTGTGGTCATTATTCAAGAGTCTGATCAGCCATCCTCCCACATCGATAGAACCTGCTCCGCTTCAGAAGATCTAGAGTTTGAGACCCTT
TTAGCCACTGGGGAACCAGATTTTGAGATTAGACGGCCTAAAGACGAATGGGATCCGATTTCTCTTAACTATACTTCAGGCACAACATCAAGGCCAAAAGGTGTTATTTA
CTCCCATAGAGGTGCATATCTCAATTCTCTGTCTGCAGTCCTTCTGAATGATATCTGCTCACTCCCTGTGTATCTGTGGACTGTTCCAATGTTTCACTGCAATGGATGGT
GTTTCACTTGGGGTGTGGCTGCACAGGGTGGCACAAACATCTGCCAGAGGAATGTGACTGCCAAAGAAATCTTTGATAATATTTCTCTGCATAAGGTTACTCATATGGGC
GGTGCACCAACCATTTTGAACATGATTATCAATACACCAGTTAGCGATCAAAAGCCACTTCCCGGGAAGGTAACTGTGATGACTGGTGGCGCTCCACCGCCTTCTCATGT
ACTCTATAAGATGAGAGCATTGGGATTCCTTATTGTCCATTCATACGGTTTAACCGAAACATATGGTCCAGCAACAGTTTGCACTTGGAAACCTGAATGGGATTCCCTTC
CTGAAGATAAACAGGCAAAACTAAACTCTCGCCAAGGATTGCAGCATGTTGGGATGGAGGAAGTAGACATAAAGAATCCGGTCACCATGGAGAGTGCTCCAGCTGATGGA
AAAACCATGGGTGAAGTTATGTTCAGAGGCAACACTGTGATGAATGGATATTTGAAAGATCTGAAAGCCACACAGGAGGCGTTTAAAGGCGGCTGGTTTCGGAGTGGGGA
CTTGGGGGTCAGGCACCTCGACGGTTATATAGAACTGAAGGACCGTTCGAAGGACATTATAATTTCTGGGGGAGAAAACATTAGCTCAATTGAGGTGGAGTCTGTGCTTT
TCAGTCATCCATCAGTTCTTGAAGCTGCTGTTGTGGGAAGACCTGATGATCACTGGGGAGAAACACCATGTGCATTTGTACAGCTGAAAGATGGGTGCAGTGCCAGCGAA
GAAGAGATCATAAAGTTCTGCAAAAAGCACCTACCCAATTACATGGCTCCTCGATGCGTCGTGTTTAAAGTTTTGCCAAAAACTTCAACGGGGAAGACTCAGAAATTTAT
TCTCAAGGAGGAGGCCAAGGCCATGGGTAGTCTTCCAAAGAGGATCAGCAAACTTCTTCGGCTGCAGATCGAGACTCGAACTCATGGCTTTCTTCATATTGCATCCGATA
TCTTCTGTTTACTAAGCTCTCGTCGCTGTCTTTTCGGGGAAGTCCTGCGAACCATGCACAGGCAAGTAAAACACTCTCCGGGACATGGTAGCAGTTACAAATGCCCTAAC
CTAGTTTCTCCTCTGCATTTTTCCTCAGGGCAAGGTCTCGTTTGGAATCAATCGCTGACTCGTTATACAGATTTAAGCCGCACGAACATGGGCGAAGAGAGGCTGCTAGC
AAATTGGTATTCATCGGACTCTTCTCATCTCCAAAGAATGGTTGGGAAGAATGAAGTTACATATTCGGAGCTTCTCAAGCTTGCAGTGTGTCAGCAAAACTTGTCTTCAG
TGCATGAGATCTGGATGGACTTTGTTAAAAACTACAGTCCAAGTGTTTTGTCTCTGAGAAAGTTTATATGGTCTTATACAAGGCTGGGAGACCTAAAATCTGCATATATT
GCATTGCAAAAGATGGTGGCTTTGGCCATTGGAGCCGCAGGAGGAAAGTTACCCTCTTTAAAATTGGACATTCCTATACCTTCAAGAACTGAATTCTACCATAACAATTT
TGATTTTGAGGAGAATGGACATTCTACTGATGAGTTACATTGTAAGAAATTGGTCACCTGCGATGGTGACATAGAGCAATTTTCTGTTCATGGTGTGAAATGTGGAGAAG
TTGAAAGTGGTCCATTAACTTTGCAGAACAATTACATAAGCTGTTCTGTTATGAAGGTTTTGAGATGGTCTTTCAATGATGTGATACATGCATGTGCACATGCTAGGAAC
TGTGGTCTTGCAGAGCAGCTAATGCAACAGATGCGTGATCTCGGATTGCAACCTTCATGCCACACATTTGATGGTTTTGTTAGATCAGTTGTTTCAGAGAGGGGTTTCAG
TGATGGCATGAAAATATTAAAAGTAATGCAACAGAGGAAATTGAAGCCATATGGTTCAACTCTTGGTGCTGTTTCAATAAGTTGTAGCAAGGCACTAGAACTTGATTTGG
CTGAGGCTCTACTCGAACAAATTTCTGCTTGTCCTTACCCACACCCCTTCAATGCATTTCTTTCAGCATGTGACACGATGGACCTGCCAAATGAAAGCCTTTATGATGAT
ATATTTGTTTCTGTACCATATAGTTTTGTAGCACTTGTTTGCACTTTTGGGTCCAAATCAGATTGTGGGATCCTTATTGCAGGATCAGCCGAACGTGCCATGCGTATGTT
GGTTAAAATGAAACAATTGAAGGTGCTTCCAGATGTCAAGACATATGAGCTTTTGTATTCTTTATTTGGTAACGTGAATGCTCCGTATGAGGAGGGCAACAGATTGTCAC
AAGTGGATGCTGCTAAAAGGATACGCATTATAGAGACTGATATGGCAAAACATGGGATCCGACACAGTCATTTATCTATGATGAACTTGTTGAAAGCTCTAGGCGCAGAG
GGGATGACAAAGGAGCTGCTTCAGTATTTAAATGTTGCAGAGAACCTCTTCTATTACAATAACACTGATCTGGGAACGCCTATTTACAACACGGTGTTGCATTTTTTAGT
TGAATCCAAGGAAATTCAGTTGGCAATAGAATTATTCAATAATATGAAGCATTCTGGTTTCTTTCCAGATGCTGCGACATTTGAGATGATGATTGACTGTTGTAGTGTTA
TGGGATGCTTGAAATCTGCTTTTGCCCTTCTTTCCATAATGGTCCGCATGGGGTTTTGTCCACAGATATTAACTTATACAAGTCTAGTAAAGATTGTGTTGGGATTTGAG
AGATTTGATGATGCCTTGAATCTTTTGGATCAAGCCAGTTCAGAAGGGATTGAACTTGATGTAGTTATAATCAATACCGTCTTGCAGAAAGCTAGTGAAAAGGGAAGGAT
TGATGTGATTGAGTTTGTGGTTGAGAGGATGAATCGTGAAAAGATCGAACCCGACCCTTCAACATGCCATAGTGTCTTCTCTGCATATGTGAACCTTGGCTATCATAGCA
CTGCTGTGGAAGCACTGCAAGTACTGAGCATGCGGATGTTATGCAAAGAAGACGACCCCTCTGCAGACTTGACAGAATACGTAGAAAACTTTGCCCTTGCAGAGGACTCC
GCAGCTGATTCACGTATTTTGGAATTCTTCAAAGCTCTGAAGAGAACCTGA
Protein sequenceShow/hide protein sequence
MDGMHRCSANYVPLTPITFLERSATVYDERISLVYGRVRYTWRDTLERCTRLASALAHMGIARGDVVYIRNSPIFIRLMFLIFLFESLQRHFVRLSSFLITFLNLNYATE
YDSLEQVAALAPNIPAMYELHFAVPMAGAVLCTLNIRHDAGMVSTLLTHSEAKIIAVDYQFLHIVKGAIKVMSKRREKLPLVVIIQESDQPSSHIDRTCSASEDLEFETL
LATGEPDFEIRRPKDEWDPISLNYTSGTTSRPKGVIYSHRGAYLNSLSAVLLNDICSLPVYLWTVPMFHCNGWCFTWGVAAQGGTNICQRNVTAKEIFDNISLHKVTHMG
GAPTILNMIINTPVSDQKPLPGKVTVMTGGAPPPSHVLYKMRALGFLIVHSYGLTETYGPATVCTWKPEWDSLPEDKQAKLNSRQGLQHVGMEEVDIKNPVTMESAPADG
KTMGEVMFRGNTVMNGYLKDLKATQEAFKGGWFRSGDLGVRHLDGYIELKDRSKDIIISGGENISSIEVESVLFSHPSVLEAAVVGRPDDHWGETPCAFVQLKDGCSASE
EEIIKFCKKHLPNYMAPRCVVFKVLPKTSTGKTQKFILKEEAKAMGSLPKRISKLLRLQIETRTHGFLHIASDIFCLLSSRRCLFGEVLRTMHRQVKHSPGHGSSYKCPN
LVSPLHFSSGQGLVWNQSLTRYTDLSRTNMGEERLLANWYSSDSSHLQRMVGKNEVTYSELLKLAVCQQNLSSVHEIWMDFVKNYSPSVLSLRKFIWSYTRLGDLKSAYI
ALQKMVALAIGAAGGKLPSLKLDIPIPSRTEFYHNNFDFEENGHSTDELHCKKLVTCDGDIEQFSVHGVKCGEVESGPLTLQNNYISCSVMKVLRWSFNDVIHACAHARN
CGLAEQLMQQMRDLGLQPSCHTFDGFVRSVVSERGFSDGMKILKVMQQRKLKPYGSTLGAVSISCSKALELDLAEALLEQISACPYPHPFNAFLSACDTMDLPNESLYDD
IFVSVPYSFVALVCTFGSKSDCGILIAGSAERAMRMLVKMKQLKVLPDVKTYELLYSLFGNVNAPYEEGNRLSQVDAAKRIRIIETDMAKHGIRHSHLSMMNLLKALGAE
GMTKELLQYLNVAENLFYYNNTDLGTPIYNTVLHFLVESKEIQLAIELFNNMKHSGFFPDAATFEMMIDCCSVMGCLKSAFALLSIMVRMGFCPQILTYTSLVKIVLGFE
RFDDALNLLDQASSEGIELDVVIINTVLQKASEKGRIDVIEFVVERMNREKIEPDPSTCHSVFSAYVNLGYHSTAVEALQVLSMRMLCKEDDPSADLTEYVENFALAEDS
AADSRILEFFKALKRT