; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg010404 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg010404
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Description4-coumarate--CoA ligase
Genome locationscaffold5:13474378..13476834
RNA-Seq ExpressionSpg010404
SyntenySpg010404
Gene Ontology termsGO:0009698 - phenylpropanoid metabolic process (biological process)
GO:0010584 - pollen exine formation (biological process)
GO:0016207 - 4-coumarate-CoA ligase activity (molecular function)
GO:0106290 - trans-cinnamate-CoA ligase activity (molecular function)
InterPro domainsIPR000873 - AMP-dependent synthetase/ligase
IPR042099 - ANL, N-terminal domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004151908.1 4-coumarate--CoA ligase 2 [Cucumis sativus]5.8e-12371.14Show/hide
Query:  MIFVALPCDGRQPELSPKISSPPP--PTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVI
        MI +A  CD +QP  +   SSPPP  P THVFRSKLPDI IP+HL LH+Y FQKLS+ S+RPCLIVGSTGKSYS+ ETHL SRKAAATFSKLG+K+GDVI
Subjt:  MIFVALPCDGRQPELSPKISSPPP--PTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVI

Query:  MILLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRES-GEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQN
        MILL NS EF                             LK+S +K VVTYS CVD+LRES G+ LTIVT+D PPENCLSFSM YDADEND+P+VEID N
Subjt:  MILLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRES-GEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQN

Query:  DAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVV
        DAV+LPFSS TTGLPKGVILTH++MVSSVAQQVDGENPN+YL++NDV LCVLPMFHIF+LSSIVLIS+ S A LLL+EKFEIE+LLRL+E H+VTVATVV
Subjt:  DAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVV

Query:  PPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR
        PPLVV+L KNPKVA+ DLSSIR+V SGAAPLRKELEEALM R+PQAIFG+
Subjt:  PPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR

XP_008462779.1 PREDICTED: 4-coumarate--CoA ligase 2 [Cucumis melo]4.3e-12672.08Show/hide
Query:  MIFVALPCDGRQPE--LSPKISSPPP--PTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGD
        MI +A  CD + P    S   SSPPP  P THVFRSKLPDI IP+HL LHSYCFQKLS+ S+RPCLIVGSTGKSYS+ ETHLFSRKAAATFSKLG+++GD
Subjt:  MIFVALPCDGRQPE--LSPKISSPPP--PTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGD

Query:  VIMILLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQ
        VIMILL NS EF                             LK+S +K VVTYS CVD+LRE GE LTIVTVDDPPENCLSFSM YDA+END+P VEID 
Subjt:  VIMILLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQ

Query:  NDAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATV
        NDAVSLPFSS TTGLPKGVILTH++MVSSVAQQVDGENPN+YL++ND+ LCVLPMFHIF+LSSIVLISI S A LLLMEKFEIE+LLRL+E H+VTVATV
Subjt:  NDAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATV

Query:  VPPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR
        VPPLVV+L KNPKVA+ +LSSIR+V SGAAPLRKELEEALM R+PQAIFG+
Subjt:  VPPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR

XP_022947959.1 4-coumarate--CoA ligase 3 [Cucurbita moschata]4.9e-12270.32Show/hide
Query:  MIFVALPCDGRQPELSPKISSPPPPTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMI
        MI VA   DG +P+LS   SSPPP    VFRSKLPDI IP+HL LH YCF+K+SEFS+RPCLIVG+TGKSYSF +THLFS++AAATFSKLG+KKGD IMI
Subjt:  MIFVALPCDGRQPELSPKISSPPPPTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMI

Query:  LLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQNDAV
        LL+NSAEF                             LK S +KLVVTYSHCVD+LRES  DLTIVTVDDPPENCLSFSM YDADEND+P VEID NDAV
Subjt:  LLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQNDAV

Query:  SLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPL
        SLPFSS TTG PKGV+LTH+SMVSS+AQQVDGENPN+YL  NDV LCVLPMFHIF+LSSIVLISI S AT+LL+EKFEIET + LIE H VTVATVVPP+
Subjt:  SLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPL

Query:  VVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR
        V+ + KNPKVA+ +LSSIRMV SGAAPL K++EEALM RIPQA+ G+
Subjt:  VVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR

XP_038901546.1 4-coumarate--CoA ligase 3 isoform X1 [Benincasa hispida]3.7e-13073.43Show/hide
Query:  MIFVALPCDGRQPELSPKISSPPPPT---THVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDV
        MI VA  C+ +QP +S ++SS PPP    TH+FRSKLPDI IP+HL LHSYCFQKLSE  + PCLIVGSTGKSYS+ ETHLFSRKAAATFSKLG+KKGDV
Subjt:  MIFVALPCDGRQPELSPKISSPPPPT---THVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDV

Query:  IMILLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQN
        IMILLQNS EF                             L +S +K VVTYS CV +LRESGEDLTIVTVDDPPENCLSFSM YDADEND+P VEID N
Subjt:  IMILLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQN

Query:  DAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVV
        DAVSLPFSS TTGLPKGVILTH++MVSSVAQQVDGENPN+YLR+ND+ LCVLPMFHIF+LSSIVL+SI S A LLLMEKFEIE+LLRLIE H VTVATVV
Subjt:  DAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVV

Query:  PPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR
        PPLVVAL KNP+ A+ DLSSIRMV SGAAPLRKELEEALM R+PQAIFG+
Subjt:  PPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR

XP_038901547.1 4-coumarate--CoA ligase 3 isoform X2 [Benincasa hispida]3.7e-13073.43Show/hide
Query:  MIFVALPCDGRQPELSPKISSPPPPT---THVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDV
        MI VA  C+ +QP +S ++SS PPP    TH+FRSKLPDI IP+HL LHSYCFQKLSE  + PCLIVGSTGKSYS+ ETHLFSRKAAATFSKLG+KKGDV
Subjt:  MIFVALPCDGRQPELSPKISSPPPPT---THVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDV

Query:  IMILLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQN
        IMILLQNS EF                             L +S +K VVTYS CV +LRESGEDLTIVTVDDPPENCLSFSM YDADEND+P VEID N
Subjt:  IMILLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQN

Query:  DAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVV
        DAVSLPFSS TTGLPKGVILTH++MVSSVAQQVDGENPN+YLR+ND+ LCVLPMFHIF+LSSIVL+SI S A LLLMEKFEIE+LLRLIE H VTVATVV
Subjt:  DAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVV

Query:  PPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR
        PPLVVAL KNP+ A+ DLSSIRMV SGAAPLRKELEEALM R+PQAIFG+
Subjt:  PPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR

TrEMBL top hitse value%identityAlignment
A0A0A0LQV1 AMP-binding domain-containing protein2.8e-12371.14Show/hide
Query:  MIFVALPCDGRQPELSPKISSPPP--PTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVI
        MI +A  CD +QP  +   SSPPP  P THVFRSKLPDI IP+HL LH+Y FQKLS+ S+RPCLIVGSTGKSYS+ ETHL SRKAAATFSKLG+K+GDVI
Subjt:  MIFVALPCDGRQPELSPKISSPPP--PTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVI

Query:  MILLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRES-GEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQN
        MILL NS EF                             LK+S +K VVTYS CVD+LRES G+ LTIVT+D PPENCLSFSM YDADEND+P+VEID N
Subjt:  MILLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRES-GEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQN

Query:  DAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVV
        DAV+LPFSS TTGLPKGVILTH++MVSSVAQQVDGENPN+YL++NDV LCVLPMFHIF+LSSIVLIS+ S A LLL+EKFEIE+LLRL+E H+VTVATVV
Subjt:  DAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVV

Query:  PPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR
        PPLVV+L KNPKVA+ DLSSIR+V SGAAPLRKELEEALM R+PQAIFG+
Subjt:  PPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR

A0A1S3CJB3 4-coumarate--CoA ligase 22.1e-12672.08Show/hide
Query:  MIFVALPCDGRQPE--LSPKISSPPP--PTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGD
        MI +A  CD + P    S   SSPPP  P THVFRSKLPDI IP+HL LHSYCFQKLS+ S+RPCLIVGSTGKSYS+ ETHLFSRKAAATFSKLG+++GD
Subjt:  MIFVALPCDGRQPE--LSPKISSPPP--PTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGD

Query:  VIMILLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQ
        VIMILL NS EF                             LK+S +K VVTYS CVD+LRE GE LTIVTVDDPPENCLSFSM YDA+END+P VEID 
Subjt:  VIMILLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQ

Query:  NDAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATV
        NDAVSLPFSS TTGLPKGVILTH++MVSSVAQQVDGENPN+YL++ND+ LCVLPMFHIF+LSSIVLISI S A LLLMEKFEIE+LLRL+E H+VTVATV
Subjt:  NDAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATV

Query:  VPPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR
        VPPLVV+L KNPKVA+ +LSSIR+V SGAAPLRKELEEALM R+PQAIFG+
Subjt:  VPPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR

A0A2H4Z8L3 4-coumarate coenzyme A ligase6.7e-10155.16Show/hide
Query:  MIFVALPCDGRQPELSPKISSPPPPTT---------HVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLG
        MI +A P + ++PE+SP +S  PPP+T         HVFRSKLPDI I NHL LH+YC++KLS F ++PCLI GS+GK+Y+F ETHL ++K AA  S LG
Subjt:  MIFVALPCDGRQPELSPKISSPPPPTT---------HVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLG

Query:  IKKGDVIMILLQNSAEFL-----------------------------KSSSSKLVVTYSHCVDELRES------------GEDLTIVTVDDPPENCLSFS
        IKKGDVIMILLQN AEF+                             K++ +KL++T S  VD+L+++            GED  ++T+DDPPENCL F+
Subjt:  IKKGDVIMILLQNSAEFL-----------------------------KSSSSKLVVTYSHCVDELRES------------GEDLTIVTVDDPPENCLSFS

Query:  MDYDADENDMPTVEIDQNDAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEI
        +  +A+EN++P V ID +D V+LPFSS TTGLPKGV+LTHRS+++SVAQQVDGENPN+YL+ +DV LCVLPMFHI++L+S++L S+ + A +LLM+KFEI
Subjt:  MDYDADENDMPTVEIDQNDAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEI

Query:  ETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR
         TLL LI+ H+V+VA VVPPLV+AL KNP VA  DLSSIR+V SGAAPL KELEEAL  R+PQA+ G+
Subjt:  ETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR

A0A5A7THZ8 4-coumarate--CoA ligase 22.1e-12672.08Show/hide
Query:  MIFVALPCDGRQPE--LSPKISSPPP--PTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGD
        MI +A  CD + P    S   SSPPP  P THVFRSKLPDI IP+HL LHSYCFQKLS+ S+RPCLIVGSTGKSYS+ ETHLFSRKAAATFSKLG+++GD
Subjt:  MIFVALPCDGRQPE--LSPKISSPPP--PTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGD

Query:  VIMILLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQ
        VIMILL NS EF                             LK+S +K VVTYS CVD+LRE GE LTIVTVDDPPENCLSFSM YDA+END+P VEID 
Subjt:  VIMILLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQ

Query:  NDAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATV
        NDAVSLPFSS TTGLPKGVILTH++MVSSVAQQVDGENPN+YL++ND+ LCVLPMFHIF+LSSIVLISI S A LLLMEKFEIE+LLRL+E H+VTVATV
Subjt:  NDAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATV

Query:  VPPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR
        VPPLVV+L KNPKVA+ +LSSIR+V SGAAPLRKELEEALM R+PQAIFG+
Subjt:  VPPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR

A0A6J1G8F9 4-coumarate--CoA ligase 32.4e-12270.32Show/hide
Query:  MIFVALPCDGRQPELSPKISSPPPPTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMI
        MI VA   DG +P+LS   SSPPP    VFRSKLPDI IP+HL LH YCF+K+SEFS+RPCLIVG+TGKSYSF +THLFS++AAATFSKLG+KKGD IMI
Subjt:  MIFVALPCDGRQPELSPKISSPPPPTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMI

Query:  LLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQNDAV
        LL+NSAEF                             LK S +KLVVTYSHCVD+LRES  DLTIVTVDDPPENCLSFSM YDADEND+P VEID NDAV
Subjt:  LLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQNDAV

Query:  SLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPL
        SLPFSS TTG PKGV+LTH+SMVSS+AQQVDGENPN+YL  NDV LCVLPMFHIF+LSSIVLISI S AT+LL+EKFEIET + LIE H VTVATVVPP+
Subjt:  SLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPL

Query:  VVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR
        V+ + KNPKVA+ +LSSIRMV SGAAPL K++EEALM RIPQA+ G+
Subjt:  VVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR

SwissProt top hitse value%identityAlignment
M4ISH0 4-coumarate--CoA ligase CCL11.0e-7749.38Show/hide
Query:  VFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEFL-------------------
        +FRSKLPDI IPNHL LHSYCF+ +S+F +RPCLI G+TG+  ++ +  L SRK AA   KLGIK+GDVIM+LLQNS EF+                   
Subjt:  VFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEFL-------------------

Query:  ----------KSSSSKLVVTYSHCVDELRE--SGED-LTIVTVDDPP--ENCLSFSMDYDADENDMPTVEIDQNDAVSLPFSSDTTGLPKGVILTHRSMV
                   +S +KLV+T +  +D+++E   GE  + ++ VD PP    CL FS    ADE ++P V+I  +D V+LP+SS TTGLPKGV+LTH+ +V
Subjt:  ----------KSSSSKLVVTYSHCVDELRE--SGED-LTIVTVDDPP--ENCLSFSMDYDADENDMPTVEIDQNDAVSLPFSSDTTGLPKGVILTHRSMV

Query:  SSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRMVFS
        +SVAQQVDG+NPN+Y  +NDV LCVLP+FHI++L+SI+L  +   A +L+M+KFEI  LL LIE  +VT+A  VPP+V+++ K P +   DLSSIR V S
Subjt:  SSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRMVFS

Query:  GAAPLRKELEEALMLRIPQAIFGR
        G AP+ KELE+A+  ++P A  G+
Subjt:  GAAPLRKELEEALMLRIPQAIFGR

P31687 4-coumarate--CoA ligase 21.5e-9756.36Show/hide
Query:  LSPKISSP--------PPPTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSA
        L+P + +P         P T+HVF+SKLPDI I NHL LHSYCFQ LS+F++RPCLIVG   K++++ +THL S K AA  S LGI KGDV+MILLQNSA
Subjt:  LSPKISSP--------PPPTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSA

Query:  EFLKS-----------------------------SSSKLVVTYSHCVDELR-----ESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQNDAVS
        +F+ S                             S +KL++T +  VD+LR     + GED  +VTVDDPPENCL FS+  +A+E+D+P VEI  +DAV+
Subjt:  EFLKS-----------------------------SSSKLVVTYSHCVDELR-----ESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQNDAVS

Query:  LPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLV
        +PFSS TTGLPKGVILTH+S+ +SVAQQVDGENPN+YL   DV LCVLP+FHIF+L+S++L ++ + + +LLM+KFEI TLL LI+ HRV+VA VVPPLV
Subjt:  LPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLV

Query:  VALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR
        +AL KNP VA+ DLSSIR+V SGAAPL KELEEAL  R+PQA+ G+
Subjt:  VALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR

P41636 4-coumarate--CoA ligase2.6e-7847.2Show/hide
Query:  HVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEF-------------------
        H++RSKLPDI I +HL LHSYCF++++EF++RPCLI G+T ++Y F E  L SRK AA  +KLG+++G V+M+LL N  EF                   
Subjt:  HVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEF-------------------

Query:  ----------LKSSSSKLVVTYSHCVDELRE-SGEDLTIVTVDD-PPENCLSFSMDYDADENDMPTVEIDQNDAVSLPFSSDTTGLPKGVILTHRSMVSS
                   K++ ++++VT +  V++L +    D+ ++T+DD P E C   S+  +ADE   P V+I  +D V+LP+SS TTGLPKGV+LTH+ +VSS
Subjt:  ----------LKSSSSKLVVTYSHCVDELRE-SGEDLTIVTVDD-PPENCLSFSMDYDADENDMPTVEIDQNDAVSLPFSSDTTGLPKGVILTHRSMVSS

Query:  VAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRMVFSGA
        VAQQVDGENPN+Y   +DV LCVLP+FHI++L+S++L ++ + A  L+M+KF + T L LI+ ++VTVA +VPP+V+ + K+P V+  D+SS+R++ SGA
Subjt:  VAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRMVFSGA

Query:  APLRKELEEALMLRIPQAIFGR
        APL KELE+AL  R P+AIFG+
Subjt:  APLRKELEEALMLRIPQAIFGR

Q42982 4-coumarate--CoA ligase 21.6e-8349.86Show/hide
Query:  MIFVALPCDGRQPELSPKISSPPPPTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMI
        MI VA P    QP+++  +   PP    VFRSKLPDI IP+HL LH YCF + +E  + PCLI  +TG++Y+F ET L  R+AAA   +LG+  GD +M+
Subjt:  MIFVALPCDGRQPELSPKISSPPPPTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMI

Query:  LLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRES-----------GED-LTIVTVDD---PPENCLSF-SMDYDA
        LLQN  EF                              K+S  KL++T S  VD+LR+            G+D LT++T+DD    PE CL F  +  DA
Subjt:  LLQNSAEF-----------------------------LKSSSSKLVVTYSHCVDELRES-----------GED-LTIVTVDD---PPENCLSF-SMDYDA

Query:  DENDMPTVEIDQNDAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLR
        DE  +P V I  +D V+LPFSS TTGLPKGV+LTHRS+VS VAQQVDGENPN+++   DV LCVLP+FHIF+L+S++L ++ + A + LM +FE+  +L 
Subjt:  DENDMPTVEIDQNDAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLR

Query:  LIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR
         IE  RVTVA VVPPLV+AL KNP V   DLSSIR+V SGAAPL KELE+AL  R+PQAIFG+
Subjt:  LIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGR

Q9S777 4-coumarate--CoA ligase 35.1e-9858.41Show/hide
Query:  PPTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEF---------------
        PPT  +FRSKLPDI IPNHL LH+YCF+KLS  S++PCLIVGSTGKSY++ ETHL  R+ A+   KLGI+KGDVIMILLQNSAEF               
Subjt:  PPTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEF---------------

Query:  --------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDP-PENCLSFSMDYDADEND--MPTVEIDQNDAVSLPFSSDTTGLPKGVILTHR
                      LKSS +KL++T+S  VD+L+  GE+LT++T D+P PENCL FS     DE +    TV+I  +DA +LPFSS TTGLPKGV+LTH+
Subjt:  --------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDP-PENCLSFSMDYDADEND--MPTVEIDQNDAVSLPFSSDTTGLPKGVILTHR

Query:  SMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRM
        S+++SVAQQVDG+NPN+YL+ NDV LCVLP+FHI++L+S++L S+ S AT+LLM KFEI  LL LI+ HRVT+A +VPPLV+AL KNP V + DLSS+R 
Subjt:  SMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRM

Query:  VFSGAAPLRKELEEALMLRIPQAIFGR
        V SGAAPL KEL+++L  R+PQAI G+
Subjt:  VFSGAAPLRKELEEALMLRIPQAIFGR

Arabidopsis top hitse value%identityAlignment
AT1G51680.1 4-coumarate:CoA ligase 13.1e-7446.81Show/hide
Query:  VFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEFL-------------------
        +FRSKLPDI IPNHL LH Y FQ +SEF+ +PCLI G TG  Y++ + H+ SR+ AA F KLG+ + DV+M+LL N  EF+                   
Subjt:  VFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEFL-------------------

Query:  ----------KSSSSKLVVTYSHCVDELR--ESGEDLTIVTVDDP-----PENCLSF---SMDYDADENDMPTVEIDQNDAVSLPFSSDTTGLPKGVILT
                  K+S++KL++T +  VD+++  ++ + + IV +DD      PE CL F   +         + +VEI  +D V+LP+SS TTGLPKGV+LT
Subjt:  ----------KSSSSKLVVTYSHCVDELR--ESGEDLTIVTVDDP-----PENCLSF---SMDYDADENDMPTVEIDQNDAVSLPFSSDTTGLPKGVILT

Query:  HRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSI
        H+ +V+SVAQQVDGENPN+Y   +DV LCVLPMFHI+AL+SI+L  +   A +L+M KFEI  LL LI+  +VTVA +VPP+V+A+ K+ +    DLSSI
Subjt:  HRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSI

Query:  RMVFSGAAPLRKELEEALMLRIPQAIFGR
        R+V SGAAPL KELE+A+  + P A  G+
Subjt:  RMVFSGAAPLRKELEEALMLRIPQAIFGR

AT1G51680.2 4-coumarate:CoA ligase 13.1e-7446.81Show/hide
Query:  VFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEFL-------------------
        +FRSKLPDI IPNHL LH Y FQ +SEF+ +PCLI G TG  Y++ + H+ SR+ AA F KLG+ + DV+M+LL N  EF+                   
Subjt:  VFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEFL-------------------

Query:  ----------KSSSSKLVVTYSHCVDELR--ESGEDLTIVTVDDP-----PENCLSF---SMDYDADENDMPTVEIDQNDAVSLPFSSDTTGLPKGVILT
                  K+S++KL++T +  VD+++  ++ + + IV +DD      PE CL F   +         + +VEI  +D V+LP+SS TTGLPKGV+LT
Subjt:  ----------KSSSSKLVVTYSHCVDELR--ESGEDLTIVTVDDP-----PENCLSF---SMDYDADENDMPTVEIDQNDAVSLPFSSDTTGLPKGVILT

Query:  HRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSI
        H+ +V+SVAQQVDGENPN+Y   +DV LCVLPMFHI+AL+SI+L  +   A +L+M KFEI  LL LI+  +VTVA +VPP+V+A+ K+ +    DLSSI
Subjt:  HRSMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSI

Query:  RMVFSGAAPLRKELEEALMLRIPQAIFGR
        R+V SGAAPL KELE+A+  + P A  G+
Subjt:  RMVFSGAAPLRKELEEALMLRIPQAIFGR

AT1G65060.1 4-coumarate:CoA ligase 33.6e-9958.41Show/hide
Query:  PPTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEF---------------
        PPT  +FRSKLPDI IPNHL LH+YCF+KLS  S++PCLIVGSTGKSY++ ETHL  R+ A+   KLGI+KGDVIMILLQNSAEF               
Subjt:  PPTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEF---------------

Query:  --------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDP-PENCLSFSMDYDADEND--MPTVEIDQNDAVSLPFSSDTTGLPKGVILTHR
                      LKSS +KL++T+S  VD+L+  GE+LT++T D+P PENCL FS     DE +    TV+I  +DA +LPFSS TTGLPKGV+LTH+
Subjt:  --------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDP-PENCLSFSMDYDADEND--MPTVEIDQNDAVSLPFSSDTTGLPKGVILTHR

Query:  SMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRM
        S+++SVAQQVDG+NPN+YL+ NDV LCVLP+FHI++L+S++L S+ S AT+LLM KFEI  LL LI+ HRVT+A +VPPLV+AL KNP V + DLSS+R 
Subjt:  SMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRM

Query:  VFSGAAPLRKELEEALMLRIPQAIFGR
        V SGAAPL KEL+++L  R+PQAI G+
Subjt:  VFSGAAPLRKELEEALMLRIPQAIFGR

AT1G65060.2 4-coumarate:CoA ligase 33.6e-9958.41Show/hide
Query:  PPTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEF---------------
        PPT  +FRSKLPDI IPNHL LH+YCF+KLS  S++PCLIVGSTGKSY++ ETHL  R+ A+   KLGI+KGDVIMILLQNSAEF               
Subjt:  PPTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEF---------------

Query:  --------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDP-PENCLSFSMDYDADEND--MPTVEIDQNDAVSLPFSSDTTGLPKGVILTHR
                      LKSS +KL++T+S  VD+L+  GE+LT++T D+P PENCL FS     DE +    TV+I  +DA +LPFSS TTGLPKGV+LTH+
Subjt:  --------------LKSSSSKLVVTYSHCVDELRESGEDLTIVTVDDP-PENCLSFSMDYDADEND--MPTVEIDQNDAVSLPFSSDTTGLPKGVILTHR

Query:  SMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRM
        S+++SVAQQVDG+NPN+YL+ NDV LCVLP+FHI++L+S++L S+ S AT+LLM KFEI  LL LI+ HRVT+A +VPPLV+AL KNP V + DLSS+R 
Subjt:  SMVSSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRM

Query:  VFSGAAPLRKELEEALMLRIPQAIFGR
        V SGAAPL KEL+++L  R+PQAI G+
Subjt:  VFSGAAPLRKELEEALMLRIPQAIFGR

AT3G21240.1 4-coumarate:CoA ligase 28.7e-7749.69Show/hide
Query:  VFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEFL-------------------
        +FRS+LPDI IPNHL LH Y F+ +SEF+ +PCLI G TG+ Y++ + H+ SRK AA    LG+K+ DV+MILL NS E +                   
Subjt:  VFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEFL-------------------

Query:  ----------KSSSSKLVVTYSHCVDELRESGED-LTIVTVDDP--PENCLSFSMDYDADENDMPTV--EIDQNDAVSLPFSSDTTGLPKGVILTHRSMV
                  K+S++KL+VT S  VD+++    D + IVT D    PENCL FS    ++E  + ++  +I   D V+LPFSS TTGLPKGV+LTH+ +V
Subjt:  ----------KSSSSKLVVTYSHCVDELRESGED-LTIVTVDDP--PENCLSFSMDYDADENDMPTV--EIDQNDAVSLPFSSDTTGLPKGVILTHRSMV

Query:  SSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRMVFS
        +SVAQQVDGENPN+Y  ++DV LCVLPMFHI+AL+SI+L S+   AT+L+M KFEI  LL  I+  +VTVA VVPP+V+A+ K+P+    DLSS+RMV S
Subjt:  SSVAQQVDGENPNVYLRKNDVFLCVLPMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRMVFS

Query:  GAAPLRKELEEALMLRIPQAIFGR
        GAAPL KELE+A+  + P A  G+
Subjt:  GAAPLRKELEEALMLRIPQAIFGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATTTTCGTTGCCCTACCGTGCGATGGCCGGCAGCCCGAACTCTCCCCCAAAATCTCCTCTCCACCGCCGCCGACGACCCATGTTTTCCGATCGAAGTTACCGGATAT
TGCAATCCCCAATCATCTCCTTCTTCATAGCTACTGTTTTCAGAAGCTCTCTGAATTCTCCAACCGCCCTTGTTTGATCGTCGGCTCCACTGGAAAATCCTATTCCTTCT
TCGAAACCCATCTGTTCTCGCGGAAGGCCGCCGCGACTTTCTCTAAACTTGGAATTAAGAAGGGCGATGTCATTATGATTCTCCTCCAGAACTCTGCGGAATTCCTGAAG
TCCTCCAGTTCCAAATTAGTCGTTACTTACTCTCATTGCGTCGACGAGCTTCGAGAATCCGGCGAGGATCTCACCATCGTCACTGTCGATGACCCGCCGGAGAACTGTCT
GAGCTTTTCAATGGATTATGACGCCGACGAAAACGACATGCCTACGGTGGAGATTGACCAAAACGACGCTGTTTCGCTGCCGTTCTCCTCCGACACGACCGGGCTCCCCA
AGGGGGTGATTTTGACCCATAGGAGTATGGTATCGAGTGTGGCTCAACAGGTAGATGGAGAGAATCCGAACGTGTATTTGAGAAAGAACGACGTATTTTTATGCGTGCTA
CCGATGTTCCACATATTTGCCTTGAGTAGCATTGTTTTGATTTCGATTTGGTCATGGGCGACACTACTATTGATGGAGAAGTTCGAAATAGAAACATTGTTACGGCTGAT
AGAGACGCATCGGGTGACGGTGGCGACGGTGGTGCCACCACTGGTGGTGGCGCTGGGGAAGAACCCTAAGGTAGCGAATTGCGACTTGAGCTCGATCAGAATGGTGTTTT
CCGGGGCGGCGCCGCTCCGAAAGGAGCTAGAGGAGGCCCTCATGCTGAGGATACCTCAAGCAATTTTTGGTCGGGTAAATATTGTAGGGCGAGTTTCAATGCTAGAGTCA
CATGGAATTGTCTCATTGATAGAATTGATTAATGCCAAAGAATCAGATAAAATCGTGACTCTAACCACATCCAATGACTTTGCCAAACGTAGACCTTCCAGGACCGCCAC
CGCTTCCACCCCCAAAGGGGACGTACAAACATCTGAAATTGTCGTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATTTTCGTTGCCCTACCGTGCGATGGCCGGCAGCCCGAACTCTCCCCCAAAATCTCCTCTCCACCGCCGCCGACGACCCATGTTTTCCGATCGAAGTTACCGGATAT
TGCAATCCCCAATCATCTCCTTCTTCATAGCTACTGTTTTCAGAAGCTCTCTGAATTCTCCAACCGCCCTTGTTTGATCGTCGGCTCCACTGGAAAATCCTATTCCTTCT
TCGAAACCCATCTGTTCTCGCGGAAGGCCGCCGCGACTTTCTCTAAACTTGGAATTAAGAAGGGCGATGTCATTATGATTCTCCTCCAGAACTCTGCGGAATTCCTGAAG
TCCTCCAGTTCCAAATTAGTCGTTACTTACTCTCATTGCGTCGACGAGCTTCGAGAATCCGGCGAGGATCTCACCATCGTCACTGTCGATGACCCGCCGGAGAACTGTCT
GAGCTTTTCAATGGATTATGACGCCGACGAAAACGACATGCCTACGGTGGAGATTGACCAAAACGACGCTGTTTCGCTGCCGTTCTCCTCCGACACGACCGGGCTCCCCA
AGGGGGTGATTTTGACCCATAGGAGTATGGTATCGAGTGTGGCTCAACAGGTAGATGGAGAGAATCCGAACGTGTATTTGAGAAAGAACGACGTATTTTTATGCGTGCTA
CCGATGTTCCACATATTTGCCTTGAGTAGCATTGTTTTGATTTCGATTTGGTCATGGGCGACACTACTATTGATGGAGAAGTTCGAAATAGAAACATTGTTACGGCTGAT
AGAGACGCATCGGGTGACGGTGGCGACGGTGGTGCCACCACTGGTGGTGGCGCTGGGGAAGAACCCTAAGGTAGCGAATTGCGACTTGAGCTCGATCAGAATGGTGTTTT
CCGGGGCGGCGCCGCTCCGAAAGGAGCTAGAGGAGGCCCTCATGCTGAGGATACCTCAAGCAATTTTTGGTCGGGTAAATATTGTAGGGCGAGTTTCAATGCTAGAGTCA
CATGGAATTGTCTCATTGATAGAATTGATTAATGCCAAAGAATCAGATAAAATCGTGACTCTAACCACATCCAATGACTTTGCCAAACGTAGACCTTCCAGGACCGCCAC
CGCTTCCACCCCCAAAGGGGACGTACAAACATCTGAAATTGTCGTCTGA
Protein sequenceShow/hide protein sequence
MIFVALPCDGRQPELSPKISSPPPPTTHVFRSKLPDIAIPNHLLLHSYCFQKLSEFSNRPCLIVGSTGKSYSFFETHLFSRKAAATFSKLGIKKGDVIMILLQNSAEFLK
SSSSKLVVTYSHCVDELRESGEDLTIVTVDDPPENCLSFSMDYDADENDMPTVEIDQNDAVSLPFSSDTTGLPKGVILTHRSMVSSVAQQVDGENPNVYLRKNDVFLCVL
PMFHIFALSSIVLISIWSWATLLLMEKFEIETLLRLIETHRVTVATVVPPLVVALGKNPKVANCDLSSIRMVFSGAAPLRKELEEALMLRIPQAIFGRVNIVGRVSMLES
HGIVSLIELINAKESDKIVTLTTSNDFAKRRPSRTATASTPKGDVQTSEIVV