; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G012560 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G012560
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionSAGA-associated factor 11
Genome locationchr02:16688484..16697931
RNA-Seq ExpressionLsi02G012560
SyntenyLsi02G012560
Gene Ontology termsGO:0006325 - chromatin organization (biological process)
GO:0009737 - response to abscisic acid (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0016487 - farnesol metabolic process (biological process)
GO:0048440 - carpel development (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0031969 - chloroplast membrane (cellular component)
GO:0070461 - SAGA-type complex (cellular component)
GO:0016301 - kinase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0052668 - farnesol kinase activity (molecular function)
GO:0052670 - geraniol kinase activity (molecular function)
GO:0052671 - geranylgeraniol kinase activity (molecular function)
InterPro domainsIPR013246 - SAGA complex, Sgf11 subunit
IPR039606 - Phytol/farnesol kinase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060025.1 farnesol kinase [Cucumis melo var. makuwa]5.6e-21493.3Show/hide
Query:  MLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMS
        ML P+NPVVSDICATALS  VALSLL+LW ETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGY+GAI ASLIPG N+IRMLVLGFGILKDEAT+KSMS
Subjt:  MLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMS

Query:  RYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRM
        RYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDG ADIVGRRFGS+KIFYN NKSL GSVAMA+AGFLAS+GYMYYFSSFGYV  S  M
Subjt:  RYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRM

Query:  VLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEEL
         L FL+VSLASALVESLPISTE+DDNLTVPLTSLLAYIET  ASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVD+ASECHRIARLGLDRNLEEEEEEL
Subjt:  VLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEEL

Query:  RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTS
        RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTS
Subjt:  RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTS

Query:  TNRLPNGTSSLAGEEYSN
        TNRLPNGTSSLAGEEYSN
Subjt:  TNRLPNGTSSLAGEEYSN

KAG6583348.1 hypothetical protein SDJN03_19280, partial [Cucurbita argyrosperma subsp. sororia]4.0e-21284.2Show/hide
Query:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE
        MA  LQF F S  GSFGP FPFSSP   SRF PVSVSFNS   PIFRS  F LRF +KIRRE C VAAVMLLP NPVVSDICATA+SG VALSLLRLW E
Subjt:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE

Query:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP
        TAKRGLDQKLNRKLVH SIGLAFMLCWPMFSSG+RGA+LASLIPGVNIIRMLVLG GILKDEATVKSMSR GD RELLKGPLYYVATITLVCIFYWRTSP
Subjt:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP

Query:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL
        ISIALICNLCAGDGLADIVGRRFGS+KI YN+NKSL GSVAM SAGFLASVGYMYYFSSFGYV GSNRMVLGFLVVS+ASALVESLPISTEIDDNLTVPL
Subjt:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL

Query:  TSLL------------------------------AYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELR
        TSLL                              AYIETRPASRSMSMP+ED+ASS TQLS NLFGDLLDSVI D+ASECHRIARLGLDRNLEEEEEELR
Subjt:  TSLL------------------------------AYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELR

Query:  LSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSR
        LSAQAR RVADS NSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSR
Subjt:  LSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSR

KAG6607329.1 hypothetical protein SDJN03_00671, partial [Cucurbita argyrosperma subsp. sororia]7.8e-20079.71Show/hide
Query:  FPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLS----KIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLV
        +P  SP+F SRF  +SVS NS S P  RS +F  RF S    KIRR+  PVAA MLLP NPVVSDICA+ LSG VA SLLRLWAETAKRGLDQKLNRKLV
Subjt:  FPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLS----KIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLV

Query:  HISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGL
        HISIGLAFMLCWPMFSSG RGAILASL+PGVNIIRMLV G GI+KDEATVKSM+RYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDG+
Subjt:  HISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGL

Query:  ADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAY-----IETR
        ADI+GRRFG++KI YN+NKS+VGSVAMASAGFLASVGYMYYFSSFGYV GS+RMVLGFLVVSLASALVESLPISTEIDDNLTVPLTS L+      IET 
Subjt:  ADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAY-----IETR

Query:  PASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFD
        P+SRSMSMP+EDNASS  QLSSN FGDLLDSVIVD+ASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQ HPSVA+EIFD
Subjt:  PASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFD

Query:  CMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
        CMNCGRSI+AGRFAPHLEKCMGRGRKAR KVTRSSTA QSR                            LAG EYSNG S +P
Subjt:  CMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP

KAG7019118.1 Farnesol kinase, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]2.9e-23990.47Show/hide
Query:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE
        MA  LQF F S  GSFGP FPFSSP   SRF PVSVSFNS   PIFRS  F LRF +KIRRE C VAAVMLLP NPVVSDICATA+SG VALSLLRLW E
Subjt:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE

Query:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP
        TAKRGLDQKLNRKLVH SIGLAFMLCWPMFSSG+RGA+LASLIPGVNIIRMLVLG GILKDEATVKSMSR GD RELLKGPLYYVATITLVCIFYWRTSP
Subjt:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP

Query:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL
        ISIALICNLCAGDGLADIVGRRFGS+KI YN+NKSL GSVAM SAGFLASVGYMYYFSSFGYV GSNRMVLGFLVVS+ASALVESLPISTEIDDNLTVPL
Subjt:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL

Query:  TSLLAYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQT
        TSLLAYIETRPASRSMSMP+ED+ASS TQLS NLFGDLLDSVI D+ASECHRIARLGLDRNLEEEEEELRLSAQAR RVADS NSSEANGKYVVDIFGQT
Subjt:  TSLLAYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQT

Query:  HPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
        HPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVS YSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
Subjt:  HPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP

KAG8471435.1 hypothetical protein CXB51_036412 [Gossypium anomalum]1.0e-17070.41Show/hide
Query:  SFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVHISIGLAFMLCWPMFSSGYRGAI
        SF+     ++NF L +  K R      AA ML P+N + SD CA  +SG +ALS+LRLW ETAKRGL DQKLNRKLVHISIGL FMLCWP++SSGYRGAI
Subjt:  SFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVHISIGLAFMLCWPMFSSGYRGAI

Query:  LASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVG
        LA++ PGVNIIRM+++G G+ KDEATVKSMSRYGDYRELLKGPLYY  TITL C FYWRTSPI+IA ICNLCAGDG ADIVGR+FG +K+ YN+NKS+ G
Subjt:  LASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVG

Query:  SVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLL------------AYIETRPASR-----SMSMPNE
        SVAMA AGFL SVGYMYYFS FGY+  S  +V GFL+VSLASALVESLP+STE+DDNLTV LTS+L             +    P SR      +     
Subjt:  SVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLL------------AYIETRPASR-----SMSMPNE

Query:  DNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAG
           +   +LSS+ FGDLLDS+IVD+ASECHRIA+LGLDRNLEEEEEE+RLS QAR RVAD SNSSE N KYVVDIFGQTHPSVA EIF+CMNCGRSI AG
Subjt:  DNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAG

Query:  RFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEE
        RFAPHLEKCMG+GRKAR KVTRSSTAAQ+RYSRG+PVSAYSPY NSTSTNRLPNGT S+AGEE
Subjt:  RFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEE

TrEMBL top hitse value%identityAlignment
A0A5D3BBT0 SAGA-associated factor 112.7e-21493.3Show/hide
Query:  MLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMS
        ML P+NPVVSDICATALS  VALSLL+LW ETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGY+GAI ASLIPG N+IRMLVLGFGILKDEAT+KSMS
Subjt:  MLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMS

Query:  RYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRM
        RYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDG ADIVGRRFGS+KIFYN NKSL GSVAMA+AGFLAS+GYMYYFSSFGYV  S  M
Subjt:  RYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRM

Query:  VLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEEL
         L FL+VSLASALVESLPISTE+DDNLTVPLTSLLAYIET  ASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVD+ASECHRIARLGLDRNLEEEEEEL
Subjt:  VLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEEL

Query:  RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTS
        RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTS
Subjt:  RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTS

Query:  TNRLPNGTSSLAGEEYSN
        TNRLPNGTSSLAGEEYSN
Subjt:  TNRLPNGTSSLAGEEYSN

A0A6J1DBM3 probable phytol kinase 3, chloroplastic4.6e-12980.84Show/hide
Query:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLS----KIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLR
        MA ILQ RF SS+G F P F    P F  +F PVSVSFN  S P      F LRF S    KIRR   PVAAVMLLP NPVVSDICATA++G +ALSLLR
Subjt:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLS----KIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLR

Query:  LWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYW
        LW ETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSG RGA+LASLIPGVNIIRMLVLG GILKDEATVKSMSRYGDYRELLKGPLYYV TITL CI YW
Subjt:  LWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYW

Query:  RTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNL
        RTSPISIAL+CNLCAGDGLAD++GRRFGS+KI YN+NKSL GSVAMASAGFLASVGYMYYFSSFGY+ GS+RM+LGFLVVS+ASALVESLPISTEIDDNL
Subjt:  RTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNL

Query:  TVPLTSLL
        +VPLTSLL
Subjt:  TVPLTSLL

A0A6J1HM59 farnesol kinase, chloroplastic-like4.6e-13787.5Show/hide
Query:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE
        MA  LQF F S  GSFGP FPFSSP   SRF PVSVSFNS   PIFRS  F LRF +KIRRE C VAAVMLLP NPVVSDICATA+SG VALSLLRLW E
Subjt:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE

Query:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP
        TAKRGLDQKLNRKLVH SIGLAFMLCWPMFSSG+RGA+LASLIPGVNIIRMLVLG GILKDEATVKSMSR GD RELLKGPLYYVATITLVCIFYWRTSP
Subjt:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP

Query:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL
        ISIALICNLCAGDGLADIVGRRFGS+KI YN+NKSL GSVAM SAGFLASVGYMYYFSSFGYV GSNRMVLGFLVVS+ASALVESLPISTEIDDNLTVPL
Subjt:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL

Query:  TSLL
        TSLL
Subjt:  TSLL

A0A6J1I3Z6 farnesol kinase, chloroplastic8.6e-13686.84Show/hide
Query:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE
        MA  LQF F S  GSFGP FPFSSP   SRF PVSVSFNS   PIFR   F LRF +KIRRE C VAA MLLP NPVVSDICATA+SG VALSLLRLW E
Subjt:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE

Query:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP
        TAKRGLDQKLNRKLVH SIGLAFMLCWPMFSSG+RGA+LASLIPGVNIIRMLVLG GILKDEATVKSMSR GD RELLKGPLYYVATITLVCIFYWRTSP
Subjt:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP

Query:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL
        ISIALICNLCAGDGLADIVGRRFGS+KI YN+NKSL GSVAM SAGFLASVGYMYYFSSFGYV GSNRMVLGFLVVS+ASALVESLPISTEIDDNLTVPL
Subjt:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL

Query:  TSLL
        TSLL
Subjt:  TSLL

D7MQE9 SAGA-associated factor 118.0e-13453.68Show/hide
Query:  PSLSSEPKMATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVAL
        P + S P +A I +F        F P   F + +FRS         + F     R S  +L   S  RR                    CA  ++ +VA 
Subjt:  PSLSSEPKMATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVAL

Query:  SLLRLWAETAKRG-LDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLV
        S L  W E  KR  LDQKL RKLVHI+IGL FMLCWP+FSSG +GA+ ASL+PG+NIIRML+LG G+  DE T+KSMSR+GD RELLKGPLYY  +IT  
Subjt:  SLLRLWAETAKRG-LDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLV

Query:  CIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPIST-
        CIFYW++SPI+IA+ICNLCAGDG+ADIVGRRFG++K+ YN+NKS  GS+ MA+AGFLASVGYMYYF+SFGY+  S  M+L FL++SLASALV  + +S  
Subjt:  CIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPIST-

Query:  --------------EIDDNLTVPLTSL-LAYIETRPASRSMSMPNE--DNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSA
                        D N    LT   +  + +      +S+P      A+    LSS +F DL+DSVI D+ASECHR+ARLGLDR+LE  EEELRLS 
Subjt:  --------------EIDDNLTVPLTSL-LAYIETRPASRSMSMPNE--DNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSA

Query:  QARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRL
        +AR +VAD SN+ E N K+VVDIFGQTHP VA E+F+CMNCGR I+AGRFAPHLEKCMG+GRKAR K TRS+TAAQ+R +R +P   YSPYPNS S N+L
Subjt:  QARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRL

Query:  PNGTSSLAGEEYSNGT
         +G+  +AGE+ SNGT
Subjt:  PNGTSSLAGEEYSNGT

SwissProt top hitse value%identityAlignment
Q2N2K0 Probable phytol kinase 3, chloroplastic1.1e-9267.01Show/hide
Query:  PFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVH
        P F F SP F S+  P  + F SFS+    SS+F   F S       P  + M L  +P+VSD+ ATA+SGVVALS LRL+ ETAKR L DQKLNRKLVH
Subjt:  PFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVH

Query:  ISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLA
        ISIGL FMLC P+FS+    +  A+LIPG+NI RMLV+G GILKDEATVKSMSR+GDYRELLKGPLYY ATITL  I YWRTSPISIA ICNLCAGDG+A
Subjt:  ISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLA

Query:  DIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLL
        DIVGRR G +KI YN+NKS  GS+AMA+AGFL S+GYM+YFSSFG++ GS ++VLGFL+VS+ +A VESLPISTE+DDNLTVPLTS+L
Subjt:  DIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLL

Q2N2K1 Probable phytol kinase 1, chloroplastic2.1e-4646.01Show/hide
Query:  SLLRLWAETAKRG-LDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLV
        +L+R + E  +R  L Q L+RKLVHI  GL F++ WP+FS+  +    A+ +P VN +R+LV G  +  DE  +KS++R GD  ELL+GPLYYV  + L 
Subjt:  SLLRLWAETAKRG-LDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLV

Query:  CIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLV-VSLASALVESLPIST
         + +WR SPI +  +  +CAGDG+ADI+GRR+GS KI YN +KSL GS++M   GFL S+G +YY+S  G+V       L  +  +S  + LVESLPI+ 
Subjt:  CIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLV-VSLASALVESLPIST

Query:  EIDDNLTVPLTSL
         +DDN++VPL ++
Subjt:  EIDDNLTVPLTSL

Q5N9J9 Probable phytol kinase 2, chloroplastic1.5e-7659.92Show/hide
Query:  RFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRML
        R ++   R    +AA +    + +  D+ + A++  VAL+LLR + E AKRG+ +QKLNRKLVHI+IG+ F+L WP+FSSG     LA++ PG+NIIRML
Subjt:  RFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRML

Query:  VLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVG
        +LG G++K+EA VKSMSR GD RELLKGPLYY  TIT     +WRTSPI+IALICNLCAGDG+ADIVGRR G +K+ YN NKS  GS+AMA AGF+AS+G
Subjt:  VLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVG

Query:  YMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLL
        YM+YF SFG++  S  +  GFLVVS+ +ALVES PIST +DDNLTVPLTS L
Subjt:  YMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLL

Q67ZM7 Farnesol kinase, chloroplastic8.1e-9161.59Show/hide
Query:  PFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVH
        P   F SP  R   + ++ SF S       SS F     +KIR+    +AAVM  P+N V+SD+CA  ++ +VA S L  W E  KRG+ DQKL RKLVH
Subjt:  PFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVH

Query:  ISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLA
        I+IGL FMLCWP+FSSG +GA+ ASL+PG+NI+RML+LG G+  DE T+KSMSR+GD RELLKGPLYYV +IT  CI+YW++SPI+IA+ICNLCAGDG+A
Subjt:  ISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLA

Query:  DIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLA
        DIVGRRFG++K+ YN+NKS  GS+ MA+AGFLASV YMYYF+SFGY+  S  M+L FLV+S+ASALVESLPIST+IDDNLT+ LTS LA
Subjt:  DIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLA

Q94BV2 SAGA-associated factor 113.1e-5868.05Show/hide
Query:  EDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMA
        EDN SS  QLSS +F DL+DSVI D+ASECHR+ARLGLDR+L+  EEELRLS +AR ++AD SN+ E N KYVVDIFGQTHP VA+E+F+CMNCGR I+A
Subjt:  EDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMA

Query:  GRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGT
        GRFAPHLEKCMG+GRKAR K TRS+TAAQ+R +R +P   YSPYPNS S N+L +G+  +AGE+ SN T
Subjt:  GRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGT

Arabidopsis top hitse value%identityAlignment
AT5G04490.1 vitamin E pathway gene 51.8e-4541.88Show/hide
Query:  NPVVSDICAT--ALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYG
        N ++ D+ AT   L G  AL +L   + T +  + Q L+RKLVHI  GL F+L WP+FS        A+ +P VN +R+++ G  I  +   +KS++R G
Subjt:  NPVVSDICAT--ALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYG

Query:  DYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYV-VGSNRMVL
           ELLKGPL+YV  +    +F+WR SPI +  +  +C GDG+ADI+GR+FGS KI YN  KS  GS++M   GF  S+  +YY+SS GY+ +     + 
Subjt:  DYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYV-VGSNRMVL

Query:  GFLVVSLASALVESLPISTEIDDNLTVPLTSLLA
           +VS+ + +VESLPI+ ++DDN++VPL ++LA
Subjt:  GFLVVSLASALVESLPISTEIDDNLTVPLTSLLA

AT5G58560.1 Phosphatidate cytidylyltransferase family protein5.8e-9261.59Show/hide
Query:  PFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVH
        P   F SP  R   + ++ SF S       SS F     +KIR+    +AAVM  P+N V+SD+CA  ++ +VA S L  W E  KRG+ DQKL RKLVH
Subjt:  PFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVH

Query:  ISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLA
        I+IGL FMLCWP+FSSG +GA+ ASL+PG+NI+RML+LG G+  DE T+KSMSR+GD RELLKGPLYYV +IT  CI+YW++SPI+IA+ICNLCAGDG+A
Subjt:  ISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLA

Query:  DIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLA
        DIVGRRFG++K+ YN+NKS  GS+ MA+AGFLASV YMYYF+SFGY+  S  M+L FLV+S+ASALVESLPIST+IDDNLT+ LTS LA
Subjt:  DIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLA

AT5G58575.1 CONTAINS InterPro DOMAIN/s: Sgf11, transcriptional regulation (InterPro:IPR013246); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).2.2e-5968.05Show/hide
Query:  EDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMA
        EDN SS  QLSS +F DL+DSVI D+ASECHR+ARLGLDR+L+  EEELRLS +AR ++AD SN+ E N KYVVDIFGQTHP VA+E+F+CMNCGR I+A
Subjt:  EDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMA

Query:  GRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGT
        GRFAPHLEKCMG+GRKAR K TRS+TAAQ+R +R +P   YSPYPNS S N+L +G+  +AGE+ SN T
Subjt:  GRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTAGCCTCCATGGCAGGTTATCGCTCGATCCATCTCTTTCAAGTGAACCAAAAATGGCTACAATTCTTCAATTTCGATTCTGCTCGTCAGTTGGGTCGTTTGGCCC
TTTCTTTCCCTTCAGCTCCCCCAATTTTCGCTCTCGATTCATACCAGTTTCCGTCTCTTTCAACTCCTTTTCTACGCCAATCTTCCGCTCCAGTAACTTCGCTTTGAGAT
TTCTGTCCAAAATCCGTCGGGAACACTGCCCAGTTGCGGCAGTTATGTTGTTGCCTAAAAATCCGGTGGTCTCCGACATCTGCGCCACCGCCTTGTCCGGCGTAGTCGCC
TTGTCTTTGCTTCGATTATGGGCAGAAACGGCGAAACGTGGCCTCGACCAGAAATTGAACAGGAAGCTTGTTCATATAAGCATTGGGCTTGCTTTCATGCTTTGCTGGCC
TATGTTCAGTTCTGGTTATCGAGGAGCAATACTTGCGTCTCTAATTCCCGGTGTCAATATAATACGAATGCTCGTCTTGGGATTCGGGATATTGAAAGACGAGGCTACAG
TGAAATCAATGAGCAGATATGGAGACTACAGGGAGCTTTTGAAGGGGCCTTTGTATTATGTTGCAACAATTACATTAGTTTGTATATTCTACTGGAGGACGTCCCCCATT
TCAATTGCTCTGATATGCAACTTATGTGCCGGAGATGGGTTGGCTGATATCGTTGGAAGACGATTTGGAAGTAAAAAGATCTTTTACAACAGAAACAAGTCTCTAGTTGG
TAGTGTGGCAATGGCATCTGCTGGTTTTCTTGCATCTGTTGGGTATATGTACTATTTCTCATCATTTGGGTATGTTGTGGGAAGCAACAGAATGGTTTTGGGATTCTTAG
TTGTGTCCCTTGCCTCAGCATTGGTGGAGTCTCTCCCCATAAGCACTGAGATTGATGACAACCTTACAGTTCCACTCACTTCTTTGCTGGCCTATATTGAAACTCGTCCT
GCTTCTAGATCCATGTCAATGCCTAATGAGGACAATGCATCTTCACAAACTCAGCTTTCATCTAATTTGTTTGGAGATCTTCTGGATTCTGTGATTGTTGATATTGCATC
AGAATGTCATCGAATAGCAAGGTTAGGTCTTGATCGTAACTTAGAAGAGGAAGAAGAAGAATTAAGACTTTCAGCACAGGCACGAGTAAGAGTAGCTGATTCTAGCAATA
GTAGTGAGGCAAACGGCAAATATGTAGTTGATATTTTTGGACAAACTCATCCTTCTGTTGCGAATGAAATATTTGATTGCATGAATTGTGGTCGATCAATTATGGCTGGG
AGATTTGCTCCTCATTTAGAGAAATGCATGGGAAGGGGTAGAAAGGCTCGTCCCAAAGTAACAAGAAGTAGTACAGCTGCCCAGAGCCGGTATTCACGAGGCAATCCTGT
TTCTGCATATTCCCCTTACCCTAATTCCACCAGCACGAATCGCTTACCTAATGGAACGTCTAGTCTTGCAGGGGAGGAGTACTCAAATGGTACATCTGAAGACCCATGA
mRNA sequenceShow/hide mRNA sequence
GCGTATCCGAAAATGCCATTAAATTCAATATATACATATCCACCAATTTGTCATTTACCTGTTGGTTCATTAATTCCTTAGCCTCAAAACCCACGCAAGCGCCAGCGACG
ACGTTCATGCTTAGCCTCCATGGCAGGTTATCGCTCGATCCATCTCTTTCAAGTGAACCAAAAATGGCTACAATTCTTCAATTTCGATTCTGCTCGTCAGTTGGGTCGTT
TGGCCCTTTCTTTCCCTTCAGCTCCCCCAATTTTCGCTCTCGATTCATACCAGTTTCCGTCTCTTTCAACTCCTTTTCTACGCCAATCTTCCGCTCCAGTAACTTCGCTT
TGAGATTTCTGTCCAAAATCCGTCGGGAACACTGCCCAGTTGCGGCAGTTATGTTGTTGCCTAAAAATCCGGTGGTCTCCGACATCTGCGCCACCGCCTTGTCCGGCGTA
GTCGCCTTGTCTTTGCTTCGATTATGGGCAGAAACGGCGAAACGTGGCCTCGACCAGAAATTGAACAGGAAGCTTGTTCATATAAGCATTGGGCTTGCTTTCATGCTTTG
CTGGCCTATGTTCAGTTCTGGTTATCGAGGAGCAATACTTGCGTCTCTAATTCCCGGTGTCAATATAATACGAATGCTCGTCTTGGGATTCGGGATATTGAAAGACGAGG
CTACAGTGAAATCAATGAGCAGATATGGAGACTACAGGGAGCTTTTGAAGGGGCCTTTGTATTATGTTGCAACAATTACATTAGTTTGTATATTCTACTGGAGGACGTCC
CCCATTTCAATTGCTCTGATATGCAACTTATGTGCCGGAGATGGGTTGGCTGATATCGTTGGAAGACGATTTGGAAGTAAAAAGATCTTTTACAACAGAAACAAGTCTCT
AGTTGGTAGTGTGGCAATGGCATCTGCTGGTTTTCTTGCATCTGTTGGGTATATGTACTATTTCTCATCATTTGGGTATGTTGTGGGAAGCAACAGAATGGTTTTGGGAT
TCTTAGTTGTGTCCCTTGCCTCAGCATTGGTGGAGTCTCTCCCCATAAGCACTGAGATTGATGACAACCTTACAGTTCCACTCACTTCTTTGCTGGCCTATATTGAAACT
CGTCCTGCTTCTAGATCCATGTCAATGCCTAATGAGGACAATGCATCTTCACAAACTCAGCTTTCATCTAATTTGTTTGGAGATCTTCTGGATTCTGTGATTGTTGATAT
TGCATCAGAATGTCATCGAATAGCAAGGTTAGGTCTTGATCGTAACTTAGAAGAGGAAGAAGAAGAATTAAGACTTTCAGCACAGGCACGAGTAAGAGTAGCTGATTCTA
GCAATAGTAGTGAGGCAAACGGCAAATATGTAGTTGATATTTTTGGACAAACTCATCCTTCTGTTGCGAATGAAATATTTGATTGCATGAATTGTGGTCGATCAATTATG
GCTGGGAGATTTGCTCCTCATTTAGAGAAATGCATGGGAAGGGGTAGAAAGGCTCGTCCCAAAGTAACAAGAAGTAGTACAGCTGCCCAGAGCCGGTATTCACGAGGCAA
TCCTGTTTCTGCATATTCCCCTTACCCTAATTCCACCAGCACGAATCGCTTACCTAATGGAACGTCTAGTCTTGCAGGGGAGGAGTACTCAAATGGTACATCTGAAGACC
CATGAACAACAGAGCAATGGCTGAATTATTCCAATTTAGGAAAGCATACTGTAATCTAAATGTGCTTGCAGGAATATAATGATATCTCCTGTCATGTTCAGTTTTATTTT
ACATGACTTCTTTTTATGGGTTATCATTACATTGGAAAAAAATTATATCATGTCTGGAATCTGGATGTACTATGAGTTCATTAGTTTCGACTGTTCCAAATTTTGATGCT
GAAGATCTTGTACAGAAGTTCGAGAAAAAGGAAGTTAAATGCAATTCTAATTTCATGTCTTCAATTTCAATTATAAAATTGAGTGCAATTGGTCAATCTGAATATTTCTT
CAATGTGGGATTC
Protein sequenceShow/hide protein sequence
MLSLHGRLSLDPSLSSEPKMATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVA
LSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPI
SIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAYIETRP
ASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAG
RFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP