; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10021801 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10021801
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSAGA-associated factor 11
Genome locationChr05:16928240..16937180
RNA-Seq ExpressionHG10021801
SyntenyHG10021801
Gene Ontology termsGO:0006325 - chromatin organization (biological process)
GO:0009737 - response to abscisic acid (biological process)
GO:0016310 - phosphorylation (biological process)
GO:0016487 - farnesol metabolic process (biological process)
GO:0048440 - carpel development (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0031969 - chloroplast membrane (cellular component)
GO:0070461 - SAGA-type complex (cellular component)
GO:0016301 - kinase activity (molecular function)
GO:0046872 - metal ion binding (molecular function)
GO:0052668 - farnesol kinase activity (molecular function)
GO:0052670 - geraniol kinase activity (molecular function)
GO:0052671 - geranylgeraniol kinase activity (molecular function)
InterPro domainsIPR013246 - SAGA complex, Sgf11 subunit
IPR039606 - Phytol/farnesol kinase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0060025.1 farnesol kinase [Cucumis melo var. makuwa]5.4e-21493.3Show/hide
Query:  MLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMS
        ML P+NPVVSDICATALS  VALSLL+LW ETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGY+GAI ASLIPG N+IRMLVLGFGILKDEAT+KSMS
Subjt:  MLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMS

Query:  RYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRM
        RYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDG ADIVGRRFGS+KIFYN NKSL GSVAMA+AGFLAS+GYMYYFSSFGYV  S  M
Subjt:  RYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRM

Query:  VLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEEL
         L FL+VSLASALVESLPISTE+DDNLTVPLTSLLAYIET  ASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVD+ASECHRIARLGLDRNLEEEEEEL
Subjt:  VLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEEL

Query:  RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTS
        RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTS
Subjt:  RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTS

Query:  TNRLPNGTSSLAGEEYSN
        TNRLPNGTSSLAGEEYSN
Subjt:  TNRLPNGTSSLAGEEYSN

KAG6583348.1 hypothetical protein SDJN03_19280, partial [Cucurbita argyrosperma subsp. sororia]3.8e-21284.2Show/hide
Query:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE
        MA  LQF F S  GSFGP FPFSSP   SRF PVSVSFNS   PIFRS  F LRF +KIRRE C VAAVMLLP NPVVSDICATA+SG VALSLLRLW E
Subjt:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE

Query:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP
        TAKRGLDQKLNRKLVH SIGLAFMLCWPMFSSG+RGA+LASLIPGVNIIRMLVLG GILKDEATVKSMSR GD RELLKGPLYYVATITLVCIFYWRTSP
Subjt:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP

Query:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL
        ISIALICNLCAGDGLADIVGRRFGS+KI YN+NKSL GSVAM SAGFLASVGYMYYFSSFGYV GSNRMVLGFLVVS+ASALVESLPISTEIDDNLTVPL
Subjt:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL

Query:  TSLL------------------------------AYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELR
        TSLL                              AYIETRPASRSMSMP+ED+ASS TQLS NLFGDLLDSVI D+ASECHRIARLGLDRNLEEEEEELR
Subjt:  TSLL------------------------------AYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELR

Query:  LSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSR
        LSAQAR RVADS NSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSR
Subjt:  LSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSR

KAG6607329.1 hypothetical protein SDJN03_00671, partial [Cucurbita argyrosperma subsp. sororia]7.5e-20079.71Show/hide
Query:  FPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLS----KIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLV
        +P  SP+F SRF  +SVS NS S P  RS +F  RF S    KIRR+  PVAA MLLP NPVVSDICA+ LSG VA SLLRLWAETAKRGLDQKLNRKLV
Subjt:  FPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLS----KIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLV

Query:  HISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGL
        HISIGLAFMLCWPMFSSG RGAILASL+PGVNIIRMLV G GI+KDEATVKSM+RYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDG+
Subjt:  HISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGL

Query:  ADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAY-----IETR
        ADI+GRRFG++KI YN+NKS+VGSVAMASAGFLASVGYMYYFSSFGYV GS+RMVLGFLVVSLASALVESLPISTEIDDNLTVPLTS L+      IET 
Subjt:  ADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAY-----IETR

Query:  PASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFD
        P+SRSMSMP+EDNASS  QLSSN FGDLLDSVIVD+ASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQ HPSVA+EIFD
Subjt:  PASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFD

Query:  CMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
        CMNCGRSI+AGRFAPHLEKCMGRGRKAR KVTRSSTA QSR                            LAG EYSNG S +P
Subjt:  CMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP

KAG7019118.1 Farnesol kinase, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]2.8e-23990.47Show/hide
Query:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE
        MA  LQF F S  GSFGP FPFSSP   SRF PVSVSFNS   PIFRS  F LRF +KIRRE C VAAVMLLP NPVVSDICATA+SG VALSLLRLW E
Subjt:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE

Query:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP
        TAKRGLDQKLNRKLVH SIGLAFMLCWPMFSSG+RGA+LASLIPGVNIIRMLVLG GILKDEATVKSMSR GD RELLKGPLYYVATITLVCIFYWRTSP
Subjt:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP

Query:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL
        ISIALICNLCAGDGLADIVGRRFGS+KI YN+NKSL GSVAM SAGFLASVGYMYYFSSFGYV GSNRMVLGFLVVS+ASALVESLPISTEIDDNLTVPL
Subjt:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL

Query:  TSLLAYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQT
        TSLLAYIETRPASRSMSMP+ED+ASS TQLS NLFGDLLDSVI D+ASECHRIARLGLDRNLEEEEEELRLSAQAR RVADS NSSEANGKYVVDIFGQT
Subjt:  TSLLAYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQT

Query:  HPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
        HPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVS YSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP
Subjt:  HPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP

KAG8471435.1 hypothetical protein CXB51_036412 [Gossypium anomalum]9.6e-17170.41Show/hide
Query:  SFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVHISIGLAFMLCWPMFSSGYRGAI
        SF+     ++NF L +  K R      AA ML P+N + SD CA  +SG +ALS+LRLW ETAKRGL DQKLNRKLVHISIGL FMLCWP++SSGYRGAI
Subjt:  SFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVHISIGLAFMLCWPMFSSGYRGAI

Query:  LASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVG
        LA++ PGVNIIRM+++G G+ KDEATVKSMSRYGDYRELLKGPLYY  TITL C FYWRTSPI+IA ICNLCAGDG ADIVGR+FG +K+ YN+NKS+ G
Subjt:  LASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVG

Query:  SVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLL------------AYIETRPASR-----SMSMPNE
        SVAMA AGFL SVGYMYYFS FGY+  S  +V GFL+VSLASALVESLP+STE+DDNLTV LTS+L             +    P SR      +     
Subjt:  SVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLL------------AYIETRPASR-----SMSMPNE

Query:  DNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAG
           +   +LSS+ FGDLLDS+IVD+ASECHRIA+LGLDRNLEEEEEE+RLS QAR RVAD SNSSE N KYVVDIFGQTHPSVA EIF+CMNCGRSI AG
Subjt:  DNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAG

Query:  RFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEE
        RFAPHLEKCMG+GRKAR KVTRSSTAAQ+RYSRG+PVSAYSPY NSTSTNRLPNGT S+AGEE
Subjt:  RFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEE

TrEMBL top hitse value%identityAlignment
A0A5D3BBT0 SAGA-associated factor 112.6e-21493.3Show/hide
Query:  MLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMS
        ML P+NPVVSDICATALS  VALSLL+LW ETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGY+GAI ASLIPG N+IRMLVLGFGILKDEAT+KSMS
Subjt:  MLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMS

Query:  RYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRM
        RYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDG ADIVGRRFGS+KIFYN NKSL GSVAMA+AGFLAS+GYMYYFSSFGYV  S  M
Subjt:  RYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRM

Query:  VLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEEL
         L FL+VSLASALVESLPISTE+DDNLTVPLTSLLAYIET  ASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVD+ASECHRIARLGLDRNLEEEEEEL
Subjt:  VLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAYIETRPASRSMSMPNEDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEEL

Query:  RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTS
        RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTS
Subjt:  RLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTS

Query:  TNRLPNGTSSLAGEEYSN
        TNRLPNGTSSLAGEEYSN
Subjt:  TNRLPNGTSSLAGEEYSN

A0A6J1DBM3 probable phytol kinase 3, chloroplastic4.4e-12980.84Show/hide
Query:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLS----KIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLR
        MA ILQ RF SS+G F P F    P F  +F PVSVSFN  S P      F LRF S    KIRR   PVAAVMLLP NPVVSDICATA++G +ALSLLR
Subjt:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLS----KIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLR

Query:  LWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYW
        LW ETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSG RGA+LASLIPGVNIIRMLVLG GILKDEATVKSMSRYGDYRELLKGPLYYV TITL CI YW
Subjt:  LWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYW

Query:  RTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNL
        RTSPISIAL+CNLCAGDGLAD++GRRFGS+KI YN+NKSL GSVAMASAGFLASVGYMYYFSSFGY+ GS+RM+LGFLVVS+ASALVESLPISTEIDDNL
Subjt:  RTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNL

Query:  TVPLTSLL
        +VPLTSLL
Subjt:  TVPLTSLL

A0A6J1HM59 farnesol kinase, chloroplastic-like4.4e-13787.5Show/hide
Query:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE
        MA  LQF F S  GSFGP FPFSSP   SRF PVSVSFNS   PIFRS  F LRF +KIRRE C VAAVMLLP NPVVSDICATA+SG VALSLLRLW E
Subjt:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE

Query:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP
        TAKRGLDQKLNRKLVH SIGLAFMLCWPMFSSG+RGA+LASLIPGVNIIRMLVLG GILKDEATVKSMSR GD RELLKGPLYYVATITLVCIFYWRTSP
Subjt:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP

Query:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL
        ISIALICNLCAGDGLADIVGRRFGS+KI YN+NKSL GSVAM SAGFLASVGYMYYFSSFGYV GSNRMVLGFLVVS+ASALVESLPISTEIDDNLTVPL
Subjt:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL

Query:  TSLL
        TSLL
Subjt:  TSLL

A0A6J1I3Z6 farnesol kinase, chloroplastic8.3e-13686.84Show/hide
Query:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE
        MA  LQF F S  GSFGP FPFSSP   SRF PVSVSFNS   PIFR   F LRF +KIRRE C VAA MLLP NPVVSDICATA+SG VALSLLRLW E
Subjt:  MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAE

Query:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP
        TAKRGLDQKLNRKLVH SIGLAFMLCWPMFSSG+RGA+LASLIPGVNIIRMLVLG GILKDEATVKSMSR GD RELLKGPLYYVATITLVCIFYWRTSP
Subjt:  TAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP

Query:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL
        ISIALICNLCAGDGLADIVGRRFGS+KI YN+NKSL GSVAM SAGFLASVGYMYYFSSFGYV GSNRMVLGFLVVS+ASALVESLPISTEIDDNLTVPL
Subjt:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPL

Query:  TSLL
        TSLL
Subjt:  TSLL

D7MQE9 SAGA-associated factor 112.9e-13355.03Show/hide
Query:  TILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRF-LSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAET
        T L    CS + S     P S  N    F P+      F T  FRSS+   RF  ++IR+     +   L   +      CA  ++ +VA S L  W E 
Subjt:  TILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRF-LSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAET

Query:  AKRG-LDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP
         KR  LDQKL RKLVHI+IGL FMLCWP+FSSG +GA+ ASL+PG+NIIRML+LG G+  DE T+KSMSR+GD RELLKGPLYY  +IT  CIFYW++SP
Subjt:  AKRG-LDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSP

Query:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPIST----------
        I+IA+ICNLCAGDG+ADIVGRRFG++K+ YN+NKS  GS+ MA+AGFLASVGYMYYF+SFGY+  S  M+L FL++SLASALV  + +S           
Subjt:  ISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPIST----------

Query:  -----EIDDNLTVPLTSL-LAYIETRPASRSMSMPNE--DNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADS
               D N    LT   +  + +      +S+P      A+    LSS +F DL+DSVI D+ASECHR+ARLGLDR+LE  EEELRLS +AR +VAD 
Subjt:  -----EIDDNLTVPLTSL-LAYIETRPASRSMSMPNE--DNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADS

Query:  SNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAG
        SN+ E N K+VVDIFGQTHP VA E+F+CMNCGR I+AGRFAPHLEKCMG+GRKAR K TRS+TAAQ+R +R +P   YSPYPNS S N+L +G+  +AG
Subjt:  SNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAG

Query:  EEYSNGT
        E+ SNGT
Subjt:  EEYSNGT

SwissProt top hitse value%identityAlignment
Q2N2K0 Probable phytol kinase 3, chloroplastic1.1e-9267.01Show/hide
Query:  PFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVH
        P F F SP F S+  P  + F SFS+    SS+F   F S       P  + M L  +P+VSD+ ATA+SGVVALS LRL+ ETAKR L DQKLNRKLVH
Subjt:  PFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVH

Query:  ISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLA
        ISIGL FMLC P+FS+    +  A+LIPG+NI RMLV+G GILKDEATVKSMSR+GDYRELLKGPLYY ATITL  I YWRTSPISIA ICNLCAGDG+A
Subjt:  ISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLA

Query:  DIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLL
        DIVGRR G +KI YN+NKS  GS+AMA+AGFL S+GYM+YFSSFG++ GS ++VLGFL+VS+ +A VESLPISTE+DDNLTVPLTS+L
Subjt:  DIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLL

Q2N2K1 Probable phytol kinase 1, chloroplastic2.0e-4646.01Show/hide
Query:  SLLRLWAETAKRG-LDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLV
        +L+R + E  +R  L Q L+RKLVHI  GL F++ WP+FS+  +    A+ +P VN +R+LV G  +  DE  +KS++R GD  ELL+GPLYYV  + L 
Subjt:  SLLRLWAETAKRG-LDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLV

Query:  CIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLV-VSLASALVESLPIST
         + +WR SPI +  +  +CAGDG+ADI+GRR+GS KI YN +KSL GS++M   GFL S+G +YY+S  G+V       L  +  +S  + LVESLPI+ 
Subjt:  CIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLV-VSLASALVESLPIST

Query:  EIDDNLTVPLTSL
         +DDN++VPL ++
Subjt:  EIDDNLTVPLTSL

Q5N9J9 Probable phytol kinase 2, chloroplastic1.4e-7659.92Show/hide
Query:  RFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRML
        R ++   R    +AA +    + +  D+ + A++  VAL+LLR + E AKRG+ +QKLNRKLVHI+IG+ F+L WP+FSSG     LA++ PG+NIIRML
Subjt:  RFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRML

Query:  VLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVG
        +LG G++K+EA VKSMSR GD RELLKGPLYY  TIT     +WRTSPI+IALICNLCAGDG+ADIVGRR G +K+ YN NKS  GS+AMA AGF+AS+G
Subjt:  VLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVG

Query:  YMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLL
        YM+YF SFG++  S  +  GFLVVS+ +ALVES PIST +DDNLTVPLTS L
Subjt:  YMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLL

Q67ZM7 Farnesol kinase, chloroplastic7.8e-9161.59Show/hide
Query:  PFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVH
        P   F SP  R   + ++ SF S       SS F     +KIR+    +AAVM  P+N V+SD+CA  ++ +VA S L  W E  KRG+ DQKL RKLVH
Subjt:  PFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVH

Query:  ISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLA
        I+IGL FMLCWP+FSSG +GA+ ASL+PG+NI+RML+LG G+  DE T+KSMSR+GD RELLKGPLYYV +IT  CI+YW++SPI+IA+ICNLCAGDG+A
Subjt:  ISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLA

Query:  DIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLA
        DIVGRRFG++K+ YN+NKS  GS+ MA+AGFLASV YMYYF+SFGY+  S  M+L FLV+S+ASALVESLPIST+IDDNLT+ LTS LA
Subjt:  DIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLA

Q94BV2 SAGA-associated factor 113.0e-5868.05Show/hide
Query:  EDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMA
        EDN SS  QLSS +F DL+DSVI D+ASECHR+ARLGLDR+L+  EEELRLS +AR ++AD SN+ E N KYVVDIFGQTHP VA+E+F+CMNCGR I+A
Subjt:  EDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMA

Query:  GRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGT
        GRFAPHLEKCMG+GRKAR K TRS+TAAQ+R +R +P   YSPYPNS S N+L +G+  +AGE+ SN T
Subjt:  GRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGT

Arabidopsis top hitse value%identityAlignment
AT5G04490.1 vitamin E pathway gene 51.7e-4541.88Show/hide
Query:  NPVVSDICAT--ALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYG
        N ++ D+ AT   L G  AL +L   + T +  + Q L+RKLVHI  GL F+L WP+FS        A+ +P VN +R+++ G  I  +   +KS++R G
Subjt:  NPVVSDICAT--ALSGVVALSLLRLWAETAKRGLDQKLNRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYG

Query:  DYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYV-VGSNRMVL
           ELLKGPL+YV  +    +F+WR SPI +  +  +C GDG+ADI+GR+FGS KI YN  KS  GS++M   GF  S+  +YY+SS GY+ +     + 
Subjt:  DYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYV-VGSNRMVL

Query:  GFLVVSLASALVESLPISTEIDDNLTVPLTSLLA
           +VS+ + +VESLPI+ ++DDN++VPL ++LA
Subjt:  GFLVVSLASALVESLPISTEIDDNLTVPLTSLLA

AT5G58560.1 Phosphatidate cytidylyltransferase family protein5.5e-9261.59Show/hide
Query:  PFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVH
        P   F SP  R   + ++ SF S       SS F     +KIR+    +AAVM  P+N V+SD+CA  ++ +VA S L  W E  KRG+ DQKL RKLVH
Subjt:  PFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGL-DQKLNRKLVH

Query:  ISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLA
        I+IGL FMLCWP+FSSG +GA+ ASL+PG+NI+RML+LG G+  DE T+KSMSR+GD RELLKGPLYYV +IT  CI+YW++SPI+IA+ICNLCAGDG+A
Subjt:  ISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLA

Query:  DIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLA
        DIVGRRFG++K+ YN+NKS  GS+ MA+AGFLASV YMYYF+SFGY+  S  M+L FLV+S+ASALVESLPIST+IDDNLT+ LTS LA
Subjt:  DIVGRRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLA

AT5G58575.1 CONTAINS InterPro DOMAIN/s: Sgf11, transcriptional regulation (InterPro:IPR013246); Has 30201 Blast hits to 17322 proteins in 780 species: Archae - 12; Bacteria - 1396; Metazoa - 17338; Fungi - 3422; Plants - 5037; Viruses - 0; Other Eukaryotes - 2996 (source: NCBI BLink).2.1e-5968.05Show/hide
Query:  EDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMA
        EDN SS  QLSS +F DL+DSVI D+ASECHR+ARLGLDR+L+  EEELRLS +AR ++AD SN+ E N KYVVDIFGQTHP VA+E+F+CMNCGR I+A
Subjt:  EDNASSQTQLSSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMA

Query:  GRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGT
        GRFAPHLEKCMG+GRKAR K TRS+TAAQ+R +R +P   YSPYPNS S N+L +G+  +AGE+ SN T
Subjt:  GRFAPHLEKCMGRGRKARPKVTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTACAATTCTTCAATTTCGATTCTGCTCGTCAGTTGGGTCGTTTGGCCCTTTCTTTCCCTTCAGCTCCCCCAATTTTCGCTCTCGATTCATACCAGTTTCCGTCTC
TTTCAACTCCTTTTCTACGCCAATCTTCCGCTCCAGTAACTTCGCTTTGAGATTTCTGTCCAAAATCCGTCGGGAACACTGCCCAGTTGCGGCAGTTATGTTGTTGCCTA
AAAATCCGGTGGTCTCCGACATCTGCGCCACCGCCTTGTCCGGCGTAGTCGCCTTGTCTTTGCTTCGATTATGGGCAGAAACGGCGAAACGTGGCCTCGACCAGAAATTG
AACAGGAAGCTTGTTCATATAAGCATTGGGCTTGCTTTCATGCTTTGCTGGCCTATGTTCAGTTCTGGTTATCGAGGAGCAATACTTGCGTCTCTAATTCCCGGTGTCAA
TATAATACGAATGCTCGTCTTGGGATTCGGGATATTGAAAGACGAGGCTACAGTGAAATCAATGAGCAGATATGGAGACTACAGGGAGCTTTTGAAGGGGCCTTTGTATT
ATGTTGCAACAATTACATTAGTTTGTATATTCTACTGGAGGACGTCCCCCATTTCAATTGCTCTGATATGCAACTTATGTGCCGGAGATGGGTTGGCTGATATCGTTGGA
AGACGATTTGGAAGTAAAAAGATCTTTTACAACAGAAACAAGTCTCTAGTTGGTAGTGTGGCAATGGCATCTGCTGGTTTTCTTGCATCTGTTGGGTATATGTACTATTT
CTCATCATTTGGGTATGTTGTGGGAAGCAACAGAATGGTTTTGGGATTCTTAGTTGTGTCCCTTGCCTCAGCATTGGTGGAGTCTCTCCCCATAAGCACTGAGATTGATG
ACAACCTTACAGTTCCACTCACTTCTTTGCTGGCCTATATTGAAACTCGTCCTGCTTCTAGATCCATGTCAATGCCTAATGAGGACAATGCATCTTCACAAACTCAGCTT
TCATCTAATTTGTTTGGAGATCTTCTGGATTCTGTGATTGTTGATATTGCATCAGAATGTCATCGAATAGCAAGGTTAGGTCTTGATCGTAACTTAGAAGAGGAAGAAGA
AGAATTAAGACTTTCAGCACAGGCACGAGTAAGAGTAGCTGATTCTAGCAATAGTAGTGAGGCAAACGGCAAATATGTAGTTGATATTTTTGGACAAACTCATCCTTCTG
TTGCGAATGAAATATTTGATTGCATGAATTGTGGTCGATCAATTATGGCTGGGAGATTTGCTCCTCATTTAGAGAAATGCATGGGAAGGGGTAGAAAGGCTCGTCCCAAA
GTAACAAGAAGTAGTACAGCTGCCCAGAGCCGGTATTCACGAGGCAATCCTGTTTCTGCATATTCCCCTTACCCTAATTCCACCAGCACGAATCGCTTACCTAATGGAAC
GTCTAGTCTTGCAGGGGAGGAGTACTCAAATGGTACATCTGAAGACCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTACAATTCTTCAATTTCGATTCTGCTCGTCAGTTGGGTCGTTTGGCCCTTTCTTTCCCTTCAGCTCCCCCAATTTTCGCTCTCGATTCATACCAGTTTCCGTCTC
TTTCAACTCCTTTTCTACGCCAATCTTCCGCTCCAGTAACTTCGCTTTGAGATTTCTGTCCAAAATCCGTCGGGAACACTGCCCAGTTGCGGCAGTTATGTTGTTGCCTA
AAAATCCGGTGGTCTCCGACATCTGCGCCACCGCCTTGTCCGGCGTAGTCGCCTTGTCTTTGCTTCGATTATGGGCAGAAACGGCGAAACGTGGCCTCGACCAGAAATTG
AACAGGAAGCTTGTTCATATAAGCATTGGGCTTGCTTTCATGCTTTGCTGGCCTATGTTCAGTTCTGGTTATCGAGGAGCAATACTTGCGTCTCTAATTCCCGGTGTCAA
TATAATACGAATGCTCGTCTTGGGATTCGGGATATTGAAAGACGAGGCTACAGTGAAATCAATGAGCAGATATGGAGACTACAGGGAGCTTTTGAAGGGGCCTTTGTATT
ATGTTGCAACAATTACATTAGTTTGTATATTCTACTGGAGGACGTCCCCCATTTCAATTGCTCTGATATGCAACTTATGTGCCGGAGATGGGTTGGCTGATATCGTTGGA
AGACGATTTGGAAGTAAAAAGATCTTTTACAACAGAAACAAGTCTCTAGTTGGTAGTGTGGCAATGGCATCTGCTGGTTTTCTTGCATCTGTTGGGTATATGTACTATTT
CTCATCATTTGGGTATGTTGTGGGAAGCAACAGAATGGTTTTGGGATTCTTAGTTGTGTCCCTTGCCTCAGCATTGGTGGAGTCTCTCCCCATAAGCACTGAGATTGATG
ACAACCTTACAGTTCCACTCACTTCTTTGCTGGCCTATATTGAAACTCGTCCTGCTTCTAGATCCATGTCAATGCCTAATGAGGACAATGCATCTTCACAAACTCAGCTT
TCATCTAATTTGTTTGGAGATCTTCTGGATTCTGTGATTGTTGATATTGCATCAGAATGTCATCGAATAGCAAGGTTAGGTCTTGATCGTAACTTAGAAGAGGAAGAAGA
AGAATTAAGACTTTCAGCACAGGCACGAGTAAGAGTAGCTGATTCTAGCAATAGTAGTGAGGCAAACGGCAAATATGTAGTTGATATTTTTGGACAAACTCATCCTTCTG
TTGCGAATGAAATATTTGATTGCATGAATTGTGGTCGATCAATTATGGCTGGGAGATTTGCTCCTCATTTAGAGAAATGCATGGGAAGGGGTAGAAAGGCTCGTCCCAAA
GTAACAAGAAGTAGTACAGCTGCCCAGAGCCGGTATTCACGAGGCAATCCTGTTTCTGCATATTCCCCTTACCCTAATTCCACCAGCACGAATCGCTTACCTAATGGAAC
GTCTAGTCTTGCAGGGGAGGAGTACTCAAATGGTACATCTGAAGACCCATGA
Protein sequenceShow/hide protein sequence
MATILQFRFCSSVGSFGPFFPFSSPNFRSRFIPVSVSFNSFSTPIFRSSNFALRFLSKIRREHCPVAAVMLLPKNPVVSDICATALSGVVALSLLRLWAETAKRGLDQKL
NRKLVHISIGLAFMLCWPMFSSGYRGAILASLIPGVNIIRMLVLGFGILKDEATVKSMSRYGDYRELLKGPLYYVATITLVCIFYWRTSPISIALICNLCAGDGLADIVG
RRFGSKKIFYNRNKSLVGSVAMASAGFLASVGYMYYFSSFGYVVGSNRMVLGFLVVSLASALVESLPISTEIDDNLTVPLTSLLAYIETRPASRSMSMPNEDNASSQTQL
SSNLFGDLLDSVIVDIASECHRIARLGLDRNLEEEEEELRLSAQARVRVADSSNSSEANGKYVVDIFGQTHPSVANEIFDCMNCGRSIMAGRFAPHLEKCMGRGRKARPK
VTRSSTAAQSRYSRGNPVSAYSPYPNSTSTNRLPNGTSSLAGEEYSNGTSEDP