; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G04730 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G04730
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionChalcone and stilbene synthase family protein
Genome locationClcChr02:4115549..4121263
RNA-Seq ExpressionClc02G04730
SyntenyClc02G04730
Gene Ontology termsGO:0009116 - nucleoside metabolic process (biological process)
GO:0010584 - pollen exine formation (biological process)
GO:0030639 - polyketide biosynthetic process (biological process)
GO:0016747 - transferase activity, transferring acyl groups other than amino-acyl groups (molecular function)
InterPro domainsIPR000845 - Nucleoside phosphorylase domain
IPR001099 - Chalcone/stilbene synthase, N-terminal
IPR011141 - Polyketide synthase, type III
IPR012328 - Chalcone/stilbene synthase, C-terminal
IPR016039 - Thiolase-like
IPR035994 - Nucleoside phosphorylase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF4356666.1 hypothetical protein G4B88_009643 [Cannabis sativa]3.3e-24364.27Show/hide
Query:  DNPSSAEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQAL
        D   +AEE   SQ  +  PIS+ILI+IAMQTEALP+V KFQL+ED  S FP  VPWVRYHGIY++L IN++WPGKD +LGVDSVGTISASLVTYAS+QAL
Subjt:  DNPSSAEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQAL

Query:  HPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIV
         PDLIINAGTAGGFKAKGA IGDVFL S+ AFHDRRIPIPVFDLYGVGL+ A  TP L KEL+LK              VGKLSTGDSLDM   DEASI 
Subjt:  HPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIV

Query:  ANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISGNPSKQIDVATLLSFQIAKNRLIPKFLWKVMK
        ANDAT+KDMEGAAVAYVADIFKVP IFVKAVTDIVDGEKPTAEEFLQNLA V+AAL+Q+VTQ+I                                    
Subjt:  ANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISGNPSKQIDVATLLSFQIAKNRLIPKFLWKVMK

Query:  EMANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEI
         M + +  +     KANPGKA ILALGKA P QLV+Q+YLVDGYF+DT+CDD+DLK+KLTRLCKTTTVKTRYVVMSEEIL+KYPELA+EG+ T+KQRL+I
Subjt:  EMANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEI

Query:  CNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG-------------------------------
        CN AVT+MAIEASQ+ I  WGRP+S ITHLVYVSSSE RLPGGDL+LA GLGL+ HTQR ML F GCSG                               
Subjt:  CNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG-------------------------------

Query:  -------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEY
                     GVALFGDGAGAM+IG++P LGIE+PLFELH A Q+F+P T   IDG+L+EEGISF + RELPQ+IEDN+E FCE  ++ +G  ++ Y
Subjt:  -------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEY

Query:  NKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKM-------GGG----------EIEEWGLILAFGPGISF
        NKMFWAVHPGGPAILNR+EKRL+L+PEKL ASRRALMDYGNASSN+IVYVLEYM+EES KM M       G G          E  EWGL+LAFGPGI+F
Subjt:  NKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKM-------GGG----------EIEEWGLILAFGPGISF

Query:  EGILARNL
        EGILARNL
Subjt:  EGILARNL

KAF4382049.1 hypothetical protein G4B88_006681 [Cannabis sativa]5.6e-24364.12Show/hide
Query:  DNPSSAEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQAL
        D   +AEE   SQ  +  PIS+ILI+IAMQTEALP+V KFQL+ED  S FP  VPWVRYHGIY++L IN++WPGKD +LGVDSVGTISASLVTYAS+QAL
Subjt:  DNPSSAEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQAL

Query:  HPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIV
         PDLIINAGTAGGFKAKGA IGDVFL S+ AFHDRRIPIPVFDLYGVGL+ A  TP L KEL+LK              VGKLSTGDSLDM   DEASI 
Subjt:  HPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIV

Query:  ANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISGNPSKQIDVATLLSFQIAKNRLIPKFLWKVMK
        ANDAT+KDMEGAAVAYVADIFKVP IFVKAVTDIVDGEKPTAEEFLQNLA V+AAL+Q+VTQ+I                                    
Subjt:  ANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISGNPSKQIDVATLLSFQIAKNRLIPKFLWKVMK

Query:  EMANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEI
         M + +  +     KANPGKA ILALGKA P QLV+Q+YLVDGYF+DT+CDD+DLK+KLTRLCKTTTVKTRYVVMSEEIL+KYPELA+EG+ T+KQRL+I
Subjt:  EMANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEI

Query:  CNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG-------------------------------
        CN AVT+MAIEASQ+ I  WGRP+S ITHLVYVSSSE RLPGGDL+LA GLGL+ HTQR ML F GCSG                               
Subjt:  CNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG-------------------------------

Query:  -------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEY
                     GVALFGDGAGAM+IG++P LGIE+PLFELH A Q+F+P T   IDG+L+EEGISF + RELPQ+IEDN+E FCE  ++ +G  +  Y
Subjt:  -------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEY

Query:  NKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKM-----------------GGGEIEEWGLILAFGPGISF
        NKMFWAVHPGGPAILNR+EKRL+L+PEKL ASRRALMDYGNASSN+IVYVLEYM+EES KM M                 G  E  EWGL+LAFGPGI+F
Subjt:  NKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKM-----------------GGGEIEEWGLILAFGPGISF

Query:  EGILARNL
        EGILARNL
Subjt:  EGILARNL

KAG6651619.1 hypothetical protein CIPAW_06G125600 [Carya illinoinensis]6.4e-25568.51Show/hide
Query:  AEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQALHPDLI
        +E+ + SQ   + PIS+I+IIIAMQTEALPLV K QL+ED + VFPK VPWVRYHG Y++L INLIWPGKD ALGVDS+GT+SASLVTYASI+AL PDLI
Subjt:  AEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQALHPDLI

Query:  INAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIVANDAT
        INAGTAGGFKAKGA +GDVF+ S+CAFHDRRIPIPVFD+YGVGL+ A  TPNL KEL+LK              VGKLSTGDSLDMS+QDEASIVANDAT
Subjt:  INAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIVANDAT

Query:  VKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISGNPSKQIDVATLLSFQIAKNRLIPKFLWKVMKEMANR
        +KDMEGAAVAYVAD+ KVP IFVKAVTDIVDGEKPTA+EFLQNLA V+AALDQAVTQ                                     +EM + 
Subjt:  VKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISGNPSKQIDVATLLSFQIAKNRLIPKFLWKVMKEMANR

Query:  NGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEICNKAV
        + A+G S   ANPGKA ILALGKA P QLV+QDYLVDGYF+DT+CDD  LKQKLTRLCKTTTV+TRYVVMSEEILKKYPELA+EG  T+KQRL+ICN AV
Subjt:  NGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEICNKAV

Query:  TDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG------------------------------------
        T MAIEASQA +KNWGRP+SDITHLVYVSSSEARLPGGDL+LA+GLGLSP TQR MLYF+GCSG                                    
Subjt:  TDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG------------------------------------

Query:  --------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYNKMFW
                GVALFGDGAGAMLIG++PVLG E+PLFELHTA Q+F+PDT+  IDG+L+EEGISF +ARELPQIIEDNIE FC+  ++ +G  ++EYNKMFW
Subjt:  --------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYNKMFW

Query:  AVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNL
        AVHPGGPAILNRMEKRLELLPEKL ASRRALMDYGNASSN+IVYVLEYM+EE+LK+K       EWGLILAFGPGISFEGILARNL
Subjt:  AVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNL

KAG7011746.1 Type III polyketide synthase B [Cucurbita argyrosperma subsp. argyrosperma]8.9e-24171.91Show/hide
Query:  MVLSKDNPSSAEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYA
        M L +D  SSAE++MDSQTHS  PISSILIII                                                    GVDSVGTISASLVTYA
Subjt:  MVLSKDNPSSAEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYA

Query:  SIQALHPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQD
        SIQAL PDLIINAGTAGGFKAKGA+IGDVFLVSECAFHDRRIPIPVFDLYGVGLK  LKTPNL   LDLK              VGKLSTGDSLDMS QD
Subjt:  SIQALHPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQD

Query:  EASIVANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISGNPSKQIDVATLLSFQIAKNRLIPKFL
        EASI+ANDATVKDMEGAAVAYVAD+FKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALD+AVTQVI FISG    ++                  FL
Subjt:  EASIVANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISGNPSKQIDVATLLSFQIAKNRLIPKFL

Query:  WKV-MKEMANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATI
         K  MK+M N NGAQGA  GKAN G+A ILALGKA PPQLV QDYLVDGYFKDTSCDD+DLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATI
Subjt:  WKV-MKEMANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATI

Query:  KQRLEICNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSGGVALFGDGAGAMLIGTNPVLGIERP
        KQRLEICNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLA+GLGLSPHTQR MLYFMGCSGGVA                L + + 
Subjt:  KQRLEICNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSGGVALFGDGAGAMLIGTNPVLGIERP

Query:  LFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYNKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMD
        L E +  ++  +  ++  I        I FTIAR+LPQIIEDNIE+FCE FLQT+GLQEKEYNKMFWAVHPGGPAIL+R+EKRLELLPEKLTASRRALMD
Subjt:  LFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYNKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMD

Query:  YGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNLAV
        YGNASSN+IVYVLEYMVEESLK KM GGE EEWGLILAFGPGISFEGIL RNLAV
Subjt:  YGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNLAV

OWM80362.1 hypothetical protein CDL15_Pgr019642 [Punica granatum]3.4e-24065.42Show/hide
Query:  DNPSSAEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQAL
        D  ++AEE M +Q   K P+ +ILIIIAMQTEALP+V KFQL ED  +VFP+ VPWVRYHG+Y++L INLIWPGKD +LGVDSVGT+SASLVTYASIQAL
Subjt:  DNPSSAEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQAL

Query:  HPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIV
         PDLIINAGTAGGFKAKGA IGDVFL SE AFHDRRIPIPVFDLYGVGL+ AL TP+L K+L+LK              VGKLSTGDSLDMS QDEA+I+
Subjt:  HPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIV

Query:  ANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDF---ISGNPSKQIDVATLLSFQIAKNRLIPKFLWK
        ANDATVKDMEGAAVAYVAD+ KVPAIFVKAVTD+VDG+KPTA+EFLQNLATV+AAL++ V+Q+  +   +    S+Q +  +  +    + RL    L++
Subjt:  ANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDF---ISGNPSKQIDVATLLSFQIAKNRLIPKFLWK

Query:  VMKEMANRNGAQGA-SKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQ
         M++      A+GA +K  A+PGKA ILALGKA P QLV+Q++LVDGYFK+T+CDD +L+QKLTRLCKTTTVKTRYVVM EEIL KYPELA+EG  T+KQ
Subjt:  VMKEMANRNGAQGA-SKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQ

Query:  RLEICNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG---------------------------
        RL+ICN AVT MAIEAS++ I+ WGRPVSDITHLVYVSSSEARLPGGDL+LARGLGL P T+R MLYFMGCSG                           
Subjt:  RLEICNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG---------------------------

Query:  -----------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQ
                         GVALFGDGAGA++IG +P   IE+PLFEL++++Q F+PDT+  IDG+ +EEGISF +ARELPQIIEDNIE FC+  +   G  
Subjt:  -----------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQ

Query:  EKEYNKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNLAV
          +YNK+FWAVHPGGPAILNR+EKRLELLP KL ASRRALMDYGNASSN+IVYVLEYM+EE LKMK       EWGLILAFGPGI+ EGILARNL V
Subjt:  EKEYNKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNLAV

TrEMBL top hitse value%identityAlignment
A0A218X7L0 Uncharacterized protein1.6e-24065.42Show/hide
Query:  DNPSSAEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQAL
        D  ++AEE M +Q   K P+ +ILIIIAMQTEALP+V KFQL ED  +VFP+ VPWVRYHG+Y++L INLIWPGKD +LGVDSVGT+SASLVTYASIQAL
Subjt:  DNPSSAEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQAL

Query:  HPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIV
         PDLIINAGTAGGFKAKGA IGDVFL SE AFHDRRIPIPVFDLYGVGL+ AL TP+L K+L+LK              VGKLSTGDSLDMS QDEA+I+
Subjt:  HPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIV

Query:  ANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDF---ISGNPSKQIDVATLLSFQIAKNRLIPKFLWK
        ANDATVKDMEGAAVAYVAD+ KVPAIFVKAVTD+VDG+KPTA+EFLQNLATV+AAL++ V+Q+  +   +    S+Q +  +  +    + RL    L++
Subjt:  ANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDF---ISGNPSKQIDVATLLSFQIAKNRLIPKFLWK

Query:  VMKEMANRNGAQGA-SKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQ
         M++      A+GA +K  A+PGKA ILALGKA P QLV+Q++LVDGYFK+T+CDD +L+QKLTRLCKTTTVKTRYVVM EEIL KYPELA+EG  T+KQ
Subjt:  VMKEMANRNGAQGA-SKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQ

Query:  RLEICNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG---------------------------
        RL+ICN AVT MAIEAS++ I+ WGRPVSDITHLVYVSSSEARLPGGDL+LARGLGL P T+R MLYFMGCSG                           
Subjt:  RLEICNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG---------------------------

Query:  -----------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQ
                         GVALFGDGAGA++IG +P   IE+PLFEL++++Q F+PDT+  IDG+ +EEGISF +ARELPQIIEDNIE FC+  +   G  
Subjt:  -----------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQ

Query:  EKEYNKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNLAV
          +YNK+FWAVHPGGPAILNR+EKRLELLP KL ASRRALMDYGNASSN+IVYVLEYM+EE LKMK       EWGLILAFGPGI+ EGILARNL V
Subjt:  EKEYNKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNLAV

A0A4Y1S0V5 Chalcone and stilbene synthase family protein3.3e-20163.12Show/hide
Query:  GVDSVGTISASLVTYASIQALHPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIM
        GVD VGT+ ASLVTYASIQAL PDLIINAGTAGGFKAKGA IGDV++ S+ AF DRRIPIPVFDLYG+GL+ AL TPNL KEL+LK              
Subjt:  GVDSVGTISASLVTYASIQALHPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIM

Query:  VGKLSTGDSLDMSAQDEASIVANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISG------NPSK
        VGKLSTGDSL MS QDEAS+VANDA VKDME AAVAYVAD+ KVP++F+K V DI DGE+ TAEE  Q+     AAL++AVTQV DFI+G       P K
Subjt:  VGKLSTGDSLDMSAQDEASIVANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISG------NPSK

Query:  QIDVATLLSFQIAKNRLIPKFLWKVMKEMANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYV
          D+  L        R   + L     +M + +  QG S  KAN G+A ILALGKA P QLV+QD+LVDGYF+DT+CDD +LKQKL RLCK       YV
Subjt:  QIDVATLLSFQIAKNRLIPKFLWKVMKEMANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYV

Query:  VMSEEILKKYPELAMEGQATIKQRLEICNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG----
        VMS+EIL+KYPEL  EG  TIKQRL ICN+AVT MAIEAS+A IKNWGRP SDITHLVYVSSSEARLPGGD++LA+GLGL P TQR +LYF GCSG    
Subjt:  VMSEEILKKYPELAMEGQATIKQRLEICNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG----

Query:  ----------------------------------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARE
                                                GVALFGDGAGAMLIG++P L  E+PLFELHTA Q+F+PDT+  IDG+++EEGISF + RE
Subjt:  ----------------------------------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARE

Query:  LPQIIEDNIESFCETFLQTIGLQEKEYNKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIE-EWG
        LPQIIED+IE FC   +  +G   KEYNKMFWAVHPGGPAILNR+EKRL+L PEKL ASRRAL DYGNASSN+IVYVLEYM+EES K+K    E + EWG
Subjt:  LPQIIEDNIESFCETFLQTIGLQEKEYNKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIE-EWG

Query:  LILAFGPGISFEGILARNLAV
        LILAFGPGI+FEGILARNLAV
Subjt:  LILAFGPGISFEGILARNLAV

A0A6N2MG63 Uncharacterized protein6.3e-23264.83Show/hide
Query:  AEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQALHPDLI
        +EE M  Q  ++ PISSILI+IAMQTEA+P+V K QL ED   VFPK VPWVRYHGIY++L INL+ PGKD  LGVDSVGTISASLVTYA+IQAL PDLI
Subjt:  AEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQALHPDLI

Query:  INAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIVANDAT
        INAGTAGGFK KGA I DVFLVS+ AFHDRRIPIPVFDLYGVGL+    TPNL KEL+LK               GKLSTGDSLDMS QDEASIVANDAT
Subjt:  INAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIVANDAT

Query:  VKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISGNPSKQIDVATLLSFQIAKNRLIPKFLWKVMKEMANR
        VKDMEGAAVAYVAD+FKVPAIF+KAVTDIVDG+KPTAEEFLQNLA V+AALD AV QV+DFISGN S                             M + 
Subjt:  VKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISGNPSKQIDVATLLSFQIAKNRLIPKFLWKVMKEMANR

Query:  NGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEICNKAV
           QG    KA+PGKA ILALGKA P QLV+Q++LVDG                            YVVMS+EILKKYPEL +EG  TIKQRL+ICN+AV
Subjt:  NGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEICNKAV

Query:  TDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG------------------------------------
        T MAIEAS+A IK WGR VSDITH+VYVSSSEARLPGGDL+LARGLGLSP TQR MLYF GCSG                                    
Subjt:  TDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG------------------------------------

Query:  --------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYNKMFW
                GVALFGDGAGAM+IGT+PV   E PLFELHTA Q F+PDT+  IDG+L+EEGISFT+ARELPQIIEDNIE FC+  +   GL  K+YNKMFW
Subjt:  --------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYNKMFW

Query:  AVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNLAV
        AVHPGGPAILNRMEKRL+LLP+KL ASRRALMDYGNASSN+IVYVLEYM+EES KMK    +  +WGLILAFGPGI+FEGILARNLA+
Subjt:  AVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNLAV

A0A7J6GIU9 Uncharacterized protein2.7e-24364.12Show/hide
Query:  DNPSSAEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQAL
        D   +AEE   SQ  +  PIS+ILI+IAMQTEALP+V KFQL+ED  S FP  VPWVRYHGIY++L IN++WPGKD +LGVDSVGTISASLVTYAS+QAL
Subjt:  DNPSSAEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQAL

Query:  HPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIV
         PDLIINAGTAGGFKAKGA IGDVFL S+ AFHDRRIPIPVFDLYGVGL+ A  TP L KEL+LK              VGKLSTGDSLDM   DEASI 
Subjt:  HPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIV

Query:  ANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISGNPSKQIDVATLLSFQIAKNRLIPKFLWKVMK
        ANDAT+KDMEGAAVAYVADIFKVP IFVKAVTDIVDGEKPTAEEFLQNLA V+AAL+Q+VTQ+I                                    
Subjt:  ANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISGNPSKQIDVATLLSFQIAKNRLIPKFLWKVMK

Query:  EMANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEI
         M + +  +     KANPGKA ILALGKA P QLV+Q+YLVDGYF+DT+CDD+DLK+KLTRLCKTTTVKTRYVVMSEEIL+KYPELA+EG+ T+KQRL+I
Subjt:  EMANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEI

Query:  CNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG-------------------------------
        CN AVT+MAIEASQ+ I  WGRP+S ITHLVYVSSSE RLPGGDL+LA GLGL+ HTQR ML F GCSG                               
Subjt:  CNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG-------------------------------

Query:  -------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEY
                     GVALFGDGAGAM+IG++P LGIE+PLFELH A Q+F+P T   IDG+L+EEGISF + RELPQ+IEDN+E FCE  ++ +G  +  Y
Subjt:  -------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEY

Query:  NKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKM-----------------GGGEIEEWGLILAFGPGISF
        NKMFWAVHPGGPAILNR+EKRL+L+PEKL ASRRALMDYGNASSN+IVYVLEYM+EES KM M                 G  E  EWGL+LAFGPGI+F
Subjt:  NKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKM-----------------GGGEIEEWGLILAFGPGISF

Query:  EGILARNL
        EGILARNL
Subjt:  EGILARNL

A0A7J6GMT7 Uncharacterized protein1.6e-24364.27Show/hide
Query:  DNPSSAEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQAL
        D   +AEE   SQ  +  PIS+ILI+IAMQTEALP+V KFQL+ED  S FP  VPWVRYHGIY++L IN++WPGKD +LGVDSVGTISASLVTYAS+QAL
Subjt:  DNPSSAEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQAL

Query:  HPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIV
         PDLIINAGTAGGFKAKGA IGDVFL S+ AFHDRRIPIPVFDLYGVGL+ A  TP L KEL+LK              VGKLSTGDSLDM   DEASI 
Subjt:  HPDLIINAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIV

Query:  ANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISGNPSKQIDVATLLSFQIAKNRLIPKFLWKVMK
        ANDAT+KDMEGAAVAYVADIFKVP IFVKAVTDIVDGEKPTAEEFLQNLA V+AAL+Q+VTQ+I                                    
Subjt:  ANDATVKDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISGNPSKQIDVATLLSFQIAKNRLIPKFLWKVMK

Query:  EMANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEI
         M + +  +     KANPGKA ILALGKA P QLV+Q+YLVDGYF+DT+CDD+DLK+KLTRLCKTTTVKTRYVVMSEEIL+KYPELA+EG+ T+KQRL+I
Subjt:  EMANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEI

Query:  CNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG-------------------------------
        CN AVT+MAIEASQ+ I  WGRP+S ITHLVYVSSSE RLPGGDL+LA GLGL+ HTQR ML F GCSG                               
Subjt:  CNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG-------------------------------

Query:  -------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEY
                     GVALFGDGAGAM+IG++P LGIE+PLFELH A Q+F+P T   IDG+L+EEGISF + RELPQ+IEDN+E FCE  ++ +G  ++ Y
Subjt:  -------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEY

Query:  NKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKM-------GGG----------EIEEWGLILAFGPGISF
        NKMFWAVHPGGPAILNR+EKRL+L+PEKL ASRRALMDYGNASSN+IVYVLEYM+EES KM M       G G          E  EWGL+LAFGPGI+F
Subjt:  NKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKM-------GGG----------EIEEWGLILAFGPGISF

Query:  EGILARNL
        EGILARNL
Subjt:  EGILARNL

SwissProt top hitse value%identityAlignment
O23674 Type III polyketide synthase A1.3e-10955.05Show/hide
Query:  ANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEICNKAVTDMAIEASQA
        AN GKA +LALGKA P Q+V Q+ LV+G+ +DT CDD  +K+KL  LCKTTTVKTRY V++ EIL KYPEL  EG  TIKQRLEI N+AV +MA+EAS  
Subjt:  ANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEICNKAVTDMAIEASQA

Query:  AIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSGGV--------------------------------------------
         IK WGRPV DITH+VYVSSSE RLPGGDL+L+  LGL     R MLYF+GC GGV                                            
Subjt:  AIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSGGV--------------------------------------------

Query:  ALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYNKMFWAVHPGGPAIL
        ALFGDGA A++IG +P    E P  ELH A Q+F+P TQN+I+G+L+EEGI+F + R+LPQ IE+NIE FC+  +   G +  E+N MFWAVHPGGPAIL
Subjt:  ALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYNKMFWAVHPGGPAIL

Query:  NRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNL
        NR+E +L+L  EKL +SRRAL+DYGN SSN+I+YV+EYM +E   +K  G   +EWGL LAFGPGI+FEG+L R+L
Subjt:  NRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNL

O81305 Type III polyketide synthase C2.8e-10453.83Show/hide
Query:  KGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEICNKAVTDMAIEA
        K  A  GKA +LALGKALP  +V Q+ LV+ Y ++  CD++ +K KL  LCK+TTVKTRY VMS E L KYPELA EG  TIKQRLEI N AV  MA EA
Subjt:  KGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEICNKAVTDMAIEA

Query:  SQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSGGV-----------------------------------------
        S   IK WGR V DITHLVYVSSSE RLPGGDL+L+  LGLS   QR MLYF+GC GG+                                         
Subjt:  SQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSGGV-----------------------------------------

Query:  ---ALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYNKMFWAVHPGGP
           ALFGDGA A++IG +P    E P  ELH A Q+F+P TQ +IDG+LSEEGI+F + R+LPQ IEDN+E FC+  +   G    E N +FWAVHPGGP
Subjt:  ---ALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYNKMFWAVHPGGP

Query:  AILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNL
        AIL+ +E +L+L PEKL  SRRALMDYGN SSN+I Y+++ + +E   ++  G E EEWGL LAFGPGI+FEG L RNL
Subjt:  AILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNL

Q7XA67 5'-methylthioadenosine/S-adenosylhomocysteine nucleosidase6.9e-8764.82Show/hide
Query:  KLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQALHPDLIINAGTAGGFKA
        K PIS+I+ I+AMQ EA PL+ + +L E+  + FPKEV W+ + G+Y++L IN++ PGKDS LGV+SVGT+ ASLVTYASI A+ PDLIINAGTAGGFKA
Subjt:  KLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQALHPDLIINAGTAGGFKA

Query:  KGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIVANDATVKDMEGAAVAY
        KGA I DV++VS  AFHDRRIP+PV D+YGVG++    TPNL KEL+LK              VG+LSTGDS+DMS  DE SI ANDATVKDMEGAAVAY
Subjt:  KGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIVANDATVKDMEGAAVAY

Query:  VADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISG
        VADIFKVP I +K VTDIVDG +PT+EEFL+NLA V+A LD+++T+VIDFISG
Subjt:  VADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISG

Q8LDM2 Type III polyketide synthase B2.1e-13964.54Show/hide
Query:  MANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEIC
        M + + A   S+ K+NPGKA ILALGKA P QLV+Q+YLVDGYFK T CDD +LKQKLTRLCKTTTVKTRYVVMSEEILKKYPELA+EG +T+ QRL+IC
Subjt:  MANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEIC

Query:  NKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG--------------------------------
        N AVT+MA+EAS+A IKNWGR +SDITH+VYVSSSEARLPGGDL+LA+GLGLSP T R +LYF+GCSG                                
Subjt:  NKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG--------------------------------

Query:  ------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYN
                    GVALFGDGAGAM+IG++P    E+PLFELHTA Q F+P+T+  IDG+L+E+GI+F ++RELPQIIEDN+E+FC+  +   GL  K YN
Subjt:  ------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYN

Query:  KMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNLAV
        +MFWAVHPGGPAILNR+EKRL L PEKL+ SRRALMDYGNASSNSIVYVLEYM+EES K++    E  EWGLILAFGPG++FEGI+ARNL V
Subjt:  KMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNLAV

Q9T0I8 5'-methylthioadenosine nucleosidase1.2e-9164.64Show/hide
Query:  EEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQALHPDLII
        E  +D+Q+    PISS++ +IAMQ EALPLV KF LSE   S   K +PWV YHG++++L+IN++ PG+D+ALG+DSVGT+ ASL+T+ASIQAL PD+II
Subjt:  EEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQALHPDLII

Query:  NAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIVANDATV
        NAGT GGFK KGANIGDVFLVS+  FHDRRIPIP+FDLYGVGL+ A  TPNL KEL+LK              +G+LSTGDSLDMS QDE  I+ANDAT+
Subjt:  NAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIVANDATV

Query:  KDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISG
        KDMEGAAVAYVAD+ K+P +F+KAVTD+VDG+KPTAEEFLQNL  V+AAL+   T+VI+FI+G
Subjt:  KDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISG

Arabidopsis top hitse value%identityAlignment
AT1G02050.1 Chalcone and stilbene synthase family protein9.2e-11155.05Show/hide
Query:  ANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEICNKAVTDMAIEASQA
        AN GKA +LALGKA P Q+V Q+ LV+G+ +DT CDD  +K+KL  LCKTTTVKTRY V++ EIL KYPEL  EG  TIKQRLEI N+AV +MA+EAS  
Subjt:  ANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEICNKAVTDMAIEASQA

Query:  AIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSGGV--------------------------------------------
         IK WGRPV DITH+VYVSSSE RLPGGDL+L+  LGL     R MLYF+GC GGV                                            
Subjt:  AIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSGGV--------------------------------------------

Query:  ALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYNKMFWAVHPGGPAIL
        ALFGDGA A++IG +P    E P  ELH A Q+F+P TQN+I+G+L+EEGI+F + R+LPQ IE+NIE FC+  +   G +  E+N MFWAVHPGGPAIL
Subjt:  ALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYNKMFWAVHPGGPAIL

Query:  NRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNL
        NR+E +L+L  EKL +SRRAL+DYGN SSN+I+YV+EYM +E   +K  G   +EWGL LAFGPGI+FEG+L R+L
Subjt:  NRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNL

AT4G00040.1 Chalcone and stilbene synthase family protein2.0e-10553.83Show/hide
Query:  KGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEICNKAVTDMAIEA
        K  A  GKA +LALGKALP  +V Q+ LV+ Y ++  CD++ +K KL  LCK+TTVKTRY VMS E L KYPELA EG  TIKQRLEI N AV  MA EA
Subjt:  KGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEICNKAVTDMAIEA

Query:  SQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSGGV-----------------------------------------
        S   IK WGR V DITHLVYVSSSE RLPGGDL+L+  LGLS   QR MLYF+GC GG+                                         
Subjt:  SQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSGGV-----------------------------------------

Query:  ---ALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYNKMFWAVHPGGP
           ALFGDGA A++IG +P    E P  ELH A Q+F+P TQ +IDG+LSEEGI+F + R+LPQ IEDN+E FC+  +   G    E N +FWAVHPGGP
Subjt:  ---ALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYNKMFWAVHPGGP

Query:  AILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNL
        AIL+ +E +L+L PEKL  SRRALMDYGN SSN+I Y+++ + +E   ++  G E EEWGL LAFGPGI+FEG L RNL
Subjt:  AILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNL

AT4G34840.1 Phosphorylase superfamily protein4.9e-8864.82Show/hide
Query:  KLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQALHPDLIINAGTAGGFKA
        K PIS+I+ I+AMQ EA PL+ + +L E+  + FPKEV W+ + G+Y++L IN++ PGKDS LGV+SVGT+ ASLVTYASI A+ PDLIINAGTAGGFKA
Subjt:  KLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQALHPDLIINAGTAGGFKA

Query:  KGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIVANDATVKDMEGAAVAY
        KGA I DV++VS  AFHDRRIP+PV D+YGVG++    TPNL KEL+LK              VG+LSTGDS+DMS  DE SI ANDATVKDMEGAAVAY
Subjt:  KGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIVANDATVKDMEGAAVAY

Query:  VADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISG
        VADIFKVP I +K VTDIVDG +PT+EEFL+NLA V+A LD+++T+VIDFISG
Subjt:  VADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISG

AT4G34850.1 Chalcone and stilbene synthase family protein1.5e-14064.54Show/hide
Query:  MANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEIC
        M + + A   S+ K+NPGKA ILALGKA P QLV+Q+YLVDGYFK T CDD +LKQKLTRLCKTTTVKTRYVVMSEEILKKYPELA+EG +T+ QRL+IC
Subjt:  MANRNGAQGASKGKANPGKAKILALGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEIC

Query:  NKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG--------------------------------
        N AVT+MA+EAS+A IKNWGR +SDITH+VYVSSSEARLPGGDL+LA+GLGLSP T R +LYF+GCSG                                
Subjt:  NKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSSSEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSG--------------------------------

Query:  ------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYN
                    GVALFGDGAGAM+IG++P    E+PLFELHTA Q F+P+T+  IDG+L+E+GI+F ++RELPQIIEDN+E+FC+  +   GL  K YN
Subjt:  ------------GVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETFLQTIGLQEKEYN

Query:  KMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNLAV
        +MFWAVHPGGPAILNR+EKRL L PEKL+ SRRALMDYGNASSNSIVYVLEYM+EES K++    E  EWGLILAFGPG++FEGI+ARNL V
Subjt:  KMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNLAV

AT4G38800.1 methylthioadenosine nucleosidase 18.7e-9364.64Show/hide
Query:  EEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQALHPDLII
        E  +D+Q+    PISS++ +IAMQ EALPLV KF LSE   S   K +PWV YHG++++L+IN++ PG+D+ALG+DSVGT+ ASL+T+ASIQAL PD+II
Subjt:  EEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQALHPDLII

Query:  NAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIVANDATV
        NAGT GGFK KGANIGDVFLVS+  FHDRRIPIP+FDLYGVGL+ A  TPNL KEL+LK              +G+LSTGDSLDMS QDE  I+ANDAT+
Subjt:  NAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIVANDATV

Query:  KDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISG
        KDMEGAAVAYVAD+ K+P +F+KAVTD+VDG+KPTAEEFLQNL  V+AAL+   T+VI+FI+G
Subjt:  KDMEGAAVAYVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCTTTCCAAGGACAATCCGAGCTCTGCAGAGGAATTCATGGATTCTCAGACTCACAGCAAGCTTCCGATTTCATCTATACTCATCATAATCGCTATGCAGACGGA
AGCACTTCCTTTGGTGGAGAAGTTTCAACTTTCCGAAGATAAAAAATCTGTGTTTCCAAAGGAGGTTCCGTGGGTTCGTTATCATGGGATTTATAGAAATCTTCAAATCA
ATTTAATTTGGCCTGGAAAGGATTCAGCCTTGGGAGTTGATAGTGTGGGTACGATTTCCGCATCGCTTGTGACCTATGCTTCTATTCAAGCATTGCACCCAGACCTGATC
ATAAACGCAGGCACTGCTGGTGGTTTTAAGGCGAAAGGAGCTAACATTGGCGATGTGTTTCTGGTATCCGAATGTGCCTTCCATGACAGACGTATACCGATTCCAGTTTT
CGATTTATACGGAGTCGGATTGAAGCCAGCACTGAAAACACCCAATCTCCATAAGGAACTCGACCTGAAGGTGAATTTCCTTTTTACAAAATTAATTAGAGTTAACATTA
TGGTTGGCAAATTATCAACAGGTGACTCGCTGGACATGTCTGCACAGGATGAAGCATCGATTGTAGCTAACGACGCCACGGTTAAAGATATGGAGGGAGCAGCAGTAGCC
TATGTGGCAGATATATTCAAAGTTCCTGCAATATTTGTAAAAGCTGTAACCGATATCGTCGATGGTGAAAAACCAACTGCAGAAGAATTCTTGCAGAATTTAGCTACAGT
TTCTGCTGCATTGGATCAAGCAGTCACACAAGTTATAGATTTCATCAGTGGAAACCCGAGCAAACAAATCGATGTAGCAACTTTGCTGTCCTTCCAAATTGCAAAGAATC
GTTTGATTCCCAAGTTTCTTTGGAAAGTTATGAAAGAGATGGCAAACAGAAATGGGGCACAAGGAGCTTCCAAGGGAAAGGCCAATCCTGGTAAAGCTAAAATTTTGGCT
CTAGGGAAAGCCTTACCTCCGCAACTTGTCCTTCAAGACTACCTGGTCGATGGCTATTTCAAGGACACGAGCTGCGATGATATAGACCTCAAGCAGAAGCTCACTCGACT
CTGCAAGACAACAACAGTCAAAACCAGATACGTGGTGATGTCAGAAGAGATCTTGAAGAAGTACCCAGAGCTAGCAATGGAAGGCCAAGCAACTATAAAGCAGAGACTAG
AGATTTGCAACAAAGCTGTGACAGACATGGCCATTGAAGCCTCACAAGCTGCCATTAAGAACTGGGGAAGGCCAGTTTCAGATATTACCCATTTAGTCTATGTCTCTTCA
AGTGAAGCTCGCCTCCCTGGTGGTGATCTCTTCCTAGCAAGGGGCCTTGGACTCAGCCCTCATACTCAACGAGCCATGCTTTACTTCATGGGTTGCTCTGGAGGCGTGGC
ACTATTTGGGGATGGTGCAGGGGCGATGCTAATTGGCACAAACCCTGTTTTGGGCATTGAACGGCCACTCTTTGAGCTCCACACAGCAACCCAGAAATTTATTCCGGATA
CCCAGAACATAATCGATGGAAAACTGTCGGAGGAAGGTATTAGTTTCACAATAGCAAGAGAACTTCCTCAGATAATCGAAGATAACATCGAAAGTTTCTGTGAGACATTT
CTTCAAACAATTGGTCTGCAAGAGAAAGAATACAACAAAATGTTCTGGGCAGTACATCCAGGTGGGCCAGCGATATTGAACAGGATGGAGAAGAGACTCGAACTGTTACC
TGAGAAGCTGACGGCGAGCCGGCGAGCTCTGATGGACTATGGAAATGCGAGCAGTAATTCGATTGTGTACGTACTGGAGTACATGGTGGAAGAAAGCTTGAAAATGAAGA
TGGGTGGTGGGGAAATTGAGGAATGGGGATTGATTCTGGCGTTTGGACCTGGGATTTCGTTTGAAGGGATTCTGGCGAGGAATCTTGCCGTCTGA
mRNA sequenceShow/hide mRNA sequence
AAAAGTACTTTGCTTTTGAAACAGCCGCCGTTGAAGCTGTTTTATAGTGACTCTTTGCCACGTTCTATACGATTTTCACAAATTTTCTACCCTTCAGTCGCTTCCAAGTC
CCTTGATTTTCCCTTAAATTGCTCAAAAATCTTAGCCATTGATAGCAAAGATCACCATGGTTCTTTCCAAGGACAATCCGAGCTCTGCAGAGGAATTCATGGATTCTCAG
ACTCACAGCAAGCTTCCGATTTCATCTATACTCATCATAATCGCTATGCAGACGGAAGCACTTCCTTTGGTGGAGAAGTTTCAACTTTCCGAAGATAAAAAATCTGTGTT
TCCAAAGGAGGTTCCGTGGGTTCGTTATCATGGGATTTATAGAAATCTTCAAATCAATTTAATTTGGCCTGGAAAGGATTCAGCCTTGGGAGTTGATAGTGTGGGTACGA
TTTCCGCATCGCTTGTGACCTATGCTTCTATTCAAGCATTGCACCCAGACCTGATCATAAACGCAGGCACTGCTGGTGGTTTTAAGGCGAAAGGAGCTAACATTGGCGAT
GTGTTTCTGGTATCCGAATGTGCCTTCCATGACAGACGTATACCGATTCCAGTTTTCGATTTATACGGAGTCGGATTGAAGCCAGCACTGAAAACACCCAATCTCCATAA
GGAACTCGACCTGAAGGTGAATTTCCTTTTTACAAAATTAATTAGAGTTAACATTATGGTTGGCAAATTATCAACAGGTGACTCGCTGGACATGTCTGCACAGGATGAAG
CATCGATTGTAGCTAACGACGCCACGGTTAAAGATATGGAGGGAGCAGCAGTAGCCTATGTGGCAGATATATTCAAAGTTCCTGCAATATTTGTAAAAGCTGTAACCGAT
ATCGTCGATGGTGAAAAACCAACTGCAGAAGAATTCTTGCAGAATTTAGCTACAGTTTCTGCTGCATTGGATCAAGCAGTCACACAAGTTATAGATTTCATCAGTGGAAA
CCCGAGCAAACAAATCGATGTAGCAACTTTGCTGTCCTTCCAAATTGCAAAGAATCGTTTGATTCCCAAGTTTCTTTGGAAAGTTATGAAAGAGATGGCAAACAGAAATG
GGGCACAAGGAGCTTCCAAGGGAAAGGCCAATCCTGGTAAAGCTAAAATTTTGGCTCTAGGGAAAGCCTTACCTCCGCAACTTGTCCTTCAAGACTACCTGGTCGATGGC
TATTTCAAGGACACGAGCTGCGATGATATAGACCTCAAGCAGAAGCTCACTCGACTCTGCAAGACAACAACAGTCAAAACCAGATACGTGGTGATGTCAGAAGAGATCTT
GAAGAAGTACCCAGAGCTAGCAATGGAAGGCCAAGCAACTATAAAGCAGAGACTAGAGATTTGCAACAAAGCTGTGACAGACATGGCCATTGAAGCCTCACAAGCTGCCA
TTAAGAACTGGGGAAGGCCAGTTTCAGATATTACCCATTTAGTCTATGTCTCTTCAAGTGAAGCTCGCCTCCCTGGTGGTGATCTCTTCCTAGCAAGGGGCCTTGGACTC
AGCCCTCATACTCAACGAGCCATGCTTTACTTCATGGGTTGCTCTGGAGGCGTGGCACTATTTGGGGATGGTGCAGGGGCGATGCTAATTGGCACAAACCCTGTTTTGGG
CATTGAACGGCCACTCTTTGAGCTCCACACAGCAACCCAGAAATTTATTCCGGATACCCAGAACATAATCGATGGAAAACTGTCGGAGGAAGGTATTAGTTTCACAATAG
CAAGAGAACTTCCTCAGATAATCGAAGATAACATCGAAAGTTTCTGTGAGACATTTCTTCAAACAATTGGTCTGCAAGAGAAAGAATACAACAAAATGTTCTGGGCAGTA
CATCCAGGTGGGCCAGCGATATTGAACAGGATGGAGAAGAGACTCGAACTGTTACCTGAGAAGCTGACGGCGAGCCGGCGAGCTCTGATGGACTATGGAAATGCGAGCAG
TAATTCGATTGTGTACGTACTGGAGTACATGGTGGAAGAAAGCTTGAAAATGAAGATGGGTGGTGGGGAAATTGAGGAATGGGGATTGATTCTGGCGTTTGGACCTGGGA
TTTCGTTTGAAGGGATTCTGGCGAGGAATCTTGCCGTCTGA
Protein sequenceShow/hide protein sequence
MVLSKDNPSSAEEFMDSQTHSKLPISSILIIIAMQTEALPLVEKFQLSEDKKSVFPKEVPWVRYHGIYRNLQINLIWPGKDSALGVDSVGTISASLVTYASIQALHPDLI
INAGTAGGFKAKGANIGDVFLVSECAFHDRRIPIPVFDLYGVGLKPALKTPNLHKELDLKVNFLFTKLIRVNIMVGKLSTGDSLDMSAQDEASIVANDATVKDMEGAAVA
YVADIFKVPAIFVKAVTDIVDGEKPTAEEFLQNLATVSAALDQAVTQVIDFISGNPSKQIDVATLLSFQIAKNRLIPKFLWKVMKEMANRNGAQGASKGKANPGKAKILA
LGKALPPQLVLQDYLVDGYFKDTSCDDIDLKQKLTRLCKTTTVKTRYVVMSEEILKKYPELAMEGQATIKQRLEICNKAVTDMAIEASQAAIKNWGRPVSDITHLVYVSS
SEARLPGGDLFLARGLGLSPHTQRAMLYFMGCSGGVALFGDGAGAMLIGTNPVLGIERPLFELHTATQKFIPDTQNIIDGKLSEEGISFTIARELPQIIEDNIESFCETF
LQTIGLQEKEYNKMFWAVHPGGPAILNRMEKRLELLPEKLTASRRALMDYGNASSNSIVYVLEYMVEESLKMKMGGGEIEEWGLILAFGPGISFEGILARNLAV