; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027088 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027088
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Description2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein
Genome locationtig00153048:922586..928678
RNA-Seq ExpressionSgr027088
SyntenySgr027088
Gene Ontology termsNA
InterPro domainsIPR005123 - Oxoglutarate/iron-dependent dioxygenase
IPR026992 - Non-haem dioxygenase N-terminal domain
IPR027443 - Isopenicillin N synthase-like superfamily
IPR044861 - Isopenicillin N synthase-like, Fe(2+) 2OG dioxygenase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF9662833.1 hypothetical protein SADUNF_Sadunf18G0095400 [Salix dunnii]2.4e-21253.78Show/hide
Query:  SVQDVASKG-EVPERYIHEECDRGALDA--PSMDGPVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMEDKLKYS
        SVQ++A+ G E P +Y ++  D G LDA  P ++ PV+D+GLL+S S +  ELEKL+L L +WGCFQ INHGM+  FL+++R+V+KQFFA PME+K KYS
Subjt:  SVQDVASKG-EVPERYIHEECDRGALDA--PSMDGPVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMEDKLKYS

Query:  REEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSARFNFYP
        RE D IEGYGND +++   T+DWTDRL+LT+ PE+    K WP NP+ FRE + EYT  ++  ++ +L+AMA+SL+L+ES FL+QYGE+  ++ARFNFYP
Subjt:  REEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSARFNFYP

Query:  RCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQV--EGIDK---------LREGEISVAVFYVPDSEKELEPLEKL
         C KPD +LGV+PH D SA+T LLQDKEVEGLQFLKDNEWF VPIIP ALLVNVGDQ+   GI K          +    ++AVF +P+S+ E++P + L
Subjt:  RCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQV--EGIDK---------LREGEISVAVFYVPDSEKELEPLEKL

Query:  VNETQPRLYKTVKNFVSLIIQCYQQGKTPIEAAKESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGGGFNDPLPLAEIPVVD
        ++ET+P LY+ VK++VS+  Q YQQGK PIEA   S   +  +     P+ ++    +K++QE ++ N  E P  Y Y+DG   G  +  LPL E+PV+D
Subjt:  VNETQPRLYKTVKNFVSLIIQCYQQGKTPIEAAKESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGGGFNDPLPLAEIPVVD

Query:  LEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQL
        +++L S  T     E+L  AL S GCF +INH I   FLD++R ++ QFF+LPM EK K+SR  D  EGYG+D+I SE QILDWTDRL   V+PED+RQL
Subjt:  LEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQL

Query:  KYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGE-RPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNN
        K WPE P+ FRE L E T KLK I+E VL AMA+ LN+E   F D+ GE R +M  RFNF+PPC RPD  LG K H+DGSAIT+VL D+EVEGL+   ++
Subjt:  KYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGE-RPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNN

Query:  QWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKGQRPIDRLK
        QWFRVP+  +  +LLIN+G+  E+MSNG FK PVHRVVTNSERERTS+A FC+P+ D +I+P++ +++ETRPRLY  +++Y   YFQ YQKG+RPI+ ++
Subjt:  QWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKGQRPIDRLK

Query:  I
        I
Subjt:  I

KAG6589813.1 Protein SRG1, partial [Cucurbita argyrosperma subsp. sororia]8.9e-22356.28Show/hide
Query:  MAGINPSGSVQDVASKGEVPERYIHEECDRGALDAPSMDGPVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMED
        MAG NPSG+VQDVASKGEVPERYIH+E DRGA DAP M  PVID+ LLSSSS +GPELEKLR GLQSWGCFQ +NHGMS EFL+E+R++AKQFF LPME+
Subjt:  MAGINPSGSVQDVASKGEVPERYIHEECDRGALDAPSMDGPVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMED

Query:  KLKYSREEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSAR
        K KYSREED IEGYGND +++ Q  LDWTDRL+LTVYP+E H FKYWP NPERFR V+ EYTANV+++SE ILKAMA+SLDL+E SF+ QYGE +KL AR
Subjt:  KLKYSREEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSAR

Query:  FNFYPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVE----GIDK------LREGE---ISVAVFYVPDSEKE
        FNFYPRC+ PDLVLGV+PH DGSAITILLQD+EVEGLQFL  NEWFN PI+P ALL+NVGDQ E    GI K      L   E   IS+AVFY+PD EKE
Subjt:  FNFYPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVE----GIDK------LREGE---ISVAVFYVPDSEKE

Query:  LEPLEKLVNETQPRLYKTVKNFVSLIIQCYQQGKTPIEAAKESP--AANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGGGFNDPL
        +EPLEKL++ET+PR     +  ++ ++         I AA   P   A P++MA S+PS         ++QE  + +  E PE Y +K    G       
Subjt:  LEPLEKLVNETQPRLYKTVKNFVSLIIQCYQQGKTPIEAAKESP--AANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGGGFNDPL

Query:  PLAEIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLT
        PL +  V+D+  L++SP     LE+LR  L +WGC   INH +   FL+ +R + KQFF+ PM EK K+S E    EGYG+D + S+ QILDW  RL+LT
Subjt:  PLAEIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLT

Query:  VNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVE
        + PE+ R++K+WP NP  FRE + E +  ++ + E +L AM + L++E   F +Q GER  ++ RFN YPPC  PDLVLG K H+D SAIT++L D++VE
Subjt:  VNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVE

Query:  GLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKG
        GL+  K ++WF  P+  + D+LL+ +G+Q EI SNGIFK PVHRV+TNS+ ERTSLA F  P+  KE+ P+E LI+ET+PR Y  VKN +D YF+ YQ+G
Subjt:  GLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKG

Query:  QRPIDRLKI
        +RPI+  +I
Subjt:  QRPIDRLKI

KAG7023484.1 putative 2-oxoglutarate-dependent dioxygenase ANS [Cucurbita argyrosperma subsp. argyrosperma]8.7e-20257.53Show/hide
Query:  MAGINPSGSVQDVASKGEVPERYIHEECDRGALDAPSMDGPVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMED
        MAG NPSG+VQDVASKGEVPERYIH+E DRGA DAP M  PVID+ LLSSSS +GPELEKLR GLQSWGCFQ +NHGMS EFL+E+R++AKQFF LPME+
Subjt:  MAGINPSGSVQDVASKGEVPERYIHEECDRGALDAPSMDGPVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMED

Query:  KLKYSREEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSAR
        K KYSREED IEGYGND +++ Q  LDWTDRL+LTVYP+E H FKYWP NPERFR V+ EYTANV+++SE ILKAMA+SLDL+E SF+ QYGE +KL AR
Subjt:  KLKYSREEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSAR

Query:  FNFYPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVE----GIDK------LREGE---ISVAVFYVPDSEKE
        FNFYPRC+ PDLVLGV+PH DGSAITILLQD+EVEGLQFL  NEWFN PI+P ALL+NVGDQ E    GI K      L   E   IS+AVFY+PD EKE
Subjt:  FNFYPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVE----GIDK------LREGE---ISVAVFYVPDSEKE

Query:  LEPLEKLVNETQPRLYKTVKNFVSLIIQCYQQGKTPIEAAKESP--AANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGGGFNDPL
        +EPLEKL++ET+PRLYKTVKNFV L  Q     +  I AA   P   A P++MA S+PS         ++QE  + +  E PE Y +K    G       
Subjt:  LEPLEKLVNETQPRLYKTVKNFVSLIIQCYQQGKTPIEAAKESP--AANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGGGFNDPL

Query:  PLAEIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLT
        PL +  V+D+  L++SP     LE+LR  L +WGC   INH +   FL+ +R + KQFF+ PM EK K+S E    EGYG+D + S+ QILDW  RL+LT
Subjt:  PLAEIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLT

Query:  VNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVE
        + PE+ R++K+WP NP  FRE + E +  ++ + E +L AM + L++E   F +Q GER  ++ RFN YPPC  PDLVLG K H+D SAIT++L D++VE
Subjt:  VNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVE

Query:  GLELQKNNQWFRVPVPAMADSLLINLGEQAE
        GL+  K ++WF  P+  + D+LL+ +G+Q E
Subjt:  GLELQKNNQWFRVPVPAMADSLLINLGEQAE

XP_022134811.1 uncharacterized protein LOC111006992 [Momordica charantia]0.0e+0074.93Show/hide
Query:  MAGINPSGSVQDVASKGEVPERYIHEECDRGALDAPSMDGPVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMED
        MAG NP+GSVQDVASKGEVPERYIH+ECDRGALDAP M+ PVIDIGLLSS SN GPELEKLR GL SWGCFQAINHGMSPEFLEEVRQVAK FFALPMED
Subjt:  MAGINPSGSVQDVASKGEVPERYIHEECDRGALDAPSMDGPVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMED

Query:  KLKYSREEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSAR
        KLK+SREED +EGYGND + + Q  LDWTDRL+LTV PEE   FKYWP NPERFREV+ EYTANV+++SE ILKAMA SLDLDE+SFLNQYG+R++L AR
Subjt:  KLKYSREEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSAR

Query:  FNFYPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVE----GIDKLR---------EGEISVAVFYVPDSEKE
        FNFY RC+ P+LVLGV+PH DGSAITILLQDKEVEGLQFLK NEW+N PIIPDALLVNVGDQ E    GI K R            IS+AVFY+PD +KE
Subjt:  FNFYPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVE----GIDKLR---------EGEISVAVFYVPDSEKE

Query:  LEPLEKLVNETQPRLYKTVKNFVSLIIQCYQQGKTPIEAAK--ESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGG-GFNDP
        +EPLEKL+NET PRLY+TVKNFV L  Q YQQG+ P EAAK        P+NMAE     L DYF TKT Q++LL+NGG+TPE Y YKDGYGGG   N+P
Subjt:  LEPLEKLVNETQPRLYKTVKNFVSLIIQCYQQGKTPIEAAK--ESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGG-GFNDP

Query:  LPLAEIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYL
        LPLAEIPVVDL QL+SSP   A LE+LRLAL+SWGCFQAINHSI SSFL+K+ QIS QFFSLPM EKNK  RE+ G EGYG+D++FSEQQILDWTDRLYL
Subjt:  LPLAEIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYL

Query:  TVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEV
         VNPEDERQLKYWP+NPQSFREDLHE TIKLKQIIETVL+AMA+ +NVE   F++QVG+RP +  RFNFYPPCSRPDLVLGLKEHSDGSAITIVLLD+EV
Subjt:  TVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEV

Query:  EGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQK
        EGL+ +K++QWFRVPVPAMADSLLIN+GEQAEIMSNG+FK  VHR VTNSE++R S+ACFC PE+D+EI+PIE LI+E RPRLY +VKNY+ +YFQ YQK
Subjt:  EGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQK

Query:  GQRPIDRLKI
        GQRP+D+LKI
Subjt:  GQRPIDRLKI

XP_028227114.1 zinc finger CCCH domain-containing protein 19-like [Glycine soja]9.9e-19852.72Show/hide
Query:  VPERYIHEECDRGALDA--PSMDG--PVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMEDKLKYSREED--GIE
        +P+ YI+EE   G  DA  PS D   PVID+  LSSSS +  EL KL   L SWGCFQAINHGM   FL++VR+V+KQFF LP E+K K +RE +   IE
Subjt:  VPERYIHEECDRGALDA--PSMDG--PVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMEDKLKYSREED--GIE

Query:  GYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSARFNFYPRCQKPDL
        GYGND + +K   LDWTDR++L V PE+   F +WP  P  FR  VL+YT ++R++SE ILKAMA SL+L+E  FLN+ GER  +  R N+YP C  PD 
Subjt:  GYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSARFNFYPRCQKPDL

Query:  VLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVE--------------GIDKLREGEISVAVFYVPDSEKELEPLEKLVNET
        VLGV+PH DGS IT LLQDKEVEGLQ LKD++WF VPIIPDALL+NVGDQ+E               I+K +E  ++VA+F VPDSEKE++P++KLVNE+
Subjt:  VLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVE--------------GIDKLREGEISVAVFYVPDSEKELEPLEKLVNET

Query:  QPRLYKTVKNFVSLIIQCYQQGKTPIEAAKESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGGGFNDPLPLAE---IPVVDL
        +P LY+ VKN+V +  Q YQQGK PIEA+K   +              +     K +Q+ L++N    P+ Y Y++  GG GF D L  ++   IPV+DL
Subjt:  QPRLYKTVKNFVSLIIQCYQQGKTPIEAAKESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGGGFNDPLPLAE---IPVVDL

Query:  EQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLK
         +L+S  T    L +L  AL SWGCFQAINH + SSFLDK+R++SKQFF LP  EK K +RE +  EGYG+D+I+SE Q LDWTDR+YL V PEDER+ K
Subjt:  EQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLK

Query:  YWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQW
        +WP+NP  FR  + + T  ++ + E ++ AMAK LN+E   F ++ GER  M +RFN+YPPC  PD VLGLK H+DGS IT +L DKEVEGL++ K++QW
Subjt:  YWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQW

Query:  FRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNY
        F+VP+  + D+L+IN+G+Q EIMSNGIF+ P+HR V NSE+ER ++A FC  + +KEIKP+E+L+NE+RP LY  +  Y
Subjt:  FRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNY

TrEMBL top hitse value%identityAlignment
A0A0A0LW96 Uncharacterized protein6.3e-19053.17Show/hide
Query:  MAGINPSGSVQDVASKGEVPERYIHEECDRGALDAPSMDGPVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMED
        MA  NPSG+VQD ASKGEVPERYIH+E DRGA +AP+M  PVIDI LLSSSS +GPELE+LR  LQSWGCF                Q+  + F    + 
Subjt:  MAGINPSGSVQDVASKGEVPERYIHEECDRGALDAPSMDGPVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMED

Query:  KLKYSREEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSAR
         + Y                       W D                        +EV+ EYT NV++ISE I KAMA SLDLDESSFL QYGE+IKL AR
Subjt:  KLKYSREEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSAR

Query:  FNFYPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVEGIDKLREGEISVAVFYVPDSEKELEPLEKLVNETQP
        FNFYPRC+ PDLVLGV+PH DGSAITILLQDKEVEGLQF+KDNEWFN  I+PDALLVNVGDQVE                             +  ET  
Subjt:  FNFYPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVEGIDKLREGEISVAVFYVPDSEKELEPLEKLVNETQP

Query:  RLYKTVKNFVSLIIQCYQQGKTPIEAAKESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGGGFNDPLPLAEIPVVDLEQLAS
            TVK+       C Q G                        L+ D+   K L +  L++   +     + + Y                     L S
Subjt:  RLYKTVKNFVSLIIQCYQQGKTPIEAAKESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGGGFNDPLPLAEIPVVDLEQLAS

Query:  SPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPEN
           G   L +LRLALS+WGCFQA NHSI SSFL+K+R+IS+QFFSLP+ EK ++ REVDG EGYG+D+ FS QQ LDW+DRLY   +PEDER+L  WP N
Subjt:  SPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPEN

Query:  PQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPV
        P SFREDLHE T+K+ +IIETVL+AMA+ LNVE   F+DQVGERP +  RFNFYPPCS P LVLGLKEHSDGSAITI+LLDK+VEGL+L+K++QW+RVPV
Subjt:  PQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPV

Query:  PAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKGQRPIDRLKI
        PA+ADSLL+ +GEQAE+MSNGIFK  VHR VTNSER+R S+ CFC PE+D EIKP+E LI+E RPRL+ SVKNYL+TYFQ+YQ+GQR +D L+I
Subjt:  PAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKGQRPIDRLKI

A0A498KDN2 Uncharacterized protein1.0e-17948.35Show/hide
Query:  SVQDVASKGE-VPERYIHEECDRGALDA--PSMDG-PVIDIGLLSSSS------NAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPM
        +VQ+++  GE  P  YI +E   G++D+  P  D  P+IDI L S SS          EL+KLR  L S GCFQA  HG+S   L++VR+ AKQFFALP+
Subjt:  SVQDVASKGE-VPERYIHEECDRGALDA--PSMDG-PVIDIGLLSSSS------NAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPM

Query:  EDKLKYSREED-GIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQ-YGERIK
        E+K KYSR  D G EGYGND +V+ +  LDW+ RL L V+P++      WP  P  F E++ EYT  ++++   + KAMA SL+L+E SF  Q  G++  
Subjt:  EDKLKYSREED-GIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQ-YGERIK

Query:  LSARFNFYPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVE----GIDK------LREGE---ISVAVFYVPD
        + ARFNFYP C + DLVLGV+PHTD S +T LLQDK+VEGLQ L D +W  VPI+P A+++N+GDQ++    GI K      +  GE   +SVA+F  PD
Subjt:  LSARFNFYPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVE----GIDK------LREGE---ISVAVFYVPD

Query:  SEKELEPLEKLVNETQPRLYKTVKNFVSLIIQCYQQGKTPIEAAKESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGGGFND
         E E+ P+E L++ET+PRLYK VKN+  +  +CYQ+G+  +E  K        N+ +       +   +KT+QE +L+ G + P+ Y  K     GG   
Subjt:  SEKELEPLEKLVNETQPRLYKTVKNFVSLIIQCYQQGKTPIEAAKESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGGGFND

Query:  PLP------LAEIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILD
        P+P      L ++P++DL  LA+S      LEE R AL++WGCFQ INHS+   FLD++R+I+KQFF+LP+ +K K+ R+V+  +GYG+D++FSEQQ LD
Subjt:  PLP------LAEIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILD

Query:  WTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITI
        W+DRLYL+V P+D R+LK+WP+NP SF   L +  ++L+ + +T+L  M + L++E   FSD  GE+  M++RFNFYPPC RPDLVLGLK HSDG+ IT+
Subjt:  WTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITI

Query:  VLLDKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVK
        VL DKEVEGL+  K + WFR P+  + D+LLIN+G+  EI+SNGIFK PVH+VVTN  +ER SLA FC PE  KEI+P E L++E+RPRLY  +K
Subjt:  VLLDKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVK

A0A6J0ZRW3 uncharacterized protein LOC1104115041.3e-17947.55Show/hide
Query:  SVQDVASKGEVP-ERYIHEECDRGALDAPS---MDGPVIDIGLLSSSSNAGP------ELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPM
        SVQ+++  G+ P   YI ++   G+ D PS      P++DI   SS S++ P      ELEKL+  L S GCFQAI HG+S  FL++VR VAKQFFALP 
Subjt:  SVQDVASKGEVP-ERYIHEECDRGALDAPS---MDGPVIDIGLLSSSSNAGP------ELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPM

Query:  EDKLKYSREEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLS
         +K KY+R  +  EGYG+D +V+ +  LDW+ RL L V+PE+   F  WP NP  FR+++ EY + ++V+++ + KAMA +L+L+E+SFLNQ+G+R  + 
Subjt:  EDKLKYSREEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLS

Query:  ARFNFYPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVE----GI------------DKLREGEISVAVFYVP
        ARFNFYP C +PDLVLGV+PHTD S IT+LLQD+EVEGLQ L D++W  VP+IP AL++N+GDQ++    GI            +KLR   ISVA+F  P
Subjt:  ARFNFYPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVE----GI------------DKLREGEISVAVFYVP

Query:  DSEKELEPLEKLVNETQPR-LYKTVKNFVSLIIQCYQQGKTPIEAAKESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKD-GYGGGG
        + E E+  ++ LV+E +PR   +T     S         +T  +AAK  P    +NMA   P        +K++QE + +NG E P  +  KD  +    
Subjt:  DSEKELEPLEKLVNETQPR-LYKTVKNFVSLIIQCYQQGKTPIEAAKESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKD-GYGGGG

Query:  FNDPLPLAEIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTD
        ++D      +PV+D+  L   P+    LE+LR AL S GCFQA+ H I +SF+DK+R+++KQFF  P  EK K+SR V+  EGYG D+I SE+Q+LDW  
Subjt:  FNDPLPLAEIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTD

Query:  RLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLL
        RL+L V PE +R+L  WPENP +FRE LHE +IKLKQ+++ +  AMAK LN+E K FSDQ G+ P+M +RFNFYPPCSRPD VLG+K HSD S +T++L 
Subjt:  RLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLL

Query:  DKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNYLDTYFQ
        D+EVEGL++ K+++W  VPV  +  +L++NLG+Q +IMSNGIFK P+HRVVTN+++ R S+A F   E DKEI P+E LI++ RPRLY +VKNY    ++
Subjt:  DKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNYLDTYFQ

Query:  SYQKGQRPIDRLK
         +QKG+  ++ +K
Subjt:  SYQKGQRPIDRLK

A0A6J1BYU1 uncharacterized protein LOC1110069920.0e+0074.93Show/hide
Query:  MAGINPSGSVQDVASKGEVPERYIHEECDRGALDAPSMDGPVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMED
        MAG NP+GSVQDVASKGEVPERYIH+ECDRGALDAP M+ PVIDIGLLSS SN GPELEKLR GL SWGCFQAINHGMSPEFLEEVRQVAK FFALPMED
Subjt:  MAGINPSGSVQDVASKGEVPERYIHEECDRGALDAPSMDGPVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMED

Query:  KLKYSREEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSAR
        KLK+SREED +EGYGND + + Q  LDWTDRL+LTV PEE   FKYWP NPERFREV+ EYTANV+++SE ILKAMA SLDLDE+SFLNQYG+R++L AR
Subjt:  KLKYSREEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSAR

Query:  FNFYPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVE----GIDKLR---------EGEISVAVFYVPDSEKE
        FNFY RC+ P+LVLGV+PH DGSAITILLQDKEVEGLQFLK NEW+N PIIPDALLVNVGDQ E    GI K R            IS+AVFY+PD +KE
Subjt:  FNFYPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVE----GIDKLR---------EGEISVAVFYVPDSEKE

Query:  LEPLEKLVNETQPRLYKTVKNFVSLIIQCYQQGKTPIEAAK--ESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGG-GFNDP
        +EPLEKL+NET PRLY+TVKNFV L  Q YQQG+ P EAAK        P+NMAE     L DYF TKT Q++LL+NGG+TPE Y YKDGYGGG   N+P
Subjt:  LEPLEKLVNETQPRLYKTVKNFVSLIIQCYQQGKTPIEAAK--ESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGG-GFNDP

Query:  LPLAEIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYL
        LPLAEIPVVDL QL+SSP   A LE+LRLAL+SWGCFQAINHSI SSFL+K+ QIS QFFSLPM EKNK  RE+ G EGYG+D++FSEQQILDWTDRLYL
Subjt:  LPLAEIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYL

Query:  TVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEV
         VNPEDERQLKYWP+NPQSFREDLHE TIKLKQIIETVL+AMA+ +NVE   F++QVG+RP +  RFNFYPPCSRPDLVLGLKEHSDGSAITIVLLD+EV
Subjt:  TVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEV

Query:  EGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQK
        EGL+ +K++QWFRVPVPAMADSLLIN+GEQAEIMSNG+FK  VHR VTNSE++R S+ACFC PE+D+EI+PIE LI+E RPRLY +VKNY+ +YFQ YQK
Subjt:  EGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQK

Query:  GQRPIDRLKI
        GQRP+D+LKI
Subjt:  GQRPIDRLKI

A0A6N2N704 Uncharacterized protein2.9e-18749.42Show/hide
Query:  SGSVQDVASKG-EVPERYIHEECDRGALDA--PSMDGPVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMEDKLK
        S SVQ++AS G E P +Y ++  D G LD+  P ++ PV+D+GLL S S +  EL+KL+L L +WGCFQ INHGM+  FL+++R+V+KQFFA PME+K K
Subjt:  SGSVQDVASKG-EVPERYIHEECDRGALDA--PSMDGPVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMEDKLK

Query:  YSREEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSARFNF
        YSRE D IEGYGND +++   T+DWTDRL+LT+ PE+    K+WP NPE FRE + EYT  ++  ++ +L+AMA SL+L+ES FL+QYGE+  ++ARFNF
Subjt:  YSREEDGIEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSARFNF

Query:  YPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVEGIDKLREGEISVAVFYVPDSEKELEPLEKLVNETQPRLY
        YP C KP  +LGV+PH D SA+T LLQDKEVEGLQFLKDNEWF   I+ + +  +   +V  +   +    ++AVF +P+S+ E++P + L++E++P LY
Subjt:  YPRCQKPDLVLGVRPHTDGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVEGIDKLREGEISVAVFYVPDSEKELEPLEKLVNETQPRLY

Query:  KTVKNFVSLIIQCYQQGKTPIEAAKESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGGGFNDPLPLAEIPVVDLEQLASSPT
        + VK++VS+  Q YQQGK PIEA                                ++ N  E P  Y Y+DG   G  +  LPL E+P++D+ +L S  T
Subjt:  KTVKNFVSLIIQCYQQGKTPIEAAKESPAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGGGFNDPLPLAEIPVVDLEQLASSPT

Query:  GAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQS
            +E+L  AL S GCF  ++                QFF+ PM EK K+SR  D  EGYG+D I SE QILDWTDRLY  V+PED+RQL+ WPE P+ 
Subjt:  GAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQS

Query:  FREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVG-ERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPVPA
        FRE L E T KLK I+E VL AMA+ LN+E   F D+ G ER +M  RFNF+PPC RPD  LG K H+DGSAITIVL DKEVEGL+   ++QWFRVP+  
Subjt:  FREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVG-ERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPVPA

Query:  MADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKGQRPIDRLKI
        +  +LLIN+G+  E         PVHRVVTNSERERTS+A FC+P+ D +I+P++ +++ETRPRLY  +++Y   Y Q YQKG+RPI+ +KI
Subjt:  MADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKGQRPIDRLKI

SwissProt top hitse value%identityAlignment
A2A1A0 S-norcoclaurine synthase 13.0e-4833.44Show/hide
Query:  EIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNP
        EIPV+DL +L         L +   A   WG FQ INH +    ++KM+  ++ FF LP  EKN + +  +G EGYG   + SE+Q LDW D  +L   P
Subjt:  EIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNP

Query:  EDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLE
          ER +++WP +P SFRE + + +++L+++   +   MAK L +E +  +  +  R + +      P  S     LGL  HSD + +T+++   EV GL 
Subjt:  EDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLE

Query:  LQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSV--KNYLDTYFQSYQKGQ
        ++K+ +W  VP+  +  + ++N+G+  EIMSNGI+K   HR V N+++ER S+A F  PE   +I P+ +L+ E   + Y ++  ++YL         G+
Subjt:  LQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSV--KNYLDTYFQSYQKGQ

Query:  RPIDRLKI
          +D++K+
Subjt:  RPIDRLKI

D4N502 Codeine O-demethylase8.2e-5437.15Show/hide
Query:  TLQEQLLVNGGETPERYTYK-----DGYGGGGFNDPLPLAEIPVVDLEQLASSP--TGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSL
        ++QE   +   E P RYT       +  G    +D      +PV+DL+ L S     G   L++L  A   WG FQ +NH + +  +D ++   K FF+L
Subjt:  TLQEQLLVNGGETPERYTYK-----DGYGGGGFNDPLPLAEIPVVDLEQLASSP--TGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSL

Query:  PMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLN-VEVKGFSDQVGERP
        PM EK K+ ++    EG+G   I SE Q LDWT+   +   P   R+   +PE P  FRE L     K+K++   V   + K L  VE+KG +D + E  
Subjt:  PMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLN-VEVKGFSDQVGERP

Query:  IMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFC
        + +MR N+YPPC RP+LVLGL  HSD S +TI+L   EVEGL+++K  +W  + +  + D+ ++N+G+  EIM+NGI++   HR V NS +ER S+A F 
Subjt:  IMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFC

Query:  SPERDKEIKPIEELINETRPRLY
          + + EI PI  L+    P L+
Subjt:  SPERDKEIKPIEELINETRPRLY

O80449 Jasmonate-induced oxygenase 41.2e-5235.26Show/hide
Query:  EIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNP
        EIPV+D+  +   P    GL  +R A   WG FQ +NH +  S ++++R   ++FF LP+ EK K++   D +EGYGS +   +   LDW+D  +L   P
Subjt:  EIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNP

Query:  EDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLN------VEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDK
           R    WP  P   RE + +   +++++ E +   +++ L       ++  G  D+VG     S+R NFYP C +P L LGL  HSD   ITI+L D+
Subjt:  EDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLN------VEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDK

Query:  EVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVK--NYLDTYFQ
        +V GL++++ + W  V + ++ ++L++N+G+Q +I+SNGI+K   H+V+ NS  ER SLA F +P  D  + PIEEL+   RP LY  ++   Y     Q
Subjt:  EVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYNSVK--NYLDTYFQ

Query:  SYQKGQRPIDRL
            G+  +D L
Subjt:  SYQKGQRPIDRL

Q39224 Protein SRG15.1e-5636.33Show/hide
Query:  EIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNP
        EIP++D+++L SS T  + +E+L  A   WG FQ +NH I SSFLDK++   + FF+LPM EK K  +  D  EG+G   + SE Q LDW D  + TV P
Subjt:  EIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNP

Query:  EDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNV---EVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVE
         + R+   +P+ P  FR+ L   + +++ + + ++  MA+ L +   E++   D V    + SMR N+YPPC +PD V+GL  HSD   +T+++   +VE
Subjt:  EDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNV---EVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVE

Query:  GLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYN--SVKNYLDTYFQSYQ
        GL+++K+ +W  VPV  + ++ ++N+G+  EI++NG ++   HR V NSE+ER S+A F +    KE+ P + L+   +   +   ++K Y D  F    
Subjt:  GLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPRLYN--SVKNYLDTYFQSYQ

Query:  KGQRPIDRLKI
         G+  +D L+I
Subjt:  KGQRPIDRLKI

Q94LP4 2-oxoglutarate-dependent dioxygenase 111.6e-5233.53Show/hide
Query:  PERYTYKDGYGGGGFNDPLPLAEIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGS
        PERY   +       N+      IP++DL++L    +      +LR A   WG F  INH +P   +  +++    FFS P+  K ++++  +  EGYG 
Subjt:  PERYTYKDGYGGGGFNDPLPLAEIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMGEKNKHSREVDGHEGYGS

Query:  DVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGL
          +FSE Q LDW D LYL V+P D R L++WP +P SFR+ +   + + K +   +   MAK +  + +   D   E+P   +R  +YPPC + D V+GL
Subjt:  DVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMRFNFYPPCSRPDLVLGL

Query:  KEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPR
          HSD   +T++L    V+GL+++K+ +WF +  P  A  L+ N+G+  EI+SNG F+   HR V N  +ER S A F  P  +  I P+ E + + + +
Subjt:  KEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELINETRPR

Query:  LYNSVKNYLDTYFQSYQK---GQRPIDRLKI
         Y S+ +YLD   Q + +   G+  ++ LK+
Subjt:  LYNSVKNYLDTYFQSYQK---GQRPIDRLKI

Arabidopsis top hitse value%identityAlignment
AT1G49390.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.4e-9348.97Show/hide
Query:  KTLQEQLLVNGGETPERYTYKDGYGGGG--FNDPLPLAEIPVVDLEQL-ASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMG
        KT+QE ++  G   PERY +     G     N  +P  +IP +DL  L +SS  G   +++L  ALS+WG  Q +NH I  +FLDK+ +++KQFF+LP  
Subjt:  KTLQEQLLVNGGETPERYTYKDGYGGGG--FNDPLPLAEIPVVDLEQL-ASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMG

Query:  EKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSM
        EK+K +RE    +GYG+D+I S+ Q+LDW DRL+LT  PED+RQLK+WP+ P  F E L E T+K + +IE    AMA+ L +E   F +  GE  +M+ 
Subjt:  EKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSM

Query:  RFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPER
        RFNF+PPC RPD V+G+K H+DGSAIT++L DK+VEGL+  K+ +W++ P+  + D++LI LG+Q EIMSNGI+K PVHRVVTN E+ER S+A FC P  
Subjt:  RFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPER

Query:  DKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKGQRPID
        DKEI P + L+ E RPRLY +V  Y+D +++ YQ+G+R I+
Subjt:  DKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKGQRPID

AT3G21420.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein1.3e-5735.82Show/hide
Query:  LQEQLLVNGGETPERY---TYKDGYGGGGFNDPLPLAEIPVVDLEQLASSPTGAAGLEELRL--ALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMG
        +QE +     + PER+    Y+ G             +IPV+DL +L+         E L+L  A   WG FQ INH I    ++ + +++ +FF +P+ 
Subjt:  LQEQLLVNGGETPERY---TYKDGYGGGGFNDPLPLAEIPVVDLEQLASSPTGAAGLEELRL--ALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMG

Query:  EKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSM
        EK K+  E    +GYG   IFSE Q LDW +   L V+P   R  K WP  P  F E L   + +++++ + +L  +A  L ++ + F +  GE  + ++
Subjt:  EKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSM

Query:  RFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDK-EVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPE
        R N+YPPCS PDLVLGL  HSDGSA+T++   K    GL++ K+N W  VPV  + ++L+IN+G+  E++SNG +K   HR VTN E+ER ++  F +P 
Subjt:  RFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDK-EVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPE

Query:  RDKEIKPIEELI-NETRPRLYNSVK--NYLDTYFQSYQKGQRPIDRLKI
         + EI+P+ EL+ +ET P  Y S    +Y   Y  +  +G++ +D  KI
Subjt:  RDKEIKPIEELI-NETRPRLYNSVK--NYLDTYFQSYQKGQRPIDRLKI

AT5G20400.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein7.3e-9850.44Show/hide
Query:  KTLQEQLLVNGGETPERYTYKDGYGG--GGFNDPLPLAEIPVVDLE-QLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMG
        KT+QE ++  G   PERY +     G     N  +P  +IP +DL   L+SS  G   L +L  ALS+WG  Q +NH I  +FLDK+ +++K+FF+LP  
Subjt:  KTLQEQLLVNGGETPERYTYKDGYGG--GGFNDPLPLAEIPVVDLE-QLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMG

Query:  EKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSM
        EK K +RE+D  +GYG+D+I  + Q+LDW DRLY+T  PED+RQL +WPE P  FRE LHE T+K + +IE    AMA+ L +E   F D  GE   +  
Subjt:  EKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSM

Query:  RFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPER
        RFN YPPC  PD V+G+K H+DGSAIT++L DK+V GL+ QK+ +W++ P+  + D++LIN+G+Q EIMSNGI+K PVHRVVTN E+ER S+A FC P  
Subjt:  RFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPER

Query:  DKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKGQRPID
        DKEI+P+ EL++E RPRLY +VK Y++ YF+ YQ+G+RPI+
Subjt:  DKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKGQRPID

AT5G20550.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein3.2e-9349.71Show/hide
Query:  KTLQEQLLVNGGETPERYTYKDGYGGGG--FNDPLPLAEIPVVDLEQLAS-SPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMG
        KT+QE ++  G   PERY         G   N  +P+ +IP +DL  L S S  G   L +L  ALS+WG  Q INH I  + LDK+ +++K+F +LP  
Subjt:  KTLQEQLLVNGGETPERYTYKDGYGGGG--FNDPLPLAEIPVVDLEQLAS-SPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPMG

Query:  EKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSM
        EK K++RE+   +GYG+D+I  + Q+LDW DRLY+T  PED+RQLK+WP+ P  FRE LHE T+K   +   V  AMA  L +E   F D  GE   M  
Subjt:  EKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSM

Query:  RFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPER
        RFN YPPC RPD V+G++ H+D SA T++L DK VEGL+  K+ +W++ PV A +D++LIN+G+Q EIMSNGI+K PVHRVVTN+E+ER S+A FC P  
Subjt:  RFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPER

Query:  DKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKGQRPI
        DKEI+P++ L++E RPRLY  VKNY+D   + Y +GQRPI
Subjt:  DKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKGQRPI

AT5G54000.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein2.2e-9447.98Show/hide
Query:  KTLQEQLLVNGGETPERYTYK---DGYGGGGFNDPLPLAEIPVVDLEQL-ASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPM
        KT+QE ++  G + PERY Y    DG G   FN  LP  +I ++DL  L +SS  G   L +L  A+S+WG  Q +NH I  + LDK+ +++KQFF LP 
Subjt:  KTLQEQLLVNGGETPERYTYK---DGYGGGGFNDPLPLAEIPVVDLEQL-ASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQISKQFFSLPM

Query:  GEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMS
         EK K++RE+   +G+G+D+I S+ Q+LDW DRLYL   PED+RQLK+WPENP  FRE LHE T+K + ++E    A+A+ L +E   F +  GE   + 
Subjt:  GEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMS

Query:  MRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPE
         RFN YPPC RPD VLGLK HSDGSA T++L DK VEGL+  K+ +W++  +  +  ++LIN+G+  E+MSNGI+K PVHRVV N ++ER  +A FC+ +
Subjt:  MRFNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPE

Query:  RDKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKGQRPIDRLKI
         DKEI+P+  L++E RPRLY +VK     +F  YQ+G+RPI+   I
Subjt:  RDKEIKPIEELINETRPRLYNSVKNYLDTYFQSYQKGQRPIDRLKI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGAATTAACCCTTCCGGCAGCGTCCAAGACGTGGCTTCGAAAGGGGAAGTGCCGGAGAGATACATCCATGAAGAATGCGATCGGGGAGCTCTAGATGCTCCTTC
AATGGACGGTCCTGTAATTGATATCGGTCTCCTCTCGTCTTCATCCAATGCAGGACCGGAACTGGAGAAACTCCGACTTGGACTTCAGTCATGGGGCTGCTTTCAGGCTA
TAAATCATGGAATGTCACCTGAATTTCTGGAAGAAGTGCGCCAAGTAGCGAAACAGTTCTTCGCTCTTCCGATGGAAGACAAACTGAAATACTCCAGAGAAGAAGACGGG
ATTGAAGGATATGGCAACGACGCGGTTGTGGCCAAGCAAGGAACTCTTGATTGGACTGATCGTTTACACCTCACTGTATATCCAGAAGAATGTCACTATTTCAAGTATTG
GCCAGCAAATCCTGAAAGATTTAGAGAAGTTGTTCTTGAGTACACTGCAAATGTGAGGGTGATAAGTGAGACAATTCTTAAGGCCATGGCAATGTCATTGGATTTAGATG
AGAGTAGCTTTCTGAATCAGTACGGGGAGCGGATTAAACTCAGCGCACGATTCAACTTCTACCCGCGATGTCAAAAGCCCGACCTTGTTCTTGGTGTTAGGCCACACACC
GATGGATCAGCAATCACCATTTTGTTGCAGGACAAAGAAGTAGAAGGTCTTCAGTTCTTGAAGGACAATGAGTGGTTCAACGTACCAATCATTCCCGACGCTCTTCTCGT
CAATGTCGGGGATCAAGTGGAGGGTATTGACAAACTCAGAGAGGGAGAGATTTCTGTGGCTGTGTTTTACGTTCCAGATTCAGAGAAAGAACTTGAACCCTTAGAGAAAC
TTGTCAATGAAACTCAGCCAAGATTGTACAAAACGGTGAAGAACTTTGTTAGCCTTATTATCCAATGCTACCAGCAGGGCAAGACACCAATAGAGGCTGCAAAAGAAAGC
CCAGCTGCTAACCCCACAAACATGGCCGAATCTCATCCTTCTCTCCTGCAAGATTACTTCACAACAAAAACACTCCAAGAACAACTGCTCGTCAACGGCGGCGAAACGCC
GGAAAGATATACCTACAAAGATGGCTATGGCGGCGGAGGCTTTAATGATCCACTTCCATTAGCGGAGATTCCGGTCGTTGACCTTGAACAACTCGCGTCGTCGCCTACCG
GCGCGGCTGGGTTAGAGGAGCTCCGGTTGGCTCTCAGTTCTTGGGGCTGCTTTCAGGCGATAAATCACAGCATTCCCAGTTCATTTTTAGACAAGATGCGTCAAATATCC
AAGCAATTTTTCTCCCTGCCAATGGGAGAGAAGAACAAGCATTCCAGAGAAGTTGATGGGCATGAAGGATATGGAAGTGATGTGATTTTTTCAGAACAACAAATTCTTGA
TTGGACCGATCGGTTATATCTCACAGTAAATCCGGAGGATGAGAGACAGCTCAAGTATTGGCCGGAAAATCCTCAATCTTTCCGGGAAGATCTACACGAGTTAACGATCA
AGTTAAAGCAGATAATTGAAACTGTGCTCGTCGCCATGGCAAAGTGCTTAAACGTGGAAGTGAAGGGCTTTTCAGACCAGGTGGGGGAGCGACCGATTATGTCCATGAGA
TTCAACTTCTATCCGCCGTGTTCGAGGCCCGATCTTGTTCTTGGACTCAAAGAACACTCCGATGGGTCGGCCATCACCATTGTTTTACTGGACAAAGAAGTCGAAGGTCT
CGAGTTGCAGAAGAACAACCAATGGTTCAGAGTTCCTGTTCCTGCCATGGCCGATTCTCTTCTTATCAATCTTGGGGAACAAGCCGAGATTATGAGCAATGGGATTTTCA
AGGGCCCAGTTCATCGGGTGGTGACGAACTCGGAGAGGGAGAGGACTTCGTTGGCATGCTTCTGTTCCCCAGAAAGAGATAAAGAGATCAAGCCGATCGAGGAGCTGATC
AATGAGACGAGGCCGAGATTATATAACAGTGTGAAGAACTATTTGGATACTTACTTCCAATCCTACCAGAAGGGTCAGAGACCGATTGATAGATTGAAAATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGGAATTAACCCTTCCGGCAGCGTCCAAGACGTGGCTTCGAAAGGGGAAGTGCCGGAGAGATACATCCATGAAGAATGCGATCGGGGAGCTCTAGATGCTCCTTC
AATGGACGGTCCTGTAATTGATATCGGTCTCCTCTCGTCTTCATCCAATGCAGGACCGGAACTGGAGAAACTCCGACTTGGACTTCAGTCATGGGGCTGCTTTCAGGCTA
TAAATCATGGAATGTCACCTGAATTTCTGGAAGAAGTGCGCCAAGTAGCGAAACAGTTCTTCGCTCTTCCGATGGAAGACAAACTGAAATACTCCAGAGAAGAAGACGGG
ATTGAAGGATATGGCAACGACGCGGTTGTGGCCAAGCAAGGAACTCTTGATTGGACTGATCGTTTACACCTCACTGTATATCCAGAAGAATGTCACTATTTCAAGTATTG
GCCAGCAAATCCTGAAAGATTTAGAGAAGTTGTTCTTGAGTACACTGCAAATGTGAGGGTGATAAGTGAGACAATTCTTAAGGCCATGGCAATGTCATTGGATTTAGATG
AGAGTAGCTTTCTGAATCAGTACGGGGAGCGGATTAAACTCAGCGCACGATTCAACTTCTACCCGCGATGTCAAAAGCCCGACCTTGTTCTTGGTGTTAGGCCACACACC
GATGGATCAGCAATCACCATTTTGTTGCAGGACAAAGAAGTAGAAGGTCTTCAGTTCTTGAAGGACAATGAGTGGTTCAACGTACCAATCATTCCCGACGCTCTTCTCGT
CAATGTCGGGGATCAAGTGGAGGGTATTGACAAACTCAGAGAGGGAGAGATTTCTGTGGCTGTGTTTTACGTTCCAGATTCAGAGAAAGAACTTGAACCCTTAGAGAAAC
TTGTCAATGAAACTCAGCCAAGATTGTACAAAACGGTGAAGAACTTTGTTAGCCTTATTATCCAATGCTACCAGCAGGGCAAGACACCAATAGAGGCTGCAAAAGAAAGC
CCAGCTGCTAACCCCACAAACATGGCCGAATCTCATCCTTCTCTCCTGCAAGATTACTTCACAACAAAAACACTCCAAGAACAACTGCTCGTCAACGGCGGCGAAACGCC
GGAAAGATATACCTACAAAGATGGCTATGGCGGCGGAGGCTTTAATGATCCACTTCCATTAGCGGAGATTCCGGTCGTTGACCTTGAACAACTCGCGTCGTCGCCTACCG
GCGCGGCTGGGTTAGAGGAGCTCCGGTTGGCTCTCAGTTCTTGGGGCTGCTTTCAGGCGATAAATCACAGCATTCCCAGTTCATTTTTAGACAAGATGCGTCAAATATCC
AAGCAATTTTTCTCCCTGCCAATGGGAGAGAAGAACAAGCATTCCAGAGAAGTTGATGGGCATGAAGGATATGGAAGTGATGTGATTTTTTCAGAACAACAAATTCTTGA
TTGGACCGATCGGTTATATCTCACAGTAAATCCGGAGGATGAGAGACAGCTCAAGTATTGGCCGGAAAATCCTCAATCTTTCCGGGAAGATCTACACGAGTTAACGATCA
AGTTAAAGCAGATAATTGAAACTGTGCTCGTCGCCATGGCAAAGTGCTTAAACGTGGAAGTGAAGGGCTTTTCAGACCAGGTGGGGGAGCGACCGATTATGTCCATGAGA
TTCAACTTCTATCCGCCGTGTTCGAGGCCCGATCTTGTTCTTGGACTCAAAGAACACTCCGATGGGTCGGCCATCACCATTGTTTTACTGGACAAAGAAGTCGAAGGTCT
CGAGTTGCAGAAGAACAACCAATGGTTCAGAGTTCCTGTTCCTGCCATGGCCGATTCTCTTCTTATCAATCTTGGGGAACAAGCCGAGATTATGAGCAATGGGATTTTCA
AGGGCCCAGTTCATCGGGTGGTGACGAACTCGGAGAGGGAGAGGACTTCGTTGGCATGCTTCTGTTCCCCAGAAAGAGATAAAGAGATCAAGCCGATCGAGGAGCTGATC
AATGAGACGAGGCCGAGATTATATAACAGTGTGAAGAACTATTTGGATACTTACTTCCAATCCTACCAGAAGGGTCAGAGACCGATTGATAGATTGAAAATTTAG
Protein sequenceShow/hide protein sequence
MAGINPSGSVQDVASKGEVPERYIHEECDRGALDAPSMDGPVIDIGLLSSSSNAGPELEKLRLGLQSWGCFQAINHGMSPEFLEEVRQVAKQFFALPMEDKLKYSREEDG
IEGYGNDAVVAKQGTLDWTDRLHLTVYPEECHYFKYWPANPERFREVVLEYTANVRVISETILKAMAMSLDLDESSFLNQYGERIKLSARFNFYPRCQKPDLVLGVRPHT
DGSAITILLQDKEVEGLQFLKDNEWFNVPIIPDALLVNVGDQVEGIDKLREGEISVAVFYVPDSEKELEPLEKLVNETQPRLYKTVKNFVSLIIQCYQQGKTPIEAAKES
PAANPTNMAESHPSLLQDYFTTKTLQEQLLVNGGETPERYTYKDGYGGGGFNDPLPLAEIPVVDLEQLASSPTGAAGLEELRLALSSWGCFQAINHSIPSSFLDKMRQIS
KQFFSLPMGEKNKHSREVDGHEGYGSDVIFSEQQILDWTDRLYLTVNPEDERQLKYWPENPQSFREDLHELTIKLKQIIETVLVAMAKCLNVEVKGFSDQVGERPIMSMR
FNFYPPCSRPDLVLGLKEHSDGSAITIVLLDKEVEGLELQKNNQWFRVPVPAMADSLLINLGEQAEIMSNGIFKGPVHRVVTNSERERTSLACFCSPERDKEIKPIEELI
NETRPRLYNSVKNYLDTYFQSYQKGQRPIDRLKI