; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G015050 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G015050
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionAspartic proteinase-like protein 2
Genome locationchr04:22690699..22701369
RNA-Seq ExpressionLsi04G015050
SyntenyLsi04G015050
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0020037 - heme binding (molecular function)
GO:0016705 - oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen (molecular function)
GO:0005506 - iron ion binding (molecular function)
GO:0004497 - monooxygenase activity (molecular function)
InterPro domainsIPR001969 - Aspartic peptidase, active site
IPR036396 - Cytochrome P450 superfamily
IPR034161 - Pepsin-like domain, plant
IPR033121 - Peptidase family A1 domain
IPR032861 - Xylanase inhibitor, N-terminal
IPR032799 - Xylanase inhibitor, C-terminal
IPR021109 - Aspartic peptidase domain superfamily
IPR017972 - Cytochrome P450, conserved site
IPR002401 - Cytochrome P450, E-class, group I
IPR001461 - Aspartic peptidase A1 family
IPR001128 - Cytochrome P450


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064428.1 aspartic proteinase-like protein 2 [Cucumis melo var. makuwa]0.0e+0080.3Show/hide
Query:  AMGAIFLYIPLFLVFYILTEHFLHKIRNLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYGPVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPRL
        AMGA+FLYIPLFLV Y+LT+HFLH IRNLPPTPFPSLPILGHLHLL KPIYR LYNISNRYGPVVFLRLGSRSVLIVSS S AEECLTKNDI+FANRPRL
Subjt:  AMGAIFLYIPLFLVFYILTEHFLHKIRNLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYGPVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPRL

Query:  LISKCFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLIQRLVKCENQIVNMKNAFFDLTFNSMLRMIVGKRFYGDDVDDVDEAK
        L+SKCFGYN+TNL+WSSYGD WRNLRRICTVEILSTHRLHMLS+VRFEEVRSLIQRL+K ENQ+VNMK+ FFDL+FN MLRMIVGKRFYGDDVDDV+EAK
Subjt:  LISKCFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLIQRLVKCENQIVNMKNAFFDLTFNSMLRMIVGKRFYGDDVDDVDEAK

Query:  LFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQH-RGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDETIRALM
        LFRELQA+S+ L+GKS +GDFIPLM+WLGF STL +EM+DCQN RDALMQSLIEQH R R  +IDDSFRDGRK T+IEVLLELQESEPEQY DETIR LM
Subjt:  LFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQH-RGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDETIRALM

Query:  LLMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATMLYINI
        LLMLVAGTETSGS MEWALSLLLNHPEILKKAQ EID+QVG++R I+ESD+A LPYLRGIINETLRMYPPAPLL PHESS+DCSVGGYH+PR TMLYINI
Subjt:  LLMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATMLYINI

Query:  WAIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHAKCRPRPII
        WAIQNDPKIWAHP++FDPDRFN +ESE YKFNLMPFGLGRRGCPGEGLGLRM+GLVLGSLIQCFEWERP++EL                           
Subjt:  WAIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHAKCRPRPII

Query:  RLSLQLVVGLPLTVCPVVSAMALLHSVSVTWGQAERAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFD
                                         A RAF+RVLSAPISVIS   APSRT HRHTMSP C NSY AIVF  LVGFNLLGMILSSSVDSRD D
Subjt:  RLSLQLVVGLPLTVCPVVSAMALLHSVSVTWGQAERAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFD

Query:  YQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQNLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQ
        YQQRPVLLPLYISP NSTH+RV DRDHRLRHLQNL KPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQ
Subjt:  YQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQNLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQ

Query:  PELSSTYQPVKCNVDCNCDDNGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVS
        PELSSTYQPVKCNVDCNCD+NGVQCTYERRYAEMSTSSGVLAED+MSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVS
Subjt:  PELSSTYQPVKCNVDCNCDDNGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVS

Query:  NSFSLCYGGMDIGGGAMVLGGISSPHGMLWD----GNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQ
        NSFSLCYGGMD+GGGAMVLGGISSP GM++       SPYYNIELKEIHVAGKPLKLNP TF+GKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQ
Subjt:  NSFSLCYGGMDIGGGAMVLGGISSPHGMLWD----GNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQ

Query:  INGPDPNFKDICFSGAG----------------------------RDVTELSKVFPEVDMVFANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTL
        INGPDPNFKDICFSGAG                            RDVTEL KVFPEVDMVFA+GQKISLSPENYLFRHTKVS AYCLGIFKN NDQTTL
Subjt:  INGPDPNFKDICFSGAG----------------------------RDVTELSKVFPEVDMVFANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTL

Query:  LGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLSPAPPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVIT
        LG                      GIIVRNTLVTY+REN+TIGFWKTNCSELWKNLHYLSPAPPPAPLPS+  NTSKE+PPPGSP+ PFLSGEFQVGVIT
Subjt:  LGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLSPAPPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVIT

Query:  FNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFTSREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDK
        FNMMLHVNKSSVKLNITELAE IA+ELEV +                 + VHVLNFTS E DFFIRWAIFPADSAGYISNSTAMDIISRLKE  LQLP+K
Subjt:  FNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFTSREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDK

Query:  FGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEPVGVVGPEQELQPL
        FGSYQLVELNVEPPLKKTWMEQHFWS+MTIG+AVTLVVGLAAGSTWLIWRYRRR++SSYEPVGVVGPEQELQP+
Subjt:  FGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEPVGVVGPEQELQPL

KAG6591408.1 Aspartic proteinase-like protein 2, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0084.39Show/hide
Query:  RAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQNL
        RA YRVLSAPI+V+S P APSRTVH+HTMSPP  + + AI  Y LVGFNLLG I SSSVDSRDFDYQQRPV+LPLYISP NSTHQRVLDRDHRLRHL NL
Subjt:  RAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQNL

Query:  DKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDDNGVQCTYERRYAEMS
        +KPHSSNARMRL+DDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCV+CGNHQDPRFQP+LSS+YQPVKC++DC+CDDNGVQCTYERRYAEMS
Subjt:  DKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDDNGVQCTYERRYAEMS

Query:  TSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLGGISSPHGMLW----D
        TSSGVLAEDIMSFGKESEL+PQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKG VSNSFSLCYGGMD+GGGAMVLGGISSP GM++     
Subjt:  TSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLGGISSPHGMLW----D

Query:  GNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTELSKVFPEVDMVFA
          SPYYNIELKEIHVAGKPLKLNP TF+GKYG++LDSGTTYAYFPEKAYYAFKDA+MKKISFLKQINGPDPNFKDICFSGAGRDV+ELSKVFPEV MVFA
Subjt:  GNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTELSKVFPEVDMVFA

Query:  NGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLSPAP
        NGQKISLSPENYLFRHTKVS AYCLGIFKN NDQTTLLG                      GIIVRNTLVTYDRENT IGFWKTNCSELWKNLHYLSPAP
Subjt:  NGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLSPAP

Query:  PPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFTSREADF
        PPAPLPSYGQNTS EIPPP SPT PFLSGEFQVGVITFNM+LH NKSSVKLNITELAE IA+ELEV +                 + VHVLNFTS E DF
Subjt:  PPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFTSREADF

Query:  FIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEPVG
        FIRWAIFPADSAGYISNSTAMDIISRLKE  LQLP+KFGSYQLVELNVEPPLKKTWMEQHFWS+M+I  AVTLVVGLAAGSTWLIWRYRRRELSSYEPVG
Subjt:  FIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEPVG

Query:  VVGPEQELQPL
         VGPEQELQPL
Subjt:  VVGPEQELQPL

XP_022936107.1 aspartic proteinase-like protein 2 [Cucurbita moschata]0.0e+0083.54Show/hide
Query:  AERAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQ
        A+RA YRVLSAPI+V+S P APSRTVH+HTMSPP  + + A+  Y LVGFNLLG I SSSVDSRDF YQQRPV+LPLYISP NSTHQRVLDRDHRLRHL 
Subjt:  AERAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQ

Query:  NLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDDNGVQCTYERRYAE
        NL+KPHSSNARMRL+DDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCV+CGNHQDPRFQP+LSS+YQPVKCN+DC+CDDNGVQCTYERRYAE
Subjt:  NLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDDNGVQCTYERRYAE

Query:  MSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLGGISSPHGMLW---
        MSTSSGVLAEDIMSFGKESEL+PQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKG VSNSFSLCYGGMD+GGGAMVLGGISSP GM++   
Subjt:  MSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLGGISSPHGMLW---

Query:  -DGNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTELSKVFPEVDMV
            SPYYNIELKEIHVAGKPLKLNP TF+GKYG++LDSGTTYAYFPEKAYYAFKDA+MKKISFLKQI+GPDPNFKDICFSGAGRDV+ELSKVFPEV MV
Subjt:  -DGNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTELSKVFPEVDMV

Query:  FANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLS-
        FANGQKISLSPENYLFRHTKVS AYCLGIFKN NDQTTLLG                      GIIVRNTLVTYDRENT IGFWKTNCSELWKNLHYLS 
Subjt:  FANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLS-

Query:  ---PAPPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFT
           PAPPPAPLPSYGQNTS EIPPP SPT PFLSGEFQVGVITFNM+LH NKSSVKLNITELAE IA+ELEV +                 + VHVLNFT
Subjt:  ---PAPPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFT

Query:  SREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELS
        S E DFFIRWAIFPADSAGYISNSTAMDIISRLKE  LQLP+KFGSYQLVELNVEPPLKKTWMEQHFWS+M+I  AVTLVVGLAAGSTWLIWRYRRRELS
Subjt:  SREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELS

Query:  SYEPVGVVGPEQELQPL
        SYEPVG VGPEQELQPL
Subjt:  SYEPVGVVGPEQELQPL

XP_022976987.1 aspartic proteinase-like protein 2 [Cucurbita maxima]0.0e+0083.73Show/hide
Query:  AERAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQ
        A+RA YRVLSAPI+V+ LP APSRTVH+HTMSPP  + + AI  Y LVGFN LG I SSSVDSRDFDYQQRPV+LPLYISP NSTHQRVLDRDHRLRHL 
Subjt:  AERAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQ

Query:  NLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDDNGVQCTYERRYAE
        NL+KPHSSNARMRL+DDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCV+CGNHQDPRFQP+LSS+YQPVKCN+DC+CDDNGVQCTYERRYAE
Subjt:  NLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDDNGVQCTYERRYAE

Query:  MSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLGGISSPHGMLW---
        MSTSSGVLAEDIMSFGKESEL+PQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKG VSNSFSLCYGGMD+GGGAMVLGGISSP GM++   
Subjt:  MSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLGGISSPHGMLW---

Query:  -DGNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTELSKVFPEVDMV
            SPYYNIELKEIHVAGKPLKLNP TF+GKYG++LDSGTTYAYFPEKAYYAFKDA+MKKISFLKQINGPDPNFKDICFSGAGRDV+ELSKVFPEV MV
Subjt:  -DGNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTELSKVFPEVDMV

Query:  FANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLSP
        FANGQKISLSPENYLFRHTKVS AYCLGIFKN NDQTTLLG                      GIIVRNTLVTYDRENT IGFWKTNCSELWKNLHYLSP
Subjt:  FANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLSP

Query:  APPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFTSREA
        APPPAPLPSYGQNTS EIPPP SPT PFLSGEFQVGVITFNM+LH NKSSVKLNITELAE IA+ELEV +                 + VH+LNFTS   
Subjt:  APPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFTSREA

Query:  DFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEP
        DFFIRWAIFPADSAGYISNSTAMDIISRLK+  LQLP+KFGSYQLVELNVEPPLKKTWME HFWS+M+I  AVTLVVGLAAGSTWLIWRYRRRELSSYEP
Subjt:  DFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEP

Query:  VGVVGPEQELQPL
        VG VGPEQELQPL
Subjt:  VGVVGPEQELQPL

XP_038898702.1 aspartic proteinase 39-like [Benincasa hispida]0.0e+0086.16Show/hide
Query:  MALLHSVSVTWGQAERAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQ
        MAL HSVSV WG+AERA+YRVLS   SVIS+P APSRTVHRHTMSPP A+SY AIVFY LVGFNLLG ILSSSV +RDFDYQQ+PV+LPLYISPMNSTHQ
Subjt:  MALLHSVSVTWGQAERAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQ

Query:  RVLDRDHRLRHLQNLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDD
        RVLDRDHRLRHLQNL KPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDD
Subjt:  RVLDRDHRLRHLQNLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDD

Query:  NGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLG
        NGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLV K V+SNSFSLCYGGMD+GGGAMVLG
Subjt:  NGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLG

Query:  GISSPHGMLW----DGNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDV
        GISSP GM++       SPYYNIELKEIHVAGKPLKLNPSTF+GKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQI GPDPNFKDICFSGAGRDV
Subjt:  GISSPHGMLW----DGNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDV

Query:  TELSKVFPEVDMVFANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTN
        TELSKVFPEVDMVFANGQKISLSPENYLFRHTKVS AYCLG+FKN NDQTTLLG                      GIIVRNTLVTYDRENTTIGFWKTN
Subjt:  TELSKVFPEVDMVFANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTN

Query:  CSELWKNLHYLS----PAPPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVIS
        CSELWKNLHYLS    PAPPPAPLPSYGQNTSKE+PPPGSP+ PFLSGEFQVGVITFNMMLHVNKSSVKLNITELAE IA+ELEV +             
Subjt:  CSELWKNLHYLS----PAPPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVIS

Query:  CRLSTLVHVLNFTSREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGS
            + VHVLNFT  E +FFIRWAIFPAD AGYISN+TAMDIISRLKE  LQLP+KFGSYQLVELNVEPPLKKTWMEQHFWS+MTIGVAVTLVVGLAA S
Subjt:  CRLSTLVHVLNFTSREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGS

Query:  TWLIWRYRRRELSSYEPVGVVGPEQELQPL
        TWLIWRYRRRELSSYEPVGVVGPEQELQPL
Subjt:  TWLIWRYRRRELSSYEPVGVVGPEQELQPL

TrEMBL top hitse value%identityAlignment
A0A0A0L518 Peptidase A1 domain-containing protein0.0e+0084.79Show/hide
Query:  NSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQNLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQE
        NSY A +   L+GFNLL +ILSSSVDSRDFDYQQR V+LPL+ISP NS+H+RVLDRDHRLRHLQNL KPHSSNARMRLHDDLLTNGYYTTRLWIG+PPQE
Subjt:  NSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQNLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQE

Query:  FALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDDNGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETG
        FALIVDTGSTVTYVPCSNCV+CGNHQDPRFQPELSSTYQPVKCN DCNCD+NGVQCTYERRYAEMSTSSGVLAED+MSFGKESELVPQRAVFGCETME+G
Subjt:  FALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDDNGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETG

Query:  DLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLGGISSPHGMLWD----GNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAI
        DLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMD+GGGAMVLGGISSP GM++       SPYYNIELKEIHVAGKPLKLNP TF+GKYGAI
Subjt:  DLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLGGISSPHGMLWD----GNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAI

Query:  LDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTELSKVFPEVDMVFANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQ
        LDSGTTYAYFPEKAYYAFKDAIMKKISFLKQI+GPDPNFKDICFSGAGRDVTEL KVFPEVDMVFANGQKISLSPENYLFRHTKVS AYCLGIFKN NDQ
Subjt:  LDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTELSKVFPEVDMVFANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQ

Query:  TTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLSPAPPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVG
        TTLLG                      GIIVRNTLVTY+REN+TIGFWKTNCSELWKNLHYLSPAPPPAPLPS+  NTSKE+PPPGSP+ PFLSGEFQVG
Subjt:  TTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLSPAPPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVG

Query:  VITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFTSREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQL
        VITFNMMLHVN+SSVKLNITELAE IA+ELEV +                 + VHVLNFTS E D FIRWAIFPADSAGYISNSTAMDIISRLKE  LQL
Subjt:  VITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFTSREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQL

Query:  PDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEPVGVVGPEQELQPL
        P+KFGSYQLVELNVEPPLKKTWMEQHFWS+ TIGVAVTLVVGLAAGSTWLIWRYRRR+ SSYEPVGVVGPEQELQPL
Subjt:  PDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEPVGVVGPEQELQPL

A0A1S3BTN6 aspartic proteinase-like protein 20.0e+0086.53Show/hide
Query:  MSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQNLDKPHSSNARMRLHDDLLTNGYYTTRLWI
        MSP C NSY AIVF  LVGFNLLGMILSSSVDSRD DYQQRPVLLPLYISP NSTH+RV DRDHRLRHLQNL KPHSSNARMRLHDDLLTNGYYTTRLWI
Subjt:  MSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQNLDKPHSSNARMRLHDDLLTNGYYTTRLWI

Query:  GTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDDNGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQRAVFGC
        GTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCD+NGVQCTYERRYAEMSTSSGVLAED+MSFGKESELVPQRAVFGC
Subjt:  GTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDDNGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQRAVFGC

Query:  ETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLGGISSPHGMLWD----GNSPYYNIELKEIHVAGKPLKLNPSTFN
        ETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMD+GGGAMVLGGISSP GM++       SPYYNIELKEIHVAGKPLKLNP TF+
Subjt:  ETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLGGISSPHGMLWD----GNSPYYNIELKEIHVAGKPLKLNPSTFN

Query:  GKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTELSKVFPEVDMVFANGQKISLSPENYLFRHTKVSRAYCLGIF
        GKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTEL KVFPEVDMVFA+GQKISLSPENYLFRHTKVS AYCLGIF
Subjt:  GKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTELSKVFPEVDMVFANGQKISLSPENYLFRHTKVSRAYCLGIF

Query:  KNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLSPAPPPAPLPSYGQNTSKEIPPPGSPTGPFLS
        KN NDQTTLLG                      GIIVRNTLVTY+REN+TIGFWKTNCSELWKNLHYLSPAPPPAPLPS+  NTSKE+PPPGSP+ PFLS
Subjt:  KNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLSPAPPPAPLPSYGQNTSKEIPPPGSPTGPFLS

Query:  GEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFTSREADFFIRWAIFPADSAGYISNSTAMDIISRLK
        GEFQVGVITFNMMLHVNKSSVKLNITELAE IA+ELEV +                 + VHVLNFTS E DFFIRWAIFPADSAGYISNSTAMDIISRLK
Subjt:  GEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFTSREADFFIRWAIFPADSAGYISNSTAMDIISRLK

Query:  ERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEPVGVVGPEQELQPL
        E  LQLP+KFGSYQLVELNVEPPLKKTWMEQHFWS+MTIG+AVTLVVGLAAGSTWLIWRYRRR++SSYEPVGVVGPEQELQP+
Subjt:  ERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEPVGVVGPEQELQPL

A0A5A7VA17 Aspartic proteinase-like protein 20.0e+0080.3Show/hide
Query:  AMGAIFLYIPLFLVFYILTEHFLHKIRNLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYGPVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPRL
        AMGA+FLYIPLFLV Y+LT+HFLH IRNLPPTPFPSLPILGHLHLL KPIYR LYNISNRYGPVVFLRLGSRSVLIVSS S AEECLTKNDI+FANRPRL
Subjt:  AMGAIFLYIPLFLVFYILTEHFLHKIRNLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYGPVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPRL

Query:  LISKCFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLIQRLVKCENQIVNMKNAFFDLTFNSMLRMIVGKRFYGDDVDDVDEAK
        L+SKCFGYN+TNL+WSSYGD WRNLRRICTVEILSTHRLHMLS+VRFEEVRSLIQRL+K ENQ+VNMK+ FFDL+FN MLRMIVGKRFYGDDVDDV+EAK
Subjt:  LISKCFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLIQRLVKCENQIVNMKNAFFDLTFNSMLRMIVGKRFYGDDVDDVDEAK

Query:  LFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQH-RGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDETIRALM
        LFRELQA+S+ L+GKS +GDFIPLM+WLGF STL +EM+DCQN RDALMQSLIEQH R R  +IDDSFRDGRK T+IEVLLELQESEPEQY DETIR LM
Subjt:  LFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQH-RGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDETIRALM

Query:  LLMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATMLYINI
        LLMLVAGTETSGS MEWALSLLLNHPEILKKAQ EID+QVG++R I+ESD+A LPYLRGIINETLRMYPPAPLL PHESS+DCSVGGYH+PR TMLYINI
Subjt:  LLMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATMLYINI

Query:  WAIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHAKCRPRPII
        WAIQNDPKIWAHP++FDPDRFN +ESE YKFNLMPFGLGRRGCPGEGLGLRM+GLVLGSLIQCFEWERP++EL                           
Subjt:  WAIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHAKCRPRPII

Query:  RLSLQLVVGLPLTVCPVVSAMALLHSVSVTWGQAERAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFD
                                         A RAF+RVLSAPISVIS   APSRT HRHTMSP C NSY AIVF  LVGFNLLGMILSSSVDSRD D
Subjt:  RLSLQLVVGLPLTVCPVVSAMALLHSVSVTWGQAERAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFD

Query:  YQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQNLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQ
        YQQRPVLLPLYISP NSTH+RV DRDHRLRHLQNL KPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQ
Subjt:  YQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQNLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQ

Query:  PELSSTYQPVKCNVDCNCDDNGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVS
        PELSSTYQPVKCNVDCNCD+NGVQCTYERRYAEMSTSSGVLAED+MSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVS
Subjt:  PELSSTYQPVKCNVDCNCDDNGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVS

Query:  NSFSLCYGGMDIGGGAMVLGGISSPHGMLWD----GNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQ
        NSFSLCYGGMD+GGGAMVLGGISSP GM++       SPYYNIELKEIHVAGKPLKLNP TF+GKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQ
Subjt:  NSFSLCYGGMDIGGGAMVLGGISSPHGMLWD----GNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQ

Query:  INGPDPNFKDICFSGAG----------------------------RDVTELSKVFPEVDMVFANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTL
        INGPDPNFKDICFSGAG                            RDVTEL KVFPEVDMVFA+GQKISLSPENYLFRHTKVS AYCLGIFKN NDQTTL
Subjt:  INGPDPNFKDICFSGAG----------------------------RDVTELSKVFPEVDMVFANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTL

Query:  LGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLSPAPPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVIT
        LG                      GIIVRNTLVTY+REN+TIGFWKTNCSELWKNLHYLSPAPPPAPLPS+  NTSKE+PPPGSP+ PFLSGEFQVGVIT
Subjt:  LGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLSPAPPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVIT

Query:  FNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFTSREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDK
        FNMMLHVNKSSVKLNITELAE IA+ELEV +                 + VHVLNFTS E DFFIRWAIFPADSAGYISNSTAMDIISRLKE  LQLP+K
Subjt:  FNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFTSREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDK

Query:  FGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEPVGVVGPEQELQPL
        FGSYQLVELNVEPPLKKTWMEQHFWS+MTIG+AVTLVVGLAAGSTWLIWRYRRR++SSYEPVGVVGPEQELQP+
Subjt:  FGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEPVGVVGPEQELQPL

A0A6J1F7I1 aspartic proteinase-like protein 20.0e+0083.54Show/hide
Query:  AERAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQ
        A+RA YRVLSAPI+V+S P APSRTVH+HTMSPP  + + A+  Y LVGFNLLG I SSSVDSRDF YQQRPV+LPLYISP NSTHQRVLDRDHRLRHL 
Subjt:  AERAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQ

Query:  NLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDDNGVQCTYERRYAE
        NL+KPHSSNARMRL+DDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCV+CGNHQDPRFQP+LSS+YQPVKCN+DC+CDDNGVQCTYERRYAE
Subjt:  NLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDDNGVQCTYERRYAE

Query:  MSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLGGISSPHGMLW---
        MSTSSGVLAEDIMSFGKESEL+PQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKG VSNSFSLCYGGMD+GGGAMVLGGISSP GM++   
Subjt:  MSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLGGISSPHGMLW---

Query:  -DGNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTELSKVFPEVDMV
            SPYYNIELKEIHVAGKPLKLNP TF+GKYG++LDSGTTYAYFPEKAYYAFKDA+MKKISFLKQI+GPDPNFKDICFSGAGRDV+ELSKVFPEV MV
Subjt:  -DGNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTELSKVFPEVDMV

Query:  FANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLS-
        FANGQKISLSPENYLFRHTKVS AYCLGIFKN NDQTTLLG                      GIIVRNTLVTYDRENT IGFWKTNCSELWKNLHYLS 
Subjt:  FANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLS-

Query:  ---PAPPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFT
           PAPPPAPLPSYGQNTS EIPPP SPT PFLSGEFQVGVITFNM+LH NKSSVKLNITELAE IA+ELEV +                 + VHVLNFT
Subjt:  ---PAPPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFT

Query:  SREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELS
        S E DFFIRWAIFPADSAGYISNSTAMDIISRLKE  LQLP+KFGSYQLVELNVEPPLKKTWMEQHFWS+M+I  AVTLVVGLAAGSTWLIWRYRRRELS
Subjt:  SREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELS

Query:  SYEPVGVVGPEQELQPL
        SYEPVG VGPEQELQPL
Subjt:  SYEPVGVVGPEQELQPL

A0A6J1IH74 aspartic proteinase-like protein 20.0e+0083.73Show/hide
Query:  AERAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQ
        A+RA YRVLSAPI+V+ LP APSRTVH+HTMSPP  + + AI  Y LVGFN LG I SSSVDSRDFDYQQRPV+LPLYISP NSTHQRVLDRDHRLRHL 
Subjt:  AERAFYRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQ

Query:  NLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDDNGVQCTYERRYAE
        NL+KPHSSNARMRL+DDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCV+CGNHQDPRFQP+LSS+YQPVKCN+DC+CDDNGVQCTYERRYAE
Subjt:  NLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDDNGVQCTYERRYAE

Query:  MSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLGGISSPHGMLW---
        MSTSSGVLAEDIMSFGKESEL+PQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKG VSNSFSLCYGGMD+GGGAMVLGGISSP GM++   
Subjt:  MSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLGGISSPHGMLW---

Query:  -DGNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTELSKVFPEVDMV
            SPYYNIELKEIHVAGKPLKLNP TF+GKYG++LDSGTTYAYFPEKAYYAFKDA+MKKISFLKQINGPDPNFKDICFSGAGRDV+ELSKVFPEV MV
Subjt:  -DGNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTELSKVFPEVDMV

Query:  FANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLSP
        FANGQKISLSPENYLFRHTKVS AYCLGIFKN NDQTTLLG                      GIIVRNTLVTYDRENT IGFWKTNCSELWKNLHYLSP
Subjt:  FANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLSP

Query:  APPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFTSREA
        APPPAPLPSYGQNTS EIPPP SPT PFLSGEFQVGVITFNM+LH NKSSVKLNITELAE IA+ELEV +                 + VH+LNFTS   
Subjt:  APPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFTSREA

Query:  DFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEP
        DFFIRWAIFPADSAGYISNSTAMDIISRLK+  LQLP+KFGSYQLVELNVEPPLKKTWME HFWS+M+I  AVTLVVGLAAGSTWLIWRYRRRELSSYEP
Subjt:  DFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEP

Query:  VGVVGPEQELQPL
        VG VGPEQELQPL
Subjt:  VGVVGPEQELQPL

SwissProt top hitse value%identityAlignment
Q6WNQ8 Cytochrome P450 81E82.7e-12747.12Show/hide
Query:  LYIPLFLVFYILTEHF--LHKIRNLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYGPVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPRLLISK
        L I LF +   L   F    K +NLPP P   LPI+G+LH L +P++ T + +S +YG +  L  GSR V++VSS ++A+EC TKNDIV ANRP  L  K
Subjt:  LYIPLFLVFYILTEHF--LHKIRNLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYGPVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPRLLISK

Query:  CFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLIQRLV-KCENQI--VNMKNAFFDLTFNSMLRMIVGKRFYGDDVD--DVDEA
          GYN+T +  S YGD+WRNLRRI ++EILS+HRL+    +R +E+  LIQ+L  K  N    V ++  F ++TFN+++RM+ GKR+YG+D D  DV+EA
Subjt:  CFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLIQRLV-KCENQI--VNMKNAFFDLTFNSMLRMIVGKRFYGDDVD--DVDEA

Query:  KLFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDETIRALM
        +LFR +  E   L G + +GDF+  + W  F   L + +     R DA +Q LI++HR         F     NTMI+ LL  Q+S+PE Y D+ I+ LM
Subjt:  KLFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDETIRALM

Query:  LLMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATMLYINI
        ++ML+AGT+TS  T+EWA+S LLNHPEI+KKA+NE+D  +GH+R +DE D++ LPYL+ I+ ETLR++  APLL PH SSED S+GGY+IP+ T+L +N 
Subjt:  LLMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATMLYINI

Query:  WAIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHAKCRPRPII
        W I  DP +W+ P  F P+RF   E E     L+ FGLGRR CPGE L  R  GL LG LIQCFEW+R  +E +DM E   +T  K   L+A C+ R  +
Subjt:  WAIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHAKCRPRPII

Query:  RLS
        +++
Subjt:  RLS

Q6WNQ9 Isoflavone 3'-hydroxylase (Fragment)5.5e-12547.34Show/hide
Query:  AIFLYIPLFLVFYILTEHFL----HKIRNLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYGPVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPR
        A+F Y  L L F I  +  L     +++NLPP P P++PI+G+LH L  P++RT   +S  YG +  L  GSR V++VSSPS+A EC TKNDI+ ANRPR
Subjt:  AIFLYIPLFLVFYILTEHFL----HKIRNLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYGPVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPR

Query:  LLISKCFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLIQRLVK------CENQIVNMKNAFFDLTFNSMLRMIVGKRFYGD--
         L  K   YN T L  +SYGD+WRNLRRI T+++LS +RL+    VR +E   LIQ+L+K           V ++    ++TFN+M+RMI GKR+YGD  
Subjt:  LLISKCFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLIQRLVK------CENQIVNMKNAFFDLTFNSMLRMIVGKRFYGD--

Query:  DVDDVDEAKLFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYK
        DV DV+EAK FRE+ +E   L G +  GDF+PL+  +     L +       R +A ++ LIE+HR      D         TMI+ LL+L ES+PE Y 
Subjt:  DVDDVDEAKLFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYK

Query:  DETIRALMLLMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPR
        D  I+ L+  ML+AGT+TS  T+EW +S LLNHPE+LKKA+ E+D Q+G  +L+DE D++ LPYL+ II+ETLR++PPAPLL PH SSEDC++G +++P+
Subjt:  DETIRALMLLMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPR

Query:  ATMLYINIWAIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHA
         T++  N+W I  DPK W     F P+RF + E  N    +M FGLGRR CPG  L  R VG  +G LIQCFEWER S+E +DM EG  +TMP    L A
Subjt:  ATMLYINIWAIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHA

Query:  KCRPRPI
         C+  PI
Subjt:  KCRPRPI

Q9FG65 Cytochrome P450 81D15.0e-12648.12Show/hide
Query:  IFLYIPLFLVFYILTEHFLH-KIRNLPPTPFPSLPILGHLHLLTKPIYRTLYNIS-----NRYGPVVFLRLGSRSVLIVSSPSV-AEECLTKNDIVFANR
        + LY    L+F I++  FL  K +NLPP+P   LPI+GHL LL  PI+RTL + S     N  G V+ LRLGSR V +VSS  V AEEC  KND+V ANR
Subjt:  IFLYIPLFLVFYILTEHFLH-KIRNLPPTPFPSLPILGHLHLLTKPIYRTLYNIS-----NRYGPVVFLRLGSRSVLIVSSPSV-AEECLTKNDIVFANR

Query:  PRLLISKCFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLIQRLVK---CENQIVNMKNAFFDLTFNSMLRMIVGKRFYGDDVD
        P+++I K  GYN+TN+I + YGD+WRNLRR+CT+EI STHRL+    VR +EVR LI RL +    +  +V +K    DLTFN+++RM+ GKR+YG++  
Subjt:  PRLLISKCFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLIQRLVK---CENQIVNMKNAFFDLTFNSMLRMIVGKRFYGDDVD

Query:  DVDEAKLFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDET
        D +EAK  R+L A+    T      D++P+   L   S+    +       D  +Q LI+  RG         +     TMI+ LL LQ+S+ E Y D+ 
Subjt:  DVDEAKLFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDET

Query:  IRALMLLMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATM
        I+ ++L+M++AGT TS  T+EWALS LLNHP+++ KA++EID++VG +RLI+E+D++ LPYL+ I+ ETLR++P  PLL PH +SEDC +G Y +PR T 
Subjt:  IRALMLLMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATM

Query:  LYINIWAIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHAKCR
        L +N WAI  DP  W  P  F P+RF   E E     L+ FGLGRR CPG GL  R+VGL LGSLIQCFEWER  +  VDM EG+  T+PKA  L A C+
Subjt:  LYINIWAIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHAKCR

Query:  PRPII
         RP +
Subjt:  PRPII

Q9LHA1 Cytochrome P450 81D113.8e-12646.58Show/hide
Query:  RTAMGAIFLYIPLFLVFYILTEHFLHKIR----NLPPTPFPSLPILGHLHLLTKPIYRTLYNISN--RYGPVVFLRLGSRSVLIVSSPSVAEECLTKNDI
        +T M  I+L + LF  F  L+   L   R    NLPP+P    PI+GHLHLL  P++R   ++S       +  L LGSR V +VSS +VAEEC TKND+
Subjt:  RTAMGAIFLYIPLFLVFYILTEHFLHKIR----NLPPTPFPSLPILGHLHLLTKPIYRTLYNISN--RYGPVVFLRLGSRSVLIVSSPSVAEECLTKNDI

Query:  VFANRPRLLISKCFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLIQRLVKCENQ---IVNMKNAFFDLTFNSMLRMIVGKRFY
        V ANRP  L+ K  GYNST ++ ++YGD+WRNLRRI T+EI S+ RL+    +R +E+R LI  L K        V MK  F  LT N+++RM+ GKRFY
Subjt:  VFANRPRLLISKCFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLIQRLVKCENQ---IVNMKNAFFDLTFNSMLRMIVGKRFY

Query:  GDDVDDVDEAKLFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQ
        GD  ++ +EAK  R+L AE     G     D+ P++ ++   +   + +     R D  +QSL+ + R    K          NTMI+ LL LQE++P+ 
Subjt:  GDDVDDVDEAKLFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQ

Query:  YKDETIRALMLLMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHI
        Y D  I+ ++L+M++AGT+TS  T+EWA+S LLNHPE+L+KA+ EIDDQ+G +RL++E D+  LPYL+ I++ETLR+YP AP+L PH +SEDC V GY +
Subjt:  YKDETIRALMLLMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHI

Query:  PRATMLYINIWAIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEG-IAVTMPKAQH
        PR T++ +N WAI  DPK+W  P KF P+RF + + E+ K  LMPFG+GRR CPG GL  R+V L LGSL+QCFEWER  ++ +DM E     TM KA  
Subjt:  PRATMLYINIWAIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEG-IAVTMPKAQH

Query:  LHAKCRPRPII
        L A C+ RPI+
Subjt:  LHAKCRPRPII

W8JMU7 Cytochrome P450 81Q324.7e-13249.58Show/hide
Query:  RNLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYGPVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPRLLISKCFGYNSTNLIWSSYGDNWRNLR
        RNLPP+P  +LP++GHLHL+ K ++R+LY++S +YG V  L+LG+R VL+VSSP+ AEEC TKNDIVFANRP  ++ K  GYN T ++ S YG++WRNLR
Subjt:  RNLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYGPVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPRLLISKCFGYNSTNLIWSSYGDNWRNLR

Query:  RICTVEILSTHRLHMLSIVRFEEVRSLIQRLVKCENQ---IVNMKNAFFDLTFNSMLRMIVGKRFYGDDVDDVDEAKLFRELQAESTRLTGKSYIGDFIP
        R+  VEI S   L+    +R +EV+ L+  L +   Q    V MK+   +L+FN  +RM+ GKR++G DVD  DEAKLFR L  E     G S  GDF+P
Subjt:  RICTVEILSTHRLHMLSIVRFEEVRSLIQRLVKCENQ---IVNMKNAFFDLTFNSMLRMIVGKRFYGDDVDDVDEAKLFRELQAESTRLTGKSYIGDFIP

Query:  LMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDETIRALMLLMLVAGTETSGSTMEWALSLLLN
         + W+ F     +++       DA +Q LI + R     +          TMI+ LL LQES+PE Y D+ I+ +++++L+AGT+TS  T+EWA+SLLLN
Subjt:  LMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDETIRALMLLMLVAGTETSGSTMEWALSLLLN

Query:  HPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATMLYINIWAIQNDPKIWAHPRKFDPDRFNRL
        HPE L+KA+ EI+ QVG  RLI+E D+  L YL  II+ET R+ P AP+L PHESS+DC V GY +P+ T+L +N WAI  DP+ W  P  F P+R   +
Subjt:  HPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATMLYINIWAIQNDPKIWAHPRKFDPDRFNRL

Query:  ESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHAKCRPRPII
        E E  K  LMPFG+GRR CPG GL  R+VGL LG+LIQCFEW+R  +  +DM EG  +TMPKAQ L A C+PR I+
Subjt:  ESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHAKCRPRPII

Arabidopsis top hitse value%identityAlignment
AT3G50050.1 Eukaryotic aspartyl protease family protein2.0e-18652.17Show/hide
Query:  QRPVLLPLYISPMNSTHQRVLDRDHRLRHLQNLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPE
        +RP++ PL++S  NS+ + +     +L    +   PHS   RMRL+DDLL NGYYTTRLWIGTPPQ FALIVD+GSTVTYVPCS+C +CG HQDP+FQPE
Subjt:  QRPVLLPLYISPMNSTHQRVLDRDHRLRHLQNLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPE

Query:  LSSTYQPVKCNVDCNCDDNGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNS
        +SSTYQPVKCN+DCNCDD+  QC YER YAE S+S GVL ED++SFG ES+L PQRAVFGCET+ETGDLY+QRADGI+GLG+G LS++DQLV KG++SNS
Subjt:  LSSTYQPVKCNVDCNCDDNGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNS

Query:  FSLCYGGMDIGGGAMVLGGISSPHGMLWDGN----SPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQIN
        F LCYGGMD+GGG+M+LGG   P  M++  +    SPYYNI+L  I VAGK L L+   F+G++GA+LDSGTTYAY P+ A+ AF++A+M+++S LKQI+
Subjt:  FSLCYGGMDIGGGAMVLGGISSPHGMLWDGN----SPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQIN

Query:  GPDPNFKDICFS-GAGRDVTELSKVFPEVDMVFANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVR
        GPDPNFKD CF   A   V+ELSK+FP V+MVF +GQ   LSPENY+FRH+KV  AYCLG+F N  D TTLLG                      GI+VR
Subjt:  GPDPNFKDICFS-GAGRDVTELSKVFPEVDMVFANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVR

Query:  NTLVTYDRENTTIGFWKTNCSELWKNLHYLSPAPPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEV
        NTLV YDREN+ +GFW+TNCSEL   LH +  APPPA LPS   N       P   +   LSG  QVG I  ++ L VN S +K  I +L+++ + EL+V
Subjt:  NTLVTYDRENTTIGFWKTNCSELWKNLHYLSPAPPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADELEV

Query:  IMNLESNLFTVHVISCRLSTLVHVLNFTSREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMT
                          S+ V + N TS+  +  +R  + P + + + SN TA +I+SR     ++LP+ FG+YQLV   +EPP K+T    +   V+ 
Subjt:  IMNLESNLFTVHVISCRLSTLVHVLNFTSREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMT

Query:  IGVAVTLVVGLAAGSTWLIWRYRRRELSSYEPVG-VVGPEQELQPL
        IG+ + ++VGL+A   WLIW+ R++    Y+PV   +  EQELQP+
Subjt:  IGVAVTLVVGLAAGSTWLIWRYRRRELSSYEPVG-VVGPEQELQPL

AT4G37320.1 cytochrome P450, family 81, subfamily D, polypeptide 59.0e-13148.43Show/hide
Query:  NLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYG--PVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPRLLISKCFGYNSTNLIWSSYGDNWRNL
        NLPP+P   LP++GHLHLL +P++RT ++IS   G  P+  LRLG+R V ++SS S+AEEC TKND+V ANRP ++++K  GYN TN+I +SYGD+WRNL
Subjt:  NLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYG--PVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPRLLISKCFGYNSTNLIWSSYGDNWRNL

Query:  RRICTVEILSTHRLHMLSIVRFEEVRSLIQRLVKCENQ---IVNMKNAFFDLTFNSMLRMIVGKRFYGDDVDDVDEAKLFRELQAESTRLTGKSYIGDFI
        RRI  VEI S+HR+   S +R +E+R LI  L +        V +K+   +L FN+++ M+ GKR+YG   +D DEAKL REL AE     G   + D++
Subjt:  RRICTVEILSTHRLHMLSIVRFEEVRSLIQRLVKCENQ---IVNMKNAFFDLTFNSMLRMIVGKRFYGDDVDDVDEAKLFRELQAESTRLTGKSYIGDFI

Query:  PLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDETIRALMLLMLVAGTETSGSTMEWALSLLL
        P + W+   +    +     NR D ++Q L+++ R    K           T+I+ LL  QE+EPE Y D  I+ ++L +++AGT+TS  T+EWA+S LL
Subjt:  PLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDETIRALMLLMLVAGTETSGSTMEWALSLLL

Query:  NHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATMLYINIWAIQNDPKIWAHPRKFDPDRFNR
        NHPEIL+KA+ EIDD++G +RL++ESD+ +L YL+ I++ETLR+YP  PLL PH SS++C V GY +PR T+L  N+WA+  DP +W  P +F P+RF  
Subjt:  NHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATMLYINIWAIQNDPKIWAHPRKFDPDRFNR

Query:  LESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHAKCRPRPII
         E E     LMPFG+GRR CPG  LG R+V L LG LIQ FEWER   ELVDMTEG  +TMPKA  L A C+ R I+
Subjt:  LESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHAKCRPRPII

AT4G37340.1 cytochrome P450, family 81, subfamily D, polypeptide 35.9e-13048.8Show/hide
Query:  FLYIPLFLVFYILTEHFLHKIRNLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYG--PVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPRLLIS
        FL+I L L F I     + +  NLPP+P  +LP++GHL LL  P++R   ++S   G  P++ LRLG+R V +VSS S+AEEC TKND+V ANR   L S
Subjt:  FLYIPLFLVFYILTEHFLHKIRNLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYG--PVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPRLLIS

Query:  KCFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLI---QRLVKCENQIVNMKNAFFDLTFNSMLRMIVGKRFYGDDVDDVDEAK
        K   Y  T ++ +SYGD+WRNLRRI  VEI S HRL+  S +R +E+  LI    R    E   V MK+ F +LTFN+++RM+ GK +YGD  +D  EAK
Subjt:  KCFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLI---QRLVKCENQIVNMKNAFFDLTFNSMLRMIVGKRFYGDDVDDVDEAK

Query:  LFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDETIRALML
          REL AE     G     D++P++ W+  GS   + +    +R D  +Q L+++ R          ++ R+NTM++ LL LQE++PE Y D  I+ +ML
Subjt:  LFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDETIRALML

Query:  LMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATMLYINIW
         +++AGT+TS  T+EW LS LLNHP+IL KA++EID++VG  RL++ESD++HLPYL+ I++E+LR+YP +PLL PH +SEDC VGGYH+PR TML  N W
Subjt:  LMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATMLYINIW

Query:  AIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTE-GIAVTMPKAQHLHAKCRPRPII
        AI  DPKIW  P  F P+RF   E E     L+ FGLGRR CPG GL  R+  L +GSLIQCFEWER  +E VDMTE G  V MPKA  L A C+ RP++
Subjt:  AIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTE-GIAVTMPKAQHLHAKCRPRPII

AT4G37360.1 cytochrome P450, family 81, subfamily D, polypeptide 27.2e-12846.49Show/hide
Query:  FLYIPLFLVFYILTEHFLHKIRNLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYG--PVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPRLLIS
        F +I L L+F I     + +  NLPP+P  +LP++GHL LL  P++R   ++S   G  P++ LRLG+R + +VSS S+AEEC TKND++ ANR   + +
Subjt:  FLYIPLFLVFYILTEHFLHKIRNLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYG--PVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANRPRLLIS

Query:  KCFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLIQRLVKCEN---QIVNMKNAFFDLTFNSMLRMIVGKRFYGDDVDDVDEAK
        K   Y ++ ++ +SY ++WRNLRRI  +EI S HRL+  S +R +E+R LI RL++  +     V MK+ F DLTFN+++RM+ GK +YGD  +D  EAK
Subjt:  KCFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLIQRLVKCEN---QIVNMKNAFFDLTFNSMLRMIVGKRFYGDDVDDVDEAK

Query:  LFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDETIRALML
          R L AE+   +G     D+IP++ W+ +  T  +++     R D  +Q L+++ R          ++ ++NTM++ LL LQE++PE Y D  I+  ML
Subjt:  LFRELQAESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDETIRALML

Query:  LMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATMLYINIW
         ++  GT+T+  T+EWALS LLN+PE+L KA++EID  +G +RL++ESD+ +LPYL+ I++ETLR+YP AP+L PH +S+DC VGGY +PR TML  N W
Subjt:  LMLVAGTETSGSTMEWALSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATMLYINIW

Query:  AIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHAKCRPRPII
        AI  DP +W  P  F P+RF   E E     LMPFGLGRR CPG GL  R+V L LGSLIQCFEWER  +E VDMTEG  +TMPKA+ L A CR R  +
Subjt:  AIQNDPKIWAHPRKFDPDRFNRLESENYKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHAKCRPRPII

AT5G43100.1 Eukaryotic aspartyl protease family protein2.0e-19955.64Show/hide
Query:  QRPVLLPLYISPMNSTHQRVLDRDHRLRHLQNLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPE
        + P++ PL  S   S   R    D R R L     P   NA M+L+DDLL+NGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCS C +CG HQDP+FQPE
Subjt:  QRPVLLPLYISPMNSTHQRVLDRDHRLRHLQNLDKPHSSNARMRLHDDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPE

Query:  LSSTYQPVKCNVDCNCDDNGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNS
        LS++YQ +KCN DCNCDD G  C YERRYAEMS+SSGVL+ED++SFG ES+L PQRAVFGCE  ETGDL++QRADGIMGLGRG LSV+DQLV KGV+ + 
Subjt:  LSSTYQPVKCNVDCNCDDNGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQRAVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNS

Query:  FSLCYGGMDIGGGAMVLGGISSPHGMLWDGN----SPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQIN
        FSLCYGGM++GGGAMVLG IS P GM++  +    SPYYNI+LK++HVAGK LKLNP  FNGK+G +LDSGTTYAYFP++A+ A KDA++K+I  LK+I+
Subjt:  FSLCYGGMDIGGGAMVLGGISSPHGMLWDGN----SPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDSGTTYAYFPEKAYYAFKDAIMKKISFLKQIN

Query:  GPDPNFKDICFSGAGRDVTELSKVFPEVDMVFANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRN
        GPDPN+ D+CFSGAGRDV E+   FPE+ M F NGQK+ LSPENYLFRHTKV  AYCLGIF +R D TTLLG                      GI+VRN
Subjt:  GPDPNFKDICFSGAGRDVTELSKVFPEVDMVFANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWIYFFRVKLHGFFPSAGIIVRN

Query:  TLVTYDRENTTIGFWKTNCSELWKNLHYLSPAPPPAPLPSYGQNTSKEI-PPPGSPTGP--FLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADEL
        TLVTYDREN  +GF KTNCS++W+    L+    PAP     QN S  I P P +   P   L G F+VGVITF + + VN SS+K   +E+A+ IA EL
Subjt:  TLVTYDRENTTIGFWKTNCSELWKNLHYLSPAPPPAPLPSYGQNTSKEI-PPPGSPTGP--FLSGEFQVGVITFNMMLHVNKSSVKLNITELAEVIADEL

Query:  EVIMNLESNLFTVHVISCRLSTLVHVLNFTSREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSV
        ++                  S  V +LNF+S   ++ ++W +FP  S+ YISN+TA++I+  LKE  L+LP +FGSY+L+E   E   K++W E+H   V
Subjt:  EVIMNLESNLFTVHVISCRLSTLVHVLNFTSREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSV

Query:  MTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEPVGVVGPEQELQPL
        +  G  ++L+V        L+WR R++E ++YEPV     EQELQPL
Subjt:  MTIGVAVTLVVGLAAGSTWLIWRYRRRELSSYEPVGVVGPEQELQPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTGTTGCCAAATGGCTTCAGGCTACAGACGAACAGCTATGGGAGCTATCTTTCTTTACATCCCTCTGTTTCTTGTGTTTTACATTTTAACAGAGCACTTCCTTCA
CAAGATCAGAAACCTCCCGCCCACCCCTTTCCCATCACTACCCATCCTTGGCCATCTCCACCTCTTGACCAAACCCATCTACCGCACTCTATACAACATCTCGAACCGCT
ACGGTCCAGTCGTGTTCCTCCGGTTGGGGTCTCGCTCGGTCCTGATCGTGTCGTCTCCTTCTGTAGCTGAAGAATGTTTGACTAAAAACGACATCGTGTTTGCAAACCGG
CCTCGTCTCCTCATATCCAAGTGCTTCGGATACAACAGTACGAATCTCATTTGGTCATCCTATGGGGACAATTGGCGCAACCTCCGCCGAATCTGCACCGTTGAAATTTT
GTCCACTCACCGCCTTCATATGCTTTCCATTGTTCGTTTTGAAGAGGTTAGGTCTTTAATTCAACGGCTTGTGAAATGTGAAAACCAAATAGTTAACATGAAAAATGCAT
TTTTCGATCTGACTTTCAACTCGATGTTGAGGATGATTGTTGGAAAACGTTTCTATGGCGATGACGTAGACGATGTTGACGAGGCAAAATTATTCAGGGAGTTACAAGCT
GAGTCAACTCGGTTGACTGGTAAATCATATATAGGAGATTTTATTCCATTAATGGCATGGTTGGGATTTGGTTCGACTTTAGGGGAAGAGATGTTGGATTGTCAAAATCG
AAGAGATGCTTTGATGCAGAGTTTAATCGAGCAACATCGAGGAAGAACGACAAAAATCGATGATTCGTTTCGTGATGGAAGAAAGAACACGATGATTGAAGTATTGTTAG
AATTGCAAGAATCCGAACCTGAACAATATAAAGACGAAACTATCAGAGCGTTGATGCTACTTATGTTAGTAGCAGGTACAGAAACATCAGGAAGCACAATGGAGTGGGCT
CTATCTCTTTTATTAAACCATCCAGAAATTTTAAAGAAAGCACAAAACGAAATTGACGATCAAGTTGGCCACGAACGTCTGATTGACGAATCCGACGTGGCCCACCTTCC
TTACCTACGTGGCATCATCAACGAGACCCTTCGGATGTACCCTCCAGCTCCATTACTAGCACCTCATGAATCATCAGAAGACTGCTCTGTTGGAGGCTACCATATTCCAC
GTGCCACTATGTTATACATCAATATATGGGCCATTCAGAACGACCCCAAAATTTGGGCCCACCCGAGAAAATTTGACCCCGATAGGTTTAACCGGTTGGAAAGTGAAAAT
TACAAGTTCAATTTGATGCCATTTGGGTTGGGGAGGAGGGGGTGTCCAGGAGAGGGTCTGGGCCTACGAATGGTTGGACTTGTATTGGGCTCATTGATTCAATGCTTTGA
ATGGGAGCGTCCTAGTGACGAATTGGTGGACATGACAGAAGGGATTGCAGTGACAATGCCCAAAGCCCAACACTTACATGCTAAATGCAGGCCCAGGCCCATAATTAGGT
TGAGCTTGCAACTTGTCGTCGGCTTGCCGTTGACCGTCTGCCCCGTCGTCAGTGCAATGGCTTTACTTCATTCAGTCTCAGTAACTTGGGGGCAAGCGGAGAGGGCATTT
TACCGGGTCCTCTCTGCGCCAATTTCCGTCATTTCTCTTCCGTTTGCACCGTCGCGTACGGTTCACCGGCACACCATGTCTCCGCCCTGCGCCAATTCATACTATGCCAT
CGTCTTCTATTATCTAGTTGGTTTTAATTTGCTAGGCATGATTCTTTCCTCTTCTGTGGACTCGAGAGATTTCGATTATCAGCAACGGCCGGTGCTTCTTCCTCTTTATA
TCTCGCCCATGAATTCTACTCATCAGCGCGTTTTAGACCGTGATCACCGTCTCCGGCACCTGCAGAACTTGGACAAGCCCCATTCGTCAAATGCTCGAATGAGGCTTCAC
GATGACCTCCTTACCAACGGGTATTACACGACGCGTCTATGGATTGGGACTCCTCCACAAGAATTTGCTCTTATTGTGGATACCGGAAGTACTGTGACATACGTTCCATG
CTCTAACTGCGTAGAGTGCGGGAATCATCAGGATCCAAGGTTCCAACCAGAGTTGTCTAGCACATATCAACCTGTTAAGTGCAATGTTGATTGTAACTGTGATGACAACG
GAGTCCAGTGTACATACGAGAGAAGGTATGCAGAGATGAGTACTAGCAGCGGTGTGCTTGCTGAGGACATCATGTCATTTGGAAAAGAAAGTGAACTCGTACCCCAGCGT
GCTGTCTTTGGCTGTGAAACTATGGAAACTGGTGATCTTTATACCCAACGTGCTGATGGGATTATGGGTTTGGGCCGTGGTACACTCAGTGTGATGGACCAACTTGTTGG
CAAGGGTGTTGTGAGCAATTCATTTTCATTATGTTATGGTGGGATGGATATTGGTGGGGGTGCAATGGTTCTTGGTGGGATCTCTTCACCCCATGGCATGCTTTGGGATG
GAAACAGCCCATATTATAATATAGAGTTGAAGGAAATACATGTTGCTGGGAAGCCATTGAAGTTGAATCCGAGCACTTTTAATGGGAAGTATGGTGCTATCTTGGATAGT
GGAACTACATATGCTTATTTTCCAGAAAAAGCCTACTATGCATTCAAGGACGCTATAATGAAGAAAATTAGTTTTCTGAAACAAATCAATGGTCCTGACCCGAATTTTAA
AGATATTTGTTTCTCTGGTGCTGGAAGGGATGTCACTGAACTTTCAAAAGTTTTTCCAGAGGTTGATATGGTTTTTGCTAATGGACAGAAAATTTCTCTTTCTCCAGAGA
ACTACTTGTTCCGGCATACTAAGGTTAGTCGTGCATATTGTCTGGGGATCTTTAAAAACAGGAATGATCAGACGACACTTTTGGGAGGTATGGCAAAGTTTCCCTGGATT
TACTTCTTTAGAGTAAAGCTGCATGGTTTCTTTCCATCGGCAGGAATTATTGTTCGTAATACTCTTGTCACTTATGATCGAGAGAACACTACAATTGGGTTTTGGAAGAC
TAACTGTTCTGAACTTTGGAAGAATCTGCATTATCTTTCTCCTGCTCCTCCTCCAGCTCCTTTACCTTCCTATGGTCAGAATACAAGTAAAGAAATCCCTCCACCTGGTT
CTCCAACTGGGCCATTTCTTTCTGGTGAATTTCAGGTTGGAGTCATAACATTTAATATGATGCTTCATGTCAACAAATCCTCCGTGAAACTTAACATCACTGAACTTGCA
GAAGTTATTGCCGATGAACTTGAAGTAATAATGAATTTAGAGTCAAATCTTTTCACCGTTCATGTTATCTCTTGTCGACTCTCCACCCTTGTTCATGTGCTGAACTTTAC
ATCGAGGGAAGCTGATTTCTTCATTAGATGGGCCATCTTCCCTGCTGACTCTGCTGGTTATATATCTAATTCCACGGCAATGGACATAATATCTCGCTTAAAGGAACGTG
GCTTGCAACTTCCTGATAAATTCGGAAGTTATCAGCTGGTTGAATTGAATGTTGAACCCCCGTTAAAGAAGACATGGATGGAGCAGCACTTCTGGTCTGTAATGACTATT
GGAGTAGCAGTTACCTTAGTAGTTGGATTGGCAGCCGGAAGCACATGGTTGATTTGGAGATACAGACGGAGGGAACTGAGTTCGTATGAGCCTGTCGGTGTAGTCGGACC
TGAGCAAGAGCTTCAGCCACTATAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTTGTTGCCAAATGGCTTCAGGCTACAGACGAACAGCTATGGGAGCTATCTTTCTTTACATCCCTCTGTTTCTTGTGTTTTACATTTTAACAGAGCACTTCCTTCA
CAAGATCAGAAACCTCCCGCCCACCCCTTTCCCATCACTACCCATCCTTGGCCATCTCCACCTCTTGACCAAACCCATCTACCGCACTCTATACAACATCTCGAACCGCT
ACGGTCCAGTCGTGTTCCTCCGGTTGGGGTCTCGCTCGGTCCTGATCGTGTCGTCTCCTTCTGTAGCTGAAGAATGTTTGACTAAAAACGACATCGTGTTTGCAAACCGG
CCTCGTCTCCTCATATCCAAGTGCTTCGGATACAACAGTACGAATCTCATTTGGTCATCCTATGGGGACAATTGGCGCAACCTCCGCCGAATCTGCACCGTTGAAATTTT
GTCCACTCACCGCCTTCATATGCTTTCCATTGTTCGTTTTGAAGAGGTTAGGTCTTTAATTCAACGGCTTGTGAAATGTGAAAACCAAATAGTTAACATGAAAAATGCAT
TTTTCGATCTGACTTTCAACTCGATGTTGAGGATGATTGTTGGAAAACGTTTCTATGGCGATGACGTAGACGATGTTGACGAGGCAAAATTATTCAGGGAGTTACAAGCT
GAGTCAACTCGGTTGACTGGTAAATCATATATAGGAGATTTTATTCCATTAATGGCATGGTTGGGATTTGGTTCGACTTTAGGGGAAGAGATGTTGGATTGTCAAAATCG
AAGAGATGCTTTGATGCAGAGTTTAATCGAGCAACATCGAGGAAGAACGACAAAAATCGATGATTCGTTTCGTGATGGAAGAAAGAACACGATGATTGAAGTATTGTTAG
AATTGCAAGAATCCGAACCTGAACAATATAAAGACGAAACTATCAGAGCGTTGATGCTACTTATGTTAGTAGCAGGTACAGAAACATCAGGAAGCACAATGGAGTGGGCT
CTATCTCTTTTATTAAACCATCCAGAAATTTTAAAGAAAGCACAAAACGAAATTGACGATCAAGTTGGCCACGAACGTCTGATTGACGAATCCGACGTGGCCCACCTTCC
TTACCTACGTGGCATCATCAACGAGACCCTTCGGATGTACCCTCCAGCTCCATTACTAGCACCTCATGAATCATCAGAAGACTGCTCTGTTGGAGGCTACCATATTCCAC
GTGCCACTATGTTATACATCAATATATGGGCCATTCAGAACGACCCCAAAATTTGGGCCCACCCGAGAAAATTTGACCCCGATAGGTTTAACCGGTTGGAAAGTGAAAAT
TACAAGTTCAATTTGATGCCATTTGGGTTGGGGAGGAGGGGGTGTCCAGGAGAGGGTCTGGGCCTACGAATGGTTGGACTTGTATTGGGCTCATTGATTCAATGCTTTGA
ATGGGAGCGTCCTAGTGACGAATTGGTGGACATGACAGAAGGGATTGCAGTGACAATGCCCAAAGCCCAACACTTACATGCTAAATGCAGGCCCAGGCCCATAATTAGGT
TGAGCTTGCAACTTGTCGTCGGCTTGCCGTTGACCGTCTGCCCCGTCGTCAGTGCAATGGCTTTACTTCATTCAGTCTCAGTAACTTGGGGGCAAGCGGAGAGGGCATTT
TACCGGGTCCTCTCTGCGCCAATTTCCGTCATTTCTCTTCCGTTTGCACCGTCGCGTACGGTTCACCGGCACACCATGTCTCCGCCCTGCGCCAATTCATACTATGCCAT
CGTCTTCTATTATCTAGTTGGTTTTAATTTGCTAGGCATGATTCTTTCCTCTTCTGTGGACTCGAGAGATTTCGATTATCAGCAACGGCCGGTGCTTCTTCCTCTTTATA
TCTCGCCCATGAATTCTACTCATCAGCGCGTTTTAGACCGTGATCACCGTCTCCGGCACCTGCAGAACTTGGACAAGCCCCATTCGTCAAATGCTCGAATGAGGCTTCAC
GATGACCTCCTTACCAACGGGTATTACACGACGCGTCTATGGATTGGGACTCCTCCACAAGAATTTGCTCTTATTGTGGATACCGGAAGTACTGTGACATACGTTCCATG
CTCTAACTGCGTAGAGTGCGGGAATCATCAGGATCCAAGGTTCCAACCAGAGTTGTCTAGCACATATCAACCTGTTAAGTGCAATGTTGATTGTAACTGTGATGACAACG
GAGTCCAGTGTACATACGAGAGAAGGTATGCAGAGATGAGTACTAGCAGCGGTGTGCTTGCTGAGGACATCATGTCATTTGGAAAAGAAAGTGAACTCGTACCCCAGCGT
GCTGTCTTTGGCTGTGAAACTATGGAAACTGGTGATCTTTATACCCAACGTGCTGATGGGATTATGGGTTTGGGCCGTGGTACACTCAGTGTGATGGACCAACTTGTTGG
CAAGGGTGTTGTGAGCAATTCATTTTCATTATGTTATGGTGGGATGGATATTGGTGGGGGTGCAATGGTTCTTGGTGGGATCTCTTCACCCCATGGCATGCTTTGGGATG
GAAACAGCCCATATTATAATATAGAGTTGAAGGAAATACATGTTGCTGGGAAGCCATTGAAGTTGAATCCGAGCACTTTTAATGGGAAGTATGGTGCTATCTTGGATAGT
GGAACTACATATGCTTATTTTCCAGAAAAAGCCTACTATGCATTCAAGGACGCTATAATGAAGAAAATTAGTTTTCTGAAACAAATCAATGGTCCTGACCCGAATTTTAA
AGATATTTGTTTCTCTGGTGCTGGAAGGGATGTCACTGAACTTTCAAAAGTTTTTCCAGAGGTTGATATGGTTTTTGCTAATGGACAGAAAATTTCTCTTTCTCCAGAGA
ACTACTTGTTCCGGCATACTAAGGTTAGTCGTGCATATTGTCTGGGGATCTTTAAAAACAGGAATGATCAGACGACACTTTTGGGAGGTATGGCAAAGTTTCCCTGGATT
TACTTCTTTAGAGTAAAGCTGCATGGTTTCTTTCCATCGGCAGGAATTATTGTTCGTAATACTCTTGTCACTTATGATCGAGAGAACACTACAATTGGGTTTTGGAAGAC
TAACTGTTCTGAACTTTGGAAGAATCTGCATTATCTTTCTCCTGCTCCTCCTCCAGCTCCTTTACCTTCCTATGGTCAGAATACAAGTAAAGAAATCCCTCCACCTGGTT
CTCCAACTGGGCCATTTCTTTCTGGTGAATTTCAGGTTGGAGTCATAACATTTAATATGATGCTTCATGTCAACAAATCCTCCGTGAAACTTAACATCACTGAACTTGCA
GAAGTTATTGCCGATGAACTTGAAGTAATAATGAATTTAGAGTCAAATCTTTTCACCGTTCATGTTATCTCTTGTCGACTCTCCACCCTTGTTCATGTGCTGAACTTTAC
ATCGAGGGAAGCTGATTTCTTCATTAGATGGGCCATCTTCCCTGCTGACTCTGCTGGTTATATATCTAATTCCACGGCAATGGACATAATATCTCGCTTAAAGGAACGTG
GCTTGCAACTTCCTGATAAATTCGGAAGTTATCAGCTGGTTGAATTGAATGTTGAACCCCCGTTAAAGAAGACATGGATGGAGCAGCACTTCTGGTCTGTAATGACTATT
GGAGTAGCAGTTACCTTAGTAGTTGGATTGGCAGCCGGAAGCACATGGTTGATTTGGAGATACAGACGGAGGGAACTGAGTTCGTATGAGCCTGTCGGTGTAGTCGGACC
TGAGCAAGAGCTTCAGCCACTATAGATCC
Protein sequenceShow/hide protein sequence
MFCCQMASGYRRTAMGAIFLYIPLFLVFYILTEHFLHKIRNLPPTPFPSLPILGHLHLLTKPIYRTLYNISNRYGPVVFLRLGSRSVLIVSSPSVAEECLTKNDIVFANR
PRLLISKCFGYNSTNLIWSSYGDNWRNLRRICTVEILSTHRLHMLSIVRFEEVRSLIQRLVKCENQIVNMKNAFFDLTFNSMLRMIVGKRFYGDDVDDVDEAKLFRELQA
ESTRLTGKSYIGDFIPLMAWLGFGSTLGEEMLDCQNRRDALMQSLIEQHRGRTTKIDDSFRDGRKNTMIEVLLELQESEPEQYKDETIRALMLLMLVAGTETSGSTMEWA
LSLLLNHPEILKKAQNEIDDQVGHERLIDESDVAHLPYLRGIINETLRMYPPAPLLAPHESSEDCSVGGYHIPRATMLYINIWAIQNDPKIWAHPRKFDPDRFNRLESEN
YKFNLMPFGLGRRGCPGEGLGLRMVGLVLGSLIQCFEWERPSDELVDMTEGIAVTMPKAQHLHAKCRPRPIIRLSLQLVVGLPLTVCPVVSAMALLHSVSVTWGQAERAF
YRVLSAPISVISLPFAPSRTVHRHTMSPPCANSYYAIVFYYLVGFNLLGMILSSSVDSRDFDYQQRPVLLPLYISPMNSTHQRVLDRDHRLRHLQNLDKPHSSNARMRLH
DDLLTNGYYTTRLWIGTPPQEFALIVDTGSTVTYVPCSNCVECGNHQDPRFQPELSSTYQPVKCNVDCNCDDNGVQCTYERRYAEMSTSSGVLAEDIMSFGKESELVPQR
AVFGCETMETGDLYTQRADGIMGLGRGTLSVMDQLVGKGVVSNSFSLCYGGMDIGGGAMVLGGISSPHGMLWDGNSPYYNIELKEIHVAGKPLKLNPSTFNGKYGAILDS
GTTYAYFPEKAYYAFKDAIMKKISFLKQINGPDPNFKDICFSGAGRDVTELSKVFPEVDMVFANGQKISLSPENYLFRHTKVSRAYCLGIFKNRNDQTTLLGGMAKFPWI
YFFRVKLHGFFPSAGIIVRNTLVTYDRENTTIGFWKTNCSELWKNLHYLSPAPPPAPLPSYGQNTSKEIPPPGSPTGPFLSGEFQVGVITFNMMLHVNKSSVKLNITELA
EVIADELEVIMNLESNLFTVHVISCRLSTLVHVLNFTSREADFFIRWAIFPADSAGYISNSTAMDIISRLKERGLQLPDKFGSYQLVELNVEPPLKKTWMEQHFWSVMTI
GVAVTLVVGLAAGSTWLIWRYRRRELSSYEPVGVVGPEQELQPL