; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018174 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018174
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionGlycosyltransferase
Genome locationtig00153113:784338..799213
RNA-Seq ExpressionSgr018174
SyntenySgr018174
Gene Ontology termsGO:0008194 - UDP-glycosyltransferase activity (molecular function)
InterPro domainsIPR002213 - UDP-glucuronosyl/UDP-glucosyltransferase
IPR035595 - UDP-glycosyltransferase family, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575213.1 hypothetical protein SDJN03_25852, partial [Cucurbita argyrosperma subsp. sororia]5.3e-24260.05Show/hide
Query:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKR--QLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGP
        ME+KL A  SKRSS ++QP++         LQAG LH PRKKPK     QP+E  SKDGDGVS F+AKDLR+KRVFSPNL  EN S V SGE ISDKEGP
Subjt:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKR--QLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGP

Query:  IIANGSWPIE----MKASEIAP-------RLQRY----------------------------GKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTG
        + ANG+   E     K SEI            RY                            G ++ A  +GC  SSNG V GD  A  D RI S++RTG
Subjt:  IIANGSWPIE----MKASEIAP-------RLQRY----------------------------GKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTG

Query:  SVLEPCIKRKLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISF
        SVL+PC KRKLFKAPGSIAYKR+LP+LLD+DN+TL   P+SKRE NLEK+ NIESNLCN A  SS VDSD CVKNA F S  +CK MKLNLP PD     
Subjt:  SVLEPCIKRKLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISF

Query:  SKPSVNGETEIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD-----------------
             NG+T+ F+NG        SD N+DPT+V+EGS LK DNVVCAS  D+R      PTK+ TE  S KEQ KTSG+ERLD                 
Subjt:  SKPSVNGETEIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD-----------------

Query:  ----------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVN
                                          VKEERRDEKV CTQGADQ LGSSTVGEN  NI T SDKK+G+YVRNK+VRN LV+LKL  SQASV+
Subjt:  ----------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVN

Query:  YRRMLPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSP
        YRRMLPFLE+LFKDN  NCALGNINCPR EKELAT NL P SSNSHN+ +KS  LVSC  PCDG+SD LSM  SN++N+ VCE +  LM A V+D LLSP
Subjt:  YRRMLPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSP

Query:  P-KLQMHQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACI
        P       LHS+QE L++C+L++DPQ       LNDQAVS  Y A +YEP+T EGSRM ++QS  TSEDCT+LT+ VSDG KLSERNSL P    +EACI
Subjt:  P-KLQMHQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACI

Query:  LPENPANVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKD-NAGYHSNKVKEACRKASEAESI
        LPEN  NVRKGILKRNPRGCRGIC CLNCSSFRLHAERAFEFSRNQLQDAEEV SDLMKEL  LR VL+K++D  K+ +AGYHSNKVKEACRKASEAE I
Subjt:  LPENPANVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKD-NAGYHSNKVKEACRKASEAESI

Query:  AKGRLLQMNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK
        AK RLLQMNYEL IHCRITCSQRPNVRFS +VE IEIEDGK
Subjt:  AKGRLLQMNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK

XP_022147234.1 uncharacterized protein LOC111016232 [Momordica charantia]1.5e-25762.71Show/hide
Query:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKRQLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGPII
        MES+LGAKTSKR S I+QPQ+         LQ+G L SPRK+P+RQ QP EFPSKDGDGVS F+AKDLRIKRVFS NLPD +GS   SGEQISDKEG II
Subjt:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKRQLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGPII

Query:  ANGSWPIE----MKASEIAPRL-------------QRY-GK--------------WMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTGSVLEPCIKR
        ANGS P E     K SE  P +             QR+ GK                 A  DGC  SSN      TYA  D R+ S++RTGSVL+PC+KR
Subjt:  ANGSWPIE----MKASEIAPRL-------------QRY-GK--------------WMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTGSVLEPCIKR

Query:  KLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISFSKPSVNGET
        KLFKAPGS+AYKRLLP+LLD+DNYT+QG P+ KRE NLEKK NIESNL N  KE   VDSD CVK + F S MSCKA K NL PPD          N +T
Subjt:  KLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISFSKPSVNGET

Query:  EIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD--------------------------
        + F+NGNPSGNQKCSDSNN+  +VK  SGLK D+ VCASSHDK  TENGG TKH  EH+SY EQSKTS LER+D                          
Subjt:  EIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD--------------------------

Query:  ------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVNYRRM
                                      + EERRDEKVECT+GADQ LG ST+GEN SN+   SDK+HGS VRNK VRN LV+LK F  Q SV+YRRM
Subjt:  ------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVNYRRM

Query:  LPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSPPKLQ
        LPFLE+LFKDNS NC L NI+CPR + ELAT +LQ  SS+SHN+L+KS GLVSC KPCDG+SDTL    S  MNE VCETE  L  AEVND+LLSP KLQ
Subjt:  LPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSPPKLQ

Query:  MHQLHSEQELDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACILPENPA
        MHQLHSEQ LDE K+E+DPQLHGAA + NDQAVS   P+ +YE              P+T EDCTSLTDNVSDGAKLS+ NSLGPNSS   ACILPEN  
Subjt:  MHQLHSEQELDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACILPENPA

Query:  NVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKDNAGYHSNKVKEACRKASEAESIAKGRLLQ
        NV+KGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEV SDLMKELSFLRDVL++FSDGVK +AGYHS KVKEACRKA+EAE IAK R LQ
Subjt:  NVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKDNAGYHSNKVKEACRKASEAESIAKGRLLQ

Query:  MNYELDIHCRITCSQRPNVRFSGDVE
        MNYELDIHCRITCSQRP+VRFS D+E
Subjt:  MNYELDIHCRITCSQRPNVRFSGDVE

XP_022959043.1 uncharacterized protein LOC111460150 [Cucurbita moschata]3.2e-23959.45Show/hide
Query:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKR--QLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGP
        ME+KL A  SKRSS ++QP++         LQAG LH PRKKPK     QP+E  SKDGDGVS F+AKDLR+KRVFSPNL  EN S V SGE ISDKEGP
Subjt:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKR--QLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGP

Query:  IIANGSWPIE----MKASEIAP-------RLQRY----------------------------GKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTG
        + ANG+   E     K SEI            RY                            G ++ A  +GC  SSNG V GD  A  D R+ S++RTG
Subjt:  IIANGSWPIE----MKASEIAP-------RLQRY----------------------------GKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTG

Query:  SVLEPCIKRKLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISF
        SVL+PC KRKLFKAPGSIAYKR+LP+LLD+DN+TL   P+SKRE NLEKK NIESNLCN A  SS VDSD CVKNA F S  +CK MKLNLP PD     
Subjt:  SVLEPCIKRKLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISF

Query:  SKPSVNGETEIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD-----------------
             NG+T+ F+NG        SD N+DPT+V+EGS LK DNVVCAS  D+R      PTK+ TE  S KEQSKTSG+ERLD                 
Subjt:  SKPSVNGETEIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD-----------------

Query:  ----------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVN
                                          VKEERR+EKV C++GADQ LGS TVGEN  NI T SDKK+G+YVRNK+VRN LV+LKL  SQASV+
Subjt:  ----------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVN

Query:  YRRMLPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSP
        YRRMLPFLE+LFKDN  NCALGNINCPR EKELAT NL   SSNS+N+ +KS  LVSC  PCDG+SD LS+  SN++N+ VCE +  LM A VND LLSP
Subjt:  YRRMLPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSP

Query:  P-KLQMHQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACI
        P       L S+QE L++CKL++DPQ       LNDQAVS  Y A +YEP+T EGSRM ++QSP TSEDCT+LT+ VSDG KL ERNSL P    +EACI
Subjt:  P-KLQMHQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACI

Query:  LPENPANVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKD-NAGYHSNKVKEACRKASEAESI
        LPEN  NVRKGILKRNPRGCRGIC CLNCSSFRLHAERAFEFSRNQLQDAEEV SDLMKEL  LR VL+K++D  K+ +AGYHSNKVKEACRKASEAE I
Subjt:  LPENPANVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKD-NAGYHSNKVKEACRKASEAESI

Query:  AKGRLLQMNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK
        AK RLLQMNYEL IHCRITCSQRPNVRFS +VE IEIEDGK
Subjt:  AKGRLLQMNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK

XP_023006513.1 uncharacterized protein LOC111499219 [Cucurbita maxima]7.9e-23859.33Show/hide
Query:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKR--QLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGP
        MESKL A  SKRSS ++QP++         LQAG LH PRKKPK     Q +E  SKDGDGVS F+AKDLR+KRVFSPNL  EN S V SGE ISDKEGP
Subjt:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKR--QLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGP

Query:  IIANGSWPIE----MKASEIAP-------RLQRY----------------------------GKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTG
        + ANG+   E     K SEI            RY                            G ++ A   GC  SSNG V GD  A  D RI S++RTG
Subjt:  IIANGSWPIE----MKASEIAP-------RLQRY----------------------------GKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTG

Query:  SVLEPCIKRKLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISF
        SVL+PC KRKLFKAPGSIAYKR+LP+LLD+DN+TL   P+ KRE NLEKK NIESNLCN A  SS VDSD CVKNA F S  +CK MKL+LPPPD     
Subjt:  SVLEPCIKRKLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISF

Query:  SKPSVNGETEIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD-----------------
             NG+T+ F+NG        SD ++DPT+V+EGS LK DNVVCAS  D+R      PTK+  E  S +EQSKTSG+ERLD                 
Subjt:  SKPSVNGETEIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD-----------------

Query:  ----------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVN
                                          VKEERRDEKV CT+GAD+ LGSSTVGEN  NI T SDKK+G+YVRNK+VRN L +LKL  SQASV+
Subjt:  ----------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVN

Query:  YRRMLPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSP
        YRRMLPFLE+LFKDN  NCA GNINCPR EKEL T NL P SSNSHN+ +KS  LVSC  PCDG+SD LSM  SN++N+ VCE +  LM A VND LLSP
Subjt:  YRRMLPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSP

Query:  P-KLQMHQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACI
        P       LHS+ E L++CKL++DPQ       LNDQAVS  Y A +YEP+T EGSRM +KQSP TSEDCT+LT+ VSDG KL+ERNSL P    VEACI
Subjt:  P-KLQMHQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACI

Query:  LPENPANVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKD-NAGYHSNKVKEACRKASEAESI
        LPEN  N+RKGILKRN RGCRGIC CLNCSSFRLHAERAFEFSRNQLQDAEEV SDLMKEL  LR VL+K++D  K+ +AGYHSNKVKEACRKASEAE I
Subjt:  LPENPANVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKD-NAGYHSNKVKEACRKASEAESI

Query:  AKGRLLQMNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK
        AK RLLQMNYEL IHCRITCSQRPNVRFS +VE IEIEDGK
Subjt:  AKGRLLQMNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK

XP_023547530.1 uncharacterized protein LOC111806447 [Cucurbita pepo subsp. pepo]1.6e-23859.45Show/hide
Query:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKR--QLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGP
        MESKL A  SKRSS ++QP++         LQAG LH PRKKPK     Q +E  SKDGDGVS F+AKDLR+KRVFSPNL  EN S V SGE ISDKEGP
Subjt:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKR--QLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGP

Query:  IIANGSWPIE----MKASEIAP-------RLQRY----------------------------GKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTG
        + ANG+   E     K SEI         +  RY                            G ++ A  +GC  SSNG V GD  A  D RI S++RTG
Subjt:  IIANGSWPIE----MKASEIAP-------RLQRY----------------------------GKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTG

Query:  SVLEPCIKRKLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISF
        SVL+PC KRKLFKAPGSIAYKR+LP+LLD+D +TL   P+SKRE NLEKK NIESNLCN A  SS VDSD CVKNA F    +CK MKLNLPPPD     
Subjt:  SVLEPCIKRKLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISF

Query:  SKPSVNGETEIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD-----------------
             NG+T+ F+NG        SD N+DPT+V+EGS LK DNVVCAS  D+R      PTK+ TE  S KEQSKTSG+ERLD                 
Subjt:  SKPSVNGETEIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD-----------------

Query:  ----------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVN
                                          VKEERRDEKV CT+GAD+ LGSSTVGEN  NI T SDKK+G+YVRNK+VRN LV+LKL  SQASV+
Subjt:  ----------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVN

Query:  YRRMLPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSP
        YRRMLPFLE+LFKDN  NCA GNI+CPR EKEL T NL P SSNSHN+ +KS  LVSC  PCDG+SD LSM  SN++N+ VCE +  LM A VND LLSP
Subjt:  YRRMLPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSP

Query:  P-KLQMHQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACI
        P       LHS+QE L++CKL++D Q       LNDQA S  Y A +YEP++ EGSRM  +QSP TSEDCT+ T+ VSDG KLSERNSL P    +EACI
Subjt:  P-KLQMHQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACI

Query:  LPENPANVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKD-NAGYHSNKVKEACRKASEAESI
        LPEN  NVRKGILKRNPRGCRGIC CLNCSSFRLHAERAFEFSRNQLQDAEEV SDLMKEL  LR VL+K++D  K+ +AGYHSNKVKEACRKASEAE I
Subjt:  LPENPANVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKD-NAGYHSNKVKEACRKASEAESI

Query:  AKGRLLQMNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK
        AK RLLQMNYEL IHCRITCSQRPNVRFS +VE IEIEDGK
Subjt:  AKGRLLQMNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK

TrEMBL top hitse value%identityAlignment
A0A6J1D0E4 uncharacterized protein LOC1110162327.4e-25862.71Show/hide
Query:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKRQLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGPII
        MES+LGAKTSKR S I+QPQ+         LQ+G L SPRK+P+RQ QP EFPSKDGDGVS F+AKDLRIKRVFS NLPD +GS   SGEQISDKEG II
Subjt:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKRQLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGPII

Query:  ANGSWPIE----MKASEIAPRL-------------QRY-GK--------------WMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTGSVLEPCIKR
        ANGS P E     K SE  P +             QR+ GK                 A  DGC  SSN      TYA  D R+ S++RTGSVL+PC+KR
Subjt:  ANGSWPIE----MKASEIAPRL-------------QRY-GK--------------WMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTGSVLEPCIKR

Query:  KLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISFSKPSVNGET
        KLFKAPGS+AYKRLLP+LLD+DNYT+QG P+ KRE NLEKK NIESNL N  KE   VDSD CVK + F S MSCKA K NL PPD          N +T
Subjt:  KLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISFSKPSVNGET

Query:  EIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD--------------------------
        + F+NGNPSGNQKCSDSNN+  +VK  SGLK D+ VCASSHDK  TENGG TKH  EH+SY EQSKTS LER+D                          
Subjt:  EIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD--------------------------

Query:  ------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVNYRRM
                                      + EERRDEKVECT+GADQ LG ST+GEN SN+   SDK+HGS VRNK VRN LV+LK F  Q SV+YRRM
Subjt:  ------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVNYRRM

Query:  LPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSPPKLQ
        LPFLE+LFKDNS NC L NI+CPR + ELAT +LQ  SS+SHN+L+KS GLVSC KPCDG+SDTL    S  MNE VCETE  L  AEVND+LLSP KLQ
Subjt:  LPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSPPKLQ

Query:  MHQLHSEQELDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACILPENPA
        MHQLHSEQ LDE K+E+DPQLHGAA + NDQAVS   P+ +YE              P+T EDCTSLTDNVSDGAKLS+ NSLGPNSS   ACILPEN  
Subjt:  MHQLHSEQELDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACILPENPA

Query:  NVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKDNAGYHSNKVKEACRKASEAESIAKGRLLQ
        NV+KGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEV SDLMKELSFLRDVL++FSDGVK +AGYHS KVKEACRKA+EAE IAK R LQ
Subjt:  NVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKDNAGYHSNKVKEACRKASEAESIAKGRLLQ

Query:  MNYELDIHCRITCSQRPNVRFSGDVE
        MNYELDIHCRITCSQRP+VRFS D+E
Subjt:  MNYELDIHCRITCSQRPNVRFSGDVE

A0A6J1H4T6 uncharacterized protein LOC1114601501.5e-23959.45Show/hide
Query:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKR--QLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGP
        ME+KL A  SKRSS ++QP++         LQAG LH PRKKPK     QP+E  SKDGDGVS F+AKDLR+KRVFSPNL  EN S V SGE ISDKEGP
Subjt:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKR--QLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGP

Query:  IIANGSWPIE----MKASEIAP-------RLQRY----------------------------GKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTG
        + ANG+   E     K SEI            RY                            G ++ A  +GC  SSNG V GD  A  D R+ S++RTG
Subjt:  IIANGSWPIE----MKASEIAP-------RLQRY----------------------------GKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTG

Query:  SVLEPCIKRKLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISF
        SVL+PC KRKLFKAPGSIAYKR+LP+LLD+DN+TL   P+SKRE NLEKK NIESNLCN A  SS VDSD CVKNA F S  +CK MKLNLP PD     
Subjt:  SVLEPCIKRKLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISF

Query:  SKPSVNGETEIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD-----------------
             NG+T+ F+NG        SD N+DPT+V+EGS LK DNVVCAS  D+R      PTK+ TE  S KEQSKTSG+ERLD                 
Subjt:  SKPSVNGETEIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD-----------------

Query:  ----------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVN
                                          VKEERR+EKV C++GADQ LGS TVGEN  NI T SDKK+G+YVRNK+VRN LV+LKL  SQASV+
Subjt:  ----------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVN

Query:  YRRMLPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSP
        YRRMLPFLE+LFKDN  NCALGNINCPR EKELAT NL   SSNS+N+ +KS  LVSC  PCDG+SD LS+  SN++N+ VCE +  LM A VND LLSP
Subjt:  YRRMLPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSP

Query:  P-KLQMHQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACI
        P       L S+QE L++CKL++DPQ       LNDQAVS  Y A +YEP+T EGSRM ++QSP TSEDCT+LT+ VSDG KL ERNSL P    +EACI
Subjt:  P-KLQMHQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACI

Query:  LPENPANVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKD-NAGYHSNKVKEACRKASEAESI
        LPEN  NVRKGILKRNPRGCRGIC CLNCSSFRLHAERAFEFSRNQLQDAEEV SDLMKEL  LR VL+K++D  K+ +AGYHSNKVKEACRKASEAE I
Subjt:  LPENPANVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKD-NAGYHSNKVKEACRKASEAESI

Query:  AKGRLLQMNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK
        AK RLLQMNYEL IHCRITCSQRPNVRFS +VE IEIEDGK
Subjt:  AKGRLLQMNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK

A0A6J1HGP9 uncharacterized protein LOC111464172 isoform X18.0e-22055.76Show/hide
Query:  SKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKRQLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGPIIANGSWP---
        SKRSS +HQPQ          LQAG LH PRKKPK     DE  SK GD + ++ AKDLRIKRVFSPN   +N S +PS  QISD+E  I  NG+ P   
Subjt:  SKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKRQLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGPIIANGSWP---

Query:  -----------------------IEMK------ASEIAPRLQ--------RYGKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTGSVLEPCIKRK
                               +E+       + +I  R+           G  + A  +GC  SSNG V GDT A  D RI +++RTGSVL+PC KRK
Subjt:  -----------------------IEMK------ASEIAPRLQ--------RYGKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTGSVLEPCIKRK

Query:  LFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISFSKPSVNGETE
        LFKAPGSIAYKRLLP+L D DNY L+G   SKREK LEKK NIESNLCNRA ESS +DSD  V NA     +SC  MKLN  PP           NG+ +
Subjt:  LFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISFSKPSVNGETE

Query:  IFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD---------------------------
         F+NG        SDS NDPT+VKE SGLK D VV  SS DK+ TENGGP++HQ E     EQSKT  +ERLD                           
Subjt:  IFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD---------------------------

Query:  -----------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVNYRRML
                                     VKEER++EKV CT+G DQNLGSSTVGEN  N  T SDKK+G  VRNKVVRN LV+LK   SQ SV++RRML
Subjt:  -----------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVNYRRML

Query:  PFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSPPKLQM
        PFL +LFKDN  NCA GNI+ PR EKEL T NLQ  SSNSHN+ ++S GL SC  PCDGS DT SM  SNTMNE VCETE  L+   +ND+LLS PKLQM
Subjt:  PFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSPPKLQM

Query:  HQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACILPENPA
        H LHSEQE LD C L+VDPQLH       DQAV      ++Y+P+T +GSRM+++QSPITSE CT+LTDNVSD AKLSERNSL PNS CVE C+ P    
Subjt:  HQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACILPENPA

Query:  NVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKDNAGYHSNKVKEACRKASEAESIAKGRLLQ
        NV KGI K+NPRGCRGIC CLNCSSFRLHAERAFEFSRNQLQDAE V SDLMKELSF+RDVL+K SDG   +AGY+SNKVKEACRKASEAE +AK RLLQ
Subjt:  NVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKDNAGYHSNKVKEACRKASEAESIAKGRLLQ

Query:  MNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK
        MN +LDI  RI C QRPNVRFS +++  +IE GK
Subjt:  MNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK

A0A6J1KHD4 uncharacterized protein LOC111494390 isoform X17.0e-22456.59Show/hide
Query:  SKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKRQLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGPIIANGSWP---
        SKRSS +HQPQS         LQAG LH PRKKPKR    DE  S  GD +S++ AKDLRIKRVFSPNL  +N S VPS  QISD+E PI ANG+ P   
Subjt:  SKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKRQLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGPIIANGSWP---

Query:  -------------------------------------IEMKASEIAPRLQRYGKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTGSVLEPCIKRK
                                             +E+  S         G  + A  +GC  SS+G V GD  A  D RI S++RTGSVL+PC KRK
Subjt:  -------------------------------------IEMKASEIAPRLQRYGKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTGSVLEPCIKRK

Query:  LFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISFSKPSVNGETE
        LFKAPGSIAYKRLLP+LLD DNY LQG   SKRE NLEKK NIESN CNRA ESS VDSD  VK A     +SC  MKLNL PPD          NG+T+
Subjt:  LFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISFSKPSVNGETE

Query:  IFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD---------------------------
         F NG        SDS NDPT+VKE SGLK D VV  SS DKR TENG             +QSKT G+ERLD                           
Subjt:  IFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD---------------------------

Query:  -----------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVNYRRML
                                     VKEER++EKV CT+G DQNLGSSTVGEN  NI T SDKK+G  VRNKV+RN LV+LK   SQ  V+YRRML
Subjt:  -----------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVNYRRML

Query:  PFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSPPKLQM
        PFLE+LFKDN  NCA  NI+ PR EKEL T NLQ  SSNSHN+ +KS  L SC  PC+G+ DT SM   NTMNE VCETE  L+   + D+LLS PKLQM
Subjt:  PFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSPPKLQM

Query:  HQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACILPENPA
        H  HSEQE LD+C L+VDPQLH       DQAV  LY AA+Y+P+T EGSRM+++QSPITSE CT+LTDNVSD AKLSERNSL PNS CVE C+LPE+  
Subjt:  HQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACILPENPA

Query:  NVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKDNAGYHSNKVKEACRKASEAESIAKGRLLQ
        NV KGI K+NPRGCRGIC CLNCSSFRLHAERAFEFSRNQLQDAE V SDLMKELSF+RDVL+K S+G   +AGY+SNKVKEACRKASEAE +AK RL Q
Subjt:  NVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKDNAGYHSNKVKEACRKASEAESIAKGRLLQ

Query:  MNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK
        MN +LDIH RI C QRPNVRFS +++  EIE GK
Subjt:  MNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK

A0A6J1L546 uncharacterized protein LOC1114992193.8e-23859.33Show/hide
Query:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKR--QLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGP
        MESKL A  SKRSS ++QP++         LQAG LH PRKKPK     Q +E  SKDGDGVS F+AKDLR+KRVFSPNL  EN S V SGE ISDKEGP
Subjt:  MESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKR--QLQPDEFPSKDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGP

Query:  IIANGSWPIE----MKASEIAP-------RLQRY----------------------------GKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTG
        + ANG+   E     K SEI            RY                            G ++ A   GC  SSNG V GD  A  D RI S++RTG
Subjt:  IIANGSWPIE----MKASEIAP-------RLQRY----------------------------GKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTG

Query:  SVLEPCIKRKLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISF
        SVL+PC KRKLFKAPGSIAYKR+LP+LLD+DN+TL   P+ KRE NLEKK NIESNLCN A  SS VDSD CVKNA F S  +CK MKL+LPPPD     
Subjt:  SVLEPCIKRKLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISF

Query:  SKPSVNGETEIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD-----------------
             NG+T+ F+NG        SD ++DPT+V+EGS LK DNVVCAS  D+R      PTK+  E  S +EQSKTSG+ERLD                 
Subjt:  SKPSVNGETEIFRNGNPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLD-----------------

Query:  ----------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVN
                                          VKEERRDEKV CT+GAD+ LGSSTVGEN  NI T SDKK+G+YVRNK+VRN L +LKL  SQASV+
Subjt:  ----------------------------------VKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGSYVRNKVVRNQLVRLKLFNSQASVN

Query:  YRRMLPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSP
        YRRMLPFLE+LFKDN  NCA GNINCPR EKEL T NL P SSNSHN+ +KS  LVSC  PCDG+SD LSM  SN++N+ VCE +  LM A VND LLSP
Subjt:  YRRMLPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETENFLMLAEVNDKLLSP

Query:  P-KLQMHQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACI
        P       LHS+ E L++CKL++DPQ       LNDQAVS  Y A +YEP+T EGSRM +KQSP TSEDCT+LT+ VSDG KL+ERNSL P    VEACI
Subjt:  P-KLQMHQLHSEQE-LDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEACI

Query:  LPENPANVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKD-NAGYHSNKVKEACRKASEAESI
        LPEN  N+RKGILKRN RGCRGIC CLNCSSFRLHAERAFEFSRNQLQDAEEV SDLMKEL  LR VL+K++D  K+ +AGYHSNKVKEACRKASEAE I
Subjt:  LPENPANVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKD-NAGYHSNKVKEACRKASEAESI

Query:  AKGRLLQMNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK
        AK RLLQMNYEL IHCRITCSQRPNVRFS +VE IEIEDGK
Subjt:  AKGRLLQMNYELDIHCRITCSQRPNVRFSGDVENIEIEDGK

SwissProt top hitse value%identityAlignment
A7MAS5 Phloretin 4'-O-glucosyltransferase1.0e-12351.9Show/hide
Query:  MRNHHFLLVCFPSQGYINPSLQLARRLI-SLGVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLIT
        M  H FLLV FP+QG+INPSLQ A+RLI + G +VT+ T++ A RR+ N     +   GL++A FSDG+DD   KP  ++  Y SELRR G +++T L+ 
Subjt:  MRNHHFLLVCFPSQGYINPSLQLARRLI-SLGVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLIT

Query:  FAANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSS-----YIELPGLPL-LTSQDMPSFFSPSGPHAFI
         +ANEG P T ++YSLLL W+A +A    +PS L + QPATV  +Y+YYF+GY D I       ++      IELPGLPL  TS+D+PSF   + P+ F 
Subjt:  FAANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSS-----YIELPGLPL-LTSQDMPSFFSPSGPHAFI

Query:  LPPMREQAEFLGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSE-DYIEWLNSKPDSSVVYISVGSICVLSD
        LP  +EQ E L R++ P +LVNTFDALE +AL+AIDK  ++ +GPLIPSA L G D SD SF GDLFQ S +  Y+EWLNSKP+ SV+Y+S GSI VL  
Subjt:  LPPMREQAEFLGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSE-DYIEWLNSKPDSSVVYISVGSICVLSD

Query:  EQEDELVHALLNSG---------------------HTQEKLK---KLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQA
         Q +E+   LL+ G                       +E L+   +LEE G +V WC QVEVL  P+LGCF+THCGWNS++ESLVSG+PVVAFPQ  DQ 
Subjt:  EQEDELVHALLNSG---------------------HTQEKLK---KLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQA

Query:  TNAKLIEDVWKTGVRVKANTEGIVEREEIRRCLDLVMGSRDGQKEEIERNAKKWKELARQAIGEETSMESKLGA
        TNAKLIED WKTGVRV  N EGIV  EE++RCLDLV+GS +   E++ RNAKKWK+LAR+A+ E  S +  L A
Subjt:  TNAKLIEDVWKTGVRVKANTEGIVEREEIRRCLDLVMGSRDGQKEEIERNAKKWKELARQAIGEETSMESKLGA

F8WKW0 Crocetin glucosyltransferase, chloroplastic1.0e-12350.54Show/hide
Query:  MMRNHHFLLVCFPSQGYINPSLQLARRLISLGVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLIT
        M++  H LL+ +P+QG+INP+LQ A+RL+ +G+ VT AT+V A  RMK  +   +T  GL+FATFSDG+DD       D T Y S L + GS +L ++I 
Subjt:  MMRNHHFLLVCFPSQGYINPSLQLARRLISLGVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLIT

Query:  FAANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGP--HAFILPPM
         +A++G P+T ++Y+LLL WAA +A    IPSAL + QP  V+ +Y+YYF GY D + +   DP+  I+ PGLP + ++D+PSF  PS    ++F LP  
Subjt:  FAANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGP--HAFILPPM

Query:  REQAEFLGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSEDYIEWLNSKPDSSVVYISVGSICVLSDEQEDE
        ++Q E L  + +PKVLVNTFDALE  AL+AI+   ++AIGPL PSA L G D S+ SF GDLFQ  S+DY EWLNS+P  SVVY+S GS+  L  +Q +E
Subjt:  REQAEFLGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSEDYIEWLNSKPDSSVVYISVGSICVLSDEQEDE

Query:  LVHALLNSGH------------TQEK-------LKKLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAKLIEDVW
        +   LL SG              +EK       +++LEEQG +V WC Q+EVL HP+LGCF+THCGWNST+E+LV G+PVVAFP   DQ TNAKLIEDVW
Subjt:  LVHALLNSGH------------TQEK-------LKKLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAKLIEDVW

Query:  KTGVRVKANTEGIVEREEIRRCLDLVMGSRDGQK-EEIERNAKKWKELARQAIGEETSMESKLGA
        +TGVRV  N +G VE +EI+RC++ VM   DG+K  E++RNAKKWKELAR+A+ E+ S +  L A
Subjt:  KTGVRVKANTEGIVEREEIRRCLDLVMGSRDGQK-EEIERNAKKWKELARQAIGEETSMESKLGA

K4CWS6 UDP-glycosyltransferase 75C12.4e-12853.48Show/hide
Query:  HFLLVCFPSQGYINPSLQLARRLISLGVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLITFAANE
        H LLV FP+QG+INPSLQ A+RLI +G+ VTF T+V A RRM      TA   GL+ A FSDGFDD       D   Y SE+R  GS++L  +I  +++E
Subjt:  HFLLVCFPSQGYINPSLQLARRLISLGVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLITFAANE

Query:  GRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTI-CSKLQDPSSYIELPGLPLLTSQDMPSFF----SPSGPHAFILPPMRE
        GRP+T ++Y+LLL WAA++A    IPSAL + QPATVL +Y+YYF+GY D + CS   DP+  I+LP LPLL SQD+PSF     S    ++F LP  +E
Subjt:  GRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTI-CSKLQDPSSYIELPGLPLLTSQDMPSFF----SPSGPHAFILPPMRE

Query:  QAEFLGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSEDYIEWLNSKPDSSVVYISVGSICVLSDEQEDELV
        Q + L  +  PKVLVNTFDALE + L+AI+K  ++ IGPLIPS+ LGG DS ++SF GDLFQ S++DY+EWLN+KP SS+VYIS GS+  LS  Q++E+ 
Subjt:  QAEFLGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSEDYIEWLNSKPDSSVVYISVGSICVLSDEQEDELV

Query:  HALL-------------NSGHTQEKLK---KLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAKLIEDVWKTGVR
          L+                  +EKL    +LE+QGK+V WC Q+EVL HP+LGCF++HCGWNST+ESL SG+PVVAFP   DQ TNAKLIEDVWKTGVR
Subjt:  HALL-------------NSGHTQEKLK---KLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAKLIEDVWKTGVR

Query:  VKANTEGIVEREEIRRCLDLVMGSRDGQK-EEIERNAKKWKELARQAIGEETSMESKLGA
        ++ N +G+VE +EI+RC+++VM    G+K EE+ +NA+KWKELAR A+ E  S E  L A
Subjt:  VKANTEGIVEREEIRRCLDLVMGSRDGQK-EEIERNAKKWKELARQAIGEETSMESKLGA

Q9ZR26 Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 27.9e-10848.68Show/hide
Query:  MRNHHFLLVCFPSQGYINPSLQLARRLISLGVNVTFATTVLAGRRMKNKTHQTA-TTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLIT
        M     LL  FP+QG+INP+LQ A+RL+  G +VTF T+V A RRM N     A   PGL F  FSDG+DD  LKP GD   Y SE++  GSE+L +L+ 
Subjt:  MRNHHFLLVCFPSQGYINPSLQLARRLISLGVNVTFATTVLAGRRMKNKTHQTA-TTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLIT

Query:  FAANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGPHAFILPPMRE
                +TFV+YS L +WAA++A    +P+AL + +PATVL +Y +YF+GY D    ++   S+ I+LP LP L  + +P+F  P+ P  F L  M+E
Subjt:  FAANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGPHAFILPPMRE

Query:  QAEFLGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSE-DYIEWLNSKPDSSVVYISVGSICVLSDEQEDEL
        + E L  + + KVLVNTFDALE DAL AID+ +++ IGPLIPSA L G D S+ S+ GDLF+ S E + +EWLNSKP SSVVY+S GS+      Q +E+
Subjt:  QAEFLGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSE-DYIEWLNSKPDSSVVYISVGSICVLSDEQEDEL

Query:  VHALL---------------NSGHTQEK------LKKLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAKLIEDV
           LL               + G  +E+      + +L++ GK+VSWC Q+EVL HPALGCF+THCGWNS +ESL  G+PVVA PQ  DQ TNAKLIED 
Subjt:  VHALL---------------NSGHTQEK------LKKLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAKLIEDV

Query:  WKTGVRVKANTEGIVEREEIRRCLDLVMGSRDGQKEEIERNAKKWKELARQAIG
        W TGVRV+ N  G V+  EI RC+++VM   D  K  +  NA KWK LARQA+G
Subjt:  WKTGVRVKANTEGIVEREEIRRCLDLVMGSRDGQKEEIERNAKKWKELARQAIG

Q9ZR27 Anthocyanidin 3-O-glucoside 5-O-glucosyltransferase 11.2e-10848.28Show/hide
Query:  MRNHHFLLVCFPSQGYINPSLQLARRLISLGVNVTFATTVLAGRRMKNKTHQTA-TTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLIT
        M     LL  FP+QG+INP+LQ A+RL+  G +VTF T+V A RRM N     A   PGL F  FSDG+DD  LKP GD   Y SE++  GSE+L +L+ 
Subjt:  MRNHHFLLVCFPSQGYINPSLQLARRLISLGVNVTFATTVLAGRRMKNKTHQTA-TTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLIT

Query:  FAANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGPHAFILPPMRE
                +TFV+YS L +WAA++A    +PSAL + +PATVL +Y++YF+GY D    ++   S  I+LP LP L  + +P+F  P  P  F L  M+E
Subjt:  FAANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGPHAFILPPMRE

Query:  QAEFLGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSE-DYIEWLNSKPDSSVVYISVGSICVLSDEQEDEL
        + E L  + + KVLVNTFDALE DAL AID+ +++ IGPLIPSA L G D S+ S+ GDLF+ S E + +EWL++KP SSVVY+S GS+      Q +E+
Subjt:  QAEFLGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSE-DYIEWLNSKPDSSVVYISVGSICVLSDEQEDEL

Query:  VHALL---------------NSGHTQEK----LKKLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAKLIEDVWK
           LL               + G  +E+    + +L++ GK+VSWC Q+EVL HPALGCF+THCGWNS +ESL  G+PVVA PQ  DQ TNAKLIED W 
Subjt:  VHALL---------------NSGHTQEK----LKKLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAKLIEDVWK

Query:  TGVRVKANTEGIVEREEIRRCLDLVMGSRDGQKEEIER-NAKKWKELARQAIGEETSMESKLGA
        TGVRV+ N  G V+  EI RC+++VM    G+K ++ R NA KWK LAR+A+GE+ S    L A
Subjt:  TGVRVKANTEGIVEREEIRRCLDLVMGSRDGQKEEIER-NAKKWKELARQAIGEETSMESKLGA

Arabidopsis top hitse value%identityAlignment
AT1G05530.1 UDP-glucosyl transferase 75B21.2e-9041.28Show/hide
Query:  MRNHHFLLVCFPSQGYINPSLQLARRLI-SLGVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLIT
        M   HFLLV FP+QG++NPSL+ ARRLI + G  VTFAT +    R     H       LSF TFSDGFDD  +    D+ +      R G ++L+  I 
Subjt:  MRNHHFLLVCFPSQGYINPSLQLARRLI-SLGVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLIT

Query:  FAANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGPHAFILPPMRE
           N   P++ +IY++L +W   +A  + +PS   + QPA    +Y+ Y  G            +S  E P LP L  +D+PSF SPS  +       +E
Subjt:  FAANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGPHAFILPPMRE

Query:  QAEFLGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQ-VSSEDYIEWLNSKPDSSVVYISVGSICVLSDEQEDEL
          +FL  +S PK+LVNTFD+LE + L AI  ++M+A+GPL+P+ +  G++S       DL +   S  Y  WL+SK +SSV+Y+S G++  LS +Q +EL
Subjt:  QAEFLGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQ-VSSEDYIEWLNSKPDSSVVYISVGSICVLSDEQEDEL

Query:  VHALLNSGH-----TQEKLKK---------------------LEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAK
          AL+  G        +KL +                     LEE G +VSWC Q+EVL+H A+GCFLTHCGW+S++ESLV G+PVVAFP   DQ  NAK
Subjt:  VHALLNSGH-----TQEKLKK---------------------LEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAK

Query:  LIEDVWKTGVRVKANTEGIVEREEIRRCLDLVMGSRDGQKEEIERNAKKWKELARQAIGEETSMESKLGA
        L+E++WKTGVRV+ N+EG+VER EI RCL+ VM   + +  E+  NA+KWK LA +A  E  S +  + A
Subjt:  LIEDVWKTGVRVKANTEGIVEREEIRRCLDLVMGSRDGQKEEIERNAKKWKELARQAIGEETSMESKLGA

AT1G05560.1 UDP-glucosyltransferase 75B11.1e-9141.72Show/hide
Query:  HFLLVCFPSQGYINPSLQLARRLIS-LGVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLITFAAN
        HFLLV FP+QG++NPSL+ ARRLI   G  VTF T V          H       LSF TFSDGFDD  +    D       L+  G ++L+  I    N
Subjt:  HFLLVCFPSQGYINPSLQLARRLIS-LGVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLITFAAN

Query:  EGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGPHAFILPPMREQAEF
           P+T +IY++LL+WA  +A  + +PSAL + QPA V  +Y+ +F G             S  ELP L  L  +D+PSF +PS  +       +E  EF
Subjt:  EGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGPHAFILPPMREQAEF

Query:  LGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSEDYIEWLNSKPDSSVVYISVGSICVLSDEQEDELVHALL
        L ++++PK+L+NTFD+LE +AL A   + M+A+GPL+P+ +  G+ +          +  S  Y  WL+SK +SSV+Y+S G++  LS +Q +EL  AL+
Subjt:  LGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSEDYIEWLNSKPDSSVVYISVGSICVLSDEQEDELVHALL

Query:  -------------------NSGHTQEKLKK-------LEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAKLIEDV
                             G  + +++K       LEE G +VSWC Q+EVL H A+GCF+THCGW+ST+ESLV G+PVVAFP   DQ TNAKL+E+ 
Subjt:  -------------------NSGHTQEKLKK-------LEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAKLIEDV

Query:  WKTGVRVKANTEGIVEREEIRRCLDLVMGSRDGQKEEIERNAKKWKELARQAIGEETSMESKLGA
        WKTGVRV+ N +G+VER EIRRCL+ VM   + +  E+  NAKKWK LA +A  E  S +  + A
Subjt:  WKTGVRVKANTEGIVEREEIRRCLDLVMGSRDGQKEEIERNAKKWKELARQAIGEETSMESKLGA

AT1G05675.1 UDP-Glycosyltransferase superfamily protein3.6e-7136.18Show/hide
Query:  MMRNHHFLLVCFPSQGYINPSLQLARRLISLGVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLIT
        M    H +++ FP+QG+I P  Q  +RL S  + +T               H T T         S+GF  E  + + DL  Y   +       L  LI 
Subjt:  MMRNHHFLLVCFPSQGYINPSLQLARRLISLGVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLIT

Query:  FAANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGPHAFILPPMRE
             G P   ++Y   + W  D+A +Y +  A+FF QP  V A+Y++ F G   ++ S     S+    P LP+L + D+PSF   S  + +IL  + +
Subjt:  FAANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGPHAFILPPMRE

Query:  QAEFLGRQSQPKVLVNTFDALEADALRAIDKL-KMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSEDYIEWLNSKPDSSVVYISVGSICVLSDEQEDEL
        Q   + R     VL NTFD LE   L+ I  +  +L IGP +PS  L    + D ++   LF     + +EWLNSK  SSVVY+S GS+ VL  +Q  EL
Subjt:  QAEFLGRQSQPKVLVNTFDALEADALRAIDKL-KMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSEDYIEWLNSKPDSSVVYISVGSICVLSDEQEDEL

Query:  VHALLNSGH--------------TQEKLKKLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAKLIEDVWKTGVRV
           L  SGH               +  ++++ E+G  VSW  Q+EVL H ++GCF+THCGWNST+E L  G+P++  P   DQ TNAK +EDVWK GVRV
Subjt:  VHALLNSGH--------------TQEKLKKLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAKLIEDVWKTGVRV

Query:  KANTEGIVEREEIRRCLDLVMGSRDGQKEEIERNAKKWKELARQAIGEETSMESKL
        KA+++G V REE  R ++ VM +  G  +EI +NA+KWK LA++A+ E  S +  +
Subjt:  KANTEGIVEREEIRRCLDLVMGSRDGQKEEIERNAKKWKELARQAIGEETSMESKL

AT4G14090.1 UDP-Glycosyltransferase superfamily protein1.0e-9746.68Show/hide
Query:  RNHHFLLVCFPSQGYINPSLQLARRLISLGVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLITF-
        R  H+LLV FP+QG+INP+LQLA RLI  G  VT++T V A RRM     +  +T GLSFA F+DGFDD  LK   D   Y SEL+RCGS +L  +I   
Subjt:  RNHHFLLVCFPSQGYINPSLQLARRLISLGVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLITF-

Query:  --AANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGPHAFILPPMR
          A  E  PIT VIYS+L+ W + +A  + +P+ L + +PATVL +Y+YYF    +T    L D    I+LP LPL+T+ D+PSF  PS      L  +R
Subjt:  --AANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGPHAFILPPMR

Query:  EQAEFLGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSEDYIEWLNSKPDSSVVYISVGSIC-VLSDEQEDE
        E  E L  +S PK+LVNTF ALE DAL +++KLKM+ IGPL+          S +    DLF+ S EDY +WL+SK + SV+YIS+G+    L ++  + 
Subjt:  EQAEFLGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSEDYIEWLNSKPDSSVVYISVGSIC-VLSDEQEDE

Query:  LVHALLNSG----------HTQEKLK-------KLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAKLIEDVWKT
        L H +L +           + +EK K       +  ++G +V WC Q  VL H A+GCF+THCGWNST+ESL SG+PVVAFPQ  DQ T AKL+ED W+ 
Subjt:  LVHALLNSG----------HTQEKLK-------KLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAKLIEDVWKT

Query:  GVRVKANTEGIVEREEIRRCLDLVMGSRDGQKEEIERNAKKWKELARQAIGE
        GV+VK   EG V+ EEIRRCL+ VM   + + EE+  NA+KWK +A  A  E
Subjt:  GVRVKANTEGIVEREEIRRCLDLVMGSRDGQKEEIERNAKKWKELARQAIGE

AT4G15550.1 indole-3-acetate beta-D-glucosyltransferase2.9e-9744.75Show/hide
Query:  HFLLVCFPSQGYINPSLQLARRLISL--GVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDD------ETLKPNGDLT-HYFSELRRCGSESLT
        HFL V FP+QG+INPSL+LA+RL     G  VTFA ++ A  R    T     T  L FAT+SDG DD       + K   D T ++ SE+RR G E+LT
Subjt:  HFLLVCFPSQGYINPSLQLARRLISL--GVNVTFATTVLAGRRMKNKTHQTATTPGLSFATFSDGFDD------ETLKPNGDLT-HYFSELRRCGSESLT

Query:  HLITFAANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGPHAFILP
         LI     + RP T V+Y++LL+W A++A  + +PSAL + QP TV +++++YF+GY D I      PSS I+LP LPLLT +D+PSF   S  +AF+LP
Subjt:  HLITFAANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSKLQDPSSYIELPGLPLLTSQDMPSFFSPSGPHAFILP

Query:  PMREQAEFLGRQSQPKVLVNTFDALEADALRAI-DKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSEDYIEWLNSKPDSSVVYISVGSICVLSDEQ
          REQ + L  +  PK+L+NTF  LE +A+ ++ D  K++ +GPL+          +D S        S  +YIEWL++K DSSV+Y+S G++ VLS +Q
Subjt:  PMREQAEFLGRQSQPKVLVNTFDALEADALRAI-DKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSEDYIEWLNSKPDSSVVYISVGSICVLSDEQ

Query:  EDELVHALL----------------NSGHTQEK--------LKKLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATN
          EL  AL+                N    QEK         ++L+E G +VSWC Q  VL H ++GCF+THCGWNST+ESLVSG+PVVAFPQ  DQ  N
Subjt:  EDELVHALL----------------NSGHTQEK--------LKKLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATN

Query:  AKLIEDVWKTGVRV--KANTEG--IVEREEIRRCLDLVMGSRDGQKEEIERNAKKWKELARQAIGEETSMESKLGA
        AKL+ED WKTGVRV  K   EG  +V+ EEIRRC++ VM   + + EE   NA +WK+LA +A+ E  S  + L A
Subjt:  AKLIEDVWKTGVRV--KANTEG--IVEREEIRRCLDLVMGSRDGQKEEIERNAKKWKELARQAIGEETSMESKLGA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACAACAGAGATGGAGGAGAAAAGAAAGAGCTAAAGGAAAATGTCAAAAAGGGCAAAGAATTGGGTAGACAAGCTATTGGTGAAGGTGGGTCTTCGTACTTGAA
TCTTAAAACCTTTGTTAAGGAGGTAGATGAAGAAGGGGTAGCAGCTGCTTCTGAAATCAGTGCTTGGGATATGATGAGAAACCACCATTTCCTTCTGGTATGTTTCCCTT
CTCAAGGCTATATAAACCCTTCCCTTCAACTCGCCAGGCGACTGATAAGCCTCGGCGTTAATGTCACCTTCGCCACCACCGTCCTCGCCGGCCGCCGCATGAAGAACAAA
ACCCACCAAACTGCAACAACACCAGGCTTGTCTTTTGCTACTTTCTCCGATGGCTTCGATGACGAAACCCTCAAACCCAACGGCGACTTGACCCACTACTTCTCGGAGCT
CAGGCGCTGCGGCTCTGAATCTCTAACCCATCTCATTACTTTTGCAGCAAACGAAGGTCGTCCAATTACCTTCGTAATCTATAGCCTCCTGCTCTCTTGGGCGGCTGATA
TTGCCACCACATATGACATCCCGTCAGCACTTTTTTTTGCTCAGCCTGCGACGGTTTTGGCTTTGTACTTCTATTACTTCCATGGTTATGGTGATACCATTTGCAGCAAA
CTCCAAGACCCATCTTCGTACATAGAATTACCAGGTTTGCCGTTGCTCACTAGTCAGGACATGCCCTCTTTTTTCTCCCCTTCCGGCCCCCATGCTTTCATTCTCCCTCC
AATGAGAGAGCAGGCTGAATTCCTCGGCCGACAAAGCCAACCAAAAGTACTAGTGAACACCTTCGACGCGTTAGAGGCAGACGCCTTGAGAGCCATTGATAAGTTGAAGA
TGTTGGCGATTGGACCCTTGATTCCATCTGCTTTACTGGGTGGAAACGATTCCTCTGATGCATCATTTTGTGGTGATCTTTTTCAAGTCTCGTCAGAGGATTATATAGAA
TGGTTGAACTCCAAGCCTGACTCGTCGGTCGTTTACATATCAGTTGGATCCATCTGCGTGCTGTCTGATGAACAAGAGGACGAGCTTGTGCATGCTTTATTAAACAGTGG
CCACACACAGGAGAAGTTGAAGAAGCTGGAAGAGCAAGGGAAAATGGTGTCGTGGTGCCGTCAAGTTGAAGTGTTGAAACACCCTGCGTTGGGTTGTTTTCTGACGCACT
GTGGGTGGAACTCGACTATTGAAAGCTTGGTTTCAGGGCTGCCGGTGGTTGCTTTTCCGCAGCAGATAGATCAAGCCACCAACGCGAAGCTCATAGAGGACGTGTGGAAG
ACGGGAGTGAGGGTGAAGGCCAATACAGAAGGAATTGTGGAGAGGGAAGAAATCAGGAGGTGCTTGGATTTGGTGATGGGGAGCAGAGATGGGCAAAAGGAAGAGATAGA
GAGAAATGCCAAAAAGTGGAAAGAATTGGCTAGACAGGCCATCGGTGAAGAAACGTCCATGGAATCAAAGCTTGGTGCTAAGACTTCCAAACGCTCCTCCACCATCCATC
AACCTCAGTCCCGGTTCCATGGGACTTCGTTTAAAGATCTACAAGCTGGACTTCTCCACTCACCCCGCAAAAAACCTAAAAGGCAGCTGCAGCCGGATGAATTTCCTTCG
AAAGACGGGGATGGGGTTTCGCATTTTATAGCAAAGGATCTACGCATCAAGCGAGTCTTTTCCCCCAATTTACCTGATGAGAATGGCTCCCCGGTGCCATCGGGGGAGCA
GATTAGCGACAAAGAAGGGCCGATTATCGCGAATGGAAGTTGGCCGATTGAAATGAAGGCGTCAGAAATAGCTCCGAGGCTGCAGAGGTACGGAAAGTGGATGTTTGCAA
TTCTGGATGGATGTCATCCTTCAAGTAATGGATGTGTTGAAGGGGATACTTATGCGAACACTGACGGTAGAATTGGCTCTCTTTCTAGAACCGGATCGGTGCTCGAACCT
TGTATTAAACGGAAGCTATTCAAAGCACCTGGCTCCATTGCCTACAAAAGATTGCTGCCTTATTTGCTGGATACCGATAATTACACCCTACAAGGTTATCCACACTCTAA
ACGTGAAAAGAATTTGGAGAAGAAGGCAAACATTGAATCTAATCTATGTAATCGTGCCAAGGAATCATCTTCTGTTGATTCAGATAATTGCGTAAAGAATGCAGCTTTTC
CCTCCAGCATGTCATGTAAGGCTATGAAGCTAAATTTACCACCTCCAGATAAAATTATCTCCTTTAGCAAGCCATCAGTGAATGGAGAAACTGAAATTTTTCGAAATGGC
AATCCTTCTGGAAATCAGAAGTGCAGTGATTCAAATAATGACCCGACTATAGTGAAAGAGGGTTCTGGTTTGAAAACAGATAATGTAGTTTGTGCCTCTTCTCATGACAA
AAGGCCGACAGAGAATGGAGGGCCCACAAAGCATCAGACAGAGCATCTTTCTTATAAAGAACAATCCAAAACCTCTGGATTGGAGAGGCTTGATGTGAAGGAAGAAAGGA
GAGATGAAAAAGTGGAATGTACTCAAGGTGCAGATCAAAATCTTGGTAGTTCCACTGTTGGTGAAAATCAAAGCAACATTTGTACAGTGAGTGACAAGAAACATGGCTCC
TATGTTAGAAACAAAGTGGTTCGCAATCAACTTGTACGACTGAAGTTATTCAACAGCCAAGCTTCAGTCAACTATCGAAGGATGCTTCCATTCCTTGAGAATCTTTTCAA
AGATAATTCCGGGAACTGCGCCTTGGGAAACATCAACTGTCCAAGACTAGAGAAAGAATTGGCAACAACGAATTTGCAACCACTGAGTTCGAATTCTCACAATGCCCTGG
AAAAATCAGTAGGTTTGGTATCTTGCAAAAAGCCATGCGATGGAAGTTCAGATACTCTTTCAATGACTGAGTCGAACACTATGAATGAAAAGGTTTGTGAAACAGAAAAT
TTTTTAATGCTTGCTGAAGTCAATGATAAACTTCTATCACCACCAAAATTACAGATGCATCAGCTGCACTCTGAACAGGAGTTGGATGAATGTAAGTTGGAAGTGGACCC
TCAATTGCATGGTGCAGCACTTTTATTAAATGATCAAGCAGTCTCCCCATTGTATCCTGCTGCCACTTATGAGCCTGTGACACGAGAAGGATCCAGAATGATCACTAAAC
AATCACCAATTACTTCAGAAGACTGCACTAGTTTGACAGATAATGTTTCCGATGGTGCCAAACTCTCTGAGAGAAACAGCTTAGGACCTAATTCTTCATGCGTAGAAGCA
TGTATTCTGCCAGAAAATCCTGCTAATGTTAGAAAGGGAATTCTTAAACGAAACCCACGAGGATGCCGAGGAATCTGCACTTGTCTGAACTGTTCCTCTTTTCGTCTCCA
TGCCGAAAGAGCATTCGAATTTTCTAGGAATCAACTGCAAGATGCTGAAGAAGTTACTTCAGATTTGATGAAAGAATTGTCATTTCTTCGTGATGTGCTACAAAAATTTT
CTGATGGTGTAAAAGACAACGCTGGATATCATTCAAATAAGGTGAAAGAAGCTTGTAGGAAAGCATCTGAAGCAGAGTCGATTGCAAAAGGCCGGCTTCTACAGATGAAC
TATGAACTTGACATTCATTGCAGAATCACGTGCTCCCAACGACCAAATGTTAGATTTTCTGGTGACGTTGAGAATATAGAGATTGAAGATGGAAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACAACAGAGATGGAGGAGAAAAGAAAGAGCTAAAGGAAAATGTCAAAAAGGGCAAAGAATTGGGTAGACAAGCTATTGGTGAAGGTGGGTCTTCGTACTTGAA
TCTTAAAACCTTTGTTAAGGAGGTAGATGAAGAAGGGGTAGCAGCTGCTTCTGAAATCAGTGCTTGGGATATGATGAGAAACCACCATTTCCTTCTGGTATGTTTCCCTT
CTCAAGGCTATATAAACCCTTCCCTTCAACTCGCCAGGCGACTGATAAGCCTCGGCGTTAATGTCACCTTCGCCACCACCGTCCTCGCCGGCCGCCGCATGAAGAACAAA
ACCCACCAAACTGCAACAACACCAGGCTTGTCTTTTGCTACTTTCTCCGATGGCTTCGATGACGAAACCCTCAAACCCAACGGCGACTTGACCCACTACTTCTCGGAGCT
CAGGCGCTGCGGCTCTGAATCTCTAACCCATCTCATTACTTTTGCAGCAAACGAAGGTCGTCCAATTACCTTCGTAATCTATAGCCTCCTGCTCTCTTGGGCGGCTGATA
TTGCCACCACATATGACATCCCGTCAGCACTTTTTTTTGCTCAGCCTGCGACGGTTTTGGCTTTGTACTTCTATTACTTCCATGGTTATGGTGATACCATTTGCAGCAAA
CTCCAAGACCCATCTTCGTACATAGAATTACCAGGTTTGCCGTTGCTCACTAGTCAGGACATGCCCTCTTTTTTCTCCCCTTCCGGCCCCCATGCTTTCATTCTCCCTCC
AATGAGAGAGCAGGCTGAATTCCTCGGCCGACAAAGCCAACCAAAAGTACTAGTGAACACCTTCGACGCGTTAGAGGCAGACGCCTTGAGAGCCATTGATAAGTTGAAGA
TGTTGGCGATTGGACCCTTGATTCCATCTGCTTTACTGGGTGGAAACGATTCCTCTGATGCATCATTTTGTGGTGATCTTTTTCAAGTCTCGTCAGAGGATTATATAGAA
TGGTTGAACTCCAAGCCTGACTCGTCGGTCGTTTACATATCAGTTGGATCCATCTGCGTGCTGTCTGATGAACAAGAGGACGAGCTTGTGCATGCTTTATTAAACAGTGG
CCACACACAGGAGAAGTTGAAGAAGCTGGAAGAGCAAGGGAAAATGGTGTCGTGGTGCCGTCAAGTTGAAGTGTTGAAACACCCTGCGTTGGGTTGTTTTCTGACGCACT
GTGGGTGGAACTCGACTATTGAAAGCTTGGTTTCAGGGCTGCCGGTGGTTGCTTTTCCGCAGCAGATAGATCAAGCCACCAACGCGAAGCTCATAGAGGACGTGTGGAAG
ACGGGAGTGAGGGTGAAGGCCAATACAGAAGGAATTGTGGAGAGGGAAGAAATCAGGAGGTGCTTGGATTTGGTGATGGGGAGCAGAGATGGGCAAAAGGAAGAGATAGA
GAGAAATGCCAAAAAGTGGAAAGAATTGGCTAGACAGGCCATCGGTGAAGAAACGTCCATGGAATCAAAGCTTGGTGCTAAGACTTCCAAACGCTCCTCCACCATCCATC
AACCTCAGTCCCGGTTCCATGGGACTTCGTTTAAAGATCTACAAGCTGGACTTCTCCACTCACCCCGCAAAAAACCTAAAAGGCAGCTGCAGCCGGATGAATTTCCTTCG
AAAGACGGGGATGGGGTTTCGCATTTTATAGCAAAGGATCTACGCATCAAGCGAGTCTTTTCCCCCAATTTACCTGATGAGAATGGCTCCCCGGTGCCATCGGGGGAGCA
GATTAGCGACAAAGAAGGGCCGATTATCGCGAATGGAAGTTGGCCGATTGAAATGAAGGCGTCAGAAATAGCTCCGAGGCTGCAGAGGTACGGAAAGTGGATGTTTGCAA
TTCTGGATGGATGTCATCCTTCAAGTAATGGATGTGTTGAAGGGGATACTTATGCGAACACTGACGGTAGAATTGGCTCTCTTTCTAGAACCGGATCGGTGCTCGAACCT
TGTATTAAACGGAAGCTATTCAAAGCACCTGGCTCCATTGCCTACAAAAGATTGCTGCCTTATTTGCTGGATACCGATAATTACACCCTACAAGGTTATCCACACTCTAA
ACGTGAAAAGAATTTGGAGAAGAAGGCAAACATTGAATCTAATCTATGTAATCGTGCCAAGGAATCATCTTCTGTTGATTCAGATAATTGCGTAAAGAATGCAGCTTTTC
CCTCCAGCATGTCATGTAAGGCTATGAAGCTAAATTTACCACCTCCAGATAAAATTATCTCCTTTAGCAAGCCATCAGTGAATGGAGAAACTGAAATTTTTCGAAATGGC
AATCCTTCTGGAAATCAGAAGTGCAGTGATTCAAATAATGACCCGACTATAGTGAAAGAGGGTTCTGGTTTGAAAACAGATAATGTAGTTTGTGCCTCTTCTCATGACAA
AAGGCCGACAGAGAATGGAGGGCCCACAAAGCATCAGACAGAGCATCTTTCTTATAAAGAACAATCCAAAACCTCTGGATTGGAGAGGCTTGATGTGAAGGAAGAAAGGA
GAGATGAAAAAGTGGAATGTACTCAAGGTGCAGATCAAAATCTTGGTAGTTCCACTGTTGGTGAAAATCAAAGCAACATTTGTACAGTGAGTGACAAGAAACATGGCTCC
TATGTTAGAAACAAAGTGGTTCGCAATCAACTTGTACGACTGAAGTTATTCAACAGCCAAGCTTCAGTCAACTATCGAAGGATGCTTCCATTCCTTGAGAATCTTTTCAA
AGATAATTCCGGGAACTGCGCCTTGGGAAACATCAACTGTCCAAGACTAGAGAAAGAATTGGCAACAACGAATTTGCAACCACTGAGTTCGAATTCTCACAATGCCCTGG
AAAAATCAGTAGGTTTGGTATCTTGCAAAAAGCCATGCGATGGAAGTTCAGATACTCTTTCAATGACTGAGTCGAACACTATGAATGAAAAGGTTTGTGAAACAGAAAAT
TTTTTAATGCTTGCTGAAGTCAATGATAAACTTCTATCACCACCAAAATTACAGATGCATCAGCTGCACTCTGAACAGGAGTTGGATGAATGTAAGTTGGAAGTGGACCC
TCAATTGCATGGTGCAGCACTTTTATTAAATGATCAAGCAGTCTCCCCATTGTATCCTGCTGCCACTTATGAGCCTGTGACACGAGAAGGATCCAGAATGATCACTAAAC
AATCACCAATTACTTCAGAAGACTGCACTAGTTTGACAGATAATGTTTCCGATGGTGCCAAACTCTCTGAGAGAAACAGCTTAGGACCTAATTCTTCATGCGTAGAAGCA
TGTATTCTGCCAGAAAATCCTGCTAATGTTAGAAAGGGAATTCTTAAACGAAACCCACGAGGATGCCGAGGAATCTGCACTTGTCTGAACTGTTCCTCTTTTCGTCTCCA
TGCCGAAAGAGCATTCGAATTTTCTAGGAATCAACTGCAAGATGCTGAAGAAGTTACTTCAGATTTGATGAAAGAATTGTCATTTCTTCGTGATGTGCTACAAAAATTTT
CTGATGGTGTAAAAGACAACGCTGGATATCATTCAAATAAGGTGAAAGAAGCTTGTAGGAAAGCATCTGAAGCAGAGTCGATTGCAAAAGGCCGGCTTCTACAGATGAAC
TATGAACTTGACATTCATTGCAGAATCACGTGCTCCCAACGACCAAATGTTAGATTTTCTGGTGACGTTGAGAATATAGAGATTGAAGATGGAAAATAG
Protein sequenceShow/hide protein sequence
MGDNRDGGEKKELKENVKKGKELGRQAIGEGGSSYLNLKTFVKEVDEEGVAAASEISAWDMMRNHHFLLVCFPSQGYINPSLQLARRLISLGVNVTFATTVLAGRRMKNK
THQTATTPGLSFATFSDGFDDETLKPNGDLTHYFSELRRCGSESLTHLITFAANEGRPITFVIYSLLLSWAADIATTYDIPSALFFAQPATVLALYFYYFHGYGDTICSK
LQDPSSYIELPGLPLLTSQDMPSFFSPSGPHAFILPPMREQAEFLGRQSQPKVLVNTFDALEADALRAIDKLKMLAIGPLIPSALLGGNDSSDASFCGDLFQVSSEDYIE
WLNSKPDSSVVYISVGSICVLSDEQEDELVHALLNSGHTQEKLKKLEEQGKMVSWCRQVEVLKHPALGCFLTHCGWNSTIESLVSGLPVVAFPQQIDQATNAKLIEDVWK
TGVRVKANTEGIVEREEIRRCLDLVMGSRDGQKEEIERNAKKWKELARQAIGEETSMESKLGAKTSKRSSTIHQPQSRFHGTSFKDLQAGLLHSPRKKPKRQLQPDEFPS
KDGDGVSHFIAKDLRIKRVFSPNLPDENGSPVPSGEQISDKEGPIIANGSWPIEMKASEIAPRLQRYGKWMFAILDGCHPSSNGCVEGDTYANTDGRIGSLSRTGSVLEP
CIKRKLFKAPGSIAYKRLLPYLLDTDNYTLQGYPHSKREKNLEKKANIESNLCNRAKESSSVDSDNCVKNAAFPSSMSCKAMKLNLPPPDKIISFSKPSVNGETEIFRNG
NPSGNQKCSDSNNDPTIVKEGSGLKTDNVVCASSHDKRPTENGGPTKHQTEHLSYKEQSKTSGLERLDVKEERRDEKVECTQGADQNLGSSTVGENQSNICTVSDKKHGS
YVRNKVVRNQLVRLKLFNSQASVNYRRMLPFLENLFKDNSGNCALGNINCPRLEKELATTNLQPLSSNSHNALEKSVGLVSCKKPCDGSSDTLSMTESNTMNEKVCETEN
FLMLAEVNDKLLSPPKLQMHQLHSEQELDECKLEVDPQLHGAALLLNDQAVSPLYPAATYEPVTREGSRMITKQSPITSEDCTSLTDNVSDGAKLSERNSLGPNSSCVEA
CILPENPANVRKGILKRNPRGCRGICTCLNCSSFRLHAERAFEFSRNQLQDAEEVTSDLMKELSFLRDVLQKFSDGVKDNAGYHSNKVKEACRKASEAESIAKGRLLQMN
YELDIHCRITCSQRPNVRFSGDVENIEIEDGK