; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G07730 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G07730
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionGlycos_transf_1 domain-containing protein
Genome locationClcChr08:19301441..19311186
RNA-Seq ExpressionClc08G07730
SyntenyClc08G07730
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR001296 - Glycosyl transferase, family 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032959.1 hypothetical protein SDJN02_07010 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0089.34Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSS+EIDDNGSGNAVP  HSIRDRFPFKRNSSHFRLRAKDSLDHA  RSRSHQSRINRKGLLWW+PARGQT FYFVVVFAVF FV+GSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        MSS GSE+ RWLMERIKFGSSLKF PGRISRRLVEG GLDE+RK DR+GVRAPRLALILGSME++PQSLMLITVMKNIQKLGYV E            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGN+HSMW+QIGGQPS+LSP HYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCS+PLIWI+REDTLANRLPMYEQRGWKHLISHWK SFRRAN+VV
Subjt:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDF+LPMLYS LDNGNF+VIPGSPADVYAAE+YKNVHSKSQLREKNGF+E+DILV+VVGSLFFPNELSWDYAVAMHSIGPLL+ YA R+EV GSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSH AL+EI SRLGLPD SITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI
        L SFS+MISDGKLSRF+QAIASSG+LLAKNILASECV  YA+LLENVLNFPSDVKLPGSVSQLQLGAWEWNLFR+E V+TI +  D EERIAA SKSSVI
Subjt:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY
        FALE Q+TN VNLTN SETENG LEQDIPT  DWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ VSIY
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIY+G+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLLNDSYY DTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL  KAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY
        IRDN KGDVIYFWAHLQVNR    GS   TFWSVCDILNGGLCRT FE+TFREMFGLSSNMEALPPMP+DGGRWSALHSWVMPTPSFLEFIMFSRMFTHY
Subjt:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY

Query:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG
        LDA N+N SQP GCL+ASSELEKKHCYCRILE+LVNVWAYHSGRRMVYI+P SGFLEEQHPVEQR+EFMWAKYFN TLLKSMDEDLAEAADDEGGS +MG
Subjt:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG

Query:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSL G
Subjt:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

XP_004138684.1 uncharacterized protein LOC101206364 isoform X1 [Cucumis sativus]0.0e+0092.45Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSSSEIDDN S NAV GTHSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQ+RINRKGLL WIPARGQTLFYF+VVFAVFGF TGSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        +SS GS++ERWLMERIKFGSSLKFVPGRIS+RLVEGDGL+E+RK DR+GVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFE            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVE GNK SMWEQI GQPS+LSPGHYG VDWSIYDGIIADSLE EGAIASLMQEPFCSLPLIWI+REDTLA+RLPMYEQRGWKHLISHWKRSFRRANVVV
Subjt:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDFALPMLYS LDNGNFHVIPGSPADVYAAE Y NVHSKSQLREKNGF+E+DILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR+EVEGSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSHDALKEI SRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP L+NYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI
        LSSFSQMISDGKLSRFAQ+IASSGRLLAKNILASECV GYAQLLENVLNFPSDVKLPG VSQLQLGAWEWNLFRKEMVKTIDENAD+EERIA ISK+SVI
Subjt:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY
        FALE QLTNSVNLT LSE ENG LEQDIPT QDWDILE+IE+AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFE+NERDEGELERTGQTVSIY
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIYSG+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLL+DSYYLD LCEIGGMFAIANKIDNIHKRPWIGFQSW+ASGRKVSL KKAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH
        I+DNPKGDVIYFWAHLQVNRG+ P TFWSVCDILNGGLCRTTF STFREMFGLSSNM ALPPMPEDGG WSALHSWVMPTPSFLEFIMFSRMFTHYLDA 
Subjt:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH

Query:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL
        N+NQSQPNGCLLASSE+EKKHCYCRILEMLVNVWAYHSGRRMVYINP SGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGK+GLWPL
Subjt:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL

Query:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
Subjt:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

XP_008456559.1 PREDICTED: uncharacterized protein LOC103496475 isoform X1 [Cucumis melo]0.0e+0093.03Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSSSEIDDN S NAVPGTHSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQ+RINRKGLL WIPARGQTLFYF+VVFAVFGF TGSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        +SS GS++ERWLMERIKFGSSLKFVPGRISRRLVEGDGL+E+RK DR+GVRAPRLALILGSMENDPQSLMLITVMKN+QKLGYVFE            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGNK SMWEQI GQPS+LSPGHYG VDWSIYDGIIADSLE EGAIASLMQEPFCSLPLIWI+REDTLA+RLPMYEQRGWKHLISHWKRSFRRANVVV
Subjt:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDFALPMLYS LDNGNFHVIPGSPADVYAAE+Y NVHSKSQLREKNGF+ +DILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR+EVEGSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSHDALKEI SRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI
        LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECV GY QLLENVLNFPSDVKLPG  SQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISK+SVI
Subjt:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY
        FALE QLTNSVNLT LSE ENG LEQDIPT QDWDILEEIE+AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIYSG+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLLNDSYYLD LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL KKAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH
        IRDNP+GDVIYFWAHLQVNRG+ P TFWSVCDILNGGLCRTTF STFREMFGLSSNM ALPPMPEDGG WSALHSWVMPTPSFLEFIMFSRMFTHYLDA 
Subjt:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH

Query:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL
        N+NQSQPNGCL A SE+EKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGK+GLWPL
Subjt:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL

Query:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
Subjt:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

XP_022958089.1 uncharacterized protein LOC111459418 isoform X1 [Cucurbita moschata]0.0e+0089.44Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSS+EIDDNGSGNAVP  HSIRDRFPFKRNSSHFRLRAKDSLDHA  RSRSHQSRINRKGLLWW+PARGQT FYFVVVFAVF FV+GSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        MSS GSE+ RWLMERIKFGSSLKF PGRISRRLVEG GLDE+RK DR+GVRAPRLALILGSME++PQSLMLITVMKNIQKLGYV E            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGN+HSMW+QIGGQPS+LSP HYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCS+PLIWI+REDTLANRLPMYEQRGWKHLISHWK SFRRAN+VV
Subjt:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDF+LPMLYS LDNGNF+VIPGSPADVYAAE+YKNVHSKSQLREKNGF+E+DILV+VVGSLFFPNELSWDYAVAMHSIGPLL+ YA R+EV GSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSH AL+EI SRLGLPD SITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI
        L SFS+MISDGKLSRF+QAIASSG+LLAKNILASECV  YA+LLENVLNFPSDVKLPGSVSQLQLGAWEWNLFR+E V+TI +  D EERIAA SKSSVI
Subjt:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY
        FALE Q+TN VNLTN SETENG LEQDIPT  DWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ VSIY
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIY+G+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLLNDSYY DTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL  KAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY
        IRDN KGDVIYFWAHLQVNR    GS   TFWSVCDILNGGLCRT FE+TFREMFGLSSNMEALPPMP+DGGRWSALHSWVMPTPSFLEFIMFSRMFTHY
Subjt:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY

Query:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG
        LDA N+NQSQP GCL+ASSELEKKHCYCRILE+LVNVWAYHSGRRMVYI+P SGFLEEQHPVEQR+EFMWAKYFN TLLKSMDEDLAEAADDEGGS +MG
Subjt:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG

Query:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSL G
Subjt:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

XP_038884759.1 uncharacterized protein LOC120075439 isoform X1 [Benincasa hispida]0.0e+0092.84Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYF+VVFAVFGFVTGSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        MSS GSE+ERWLMERIKFGSSLKFVPG ISR+LVEGDGLDE+RK DR+GVR+PRLALILGSMENDPQSLMLITVMKNIQKLGY+ E            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGNKHS+WEQIGGQPS+LSP HYG VDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWI+REDTLANRLP+YEQRGWKHLISHWK SFRRANVVV
Subjt:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDFALPMLYSTLD+GNFHVIPGSPADVYAAE+YKN HSKSQLREKNGF E+DILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEV GSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSHDALKEI SRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI
        LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECV GYAQLLENVLNFP DVKLP S SQLQLGAWEWNLFRKEMVK IDE ADDEERIAA +K+SVI
Subjt:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY
        FALE QLTNSVNLT LSE ENG LE DIPTSQDWD+LEEIENAEEYETVEMEEFQERMERDLGAWD+IYRNARKSEKLKFEANERDEGELERTGQTVSIY
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIYSG+GAWPFMHHGSLYRGLSLST+ALRL+SDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAEN LED 
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH
        IRDNPKGDVIYFWAHLQVNRG  P TFWSVCDILNGGLCRTTF+STFR+M+GLSSNM ALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDA 
Subjt:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH

Query:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL
        N+NQS PNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRR+VYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEA DDEG SGK GLWPL
Subjt:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL

Query:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        TGEVHWQGIYEREREERYRVKMDKKRTTKVKL ERMKFGYKQKSLGG
Subjt:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

TrEMBL top hitse value%identityAlignment
A0A0A0LMB5 Glycos_transf_1 domain-containing protein0.0e+0092.45Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSSSEIDDN S NAV GTHSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQ+RINRKGLL WIPARGQTLFYF+VVFAVFGF TGSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        +SS GS++ERWLMERIKFGSSLKFVPGRIS+RLVEGDGL+E+RK DR+GVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFE            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVE GNK SMWEQI GQPS+LSPGHYG VDWSIYDGIIADSLE EGAIASLMQEPFCSLPLIWI+REDTLA+RLPMYEQRGWKHLISHWKRSFRRANVVV
Subjt:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDFALPMLYS LDNGNFHVIPGSPADVYAAE Y NVHSKSQLREKNGF+E+DILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR+EVEGSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSHDALKEI SRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP L+NYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI
        LSSFSQMISDGKLSRFAQ+IASSGRLLAKNILASECV GYAQLLENVLNFPSDVKLPG VSQLQLGAWEWNLFRKEMVKTIDENAD+EERIA ISK+SVI
Subjt:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY
        FALE QLTNSVNLT LSE ENG LEQDIPT QDWDILE+IE+AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFE+NERDEGELERTGQTVSIY
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIYSG+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLL+DSYYLD LCEIGGMFAIANKIDNIHKRPWIGFQSW+ASGRKVSL KKAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH
        I+DNPKGDVIYFWAHLQVNRG+ P TFWSVCDILNGGLCRTTF STFREMFGLSSNM ALPPMPEDGG WSALHSWVMPTPSFLEFIMFSRMFTHYLDA 
Subjt:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH

Query:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL
        N+NQSQPNGCLLASSE+EKKHCYCRILEMLVNVWAYHSGRRMVYINP SGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGK+GLWPL
Subjt:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL

Query:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
Subjt:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

A0A1S3C3I4 uncharacterized protein LOC103496475 isoform X10.0e+0093.03Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSSSEIDDN S NAVPGTHSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQ+RINRKGLL WIPARGQTLFYF+VVFAVFGF TGSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        +SS GS++ERWLMERIKFGSSLKFVPGRISRRLVEGDGL+E+RK DR+GVRAPRLALILGSMENDPQSLMLITVMKN+QKLGYVFE            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGNK SMWEQI GQPS+LSPGHYG VDWSIYDGIIADSLE EGAIASLMQEPFCSLPLIWI+REDTLA+RLPMYEQRGWKHLISHWKRSFRRANVVV
Subjt:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDFALPMLYS LDNGNFHVIPGSPADVYAAE+Y NVHSKSQLREKNGF+ +DILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR+EVEGSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSHDALKEI SRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI
        LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECV GY QLLENVLNFPSDVKLPG  SQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISK+SVI
Subjt:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY
        FALE QLTNSVNLT LSE ENG LEQDIPT QDWDILEEIE+AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIYSG+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLLNDSYYLD LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL KKAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH
        IRDNP+GDVIYFWAHLQVNRG+ P TFWSVCDILNGGLCRTTF STFREMFGLSSNM ALPPMPEDGG WSALHSWVMPTPSFLEFIMFSRMFTHYLDA 
Subjt:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH

Query:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL
        N+NQSQPNGCL A SE+EKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGK+GLWPL
Subjt:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL

Query:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
Subjt:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

A0A5A7UUA8 UDP-Glycosyltransferase superfamily protein isoform 30.0e+0093.03Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSSSEIDDN S NAVPGTHSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQ+RINRKGLL WIPARGQTLFYF+VVFAVFGF TGSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        +SS GS++ERWLMERIKFGSSLKFVPGRISRRLVEGDGL+E+RK DR+GVRAPRLALILGSMENDPQSLMLITVMKN+QKLGYVFE            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGNK SMWEQI GQPS+LSPGHYG VDWSIYDGIIADSLE EGAIASLMQEPFCSLPLIWI+REDTLA+RLPMYEQRGWKHLISHWKRSFRRANVVV
Subjt:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDFALPMLYS LDNGNFHVIPGSPADVYAAE+Y NVHSKSQLREKNGF+ +DILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR+EVEGSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSHDALKEI SRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI
        LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECV GY QLLENVLNFPSDVKLPG  SQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISK+SVI
Subjt:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY
        FALE QLTNSVNLT LSE ENG LEQDIPT QDWDILEEIE+AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIYSG+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLLNDSYYLD LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL KKAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH
        IRDNP+GDVIYFWAHLQVNRG+ P TFWSVCDILNGGLCRTTF STFREMFGLSSNM ALPPMPEDGG WSALHSWVMPTPSFLEFIMFSRMFTHYLDA 
Subjt:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH

Query:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL
        N+NQSQPNGCL A SE+EKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGK+GLWPL
Subjt:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL

Query:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
Subjt:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

A0A6J1H431 uncharacterized protein LOC111459418 isoform X10.0e+0089.44Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSS+EIDDNGSGNAVP  HSIRDRFPFKRNSSHFRLRAKDSLDHA  RSRSHQSRINRKGLLWW+PARGQT FYFVVVFAVF FV+GSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        MSS GSE+ RWLMERIKFGSSLKF PGRISRRLVEG GLDE+RK DR+GVRAPRLALILGSME++PQSLMLITVMKNIQKLGYV E            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGN+HSMW+QIGGQPS+LSP HYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCS+PLIWI+REDTLANRLPMYEQRGWKHLISHWK SFRRAN+VV
Subjt:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDF+LPMLYS LDNGNF+VIPGSPADVYAAE+YKNVHSKSQLREKNGF+E+DILV+VVGSLFFPNELSWDYAVAMHSIGPLL+ YA R+EV GSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSH AL+EI SRLGLPD SITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI
        L SFS+MISDGKLSRF+QAIASSG+LLAKNILASECV  YA+LLENVLNFPSDVKLPGSVSQLQLGAWEWNLFR+E V+TI +  D EERIAA SKSSVI
Subjt:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY
        FALE Q+TN VNLTN SETENG LEQDIPT  DWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ VSIY
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIY+G+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLLNDSYY DTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL  KAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY
        IRDN KGDVIYFWAHLQVNR    GS   TFWSVCDILNGGLCRT FE+TFREMFGLSSNMEALPPMP+DGGRWSALHSWVMPTPSFLEFIMFSRMFTHY
Subjt:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY

Query:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG
        LDA N+NQSQP GCL+ASSELEKKHCYCRILE+LVNVWAYHSGRRMVYI+P SGFLEEQHPVEQR+EFMWAKYFN TLLKSMDEDLAEAADDEGGS +MG
Subjt:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG

Query:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSL G
Subjt:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

A0A6J1JPJ0 uncharacterized protein LOC111487177 isoform X10.0e+0088.77Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSS+EIDDNGSGNAVP  HS RDRFPFKRNSSHFRLRAKDSLDHA  RSRSHQSRINRKGLLWW+PARGQT FYFVVVFAVF FV+GSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        MSS GSE+ RWLMERIKFGSSLKF PGRISRRLVEG GLDE+RK DR+GVRAPRLALILGSME++PQSLMLITVMKNIQKLGYV E            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGN+HSMW+QIGGQPS+LSP HYGHVDWSIYDGIIADSLEAEG IASLMQEPFCS+PLIWI+REDTLANRLPMYEQRGWKHLISHWK SFRRAN+VV
Subjt:  AVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDF+LPMLYS LDNGNF+VIPGSPADVYAAE+YKNVHSKSQLREKNGF+E+DILV+VVGSLFFPNELSWDYAVAMHSIGPLL+ YA R+EV GSFKF+F
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSH AL+EI SRLGLPD SITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI
        L SFS+MISDGKLSRF+QAIASSG+LLAKNILASECV  YA+LLENVLNFPSDVKLPGSVSQLQL AWEWNLFR+E+V+TI +  D EERIAA SKSSVI
Subjt:  LSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY
        FALE Q+TN VNLTN SET NG LEQDIPT  DWDILEEIEN EEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ VSIY
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIYSG+GAWPF+HHGSLYRGLSLSTRALRL+SDDVNAVGRLPLLNDSYY DTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL  KAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY
        IRDN KGDVIYFW HLQVNR    GS   TFWSVCDILNGGLCRT FE+TFREMFGLSSNMEALPPMP++GGRWSALHSWVMPTPSFLEFIMFSRMFTHY
Subjt:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY

Query:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG
        LDA N+NQSQP GCLLASSELEKKHCYCRILE+LVNVWAYHSGRRMVYI+P SGFLEEQHPVEQR+EFMWAKYFN TLLKSMDEDLAEAADDEGGS +MG
Subjt:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG

Query:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSL G
Subjt:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G01210.1 glycosyl transferase family 1 protein1.9e-19039.78Show/hide
Query:  MDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIFAVESGNKHSMWEQIGGQPSVLSPGHYGH--VDWSIYDGIIADSL
        + R G R P+LAL+ G +  DP+ ++++++ K +Q++GY  E            ++++E G  +S+W+++G   ++L P       +DW  YDGII +SL
Subjt:  MDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIFAVESGNKHSMWEQIGGQPSVLSPGHYGH--VDWSIYDGIIADSL

Query:  EAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQ
         A       MQEPF SLPLIW+I E+TLA R   Y   G   L++ WK+ F RA+VVVF ++ LP+LY+  D GNF+VIPGSP +V  A   KN+    Q
Subjt:  EAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQ

Query:  LREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGS-FKFVFLCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLM
                ++D+++ +VGS F       ++A+ + ++ PL S      E + S  K + L   +      A++ I   L  P  ++ H  + G+V+ +L 
Subjt:  LREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGS-FKFVFLCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLM

Query:  MADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYA
         +D+V+YGS  E QSFP +L++AMS G PI+ PDL  +R Y+ D V G +FPK N   L     ++I++GK+S  AQ IA  G+   KN++A E + GYA
Subjt:  MADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYA

Query:  QLLENVLNFPSDVKLPGSVSQLQ---LGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVIFALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILE
         LLEN+L F S+V  P  V ++       W W+ F   M      +     RIA     S  F  +V+     N T     + G +  D    + W+   
Subjt:  QLLENVLNFPSDVKLPGSVSQLQ---LGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVIFALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILE

Query:  EIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAV
         ++     +  E EE + R+ +  G W+++Y++A+++++ K + +ERDEGEL RTGQ + IYE Y G G W F+H   LYRG+ LS +  R R DDV+A 
Subjt:  EIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAV

Query:  GRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVN-RGSTPTTFWSVCDILNGG
         RLPL N+ YY D L + G  FAI+NKID +HK  WIGFQSWRA+ RK SL K AE+ L + I+    GD +YFW  +  + R      FWS CD +N G
Subjt:  GRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVN-RGSTPTTFWSVCDILNGG

Query:  LCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH-NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAY
         CR  +  T ++M+ +  N+++LPPMPEDG  WS + SW +PT SFLEF+MFSRMF   LDA   +   + N C L  S  + KHCY R+LE+LVNVWAY
Subjt:  LCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH-NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAY

Query:  HSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERM
        HS RR+VYI+P++G ++EQH  + R+  MW K+F++T LK+MDEDLAE AD +   G   LWP TGE+ W+G  E+E++++   K +KK+ ++ KL    
Subjt:  HSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERM

Query:  KFGYKQKSLG
            +QK +G
Subjt:  KFGYKQKSLG

AT5G04480.1 UDP-Glycosyltransferase superfamily protein0.0e+0060.47Show/hide
Query:  MRRSSSSEIDDNG-------------SGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQ--SRINRKGLLWWIPARGQTLFYFVVVFAVF
        +R S S EIDDNG             +GN     HSIRDR   KRNSS  R R+   LD  + R+R H     +NRKGLL  +  RG  L YF+V F V 
Subjt:  MRRSSSSEIDDNG-------------SGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQ--SRINRKGLLWWIPARGQTLFYFVVVFAVF

Query:  GFVTGSMLLQSSISLMSSGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFE
         FV  S+LLQ+SI+    G+ K   +  +I  GS+LK+VPG I+R L+EG GLD LR   RIGVR PRLAL+LG+M+ DP++LML+TVMKN+QKLGYVF+
Subjt:  GFVTGSMLLQSSISLMSSGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFE

Query:  CSTMFPFSNVHMIFAVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLI
                    +FAVE+G   S+WEQ+ G   VL     GH DW+I++G+IADSLEA+ AI+SLMQEPF S+PLIWI+ ED LANRLP+Y++ G   LI
Subjt:  CSTMFPFSNVHMIFAVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLI

Query:  SHWKRSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIY
        SHW+ +F RA+VVVFP F LPML+S LD+GNF VIP S  DV+AAESY   H+K  LRE N F E+D+++LV+GS FF +E SWD AVAMH +GPLL+ Y
Subjt:  SHWKRSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIY

Query:  ARRKEVEGSFKFVFLCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDG
         RRK+  GSFKFVFL  NST G  DA++E+ SRLGL +G++ H+GLN DVN VL MADI++Y SSQE Q+FPPL++RAMSFGIPI+ PD P ++ Y+ D 
Subjt:  ARRKEVEGSFKFVFLCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDG

Query:  VHGVIFPKHNSDALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENAD
        VHG+ F +++ DALL +FS +ISDG+LS+FAQ IASSGRLL KN++A+EC+ GYA+LLEN+L+FPSD  LPGS+SQLQ+ AWEWN FR E+ +      D
Subjt:  VHGVIFPKHNSDALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENAD

Query:  DEERIAAISKSSVIFALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERD
             A I KS ++F +E +    +  TN  +     +  ++P+  DWD+LEEIE AEEYE VE EE ++RMERD+  W+EIYRNARKSEKLKFE NERD
Subjt:  DEERIAAISKSSVIFALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERD

Query:  EGELERTGQTVSIYEIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRK
        EGELERTG+ + IYEIY+G+GAWPF+HHGSLYRGLSLS++  RL SDDV+A  RLPLLND+YY D LCEIGGMF++ANK+D+IH RPWIGFQSWRA+GRK
Subjt:  EGELERTGQTVSIYEIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRK

Query:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVN----RGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPS
        VSL  KAE  LE+ I+   KG++IYFW  L ++          TFWS+CDILN G CRTTFE  FR M+GL  ++EALPPMPEDG  WS+LH+WVMPTPS
Subjt:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVN----RGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPS

Query:  FLEFIMFSRMFTHYLDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDL
        FLEF+MFSRMF+  LDA + N +    C LASS LE+KHCYCR+LE+LVNVWAYHSGR+MVYINP+ G LEEQHP++QRK  MWAKYFNFTLLKSMDEDL
Subjt:  FLEFIMFSRMFTHYLDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDL

Query:  AEAADDEGGSGKMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        AEAADD+    +  LWPLTGEVHW+G+YEREREERYR+KMDKKR TK KL +R+K GYKQKSLGG
Subjt:  AEAADDEGGSGKMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

AT5G04480.2 UDP-Glycosyltransferase superfamily protein0.0e+0059.34Show/hide
Query:  MRRSSSSEIDDNG-------------SGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQ--SRINRKGLLWWIPARGQTLFYFVVVFAVF
        +R S S EIDDNG             +GN     HSIRDR   KRNSS  R R+   LD  + R+R H     +NRKGLL  +  RG  L YF+V F V 
Subjt:  MRRSSSSEIDDNG-------------SGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQ--SRINRKGLLWWIPARGQTLFYFVVVFAVF

Query:  GFVTGSMLLQSSISLMSSGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFE
         FV  S+LLQ+SI+    G+ K   +  +I  GS+LK+VPG I+R L+EG GLD LR   RIGVR PRLAL+LG+M+ DP++LML               
Subjt:  GFVTGSMLLQSSISLMSSGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFE

Query:  CSTMFPFSNVHMIFAVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLI
                    +FAVE+G   S+WEQ+ G   VL     GH DW+I++G+IADSLEA+ AI+SLMQEPF S+PLIWI+ ED LANRLP+Y++ G   LI
Subjt:  CSTMFPFSNVHMIFAVESGNKHSMWEQIGGQPSVLSPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLI

Query:  SHWKRSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIY
        SHW+ +F RA+VVVFP F LPML+S LD+GNF VIP S  DV+AAESY   H+K  LRE N F E+D+++LV+GS FF +E SWD AVAMH +GPLL+ Y
Subjt:  SHWKRSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIY

Query:  ARRKEVEGSFKFVFLCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDG
         RRK+  GSFKFVFL  NST G  DA++E+ SRLGL +G++ H+GLN DVN VL MADI++Y SSQE Q+FPPL++RAMSFGIPI+ PD P ++ Y+ D 
Subjt:  ARRKEVEGSFKFVFLCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDG

Query:  VHGVIFPKHNSDALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENAD
        VHG+ F +++ DALL +FS +ISDG+LS+FAQ IASSGRLL KN++A+EC+ GYA+LLEN+L+FPSD  LPGS+SQLQ+ AWEWN FR E+ +      D
Subjt:  VHGVIFPKHNSDALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENAD

Query:  DEERIAAISKSSVIFALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERD
             A I KS ++F +E +    +  TN  +     +  ++P+  DWD+LEEIE AEEYE VE EE ++RMERD+  W+EIYRNARKSEKLKFE NERD
Subjt:  DEERIAAISKSSVIFALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERD

Query:  EGELERTGQTVSIYEIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRK
        EGELERTG+ + IYEIY+G+GAWPF+HHGSLYRGLSLS++  RL SDDV+A  RLPLLND+YY D LCEIGGMF++ANK+D+IH RPWIGFQSWRA+GRK
Subjt:  EGELERTGQTVSIYEIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRK

Query:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVN----RGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPS
        VSL  KAE  LE+ I+   KG++IYFW  L ++          TFWS+CDILN G CRTTFE  FR M+GL  ++EALPPMPEDG  WS+LH+WVMPTPS
Subjt:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVN----RGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTPS

Query:  FLEFIMFSRMFTHYLDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDL
        FLEF+MFSRMF+  LDA + N +    C LASS LE+KHCYCR+LE+LVNVWAYHSGR+MVYINP+ G LEEQHP++QRK  MWAKYFNFTLLKSMDEDL
Subjt:  FLEFIMFSRMFTHYLDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDL

Query:  AEAADDEGGSGKMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        AEAADD+    +  LWPLTGEVHW+G+YEREREERYR+KMDKKR TK KL +R+K GYKQKSLGG
Subjt:  AEAADDEGGSGKMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACGAAGTTCATCTTCAGAGATCGACGACAATGGGAGTGGAAATGCCGTTCCCGGCACTCACTCGATTCGTGATCGTTTTCCTTTCAAGCGGAATTCCAGTCACTT
CCGTTTGCGAGCCAAGGACTCATTGGATCATGCAGCCTCTCGCTCCCGATCTCACCAGAGCCGGATCAATCGCAAGGGCTTGCTCTGGTGGATTCCAGCTAGAGGGCAAA
CGCTTTTCTACTTTGTTGTCGTTTTTGCGGTATTCGGGTTTGTTACCGGGTCTATGCTGTTGCAGAGCTCGATTAGCTTGATGAGCTCTGGAAGTGAAAAGGAGCGGTGG
CTTATGGAGCGTATTAAGTTTGGGAGCTCGCTGAAGTTTGTGCCAGGGAGGATTTCCAGGAGGCTGGTGGAAGGTGATGGGCTTGACGAGTTGAGAAAGATGGATCGAAT
TGGCGTTCGTGCACCGAGGCTTGCTCTAATCTTGGGAAGCATGGAGAACGATCCACAATCATTAATGTTGATTACTGTGATGAAGAACATACAGAAACTTGGATATGTAT
TTGAGTGTAGTACTATGTTTCCATTTTCCAATGTTCATATGATTTTTGCAGTAGAGAGTGGAAATAAACATTCAATGTGGGAACAGATAGGTGGTCAGCCTTCAGTATTA
AGTCCAGGACATTATGGTCATGTTGATTGGTCAATATATGATGGTATTATTGCTGACTCCTTGGAAGCAGAGGGGGCGATTGCAAGCCTTATGCAGGAACCTTTTTGTTC
TCTACCACTCATATGGATAATTCGAGAAGATACACTAGCCAACCGCTTGCCTATGTATGAACAAAGGGGCTGGAAGCATCTCATTTCACATTGGAAGAGGTCTTTTAGAA
GGGCTAATGTTGTCGTGTTCCCTGATTTTGCCCTCCCAATGTTGTATAGCACTTTGGACAATGGAAACTTCCACGTGATTCCTGGATCCCCAGCAGATGTTTATGCTGCA
GAAAGCTACAAGAATGTTCACTCCAAAAGTCAATTGAGAGAGAAAAATGGATTTGATGAAAATGATATACTGGTTCTTGTCGTTGGAAGTTTGTTCTTCCCAAATGAGTT
GTCCTGGGACTATGCTGTTGCTATGCACAGCATTGGACCTCTACTCTCAATATATGCAAGGAGGAAAGAAGTAGAAGGATCATTTAAATTTGTTTTCTTATGTTGTAATT
CAACTGATGGGTCCCATGATGCTTTAAAGGAAATTGTTTCACGTCTAGGACTTCCTGATGGATCTATAACACATTATGGCTTAAATGGAGATGTCAACAATGTACTGATG
ATGGCTGACATTGTGCTTTATGGATCTTCACAAGAAATCCAGAGTTTTCCTCCTCTACTTATTCGAGCCATGTCCTTTGGAATCCCAATCATGGTGCCTGATTTACCTGG
CTTGAGAAATTATATTGTTGATGGTGTCCATGGGGTTATCTTTCCAAAACATAATTCTGATGCTTTATTGAGCTCTTTCTCACAAATGATATCAGATGGGAAGCTCTCGA
GATTTGCACAAGCAATAGCTTCCTCTGGAAGATTGCTTGCTAAAAATATACTTGCATCAGAATGTGTTGCTGGTTATGCACAGCTCCTGGAGAATGTTCTGAATTTCCCA
TCAGATGTTAAGCTTCCAGGTTCTGTCTCCCAGCTTCAACTAGGGGCATGGGAATGGAATTTGTTCAGGAAGGAAATGGTGAAGACAATTGACGAAAATGCAGATGATGA
AGAACGAATTGCAGCAATAAGTAAATCCAGTGTTATTTTTGCTCTTGAAGTGCAATTAACTAATTCTGTTAATTTAACAAATTTGTCTGAGACTGAAAATGGGCCTCTGG
AGCAAGATATTCCAACTTCTCAAGACTGGGATATTTTGGAGGAAATTGAAAATGCTGAAGAGTATGAAACTGTTGAAATGGAAGAGTTTCAAGAAAGAATGGAGAGAGAT
CTAGGTGCATGGGATGAAATATATCGAAATGCTCGGAAATCAGAAAAGCTCAAGTTTGAAGCAAATGAACGGGATGAGGGTGAGCTCGAAAGGACGGGACAGACTGTATC
CATATATGAGATATACAGTGGTTCTGGAGCTTGGCCATTCATGCACCATGGTTCTTTGTACCGTGGACTAAGTCTTTCCACGAGAGCACTGAGGTTAAGATCTGATGATG
TCAATGCTGTTGGACGGCTTCCTCTACTCAATGACTCCTACTATCTGGACACTCTCTGTGAGATTGGAGGAATGTTTGCTATTGCAAATAAGATTGATAACATTCATAAG
AGACCTTGGATTGGGTTCCAATCATGGCGGGCTTCTGGAAGAAAGGTTTCCTTGTGCAAAAAAGCTGAAAATGTTTTGGAAGACACTATACGGGACAACCCTAAAGGAGA
TGTTATATACTTCTGGGCACACTTGCAAGTGAATCGTGGAAGCACTCCTACTACTTTCTGGTCGGTGTGTGATATCTTGAACGGTGGTCTCTGCAGAACCACCTTCGAGA
GCACCTTCCGCGAGATGTTTGGATTGTCATCAAATATGGAAGCTCTTCCGCCTATGCCAGAAGATGGTGGTCGCTGGTCTGCCCTCCATAGTTGGGTGATGCCAACCCCT
TCCTTCCTGGAGTTCATCATGTTTTCCAGGATGTTCACCCATTACCTTGATGCTCATAATAAAAATCAGAGTCAGCCAAATGGATGTTTGTTGGCTTCCTCAGAGCTTGA
GAAAAAACACTGTTACTGTCGGATATTGGAAATGCTGGTCAATGTCTGGGCTTACCACAGTGGCCGGAGAATGGTTTATATCAATCCTCAATCTGGTTTCCTCGAAGAGC
AGCATCCAGTTGAACAACGCAAGGAATTTATGTGGGCAAAATATTTCAACTTCACGTTGTTGAAAAGTATGGACGAAGATTTAGCAGAAGCCGCTGACGACGAAGGCGGT
TCAGGTAAAATGGGGTTATGGCCATTAACAGGGGAAGTGCATTGGCAAGGAATTTATGAAAGAGAGAGAGAAGAAAGGTATAGGGTGAAAATGGACAAGAAGAGAACTAC
AAAAGTAAAACTAATGGAGAGGATGAAATTTGGATACAAACAAAAATCACTTGGAGGATAA
mRNA sequenceShow/hide mRNA sequence
CTTGGCATATTTTCTTCGATGTGAAAAGTCATTGTCATTTCATCTTGCATTTGTCTTCTTGTAAATGGAAACCATGGCAGTGAGAGCAATTCACATCTGAGTTTCGCTTC
ATTTTATCTCCATTATCAAATTCTTCCTCCTCAATCATTGGAAATCCATTTTCCATTAACGTTCTGTGTAGAAAACTCGGACCACAGAATCATCTGAATGAGACGAAGTT
CATCTTCAGAGATCGACGACAATGGGAGTGGAAATGCCGTTCCCGGCACTCACTCGATTCGTGATCGTTTTCCTTTCAAGCGGAATTCCAGTCACTTCCGTTTGCGAGCC
AAGGACTCATTGGATCATGCAGCCTCTCGCTCCCGATCTCACCAGAGCCGGATCAATCGCAAGGGCTTGCTCTGGTGGATTCCAGCTAGAGGGCAAACGCTTTTCTACTT
TGTTGTCGTTTTTGCGGTATTCGGGTTTGTTACCGGGTCTATGCTGTTGCAGAGCTCGATTAGCTTGATGAGCTCTGGAAGTGAAAAGGAGCGGTGGCTTATGGAGCGTA
TTAAGTTTGGGAGCTCGCTGAAGTTTGTGCCAGGGAGGATTTCCAGGAGGCTGGTGGAAGGTGATGGGCTTGACGAGTTGAGAAAGATGGATCGAATTGGCGTTCGTGCA
CCGAGGCTTGCTCTAATCTTGGGAAGCATGGAGAACGATCCACAATCATTAATGTTGATTACTGTGATGAAGAACATACAGAAACTTGGATATGTATTTGAGTGTAGTAC
TATGTTTCCATTTTCCAATGTTCATATGATTTTTGCAGTAGAGAGTGGAAATAAACATTCAATGTGGGAACAGATAGGTGGTCAGCCTTCAGTATTAAGTCCAGGACATT
ATGGTCATGTTGATTGGTCAATATATGATGGTATTATTGCTGACTCCTTGGAAGCAGAGGGGGCGATTGCAAGCCTTATGCAGGAACCTTTTTGTTCTCTACCACTCATA
TGGATAATTCGAGAAGATACACTAGCCAACCGCTTGCCTATGTATGAACAAAGGGGCTGGAAGCATCTCATTTCACATTGGAAGAGGTCTTTTAGAAGGGCTAATGTTGT
CGTGTTCCCTGATTTTGCCCTCCCAATGTTGTATAGCACTTTGGACAATGGAAACTTCCACGTGATTCCTGGATCCCCAGCAGATGTTTATGCTGCAGAAAGCTACAAGA
ATGTTCACTCCAAAAGTCAATTGAGAGAGAAAAATGGATTTGATGAAAATGATATACTGGTTCTTGTCGTTGGAAGTTTGTTCTTCCCAAATGAGTTGTCCTGGGACTAT
GCTGTTGCTATGCACAGCATTGGACCTCTACTCTCAATATATGCAAGGAGGAAAGAAGTAGAAGGATCATTTAAATTTGTTTTCTTATGTTGTAATTCAACTGATGGGTC
CCATGATGCTTTAAAGGAAATTGTTTCACGTCTAGGACTTCCTGATGGATCTATAACACATTATGGCTTAAATGGAGATGTCAACAATGTACTGATGATGGCTGACATTG
TGCTTTATGGATCTTCACAAGAAATCCAGAGTTTTCCTCCTCTACTTATTCGAGCCATGTCCTTTGGAATCCCAATCATGGTGCCTGATTTACCTGGCTTGAGAAATTAT
ATTGTTGATGGTGTCCATGGGGTTATCTTTCCAAAACATAATTCTGATGCTTTATTGAGCTCTTTCTCACAAATGATATCAGATGGGAAGCTCTCGAGATTTGCACAAGC
AATAGCTTCCTCTGGAAGATTGCTTGCTAAAAATATACTTGCATCAGAATGTGTTGCTGGTTATGCACAGCTCCTGGAGAATGTTCTGAATTTCCCATCAGATGTTAAGC
TTCCAGGTTCTGTCTCCCAGCTTCAACTAGGGGCATGGGAATGGAATTTGTTCAGGAAGGAAATGGTGAAGACAATTGACGAAAATGCAGATGATGAAGAACGAATTGCA
GCAATAAGTAAATCCAGTGTTATTTTTGCTCTTGAAGTGCAATTAACTAATTCTGTTAATTTAACAAATTTGTCTGAGACTGAAAATGGGCCTCTGGAGCAAGATATTCC
AACTTCTCAAGACTGGGATATTTTGGAGGAAATTGAAAATGCTGAAGAGTATGAAACTGTTGAAATGGAAGAGTTTCAAGAAAGAATGGAGAGAGATCTAGGTGCATGGG
ATGAAATATATCGAAATGCTCGGAAATCAGAAAAGCTCAAGTTTGAAGCAAATGAACGGGATGAGGGTGAGCTCGAAAGGACGGGACAGACTGTATCCATATATGAGATA
TACAGTGGTTCTGGAGCTTGGCCATTCATGCACCATGGTTCTTTGTACCGTGGACTAAGTCTTTCCACGAGAGCACTGAGGTTAAGATCTGATGATGTCAATGCTGTTGG
ACGGCTTCCTCTACTCAATGACTCCTACTATCTGGACACTCTCTGTGAGATTGGAGGAATGTTTGCTATTGCAAATAAGATTGATAACATTCATAAGAGACCTTGGATTG
GGTTCCAATCATGGCGGGCTTCTGGAAGAAAGGTTTCCTTGTGCAAAAAAGCTGAAAATGTTTTGGAAGACACTATACGGGACAACCCTAAAGGAGATGTTATATACTTC
TGGGCACACTTGCAAGTGAATCGTGGAAGCACTCCTACTACTTTCTGGTCGGTGTGTGATATCTTGAACGGTGGTCTCTGCAGAACCACCTTCGAGAGCACCTTCCGCGA
GATGTTTGGATTGTCATCAAATATGGAAGCTCTTCCGCCTATGCCAGAAGATGGTGGTCGCTGGTCTGCCCTCCATAGTTGGGTGATGCCAACCCCTTCCTTCCTGGAGT
TCATCATGTTTTCCAGGATGTTCACCCATTACCTTGATGCTCATAATAAAAATCAGAGTCAGCCAAATGGATGTTTGTTGGCTTCCTCAGAGCTTGAGAAAAAACACTGT
TACTGTCGGATATTGGAAATGCTGGTCAATGTCTGGGCTTACCACAGTGGCCGGAGAATGGTTTATATCAATCCTCAATCTGGTTTCCTCGAAGAGCAGCATCCAGTTGA
ACAACGCAAGGAATTTATGTGGGCAAAATATTTCAACTTCACGTTGTTGAAAAGTATGGACGAAGATTTAGCAGAAGCCGCTGACGACGAAGGCGGTTCAGGTAAAATGG
GGTTATGGCCATTAACAGGGGAAGTGCATTGGCAAGGAATTTATGAAAGAGAGAGAGAAGAAAGGTATAGGGTGAAAATGGACAAGAAGAGAACTACAAAAGTAAAACTA
ATGGAGAGGATGAAATTTGGATACAAACAAAAATCACTTGGAGGATAAAGAAGGGCTGGAATTCTGAAAGCACTTCTAACTGGCTGACTTTGGGTAGAAGAAGAAGAGAT
TGAAAGAAAGAAAGAAAGAAAGACTCCTATTATATTTGGTAAGTAAAAGAAATTGAGTTCTTTCTGAATTATAATACTCTCATCACTCTACAGAGCAATATTTCTAGGCA
ACAGAAACTGGGGGAAAGGGAAAGCTGAAAAATTGCAGCTCGGTTTATTAGAAGCAATATTACCAAAGAGAGGTACTAAATATTCTTTTTAAGATTCTTTCTTTTATATT
CAACATACCTCATATAATAGCACAACAAGAATATAGGGAATTTCATTTTTGCTGTTCTGTTGTACAACCAAAATAGCAATTATTAAGGCAATTTTGGTCACACATAGTGA
TAATAATAATAGATTGGTTTCCACTTTCAGATATGTTTGAGATATGTCCTGATTTTGCTTTTAGTTGTGTTTTCTCCTCTGCCAATAACTATGGTGGAGAAATCTGCAGC
AAACTTTGTCCATTTTATGACAAAATTTCCCCATATCTCCAATTGTACCTGAGGAGGACTTGTCTTGTAACTTTGTCCTTGTCAATAAAACACTGACGAGACATTTTG
Protein sequenceShow/hide protein sequence
MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISLMSSGSEKERW
LMERIKFGSSLKFVPGRISRRLVEGDGLDELRKMDRIGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIFAVESGNKHSMWEQIGGQPSVL
SPGHYGHVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAA
ESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVFLCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLM
MADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFP
SDVKLPGSVSQLQLGAWEWNLFRKEMVKTIDENADDEERIAAISKSSVIFALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERD
LGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHK
RPWIGFQSWRASGRKVSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMEALPPMPEDGGRWSALHSWVMPTP
SFLEFIMFSRMFTHYLDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHPVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGG
SGKMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG