; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC08G152000 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC08G152000
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionGlycos_transf_1 domain-containing protein
Genome locationCicolChr08:20273054..20282867
RNA-Seq ExpressionCcUC08G152000
SyntenyCcUC08G152000
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR001296 - Glycosyl transferase, family 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032959.1 hypothetical protein SDJN02_07010 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0088.87Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSS+EIDDNGSGNAVP  HSIRDRFPFKRNSSHFRLRAKDSLDHA  RSRSHQSRINRKGLLWW+PARGQT FYFVVVFAVF FV+GSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        MSS GSE+ RWLMERIKFGSSLKF PGRISRRLVEG GLDE+RKKDRVGVRAPRLALILGSM ++PQSLMLITVMKNIQKLGYV E            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGN+HS+W+QIGGQPS+LSP HYG VDWSIYDGIIADSLEAEGAIASLMQEPFCS+PLIWI+REDTLANRLPMYEQRGWKHLISHWK SFRRAN+VV
Subjt:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDF+LPMLYS LDNGNF+VIPGSPADVYAAE+YKNVHSKSQLREKNGF+E+DILV+VVGSLFFPNELSWDYAVAMHSIGPLL+ YA R+EV GSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSH AL+EI SRLGLPD SITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI
        L SFS MISDGKLSRF+QAIASSG+LLAKNILASECV  YA+LLENVLNFPSDVKLPGSVSQLQLG WEWNLFR+E V+TI +  D EERIAA SKSSVI
Subjt:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY
        FALE Q+TN VNLTN SETENG LEQDIPT  DWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ VS+Y
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIY+G+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLLNDSYY DTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL  KAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY
        IRDN KGDVIYFWAHLQVNR    GS   TFWSVCDILNGGLCRT FE+TFREMFGLSSNM ALPPMP+DGGRWSALHSWVMPTPSFLEFIMFSRMFTHY
Subjt:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY

Query:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG
        LDA N+N SQP GCL+ASSELEKKHCYCRILE+LVNVWAYHSGRRMVYI+P SGFLEEQH VEQR+EFMWAKYFN TLLKSMDEDLAEAADDEGGS +MG
Subjt:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG

Query:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSL G
Subjt:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

XP_004138684.1 uncharacterized protein LOC101206364 isoform X1 [Cucumis sativus]0.0e+0092.07Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSSSEIDDN S NAV GTHSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQ+RINRKGLL WIPARGQTLFYF+VVFAVFGF TGSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        +SS GS++ERWLMERIKFGSSLKFVPGRIS+RLVEGDGL+E+RKKDRVGVRAPRLALILGSM NDPQSLMLITVMKNIQKLGYVFE            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVE GNK S+WEQI GQPS+LSPGHYG VDWSIYDGIIADSLE EGAIASLMQEPFCSLPLIWI+REDTLA+RLPMYEQRGWKHLISHWKRSFRRANVVV
Subjt:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDFALPMLYS LDNGNFHVIPGSPADVYAAE Y NVHSKSQLREKNGF+E+DILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR+EVEGSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSHDALKEI SRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP L+NYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI
        LSSFS MISDGKLSRFAQ+IASSGRLLAKNILASECV GYAQLLENVLNFPSDVKLPG VSQLQLG WEWNLFRKEMVKTIDE+AD+EERIA ISK+SVI
Subjt:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY
        FALE QLTNSVNLT LSE ENG LEQDIPT QDWDILE+IE+AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFE+NERDEGELERTGQTVS+Y
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIYSG+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLL+DSYYLD LCEIGGMFAIANKIDNIHKRPWIGFQSW+ASGRKVSL KKAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH
        I+DNPKGDVIYFWAHLQVNRG+ P TFWSVCDILNGGLCRTTF STFREMFGLSSNMGALPPMPEDGG WSALHSWVMPTPSFLEFIMFSRMFTHYLDA 
Subjt:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH

Query:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL
        N+NQSQPNGCLLASSE+EKKHCYCRILEMLVNVWAYHSGRRMVYINP SGFLEEQH VEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGK+GLWPL
Subjt:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL

Query:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
Subjt:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

XP_008456559.1 PREDICTED: uncharacterized protein LOC103496475 isoform X1 [Cucumis melo]0.0e+0092.65Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSSSEIDDN S NAVPGTHSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQ+RINRKGLL WIPARGQTLFYF+VVFAVFGF TGSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        +SS GS++ERWLMERIKFGSSLKFVPGRISRRLVEGDGL+E+RKKDRVGVRAPRLALILGSM NDPQSLMLITVMKN+QKLGYVFE            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGNK S+WEQI GQPS+LSPGHYG VDWSIYDGIIADSLE EGAIASLMQEPFCSLPLIWI+REDTLA+RLPMYEQRGWKHLISHWKRSFRRANVVV
Subjt:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDFALPMLYS LDNGNFHVIPGSPADVYAAE+Y NVHSKSQLREKNGF+ +DILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR+EVEGSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSHDALKEI SRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI
        LSSFS MISDGKLSRFAQAIASSGRLLAKNILASECV GY QLLENVLNFPSDVKLPG  SQLQLG WEWNLFRKEMVKTIDE+ADDEERIAAISK+SVI
Subjt:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY
        FALE QLTNSVNLT LSE ENG LEQDIPT QDWDILEEIE+AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVS+Y
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIYSG+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLLNDSYYLD LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL KKAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH
        IRDNP+GDVIYFWAHLQVNRG+ P TFWSVCDILNGGLCRTTF STFREMFGLSSNMGALPPMPEDGG WSALHSWVMPTPSFLEFIMFSRMFTHYLDA 
Subjt:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH

Query:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL
        N+NQSQPNGCL A SE+EKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQH VEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGK+GLWPL
Subjt:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL

Query:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
Subjt:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

XP_022958089.1 uncharacterized protein LOC111459418 isoform X1 [Cucurbita moschata]0.0e+0088.96Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSS+EIDDNGSGNAVP  HSIRDRFPFKRNSSHFRLRAKDSLDHA  RSRSHQSRINRKGLLWW+PARGQT FYFVVVFAVF FV+GSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        MSS GSE+ RWLMERIKFGSSLKF PGRISRRLVEG GLDE+RKKDRVGVRAPRLALILGSM ++PQSLMLITVMKNIQKLGYV E            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGN+HS+W+QIGGQPS+LSP HYG VDWSIYDGIIADSLEAEGAIASLMQEPFCS+PLIWI+REDTLANRLPMYEQRGWKHLISHWK SFRRAN+VV
Subjt:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDF+LPMLYS LDNGNF+VIPGSPADVYAAE+YKNVHSKSQLREKNGF+E+DILV+VVGSLFFPNELSWDYAVAMHSIGPLL+ YA R+EV GSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSH AL+EI SRLGLPD SITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI
        L SFS MISDGKLSRF+QAIASSG+LLAKNILASECV  YA+LLENVLNFPSDVKLPGSVSQLQLG WEWNLFR+E V+TI +  D EERIAA SKSSVI
Subjt:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY
        FALE Q+TN VNLTN SETENG LEQDIPT  DWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ VS+Y
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIY+G+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLLNDSYY DTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL  KAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY
        IRDN KGDVIYFWAHLQVNR    GS   TFWSVCDILNGGLCRT FE+TFREMFGLSSNM ALPPMP+DGGRWSALHSWVMPTPSFLEFIMFSRMFTHY
Subjt:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY

Query:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG
        LDA N+NQSQP GCL+ASSELEKKHCYCRILE+LVNVWAYHSGRRMVYI+P SGFLEEQH VEQR+EFMWAKYFN TLLKSMDEDLAEAADDEGGS +MG
Subjt:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG

Query:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSL G
Subjt:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

XP_038884759.1 uncharacterized protein LOC120075439 isoform X1 [Benincasa hispida]0.0e+0092.74Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYF+VVFAVFGFVTGSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        MSS GSE+ERWLMERIKFGSSLKFVPG ISR+LVEGDGLDE+RKKDRVGVR+PRLALILGSM NDPQSLMLITVMKNIQKLGY+ E            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGNKHSIWEQIGGQPS+LSP HYG VDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWI+REDTLANRLP+YEQRGWKHLISHWK SFRRANVVV
Subjt:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDFALPMLYSTLD+GNFHVIPGSPADVYAAE+YKN HSKSQLREKNGF E+DILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEV GSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSHDALKEI SRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI
        LSSFS MISDGKLSRFAQAIASSGRLLAKNILASECV GYAQLLENVLNFP DVKLP S SQLQLG WEWNLFRKEMVK IDE ADDEERIAA +K+SVI
Subjt:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY
        FALE QLTNSVNLT LSE ENG LE DIPTSQDWD+LEEIENAEEYETVEMEEFQERMERDLGAWD+IYRNARKSEKLKFEANERDEGELERTGQTVS+Y
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIYSG+GAWPFMHHGSLYRGLSLST+ALRL+SDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAEN LED 
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH
        IRDNPKGDVIYFWAHLQVNRG  P TFWSVCDILNGGLCRTTF+STFR+M+GLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDA 
Subjt:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH

Query:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL
        N+NQS PNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRR+VYINPQSGFLEEQH VEQRKEFMWAKYFNFTLLKSMDEDLAEA DDEG SGK GLWPL
Subjt:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL

Query:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        TGEVHWQGIYEREREERYRVKMDKKRTTKVKL ERMKFGYKQKSLGG
Subjt:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

TrEMBL top hitse value%identityAlignment
A0A0A0LMB5 Glycos_transf_1 domain-containing protein0.0e+0092.07Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSSSEIDDN S NAV GTHSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQ+RINRKGLL WIPARGQTLFYF+VVFAVFGF TGSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        +SS GS++ERWLMERIKFGSSLKFVPGRIS+RLVEGDGL+E+RKKDRVGVRAPRLALILGSM NDPQSLMLITVMKNIQKLGYVFE            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVE GNK S+WEQI GQPS+LSPGHYG VDWSIYDGIIADSLE EGAIASLMQEPFCSLPLIWI+REDTLA+RLPMYEQRGWKHLISHWKRSFRRANVVV
Subjt:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDFALPMLYS LDNGNFHVIPGSPADVYAAE Y NVHSKSQLREKNGF+E+DILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR+EVEGSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSHDALKEI SRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP L+NYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI
        LSSFS MISDGKLSRFAQ+IASSGRLLAKNILASECV GYAQLLENVLNFPSDVKLPG VSQLQLG WEWNLFRKEMVKTIDE+AD+EERIA ISK+SVI
Subjt:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY
        FALE QLTNSVNLT LSE ENG LEQDIPT QDWDILE+IE+AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFE+NERDEGELERTGQTVS+Y
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIYSG+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLL+DSYYLD LCEIGGMFAIANKIDNIHKRPWIGFQSW+ASGRKVSL KKAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH
        I+DNPKGDVIYFWAHLQVNRG+ P TFWSVCDILNGGLCRTTF STFREMFGLSSNMGALPPMPEDGG WSALHSWVMPTPSFLEFIMFSRMFTHYLDA 
Subjt:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH

Query:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL
        N+NQSQPNGCLLASSE+EKKHCYCRILEMLVNVWAYHSGRRMVYINP SGFLEEQH VEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGK+GLWPL
Subjt:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL

Query:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
Subjt:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

A0A1S3C3I4 uncharacterized protein LOC103496475 isoform X10.0e+0092.65Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSSSEIDDN S NAVPGTHSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQ+RINRKGLL WIPARGQTLFYF+VVFAVFGF TGSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        +SS GS++ERWLMERIKFGSSLKFVPGRISRRLVEGDGL+E+RKKDRVGVRAPRLALILGSM NDPQSLMLITVMKN+QKLGYVFE            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGNK S+WEQI GQPS+LSPGHYG VDWSIYDGIIADSLE EGAIASLMQEPFCSLPLIWI+REDTLA+RLPMYEQRGWKHLISHWKRSFRRANVVV
Subjt:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDFALPMLYS LDNGNFHVIPGSPADVYAAE+Y NVHSKSQLREKNGF+ +DILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR+EVEGSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSHDALKEI SRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI
        LSSFS MISDGKLSRFAQAIASSGRLLAKNILASECV GY QLLENVLNFPSDVKLPG  SQLQLG WEWNLFRKEMVKTIDE+ADDEERIAAISK+SVI
Subjt:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY
        FALE QLTNSVNLT LSE ENG LEQDIPT QDWDILEEIE+AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVS+Y
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIYSG+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLLNDSYYLD LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL KKAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH
        IRDNP+GDVIYFWAHLQVNRG+ P TFWSVCDILNGGLCRTTF STFREMFGLSSNMGALPPMPEDGG WSALHSWVMPTPSFLEFIMFSRMFTHYLDA 
Subjt:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH

Query:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL
        N+NQSQPNGCL A SE+EKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQH VEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGK+GLWPL
Subjt:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL

Query:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
Subjt:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

A0A5A7UUA8 UDP-Glycosyltransferase superfamily protein isoform 30.0e+0092.65Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSSSEIDDN S NAVPGTHSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQ+RINRKGLL WIPARGQTLFYF+VVFAVFGF TGSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        +SS GS++ERWLMERIKFGSSLKFVPGRISRRLVEGDGL+E+RKKDRVGVRAPRLALILGSM NDPQSLMLITVMKN+QKLGYVFE            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGNK S+WEQI GQPS+LSPGHYG VDWSIYDGIIADSLE EGAIASLMQEPFCSLPLIWI+REDTLA+RLPMYEQRGWKHLISHWKRSFRRANVVV
Subjt:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDFALPMLYS LDNGNFHVIPGSPADVYAAE+Y NVHSKSQLREKNGF+ +DILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR+EVEGSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSHDALKEI SRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI
        LSSFS MISDGKLSRFAQAIASSGRLLAKNILASECV GY QLLENVLNFPSDVKLPG  SQLQLG WEWNLFRKEMVKTIDE+ADDEERIAAISK+SVI
Subjt:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY
        FALE QLTNSVNLT LSE ENG LEQDIPT QDWDILEEIE+AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVS+Y
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIYSG+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLLNDSYYLD LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL KKAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH
        IRDNP+GDVIYFWAHLQVNRG+ P TFWSVCDILNGGLCRTTF STFREMFGLSSNMGALPPMPEDGG WSALHSWVMPTPSFLEFIMFSRMFTHYLDA 
Subjt:  IRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH

Query:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL
        N+NQSQPNGCL A SE+EKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQH VEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGK+GLWPL
Subjt:  NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPL

Query:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
Subjt:  TGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

A0A6J1H431 uncharacterized protein LOC111459418 isoform X10.0e+0088.96Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSS+EIDDNGSGNAVP  HSIRDRFPFKRNSSHFRLRAKDSLDHA  RSRSHQSRINRKGLLWW+PARGQT FYFVVVFAVF FV+GSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        MSS GSE+ RWLMERIKFGSSLKF PGRISRRLVEG GLDE+RKKDRVGVRAPRLALILGSM ++PQSLMLITVMKNIQKLGYV E            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGN+HS+W+QIGGQPS+LSP HYG VDWSIYDGIIADSLEAEGAIASLMQEPFCS+PLIWI+REDTLANRLPMYEQRGWKHLISHWK SFRRAN+VV
Subjt:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDF+LPMLYS LDNGNF+VIPGSPADVYAAE+YKNVHSKSQLREKNGF+E+DILV+VVGSLFFPNELSWDYAVAMHSIGPLL+ YA R+EV GSFKFVF
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSH AL+EI SRLGLPD SITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI
        L SFS MISDGKLSRF+QAIASSG+LLAKNILASECV  YA+LLENVLNFPSDVKLPGSVSQLQLG WEWNLFR+E V+TI +  D EERIAA SKSSVI
Subjt:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY
        FALE Q+TN VNLTN SETENG LEQDIPT  DWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ VS+Y
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIY+G+GAWPFMHHGSLYRGLSLSTRALRL+SDDVNAVGRLPLLNDSYY DTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL  KAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY
        IRDN KGDVIYFWAHLQVNR    GS   TFWSVCDILNGGLCRT FE+TFREMFGLSSNM ALPPMP+DGGRWSALHSWVMPTPSFLEFIMFSRMFTHY
Subjt:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY

Query:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG
        LDA N+NQSQP GCL+ASSELEKKHCYCRILE+LVNVWAYHSGRRMVYI+P SGFLEEQH VEQR+EFMWAKYFN TLLKSMDEDLAEAADDEGGS +MG
Subjt:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG

Query:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSL G
Subjt:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

A0A6J1JPJ0 uncharacterized protein LOC111487177 isoform X10.0e+0088.3Show/hide
Query:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRSSS+EIDDNGSGNAVP  HS RDRFPFKRNSSHFRLRAKDSLDHA  RSRSHQSRINRKGLLWW+PARGQT FYFVVVFAVF FV+GSMLLQSSISL
Subjt:  MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF
        MSS GSE+ RWLMERIKFGSSLKF PGRISRRLVEG GLDE+RKKDRVGVRAPRLALILGSM ++PQSLMLITVMKNIQKLGYV E            IF
Subjt:  MSS-GSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIF

Query:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV
        AVESGN+HS+W+QIGGQPS+LSP HYG VDWSIYDGIIADSLEAEG IASLMQEPFCS+PLIWI+REDTLANRLPMYEQRGWKHLISHWK SFRRAN+VV
Subjt:  AVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVV

Query:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF
        FPDF+LPMLYS LDNGNF+VIPGSPADVYAAE+YKNVHSKSQLREKNGF+E+DILV+VVGSLFFPNELSWDYAVAMHSIGPLL+ YA R+EV GSFKF+F
Subjt:  FPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVF

Query:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL
        LCCNSTDGSH AL+EI SRLGLPD SITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHN DAL
Subjt:  LCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDAL

Query:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI
        L SFS MISDGKLSRF+QAIASSG+LLAKNILASECV  YA+LLENVLNFPSDVKLPGSVSQLQL  WEWNLFR+E+V+TI +  D EERIAA SKSSVI
Subjt:  LSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVI

Query:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY
        FALE Q+TN VNLTN SET NG LEQDIPT  DWDILEEIEN EEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ VS+Y
Subjt:  FALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVY

Query:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT
        EIYSG+GAWPF+HHGSLYRGLSLSTRALRL+SDDVNAVGRLPLLNDSYY DTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSL  KAENVLEDT
Subjt:  EIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDT

Query:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY
        IRDN KGDVIYFW HLQVNR    GS   TFWSVCDILNGGLCRT FE+TFREMFGLSSNM ALPPMP++GGRWSALHSWVMPTPSFLEFIMFSRMFTHY
Subjt:  IRDNPKGDVIYFWAHLQVNR----GSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHY

Query:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG
        LDA N+NQSQP GCLLASSELEKKHCYCRILE+LVNVWAYHSGRRMVYI+P SGFLEEQH VEQR+EFMWAKYFN TLLKSMDEDLAEAADDEGGS +MG
Subjt:  LDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMG

Query:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSL G
Subjt:  LWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G01210.1 glycosyl transferase family 1 protein8.5e-19140.09Show/hide
Query:  RVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIFAVESGNKHSIWEQIGGQPSVLSPGHYGD--VDWSIYDGIIADSLEA
        R G R P+LAL+ G +  DP+ ++++++ K +Q++GY  E            ++++E G  +SIW+++G   ++L P       +DW  YDGII +SL A
Subjt:  RVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIFAVESGNKHSIWEQIGGQPSVLSPGHYGD--VDWSIYDGIIADSLEA

Query:  EGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLR
               MQEPF SLPLIW+I E+TLA R   Y   G   L++ WK+ F RA+VVVF ++ LP+LY+  D GNF+VIPGSP +V  A   KN+    Q  
Subjt:  EGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLR

Query:  EKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGS-FKFVFLCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMA
              ++D+++ +VGS F       ++A+ + ++ PL S      E + S  K + L   +      A++ I   L  P  ++ H  + G+V+ +L  +
Subjt:  EKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGS-FKFVFLCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMA

Query:  DIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDALLSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQL
        D+V+YGS  E QSFP +L++AMS G PI+ PDL  +R Y+ D V G +FPK N   L      +I++GK+S  AQ IA  G+   KN++A E + GYA L
Subjt:  DIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDALLSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQL

Query:  LENVLNFPSDVKLPGSVSQLQ---LGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVIFALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEI
        LEN+L F S+V  P  V ++      EW W+ F   M      D     RIA     S  F  +V+     N T     + G +  D    + W+    +
Subjt:  LENVLNFPSDVKLPGSVSQLQ---LGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVIFALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEI

Query:  ENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVYEIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGR
        +     +  E EE + R+ +  G W+++Y++A+++++ K + +ERDEGEL RTGQ + +YE Y G G W F+H   LYRG+ LS +  R R DDV+A  R
Subjt:  ENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVYEIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGR

Query:  LPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVN-RGSTPTTFWSVCDILNGGLC
        LPL N+ YY D L + G  FAI+NKID +HK  WIGFQSWRA+ RK SL K AE+ L + I+    GD +YFW  +  + R      FWS CD +N G C
Subjt:  LPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVN-RGSTPTTFWSVCDILNGGLC

Query:  RTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH-NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHS
        R  +  T ++M+ +  N+ +LPPMPEDG  WS + SW +PT SFLEF+MFSRMF   LDA   +   + N C L  S  + KHCY R+LE+LVNVWAYHS
Subjt:  RTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPSFLEFIMFSRMFTHYLDAH-NKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHS

Query:  GRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKF
         RR+VYI+P++G ++EQH  + R+  MW K+F++T LK+MDEDLAE AD +   G   LWP TGE+ W+G  E+E++++   K +KK+ ++ KL      
Subjt:  GRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKF

Query:  GYKQKSLG
          +QK +G
Subjt:  GYKQKSLG

AT5G04480.1 UDP-Glycosyltransferase superfamily protein0.0e+0059.91Show/hide
Query:  MRRSSSSEIDDNG-------------SGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQ--SRINRKGLLWWIPARGQTLFYFVVVFAVF
        +R S S EIDDNG             +GN     HSIRDR   KRNSS  R R+   LD  + R+R H     +NRKGLL  +  RG  L YF+V F V 
Subjt:  MRRSSSSEIDDNG-------------SGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQ--SRINRKGLLWWIPARGQTLFYFVVVFAVF

Query:  GFVTGSMLLQSSISLMSSGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFE
         FV  S+LLQ+SI+    G+ K   +  +I  GS+LK+VPG I+R L+EG GLD LR   R+GVR PRLAL+LG+M  DP++LML+TVMKN+QKLGYVF+
Subjt:  GFVTGSMLLQSSISLMSSGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFE

Query:  CSTMFPFSNVHMIFAVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLI
                    +FAVE+G   S+WEQ+ G   VL     G  DW+I++G+IADSLEA+ AI+SLMQEPF S+PLIWI+ ED LANRLP+Y++ G   LI
Subjt:  CSTMFPFSNVHMIFAVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLI

Query:  SHWKRSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIY
        SHW+ +F RA+VVVFP F LPML+S LD+GNF VIP S  DV+AAESY   H+K  LRE N F E+D+++LV+GS FF +E SWD AVAMH +GPLL+ Y
Subjt:  SHWKRSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIY

Query:  ARRKEVEGSFKFVFLCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDG
         RRK+  GSFKFVFL  NST G  DA++E+ SRLGL +G++ H+GLN DVN VL MADI++Y SSQE Q+FPPL++RAMSFGIPI+ PD P ++ Y+ D 
Subjt:  ARRKEVEGSFKFVFLCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDG

Query:  VHGVIFPKHNSDALLSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDAD
        VHG+ F +++ DALL +FS +ISDG+LS+FAQ IASSGRLL KN++A+EC+ GYA+LLEN+L+FPSD  LPGS+SQLQ+  WEWN FR E+ +      D
Subjt:  VHGVIFPKHNSDALLSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDAD

Query:  DEERIAAISKSSVIFALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERD
             A I KS ++F +E +    +  TN  +     +  ++P+  DWD+LEEIE AEEYE VE EE ++RMERD+  W+EIYRNARKSEKLKFE NERD
Subjt:  DEERIAAISKSSVIFALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERD

Query:  EGELERTGQTVSVYEIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRK
        EGELERTG+ + +YEIY+G+GAWPF+HHGSLYRGLSLS++  RL SDDV+A  RLPLLND+YY D LCEIGGMF++ANK+D+IH RPWIGFQSWRA+GRK
Subjt:  EGELERTGQTVSVYEIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRK

Query:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVN----RGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPS
        VSL  KAE  LE+ I+   KG++IYFW  L ++          TFWS+CDILN G CRTTFE  FR M+GL  ++ ALPPMPEDG  WS+LH+WVMPTPS
Subjt:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVN----RGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPS

Query:  FLEFIMFSRMFTHYLDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDL
        FLEF+MFSRMF+  LDA + N +    C LASS LE+KHCYCR+LE+LVNVWAYHSGR+MVYINP+ G LEEQH ++QRK  MWAKYFNFTLLKSMDEDL
Subjt:  FLEFIMFSRMFTHYLDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDL

Query:  AEAADDEGGSGKMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        AEAADD+    +  LWPLTGEVHW+G+YEREREERYR+KMDKKR TK KL +R+K GYKQKSLGG
Subjt:  AEAADDEGGSGKMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

AT5G04480.2 UDP-Glycosyltransferase superfamily protein0.0e+0058.78Show/hide
Query:  MRRSSSSEIDDNG-------------SGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQ--SRINRKGLLWWIPARGQTLFYFVVVFAVF
        +R S S EIDDNG             +GN     HSIRDR   KRNSS  R R+   LD  + R+R H     +NRKGLL  +  RG  L YF+V F V 
Subjt:  MRRSSSSEIDDNG-------------SGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQ--SRINRKGLLWWIPARGQTLFYFVVVFAVF

Query:  GFVTGSMLLQSSISLMSSGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFE
         FV  S+LLQ+SI+    G+ K   +  +I  GS+LK+VPG I+R L+EG GLD LR   R+GVR PRLAL+LG+M  DP++LML               
Subjt:  GFVTGSMLLQSSISLMSSGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFE

Query:  CSTMFPFSNVHMIFAVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLI
                    +FAVE+G   S+WEQ+ G   VL     G  DW+I++G+IADSLEA+ AI+SLMQEPF S+PLIWI+ ED LANRLP+Y++ G   LI
Subjt:  CSTMFPFSNVHMIFAVESGNKHSIWEQIGGQPSVLSPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLI

Query:  SHWKRSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIY
        SHW+ +F RA+VVVFP F LPML+S LD+GNF VIP S  DV+AAESY   H+K  LRE N F E+D+++LV+GS FF +E SWD AVAMH +GPLL+ Y
Subjt:  SHWKRSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIY

Query:  ARRKEVEGSFKFVFLCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDG
         RRK+  GSFKFVFL  NST G  DA++E+ SRLGL +G++ H+GLN DVN VL MADI++Y SSQE Q+FPPL++RAMSFGIPI+ PD P ++ Y+ D 
Subjt:  ARRKEVEGSFKFVFLCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDG

Query:  VHGVIFPKHNSDALLSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDAD
        VHG+ F +++ DALL +FS +ISDG+LS+FAQ IASSGRLL KN++A+EC+ GYA+LLEN+L+FPSD  LPGS+SQLQ+  WEWN FR E+ +      D
Subjt:  VHGVIFPKHNSDALLSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDAD

Query:  DEERIAAISKSSVIFALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERD
             A I KS ++F +E +    +  TN  +     +  ++P+  DWD+LEEIE AEEYE VE EE ++RMERD+  W+EIYRNARKSEKLKFE NERD
Subjt:  DEERIAAISKSSVIFALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERD

Query:  EGELERTGQTVSVYEIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRK
        EGELERTG+ + +YEIY+G+GAWPF+HHGSLYRGLSLS++  RL SDDV+A  RLPLLND+YY D LCEIGGMF++ANK+D+IH RPWIGFQSWRA+GRK
Subjt:  EGELERTGQTVSVYEIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRK

Query:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVN----RGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPS
        VSL  KAE  LE+ I+   KG++IYFW  L ++          TFWS+CDILN G CRTTFE  FR M+GL  ++ ALPPMPEDG  WS+LH+WVMPTPS
Subjt:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVN----RGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTPS

Query:  FLEFIMFSRMFTHYLDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDL
        FLEF+MFSRMF+  LDA + N +    C LASS LE+KHCYCR+LE+LVNVWAYHSGR+MVYINP+ G LEEQH ++QRK  MWAKYFNFTLLKSMDEDL
Subjt:  FLEFIMFSRMFTHYLDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDL

Query:  AEAADDEGGSGKMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        AEAADD+    +  LWPLTGEVHW+G+YEREREERYR+KMDKKR TK KL +R+K GYKQKSLGG
Subjt:  AEAADDEGGSGKMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACGAAGTTCATCTTCAGAGATCGACGACAATGGGAGTGGAAATGCCGTTCCCGGCACTCACTCGATTCGTGATCGTTTTCCTTTCAAGCGGAATTCCAGTCACTT
CCGTTTGCGAGCCAAGGACTCACTGGATCATGCAGCCTCTCGCTCCCGATCTCACCAGAGCCGGATCAATCGCAAGGGCTTGCTCTGGTGGATTCCAGCTAGAGGGCAAA
CGCTTTTCTACTTTGTTGTCGTTTTTGCGGTTTTCGGGTTTGTTACCGGGTCTATGCTGTTGCAGAGCTCGATTAGCTTGATGAGCTCTGGAAGTGAAAAGGAGCGGTGG
CTTATGGAGCGTATTAAGTTTGGGAGCTCGCTGAAGTTTGTGCCAGGGAGGATTTCCAGGAGGCTGGTGGAAGGTGATGGGCTTGATGAGTTGAGAAAGAAGGATCGAGT
TGGCGTTCGTGCACCGAGGCTTGCCCTAATCTTGGGAAGCATGGGGAATGATCCACAATCATTAATGTTGATTACTGTGATGAAGAACATACAGAAACTTGGATATGTAT
TTGAGTGTAGTACCATGTTTCCATTTTCCAATGTTCATATGATTTTTGCAGTAGAGAGTGGAAATAAACATTCAATCTGGGAACAGATAGGTGGTCAGCCTTCAGTATTA
AGTCCAGGACATTATGGTGATGTTGATTGGTCAATATATGATGGTATTATTGCTGACTCCTTGGAAGCAGAGGGGGCAATTGCAAGCCTTATGCAGGAACCTTTTTGTTC
TCTACCACTCATATGGATAATTCGAGAAGATACACTAGCCAACCGCTTGCCTATGTATGAACAAAGGGGCTGGAAGCATCTCATTTCACATTGGAAGAGGTCTTTTAGAA
GGGCTAATGTTGTCGTGTTCCCTGATTTTGCCCTCCCAATGTTGTATAGCACTTTGGACAATGGAAACTTCCACGTGATTCCTGGATCCCCAGCAGATGTTTATGCTGCA
GAAAGCTACAAGAATGTTCACTCCAAAAGTCAATTGAGAGAGAAAAATGGATTTGATGAAAATGATATACTGGTTCTTGTTGTTGGAAGTTTGTTCTTCCCAAATGAGTT
GTCGTGGGACTATGCTGTGGCTATGCACAGCATTGGACCTCTACTCTCAATATATGCAAGGAGGAAAGAAGTAGAAGGATCATTTAAATTTGTTTTCTTATGTTGTAATT
CAACTGATGGGTCCCATGATGCTTTAAAGGAAATTGTTTCACGTCTAGGACTTCCTGATGGATCTATAACACATTATGGCTTAAATGGAGATGTCAACAATGTACTGATG
ATGGCTGACATTGTGCTTTATGGATCTTCACAAGAAATCCAGAGTTTTCCTCCTCTACTTATTCGAGCCATGTCCTTTGGAATCCCAATCATGGTGCCTGATTTACCTGG
CTTGAGAAATTATATTGTTGATGGTGTCCATGGGGTTATCTTCCCAAAACATAATTCTGATGCTTTATTGAGCTCTTTCTCACATATGATATCAGATGGGAAGCTCTCGA
GATTTGCACAAGCAATAGCTTCCTCTGGAAGATTGCTTGCTAAAAATATACTTGCATCAGAATGTGTTGCCGGTTATGCACAGCTCCTGGAGAATGTTCTGAATTTCCCA
TCAGATGTTAAGCTTCCAGGTTCTGTCTCCCAGCTTCAACTAGGGGAATGGGAATGGAATTTGTTCAGGAAGGAAATGGTGAAGACAATTGACGAAGATGCAGATGATGA
AGAACGAATTGCAGCAATAAGTAAATCCAGTGTCATTTTTGCTCTTGAAGTGCAATTAACTAATTCTGTTAATTTAACAAATTTGTCTGAGACTGAAAATGGGCCTCTGG
AGCAAGATATTCCAACTTCTCAAGACTGGGATATTTTGGAGGAAATAGAAAATGCTGAAGAGTATGAAACTGTTGAAATGGAAGAGTTTCAAGAAAGAATGGAGAGAGAT
CTAGGTGCATGGGATGAAATATATCGAAATGCTCGGAAATCAGAAAAGCTCAAGTTTGAAGCAAATGAACGGGATGAGGGTGAGCTTGAAAGGACGGGACAGACTGTATC
CGTATATGAGATATACAGTGGTTCTGGAGCTTGGCCATTCATGCACCATGGTTCTTTGTACCGAGGACTAAGTCTTTCCACGAGAGCACTGAGGTTAAGATCTGATGATG
TCAATGCTGTTGGACGGCTTCCTCTACTCAATGACTCCTACTATCTGGACACTCTCTGTGAGATTGGAGGAATGTTTGCTATTGCAAATAAGATTGATAACATTCATAAG
AGACCTTGGATTGGGTTCCAATCATGGCGGGCTTCTGGAAGAAAGGTTTCCTTGTGCAAAAAAGCTGAAAATGTTTTGGAAGACACTATACGGGACAACCCTAAAGGAGA
TGTTATATACTTCTGGGCACACTTGCAAGTGAATCGTGGAAGCACTCCTACTACTTTCTGGTCTGTGTGTGATATCTTGAACGGTGGTCTCTGCAGAACCACCTTCGAGA
GCACCTTCCGCGAGATGTTTGGATTGTCATCAAATATGGGAGCTCTTCCGCCTATGCCAGAAGATGGTGGTCGCTGGTCTGCCCTCCATAGTTGGGTGATGCCAACCCCT
TCCTTCCTGGAGTTCATCATGTTTTCCAGGATGTTCACCCATTACCTTGATGCTCATAATAAAAATCAGAGTCAGCCAAATGGATGTTTGTTGGCTTCCTCAGAGCTTGA
GAAAAAACACTGTTACTGTCGGATATTGGAAATGCTGGTCAATGTCTGGGCTTACCACAGTGGCCGGAGAATGGTCTATATCAATCCTCAATCCGGTTTCCTCGAAGAGC
AGCATCTAGTTGAACAACGCAAGGAATTTATGTGGGCAAAATATTTCAACTTCACGTTGTTGAAAAGTATGGACGAAGATTTAGCAGAAGCCGCCGACGACGAAGGCGGT
TCAGGTAAAATGGGGTTATGGCCATTAACAGGGGAAGTGCATTGGCAAGGAATTTATGAAAGAGAGAGAGAAGAAAGGTATAGGGTGAAAATGGACAAGAAGAGAACTAC
AAAAGTAAAACTAATGGAGAGGATGAAATTTGGATACAAACAAAAATCACTTGGAGGATAA
mRNA sequenceShow/hide mRNA sequence
CTTGGCATATTTTCTTCGATGTGAAAAGTTATTGTCATTTCATCTTGCATTTGTCTTCTTGTAAATGGAGGCCATGGCAGTGAGAGCAATTCACATCTGAGTTTCGCTTC
ATTTTATCTCCATTATCAAATTCTTCCTCCTCAATCATTGGAAATCCATTTTCCATTAACGTTCTGTAGAAAATTCGGACCACAGAATCATCTGAATGAGACGAAGTTCA
TCTTCAGAGATCGACGACAATGGGAGTGGAAATGCCGTTCCCGGCACTCACTCGATTCGTGATCGTTTTCCTTTCAAGCGGAATTCCAGTCACTTCCGTTTGCGAGCCAA
GGACTCACTGGATCATGCAGCCTCTCGCTCCCGATCTCACCAGAGCCGGATCAATCGCAAGGGCTTGCTCTGGTGGATTCCAGCTAGAGGGCAAACGCTTTTCTACTTTG
TTGTCGTTTTTGCGGTTTTCGGGTTTGTTACCGGGTCTATGCTGTTGCAGAGCTCGATTAGCTTGATGAGCTCTGGAAGTGAAAAGGAGCGGTGGCTTATGGAGCGTATT
AAGTTTGGGAGCTCGCTGAAGTTTGTGCCAGGGAGGATTTCCAGGAGGCTGGTGGAAGGTGATGGGCTTGATGAGTTGAGAAAGAAGGATCGAGTTGGCGTTCGTGCACC
GAGGCTTGCCCTAATCTTGGGAAGCATGGGGAATGATCCACAATCATTAATGTTGATTACTGTGATGAAGAACATACAGAAACTTGGATATGTATTTGAGTGTAGTACCA
TGTTTCCATTTTCCAATGTTCATATGATTTTTGCAGTAGAGAGTGGAAATAAACATTCAATCTGGGAACAGATAGGTGGTCAGCCTTCAGTATTAAGTCCAGGACATTAT
GGTGATGTTGATTGGTCAATATATGATGGTATTATTGCTGACTCCTTGGAAGCAGAGGGGGCAATTGCAAGCCTTATGCAGGAACCTTTTTGTTCTCTACCACTCATATG
GATAATTCGAGAAGATACACTAGCCAACCGCTTGCCTATGTATGAACAAAGGGGCTGGAAGCATCTCATTTCACATTGGAAGAGGTCTTTTAGAAGGGCTAATGTTGTCG
TGTTCCCTGATTTTGCCCTCCCAATGTTGTATAGCACTTTGGACAATGGAAACTTCCACGTGATTCCTGGATCCCCAGCAGATGTTTATGCTGCAGAAAGCTACAAGAAT
GTTCACTCCAAAAGTCAATTGAGAGAGAAAAATGGATTTGATGAAAATGATATACTGGTTCTTGTTGTTGGAAGTTTGTTCTTCCCAAATGAGTTGTCGTGGGACTATGC
TGTGGCTATGCACAGCATTGGACCTCTACTCTCAATATATGCAAGGAGGAAAGAAGTAGAAGGATCATTTAAATTTGTTTTCTTATGTTGTAATTCAACTGATGGGTCCC
ATGATGCTTTAAAGGAAATTGTTTCACGTCTAGGACTTCCTGATGGATCTATAACACATTATGGCTTAAATGGAGATGTCAACAATGTACTGATGATGGCTGACATTGTG
CTTTATGGATCTTCACAAGAAATCCAGAGTTTTCCTCCTCTACTTATTCGAGCCATGTCCTTTGGAATCCCAATCATGGTGCCTGATTTACCTGGCTTGAGAAATTATAT
TGTTGATGGTGTCCATGGGGTTATCTTCCCAAAACATAATTCTGATGCTTTATTGAGCTCTTTCTCACATATGATATCAGATGGGAAGCTCTCGAGATTTGCACAAGCAA
TAGCTTCCTCTGGAAGATTGCTTGCTAAAAATATACTTGCATCAGAATGTGTTGCCGGTTATGCACAGCTCCTGGAGAATGTTCTGAATTTCCCATCAGATGTTAAGCTT
CCAGGTTCTGTCTCCCAGCTTCAACTAGGGGAATGGGAATGGAATTTGTTCAGGAAGGAAATGGTGAAGACAATTGACGAAGATGCAGATGATGAAGAACGAATTGCAGC
AATAAGTAAATCCAGTGTCATTTTTGCTCTTGAAGTGCAATTAACTAATTCTGTTAATTTAACAAATTTGTCTGAGACTGAAAATGGGCCTCTGGAGCAAGATATTCCAA
CTTCTCAAGACTGGGATATTTTGGAGGAAATAGAAAATGCTGAAGAGTATGAAACTGTTGAAATGGAAGAGTTTCAAGAAAGAATGGAGAGAGATCTAGGTGCATGGGAT
GAAATATATCGAAATGCTCGGAAATCAGAAAAGCTCAAGTTTGAAGCAAATGAACGGGATGAGGGTGAGCTTGAAAGGACGGGACAGACTGTATCCGTATATGAGATATA
CAGTGGTTCTGGAGCTTGGCCATTCATGCACCATGGTTCTTTGTACCGAGGACTAAGTCTTTCCACGAGAGCACTGAGGTTAAGATCTGATGATGTCAATGCTGTTGGAC
GGCTTCCTCTACTCAATGACTCCTACTATCTGGACACTCTCTGTGAGATTGGAGGAATGTTTGCTATTGCAAATAAGATTGATAACATTCATAAGAGACCTTGGATTGGG
TTCCAATCATGGCGGGCTTCTGGAAGAAAGGTTTCCTTGTGCAAAAAAGCTGAAAATGTTTTGGAAGACACTATACGGGACAACCCTAAAGGAGATGTTATATACTTCTG
GGCACACTTGCAAGTGAATCGTGGAAGCACTCCTACTACTTTCTGGTCTGTGTGTGATATCTTGAACGGTGGTCTCTGCAGAACCACCTTCGAGAGCACCTTCCGCGAGA
TGTTTGGATTGTCATCAAATATGGGAGCTCTTCCGCCTATGCCAGAAGATGGTGGTCGCTGGTCTGCCCTCCATAGTTGGGTGATGCCAACCCCTTCCTTCCTGGAGTTC
ATCATGTTTTCCAGGATGTTCACCCATTACCTTGATGCTCATAATAAAAATCAGAGTCAGCCAAATGGATGTTTGTTGGCTTCCTCAGAGCTTGAGAAAAAACACTGTTA
CTGTCGGATATTGGAAATGCTGGTCAATGTCTGGGCTTACCACAGTGGCCGGAGAATGGTCTATATCAATCCTCAATCCGGTTTCCTCGAAGAGCAGCATCTAGTTGAAC
AACGCAAGGAATTTATGTGGGCAAAATATTTCAACTTCACGTTGTTGAAAAGTATGGACGAAGATTTAGCAGAAGCCGCCGACGACGAAGGCGGTTCAGGTAAAATGGGG
TTATGGCCATTAACAGGGGAAGTGCATTGGCAAGGAATTTATGAAAGAGAGAGAGAAGAAAGGTATAGGGTGAAAATGGACAAGAAGAGAACTACAAAAGTAAAACTAAT
GGAGAGGATGAAATTTGGATACAAACAAAAATCACTTGGAGGATAAGAAGGGCTGGGATTCTGAAAGCACTTCTAACTGGCTGACTTTGGGTAGAAGAAGAAGAGATTGA
AAGAAAGAAAGAAAGAAAGAAAGACTCCTCCTATTATATTTGGTAAGTAAAAGAAATTGAGTTCTTTCTGAATTATAATACTCTCATCACGCTACAGAGCAATATTTCTA
GGCAACAGAAACTGGGGGAAAGGGAAAGCTGAAAAAATTGCAGCTCGGTTTATTAGAAGCAATATTACCAAAGAGAGGTACTAAATATTCTTTTCAAGATTCTTTCTTTT
ATATTCAACATACCTCATATAATAGCACAACAAGAATATAGGGAATTTCATTTTTGCTGTTCTGTTGTACAACCAAAATAGCAATTATTAAGGCAATTTTGGTCACACAT
AGTGATAATAATAATAGATTGGTTTCCACTTTCAGATATGTTTGAGATATGTCCTGATTTTGCTTTTAGTTGTGTTTTCTCCTCTGCCAATAACTATGGTGGAGAAATCT
GCAGCAAACTTTGTCCATTTTATGACAAAATTTCCCCATATCTCCAATTGTACCTGAGGAGGAGTTGTCTTGTAACTTTGTCCTTGTCAATAAAACACTGACAAGACTTT
TTG
Protein sequenceShow/hide protein sequence
MRRSSSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISLMSSGSEKERW
LMERIKFGSSLKFVPGRISRRLVEGDGLDELRKKDRVGVRAPRLALILGSMGNDPQSLMLITVMKNIQKLGYVFECSTMFPFSNVHMIFAVESGNKHSIWEQIGGQPSVL
SPGHYGDVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIIREDTLANRLPMYEQRGWKHLISHWKRSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAA
ESYKNVHSKSQLREKNGFDENDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRKEVEGSFKFVFLCCNSTDGSHDALKEIVSRLGLPDGSITHYGLNGDVNNVLM
MADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNSDALLSSFSHMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFP
SDVKLPGSVSQLQLGEWEWNLFRKEMVKTIDEDADDEERIAAISKSSVIFALEVQLTNSVNLTNLSETENGPLEQDIPTSQDWDILEEIENAEEYETVEMEEFQERMERD
LGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSVYEIYSGSGAWPFMHHGSLYRGLSLSTRALRLRSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHK
RPWIGFQSWRASGRKVSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGSTPTTFWSVCDILNGGLCRTTFESTFREMFGLSSNMGALPPMPEDGGRWSALHSWVMPTP
SFLEFIMFSRMFTHYLDAHNKNQSQPNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFLEEQHLVEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGG
SGKMGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG