; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004156 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004156
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionGlycos_transf_1 domain-containing protein
Genome locationChr08:14267285..14276733
RNA-Seq ExpressionHG10004156
SyntenyHG10004156
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR001296 - Glycosyl transferase, family 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7032959.1 hypothetical protein SDJN02_07010 [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0087.79Show/hide
Query:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRS S+EIDDNGSGNAVP  HSIRDRFPFKRNSSHFRLRAKDSLDHA  RSRSHQSRINRKGLLWW+PARGQT FYFVVVFAVF FV+GSMLLQSSISL
Subjt:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE
        MSSPGSE+ RWLMERIKFGSSLKF PGRISRRLVEG GL+EVRKKDRVGVRAPRLALILGSME++PQSLMLITVMKNIQKLGY+LEIFAVESGN+ SMW+
Subjt:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE

Query:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST
        QIGGQPSILSP HYG VDWSIYDGIIADSLEAEGAIASLMQEPFCS+PLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRAN+VVFPDF+LPMLYS 
Subjt:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST

Query:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA
        LDNGNF+VIPGSPADVYAAENYKNVHSKSQLREKNGF+EDDILV+VVGSLFFPNELSWDYAVAMHSIGPLL+ YAR+ EV GSFKFVFLCCNS+DGSH A
Subjt:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA

Query:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
        L+E+ASRLGLPD SITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHNPDALL SFS+MISDGK
Subjt:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK

Query:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN
        LSRF+QAIASSG+LLAKNILASECV  YA+LLENVLNFPSDVKLP SVSQLQLGAWEWNLFR+E V+TI +  D EERIAA SKSSVIFALE Q+TN VN
Subjt:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN

Query:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM
        LT  SETENGTLEQDIPT  DWDILEEIE AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ VSIYEIY+GAGAWPFM
Subjt:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM

Query:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ
        HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYY DTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRK                          
Subjt:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ

Query:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIA----PPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPS
        VSL  KAENVLEDTIRDN KGDVIYFWAHLQVNRGI      PTFWS+CDILN GLCRT F++TFREMFGLSSNM ALPPMP+DGG WSALHSWVMPTPS
Subjt:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIA----PPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPS

Query:  FLEFIMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDL
        FLEFIMFSRMFTHYLDA N+N S+  GCL+ASSELEKKHCYCRILE+LVNVWAYHSGRRMVYI+P SGF+EEQHP+EQR+EFMWAKYFN TLLKSMDEDL
Subjt:  FLEFIMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDL

Query:  AEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        AEAADDEGGS ++GLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSL G
Subjt:  AEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

XP_004138684.1 uncharacterized protein LOC101206364 isoform X1 [Cucumis sativus]0.0e+0091.52Show/hide
Query:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRS SSEIDDN S NAV GTHSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQ+RINRKGLL WIPARGQTLFYF+VVFAVFGF TGSMLLQSSISL
Subjt:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE
        +SS GS++ERWLMERIKFGSSLKFVPGRIS+RLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGY+ EIFAVE GNK+SMWE
Subjt:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE

Query:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST
        QI GQPSILSPGHYGRVDWSIYDGIIADSLE EGAIASLMQEPFCSLPLIWIVREDTLA+RLPMYEQRGWKHLISHWK SFRRANVVVFPDFALPMLYS 
Subjt:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST

Query:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA
        LDNGNFHVIPGSPADVYAAE+Y NVHSKSQLREKNGF+EDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR EVEGSFKFVFLCCNS+DGSHDA
Subjt:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA

Query:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
        LKE+ASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP L+NYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
Subjt:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK

Query:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN
        LSRFAQ+IASSGRLLAKNILASECV GYAQLLENVLNFPSDVKLP  VSQLQLGAWEWNLFRKEMVKTIDE ADNEERIA ISK+SVIFALE QLTNSVN
Subjt:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN

Query:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM
        LTILSE ENGTLEQDIPT QDWDILE+IE AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFE+NERDEGELERTGQTVSIYEIYSGAGAWPFM
Subjt:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM

Query:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ
        HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLL+DSYYLD LCEIGGMFAIANKIDNIHKRPWIGFQSW+ASGRKV  G                      
Subjt:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ

Query:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPSFLEF
            KKAENVLEDTI+DNPKGDVIYFWAHLQVNRG  PPTFWS+CDILN GLCRTTF+STFREMFGLSSNMGALPPMPEDGG WSALHSWVMPTPSFLEF
Subjt:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPSFLEF

Query:  IMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDLAEAA
        IMFSRMFTHYLDA N+NQS+ NGCLLASSE+EKKHCYCRILEMLVNVWAYHSGRRMVYINP SGF+EEQHP+EQRKEFMWAKYFNFTLLKSMDEDLAEAA
Subjt:  IMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDLAEAA

Query:  DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
Subjt:  DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

XP_008456559.1 PREDICTED: uncharacterized protein LOC103496475 isoform X1 [Cucumis melo]0.0e+0091.99Show/hide
Query:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRS SSEIDDN S NAVPGTHSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQ+RINRKGLL WIPARGQTLFYF+VVFAVFGF TGSMLLQSSISL
Subjt:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE
        +SS GS++ERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKN+QKLGY+ EIFAVESGNK+SMWE
Subjt:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE

Query:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST
        QI GQPSILSPGHYGRVDWSIYDGIIADSLE EGAIASLMQEPFCSLPLIWIVREDTLA+RLPMYEQRGWKHLISHWK SFRRANVVVFPDFALPMLYS 
Subjt:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST

Query:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA
        LDNGNFHVIPGSPADVYAAENY NVHSKSQLREKNGF+ DDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR EVEGSFKFVFLCCNS+DGSHDA
Subjt:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA

Query:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
        LKE+ASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
Subjt:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK

Query:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN
        LSRFAQAIASSGRLLAKNILASECV GY QLLENVLNFPSDVKLP   SQLQLGAWEWNLFRKEMVKTIDE AD+EERIAAISK+SVIFALE QLTNSVN
Subjt:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN

Query:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM
        LTILSE ENGTLEQDIPT QDWDILEEIE AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM
Subjt:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM

Query:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ
        HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLD LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKV  G                      
Subjt:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ

Query:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPSFLEF
            KKAENVLEDTIRDNP+GDVIYFWAHLQVNRG  PPTFWS+CDILN GLCRTTF STFREMFGLSSNMGALPPMPEDGG WSALHSWVMPTPSFLEF
Subjt:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPSFLEF

Query:  IMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDLAEAA
        IMFSRMFTHYLDA N+NQS+ NGCL A SE+EKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGF+EEQHP+EQRKEFMWAKYFNFTLLKSMDEDLAEAA
Subjt:  IMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDLAEAA

Query:  DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
Subjt:  DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

XP_022958089.1 uncharacterized protein LOC111459418 isoform X1 [Cucurbita moschata]0.0e+0087.89Show/hide
Query:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRS S+EIDDNGSGNAVP  HSIRDRFPFKRNSSHFRLRAKDSLDHA  RSRSHQSRINRKGLLWW+PARGQT FYFVVVFAVF FV+GSMLLQSSISL
Subjt:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE
        MSSPGSE+ RWLMERIKFGSSLKF PGRISRRLVEG GL+EVRKKDRVGVRAPRLALILGSME++PQSLMLITVMKNIQKLGY+LEIFAVESGN+ SMW+
Subjt:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE

Query:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST
        QIGGQPSILSP HYG VDWSIYDGIIADSLEAEGAIASLMQEPFCS+PLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRAN+VVFPDF+LPMLYS 
Subjt:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST

Query:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA
        LDNGNF+VIPGSPADVYAAENYKNVHSKSQLREKNGF+EDDILV+VVGSLFFPNELSWDYAVAMHSIGPLL+ YAR+ EV GSFKFVFLCCNS+DGSH A
Subjt:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA

Query:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
        L+E+ASRLGLPD SITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHNPDALL SFS+MISDGK
Subjt:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK

Query:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN
        LSRF+QAIASSG+LLAKNILASECV  YA+LLENVLNFPSDVKLP SVSQLQLGAWEWNLFR+E V+TI +  D EERIAA SKSSVIFALE Q+TN VN
Subjt:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN

Query:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM
        LT  SETENGTLEQDIPT  DWDILEEIE AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ VSIYEIY+GAGAWPFM
Subjt:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM

Query:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ
        HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYY DTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRK                          
Subjt:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ

Query:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIA----PPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPS
        VSL  KAENVLEDTIRDN KGDVIYFWAHLQVNRGI      PTFWS+CDILN GLCRT F++TFREMFGLSSNM ALPPMP+DGG WSALHSWVMPTPS
Subjt:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIA----PPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPS

Query:  FLEFIMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDL
        FLEFIMFSRMFTHYLDA N+NQS+  GCL+ASSELEKKHCYCRILE+LVNVWAYHSGRRMVYI+P SGF+EEQHP+EQR+EFMWAKYFN TLLKSMDEDL
Subjt:  FLEFIMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDL

Query:  AEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        AEAADDEGGS ++GLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSL G
Subjt:  AEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

XP_038884759.1 uncharacterized protein LOC120075439 isoform X1 [Benincasa hispida]0.0e+0092.18Show/hide
Query:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRS SSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYF+VVFAVFGFVTGSMLLQSSISL
Subjt:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE
        MSSPGSE+ERWLMERIKFGSSLKFVPG ISR+LVEGDGL+E+RKKDRVGVR+PRLALILGSMENDPQSLMLITVMKNIQKLGY+LEIFAVESGNK S+WE
Subjt:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE

Query:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST
        QIGGQPSILSP HYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLP+YEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST
Subjt:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST

Query:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA
        LD+GNFHVIPGSPADVYAAENYKN HSKSQLREKNGF EDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR EV GSFKFVFLCCNS+DGSHDA
Subjt:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA

Query:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
        LKE+ASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
Subjt:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK

Query:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN
        LSRFAQAIASSGRLLAKNILASECV GYAQLLENVLNFP DVKLPSS SQLQLGAWEWNLFRKEMVK IDE AD+EERIAA +K+SVIFALE QLTNSVN
Subjt:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN

Query:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM
        LTILSE ENGTLE DIPTSQDWD+LEEIE AEEYETVEMEEFQERMERDLGAWD+IYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM
Subjt:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM

Query:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ
        HHGSLYRGLSLST+ALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRK                          
Subjt:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ

Query:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPSFLEF
        VSLCKKAEN LED IRDNPKGDVIYFWAHLQVNRGI P TFWS+CDILN GLCRTTFKSTFR+M+GLSSNMGALPPMPEDGG WSALHSWVMPTPSFLEF
Subjt:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPSFLEF

Query:  IMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDLAEAA
        IMFSRMFTHYLDA N+NQS  NGCLLASSELEKKHCYCRILEMLVNVWAYHSGRR+VYINPQSGF+EEQHP+EQRKEFMWAKYFNFTLLKSMDEDLAEA 
Subjt:  IMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDLAEAA

Query:  DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        DDEG SGK GLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKL ERMKFGYKQKSLGG
Subjt:  DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

TrEMBL top hitse value%identityAlignment
A0A0A0LMB5 Glycos_transf_1 domain-containing protein0.0e+0091.52Show/hide
Query:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRS SSEIDDN S NAV GTHSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQ+RINRKGLL WIPARGQTLFYF+VVFAVFGF TGSMLLQSSISL
Subjt:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE
        +SS GS++ERWLMERIKFGSSLKFVPGRIS+RLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGY+ EIFAVE GNK+SMWE
Subjt:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE

Query:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST
        QI GQPSILSPGHYGRVDWSIYDGIIADSLE EGAIASLMQEPFCSLPLIWIVREDTLA+RLPMYEQRGWKHLISHWK SFRRANVVVFPDFALPMLYS 
Subjt:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST

Query:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA
        LDNGNFHVIPGSPADVYAAE+Y NVHSKSQLREKNGF+EDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR EVEGSFKFVFLCCNS+DGSHDA
Subjt:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA

Query:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
        LKE+ASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP L+NYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
Subjt:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK

Query:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN
        LSRFAQ+IASSGRLLAKNILASECV GYAQLLENVLNFPSDVKLP  VSQLQLGAWEWNLFRKEMVKTIDE ADNEERIA ISK+SVIFALE QLTNSVN
Subjt:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN

Query:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM
        LTILSE ENGTLEQDIPT QDWDILE+IE AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFE+NERDEGELERTGQTVSIYEIYSGAGAWPFM
Subjt:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM

Query:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ
        HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLL+DSYYLD LCEIGGMFAIANKIDNIHKRPWIGFQSW+ASGRKV  G                      
Subjt:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ

Query:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPSFLEF
            KKAENVLEDTI+DNPKGDVIYFWAHLQVNRG  PPTFWS+CDILN GLCRTTF+STFREMFGLSSNMGALPPMPEDGG WSALHSWVMPTPSFLEF
Subjt:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPSFLEF

Query:  IMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDLAEAA
        IMFSRMFTHYLDA N+NQS+ NGCLLASSE+EKKHCYCRILEMLVNVWAYHSGRRMVYINP SGF+EEQHP+EQRKEFMWAKYFNFTLLKSMDEDLAEAA
Subjt:  IMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDLAEAA

Query:  DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
Subjt:  DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

A0A1S3C3I4 uncharacterized protein LOC103496475 isoform X10.0e+0091.99Show/hide
Query:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRS SSEIDDN S NAVPGTHSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQ+RINRKGLL WIPARGQTLFYF+VVFAVFGF TGSMLLQSSISL
Subjt:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE
        +SS GS++ERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKN+QKLGY+ EIFAVESGNK+SMWE
Subjt:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE

Query:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST
        QI GQPSILSPGHYGRVDWSIYDGIIADSLE EGAIASLMQEPFCSLPLIWIVREDTLA+RLPMYEQRGWKHLISHWK SFRRANVVVFPDFALPMLYS 
Subjt:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST

Query:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA
        LDNGNFHVIPGSPADVYAAENY NVHSKSQLREKNGF+ DDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR EVEGSFKFVFLCCNS+DGSHDA
Subjt:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA

Query:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
        LKE+ASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
Subjt:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK

Query:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN
        LSRFAQAIASSGRLLAKNILASECV GY QLLENVLNFPSDVKLP   SQLQLGAWEWNLFRKEMVKTIDE AD+EERIAAISK+SVIFALE QLTNSVN
Subjt:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN

Query:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM
        LTILSE ENGTLEQDIPT QDWDILEEIE AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM
Subjt:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM

Query:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ
        HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLD LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKV  G                      
Subjt:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ

Query:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPSFLEF
            KKAENVLEDTIRDNP+GDVIYFWAHLQVNRG  PPTFWS+CDILN GLCRTTF STFREMFGLSSNMGALPPMPEDGG WSALHSWVMPTPSFLEF
Subjt:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPSFLEF

Query:  IMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDLAEAA
        IMFSRMFTHYLDA N+NQS+ NGCL A SE+EKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGF+EEQHP+EQRKEFMWAKYFNFTLLKSMDEDLAEAA
Subjt:  IMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDLAEAA

Query:  DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
Subjt:  DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

A0A5A7UUA8 UDP-Glycosyltransferase superfamily protein isoform 30.0e+0091.99Show/hide
Query:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRS SSEIDDN S NAVPGTHSIRDRFPFKRNSSHFRLR KDSLDHAASRSRSHQ+RINRKGLL WIPARGQTLFYF+VVFAVFGF TGSMLLQSSISL
Subjt:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE
        +SS GS++ERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKN+QKLGY+ EIFAVESGNK+SMWE
Subjt:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE

Query:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST
        QI GQPSILSPGHYGRVDWSIYDGIIADSLE EGAIASLMQEPFCSLPLIWIVREDTLA+RLPMYEQRGWKHLISHWK SFRRANVVVFPDFALPMLYS 
Subjt:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST

Query:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA
        LDNGNFHVIPGSPADVYAAENY NVHSKSQLREKNGF+ DDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARR EVEGSFKFVFLCCNS+DGSHDA
Subjt:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA

Query:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
        LKE+ASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
Subjt:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK

Query:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN
        LSRFAQAIASSGRLLAKNILASECV GY QLLENVLNFPSDVKLP   SQLQLGAWEWNLFRKEMVKTIDE AD+EERIAAISK+SVIFALE QLTNSVN
Subjt:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN

Query:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM
        LTILSE ENGTLEQDIPT QDWDILEEIE AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM
Subjt:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM

Query:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ
        HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLD LCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKV  G                      
Subjt:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ

Query:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPSFLEF
            KKAENVLEDTIRDNP+GDVIYFWAHLQVNRG  PPTFWS+CDILN GLCRTTF STFREMFGLSSNMGALPPMPEDGG WSALHSWVMPTPSFLEF
Subjt:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPSFLEF

Query:  IMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDLAEAA
        IMFSRMFTHYLDA N+NQS+ NGCL A SE+EKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGF+EEQHP+EQRKEFMWAKYFNFTLLKSMDEDLAEAA
Subjt:  IMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDLAEAA

Query:  DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
Subjt:  DDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

A0A6J1H431 uncharacterized protein LOC111459418 isoform X10.0e+0087.89Show/hide
Query:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRS S+EIDDNGSGNAVP  HSIRDRFPFKRNSSHFRLRAKDSLDHA  RSRSHQSRINRKGLLWW+PARGQT FYFVVVFAVF FV+GSMLLQSSISL
Subjt:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE
        MSSPGSE+ RWLMERIKFGSSLKF PGRISRRLVEG GL+EVRKKDRVGVRAPRLALILGSME++PQSLMLITVMKNIQKLGY+LEIFAVESGN+ SMW+
Subjt:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE

Query:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST
        QIGGQPSILSP HYG VDWSIYDGIIADSLEAEGAIASLMQEPFCS+PLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRAN+VVFPDF+LPMLYS 
Subjt:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST

Query:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA
        LDNGNF+VIPGSPADVYAAENYKNVHSKSQLREKNGF+EDDILV+VVGSLFFPNELSWDYAVAMHSIGPLL+ YAR+ EV GSFKFVFLCCNS+DGSH A
Subjt:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA

Query:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
        L+E+ASRLGLPD SITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHNPDALL SFS+MISDGK
Subjt:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK

Query:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN
        LSRF+QAIASSG+LLAKNILASECV  YA+LLENVLNFPSDVKLP SVSQLQLGAWEWNLFR+E V+TI +  D EERIAA SKSSVIFALE Q+TN VN
Subjt:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN

Query:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM
        LT  SETENGTLEQDIPT  DWDILEEIE AEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ VSIYEIY+GAGAWPFM
Subjt:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM

Query:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ
        HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYY DTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRK                          
Subjt:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ

Query:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIA----PPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPS
        VSL  KAENVLEDTIRDN KGDVIYFWAHLQVNRGI      PTFWS+CDILN GLCRT F++TFREMFGLSSNM ALPPMP+DGG WSALHSWVMPTPS
Subjt:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIA----PPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPS

Query:  FLEFIMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDL
        FLEFIMFSRMFTHYLDA N+NQS+  GCL+ASSELEKKHCYCRILE+LVNVWAYHSGRRMVYI+P SGF+EEQHP+EQR+EFMWAKYFN TLLKSMDEDL
Subjt:  FLEFIMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDL

Query:  AEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        AEAADDEGGS ++GLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSL G
Subjt:  AEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

A0A6J1JPJ0 uncharacterized protein LOC111487177 isoform X10.0e+0087.23Show/hide
Query:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL
        MRRS S+EIDDNGSGNAVP  HS RDRFPFKRNSSHFRLRAKDSLDHA  RSRSHQSRINRKGLLWW+PARGQT FYFVVVFAVF FV+GSMLLQSSISL
Subjt:  MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISL

Query:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE
        MSSPGSE+ RWLMERIKFGSSLKF PGRISRRLVEG GL+EVRKKDRVGVRAPRLALILGSME++PQSLMLITVMKNIQKLGY+LEIFAVESGN+ SMW+
Subjt:  MSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWE

Query:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST
        QIGGQPSILSP HYG VDWSIYDGIIADSLEAEG IASLMQEPFCS+PLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRAN+VVFPDF+LPMLYS 
Subjt:  QIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYST

Query:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA
        LDNGNF+VIPGSPADVYAAENYKNVHSKSQLREKNGF+EDDILV+VVGSLFFPNELSWDYAVAMHSIGPLL+ YAR+ EV GSFKF+FLCCNS+DGSH A
Subjt:  LDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDA

Query:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK
        L+E+ASRLGLPD SITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLP LRNYIVDGVHGVIFPKHNPDALL SFS+MISDGK
Subjt:  LKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGK

Query:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN
        LSRF+QAIASSG+LLAKNILASECV  YA+LLENVLNFPSDVKLP SVSQLQL AWEWNLFR+E+V+TI +  D EERIAA SKSSVIFALE Q+TN VN
Subjt:  LSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVN

Query:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM
        LT  SET NGTLEQDIPT  DWDILEEIE  EEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ VSIYEIYSGAGAWPF+
Subjt:  LTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFM

Query:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ
        HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYY DTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRK                          
Subjt:  HHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQ

Query:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIA----PPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPS
        VSL  KAENVLEDTIRDN KGDVIYFW HLQVNRGI      PTFWS+CDILN GLCRT F++TFREMFGLSSNM ALPPMP++GG WSALHSWVMPTPS
Subjt:  VSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIA----PPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPS

Query:  FLEFIMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDL
        FLEFIMFSRMFTHYLDA N+NQS+  GCLLASSELEKKHCYCRILE+LVNVWAYHSGRRMVYI+P SGF+EEQHP+EQR+EFMWAKYFN TLLKSMDEDL
Subjt:  FLEFIMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDL

Query:  AEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        AEAADDEGGS ++GLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSL G
Subjt:  AEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G01210.1 glycosyl transferase family 1 protein1.6e-18938.98Show/hide
Query:  RVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWEQIGGQPSILSPGHYGR--VDWSIYDGIIADSLEAEGAIASLMQEPF
        R G R P+LAL+ G +  DP+ ++++++ K +Q++GY +E++++E G   S+W+++G   +IL P       +DW  YDGII +SL A       MQEPF
Subjt:  RVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWEQIGGQPSILSPGHYGR--VDWSIYDGIIADSLEAEGAIASLMQEPF

Query:  CSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILV
         SLPLIW++ E+TLA R   Y   G   L++ WK  F RA+VVVF ++ LP+LY+  D GNF+VIPGSP +V  A+N +    K           DD+++
Subjt:  CSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILV

Query:  LVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDALKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQS
         +VGS F       ++A+ + ++ PL S     ++     K + L   ++     A++ ++  L  P  ++ H  + G+V+ +L  +D+V+YGS  E QS
Subjt:  LVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDALKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQS

Query:  FPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKL
        FP +L++AMS G PI+ PDL  +R Y+ D V G +FPK N   L     ++I++GK+S  AQ IA  G+   KN++A E + GYA LLEN+L F S+V  
Subjt:  FPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKL

Query:  PSSVSQLQ---LGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVNLTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEE
        P  V ++       W W+ F   M     +T+ N      I++S    A   ++    N T     + G +  D    + W+    ++     +  E EE
Subjt:  PSSVSQLQ---LGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVNLTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEE

Query:  FQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFMHHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTL
         + R+ +  G W+++Y++A+++++ K + +ERDEGEL RTGQ + IYE Y G G W F+H   LYRG+ LS +  R + DDV+A  RLPL N+ YY D L
Subjt:  FQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFMHHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTL

Query:  CEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQVSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVN-RGIAPPT
         + G  FAI+NKID +HK  WIGFQSWRA+ RK                           SL K AE+ L + I+    GD +YFW  +  + R      
Subjt:  CEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWEEVPYRTTEIIQPYIIKQVSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVN-RGIAPPT

Query:  FWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPSFLEFIMFSRMFTHYLDAQ-NKNQSRSNGCLLASSELEKKHCYCR
        FWS CD +N+G CR  +  T ++M+ +  N+ +LPPMPEDG  WS + SW +PT SFLEF+MFSRMF   LDAQ  +   R+N C L  S  + KHCY R
Subjt:  FWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPEDGGLWSALHSWVMPTPSFLEFIMFSRMFTHYLDAQ-NKNQSRSNGCLLASSELEKKHCYCR

Query:  ILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKK
        +LE+LVNVWAYHS RR+VYI+P++G ++EQH  + R+  MW K+F++T LK+MDEDLAE AD +   G   LWP TGE+ W+G  E+E++++   K +KK
Subjt:  ILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLKSMDEDLAEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKK

Query:  RTTKVKLMERMKFGYKQKSLG
        + ++ KL        +QK +G
Subjt:  RTTKVKLMERMKFGYKQKSLG

AT5G04480.1 UDP-Glycosyltransferase superfamily protein0.0e+0059.33Show/hide
Query:  MRRSPSSEIDDNG-------------SGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQ--SRINRKGLLWWIPARGQTLFYFVVVFAVF
        +R S S EIDDNG             +GN     HSIRDR   KRNSS  R R+   LD  + R+R H     +NRKGLL  +  RG  L YF+V F V 
Subjt:  MRRSPSSEIDDNG-------------SGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQ--SRINRKGLLWWIPARGQTLFYFVVVFAVF

Query:  GFVTGSMLLQSSISLMSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYIL
         FV  S+LLQ+SI+     G+ K   +  +I  GS+LK+VPG I+R L+EG GL+ +R   R+GVR PRLAL+LG+M+ DP++LML+TVMKN+QKLGY+ 
Subjt:  GFVTGSMLLQSSISLMSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYIL

Query:  EIFAVESGNKRSMWEQIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRAN
        ++FAVE+G  RS+WEQ+ G   +L     G  DW+I++G+IADSLEA+ AI+SLMQEPF S+PLIWIV ED LANRLP+Y++ G   LISHW+S+F RA+
Subjt:  EIFAVESGNKRSMWEQIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRAN

Query:  VVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFK
        VVVFP F LPML+S LD+GNF VIP S  DV+AAE+Y   H+K  LRE N F EDD+++LV+GS FF +E SWD AVAMH +GPLL+ Y RR +  GSFK
Subjt:  VVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFK

Query:  FVFLCCNSSDGSHDALKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNP
        FVFL  NS+ G  DA++EVASRLGL +G++ H+GLN DVN VL MADI++Y SSQE Q+FPPL++RAMSFGIPI+ PD P ++ Y+ D VHG+ F +++P
Subjt:  FVFLCCNSSDGSHDALKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNP

Query:  DALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKS
        DALL +FS +ISDG+LS+FAQ IASSGRLL KN++A+EC+ GYA+LLEN+L+FPSD  LP S+SQLQ+ AWEWN FR E+ +      D+    A I KS
Subjt:  DALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKS

Query:  SVIFALEVQLTNSVNLTILSETENGTL--EQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ
         ++F +E +    +  T  +  +N TL    ++P+  DWD+LEEIE AEEYE VE EE ++RMERD+  W+EIYRNARKSEKLKFE NERDEGELERTG+
Subjt:  SVIFALEVQLTNSVNLTILSETENGTL--EQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ

Query:  TVSIYEIYSGAGAWPFMHHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWE
         + IYEIY+GAGAWPF+HHGSLYRGLSLS++  RL SDDV+A  RLPLLND+YY D LCEIGGMF++ANK+D+IH RPWIGFQSWRA+GRK         
Subjt:  TVSIYEIYSGAGAWPFMHHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWE

Query:  EVPYRTTEIIQPYIIKQVSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPP----TFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPE
                         VSL  KAE  LE+ I+   KG++IYFW  L ++          TFWSMCDILN G CRTTF+  FR M+GL  ++ ALPPMPE
Subjt:  EVPYRTTEIIQPYIIKQVSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPP----TFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPE

Query:  DGGLWSALHSWVMPTPSFLEFIMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFM
        DG  WS+LH+WVMPTPSFLEF+MFSRMF+  LDA + N + S  C LASS LE+KHCYCR+LE+LVNVWAYHSGR+MVYINP+ G +EEQHP++QRK  M
Subjt:  DGGLWSALHSWVMPTPSFLEFIMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFM

Query:  WAKYFNFTLLKSMDEDLAEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        WAKYFNFTLLKSMDEDLAEAADD+    +  LWPLTGEVHW+G+YEREREERYR+KMDKKR TK KL +R+K GYKQKSLGG
Subjt:  WAKYFNFTLLKSMDEDLAEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG

AT5G04480.2 UDP-Glycosyltransferase superfamily protein0.0e+0058.41Show/hide
Query:  MRRSPSSEIDDNG-------------SGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQ--SRINRKGLLWWIPARGQTLFYFVVVFAVF
        +R S S EIDDNG             +GN     HSIRDR   KRNSS  R R+   LD  + R+R H     +NRKGLL  +  RG  L YF+V F V 
Subjt:  MRRSPSSEIDDNG-------------SGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQ--SRINRKGLLWWIPARGQTLFYFVVVFAVF

Query:  GFVTGSMLLQSSISLMSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYIL
         FV  S+LLQ+SI+     G+ K   +  +I  GS+LK+VPG I+R L+EG GL+ +R   R+GVR PRLAL+LG+M+ DP++LML              
Subjt:  GFVTGSMLLQSSISLMSSPGSEKERWLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYIL

Query:  EIFAVESGNKRSMWEQIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRAN
         +FAVE+G  RS+WEQ+ G   +L     G  DW+I++G+IADSLEA+ AI+SLMQEPF S+PLIWIV ED LANRLP+Y++ G   LISHW+S+F RA+
Subjt:  EIFAVESGNKRSMWEQIGGQPSILSPGHYGRVDWSIYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRAN

Query:  VVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFK
        VVVFP F LPML+S LD+GNF VIP S  DV+AAE+Y   H+K  LRE N F EDD+++LV+GS FF +E SWD AVAMH +GPLL+ Y RR +  GSFK
Subjt:  VVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAENYKNVHSKSQLREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFK

Query:  FVFLCCNSSDGSHDALKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNP
        FVFL  NS+ G  DA++EVASRLGL +G++ H+GLN DVN VL MADI++Y SSQE Q+FPPL++RAMSFGIPI+ PD P ++ Y+ D VHG+ F +++P
Subjt:  FVFLCCNSSDGSHDALKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQEIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNP

Query:  DALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKS
        DALL +FS +ISDG+LS+FAQ IASSGRLL KN++A+EC+ GYA+LLEN+L+FPSD  LP S+SQLQ+ AWEWN FR E+ +      D+    A I KS
Subjt:  DALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQLQLGAWEWNLFRKEMVKTIDETADNEERIAAISKS

Query:  SVIFALEVQLTNSVNLTILSETENGTL--EQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ
         ++F +E +    +  T  +  +N TL    ++P+  DWD+LEEIE AEEYE VE EE ++RMERD+  W+EIYRNARKSEKLKFE NERDEGELERTG+
Subjt:  SVIFALEVQLTNSVNLTILSETENGTL--EQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNARKSEKLKFEANERDEGELERTGQ

Query:  TVSIYEIYSGAGAWPFMHHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWE
         + IYEIY+GAGAWPF+HHGSLYRGLSLS++  RL SDDV+A  RLPLLND+YY D LCEIGGMF++ANK+D+IH RPWIGFQSWRA+GRK         
Subjt:  TVSIYEIYSGAGAWPFMHHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRASGRKVKFGPPPWE

Query:  EVPYRTTEIIQPYIIKQVSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPP----TFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPE
                         VSL  KAE  LE+ I+   KG++IYFW  L ++          TFWSMCDILN G CRTTF+  FR M+GL  ++ ALPPMPE
Subjt:  EVPYRTTEIIQPYIIKQVSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPP----TFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPE

Query:  DGGLWSALHSWVMPTPSFLEFIMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFM
        DG  WS+LH+WVMPTPSFLEF+MFSRMF+  LDA + N + S  C LASS LE+KHCYCR+LE+LVNVWAYHSGR+MVYINP+ G +EEQHP++QRK  M
Subjt:  DGGLWSALHSWVMPTPSFLEFIMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFM

Query:  WAKYFNFTLLKSMDEDLAEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG
        WAKYFNFTLLKSMDEDLAEAADD+    +  LWPLTGEVHW+G+YEREREERYR+KMDKKR TK KL +R+K GYKQKSLGG
Subjt:  WAKYFNFTLLKSMDEDLAEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGACGAAGTCCATCTTCAGAGATCGACGACAATGGGAGTGGAAATGCCGTTCCTGGCACTCACTCGATTCGTGATCGTTTTCCTTTCAAACGGAATTCAAGTCACTT
CCGTTTGCGAGCCAAGGACTCATTGGATCATGCAGCCTCTCGCTCCCGATCTCACCAGAGCCGGATCAATCGCAAGGGCTTGCTCTGGTGGATTCCGGCTAGAGGGCAAA
CGCTTTTCTACTTCGTTGTCGTTTTTGCGGTATTTGGGTTTGTTACCGGGTCTATGCTGTTGCAGAGCTCGATTAGCTTGATGTCTAGTCCTGGAAGTGAAAAGGAGCGG
TGGCTTATGGAGCGTATTAAGTTTGGGAGCTCGCTGAAGTTTGTGCCAGGGAGAATTTCCAGGAGGCTGGTGGAAGGTGATGGGCTCGAAGAGGTGCGGAAGAAGGACCG
AGTTGGCGTTCGTGCACCGAGGCTTGCTTTAATCTTGGGAAGCATGGAGAACGATCCACAATCATTGATGTTGATTACTGTGATGAAGAACATACAGAAACTTGGATATA
TACTTGAGATTTTTGCAGTAGAGAGTGGAAATAAACGTTCAATGTGGGAACAGATAGGTGGCCAGCCTTCAATATTAAGTCCAGGGCATTATGGTCGTGTTGATTGGTCA
ATATATGATGGTATTATTGCTGACTCCTTGGAAGCAGAGGGGGCAATTGCAAGCCTTATGCAGGAACCTTTTTGTTCTCTACCACTCATATGGATAGTTCGAGAAGATAC
ACTAGCCAATCGGTTGCCTATGTATGAACAAAGGGGCTGGAAGCATCTCATTTCACATTGGAAGAGTTCTTTTCGAAGGGCTAATGTGGTCGTGTTCCCTGATTTTGCCC
TCCCAATGTTGTATAGCACTCTGGACAATGGAAACTTCCACGTGATTCCTGGATCCCCAGCAGATGTTTATGCTGCAGAAAACTACAAGAACGTTCACTCCAAAAGTCAA
TTAAGAGAGAAAAATGGATTCGATGAAGATGATATACTGGTCCTCGTCGTTGGAAGTTTGTTCTTCCCGAATGAGCTGTCGTGGGACTATGCTGTGGCTATGCACAGCAT
TGGACCACTACTCTCAATATATGCAAGGAGGAATGAAGTAGAAGGATCATTTAAATTTGTTTTCTTATGTTGTAATTCAAGCGACGGGTCCCATGATGCTTTAAAGGAAG
TTGCTTCACGTCTAGGACTTCCTGATGGATCTATAACACATTATGGCTTAAATGGAGATGTCAACAATGTACTGATGATGGCTGACATTGTGCTTTATGGGTCTTCACAA
GAAATTCAGAGTTTTCCTCCTCTACTTATTCGAGCCATGTCCTTTGGAATCCCAATCATGGTGCCTGATTTACCTGGCTTGAGAAATTATATCGTTGATGGTGTCCATGG
GGTTATCTTCCCAAAACATAATCCTGATGCTTTATTGAGCTCTTTCTCACAAATGATATCAGATGGGAAGCTCTCGAGATTTGCACAAGCAATAGCTTCTTCTGGAAGAT
TGCTTGCTAAGAATATACTTGCATCAGAATGTGTTGCCGGTTATGCACAGCTCCTGGAGAATGTCCTGAATTTCCCATCAGATGTTAAGCTTCCAAGCTCTGTCTCCCAG
CTTCAACTAGGGGCATGGGAATGGAATTTGTTCAGGAAGGAAATGGTGAAGACGATTGACGAAACTGCAGATAATGAAGAAAGAATAGCAGCAATAAGTAAATCCAGTGT
TATTTTTGCTCTTGAAGTGCAATTAACTAATTCTGTTAATTTAACAATTTTGTCTGAGACTGAAAATGGGACTCTGGAGCAAGATATTCCAACTTCTCAAGACTGGGATA
TTTTGGAGGAAATAGAAAAAGCCGAAGAGTATGAAACTGTTGAAATGGAAGAGTTTCAAGAAAGAATGGAGAGAGATCTAGGTGCATGGGATGAAATATATCGAAATGCT
CGGAAATCAGAAAAACTCAAGTTTGAAGCAAATGAACGGGATGAGGGTGAGCTTGAAAGGACAGGACAGACTGTATCCATATATGAGATATACAGTGGTGCTGGAGCTTG
GCCATTCATGCACCATGGTTCTTTGTACCGTGGACTAAGTCTTTCCACGAGAGCACTGAGGTTAAAATCTGATGATGTCAATGCTGTTGGACGGCTTCCTCTTCTCAATG
ACTCTTACTATCTGGACACTCTATGTGAGATTGGAGGAATGTTTGCTATTGCAAATAAGATTGATAACATTCATAAGAGACCTTGGATTGGGTTCCAGTCATGGCGGGCT
TCTGGAAGAAAGGTGAAATTTGGTCCTCCTCCTTGGGAGGAGGTGCCCTACAGAACTACAGAGATCATTCAGCCTTATATCATTAAGCAGGTTTCCTTGTGCAAAAAAGC
TGAAAATGTTTTGGAAGACACTATACGGGACAACCCTAAAGGAGATGTTATATACTTCTGGGCACATTTGCAAGTGAATCGTGGAATCGCTCCTCCTACTTTCTGGTCGA
TGTGTGATATCTTGAACAGTGGTCTCTGCAGAACCACCTTCAAGAGCACCTTTCGCGAGATGTTTGGATTGTCATCAAATATGGGAGCTCTTCCGCCTATGCCAGAAGAT
GGTGGTCTCTGGTCTGCCCTCCATAGTTGGGTGATGCCAACCCCTTCCTTCCTGGAGTTCATCATGTTTTCCAGGATGTTCACCCATTACCTTGATGCTCAGAATAAAAA
TCAGAGTCGGTCAAATGGATGTTTGTTGGCTTCCTCAGAGCTTGAGAAAAAACACTGTTACTGTCGGATATTGGAAATGCTGGTCAATGTCTGGGCTTACCACAGTGGCC
GGAGAATGGTTTATATCAATCCTCAATCCGGTTTTGTCGAAGAGCAGCATCCAATTGAACAACGCAAGGAATTTATGTGGGCAAAGTATTTCAACTTCACGTTGTTGAAG
AGTATGGATGAAGATCTAGCAGAAGCCGCTGACGATGAAGGCGGTTCAGGTAAAATAGGGTTGTGGCCATTAACAGGAGAAGTGCATTGGCAAGGAATTTATGAAAGAGA
GAGAGAAGAAAGGTATAGGGTGAAAATGGACAAGAAGAGAACTACAAAAGTAAAACTAATGGAGAGGATGAAATTTGGATACAAACAAAAATCACTTGGAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGAGACGAAGTCCATCTTCAGAGATCGACGACAATGGGAGTGGAAATGCCGTTCCTGGCACTCACTCGATTCGTGATCGTTTTCCTTTCAAACGGAATTCAAGTCACTT
CCGTTTGCGAGCCAAGGACTCATTGGATCATGCAGCCTCTCGCTCCCGATCTCACCAGAGCCGGATCAATCGCAAGGGCTTGCTCTGGTGGATTCCGGCTAGAGGGCAAA
CGCTTTTCTACTTCGTTGTCGTTTTTGCGGTATTTGGGTTTGTTACCGGGTCTATGCTGTTGCAGAGCTCGATTAGCTTGATGTCTAGTCCTGGAAGTGAAAAGGAGCGG
TGGCTTATGGAGCGTATTAAGTTTGGGAGCTCGCTGAAGTTTGTGCCAGGGAGAATTTCCAGGAGGCTGGTGGAAGGTGATGGGCTCGAAGAGGTGCGGAAGAAGGACCG
AGTTGGCGTTCGTGCACCGAGGCTTGCTTTAATCTTGGGAAGCATGGAGAACGATCCACAATCATTGATGTTGATTACTGTGATGAAGAACATACAGAAACTTGGATATA
TACTTGAGATTTTTGCAGTAGAGAGTGGAAATAAACGTTCAATGTGGGAACAGATAGGTGGCCAGCCTTCAATATTAAGTCCAGGGCATTATGGTCGTGTTGATTGGTCA
ATATATGATGGTATTATTGCTGACTCCTTGGAAGCAGAGGGGGCAATTGCAAGCCTTATGCAGGAACCTTTTTGTTCTCTACCACTCATATGGATAGTTCGAGAAGATAC
ACTAGCCAATCGGTTGCCTATGTATGAACAAAGGGGCTGGAAGCATCTCATTTCACATTGGAAGAGTTCTTTTCGAAGGGCTAATGTGGTCGTGTTCCCTGATTTTGCCC
TCCCAATGTTGTATAGCACTCTGGACAATGGAAACTTCCACGTGATTCCTGGATCCCCAGCAGATGTTTATGCTGCAGAAAACTACAAGAACGTTCACTCCAAAAGTCAA
TTAAGAGAGAAAAATGGATTCGATGAAGATGATATACTGGTCCTCGTCGTTGGAAGTTTGTTCTTCCCGAATGAGCTGTCGTGGGACTATGCTGTGGCTATGCACAGCAT
TGGACCACTACTCTCAATATATGCAAGGAGGAATGAAGTAGAAGGATCATTTAAATTTGTTTTCTTATGTTGTAATTCAAGCGACGGGTCCCATGATGCTTTAAAGGAAG
TTGCTTCACGTCTAGGACTTCCTGATGGATCTATAACACATTATGGCTTAAATGGAGATGTCAACAATGTACTGATGATGGCTGACATTGTGCTTTATGGGTCTTCACAA
GAAATTCAGAGTTTTCCTCCTCTACTTATTCGAGCCATGTCCTTTGGAATCCCAATCATGGTGCCTGATTTACCTGGCTTGAGAAATTATATCGTTGATGGTGTCCATGG
GGTTATCTTCCCAAAACATAATCCTGATGCTTTATTGAGCTCTTTCTCACAAATGATATCAGATGGGAAGCTCTCGAGATTTGCACAAGCAATAGCTTCTTCTGGAAGAT
TGCTTGCTAAGAATATACTTGCATCAGAATGTGTTGCCGGTTATGCACAGCTCCTGGAGAATGTCCTGAATTTCCCATCAGATGTTAAGCTTCCAAGCTCTGTCTCCCAG
CTTCAACTAGGGGCATGGGAATGGAATTTGTTCAGGAAGGAAATGGTGAAGACGATTGACGAAACTGCAGATAATGAAGAAAGAATAGCAGCAATAAGTAAATCCAGTGT
TATTTTTGCTCTTGAAGTGCAATTAACTAATTCTGTTAATTTAACAATTTTGTCTGAGACTGAAAATGGGACTCTGGAGCAAGATATTCCAACTTCTCAAGACTGGGATA
TTTTGGAGGAAATAGAAAAAGCCGAAGAGTATGAAACTGTTGAAATGGAAGAGTTTCAAGAAAGAATGGAGAGAGATCTAGGTGCATGGGATGAAATATATCGAAATGCT
CGGAAATCAGAAAAACTCAAGTTTGAAGCAAATGAACGGGATGAGGGTGAGCTTGAAAGGACAGGACAGACTGTATCCATATATGAGATATACAGTGGTGCTGGAGCTTG
GCCATTCATGCACCATGGTTCTTTGTACCGTGGACTAAGTCTTTCCACGAGAGCACTGAGGTTAAAATCTGATGATGTCAATGCTGTTGGACGGCTTCCTCTTCTCAATG
ACTCTTACTATCTGGACACTCTATGTGAGATTGGAGGAATGTTTGCTATTGCAAATAAGATTGATAACATTCATAAGAGACCTTGGATTGGGTTCCAGTCATGGCGGGCT
TCTGGAAGAAAGGTGAAATTTGGTCCTCCTCCTTGGGAGGAGGTGCCCTACAGAACTACAGAGATCATTCAGCCTTATATCATTAAGCAGGTTTCCTTGTGCAAAAAAGC
TGAAAATGTTTTGGAAGACACTATACGGGACAACCCTAAAGGAGATGTTATATACTTCTGGGCACATTTGCAAGTGAATCGTGGAATCGCTCCTCCTACTTTCTGGTCGA
TGTGTGATATCTTGAACAGTGGTCTCTGCAGAACCACCTTCAAGAGCACCTTTCGCGAGATGTTTGGATTGTCATCAAATATGGGAGCTCTTCCGCCTATGCCAGAAGAT
GGTGGTCTCTGGTCTGCCCTCCATAGTTGGGTGATGCCAACCCCTTCCTTCCTGGAGTTCATCATGTTTTCCAGGATGTTCACCCATTACCTTGATGCTCAGAATAAAAA
TCAGAGTCGGTCAAATGGATGTTTGTTGGCTTCCTCAGAGCTTGAGAAAAAACACTGTTACTGTCGGATATTGGAAATGCTGGTCAATGTCTGGGCTTACCACAGTGGCC
GGAGAATGGTTTATATCAATCCTCAATCCGGTTTTGTCGAAGAGCAGCATCCAATTGAACAACGCAAGGAATTTATGTGGGCAAAGTATTTCAACTTCACGTTGTTGAAG
AGTATGGATGAAGATCTAGCAGAAGCCGCTGACGATGAAGGCGGTTCAGGTAAAATAGGGTTGTGGCCATTAACAGGAGAAGTGCATTGGCAAGGAATTTATGAAAGAGA
GAGAGAAGAAAGGTATAGGGTGAAAATGGACAAGAAGAGAACTACAAAAGTAAAACTAATGGAGAGGATGAAATTTGGATACAAACAAAAATCACTTGGAGGATGA
Protein sequenceShow/hide protein sequence
MRRSPSSEIDDNGSGNAVPGTHSIRDRFPFKRNSSHFRLRAKDSLDHAASRSRSHQSRINRKGLLWWIPARGQTLFYFVVVFAVFGFVTGSMLLQSSISLMSSPGSEKER
WLMERIKFGSSLKFVPGRISRRLVEGDGLEEVRKKDRVGVRAPRLALILGSMENDPQSLMLITVMKNIQKLGYILEIFAVESGNKRSMWEQIGGQPSILSPGHYGRVDWS
IYDGIIADSLEAEGAIASLMQEPFCSLPLIWIVREDTLANRLPMYEQRGWKHLISHWKSSFRRANVVVFPDFALPMLYSTLDNGNFHVIPGSPADVYAAENYKNVHSKSQ
LREKNGFDEDDILVLVVGSLFFPNELSWDYAVAMHSIGPLLSIYARRNEVEGSFKFVFLCCNSSDGSHDALKEVASRLGLPDGSITHYGLNGDVNNVLMMADIVLYGSSQ
EIQSFPPLLIRAMSFGIPIMVPDLPGLRNYIVDGVHGVIFPKHNPDALLSSFSQMISDGKLSRFAQAIASSGRLLAKNILASECVAGYAQLLENVLNFPSDVKLPSSVSQ
LQLGAWEWNLFRKEMVKTIDETADNEERIAAISKSSVIFALEVQLTNSVNLTILSETENGTLEQDIPTSQDWDILEEIEKAEEYETVEMEEFQERMERDLGAWDEIYRNA
RKSEKLKFEANERDEGELERTGQTVSIYEIYSGAGAWPFMHHGSLYRGLSLSTRALRLKSDDVNAVGRLPLLNDSYYLDTLCEIGGMFAIANKIDNIHKRPWIGFQSWRA
SGRKVKFGPPPWEEVPYRTTEIIQPYIIKQVSLCKKAENVLEDTIRDNPKGDVIYFWAHLQVNRGIAPPTFWSMCDILNSGLCRTTFKSTFREMFGLSSNMGALPPMPED
GGLWSALHSWVMPTPSFLEFIMFSRMFTHYLDAQNKNQSRSNGCLLASSELEKKHCYCRILEMLVNVWAYHSGRRMVYINPQSGFVEEQHPIEQRKEFMWAKYFNFTLLK
SMDEDLAEAADDEGGSGKIGLWPLTGEVHWQGIYEREREERYRVKMDKKRTTKVKLMERMKFGYKQKSLGG