; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G005970 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G005970
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionbeta-1,4-N-acetylglucosaminyltransferase family protein
Genome locationchr04:5531250..5533543
RNA-Seq ExpressionLsi04G005970
SyntenyLsi04G005970
Gene Ontology termsGO:0006044 - N-acetylglucosamine metabolic process (biological process)
GO:0006487 - protein N-linked glycosylation (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003830 - beta-1,4-mannosylglycoprotein 4-beta-N-acetylglucosaminyltransferase activity (molecular function)
InterPro domainsIPR006813 - Glycosyl transferase, family 17


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0034269.1 hypothetical protein E6C27_scaffold65G002680 [Cucumis melo var. makuwa]3.5e-23691.14Show/hide
Query:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME
        MWWMMGEGGGHYCSKKSDDICG+VCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYD NVSME
Subjt:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME

Query:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV
        N CKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVE RFTYGTVGGRFKKGENPFVEEAFQRV
Subjt:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV

Query:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
        ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPE+LHLQLKNYLYSFEFH+DDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
Subjt:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR

Query:  RISDFIFKMKAYSHNDRVSSLAIVSSGKCRKLQIPFAWELRTREWLISNFNVYLPVKVSCTVACLTPRGRSIPIVVDRYMANRGDPLPPDKLMLILLEVV
        RISDF+FKMKAYSHNDRV           R+LQIPFAWEL TREWLISNF VYLPVKVSC VACLTPRG SI IVVD+Y  NRG  LPPDKLMLILLEVV
Subjt:  RISDFIFKMKAYSHNDRVSSLAIVSSGKCRKLQIPFAWELRTREWLISNFNVYLPVKVSCTVACLTPRGRSIPIVVDRYMANRGDPLPPDKLMLILLEVV

Query:  TAWTVELIPWVEVMCRYHLFSLSWFGSR-SLSIHWVGELA
        TAWTVELIPWVEVMCRY L SL WFGSR  L I WV  L+
Subjt:  TAWTVELIPWVEVMCRYHLFSLSWFGSR-SLSIHWVGELA

TYK15651.1 hypothetical protein E5676_scaffold35G001480 [Cucumis melo var. makuwa]4.1e-21390.64Show/hide
Query:  MSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSMENQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRW
        MSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYD NVSMEN CKLHGWKVREFPRRVYDAVLFSNEIEMLTLRW
Subjt:  MSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSMENQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRW

Query:  KELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRVALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLL
        KELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVE RFTYGTVGGRFKKGENPFVEEAFQRVALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLL
Subjt:  KELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRVALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLL

Query:  RWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFRRISDFIFKMKAYSHNDRVSSLAIVSSGKCRKLQI
        RWCDDIPE+LHLQLKNYLYSFEFH+DDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFRRISDF+FKMKAYSHNDRV           R+LQI
Subjt:  RWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFRRISDFIFKMKAYSHNDRVSSLAIVSSGKCRKLQI

Query:  PFAWELRTREWLISNFNVYLPVKVSCTVACLTPRGRSIPIVVDRYMANRGDPLPPDKLMLILLEVVTAWTVELIPWVEVMCRYHLFSLSWFGSR-SLSIH
        PFAWEL TREWLISNF VYLPVKVSC VACLTPRG SI IVVD+Y  NRG  LPPDKLMLILLEVVTAWTVELIPWVEVMCRY L SL WFGSR  L I 
Subjt:  PFAWELRTREWLISNFNVYLPVKVSCTVACLTPRGRSIPIVVDRYMANRGDPLPPDKLMLILLEVVTAWTVELIPWVEVMCRYHLFSLSWFGSR-SLSIH

Query:  WVGELA
        WV  L+
Subjt:  WVGELA

XP_004135275.1 uncharacterized LOC101222690 isoform X1 [Cucumis sativus]3.9e-18793.67Show/hide
Query:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME
        MWWMMGEGGGHYCSKKSDDICG+VCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYD NVSME
Subjt:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME

Query:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV
        N CKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRD+FKFVE RFTYGTVGGRFKKGENPFVEEAFQRV
Subjt:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV

Query:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
        ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPE+LHLQLKNYLYSFEFH+DDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
Subjt:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR

Query:  RISDFIFKMKAYSHNDRVSSLAIVSSGKCRKL
        RISDF+FKMKAYSHNDRV   + ++  + +K+
Subjt:  RISDFIFKMKAYSHNDRVSSLAIVSSGKCRKL

XP_008446156.1 PREDICTED: uncharacterized protein LOC103488964 [Cucumis melo]1.3e-18793.98Show/hide
Query:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME
        MWWMMGEGGGHYCSKKSDDICG+VCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYD NVSME
Subjt:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME

Query:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV
        N CKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVE RFTYGTVGGRFKKGENPFVEEAFQRV
Subjt:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV

Query:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
        ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPE+LHLQLKNYLYSFEFH+DDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
Subjt:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR

Query:  RISDFIFKMKAYSHNDRVSSLAIVSSGKCRKL
        RISDF+FKMKAYSHNDRV   + ++  + +K+
Subjt:  RISDFIFKMKAYSHNDRVSSLAIVSSGKCRKL

XP_038892016.1 uncharacterized protein LOC120081333 [Benincasa hispida]2.3e-18797.8Show/hide
Query:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME
        MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYD NVSME
Subjt:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME

Query:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV
        N CKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV
Subjt:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV

Query:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
        ALDQLLRIAGI+DDDLLIMSDVDEIPSRHTINLLRWCDDIPE+LHLQL+NYLYSFEFH+DDNSWRASVHRYKSGKT+YVHYRQSDDLLADSGWHCSFCFR
Subjt:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR

Query:  RISDFIFKMKAYSHNDRV
        RISDFIFKMKAYSHNDRV
Subjt:  RISDFIFKMKAYSHNDRV

TrEMBL top hitse value%identityAlignment
A0A0A0KS65 Uncharacterized protein1.9e-18793.67Show/hide
Query:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME
        MWWMMGEGGGHYCSKKSDDICG+VCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYD NVSME
Subjt:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME

Query:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV
        N CKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRD+FKFVE RFTYGTVGGRFKKGENPFVEEAFQRV
Subjt:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV

Query:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
        ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPE+LHLQLKNYLYSFEFH+DDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
Subjt:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR

Query:  RISDFIFKMKAYSHNDRVSSLAIVSSGKCRKL
        RISDF+FKMKAYSHNDRV   + ++  + +K+
Subjt:  RISDFIFKMKAYSHNDRVSSLAIVSSGKCRKL

A0A1S3BF80 uncharacterized protein LOC1034889646.4e-18893.98Show/hide
Query:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME
        MWWMMGEGGGHYCSKKSDDICG+VCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYD NVSME
Subjt:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME

Query:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV
        N CKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVE RFTYGTVGGRFKKGENPFVEEAFQRV
Subjt:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV

Query:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
        ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPE+LHLQLKNYLYSFEFH+DDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
Subjt:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR

Query:  RISDFIFKMKAYSHNDRVSSLAIVSSGKCRKL
        RISDF+FKMKAYSHNDRV   + ++  + +K+
Subjt:  RISDFIFKMKAYSHNDRVSSLAIVSSGKCRKL

A0A5A7SYS3 Uncharacterized protein1.7e-23691.14Show/hide
Query:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME
        MWWMMGEGGGHYCSKKSDDICG+VCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYD NVSME
Subjt:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME

Query:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV
        N CKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVE RFTYGTVGGRFKKGENPFVEEAFQRV
Subjt:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV

Query:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
        ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPE+LHLQLKNYLYSFEFH+DDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
Subjt:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR

Query:  RISDFIFKMKAYSHNDRVSSLAIVSSGKCRKLQIPFAWELRTREWLISNFNVYLPVKVSCTVACLTPRGRSIPIVVDRYMANRGDPLPPDKLMLILLEVV
        RISDF+FKMKAYSHNDRV           R+LQIPFAWEL TREWLISNF VYLPVKVSC VACLTPRG SI IVVD+Y  NRG  LPPDKLMLILLEVV
Subjt:  RISDFIFKMKAYSHNDRVSSLAIVSSGKCRKLQIPFAWELRTREWLISNFNVYLPVKVSCTVACLTPRGRSIPIVVDRYMANRGDPLPPDKLMLILLEVV

Query:  TAWTVELIPWVEVMCRYHLFSLSWFGSR-SLSIHWVGELA
        TAWTVELIPWVEVMCRY L SL WFGSR  L I WV  L+
Subjt:  TAWTVELIPWVEVMCRYHLFSLSWFGSR-SLSIHWVGELA

A0A5D3CUS1 Uncharacterized protein2.0e-21390.64Show/hide
Query:  MSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSMENQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRW
        MSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYD NVSMEN CKLHGWKVREFPRRVYDAVLFSNEIEMLTLRW
Subjt:  MSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSMENQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRW

Query:  KELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRVALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLL
        KELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVE RFTYGTVGGRFKKGENPFVEEAFQRVALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLL
Subjt:  KELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRVALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLL

Query:  RWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFRRISDFIFKMKAYSHNDRVSSLAIVSSGKCRKLQI
        RWCDDIPE+LHLQLKNYLYSFEFH+DDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFRRISDF+FKMKAYSHNDRV           R+LQI
Subjt:  RWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFRRISDFIFKMKAYSHNDRVSSLAIVSSGKCRKLQI

Query:  PFAWELRTREWLISNFNVYLPVKVSCTVACLTPRGRSIPIVVDRYMANRGDPLPPDKLMLILLEVVTAWTVELIPWVEVMCRYHLFSLSWFGSR-SLSIH
        PFAWEL TREWLISNF VYLPVKVSC VACLTPRG SI IVVD+Y  NRG  LPPDKLMLILLEVVTAWTVELIPWVEVMCRY L SL WFGSR  L I 
Subjt:  PFAWELRTREWLISNFNVYLPVKVSCTVACLTPRGRSIPIVVDRYMANRGDPLPPDKLMLILLEVVTAWTVELIPWVEVMCRYHLFSLSWFGSR-SLSIH

Query:  WVGELA
        WV  L+
Subjt:  WVGELA

Q700J8 Putative N-acetylglucosaminyltransferase III6.7e-18592.47Show/hide
Query:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME
        MWWMMGEGGGHYCSKKSDDICG+VCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYD NVSM+
Subjt:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME

Query:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV
        N CKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYF   RD+FKFVE RFTYGTVGGRFKKGENPFVEEAFQRV
Subjt:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV

Query:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
        ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPE+LHLQLKNYLYSFEFH+DDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
Subjt:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR

Query:  RISDFIFKMKAYSHNDRVSSLAIVSSGKCRKL
        RISDF+FKMKAYSHNDRV   + ++  + +K+
Subjt:  RISDFIFKMKAYSHNDRVSSLAIVSSGKCRKL

SwissProt top hitse value%identityAlignment
Q02527 Beta-1,4-mannosyl-glycoprotein 4-beta-N-acetylglucosaminyltransferase6.5e-1229.87Show/hide
Query:  REFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFAR--NRDQFKFVEPRFTYGTV-----GGRFKKGENPFVEEAFQRVAL-
        RE PRRV +A+  ++E ++L +R+ EL   +  FV+ E+N T  G+P+PL F        F+++  +  Y  +     GGR    ++ ++ + + R  L 
Subjt:  REFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFAR--NRDQFKFVEPRFTYGTV-----GGRFKKGENPFVEEAFQRVAL-

Query:  -DQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSF
         D + R+  +  DD+ I+ D DEIP+R  +  L+  D   E     ++  LY F
Subjt:  -DQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSF

Q09327 Beta-1,4-mannosyl-glycoprotein 4-beta-N-acetylglucosaminyltransferase8.5e-1229.87Show/hide
Query:  REFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFAR--NRDQFKFVEPRFTYGTV-----GGRFKKGENPFVEEAFQRVAL-
        RE PRRV +A+  ++E ++L +R+ EL   +  FV+ E+N T  G+P+PL F        F+++  +  Y  +     GGR    ++ ++ + + R  L 
Subjt:  REFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFAR--NRDQFKFVEPRFTYGTV-----GGRFKKGENPFVEEAFQRVAL-

Query:  -DQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSF
         D + R+  +  DD+ I+ D DEIP+R  +  L+  D   E     ++  LY F
Subjt:  -DQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSF

Q10470 Beta-1,4-mannosyl-glycoprotein 4-beta-N-acetylglucosaminyltransferase1.5e-1129.22Show/hide
Query:  REFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFAR--NRDQFKFVEPRFTYGTV-----GGRFKKGENPFVEEAFQRVAL-
        RE PRRV +A+  ++E ++L +R+ EL   +  FV+ ++N T  G+P+PL F        F+++  +  Y  +     GGR    ++ ++ + + R  L 
Subjt:  REFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFAR--NRDQFKFVEPRFTYGTV-----GGRFKKGENPFVEEAFQRVAL-

Query:  -DQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSF
         D + R+  +  DD+ I+ D DEIP+R  +  L+  D   E     ++  LY F
Subjt:  -DQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSF

Arabidopsis top hitse value%identityAlignment
AT1G12990.1 beta-1,4-N-acetylglucosaminyltransferase family protein2.4e-15577.04Show/hide
Query:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME
        MWWMMGE GGHYCSKK+DDICG VC QE  R    SRL C  RG D+KT++ L  +VPTC+L  Y+HGQKISYFLRPLWESPPK F+ I HYY EN SME
Subjt:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME

Query:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV
          CKLHGW VR++PRRVYDAVLFSNE+++L +RW+EL+PYITQFVLLE+N+TFTG PKPL FA +RD+FKF+E R TYGTVGGRF KG+NPF EEA+QRV
Subjt:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRV

Query:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR
        ALDQLLRIAGITDDDLL+MSDVDEIPSRHTINLLRWCD+IP+ILHL+LKNYLYSFEF +D+ SWRAS+HRY++GKTRY HYRQSD++LAD+GWHCSFCFR
Subjt:  ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFR

Query:  RISDFIFKMKAYSHNDRV
        RIS+FIFKMKAYSHNDRV
Subjt:  RISDFIFKMKAYSHNDRV

AT1G67880.1 beta-1,4-N-acetylglucosaminyltransferase family protein1.5e-15277.12Show/hide
Query:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME
        MWWMMGE GGHYCSKKSDD+CG    QES+R  G+SRL CI RG D+K+ L L  ++P C+L +Y++  KISYFLRPLWESPPK F+ I HY+ EN SME
Subjt:  MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSME

Query:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGE-NPFVEEAFQR
        + CKLHGW+ RE+PRRVYDAVLFS E+E+LT+RWKELYPY+TQFVLLE+NSTFTG PKPL FA +RD+FKF+EPR TYG++GGRFKKGE NPF EEA+QR
Subjt:  NQCKLHGWKVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGE-NPFVEEAFQR

Query:  VALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCF
        +ALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIP+ILHL+LKNYLYSFEF +DD SWRASVHRY++GKTRY HYRQSD +LADSGWHCSFCF
Subjt:  VALDQLLRIAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCF

Query:  RRISDFIFKMKAYSHNDRV
        RRIS+F+FKMKAYSH DRV
Subjt:  RRISDFIFKMKAYSHNDRV

AT3G01620.1 beta-1,4-N-acetylglucosaminyltransferase family protein9.7e-13672.61Show/hide
Query:  GHYCSKKSDDICGEVCDQESNRV-LGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSMENQCKLHGW
        G+  SKK+D IC +VC QE +R    +SRLRC+ RG D KTFL LF L+P  I  IYLHGQKI+YFLRPLWESPPK FN++ HYY EN SME  C LHGW
Subjt:  GHYCSKKSDDICGEVCDQESNRV-LGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSMENQCKLHGW

Query:  KVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQ-FKFVEPRFTYGTVGGRFKKGENPFVEEAFQRVALDQLLR
        K+RE PRRV+DA LFSNEI+MLTLRW EL PYITQFVLLE+NSTFTG  K L FA NR++ FKFVEPR TYG VGGRFKKGENPFVEE+FQR+ALDQL++
Subjt:  KVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQ-FKFVEPRFTYGTVGGRFKKGENPFVEEAFQRVALDQLLR

Query:  IAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFRRISDFIF
        +AGI +DDLLIMSDVDEIPS HTINLLRWCD  P ILHLQL+NYLYS+E+++D  SWRASVH YK GKTR  H+RQS++LL DSGWHCSFCFR I+DF+F
Subjt:  IAGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFRRISDFIF

Query:  KMKAYSHNDRVSSL
        KMKAYSH DRV  L
Subjt:  KMKAYSHNDRVSSL

AT3G27540.1 beta-1,4-N-acetylglucosaminyltransferase family protein1.2e-14173.14Show/hide
Query:  GHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSMENQCKLHGWK
        G+  SKK+DDIC +VC Q S     +SRL+C+ +G+D++T+L LF L+P  IL IYLHGQK +YF RPLWESPPK F  I HYY+ENV+ME+ C LHGW 
Subjt:  GHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSMENQCKLHGWK

Query:  VREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRVALDQLLRIA
        +R+ PRRV+DAVLFSNE ++LT+RW ELYPY+TQFV+LE+NSTFTG PKPL F  N+DQFKFVEPR TYGT+GGRF+KGENPFVEEA+QRVALDQLLRIA
Subjt:  VREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRVALDQLLRIA

Query:  GITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFRRISDFIFKM
        GI +DDLLIMSDVDEIPS HTINLLRWCDDIP +LHLQLKNYLYSFE+++D  SWRAS+HRY  GKTRY H+RQS+ +LADSGWHCSFCFR IS+FIFKM
Subjt:  GITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFRRISDFIFKM

Query:  KAYSHNDRV
        KAYSH+DRV
Subjt:  KAYSHNDRV

AT5G14480.1 beta-1,4-N-acetylglucosaminyltransferase family protein1.2e-13871.29Show/hide
Query:  GHYCSKKSDDICGEVCDQESNRVL-GMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSMENQCKLHGW
        G+Y SKK+DDIC +VC Q+ +R     SR+RC+ RG+D KT++  F +VP  I  +YLHGQK++YFLRPLWESPPK F  + HYY EN SM   C LHGW
Subjt:  GHYCSKKSDDICGEVCDQESNRVL-GMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSMENQCKLHGW

Query:  KVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRVALDQLLRI
        K RE PRRV+DAVLFSNE++MLT+RWKELYPYITQFV+LE+NSTFTG PKPL F  NR +F+F EPR +YG + GRFKKGENPFVEEA+QR+ALDQL+R+
Subjt:  KVREFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRVALDQLLRI

Query:  AGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFRRISDFIFK
        AGI +DDLLIMSDVDEIPS HTINLLRWCD  P ILHLQLKNYLYSFE+ +D+ SWRAS+H+YK GKTRY H+RQ + LLADSGWHCSFCFR IS+FIFK
Subjt:  AGITDDDLLIMSDVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFRRISDFIFK

Query:  MKAYSHNDRV
        MKAYSHNDRV
Subjt:  MKAYSHNDRV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGGTGGATGATGGGCGAAGGTGGAGGCCATTACTGTTCCAAGAAATCCGATGATATCTGTGGCGAAGTTTGTGATCAGGAATCCAATCGAGTATTAGGCATGTCTAG
ACTTCGCTGCATTTTTCGTGGATATGATGTGAAAACGTTCCTTATTCTTTTTGCACTGGTGCCGACGTGCATCTTGATCATTTACCTGCATGGACAAAAGATCTCGTACT
TCTTACGGCCATTATGGGAATCCCCACCAAAAGAATTCAATATGATTACTCACTACTATGATGAGAATGTATCTATGGAGAACCAATGCAAACTCCATGGTTGGAAAGTC
CGTGAATTCCCAAGGCGTGTTTATGATGCTGTGCTGTTCAGTAATGAGATTGAGATGCTCACCTTGCGATGGAAAGAACTCTACCCTTACATTACACAGTTTGTTCTTCT
TGAGGCTAATTCAACATTTACTGGGAAGCCAAAGCCATTATACTTTGCCCGCAACAGAGACCAATTCAAATTTGTGGAGCCAAGATTTACCTATGGCACTGTTGGAGGGA
GATTTAAGAAAGGTGAAAATCCGTTTGTTGAAGAGGCATTTCAGCGGGTAGCACTCGATCAGCTTCTCAGAATTGCTGGTATCACTGATGATGACTTGTTGATAATGTCT
GATGTCGACGAAATTCCAAGCAGGCACACAATTAATCTCTTGAGATGGTGTGATGACATACCAGAAATTCTTCATTTACAACTTAAGAACTATTTGTACTCTTTCGAGTT
CCATCTTGATGACAATAGCTGGAGGGCTTCAGTTCATAGATACAAATCTGGTAAGACAAGGTACGTTCATTATCGCCAGTCGGATGACCTGTTGGCAGATTCAGGGTGGC
ACTGTAGCTTCTGCTTCCGTCGTATCAGTGACTTCATCTTTAAGATGAAAGCATACAGCCATAATGACAGAGTTAGTTCACTTGCCATCGTATCTTCTGGAAAATGCAGA
AAATTACAAATTCCTTTTGCCTGGGAATTGCGTACGAGAGAGTGGCTGATCTCTAACTTCAATGTCTACCTCCCTGTGAAGGTTAGTTGCACTGTAGCTTGTCTGACACC
TCGTGGACGTTCCATCCCGATAGTTGTTGACCGATACATGGCCAACCGTGGCGACCCTTTGCCTCCAGATAAGCTAATGTTAATTCTTTTGGAAGTTGTAACTGCTTGGA
CAGTTGAACTAATTCCATGGGTGGAAGTTATGTGTAGATATCATCTGTTTTCTTTGAGTTGGTTTGGTTCAAGGAGTTTATCTATTCATTGGGTGGGTGAGCTTGCTGTC
ACAGGAGCAGCAGCATATTAG
mRNA sequenceShow/hide mRNA sequence
ATGTGGTGGATGATGGGCGAAGGTGGAGGCCATTACTGTTCCAAGAAATCCGATGATATCTGTGGCGAAGTTTGTGATCAGGAATCCAATCGAGTATTAGGCATGTCTAG
ACTTCGCTGCATTTTTCGTGGATATGATGTGAAAACGTTCCTTATTCTTTTTGCACTGGTGCCGACGTGCATCTTGATCATTTACCTGCATGGACAAAAGATCTCGTACT
TCTTACGGCCATTATGGGAATCCCCACCAAAAGAATTCAATATGATTACTCACTACTATGATGAGAATGTATCTATGGAGAACCAATGCAAACTCCATGGTTGGAAAGTC
CGTGAATTCCCAAGGCGTGTTTATGATGCTGTGCTGTTCAGTAATGAGATTGAGATGCTCACCTTGCGATGGAAAGAACTCTACCCTTACATTACACAGTTTGTTCTTCT
TGAGGCTAATTCAACATTTACTGGGAAGCCAAAGCCATTATACTTTGCCCGCAACAGAGACCAATTCAAATTTGTGGAGCCAAGATTTACCTATGGCACTGTTGGAGGGA
GATTTAAGAAAGGTGAAAATCCGTTTGTTGAAGAGGCATTTCAGCGGGTAGCACTCGATCAGCTTCTCAGAATTGCTGGTATCACTGATGATGACTTGTTGATAATGTCT
GATGTCGACGAAATTCCAAGCAGGCACACAATTAATCTCTTGAGATGGTGTGATGACATACCAGAAATTCTTCATTTACAACTTAAGAACTATTTGTACTCTTTCGAGTT
CCATCTTGATGACAATAGCTGGAGGGCTTCAGTTCATAGATACAAATCTGGTAAGACAAGGTACGTTCATTATCGCCAGTCGGATGACCTGTTGGCAGATTCAGGGTGGC
ACTGTAGCTTCTGCTTCCGTCGTATCAGTGACTTCATCTTTAAGATGAAAGCATACAGCCATAATGACAGAGTTAGTTCACTTGCCATCGTATCTTCTGGAAAATGCAGA
AAATTACAAATTCCTTTTGCCTGGGAATTGCGTACGAGAGAGTGGCTGATCTCTAACTTCAATGTCTACCTCCCTGTGAAGGTTAGTTGCACTGTAGCTTGTCTGACACC
TCGTGGACGTTCCATCCCGATAGTTGTTGACCGATACATGGCCAACCGTGGCGACCCTTTGCCTCCAGATAAGCTAATGTTAATTCTTTTGGAAGTTGTAACTGCTTGGA
CAGTTGAACTAATTCCATGGGTGGAAGTTATGTGTAGATATCATCTGTTTTCTTTGAGTTGGTTTGGTTCAAGGAGTTTATCTATTCATTGGGTGGGTGAGCTTGCTGTC
ACAGGAGCAGCAGCATATTAG
Protein sequenceShow/hide protein sequence
MWWMMGEGGGHYCSKKSDDICGEVCDQESNRVLGMSRLRCIFRGYDVKTFLILFALVPTCILIIYLHGQKISYFLRPLWESPPKEFNMITHYYDENVSMENQCKLHGWKV
REFPRRVYDAVLFSNEIEMLTLRWKELYPYITQFVLLEANSTFTGKPKPLYFARNRDQFKFVEPRFTYGTVGGRFKKGENPFVEEAFQRVALDQLLRIAGITDDDLLIMS
DVDEIPSRHTINLLRWCDDIPEILHLQLKNYLYSFEFHLDDNSWRASVHRYKSGKTRYVHYRQSDDLLADSGWHCSFCFRRISDFIFKMKAYSHNDRVSSLAIVSSGKCR
KLQIPFAWELRTREWLISNFNVYLPVKVSCTVACLTPRGRSIPIVVDRYMANRGDPLPPDKLMLILLEVVTAWTVELIPWVEVMCRYHLFSLSWFGSRSLSIHWVGELAV
TGAAAY