; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018355 (gene) of Snake gourd v1 genome

Gene IDTan0018355
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationLG02:94881726..94889989
RNA-Seq ExpressionTan0018355
SyntenyTan0018355
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445103.1 PREDICTED: uncharacterized protein LOC103488245 [Cucumis melo]4.1e-18988.1Show/hide
Query:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH
        SL     YVYP R++LLCYIFSSGC+NGAF+Q  P+A RELTD ETAAQV+M+EILK+PLAQSKNPKIAFMFLTP  LP EKLWHKFFDGHDDRFSIYVH
Subjt:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH

Query:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI
        ASR K  H SPHFV RDIRSEKVAWGEISMVDAEKRLLANAL DPDNQHFVLLSESC+PLHDFEYIYNYLIFTNVSYIDCFEDPGPHG+GRYSE MLPEI
Subjt:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI

Query:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL
        E+KDFRKGSQWFSMKRQHAII+MADSLYYTKFKRYCKRTKDGPNCYADEHYF TLF+MIDPGGIANWSVTHVDWSEGKWHPK+YR QDVTYELLKNITSL
Subjt:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL

Query:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTV
        DEI+H+T+T P KRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLH+FSNYTV
Subjt:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTV

XP_011649768.1 glycosyltransferase BC10 isoform X1 [Cucumis sativus]2.5e-18685.63Show/hide
Query:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH
        S+   G YVYP R+SLLCYIFSSGC+NGAF++  P+A RELTD ETA +V+M+EILK+PLAQSKNPKIAFMFLTP SLPFEKLWHKF DGHDDRFSIYVH
Subjt:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH

Query:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI
        ASREK   ASPHF+GRDIRSEKVAWGEISMVDAEKRLLANAL DP+NQHFVLLSESC+PLHDFEYIYNYLIFTNVSYIDCFEDPGPHG+GRYSE MLPEI
Subjt:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI

Query:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL
        EKKDFRKGSQWFSMKR+HAIIVMADSLYY KFK YCKRTK+GPNCYADEHYFPTLF+MIDPGGIANWSVTHVDWSEGKWHPK+YR QDVTYELL+NITS+
Subjt:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL

Query:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV
        DEI+HIT+T P KR  LRPC+WNGVKRPCHLFARKFYPETLGRLLH+FSNY  A+
Subjt:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV

XP_022951968.1 uncharacterized protein LOC111454695 [Cucurbita moschata]3.2e-18687.04Show/hide
Query:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH
        S+   G YV+PPRSS +CYIF SGCING F+QRRP ASRELTDAE A++VV+REILKRPLAQSKNPKIAFMFLTP  LPFEKLWHKFFDGHD RFSIYVH
Subjt:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH

Query:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI
        ASREK +H+SPHFVGRDIRSEKV WGE+SMVDAEKRLLANAL DPDNQHF L SESCVPLH+FEY+YNYLIFTNVSYIDCF+DPGPHGSGRYSERMLPEI
Subjt:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI

Query:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL
        EKKDFRKGSQWFSMKR HAIIVMADSLYYTKFK YCKRTKDGPNCYADEHYFPT FNMIDPGGI+NWSVTHVDW+EGKWHPKSYRNQDVTYELLKNI SL
Subjt:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL

Query:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV
        D+   ITSTAP KR M +PCLWNGVKRPCHLFARKFYPETLGRL HLFSNYTV V
Subjt:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV

XP_023002387.1 uncharacterized protein LOC111496244 [Cucurbita maxima]2.4e-18988.45Show/hide
Query:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH
        S+   G YV+PPRSS +CYIFS  CING F+QRRPIASRELTDAE A++VV+REILKRPLAQSKNPKIAFMFLTP  LPFEKLWHKFFDGHDDRFSIYVH
Subjt:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH

Query:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI
        ASREK +H+SPHFVGRDIRSEKVAWGE+SMVDAEKRLLANAL DPDNQHFVLLSESCVPLH+FEY+YNYLIFTNVSYIDCF+DPGPHGSGRYSERMLPEI
Subjt:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI

Query:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL
        EKKDFRKGSQWFSMKRQHA+IVMADSLYY KFK YCKRTKDGPNCYADEHYFPT FNMIDPGGIANWSVTHVDW+EGKWHPKSYRNQDVTYELLKNI SL
Subjt:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL

Query:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV
        D+  HITSTAP KR M +PCLWNGVKRPCHLFARKFYPETLGRL HLFSNYTV V
Subjt:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV

XP_038886720.1 glycosyltransferase BC10 isoform X1 [Benincasa hispida]3.9e-19290.62Show/hide
Query:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH
        S+  VGVYVYPPR+SLLCYIFSSGCINGAF+Q  P+ASRELTD ETAA+VVM+EILKRPLA SKNPKIAFMFLTP SLPFEKLWHKFFDGHDDRFS+YVH
Subjt:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH

Query:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI
        ASREK    SPHFVGRDIRSEKVAWGEISMVDAEKRLLANAL DPDNQ+FVL+SE+CVPLHDFEYIYNYL+FTNVSYIDCF DPGPHGSGRYSERMLPEI
Subjt:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI

Query:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL
        EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRT DGPNCYADEHYFPTLF+MIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELL+NITSL
Subjt:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL

Query:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYT
        DE VHITSTAP +  +LRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYT
Subjt:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYT

TrEMBL top hitse value%identityAlignment
A0A0A0LSA2 Uncharacterized protein1.2e-18685.63Show/hide
Query:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH
        S+   G YVYP R+SLLCYIFSSGC+NGAF++  P+A RELTD ETA +V+M+EILK+PLAQSKNPKIAFMFLTP SLPFEKLWHKF DGHDDRFSIYVH
Subjt:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH

Query:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI
        ASREK   ASPHF+GRDIRSEKVAWGEISMVDAEKRLLANAL DP+NQHFVLLSESC+PLHDFEYIYNYLIFTNVSYIDCFEDPGPHG+GRYSE MLPEI
Subjt:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI

Query:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL
        EKKDFRKGSQWFSMKR+HAIIVMADSLYY KFK YCKRTK+GPNCYADEHYFPTLF+MIDPGGIANWSVTHVDWSEGKWHPK+YR QDVTYELL+NITS+
Subjt:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL

Query:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV
        DEI+HIT+T P KR  LRPC+WNGVKRPCHLFARKFYPETLGRLLH+FSNY  A+
Subjt:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV

A0A1S3BCM6 uncharacterized protein LOC1034882452.0e-18988.1Show/hide
Query:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH
        SL     YVYP R++LLCYIFSSGC+NGAF+Q  P+A RELTD ETAAQV+M+EILK+PLAQSKNPKIAFMFLTP  LP EKLWHKFFDGHDDRFSIYVH
Subjt:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH

Query:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI
        ASR K  H SPHFV RDIRSEKVAWGEISMVDAEKRLLANAL DPDNQHFVLLSESC+PLHDFEYIYNYLIFTNVSYIDCFEDPGPHG+GRYSE MLPEI
Subjt:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI

Query:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL
        E+KDFRKGSQWFSMKRQHAII+MADSLYYTKFKRYCKRTKDGPNCYADEHYF TLF+MIDPGGIANWSVTHVDWSEGKWHPK+YR QDVTYELLKNITSL
Subjt:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL

Query:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTV
        DEI+H+T+T P KRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLH+FSNYTV
Subjt:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTV

A0A6J1BPK3 uncharacterized protein LOC111004663 isoform X17.3e-18486.08Show/hide
Query:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH
        S+  VG YVY P+SSLLCYIFSSGC+NGAF+Q RP A RELTD ETAA+VV++EILKRPLAQSKNPKIAFMFLTP  LPFEKLWHKFF GHDDRFS+YVH
Subjt:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH

Query:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI
        ASR+K  + SP+FVG  IRSEKVAWGEISMVDAEKRLLANAL DPDN+HFVLLSESCVPLHDFEY+YNYLIFTNVSYIDCF D GPHG+GRYSERM PEI
Subjt:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI

Query:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL
        EKKDFRKGSQWFSMKRQHAII+MADSLY+TKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVT+VDWSEGKWHP+SYRNQDVTYELLKN+TS 
Subjt:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL

Query:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYT
        DE VHITST P +R +L+ CLWNGV+RPCHLFARKFYPETLGRLLHLFSNYT
Subjt:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYT

A0A6J1GJ69 uncharacterized protein LOC1114546951.6e-18687.04Show/hide
Query:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH
        S+   G YV+PPRSS +CYIF SGCING F+QRRP ASRELTDAE A++VV+REILKRPLAQSKNPKIAFMFLTP  LPFEKLWHKFFDGHD RFSIYVH
Subjt:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH

Query:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI
        ASREK +H+SPHFVGRDIRSEKV WGE+SMVDAEKRLLANAL DPDNQHF L SESCVPLH+FEY+YNYLIFTNVSYIDCF+DPGPHGSGRYSERMLPEI
Subjt:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI

Query:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL
        EKKDFRKGSQWFSMKR HAIIVMADSLYYTKFK YCKRTKDGPNCYADEHYFPT FNMIDPGGI+NWSVTHVDW+EGKWHPKSYRNQDVTYELLKNI SL
Subjt:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL

Query:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV
        D+   ITSTAP KR M +PCLWNGVKRPCHLFARKFYPETLGRL HLFSNYTV V
Subjt:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV

A0A6J1KJD2 uncharacterized protein LOC1114962441.2e-18988.45Show/hide
Query:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH
        S+   G YV+PPRSS +CYIFS  CING F+QRRPIASRELTDAE A++VV+REILKRPLAQSKNPKIAFMFLTP  LPFEKLWHKFFDGHDDRFSIYVH
Subjt:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH

Query:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI
        ASREK +H+SPHFVGRDIRSEKVAWGE+SMVDAEKRLLANAL DPDNQHFVLLSESCVPLH+FEY+YNYLIFTNVSYIDCF+DPGPHGSGRYSERMLPEI
Subjt:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI

Query:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL
        EKKDFRKGSQWFSMKRQHA+IVMADSLYY KFK YCKRTKDGPNCYADEHYFPT FNMIDPGGIANWSVTHVDW+EGKWHPKSYRNQDVTYELLKNI SL
Subjt:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSL

Query:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV
        D+  HITSTAP KR M +PCLWNGVKRPCHLFARKFYPETLGRL HLFSNYTV V
Subjt:  DEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV

SwissProt top hitse value%identityAlignment
Q65XS5 Glycosyltransferase BC108.1e-4734.6Show/hide
Query:  LITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDG-HDDRFSIYVH
        L+ +GV++      LL    SS  + G   +   + +      E    V   E+ + PL    N ++AF+F+    LP + +W  FF G  + RFSI+VH
Subjt:  LITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDG-HDDRFSIYVH

Query:  AS----REKAAHASPHFVGRDI-RSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSER
        +       +A   S  F  R +  S +V WGE SM++AE+ LLA+AL+DP N+ FV +S+SCVPL++F Y Y+Y++ ++ S++D F D     +GRY+ R
Subjt:  AS----REKAAHASPHFVGRDI-RSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSER

Query:  MLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTK--------DGP---------NCYADEHYFPTLFNMID-PGGIANWSVTHVDW--S
        M P I  +++RKGSQW  + R+HA +V+ D     +F+++C+R          D P         NC  DEHY  TL         +   SVTH  W  S
Subjt:  MLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTK--------DGP---------NCYADEHYFPTLFNMID-PGGIANWSVTHVDW--S

Query:  EGK------WHPKSYRNQDVTYELLKNITSLDEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKF
          K      WHP +Y+  D T  L+K+I  +D I + T    +       C  NG   PC LFARKF
Subjt:  EGK------WHPKSYRNQDVTYELLKNITSLDEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKF

Arabidopsis top hitse value%identityAlignment
AT2G19160.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.5e-12556.42Show/hide
Query:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH
        ++  +G Y++P  S   CY+FSS    G  D   P + RE +D E AA+VV+ EIL  P    K+ KIAFMFLTP +LPFEKLW  FF GH+ +FS+Y+H
Subjt:  SLITVGVYVYPPRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVH

Query:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI
        AS++   H S +F+ R+IRS++V WG ISM+DAE+RLL NAL+DP+NQ FVLLS+SCVPL  FEY+YNY++ +NVSY+DCF+DPGPHG+GR+ + MLPEI
Subjt:  ASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEI

Query:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDG-PNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITS
         ++DFRKG+QWFSMKRQHA++ +AD+LYY+KF+ YC    +G  NC ADEHY PT F M+DP GIANW+VT+VDWSE KWHP+ Y  +D+T EL+KNI+S
Subjt:  EKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDG-PNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITS

Query:  LDEIVHITSTAPQKRAML--RPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV
        +D +  +TS   +K  ++    C+WNG+KRPC+LF RKF+ +TL +L+ LF NYT  V
Subjt:  LDEIVHITSTAPQKRAML--RPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV

AT4G30060.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.0e-12960.28Show/hide
Query:  SLIT---VGVYVYPPRSSLLCYIFSS-GCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFS
        SLIT   +  Y+YP  S   CY+ SS GC   A     P + RE +D E AA+VV+REIL  P    KN KIAFMFLTP +LPFE+LW +FF GH+ +FS
Subjt:  SLIT---VGVYVYPPRSSLLCYIFSS-GCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFS

Query:  IYVHASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERM
        +Y+HAS+E+  H S +F+ R+IRS++V WG ISMVDAE+RLLANAL+D  NQ FVLLS+SCVPL  FEYIYNYL+ +N+SY+DCF+DPG HG+GR+   M
Subjt:  IYVHASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERM

Query:  LPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYC-KRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLK
        LPEI KKDFRKG+QWF+MKRQHA+  MADSLYY+KF+ YC    ++  NC ADEHY PT F+M+DPGGIANW+VT VDWSE KWHPK+Y  +D+T+ELL 
Subjt:  LPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYC-KRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLK

Query:  NITSLDEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV
        N+TS D +VH+TS    +   + PC+WNG++RPC+LF RKF+P+TL +LL LFSNYT +V
Subjt:  NITSLDEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV

AT4G31350.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.7e-14566.01Show/hide
Query:  SLITVGVYVYPPRSSLLCYIFSS-GCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYV
        +++ +  ++YPPR+S+ CY+FS  GC    + Q   + +RELTD+E AAQVVM EI+  P +++ NPK+AFMFLTP +LPFE LW  FF GH+++FS+YV
Subjt:  SLITVGVYVYPPRSSLLCYIFSS-GCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYV

Query:  HASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPE
        HAS++   H S +FVGRDI S KVAWG+ISMVDAE+RLLA+AL DPDNQHF+LLS+SCVPL DF YIYN+LIF N+S+IDCFEDPGPHGSGRYS+ MLPE
Subjt:  HASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPE

Query:  IEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITS
        +EKKDFRKGSQWFSMKR+HAI+VMADSLYYTKFK YC+   +G NCYADEHYFPTLFNMIDP GIANWSVTHVDWSEGKWHPK Y  +D+T  L++ I S
Subjt:  IEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITS

Query:  LDEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV
        +    H+TS   +K   ++PCLW G +RPC+LFARKF PETL RL++LF NYT  V
Subjt:  LDEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV

AT4G31350.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.7e-14566.01Show/hide
Query:  SLITVGVYVYPPRSSLLCYIFSS-GCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYV
        +++ +  ++YPPR+S+ CY+FS  GC    + Q   + +RELTD+E AAQVVM EI+  P +++ NPK+AFMFLTP +LPFE LW  FF GH+++FS+YV
Subjt:  SLITVGVYVYPPRSSLLCYIFSS-GCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYV

Query:  HASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPE
        HAS++   H S +FVGRDI S KVAWG+ISMVDAE+RLLA+AL DPDNQHF+LLS+SCVPL DF YIYN+LIF N+S+IDCFEDPGPHGSGRYS+ MLPE
Subjt:  HASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPE

Query:  IEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITS
        +EKKDFRKGSQWFSMKR+HAI+VMADSLYYTKFK YC+   +G NCYADEHYFPTLFNMIDP GIANWSVTHVDWSEGKWHPK Y  +D+T  L++ I S
Subjt:  IEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITS

Query:  LDEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV
        +    H+TS   +K   ++PCLW G +RPC+LFARKF PETL RL++LF NYT  V
Subjt:  LDEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV

AT5G57270.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.7e-12458.06Show/hide
Query:  SLITVGVYVYP----PRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFS
        S++ V  Y+YP     +SS    + S GC   A     P+  R+ TD E AA+VV+++IL+ P A +   KIAFMFLTP +LPFEKLW KFF G + RFS
Subjt:  SLITVGVYVYP----PRSSLLCYIFSSGCINGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFS

Query:  IYVHASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERM
        IY+H SR +  H S HF  R+I S+ V WG ISMVDAE+RLLANAL+DPDNQHFVLLSESC+PLH F+Y Y YL+  NVS+ID FED GPHG+GR+ + M
Subjt:  IYVHASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERM

Query:  LPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCK-RTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLK
        LPEI ++DFRKG+QWF+MKRQHA+IVMAD LYY+KF+ YC+   +   NC ADEHY PT F+M+DPGGI+NWSVT+VDWSE +WHPK+YR +DV+ +LLK
Subjt:  LPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCK-RTKDGPNCYADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLK

Query:  NITSLDEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV
        NITS D  VH+TS   +   +  PC W G++RPC+LFARK + + L +L+ LF NYT  V
Subjt:  NITSLDEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTATTTTCAGAATTTCTACTATTTGGATTACAGAATCAAAAAACAGTTTTCGAAAATATATCTGCCAAACACAATAATGCTCTCCACTGTTCACTCAGATCTCTGCA
ACTTTCACCAACTTTACTCTTCGCTTTTCCAGCCACGGTACTTCATTCTCATTCCGAATCGTGGTCGTTGCGTTTCGTTAATTCTGCTTCCTTCTTCATCCGCATCGTGA
TTACTGTTTTTCGTTTATTGGACTGCGGTTCTCGTAGTTTGATCACCGTTGGCGTCTATGTGTATCCACCAAGAAGCTCTCTGCTTTGTTATATATTTTCTAGTGGGTGT
ATCAATGGTGCATTTGATCAACGCCGACCCATTGCTTCCCGGGAATTAACTGATGCAGAGACTGCAGCTCAGGTTGTAATGAGGGAAATTTTGAAGAGACCTCTAGCCCA
ATCTAAGAATCCGAAAATTGCCTTTATGTTTTTGACTCCAGCTTCACTACCCTTTGAGAAGCTCTGGCATAAATTTTTTGATGGTCACGACGACAGATTTTCCATATATG
TGCATGCATCTAGAGAAAAAGCAGCACATGCAAGCCCCCATTTTGTAGGTCGTGACATTCGCAGTGAAAAGGTAGCATGGGGAGAAATTTCTATGGTTGATGCAGAGAAG
AGACTTTTGGCAAATGCACTTCAAGACCCTGATAATCAGCACTTTGTTTTGTTATCTGAAAGTTGTGTGCCTCTTCACGACTTTGAGTATATCTATAACTATTTGATATT
TACAAATGTCAGCTATATTGATTGTTTTGAAGACCCTGGTCCCCATGGAAGTGGTAGATATTCGGAGCGCATGTTACCTGAAATTGAAAAGAAGGACTTTCGTAAAGGTT
CTCAGTGGTTTTCCATGAAGCGACAACATGCTATTATTGTAATGGCTGACAGTCTTTACTATACAAAATTCAAGCGTTACTGCAAGCGAACTAAGGACGGCCCCAACTGC
TATGCTGACGAGCACTATTTTCCAACCCTTTTCAACATGATCGACCCTGGTGGAATTGCAAATTGGTCAGTAACACATGTAGATTGGTCTGAAGGAAAGTGGCATCCAAA
ATCGTATAGGAATCAAGATGTCACATATGAGCTTCTGAAGAACATTACTTCCCTAGACGAGATCGTCCACATTACGAGTACTGCTCCGCAGAAGAGGGCGATGTTGAGGC
CCTGCTTGTGGAACGGCGTGAAAAGACCATGCCATTTATTTGCGAGGAAATTTTATCCAGAAACCCTGGGAAGATTGTTGCACCTTTTCTCTAACTACACAGTGGCTGTT
TAG
mRNA sequenceShow/hide mRNA sequence
ATGTTATTTTCAGAATTTCTACTATTTGGATTACAGAATCAAAAAACAGTTTTCGAAAATATATCTGCCAAACACAATAATGCTCTCCACTGTTCACTCAGATCTCTGCA
ACTTTCACCAACTTTACTCTTCGCTTTTCCAGCCACGGTACTTCATTCTCATTCCGAATCGTGGTCGTTGCGTTTCGTTAATTCTGCTTCCTTCTTCATCCGCATCGTGA
TTACTGTTTTTCGTTTATTGGACTGCGGTTCTCGTAGTTTGATCACCGTTGGCGTCTATGTGTATCCACCAAGAAGCTCTCTGCTTTGTTATATATTTTCTAGTGGGTGT
ATCAATGGTGCATTTGATCAACGCCGACCCATTGCTTCCCGGGAATTAACTGATGCAGAGACTGCAGCTCAGGTTGTAATGAGGGAAATTTTGAAGAGACCTCTAGCCCA
ATCTAAGAATCCGAAAATTGCCTTTATGTTTTTGACTCCAGCTTCACTACCCTTTGAGAAGCTCTGGCATAAATTTTTTGATGGTCACGACGACAGATTTTCCATATATG
TGCATGCATCTAGAGAAAAAGCAGCACATGCAAGCCCCCATTTTGTAGGTCGTGACATTCGCAGTGAAAAGGTAGCATGGGGAGAAATTTCTATGGTTGATGCAGAGAAG
AGACTTTTGGCAAATGCACTTCAAGACCCTGATAATCAGCACTTTGTTTTGTTATCTGAAAGTTGTGTGCCTCTTCACGACTTTGAGTATATCTATAACTATTTGATATT
TACAAATGTCAGCTATATTGATTGTTTTGAAGACCCTGGTCCCCATGGAAGTGGTAGATATTCGGAGCGCATGTTACCTGAAATTGAAAAGAAGGACTTTCGTAAAGGTT
CTCAGTGGTTTTCCATGAAGCGACAACATGCTATTATTGTAATGGCTGACAGTCTTTACTATACAAAATTCAAGCGTTACTGCAAGCGAACTAAGGACGGCCCCAACTGC
TATGCTGACGAGCACTATTTTCCAACCCTTTTCAACATGATCGACCCTGGTGGAATTGCAAATTGGTCAGTAACACATGTAGATTGGTCTGAAGGAAAGTGGCATCCAAA
ATCGTATAGGAATCAAGATGTCACATATGAGCTTCTGAAGAACATTACTTCCCTAGACGAGATCGTCCACATTACGAGTACTGCTCCGCAGAAGAGGGCGATGTTGAGGC
CCTGCTTGTGGAACGGCGTGAAAAGACCATGCCATTTATTTGCGAGGAAATTTTATCCAGAAACCCTGGGAAGATTGTTGCACCTTTTCTCTAACTACACAGTGGCTGTT
TAGGCCATCTCGTCGGATCAAGACATTGTTTTAAGAGGTTAGAATAATGTGGTTGATAATTTTATACCGTTGATTTCATTATACGATAGGAAAATTAAAAATTATATTTT
GACTTCGAATACGGCTTTTCACAGAATTCTATATGTATCTATGAGATCGAGTATAGAATTCTATTTACTCACTTTAAATTGAGTATATGTTACCCTCTCTTTCCTGTCAG
GATCGAAAATCTGATATTTTACATATCCATATGATTGAGTCTTTAGGTTTCCGACAGTGTAAAAAGAGGTGTTTCGAATCATTACTATTTGACTCGGACTAGTTCTTAAA
AAAGAGACATTTCAACTGAAAA
Protein sequenceShow/hide protein sequence
MLFSEFLLFGLQNQKTVFENISAKHNNALHCSLRSLQLSPTLLFAFPATVLHSHSESWSLRFVNSASFFIRIVITVFRLLDCGSRSLITVGVYVYPPRSSLLCYIFSSGC
INGAFDQRRPIASRELTDAETAAQVVMREILKRPLAQSKNPKIAFMFLTPASLPFEKLWHKFFDGHDDRFSIYVHASREKAAHASPHFVGRDIRSEKVAWGEISMVDAEK
RLLANALQDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKRYCKRTKDGPNC
YADEHYFPTLFNMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLKNITSLDEIVHITSTAPQKRAMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTVAV