; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG08G012760 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG08G012760
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationCG_Chr08:25645997..25660571
RNA-Seq ExpressionClCG08G012760
SyntenyClCG08G012760
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445103.1 PREDICTED: uncharacterized protein LOC103488245 [Cucumis melo]7.1e-20187.77Show/hide
Query:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        MPGSRQRPYLKPS+  +ILVSLVS+FL   YVYP R +LLCYIFSSGC+NGAFEQ L VA RELTDEETAA+V+MKEILK+PLAQSKNPKIAFMFLTPG 
Subjt:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY
        LP EKLWHKF DGHDDRFSIYVHASR KV H SPHFV RDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESC+PLHDF+YIYNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCFEDPGPHG+GRYSE MLPEIE+KDFRKGSQWFSMKRQHAII+MADSLYYT FKR+CKRTKDGPNCYADEHYF TLFHMIDPGGIANWSVTHVDWSEG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAI
        KWHPK+YR QDVTYELL+NITSLDEI+H+T+T  KR MLRPCLWNGVKRPCHLFARKFYPETLGRLLH+FSNYT +
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAI

XP_011649768.1 glycosyltransferase BC10 isoform X1 [Cucumis sativus]1.1e-20189.28Show/hide
Query:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        MPGSRQRP LKPS+ I+ILVSLVSIF  G YVYP RTSLLCYIFSSGC+NGAFE+ L VA RELTDEETA RV+MKEILK+PLAQSKNPKIAFMFLTPGS
Subjt:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY
        LPFEKLWHKFLDGHDDRFSIYVHASREKV  ASPHF+GRDIRSEKVAWGEISMVDAEKRLLANALLDP+NQHFVLLSESC+PLHDF+YIYNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCFEDPGPHG+GRYSE MLPEIEKKDFRKGSQWFSMKR+HAIIVMADSLYY  FK +CKRTK+GPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNY
        KWHPK+YR QDVTYELLRNITS+DEI+HIT+T  KRM LRPC+WNGVKRPCHLFARKFYPETLGRLLH+FSNY
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNY

XP_022131460.1 uncharacterized protein LOC111004663 isoform X1 [Momordica charantia]1.5e-19585.11Show/hide
Query:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        MPGSRQRPYLKP++YI ++VSLVS++LVG YVY P++SLLCYIFSSGC+NGAFEQH   A RELTDEETAARVV+KEILKRPLAQSKNPKIAFMFLTPG 
Subjt:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY
        LPFEKLWHKF  GHDDRFS+YVHASR+K  + SP+FVG  IRSEKVAWGEISMVDAEKRLLANALLDPDN+HFVLLSESCVPLHDF+Y+YNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCF D GPHG+GRYSERM PEIEKKDFRKGSQWFSMKRQHAII+MADSLY+T FKR+CKRTKDGPNCYADEHYFPTLF+MIDPGGIANWSVT+VDWSEG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAI
        KWHP+SYRNQDVTYELL+N+TS DE VHITST  +R++L+ CLWNGV+RPCHLFARKFYPETLGRLLHLFSNYTA+
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAI

XP_023002387.1 uncharacterized protein LOC111496244 [Cucurbita maxima]3.1e-19686.21Show/hide
Query:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        M GSRQRPYLKP IYIIILVS+VSIFL G YV+PPR+S +CYIFS  CING F Q   +ASRELTD E A+RVV++EILKRPLAQSKNPKIAFMFLTPG 
Subjt:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY
        LPFEKLWHKF DGHDDRFSIYVHASREKV H+SPHFVGRDIRSEKVAWGE+SMVDAEKRLLANAL+DPDNQHFVLLSESCVPLH+F+Y+YNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCF+DPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHA+IVMADSLYY  FK +CKRTKDGPNCYADEHYFPT F+MIDPGGIANWSVTHVDW+EG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV
        KWHPKSYRNQDVTYELL+NI SLD+  HITSTA KR+M +PCLWNGVKRPCHLFARKFYPETLGRL HLFSNYT +V
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV

XP_038886720.1 glycosyltransferase BC10 isoform X1 [Benincasa hispida]1.1e-21294.41Show/hide
Query:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHL VASRELTDEETAARVVMKEILKRPLA SKNPKIAFMFLTPGS
Subjt:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY
        LPFEKLWHKF DGHDDRFS+YVHASREKVP  SPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQ+FVL+SE+CVPLHDF+YIYNYL+FTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCF DPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYT FKR+CKRT DGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAI
        KWHPKSYRNQDVTYELLRNITSLDE VHITSTA + M+LRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTA+
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAI

TrEMBL top hitse value%identityAlignment
A0A0A0LSA2 Uncharacterized protein5.3e-20289.28Show/hide
Query:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        MPGSRQRP LKPS+ I+ILVSLVSIF  G YVYP RTSLLCYIFSSGC+NGAFE+ L VA RELTDEETA RV+MKEILK+PLAQSKNPKIAFMFLTPGS
Subjt:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY
        LPFEKLWHKFLDGHDDRFSIYVHASREKV  ASPHF+GRDIRSEKVAWGEISMVDAEKRLLANALLDP+NQHFVLLSESC+PLHDF+YIYNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCFEDPGPHG+GRYSE MLPEIEKKDFRKGSQWFSMKR+HAIIVMADSLYY  FK +CKRTK+GPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNY
        KWHPK+YR QDVTYELLRNITS+DEI+HIT+T  KRM LRPC+WNGVKRPCHLFARKFYPETLGRLLH+FSNY
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNY

A0A1S3BCM6 uncharacterized protein LOC1034882453.4e-20187.77Show/hide
Query:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        MPGSRQRPYLKPS+  +ILVSLVS+FL   YVYP R +LLCYIFSSGC+NGAFEQ L VA RELTDEETAA+V+MKEILK+PLAQSKNPKIAFMFLTPG 
Subjt:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY
        LP EKLWHKF DGHDDRFSIYVHASR KV H SPHFV RDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESC+PLHDF+YIYNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCFEDPGPHG+GRYSE MLPEIE+KDFRKGSQWFSMKRQHAII+MADSLYYT FKR+CKRTKDGPNCYADEHYF TLFHMIDPGGIANWSVTHVDWSEG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAI
        KWHPK+YR QDVTYELL+NITSLDEI+H+T+T  KR MLRPCLWNGVKRPCHLFARKFYPETLGRLLH+FSNYT +
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAI

A0A6J1BPK3 uncharacterized protein LOC111004663 isoform X17.4e-19685.11Show/hide
Query:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        MPGSRQRPYLKP++YI ++VSLVS++LVG YVY P++SLLCYIFSSGC+NGAFEQH   A RELTDEETAARVV+KEILKRPLAQSKNPKIAFMFLTPG 
Subjt:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY
        LPFEKLWHKF  GHDDRFS+YVHASR+K  + SP+FVG  IRSEKVAWGEISMVDAEKRLLANALLDPDN+HFVLLSESCVPLHDF+Y+YNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCF D GPHG+GRYSERM PEIEKKDFRKGSQWFSMKRQHAII+MADSLY+T FKR+CKRTKDGPNCYADEHYFPTLF+MIDPGGIANWSVT+VDWSEG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAI
        KWHP+SYRNQDVTYELL+N+TS DE VHITST  +R++L+ CLWNGV+RPCHLFARKFYPETLGRLLHLFSNYTA+
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAI

A0A6J1GJ69 uncharacterized protein LOC1114546951.5e-19384.88Show/hide
Query:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        M GSRQRPYLKP +YIIILVS+VSIFL G YV+PPR+S +CYIF SGCING F Q    ASRELTD E A+RVV++EILKRPLAQSKNPKIAFMFLTPG 
Subjt:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY
        LPFEKLWHKF DGHD RFSIYVHASREKV H+SPHFVGRDIRSEKV WGE+SMVDAEKRLLANAL+DPDNQHF L SESCVPLH+F+Y+YNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCF+DPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKR HAIIVMADSLYYT FK +CKRTKDGPNCYADEHYFPT F+MIDPGGI+NWSVTHVDW+EG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV
        KWHPKSYRNQDVTYELL+NI SLD+   ITSTA KR+M +PCLWNGVKRPCHLFARKFYPETLGRL HLFSNYT +V
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV

A0A6J1KJD2 uncharacterized protein LOC1114962441.5e-19686.21Show/hide
Query:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        M GSRQRPYLKP IYIIILVS+VSIFL G YV+PPR+S +CYIFS  CING F Q   +ASRELTD E A+RVV++EILKRPLAQSKNPKIAFMFLTPG 
Subjt:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY
        LPFEKLWHKF DGHDDRFSIYVHASREKV H+SPHFVGRDIRSEKVAWGE+SMVDAEKRLLANAL+DPDNQHFVLLSESCVPLH+F+Y+YNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCF+DPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHA+IVMADSLYY  FK +CKRTKDGPNCYADEHYFPT F+MIDPGGIANWSVTHVDW+EG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV
        KWHPKSYRNQDVTYELL+NI SLD+  HITSTA KR+M +PCLWNGVKRPCHLFARKFYPETLGRL HLFSNYT +V
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV

SwissProt top hitse value%identityAlignment
Q3UQA7 Selenoprotein H1.4e-0531.25Show/hide
Query:  LEKLGKDAKNRTVVIEHCKQCQSFKKRAIQVQNGLENGVPGITVLLNPDKPRRGCFEI---RTEDGEKFISLLDMKRPFTRMKELDMEEVISDIIK
        ++K  K A+  TVVIEHC   + + + A  +   L+   P + V +NP KPRRG FE+   R+++    +     K P  ++K  + +EV+ ++ K
Subjt:  LEKLGKDAKNRTVVIEHCKQCQSFKKRAIQVQNGLENGVPGITVLLNPDKPRRGCFEI---RTEDGEKFISLLDMKRPFTRMKELDMEEVISDIIK

Q65XS5 Glycosyltransferase BC104.7e-4634.88Show/hide
Query:  IFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGSLPFEKLWHKFLDG-HDDRFSIYVH
        + L+GV++      LL    SS  + G   +  AV +     EE    V   E+ + PL    N ++AF+F+    LP + +W  F  G  + RFSI+VH
Subjt:  IFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGSLPFEKLWHKFLDG-HDDRFSIYVH

Query:  AS----REKVPHASPHFVGRDI-RSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSYIDCFEDPGPHGSGRYSER
        +       +    S  F  R +  S +V WGE SM++AE+ LLA+AL DP N+ FV +S+SCVPL++F+Y Y+Y++ ++ S++D F D     +GRY+ R
Subjt:  AS----REKVPHASPHFVGRDI-RSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSYIDCFEDPGPHGSGRYSER

Query:  MLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTK--------DGP---------NCYADEHYFPTLF--HMIDPGGIANWSVTHVDW--
        M P I  +++RKGSQW  + R+HA +V+ D      F++ C+R          D P         NC  DEHY  TL   H ++   +   SVTH  W  
Subjt:  MLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTK--------DGP---------NCYADEHYFPTLF--HMIDPGGIANWSVTHVDW--

Query:  SEGK------WHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKF
        S  K      WHP +Y+  D T  L+++I  +D I + T    +      C  NG   PC LFARKF
Subjt:  SEGK------WHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKF

Q8IZQ5 Selenoprotein H6.8e-0531.3Show/hide
Query:  GKVVEVEKETVKVDPALEKL---GKDAKNRTVVIEHCKQCQSFKKRAIQVQNGLENGVPGITVLLNPDKPRRGCFEIR--TEDGEKFISLLDMKR-PFTR
        G+  + E   V V    EKL   G+  +  TVVIEHC   + + + A  +   L    P + V +NP KPRRG FE+     DG        +K+ P  +
Subjt:  GKVVEVEKETVKVDPALEKL---GKDAKNRTVVIEHCKQCQSFKKRAIQVQNGLENGVPGITVLLNPDKPRRGCFEIR--TEDGEKFISLLDMKR-PFTR

Query:  MKELDMEEVISDIIK
        +K  + +EV+ ++ K
Subjt:  MKELDMEEVISDIIK

Arabidopsis top hitse value%identityAlignment
AT2G19160.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.4e-13054.88Show/hide
Query:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPG
        +PG+R R   +  I+II ++SL+++F++G Y++P  +   CY+FSS GC        L  + RE +D+E AARVV+ EIL  P    K+ KIAFMFLTPG
Subjt:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPG

Query:  SLPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVS
        +LPFEKLW  F  GH+ +FS+Y+HAS++   H S +F+ R+IRS++V WG ISM+DAE+RLL NAL DP+NQ FVLLS+SCVPL  F+Y+YNY++ +NVS
Subjt:  SLPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVS

Query:  YIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDG-PNCYADEHYFPTLFHMIDPGGIANWSVTHVDWS
        Y+DCF+DPGPHG+GR+ + MLPEI ++DFRKG+QWFSMKRQHA++ +AD+LYY+ F+ +C    +G  NC ADEHY PT F+M+DP GIANW+VT+VDWS
Subjt:  YIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDG-PNCYADEHYFPTLFHMIDPGGIANWSVTHVDWS

Query:  EGKWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV
        E KWHP+ Y  +D+T EL++NI+S+D +  +TS     +    C+WNG+KRPC+LF RKF+ +TL +L+ LF NYT+IV
Subjt:  EGKWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV

AT4G30060.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein6.1e-13457.74Show/hide
Query:  MPGSRQR--PYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLT
        +PG+R R  P  +  ++II+++SL+++F +  Y+YP  +   CY+ SS GC   A    L  + RE +D+E AARVV++EIL  P    KN KIAFMFLT
Subjt:  MPGSRQR--PYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLT

Query:  PGSLPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTN
        PG+LPFE+LW +F  GH+ +FS+Y+HAS+E+  H S +F+ R+IRS++V WG ISMVDAE+RLLANAL D  NQ FVLLS+SCVPL  F+YIYNYL+ +N
Subjt:  PGSLPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTN

Query:  VSYIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFC-KRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVD
        +SY+DCF+DPG HG+GR+   MLPEI KKDFRKG+QWF+MKRQHA+  MADSLYY+ F+ +C    ++  NC ADEHY PT FHM+DPGGIANW+VT VD
Subjt:  VSYIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFC-KRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVD

Query:  WSEGKWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV
        WSE KWHPK+Y  +D+T+ELL N+TS D +VH+TS  +   +  PC+WNG++RPC+LF RKF+P+TL +LL LFSNYT  V
Subjt:  WSEGKWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV

AT4G31350.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein4.2e-15165.34Show/hide
Query:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPG
        M  SRQRP  K   +II LV LV++ ++  ++YPPR S+ CY+FS  GC    ++Q L V +RELTD E AA+VVM EI+  P +++ NPK+AFMFLTPG
Subjt:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPG

Query:  SLPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVS
        +LPFE LW  F  GH+++FS+YVHAS++   H S +FVGRDI S KVAWG+ISMVDAE+RLLA+AL+DPDNQHF+LLS+SCVPL DF+YIYN+LIF N+S
Subjt:  SLPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVS

Query:  YIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSE
        +IDCFEDPGPHGSGRYS+ MLPE+EKKDFRKGSQWFSMKR+HAI+VMADSLYYT FK +C+   +G NCYADEHYFPTLF+MIDP GIANWSVTHVDWSE
Subjt:  YIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSE

Query:  GKWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV
        GKWHPK Y  +D+T  L+R I S+    H+TS   K   ++PCLW G +RPC+LFARKF PETL RL++LF NYT++V
Subjt:  GKWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV

AT4G31350.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein4.2e-15165.34Show/hide
Query:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPG
        M  SRQRP  K   +II LV LV++ ++  ++YPPR S+ CY+FS  GC    ++Q L V +RELTD E AA+VVM EI+  P +++ NPK+AFMFLTPG
Subjt:  MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPG

Query:  SLPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVS
        +LPFE LW  F  GH+++FS+YVHAS++   H S +FVGRDI S KVAWG+ISMVDAE+RLLA+AL+DPDNQHF+LLS+SCVPL DF+YIYN+LIF N+S
Subjt:  SLPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVS

Query:  YIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSE
        +IDCFEDPGPHGSGRYS+ MLPE+EKKDFRKGSQWFSMKR+HAI+VMADSLYYT FK +C+   +G NCYADEHYFPTLF+MIDP GIANWSVTHVDWSE
Subjt:  YIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSE

Query:  GKWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV
        GKWHPK Y  +D+T  L+R I S+    H+TS   K   ++PCLW G +RPC+LFARKF PETL RL++LF NYT++V
Subjt:  GKWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV

AT5G57270.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein8.5e-12858.31Show/hide
Query:  RQRPYLKPSIYIIILVSLVSIFLVGVYVYP----PRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        R R  LK  + I++LV + S+ LV  Y+YP     ++S    + S GC   A    L V  R+ TDEE AARVV+K+IL+ P A +   KIAFMFLTPG+
Subjt:  RQRPYLKPSIYIIILVSLVSIFLVGVYVYP----PRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY
        LPFEKLW KF  G + RFSIY+H SR +  H S HF  R+I S+ V WG ISMVDAE+RLLANAL DPDNQHFVLLSESC+PLH FDY Y YL+  NVS+
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCK-RTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSE
        ID FED GPHG+GR+ + MLPEI ++DFRKG+QWF+MKRQHA+IVMAD LYY+ F+ +C+   +   NC ADEHY PT FHM+DPGGI+NWSVT+VDWSE
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCK-RTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSE

Query:  GKWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLR-PCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV
         +WHPK+YR +DV+ +LL+NITS D  VH+TS   +   LR PC W G++RPC+LFARK + + L +L+ LF NYT+ V
Subjt:  GKWHPKSYRNQDVTYELLRNITSLDEIVHITSTALKRMMLR-PCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAGGATCACGCCAAAGGCCATACTTAAAGCCATCTATTTACATTATTATATTGGTTTCGTTGGTCAGCATATTTCTAGTTGGCGTCTATGTTTATCCACCTAGAAC
CTCCCTGCTTTGTTATATCTTTTCTAGTGGTTGTATCAATGGTGCATTTGAACAGCACCTGGCAGTTGCTTCTCGGGAATTAACCGATGAGGAAACTGCAGCTCGGGTTG
TAATGAAGGAAATTTTGAAGAGACCTCTGGCTCAGTCTAAGAATCCGAAAATTGCCTTTATGTTTTTGACTCCAGGTTCATTACCTTTTGAGAAGCTATGGCATAAATTT
CTTGATGGTCACGATGACAGATTTTCTATATATGTGCACGCATCTAGAGAGAAAGTACCCCACGCGAGCCCCCATTTTGTTGGTCGTGACATTCGCAGTGAAAAGGTAGC
CTGGGGAGAAATTTCTATGGTTGATGCAGAGAAGAGACTTTTGGCAAATGCACTTTTAGACCCTGATAATCAGCATTTTGTTTTATTATCCGAAAGCTGTGTGCCTCTTC
ACGACTTTGATTATATTTATAACTATTTGATATTTACAAACGTCAGCTATATTGATTGTTTTGAAGACCCTGGTCCCCATGGAAGTGGCAGGTATTCAGAGCGCATGTTA
CCTGAAATTGAAAAGAAAGATTTCCGTAAAGGTTCTCAGTGGTTTTCTATGAAGCGACAACATGCTATTATTGTAATGGCTGACAGTCTTTACTATACAATATTCAAGCG
TTTCTGCAAGCGAACTAAGGACGGGCCCAATTGCTATGCTGACGAGCACTATTTTCCAACCCTTTTCCATATGATCGACCCTGGTGGAATTGCAAATTGGTCAGTAACGC
ATGTTGATTGGTCTGAGGGAAAGTGGCATCCAAAATCATATAGGAACCAAGATGTCACCTATGAGCTTCTGAGGAACATTACCTCGCTAGACGAAATCGTCCACATTACA
AGTACTGCTCTGAAGAGGATGATGTTGAGGCCCTGCTTGTGGAACGGAGTGAAAAGACCATGCCATCTGTTTGCGAGAAAATTTTATCCAGAAACTCTGGGAAGATTGTT
GCACCTTTTCTCTAACTACACAGCTATAGTTGAAGACTATGAGCCTGGGGCTGCCACGATTAGCAGCTTTTCATTGGTTTTTGCCCTGTCTTCTCAGCCATATTTGAACT
CTAATATATTTTTGTTTCTTAGAAATGAAAATCTGGCGCCATTCTTGGGACTTCGTGCCGCCACTTTCTTCCATAGAAGAAACCAAGTGGAACTGCGTGGGCGTCGTACG
ACCACCCACTCGACGGCCACGATCCTTCCTCACACTCCCAATCCATCTCCACCGTCTCTTCAAAATCCCACCGCCGTCCCAGAGCGCGGGAAAAACCCCCAACCCAACCT
CTGTATATCACTGCAAACCAACGGTCAAGGTTTCAGGTTTGCTTTCATGGCGCCGAGGAAGCGGAGCAAGAATCAAGAAGACGAACCGGCGGCGGAGAAGCCGGCCCCGG
CGTCGTCGAGGGTTACTCGGAGTTCTGCTCGGCTAGCGGCGAACTCCAAAGCTGATTCGGCGGTGGATGAGGCTGTGACGGAGGTGCCGAAGAGTAAGAAGGCGAAACGT
GCTCCGAAGGAGAATGGGAAGGTGGTGGAAGTTGAAAAGGAGACGGTGAAAGTTGATCCTGCTTTGGAGAAACTTGGCAAGGATGCGAAGAATAGAACGGTGGTGATTGA
ACATTGCAAACAGTGCCAATCATTCAAGAAAAGGGCAATCCAGGTGCAAAATGGTCTAGAGAATGGTGTTCCTGGAATCACTGTGCTGCTTAACCCTGATAAGCCAAGAA
GGGGTTGCTTTGAAATCCGGACTGAAGATGGCGAGAAGTTTATTAGTCTTCTGGACATGAAGCGTCCATTTACTCGCATGAAGGAACTGGACATGGAAGAAGTCATTTCA
GATATCATCAAGAAGATAAAAGGATGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAGGATCACGCCAAAGGCCATACTTAAAGCCATCTATTTACATTATTATATTGGTTTCGTTGGTCAGCATATTTCTAGTTGGCGTCTATGTTTATCCACCTAGAAC
CTCCCTGCTTTGTTATATCTTTTCTAGTGGTTGTATCAATGGTGCATTTGAACAGCACCTGGCAGTTGCTTCTCGGGAATTAACCGATGAGGAAACTGCAGCTCGGGTTG
TAATGAAGGAAATTTTGAAGAGACCTCTGGCTCAGTCTAAGAATCCGAAAATTGCCTTTATGTTTTTGACTCCAGGTTCATTACCTTTTGAGAAGCTATGGCATAAATTT
CTTGATGGTCACGATGACAGATTTTCTATATATGTGCACGCATCTAGAGAGAAAGTACCCCACGCGAGCCCCCATTTTGTTGGTCGTGACATTCGCAGTGAAAAGGTAGC
CTGGGGAGAAATTTCTATGGTTGATGCAGAGAAGAGACTTTTGGCAAATGCACTTTTAGACCCTGATAATCAGCATTTTGTTTTATTATCCGAAAGCTGTGTGCCTCTTC
ACGACTTTGATTATATTTATAACTATTTGATATTTACAAACGTCAGCTATATTGATTGTTTTGAAGACCCTGGTCCCCATGGAAGTGGCAGGTATTCAGAGCGCATGTTA
CCTGAAATTGAAAAGAAAGATTTCCGTAAAGGTTCTCAGTGGTTTTCTATGAAGCGACAACATGCTATTATTGTAATGGCTGACAGTCTTTACTATACAATATTCAAGCG
TTTCTGCAAGCGAACTAAGGACGGGCCCAATTGCTATGCTGACGAGCACTATTTTCCAACCCTTTTCCATATGATCGACCCTGGTGGAATTGCAAATTGGTCAGTAACGC
ATGTTGATTGGTCTGAGGGAAAGTGGCATCCAAAATCATATAGGAACCAAGATGTCACCTATGAGCTTCTGAGGAACATTACCTCGCTAGACGAAATCGTCCACATTACA
AGTACTGCTCTGAAGAGGATGATGTTGAGGCCCTGCTTGTGGAACGGAGTGAAAAGACCATGCCATCTGTTTGCGAGAAAATTTTATCCAGAAACTCTGGGAAGATTGTT
GCACCTTTTCTCTAACTACACAGCTATAGTTGAAGACTATGAGCCTGGGGCTGCCACGATTAGCAGCTTTTCATTGGTTTTTGCCCTGTCTTCTCAGCCATATTTGAACT
CTAATATATTTTTGTTTCTTAGAAATGAAAATCTGGCGCCATTCTTGGGACTTCGTGCCGCCACTTTCTTCCATAGAAGAAACCAAGTGGAACTGCGTGGGCGTCGTACG
ACCACCCACTCGACGGCCACGATCCTTCCTCACACTCCCAATCCATCTCCACCGTCTCTTCAAAATCCCACCGCCGTCCCAGAGCGCGGGAAAAACCCCCAACCCAACCT
CTGTATATCACTGCAAACCAACGGTCAAGGTTTCAGGTTTGCTTTCATGGCGCCGAGGAAGCGGAGCAAGAATCAAGAAGACGAACCGGCGGCGGAGAAGCCGGCCCCGG
CGTCGTCGAGGGTTACTCGGAGTTCTGCTCGGCTAGCGGCGAACTCCAAAGCTGATTCGGCGGTGGATGAGGCTGTGACGGAGGTGCCGAAGAGTAAGAAGGCGAAACGT
GCTCCGAAGGAGAATGGGAAGGTGGTGGAAGTTGAAAAGGAGACGGTGAAAGTTGATCCTGCTTTGGAGAAACTTGGCAAGGATGCGAAGAATAGAACGGTGGTGATTGA
ACATTGCAAACAGTGCCAATCATTCAAGAAAAGGGCAATCCAGGTGCAAAATGGTCTAGAGAATGGTGTTCCTGGAATCACTGTGCTGCTTAACCCTGATAAGCCAAGAA
GGGGTTGCTTTGAAATCCGGACTGAAGATGGCGAGAAGTTTATTAGTCTTCTGGACATGAAGCGTCCATTTACTCGCATGAAGGAACTGGACATGGAAGAAGTCATTTCA
GATATCATCAAGAAGATAAAAGGATGA
Protein sequenceShow/hide protein sequence
MPGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLAVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGSLPFEKLWHKF
LDGHDDRFSIYVHASREKVPHASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFDYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERML
PEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTIFKRFCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLRNITSLDEIVHIT
STALKRMMLRPCLWNGVKRPCHLFARKFYPETLGRLLHLFSNYTAIVEDYEPGAATISSFSLVFALSSQPYLNSNIFLFLRNENLAPFLGLRAATFFHRRNQVELRGRRT
TTHSTATILPHTPNPSPPSLQNPTAVPERGKNPQPNLCISLQTNGQGFRFAFMAPRKRSKNQEDEPAAEKPAPASSRVTRSSARLAANSKADSAVDEAVTEVPKSKKAKR
APKENGKVVEVEKETVKVDPALEKLGKDAKNRTVVIEHCKQCQSFKKRAIQVQNGLENGVPGITVLLNPDKPRRGCFEIRTEDGEKFISLLDMKRPFTRMKELDMEEVIS
DIIKKIKG