; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10004615 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10004615
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationChr08:18888196..18904254
RNA-Seq ExpressionHG10004615
SyntenyHG10004615
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR036378 - FAS1 domain superfamily
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008445103.1 PREDICTED: uncharacterized protein LOC103488245 [Cucumis melo]8.8e-17787.5Show/hide
Query:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        M GSRQRPYLKPS+  +ILVSLVS+FL   YVYP R +LLCYIFSSGC+NGAFEQ LPVA RELTDEETAA+V+MKEILK+PLAQSKNPKIAFMFLTPG 
Subjt:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY
        LP EKLWHKF DGHDDRFSIYVHASR K+ + SPHFV RDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESC+PLHDFEYIYNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCFEDPGPHG+GRYSE MLPEIE+KDFRKGSQWFSMKRQHAII+MADSLYYTKFK YCKRTKDGPNCYADEHYF TLFHMIDPGGIANWSVTHVDWSEG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK
        KWHPK+YR QDVTYELL+NITSLDEI+H+T+T PK+
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK

XP_011649768.1 glycosyltransferase BC10 isoform X1 [Cucumis sativus]1.0e-18089.88Show/hide
Query:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        M GSRQRP LKPS+ I+ILVSLVSIF  G YVYP RTSLLCYIFSSGC+NGAFE+ LPVA RELTDEETA RV+MKEILK+PLAQSKNPKIAFMFLTPGS
Subjt:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY
        LPFEKLWHKFLDGHDDRFSIYVHASREK+  ASPHF+GRDIRSEKVAWGEISMVDAEKRLLANALLDP+NQHFVLLSESC+PLHDFEYIYNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCFEDPGPHG+GRYSE MLPEIEKKDFRKGSQWFSMKR+HAIIVMADSLYY KFKHYCKRTK+GPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK
        KWHPK+YR QDVTYELLRNITS+DEI+HIT+T PK+
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK

XP_023002387.1 uncharacterized protein LOC111496244 [Cucurbita maxima]2.8e-17587.2Show/hide
Query:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        MLGSRQRPYLKP IYIIILVS+VSIFL G YV+PPR+S +CYIFS  CING F Q  P+ASRELTD E A+RVV++EILKRPLAQSKNPKIAFMFLTPG 
Subjt:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY
        LPFEKLWHKF DGHDDRFSIYVHASREK+ ++SPHFVGRDIRSEKVAWGE+SMVDAEKRLLANAL+DPDNQHFVLLSESCVPLH+FEY+YNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCF+DPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHA+IVMADSLYY KFK YCKRTKDGPNCYADEHYFPT F+MIDPGGIANWSVTHVDW+EG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK
        KWHPKSYRNQDVTYELL+NI SLD+  HITSTAPK+
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK

XP_038886720.1 glycosyltransferase BC10 isoform X1 [Benincasa hispida]7.7e-18995.22Show/hide
Query:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        M GSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLA SKNPKIAFMFLTPGS
Subjt:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY
        LPFEKLWHKF DGHDDRFS+YVHASREK+P  SPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQ+FVL+SE+CVPLHDFEYIYNYL+FTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCF DPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFK YCKRT DGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPK
        KWHPKSYRNQDVTYELLRNITSLDE VHITSTAP+
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPK

XP_038886723.1 glycosyltransferase BC10 isoform X2 [Benincasa hispida]1.3e-18895.51Show/hide
Query:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        M GSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLA SKNPKIAFMFLTPGS
Subjt:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY
        LPFEKLWHKF DGHDDRFS+YVHASREK+P  SPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQ+FVL+SE+CVPLHDFEYIYNYL+FTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCF DPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFK YCKRT DGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAP
        KWHPKSYRNQDVTYELLRNITSLDE VHITSTAP
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAP

TrEMBL top hitse value%identityAlignment
A0A0A0LSA2 Uncharacterized protein4.9e-18189.88Show/hide
Query:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        M GSRQRP LKPS+ I+ILVSLVSIF  G YVYP RTSLLCYIFSSGC+NGAFE+ LPVA RELTDEETA RV+MKEILK+PLAQSKNPKIAFMFLTPGS
Subjt:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY
        LPFEKLWHKFLDGHDDRFSIYVHASREK+  ASPHF+GRDIRSEKVAWGEISMVDAEKRLLANALLDP+NQHFVLLSESC+PLHDFEYIYNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCFEDPGPHG+GRYSE MLPEIEKKDFRKGSQWFSMKR+HAIIVMADSLYY KFKHYCKRTK+GPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK
        KWHPK+YR QDVTYELLRNITS+DEI+HIT+T PK+
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK

A0A1S3BCM6 uncharacterized protein LOC1034882454.3e-17787.5Show/hide
Query:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        M GSRQRPYLKPS+  +ILVSLVS+FL   YVYP R +LLCYIFSSGC+NGAFEQ LPVA RELTDEETAA+V+MKEILK+PLAQSKNPKIAFMFLTPG 
Subjt:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY
        LP EKLWHKF DGHDDRFSIYVHASR K+ + SPHFV RDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESC+PLHDFEYIYNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCFEDPGPHG+GRYSE MLPEIE+KDFRKGSQWFSMKRQHAII+MADSLYYTKFK YCKRTKDGPNCYADEHYF TLFHMIDPGGIANWSVTHVDWSEG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK
        KWHPK+YR QDVTYELL+NITSLDEI+H+T+T PK+
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK

A0A6J1BPK3 uncharacterized protein LOC111004663 isoform X12.0e-17486.01Show/hide
Query:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        M GSRQRPYLKP++YI ++VSLVS++LVG YVY P++SLLCYIFSSGC+NGAFEQH P A RELTDEETAARVV+KEILKRPLAQSKNPKIAFMFLTPG 
Subjt:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY
        LPFEKLWHKF  GHDDRFS+YVHASR+K  Y SP+FVG  IRSEKVAWGEISMVDAEKRLLANALLDPDN+HFVLLSESCVPLHDFEY+YNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCF D GPHG+GRYSERM PEIEKKDFRKGSQWFSMKRQHAII+MADSLY+TKFK YCKRTKDGPNCYADEHYFPTLF+MIDPGGIANWSVT+VDWSEG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK
        KWHP+SYRNQDVTYELL+N+TS DE VHITST P++
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK

A0A6J1GJ69 uncharacterized protein LOC1114546951.4e-17285.71Show/hide
Query:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        MLGSRQRPYLKP +YIIILVS+VSIFL G YV+PPR+S +CYIF SGCING F Q  P ASRELTD E A+RVV++EILKRPLAQSKNPKIAFMFLTPG 
Subjt:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY
        LPFEKLWHKF DGHD RFSIYVHASREK+ ++SPHFVGRDIRSEKV WGE+SMVDAEKRLLANAL+DPDNQHF L SESCVPLH+FEY+YNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCF+DPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKR HAIIVMADSLYYTKFK YCKRTKDGPNCYADEHYFPT F+MIDPGGI+NWSVTHVDW+EG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK
        KWHPKSYRNQDVTYELL+NI SLD+   ITSTAPK+
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK

A0A6J1KJD2 uncharacterized protein LOC1114962441.4e-17587.2Show/hide
Query:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS
        MLGSRQRPYLKP IYIIILVS+VSIFL G YV+PPR+S +CYIFS  CING F Q  P+ASRELTD E A+RVV++EILKRPLAQSKNPKIAFMFLTPG 
Subjt:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGS

Query:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY
        LPFEKLWHKF DGHDDRFSIYVHASREK+ ++SPHFVGRDIRSEKVAWGE+SMVDAEKRLLANAL+DPDNQHFVLLSESCVPLH+FEY+YNYLIFTNVSY
Subjt:  LPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSY

Query:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        IDCF+DPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHA+IVMADSLYY KFK YCKRTKDGPNCYADEHYFPT F+MIDPGGIANWSVTHVDW+EG
Subjt:  IDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK
        KWHPKSYRNQDVTYELL+NI SLD+  HITSTAPK+
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPKK

SwissProt top hitse value%identityAlignment
O22126 Fasciclin-like arabinogalactan protein 87.1e-2844.72Show/hide
Query:  RLLNQFPEFGAFNDYLTKTRLFEQINTRQTITVLALDNATVSSIAG-NSLDVIKQILDAHVILDYYDVAKMRKLSTNKATVLTTMFQSTGDAVNQQGFLK
        ++L   P++ +FN YL++T+L ++IN+R TITVL L+N  +S++AG + L VIK  L   V+LDYYD  K+ K+S    T+ TT++Q+TG+A    GF+ 
Subjt:  RLLNQFPEFGAFNDYLTKTRLFEQINTRQTITVLALDNATVSSIAG-NSLDVIKQILDAHVILDYYDVAKMRKLSTNKATVLTTMFQSTGDAVNQQGFLK

Query:  VLLNKRGQIEFGSAAKGAPLSAKLVKPVASQPYNISVLQISAPIVIPGIGVYNLPPPAPEA
        +   K G++ FGSAA G+ L +   K V   PYNIS+L+I API+ PG+    L  PAP A
Subjt:  VLLNKRGQIEFGSAAKGAPLSAKLVKPVASQPYNISVLQISAPIVIPGIGVYNLPPPAPEA

O49586 Fasciclin-like arabinogalactan protein 51.2e-2737.45Show/hide
Query:  ISLLDRTVGWLVLEGNVIGVVGKMKRLLNQFPEFGAFNDYLTKTRLFEQINTRQTITVLALDNATVSSIAGNSLDVIKQILDAHVILDYYDVAKMRKLST
        +SLL  T+   +L  + +     +     ++ +F    D   KT+L   I+  QTITVLA+ N  +SSI   S   ++ IL  HVILDYYD  K++ +  
Subjt:  ISLLDRTVGWLVLEGNVIGVVGKMKRLLNQFPEFGAFNDYLTKTRLFEQINTRQTITVLALDNATVSSIAGNSLDVIKQILDAHVILDYYDVAKMRKLST

Query:  NKATVLTTMFQSTGDAVNQQGFLKVLLNKRGQIEFGSAAKGAPLSAKLVKPVASQPYNISVLQISAPIVIPGIGVYNLPPPAPEAPYVAPVEAPAPSADA
         K+ +LTT++Q+TG      GFL V  +K G++ FGS  K +PL+A+ V  V   PYN+S++QI+ PIV PG+ +   PPP P   +VAP   P  ++  
Subjt:  NKATVLTTMFQSTGDAVNQQGFLKVLLNKRGQIEFGSAAKGAPLSAKLVKPVASQPYNISVLQISAPIVIPGIGVYNLPPPAPEAPYVAPVEAPAPSADA

Query:  PAPADDDDADSPSDAPSPASKAPAPAADAPDAPASSPSEAADEDDADAPGPADDASDNTSDKKSGGS
        PAP    D +SP  A       PAPA D P+A + +P+ +AD +  +A   AD A  ++S  K+G S
Subjt:  PAPADDDDADSPSDAPSPASKAPAPAADAPDAPASSPSEAADEDDADAPGPADDASDNTSDKKSGGS

Q65XS5 Glycosyltransferase BC101.9e-4134.22Show/hide
Query:  IFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGSLPFEKLWHKFLDG-HDDRFSIYVH
        + L+GV++      LL    SS  + G   +   V +     EE    V   E+ + PL    N ++AF+F+    LP + +W  F  G  + RFSI+VH
Subjt:  IFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGSLPFEKLWHKFLDG-HDDRFSIYVH

Query:  AS----REKLPYASPHFVGRDI-RSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSER
        +       +    S  F  R +  S +V WGE SM++AE+ LLA+AL DP N+ FV +S+SCVPL++F Y Y+Y++ ++ S++D F D     +GRY+ R
Subjt:  AS----REKLPYASPHFVGRDI-RSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSER

Query:  MLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTK--------DGP---------NCYADEHYFPTLF--HMIDPGGIANWSVTHVDW--
        M P I  +++RKGSQW  + R+HA +V+ D     +F+ +C+R          D P         NC  DEHY  TL   H ++   +   SVTH  W  
Subjt:  MLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTK--------DGP---------NCYADEHYFPTLF--HMIDPGGIANWSVTHVDW--

Query:  SEGK------WHPKSYRNQDVTYELLRNITSLDEIVHIT
        S  K      WHP +Y+  D T  L+++I  +D I + T
Subjt:  SEGK------WHPKSYRNQDVTYELLRNITSLDEIVHIT

Q9LZX4 Fasciclin-like arabinogalactan protein 106.4e-2943.01Show/hide
Query:  TVGWLVLEGNVIGVVGKMKRLLNQFPEFGAFNDYLTKTRLFEQINTRQTITVLALDNATVSSIAG-NSLDVIKQILDAHVILDYYDVAKMRKLSTNKATV
        T+  L +   V G    + ++L+  PE+ +FN+YL++T+L ++IN+R TITVL L+N  +SS+AG + L V+K  L   V+LDYYD  K+ +LS    T+
Subjt:  TVGWLVLEGNVIGVVGKMKRLLNQFPEFGAFNDYLTKTRLFEQINTRQTITVLALDNATVSSIAG-NSLDVIKQILDAHVILDYYDVAKMRKLSTNKATV

Query:  LTTMFQSTGDAVNQQGFLKVLLNKRGQIEFGSAAKGAPLSAKLVKPVASQPYNISVLQISAPIVIPGIGVYNLPPPAPEAPYVAPV
         TT++Q+TG A+   GF+ V   K G++ FGSAA G+ L +   K V   PYNISVL+I+API+ PGI    L  PAP +  V+ +
Subjt:  LTTMFQSTGDAVNQQGFLKVLLNKRGQIEFGSAAKGAPLSAKLVKPVASQPYNISVLQISAPIVIPGIGVYNLPPPAPEAPYVAPV

Q9ZQ23 Fasciclin-like arabinogalactan protein 32.1e-3240.89Show/hide
Query:  SLLDRTVGWLVLEGNVIGVVGKMKRLLNQFPEFGAFNDYLTKTRLFEQINTRQTITVLALDNATVSSIAGNSLDVIKQILDAHVILDYYDVAKMRKLSTN
        SLL  T+  L+   +++  V  + R+L ++PEF    + L KT L   IN RQTITVLAL+N  + SI+G   + +K IL  HV+LDY+D  K++ L   
Subjt:  SLLDRTVGWLVLEGNVIGVVGKMKRLLNQFPEFGAFNDYLTKTRLFEQINTRQTITVLALDNATVSSIAGNSLDVIKQILDAHVILDYYDVAKMRKLSTN

Query:  KATVLTTMFQSTGDAVNQQGFLKVLLNKRGQIEFGSAAKGAPLSAKLVKPVASQPYNISVLQISAPIVIPGIG-VYNLPPPAPEAPYVAPVEAPAPSADA
        K+T+LTT++QSTG    Q GFL       G+I FGS  KGAP +A+ +  V   PYN+SV+QIS PIV PG+G    +PPP P +   AP      +  A
Subjt:  KATVLTTMFQSTGDAVNQQGFLKVLLNKRGQIEFGSAAKGAPLSAKLVKPVASQPYNISVLQISAPIVIPGIG-VYNLPPPAPEAPYVAPVEAPAPSADA

Query:  PAPADDDDADSPSDAPSPASKAPAPAADAPD-APASSPSEAADEDDADAPGPADD--------ASDNTSDKKSGGSRGQIGGAGVVVAGLV
        PAPAD+ D             A AP   AP+ APAS+PSE      +D+P PA D        A+D      S  + G   GA V+V G V
Subjt:  PAPADDDDADSPSDAPSPASKAPAPAADAPD-APASSPSEAADEDDADAPGPADD--------ASDNTSDKKSGGSRGQIGGAGVVVAGLV

Arabidopsis top hitse value%identityAlignment
AT2G19160.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein4.2e-11656.5Show/hide
Query:  GSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGSL
        G+R R   +  I+II ++SL+++F++G Y++P  +   CY+FSS GC        LP + RE +D+E AARVV+ EIL  P    K+ KIAFMFLTPG+L
Subjt:  GSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGSL

Query:  PFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYI
        PFEKLW  F  GH+ +FS+Y+HAS++   + S +F+ R+IRS++V WG ISM+DAE+RLL NAL DP+NQ FVLLS+SCVPL  FEY+YNY++ +NVSY+
Subjt:  PFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYI

Query:  DCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDG-PNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG
        DCF+DPGPHG+GR+ + MLPEI ++DFRKG+QWFSMKRQHA++ +AD+LYY+KF+ YC    +G  NC ADEHY PT F+M+DP GIANW+VT+VDWSE 
Subjt:  DCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDG-PNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEG

Query:  KWHPKSYRNQDVTYELLRNITSLDEIVHITS
        KWHP+ Y  +D+T EL++NI+S+D +  +TS
Subjt:  KWHPKSYRNQDVTYELLRNITSLDEIVHITS

AT4G30060.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.2e-11759.16Show/hide
Query:  GSRQR--PYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPG
        G+R R  P  +  ++II+++SL+++F +  Y+YP  +   CY+ SS GC   A    LP + RE +D+E AARVV++EIL  P    KN KIAFMFLTPG
Subjt:  GSRQR--PYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPG

Query:  SLPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVS
        +LPFE+LW +F  GH+ +FS+Y+HAS+E+  + S +F+ R+IRS++V WG ISMVDAE+RLLANAL D  NQ FVLLS+SCVPL  FEYIYNYL+ +N+S
Subjt:  SLPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVS

Query:  YIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYC-KRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWS
        Y+DCF+DPG HG+GR+   MLPEI KKDFRKG+QWF+MKRQHA+  MADSLYY+KF+ YC    ++  NC ADEHY PT FHM+DPGGIANW+VT VDWS
Subjt:  YIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYC-KRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWS

Query:  EGKWHPKSYRNQDVTYELLRNITSLDEIVHITS
        E KWHPK+Y  +D+T+ELL N+TS D +VH+TS
Subjt:  EGKWHPKSYRNQDVTYELLRNITSLDEIVHITS

AT4G31350.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.2e-13366.07Show/hide
Query:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPG
        M  SRQRP  K   +II LV LV++ ++  ++YPPR S+ CY+FS  GC    ++Q L V +RELTD E AA+VVM EI+  P +++ NPK+AFMFLTPG
Subjt:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPG

Query:  SLPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVS
        +LPFE LW  F  GH+++FS+YVHAS++   + S +FVGRDI S KVAWG+ISMVDAE+RLLA+AL+DPDNQHF+LLS+SCVPL DF YIYN+LIF N+S
Subjt:  SLPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVS

Query:  YIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSE
        +IDCFEDPGPHGSGRYS+ MLPE+EKKDFRKGSQWFSMKR+HAI+VMADSLYYTKFK YC+   +G NCYADEHYFPTLF+MIDP GIANWSVTHVDWSE
Subjt:  YIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSE

Query:  GKWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPK
        GKWHPK Y  +D+T  L+R I S+    H+TS   K
Subjt:  GKWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPK

AT4G31350.2 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein2.2e-13366.07Show/hide
Query:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPG
        M  SRQRP  K   +II LV LV++ ++  ++YPPR S+ CY+FS  GC    ++Q L V +RELTD E AA+VVM EI+  P +++ NPK+AFMFLTPG
Subjt:  MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSS-GCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPG

Query:  SLPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVS
        +LPFE LW  F  GH+++FS+YVHAS++   + S +FVGRDI S KVAWG+ISMVDAE+RLLA+AL+DPDNQHF+LLS+SCVPL DF YIYN+LIF N+S
Subjt:  SLPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVS

Query:  YIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSE
        +IDCFEDPGPHGSGRYS+ MLPE+EKKDFRKGSQWFSMKR+HAI+VMADSLYYTKFK YC+   +G NCYADEHYFPTLF+MIDP GIANWSVTHVDWSE
Subjt:  YIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSE

Query:  GKWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPK
        GKWHPK Y  +D+T  L+R I S+    H+TS   K
Subjt:  GKWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPK

AT5G57270.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein9.3e-11659.29Show/hide
Query:  LGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYP----PRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLT
        L  R R  LK  + I++LV + S+ LV  Y+YP     ++S    + S GC   A    LPV  R+ TDEE AARVV+K+IL+ P A +   KIAFMFLT
Subjt:  LGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYP----PRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLT

Query:  PGSLPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTN
        PG+LPFEKLW KF  G + RFSIY+H SR +  + S HF  R+I S+ V WG ISMVDAE+RLLANAL DPDNQHFVLLSESC+PLH F+Y Y YL+  N
Subjt:  PGSLPFEKLWHKFLDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTN

Query:  VSYIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCK-RTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVD
        VS+ID FED GPHG+GR+ + MLPEI ++DFRKG+QWF+MKRQHA+IVMAD LYY+KF+ YC+   +   NC ADEHY PT FHM+DPGGI+NWSVT+VD
Subjt:  VSYIDCFEDPGPHGSGRYSERMLPEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCK-RTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVD

Query:  WSEGKWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPK
        WSE +WHPK+YR +DV+ +LL+NITS D  VH+TS   +
Subjt:  WSEGKWHPKSYRNQDVTYELLRNITSLDEIVHITSTAPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTAGGATCACGTCAAAGGCCATACTTAAAGCCATCTATATACATTATTATATTGGTTTCGTTGGTCAGCATATTTCTAGTTGGCGTCTATGTTTATCCACCTAGAAC
CTCCCTGCTTTGTTATATCTTCTCCAGTGGTTGTATCAATGGTGCATTTGAACAGCACCTGCCAGTTGCTTCTCGGGAATTAACTGATGAAGAAACTGCAGCTCGGGTTG
TAATGAAGGAAATTTTGAAGAGACCTCTGGCTCAGTCTAAGAATCCAAAAATTGCCTTTATGTTTTTGACTCCAGGTTCATTACCTTTTGAGAAGCTATGGCATAAATTT
CTTGATGGTCACGATGACAGATTTTCTATATATGTGCATGCATCTAGAGAGAAATTACCATACGCGAGCCCCCATTTTGTTGGTCGTGACATTCGCAGTGAAAAGGTAGC
CTGGGGAGAAATTTCTATGGTTGATGCAGAGAAGAGACTTTTGGCAAATGCACTTTTAGACCCTGATAATCAGCACTTTGTTTTATTATCTGAAAGTTGTGTACCTCTTC
ACGACTTTGAGTATATTTATAACTATTTGATATTTACAAACGTCAGCTATATTGATTGTTTTGAAGACCCTGGTCCCCATGGAAGTGGCAGGTATTCAGAGCGCATGTTA
CCGGAAATTGAAAAGAAAGATTTCCGTAAAGGTTCTCAGTGGTTTTCTATGAAGCGACAACATGCTATTATTGTAATGGCCGACAGTCTTTACTATACAAAATTCAAGCA
TTACTGCAAGCGAACTAAGGACGGCCCCAATTGCTACGCTGACGAGCACTATTTTCCAACTCTTTTCCATATGATCGACCCTGGTGGAATTGCAAACTGGTCAGTTACGC
ATGTTGATTGGTCTGAGGGAAAGTGGCATCCAAAATCATATAGGAACCAAGATGTCACCTATGAGCTTCTGAGGAACATTACTTCGCTAGATGAGATCGTTCACATTACA
AGTACTGCTCCGAAGAAACCAAGTGGAACTGCGTGGGCGTCGTACGACCACCCACTCGACGGCCACGATCCTTCCTCACACTCCCAATCCATCTCCACCGTCTCTTCAAA
ATCCCACCGCCGTCCCAGAGCGCGGGAAAAAGCCCCAACCCAACCTCTGTATATCACTGCAAACCAACGGTCACGGTTTTCTGCTTTCATGGCGCCGAGGAAGCGAAGCA
AGAACCAAGAAGACGAGCCGGCGGCGGAGAAGCCGGCACCGGCGTCGTCGAGGGTGACTCGGAGCTCGGCTCGGCTGGCGGCGAACTCCAGGGCTGATTTGGCGGTGGAT
GATGCTGTGACGAAGTTGCCGAAGAGTAAGAAGGCGAAACGTGCTCCGAAGGAGAATGGGAAGGTGGAGGAGGTTGAAAATGAAGGAGTGGAAGTTGATGCTGCTTTGGA
GAAACTTGACAAGGATGCGAAGAATAGAACGGTTGTGATTGAACATTGCAAACAGTGCCAATCATTCAAGAAAAGGGCCATCCAGGTGCAAAATGGTCTAGAGAATGGTG
TTCCTGGAATCACTGTGCTGCTTAACCCTGATAAGCCAAGAAGGGGTTGCTTTGAAATCCGGAGTGAAGATGGCGAGAAGTTTATCAGTCTTCTGGATCGAACGGTTGGT
TGGTTGGTTTTGGAAGGAAACGTCATCGGAGTTGTTGGGAAGATGAAAAGACTCCTCAATCAATTCCCTGAATTTGGAGCTTTCAATGATTATCTCACTAAAACTCGTCT
CTTCGAACAAATCAATACTCGTCAAACCATCACCGTTCTCGCTCTTGATAACGCCACCGTTTCTAGCATTGCCGGAAATTCTCTGGATGTAATTAAGCAGATTCTTGATG
CTCATGTTATTCTTGATTATTACGATGTTGCAAAGATGAGGAAACTTTCTACTAATAAAGCTACTGTACTTACTACTATGTTTCAATCCACCGGCGATGCTGTGAATCAA
CAAGGTTTCTTGAAAGTTTTGCTTAATAAAAGAGGTCAAATCGAATTCGGATCTGCAGCGAAAGGCGCTCCTCTTAGCGCTAAGCTTGTGAAACCTGTTGCTTCTCAGCC
TTATAATATCTCTGTTTTGCAAATTAGTGCTCCGATTGTGATCCCTGGGATTGGTGTTTACAATTTGCCGCCTCCTGCTCCTGAAGCACCGTATGTAGCGCCGGTTGAAG
CTCCGGCTCCATCGGCTGATGCTCCGGCGCCGGCGGACGATGATGATGCAGATTCTCCATCTGATGCTCCTTCACCGGCGTCCAAGGCACCGGCACCAGCGGCGGATGCT
CCTGATGCACCGGCGAGTTCTCCTTCAGAGGCGGCTGATGAGGATGATGCAGATGCGCCGGGTCCAGCCGACGATGCCTCGGACAATACCTCGGACAAGAAATCGGGGGG
TTCACGGGGTCAGATCGGCGGCGCCGGAGTTGTGGTGGCCGGATTGGTGTGGCTTTGGTTGGTTAATTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTAGGATCACGTCAAAGGCCATACTTAAAGCCATCTATATACATTATTATATTGGTTTCGTTGGTCAGCATATTTCTAGTTGGCGTCTATGTTTATCCACCTAGAAC
CTCCCTGCTTTGTTATATCTTCTCCAGTGGTTGTATCAATGGTGCATTTGAACAGCACCTGCCAGTTGCTTCTCGGGAATTAACTGATGAAGAAACTGCAGCTCGGGTTG
TAATGAAGGAAATTTTGAAGAGACCTCTGGCTCAGTCTAAGAATCCAAAAATTGCCTTTATGTTTTTGACTCCAGGTTCATTACCTTTTGAGAAGCTATGGCATAAATTT
CTTGATGGTCACGATGACAGATTTTCTATATATGTGCATGCATCTAGAGAGAAATTACCATACGCGAGCCCCCATTTTGTTGGTCGTGACATTCGCAGTGAAAAGGTAGC
CTGGGGAGAAATTTCTATGGTTGATGCAGAGAAGAGACTTTTGGCAAATGCACTTTTAGACCCTGATAATCAGCACTTTGTTTTATTATCTGAAAGTTGTGTACCTCTTC
ACGACTTTGAGTATATTTATAACTATTTGATATTTACAAACGTCAGCTATATTGATTGTTTTGAAGACCCTGGTCCCCATGGAAGTGGCAGGTATTCAGAGCGCATGTTA
CCGGAAATTGAAAAGAAAGATTTCCGTAAAGGTTCTCAGTGGTTTTCTATGAAGCGACAACATGCTATTATTGTAATGGCCGACAGTCTTTACTATACAAAATTCAAGCA
TTACTGCAAGCGAACTAAGGACGGCCCCAATTGCTACGCTGACGAGCACTATTTTCCAACTCTTTTCCATATGATCGACCCTGGTGGAATTGCAAACTGGTCAGTTACGC
ATGTTGATTGGTCTGAGGGAAAGTGGCATCCAAAATCATATAGGAACCAAGATGTCACCTATGAGCTTCTGAGGAACATTACTTCGCTAGATGAGATCGTTCACATTACA
AGTACTGCTCCGAAGAAACCAAGTGGAACTGCGTGGGCGTCGTACGACCACCCACTCGACGGCCACGATCCTTCCTCACACTCCCAATCCATCTCCACCGTCTCTTCAAA
ATCCCACCGCCGTCCCAGAGCGCGGGAAAAAGCCCCAACCCAACCTCTGTATATCACTGCAAACCAACGGTCACGGTTTTCTGCTTTCATGGCGCCGAGGAAGCGAAGCA
AGAACCAAGAAGACGAGCCGGCGGCGGAGAAGCCGGCACCGGCGTCGTCGAGGGTGACTCGGAGCTCGGCTCGGCTGGCGGCGAACTCCAGGGCTGATTTGGCGGTGGAT
GATGCTGTGACGAAGTTGCCGAAGAGTAAGAAGGCGAAACGTGCTCCGAAGGAGAATGGGAAGGTGGAGGAGGTTGAAAATGAAGGAGTGGAAGTTGATGCTGCTTTGGA
GAAACTTGACAAGGATGCGAAGAATAGAACGGTTGTGATTGAACATTGCAAACAGTGCCAATCATTCAAGAAAAGGGCCATCCAGGTGCAAAATGGTCTAGAGAATGGTG
TTCCTGGAATCACTGTGCTGCTTAACCCTGATAAGCCAAGAAGGGGTTGCTTTGAAATCCGGAGTGAAGATGGCGAGAAGTTTATCAGTCTTCTGGATCGAACGGTTGGT
TGGTTGGTTTTGGAAGGAAACGTCATCGGAGTTGTTGGGAAGATGAAAAGACTCCTCAATCAATTCCCTGAATTTGGAGCTTTCAATGATTATCTCACTAAAACTCGTCT
CTTCGAACAAATCAATACTCGTCAAACCATCACCGTTCTCGCTCTTGATAACGCCACCGTTTCTAGCATTGCCGGAAATTCTCTGGATGTAATTAAGCAGATTCTTGATG
CTCATGTTATTCTTGATTATTACGATGTTGCAAAGATGAGGAAACTTTCTACTAATAAAGCTACTGTACTTACTACTATGTTTCAATCCACCGGCGATGCTGTGAATCAA
CAAGGTTTCTTGAAAGTTTTGCTTAATAAAAGAGGTCAAATCGAATTCGGATCTGCAGCGAAAGGCGCTCCTCTTAGCGCTAAGCTTGTGAAACCTGTTGCTTCTCAGCC
TTATAATATCTCTGTTTTGCAAATTAGTGCTCCGATTGTGATCCCTGGGATTGGTGTTTACAATTTGCCGCCTCCTGCTCCTGAAGCACCGTATGTAGCGCCGGTTGAAG
CTCCGGCTCCATCGGCTGATGCTCCGGCGCCGGCGGACGATGATGATGCAGATTCTCCATCTGATGCTCCTTCACCGGCGTCCAAGGCACCGGCACCAGCGGCGGATGCT
CCTGATGCACCGGCGAGTTCTCCTTCAGAGGCGGCTGATGAGGATGATGCAGATGCGCCGGGTCCAGCCGACGATGCCTCGGACAATACCTCGGACAAGAAATCGGGGGG
TTCACGGGGTCAGATCGGCGGCGCCGGAGTTGTGGTGGCCGGATTGGTGTGGCTTTGGTTGGTTAATTTCTAA
Protein sequenceShow/hide protein sequence
MLGSRQRPYLKPSIYIIILVSLVSIFLVGVYVYPPRTSLLCYIFSSGCINGAFEQHLPVASRELTDEETAARVVMKEILKRPLAQSKNPKIAFMFLTPGSLPFEKLWHKF
LDGHDDRFSIYVHASREKLPYASPHFVGRDIRSEKVAWGEISMVDAEKRLLANALLDPDNQHFVLLSESCVPLHDFEYIYNYLIFTNVSYIDCFEDPGPHGSGRYSERML
PEIEKKDFRKGSQWFSMKRQHAIIVMADSLYYTKFKHYCKRTKDGPNCYADEHYFPTLFHMIDPGGIANWSVTHVDWSEGKWHPKSYRNQDVTYELLRNITSLDEIVHIT
STAPKKPSGTAWASYDHPLDGHDPSSHSQSISTVSSKSHRRPRAREKAPTQPLYITANQRSRFSAFMAPRKRSKNQEDEPAAEKPAPASSRVTRSSARLAANSRADLAVD
DAVTKLPKSKKAKRAPKENGKVEEVENEGVEVDAALEKLDKDAKNRTVVIEHCKQCQSFKKRAIQVQNGLENGVPGITVLLNPDKPRRGCFEIRSEDGEKFISLLDRTVG
WLVLEGNVIGVVGKMKRLLNQFPEFGAFNDYLTKTRLFEQINTRQTITVLALDNATVSSIAGNSLDVIKQILDAHVILDYYDVAKMRKLSTNKATVLTTMFQSTGDAVNQ
QGFLKVLLNKRGQIEFGSAAKGAPLSAKLVKPVASQPYNISVLQISAPIVIPGIGVYNLPPPAPEAPYVAPVEAPAPSADAPAPADDDDADSPSDAPSPASKAPAPAADA
PDAPASSPSEAADEDDADAPGPADDASDNTSDKKSGGSRGQIGGAGVVVAGLVWLWLVNF