; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g0326 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g0326
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationMC06:2596405..2602162
RNA-Seq ExpressionMC06g0326
SyntenyMC06g0326
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589613.1 Glycosyltransferase BC10, partial [Cucurbita argyrosperma subsp. sororia]2.43e-19172.68Show/hide
Query:  KSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYS--------PPPRPPPPPPPSLLRLPDDDTPPPPMHDMSDEELLWRASLAPRVVPNFGPS
        KSL S LLLFA+GLA GF+L+LF FP    S  L+   S+S        PPP PPPPPPPS + L     PPP +HDM++EELLWRASL PR +P     
Subjt:  KSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYS--------PPPRPPPPPPPSLLRLPDDDTPPPPMHDMSDEELLWRASLAPRVVPNFGPS

Query:  PPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVLLSESCI
           PKIAFLFLTK+GV+LAPLWE FFK  H  L+SIYVH +S +N+T+ S  SVF+ R+IPSKGVKWG PSMMEAERRLLANALLDFSNQRF+LLSESCI
Subjt:  PPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVLLSESCI

Query:  PLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCTPPCYMDEHYLPTLVGIKFSAT
        PLFNFST+Y+YLM SKTTF+E+YDLPGPVGRGRY+P+MRPTINLHQWRKGSQWFQIDRPLA++VVSD K+FPVF + C P CYMDEHYLPTLVGI FS T
Subjt:  PLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCTPPCYMDEHYLPTLVGIKFSAT

Query:  NSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH
        NSNRTLTWVDWS+GG HPT+F R DVNV LL+RLRTGSHC YNG  TNVCHLFARKFMPNSLNRLL+FAPKLM F+H
Subjt:  NSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH

KAG7023304.1 hypothetical protein SDJN02_14329, partial [Cucurbita argyrosperma subsp. argyrosperma]4.21e-19272.94Show/hide
Query:  KSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYS--------PPPRPPPPPPPSLLRLPDDDTPPPPMHDMSDEELLWRASLAPRVVPNFGPS
        KSL S LLLFA+GLA GF+L+LF FP    S  L+   S+S        PPP PPPPPPPS + L     PPP +HDM++EELLWRASL PR +P     
Subjt:  KSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYS--------PPPRPPPPPPPSLLRLPDDDTPPPPMHDMSDEELLWRASLAPRVVPNFGPS

Query:  PPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVLLSESCI
           PKIAFLFLTK+GV+LAPLWE FFK  H  L+SIYVH +S +N+T+ S  SVF+ R+IPSKGVKWG PSMMEAERRLLANALLDFSNQRF+LLSESCI
Subjt:  PPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVLLSESCI

Query:  PLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCTPPCYMDEHYLPTLVGIKFSAT
        PLFNFST+YNYLM SKTTF+E+YDLPGPVGRGRY+P+MRPTINLHQWRKGSQWFQIDRPLA++VVSD K+FPVF + C P CYMDEHYLPTLVGI FS T
Subjt:  PLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCTPPCYMDEHYLPTLVGIKFSAT

Query:  NSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH
        NSNRTLTWVDWS+GG HPT+F R DVNV LL+RLRTGSHC YNG  TNVCHLFARKFMPNSLNRLL+FAPKLM F+H
Subjt:  NSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH

XP_022134723.1 uncharacterized protein LOC111006926 [Momordica charantia]2.18e-295100Show/hide
Query:  MTIKQQQQQQSDPKAHLIFLKLQLMHFKLKSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYSPPPRPPPPPPPSLLRLPDDDTPPPPMHDMS
        MTIKQQQQQQSDPKAHLIFLKLQLMHFKLKSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYSPPPRPPPPPPPSLLRLPDDDTPPPPMHDMS
Subjt:  MTIKQQQQQQSDPKAHLIFLKLQLMHFKLKSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYSPPPRPPPPPPPSLLRLPDDDTPPPPMHDMS

Query:  DEELLWRASLAPRVVPNFGPSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRL
        DEELLWRASLAPRVVPNFGPSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRL
Subjt:  DEELLWRASLAPRVVPNFGPSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRL

Query:  LANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCT
        LANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCT
Subjt:  LANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCT

Query:  PPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH
        PPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH
Subjt:  PPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH

XP_022988481.1 uncharacterized protein LOC111485710 [Cucurbita maxima]4.14e-19072.22Show/hide
Query:  KSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYS---------PPPRPPPPPPPSLLRLPDDDTPPPPMHDMSDEELLWRASLAPRVVPNFGP
        KSL S LLLFA+GLA GF+L+LF FP    S  L+   S+S         PPP P PPPPPS + L     PPP +HDM++EELLWRASL PR +P    
Subjt:  KSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYS---------PPPRPPPPPPPSLLRLPDDDTPPPPMHDMSDEELLWRASLAPRVVPNFGP

Query:  SPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVLLSESC
            PKIAFLFLTK+GV+LAPLWE FFK  H  L+SIYVH +S +N+T+ S  SVF+ R+IPSKGVKWG PSMMEAERRLLANALLDFSNQRF+LLSESC
Subjt:  SPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVLLSESC

Query:  IPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCTPPCYMDEHYLPTLVGIKFSA
        IPLFNFST+Y+YLM SKTTF+E+YDLPGPVGRGRY+P+MRPTINLHQWRKGSQWFQIDRPLA++VVSD K+FPVF + C P CYMDEHYLPTLVGI FS 
Subjt:  IPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCTPPCYMDEHYLPTLVGIKFSA

Query:  TNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH
        +NSNRTLTWVDWS+GG HPT+F R DVNV LL+RLRTGSHC YNGV TNVCHLFARKFMPNSLNRLL+FAPKLM F+H
Subjt:  TNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH

XP_038895081.1 glycosyltransferase BC10-like [Benincasa hispida]3.14e-19373.82Show/hide
Query:  MHFKLKSLFSQLLLFASGLAIGFSLSLF--NFPLLQISPGLAATSSYS----PPPRPPPPPPPSLLRLPDDDTPPPPMHDMSDEELLWRASLAPRVVPNF
        MHF+  SL S LL FA+GLA GF+L+LF   FP  Q S  L+   ++S    P P PPP  PPS + L D   PPP +HDM++EELLWRASL PR +PNF
Subjt:  MHFKLKSLFSQLLLFASGLAIGFSLSLF--NFPLLQISPGLAATSSYS----PPPRPPPPPPPSLLRLPDDDTPPPPMHDMSDEELLWRASLAPRVVPNF

Query:  GPSPPPP--KIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVLL
         PS      KIAFLFLTK+GV LAPLWELFFK  H   +SIYVH +  SN+T+ S SSVF+ R+IPSKGVKWGEPSMMEAERRLLANALLDFSNQRF+LL
Subjt:  GPSPPPP--KIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVLL

Query:  SESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCTPPCYMDEHYLPTLVGI
        SESCIPLFNFST+YNYLMGSK+TFIEAYDLPGPVGRGRY P+MRPTINLHQWRKGSQWF++DR +A++VVSDHKFFPVF KFC P CYMDEHYLPT VGI
Subjt:  SESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCTPPCYMDEHYLPTLVGI

Query:  KFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH
        +FS TNSNRTLTWVDWSRGG HPTRFIR DV V LL+RLR GSHC+YNGV TN+CHLFARKFMPNSLNRLLMFAPKLM+F+H
Subjt:  KFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH

TrEMBL top hitse value%identityAlignment
A0A0A0LUY5 Uncharacterized protein1.64e-18166.92Show/hide
Query:  SDPKAHLIFLKLQLMHFKLKSLFSQLLLFASGLAIGFSLSLF--NFPLLQISPGLAATSSY-----SPPPRP----PPPPPPSLLRLPDDDTPPPPMHDM
        S+PK H+ F          KS FS  LLF++GLA GF+L+LF   FP  Q S  L+   ++     SP P P    PPPPPPS + L +   PPP +HDM
Subjt:  SDPKAHLIFLKLQLMHFKLKSLFSQLLLFASGLAIGFSLSLF--NFPLLQISPGLAATSSY-----SPPPRP----PPPPPPSLLRLPDDDTPPPPMHDM

Query:  SDEELLWRASLAPRVVPNFGPSPPPP---KIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLP-STSSVFFRRTIPSKGVKWGEPSMME
        ++EELLWRASL PR +P    +       KIAFLFLTK+GV+LAPLWELFFK  +  L+SIYVH    S+ST    +SSVF+ R+IPSKGVKWGEPSMME
Subjt:  SDEELLWRASLAPRVVPNFGPSPPPP---KIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLP-STSSVFFRRTIPSKGVKWGEPSMME

Query:  AERRLLANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVF
        AERRLLANALLDFSN+RF+LLSESCIPLFNFSTVYNYLMGSK+TFIEAYDLPGPVGRGRY P+MRP I LHQWRKGSQWF++DR +A++V+SD K+F VF
Subjt:  AERRLLANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVF

Query:  HKFCTPPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMR
         KFC P CYMDEHYLPT VGI+F  TNSNRTLTWVDWSRGG HPTRF+R DV +ELL+RLR G HC+YNGV TN+CHLFARKFM NSLNRLLMFAPKLM 
Subjt:  HKFCTPPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMR

Query:  FD
        F+
Subjt:  FD

A0A1S3BX15 uncharacterized protein LOC1034941401.41e-17078.41Show/hide
Query:  MSDEELLWRASLAPRVVPNFGPSPPPP--KIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEA
        M++EELLWRASL PR +P   PS      KIAFLFLTK+GV+LAPLWELFFK  +  L+SIYVH +  SNST+ S SSVF+ R+IPSKGVKWGEPSMMEA
Subjt:  MSDEELLWRASLAPRVVPNFGPSPPPP--KIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEA

Query:  ERRLLANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFH
        ERRLLANALLDFSNQRF+LLSESCIPLFNFST+YNYLM SK+TFIEAYDLPGPVGRGRY P+MRP INLHQWRKGSQWF++DR +A++VVSD K+FPVF 
Subjt:  ERRLLANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFH

Query:  KFCTPPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRF
        KFC P CYMDEHYLPT VGI+FS +NSNRTLTWVDWSRGG HPTRFIR DV VELL+RLR+GSHC+YNGV TN+CHLFARKFM NSLNRLLMFAPKLM F
Subjt:  KFCTPPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRF

Query:  D
        +
Subjt:  D

A0A6J1C2T7 uncharacterized protein LOC1110069261.06e-295100Show/hide
Query:  MTIKQQQQQQSDPKAHLIFLKLQLMHFKLKSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYSPPPRPPPPPPPSLLRLPDDDTPPPPMHDMS
        MTIKQQQQQQSDPKAHLIFLKLQLMHFKLKSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYSPPPRPPPPPPPSLLRLPDDDTPPPPMHDMS
Subjt:  MTIKQQQQQQSDPKAHLIFLKLQLMHFKLKSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYSPPPRPPPPPPPSLLRLPDDDTPPPPMHDMS

Query:  DEELLWRASLAPRVVPNFGPSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRL
        DEELLWRASLAPRVVPNFGPSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRL
Subjt:  DEELLWRASLAPRVVPNFGPSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRL

Query:  LANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCT
        LANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCT
Subjt:  LANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCT

Query:  PPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH
        PPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH
Subjt:  PPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH

A0A6J1JHA6 uncharacterized protein LOC1114857102.01e-19072.22Show/hide
Query:  KSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYS---------PPPRPPPPPPPSLLRLPDDDTPPPPMHDMSDEELLWRASLAPRVVPNFGP
        KSL S LLLFA+GLA GF+L+LF FP    S  L+   S+S         PPP P PPPPPS + L     PPP +HDM++EELLWRASL PR +P    
Subjt:  KSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYS---------PPPRPPPPPPPSLLRLPDDDTPPPPMHDMSDEELLWRASLAPRVVPNFGP

Query:  SPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVLLSESC
            PKIAFLFLTK+GV+LAPLWE FFK  H  L+SIYVH +S +N+T+ S  SVF+ R+IPSKGVKWG PSMMEAERRLLANALLDFSNQRF+LLSESC
Subjt:  SPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVLLSESC

Query:  IPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCTPPCYMDEHYLPTLVGIKFSA
        IPLFNFST+Y+YLM SKTTF+E+YDLPGPVGRGRY+P+MRPTINLHQWRKGSQWFQIDRPLA++VVSD K+FPVF + C P CYMDEHYLPTLVGI FS 
Subjt:  IPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCTPPCYMDEHYLPTLVGIKFSA

Query:  TNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH
        +NSNRTLTWVDWS+GG HPT+F R DVNV LL+RLRTGSHC YNGV TNVCHLFARKFMPNSLNRLL+FAPKLM F+H
Subjt:  TNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH

A0A6P5S157 uncharacterized protein LOC1107540596.31e-16058.92Show/hide
Query:  MTIKQQQQQQSDPKAHLI--FLKLQLMHFKLKSLFSQLLLFASGLAIGFSLSL----FNFPLLQISPGLAATS------SYSPPPRPPPP----------
        M  KQQQQQQ  P + +   FL +Q+    L  + S  L+FA GLAIG S+S     F F L       + +S      S +PPP PPPP          
Subjt:  MTIKQQQQQQSDPKAHLI--FLKLQLMHFKLKSLFSQLLLFASGLAIGFSLSL----FNFPLLQISPGLAATS------SYSPPPRPPPP----------

Query:  ------PPPSLLRLPDDDTPPPPMHDMSDEELLWRASLAPRVVPNFG-PSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPS
              PPP  + L +   PP  MHDM D ELLWRASL PR     G P    PK+AF+FLT+  +ALAP WE+FFK  H  L+SIYVH+N   N T+P 
Subjt:  ------PPPSLLRLPDDDTPPPPMHDMSDEELLWRASLAPRVVPNFG-PSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPS

Query:  TSSVFFRRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKG
          SVF+ R +PSK V WGEP+M++AERRLLANALLDFSNQRFVLLSESCIPLFNF  +YNYLM S  TF+EAYDLPGPVGR RYRP+M P I L+QWRKG
Subjt:  TSSVFFRRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKG

Query:  SQWFQIDRPLAAEVVSDHKFFPVFHKFCTPPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVC
        SQWF++DR +A EVVSD K+FP+F K+C P CY DEHYLPT V IKF   NSNRTLTWVDWSRGGPHP++F+R DV VE LE+LR G+ C+YNG ST+VC
Subjt:  SQWFQIDRPLAAEVVSDHKFFPVFHKFCTPPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVC

Query:  HLFARKFMPNSLNRLLMFAPKLMRFD
        HLFARKF+PN+L+RLL FAPKLM+F+
Subjt:  HLFARKFMPNSLNRLLMFAPKLMRFD

SwissProt top hitse value%identityAlignment
Q65XS5 Glycosyltransferase BC104.3e-3935.83Show/hide
Query:  SPPPP---KIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFF--RRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVL
        +P PP   ++AFLF+ +N + L  +W+ FF+      FSI+VHS      T  +T S FF  R+   S  V WGE SM+EAER LLA+AL D  N+RFV 
Subjt:  SPPPP---KIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFF--RRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVL

Query:  LSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFC----------------
        +S+SC+PL+NF+  Y+Y+M S T+F++++        GRY PRM P I +  WRKGSQW  + R  A  VV D +  P F K C                
Subjt:  LSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFC----------------

Query:  ------TPPCYMDEHYLPTLV---GIKFSATNSNRTLTWVDW--------SRGGPHPTRFIRNDVNVELLERLRTGSH-----------CDYNGVSTNVC
                 C  DEHY+ TL+   G++   T   R++T   W         R G HP  +  +D    L++ ++   +           C  NG     C
Subjt:  ------TPPCYMDEHYLPTLV---GIKFSATNSNRTLTWVDW--------SRGGPHPTRFIRNDVNVELLERLRTGSH-----------CDYNGVSTNVC

Query:  HLFARKF
         LFARKF
Subjt:  HLFARKF

Arabidopsis top hitse value%identityAlignment
AT1G51770.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein9.0e-8545.48Show/hide
Query:  QQQQQSDPKAHLIFLKLQLMHFKLKSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYSPPPRPPPPPPPSLLRLPDDDT---------PPPPM
        ++   S PK+ +      L+  +L  +    L+   G+++  S+ +  F  +Q               R  P  P +LL   + ++         P    
Subjt:  QQQQQSDPKAHLIFLKLQLMHFKLKSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYSPPPRPPPPPPPSLLRLPDDDT---------PPPPM

Query:  HDMSDEELLWRASLAPRVVPNFGPSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEA
        H M+D ELLWRAS+ P+   N  P    PK+AF+FL K  +  APLWE F K  H  L+SIYVHS     S   S SSVF+RR IPS+ V WGE SM EA
Subjt:  HDMSDEELLWRASLAPRVVPNFGPSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEA

Query:  ERRLLANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFH
        ERRLLANALLD SN+ FVLLSESCIPL  FS +Y+Y+  S+ +F+ A D  GP GRGRYR  M P I L QWRKGSQWF+I+R LA E+V D  ++P F 
Subjt:  ERRLLANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFH

Query:  KFCTPPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLM
        +FC PPCY+DEHY PT++ +K     +NRTLTW DWSRGG HP  F + DV    L++L     C YN   + +C+LFARKF P++L  LL  APK++
Subjt:  KFCTPPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLM

AT1G68390.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein3.9e-10452.49Show/hide
Query:  SDPKAHLIFLKLQLMHFKLKSLFSQLLLFASGLAIGFSL--SLFNFP-----LLQISPGLAATSSYSPPPRPPPPPPPSLLRLPDD-------DTPPPPM
        S P      L  Q  HF   +L S  L+   G+ IG  L  SL NF       +Q    L   SS   PP PPPPPPPS    P+        + P   M
Subjt:  SDPKAHLIFLKLQLMHFKLKSLFSQLLLFASGLAIGFSL--SLFNFP-----LLQISPGLAATSSYSPPPRPPPPPPPSLLRLPDD-------DTPPPPM

Query:  HDMSDEELLWRASLAPRVVPNFGPSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEA
        HDM DEELLWRAS+AP+ + N+ P P  PK+AF+F+TK  + LA LWE FF+  H  LF+IYVHS    N + P   SVF  R IPSK V WG  +M+EA
Subjt:  HDMSDEELLWRASLAPRVVPNFGPSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEA

Query:  ERRLLANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFH
        E+RLLANALLD SN+RFVLLSESCIPLFNF+TVY+YL+ S  T +E+YD  G VGRGRY P M+P + L  WRKGSQW ++DR +A E++SD  ++P+F+
Subjt:  ERRLLANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFH

Query:  KFCTPPCYMDEHYLPTLVGIKFS--ATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLM
         +C   CY DEHY+PTL+ IK S    NSNRTLTWVDWS+GGPHP RFIR++V  E +E LR+G  C YNG  TN+C+LFARKF+P +L+RLL  +  ++
Subjt:  KFCTPPCYMDEHYLPTLVGIKFS--ATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLM

Query:  RF
         F
Subjt:  RF

AT3G21310.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein3.5e-8952.61Show/hide
Query:  PPM---HDMSDEELLWRASLAPRVVPNFGPSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGE
        PP+   H M+D ELLWRAS+ PR++    P    PK+AF+FLTK  +  APLWE FFK  H   +SIYVH+     S  PS SSVF+RR IPS+ V WGE
Subjt:  PPM---HDMSDEELLWRASLAPRVVPNFGPSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGE

Query:  PSMMEAERRLLANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHK
         SM +AERRLLANALLD SN+ FVLLSE+CIPL  F+ VY Y+  S+ +F+ + D  GP GRGRY   M P ++L++WRKGSQWF+I+R LA ++V D  
Subjt:  PSMMEAERRLLANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHK

Query:  FFPVFHKFCTPPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFA
        ++  F +FC PPCY+DEHY PT++ I +    +NRTLTW DWSRGG HP  F + D+  + +++L  G  C YN   + VC+LFARKF P++L  LL  A
Subjt:  FFPVFHKFCTPPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFA

Query:  PKLMRF
        PK++ F
Subjt:  PKLMRF

AT5G11730.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein4.6e-8948.11Show/hide
Query:  FSQLLLFASGLAIGFSLSLFNFPLLQI------SPGLAATSSYSPPPRPPPPPPPSLLRLPDDDTPPPPMHDMSDEELLWRASLAPRVVPNFGPSPPPPK
        F   LL   GL + FS+++F   +  I      S     TSS+ P     P      ++      P   MH+MSDEELLWRAS  PR      P    PK
Subjt:  FSQLLLFASGLAIGFSLSLFNFPLLQI------SPGLAATSSYSPPPRPPPPPPPSLLRLPDDDTPPPPMHDMSDEELLWRASLAPRVVPNFGPSPPPPK

Query:  IAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVLLSESCIPLFNF
        +AF+FLTK  + LA LWE F K  H  L+S+Y+H +    +  P+ SSVF RR IPS+  +WG  SM +AE+RLLANALLD SN+ FVL+SESCIPL+NF
Subjt:  IAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVLLSESCIPLFNF

Query:  STVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCTPPCYMDEHYLPTLVGIKFSATNSNRT
        +T+Y+YL  SK +F+ A+D PGP GRGRY   M P + L +WRKGSQWF+++R LAA +V D  ++P F +FC P CY+DEHY PT++ I+     +NR+
Subjt:  STVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCTPPCYMDEHYLPTLVGIKFSATNSNRT

Query:  LTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRF
        LTWVDWSRGGPHP  F R+D+      ++  G +C YNG +T++C+LFARKF P++L  LL  APK++ F
Subjt:  LTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRF

AT5G25970.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.3e-9153.85Show/hide
Query:  MHDMSDEELLWRASLAPRVVPNFGPSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMME
        MH+MSDEELLW AS  PR      P    PKIAF+FLT   + LAPLWE   K  H  L+S+Y+HS   S++  P+ SSVF+RR IPS+  +WG  +M +
Subjt:  MHDMSDEELLWRASLAPRVVPNFGPSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMME

Query:  AERRLLANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVF
        AERRLLANALLD SN+ FVLLSESCIPLFNF+T+Y Y+  S+ +F+ ++D PG  GRGRY   M P + + QWRKGSQWF+I+R LA  +V D  ++P F
Subjt:  AERRLLANALLDFSNQRFVLLSESCIPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVF

Query:  HKFCTPPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLM
         +FC P CY+DEHY PT++ I+  A  +NR++TWVDWSRGG HP  F   D+N E   R+  G +C YNG  T++C+LFARKF P++L  L+  APKL+
Subjt:  HKFCTPPCYMDEHYLPTLVGIKFSATNSNRTLTWVDWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGATTAAACAGCAGCAGCAGCAGCAATCGGATCCGAAGGCGCACCTTATTTTCTTGAAACTCCAATTAATGCACTTCAAATTGAAATCCCTCTTCTCCCAGCTCCT
CCTCTTCGCCTCCGGCCTCGCCATCGGCTTCTCCCTCTCCCTCTTCAATTTCCCTCTCCTCCAGATCTCCCCCGGCCTCGCCGCCACCTCTTCCTACTCCCCTCCGCCGC
GTCCGCCTCCGCCGCCGCCGCCATCTCTCCTCCGCTTACCCGACGACGACACGCCGCCCCCGCCCATGCACGACATGTCGGACGAGGAGCTTCTCTGGAGGGCCTCTCTC
GCCCCCCGTGTTGTTCCAAACTTCGGCCCCTCCCCGCCGCCCCCCAAAATCGCCTTCTTATTTCTTACCAAAAACGGCGTCGCTTTGGCTCCCCTCTGGGAATTGTTCTT
CAAACCCGCCCACCATTCCCTCTTCTCCATTTACGTCCATTCCAATTCCATTTCCAATTCTACCCTCCCCTCCACCTCCTCCGTCTTCTTCCGCCGCACCATCCCCAGCA
AGGGAGTGAAATGGGGGGAGCCGAGCATGATGGAGGCGGAGCGGCGGCTGCTAGCGAATGCGCTGCTAGACTTCTCAAACCAGAGATTCGTCCTCCTCTCCGAATCCTGC
ATCCCTCTCTTCAACTTCTCCACCGTCTACAACTACTTAATGGGCTCCAAAACCACCTTCATCGAGGCCTACGACTTACCCGGCCCAGTGGGCCGCGGCCGCTACAGGCC
TCGCATGCGGCCCACTATCAACCTCCACCAGTGGCGCAAGGGCTCCCAGTGGTTCCAAATCGACCGCCCCCTCGCCGCCGAGGTCGTCTCCGACCACAAATTCTTCCCCG
TCTTCCACAAATTCTGCACGCCCCCCTGCTACATGGACGAGCACTACCTCCCCACCCTCGTCGGGATCAAATTCTCCGCCACCAACTCCAACCGGACCCTCACCTGGGTC
GACTGGTCCCGGGGCGGGCCCCACCCGACCCGCTTCATCCGGAATGACGTCAACGTTGAATTGCTCGAGCGGCTCAGGACCGGTTCCCACTGCGACTACAATGGAGTGAG
CACCAATGTCTGCCATTTGTTTGCCAGGAAATTCATGCCCAATTCTTTGAATAGACTCCTAATGTTTGCCCCTAAGCTCATGCGGTTCGATCATTGA
mRNA sequenceShow/hide mRNA sequence
CGCTTGTTGCAGTGTGGAAAACTTTTGAGTTCTTCTTCTCTTCATCGATCTCTGTGTTTCATCTTTCAATGACGATTAAACAGCAGCAGCAGCAGCAATCGGATCCGAAG
GCGCACCTTATTTTCTTGAAACTCCAATTAATGCACTTCAAATTGAAATCCCTCTTCTCCCAGCTCCTCCTCTTCGCCTCCGGCCTCGCCATCGGCTTCTCCCTCTCCCT
CTTCAATTTCCCTCTCCTCCAGATCTCCCCCGGCCTCGCCGCCACCTCTTCCTACTCCCCTCCGCCGCGTCCGCCTCCGCCGCCGCCGCCATCTCTCCTCCGCTTACCCG
ACGACGACACGCCGCCCCCGCCCATGCACGACATGTCGGACGAGGAGCTTCTCTGGAGGGCCTCTCTCGCCCCCCGTGTTGTTCCAAACTTCGGCCCCTCCCCGCCGCCC
CCCAAAATCGCCTTCTTATTTCTTACCAAAAACGGCGTCGCTTTGGCTCCCCTCTGGGAATTGTTCTTCAAACCCGCCCACCATTCCCTCTTCTCCATTTACGTCCATTC
CAATTCCATTTCCAATTCTACCCTCCCCTCCACCTCCTCCGTCTTCTTCCGCCGCACCATCCCCAGCAAGGGAGTGAAATGGGGGGAGCCGAGCATGATGGAGGCGGAGC
GGCGGCTGCTAGCGAATGCGCTGCTAGACTTCTCAAACCAGAGATTCGTCCTCCTCTCCGAATCCTGCATCCCTCTCTTCAACTTCTCCACCGTCTACAACTACTTAATG
GGCTCCAAAACCACCTTCATCGAGGCCTACGACTTACCCGGCCCAGTGGGCCGCGGCCGCTACAGGCCTCGCATGCGGCCCACTATCAACCTCCACCAGTGGCGCAAGGG
CTCCCAGTGGTTCCAAATCGACCGCCCCCTCGCCGCCGAGGTCGTCTCCGACCACAAATTCTTCCCCGTCTTCCACAAATTCTGCACGCCCCCCTGCTACATGGACGAGC
ACTACCTCCCCACCCTCGTCGGGATCAAATTCTCCGCCACCAACTCCAACCGGACCCTCACCTGGGTCGACTGGTCCCGGGGCGGGCCCCACCCGACCCGCTTCATCCGG
AATGACGTCAACGTTGAATTGCTCGAGCGGCTCAGGACCGGTTCCCACTGCGACTACAATGGAGTGAGCACCAATGTCTGCCATTTGTTTGCCAGGAAATTCATGCCCAA
TTCTTTGAATAGACTCCTAATGTTTGCCCCTAAGCTCATGCGGTTCGATCATTGAACCTTATGCGCCCATACTTTCTATATAGTTTATATCTTCATTCCTTTCAATTTTT
GAGCCAATATCATTCGTTCGCATGTAATTTAACAACCTGGGACAAGTAAAAACCTTGGCCTTTTTGTTCTATTAGGTTCTAATATTCTTAAAGACTGAAGAATCAATTTA
ATGAACACTAACAAAACATATATGCAGCAACCCATCATTGAAAATGGAATGATTATGATGAACTACTTGAGTTTACAGTGGTGGTATAATGGTGGGAGGAGAGAGCTTGA
TCAAGCGCTTACGGTTTCGGTAGTTGCTGTGGATGTGCTGCTAACCCAAGATGTAGATAGGGAATTTGGAGAAGAAGCATAGCCATTCTTCAACAGCACGGCGTCGTCCT
TCGCCGAAATTTCGGATTCTTGATCCTTCCCTTTATCGGTGTTTTGTTTCTGTATGGTGTTCAAGGTCTCGAGAACCTCCATCATTGATGGCCTCTCGTCTTTCATACCC
TGCAGGCATCGAAATGCCAACTCGGCTACACTGGTTATCATCTCTCGGATTTTGTAGTCCGACTCGAAACCAAGCGTGGGGTCGACGAGCTCGTGCAATGCGTGGCTTTG
GATCTTGTTGATGGCCATGTTGAACAGGTTGATCTCTTGTCTATGCCTTGTGATGTCAACGGCGGGCATTGAGGACATGAGCTCAACTAGTACCACTCCAAAGCTGAAGA
CATCACTTCTATCGGAGAGTTGGTAGCACTGATGATACTCGGGATCAACGTAGCCGGGCGTGCCTTGTGGAGTGGTCGAAATGTGTGAGACATCGAGAGGGAAGAGACGG
GACAGTCCGAAATCAGCTACTTTAACACAAAAGTTGTTGTCAAGGAGAATGTTGTTGGTTTTGACATCACGGTGGATGATTTCAGAAGCATGGAGGTACACCAAAGCGCT
TGCAGTCTCTATAGCAATCTTCATTCTTGTAGACCATGGAAGCTTGCCAGGCTTTGCTAGTTTGCCATGGAGGTGATCAGCCACGGTGCCGTTCGGGATGTACTCATACA
CAAGCAAAAGCTCACGGCTATGGCGGGAGGTGCAACCGTAAAGCGAAACGAGGTTGCGGTGGCGTAGGCGGGCAAGGATATCCACCTCATTCATGAACTGCTCAACTCTC
TTAAAATTGCTTTCAAACAAACGTTTTACAGCAACTGCACGCCCATCTTTGAGTAATCCTAAACAGAGATATGAACAAAAAGATCCATCAAACAATCACCATCTGATAAT
TAAGCAAGCACTATGTTTAAGATTAGATTTAAACGGCTGCTTTAAGCTCTAACCATGATAAACTGTGCCAAAACCTCCATCTCCAAGCTCTTTATTGGAGTCAAAATGGT
GGGTGGCTTCTTCAAGCTCTTTATAGGAGAAGAGGTGAACACCCAAGTAAGTACCACCCTTCTCAAGTTCTTCCACAGAAGGCGGATTCGGCGGGTTCAACGGATCCGAG
GGAATGCTTCGTTGCACATACGGCAGCGTGCGAGAAGTTTTATGGCGGAGCTGCCGCCGGTACCAGATGCCGAGAACAAGTAAAGTTAAAAGCAAGGTTCCAAAACCAAC
ACATGTGCCTGCGCACGAATGAAAACAATGGAGTTTTTAATCAATTGTTAGACAGGCAAACATGTGAACAGAGGACGAAAAACAGAGCAGCTTTCGTGTATTACCAATTA
TAATCTTATTTCTGATATCATGCTTATTATCATTCCTGCGCTCTGAAACATTAAGCAATAAGGAAAGTTAACAACAATATCAGACTAAACGTAAGAAGAAAACTTGGAAA
AGAAACCTTTGAAACAACCACTGCCAAAGAATGCCCATATCTGTTCATCAGTTCATGCTAAGTAAAACTTTCAAGCAAAAACAATCTTAGCAGTCTCAAGAAATATGTAA
ATCAGAAAAAGAAGAGAGACACACAACCCAAAATATCAGGTTTGCAGTTAACGAAATTAGGCACATCGCTACTGATTATTATTTTCAAATCCAGAGAGCCCACCACTTTG
ATCGCAGCCGCTGCAGTACTGTGCAGTCCAATTCAGAACGAAACCCATCTTCAAAATCTCAACATAATTCATTTCCCGTAGGACATTTGGACTAACATTCGGATTCTTGC
GAATAGGCAGCCGAATCGAAGAGTGGCATGAATGCTCCGAAAGGTCCTGGTGGAAAGTCGCAAAAGAAAAGAGGGAGGAATCGTCGCTAGAACAGTTGAGTTCATAAATC
TGATCCGCGGGCTTCGATAGGCAATCATAGAAGAAGAAGAAATCTTCATTATCCAAAGTACAACGGAATGGTGTTCGAGTGAGGCTGGTGTTATGAGAAGGGCATGAATG
TTCATAGCCGTAAGCTGAAGGACTGACCAACAGAAACGAATGGTTTTGGGGGAAAATGTCCTCGATGATAAAACCCTCGCCGGAAATTTCAAGAACAGGGTATTTTTCAT
CATTACATACGATTTTGAAATCTGGGTATCCGCAGAAAGGTTCGTGGACACCAGATATCCAAAAGGGGTAGCTTATATTTGGACCATTTCCACAGCTTCGAGGCGCACAG
GCCTTGAATTTCTTGTCCAGAGAGGCGATTTCAGAAGCAAAAGTCGAGAAGAAGACAGTGAAGAGGCAAATAAGCGTGGGATTGAAGAGGGTTTGGAGTTTGAAGTCCAT
AGGAAGAAGTTGAGAAGAGTGAAGAAGATGAAAATGGGGTAGAGAAATGGGAATCGAAGGAAGGGAAGCACCACATGCCTTTCGAAGAAGTTTCGAAATTGCTTTTCAAT
CTGTTGAATTTGGTAAACGGAGAATGGAAAAAGGAAAGAAACAGAGGACAAACTTTACCAGTTCTTCAACAACCACCCACGACCCCCAAATTTCCATAAACGAAAACTTC
AGTTTTCAGACCAAACAACGGCTCAACAAGACGGTGTTTTTCAAACTTTTACTGGGTCCGGCTGAAATTGGTGGGATTTTGAGTGGAAAATACCCATTGGAAGACAGGGA
CAAAATTGTCATTATAAATGCGGATGATTTGGTAGAGGAGACCGACTTCCATGGTTTCCATGTGGGAAGGTTTGGTTGGTTGAAGAAGAAAGAAAATGCACAGAAAAACT
AGATGCAGAGTTTTTTAC
Protein sequenceShow/hide protein sequence
MTIKQQQQQQSDPKAHLIFLKLQLMHFKLKSLFSQLLLFASGLAIGFSLSLFNFPLLQISPGLAATSSYSPPPRPPPPPPPSLLRLPDDDTPPPPMHDMSDEELLWRASL
APRVVPNFGPSPPPPKIAFLFLTKNGVALAPLWELFFKPAHHSLFSIYVHSNSISNSTLPSTSSVFFRRTIPSKGVKWGEPSMMEAERRLLANALLDFSNQRFVLLSESC
IPLFNFSTVYNYLMGSKTTFIEAYDLPGPVGRGRYRPRMRPTINLHQWRKGSQWFQIDRPLAAEVVSDHKFFPVFHKFCTPPCYMDEHYLPTLVGIKFSATNSNRTLTWV
DWSRGGPHPTRFIRNDVNVELLERLRTGSHCDYNGVSTNVCHLFARKFMPNSLNRLLMFAPKLMRFDH