; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012737 (gene) of Snake gourd v1 genome

Gene IDTan0012737
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCore-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein
Genome locationLG09:70016754..70019353
RNA-Seq ExpressionTan0012737
SyntenyTan0012737
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016757 - transferase activity, transferring glycosyl groups (molecular function)
InterPro domainsIPR003406 - Glycosyl transferase, family 14
IPR044174 - Glycosyltransferase BC10-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6589613.1 Glycosyltransferase BC10, partial [Cucurbita argyrosperma subsp. sororia]1.1e-16677.49Show/hide
Query:  LFKLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSP---SPLSLRLTFNQLS--------SPPPSLPPSRVGLRDFLKPPPALHDMTEEELLWRASLVPR
        LFKLQ  FKSL S  LLFAAGLA GFTLTLF F  P   SPLSL  +F+QL          PPP  PPSRVGL+ FL PPPALHDMTEEELLWRASLVPR
Subjt:  LFKLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSP---SPLSLRLTFNQLS--------SPPPSLPPSRVGLRDFLKPPPALHDMTEEELLWRASLVPR

Query:  RIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFIL
        RIP  P      KIAFLFLTKDGVSLAPLWE FFKGHR LYSIYVH S S+NATV S+SVFY RSIPSKGVKWG  SMMEAERRLLANALLDFSNQRFIL
Subjt:  RIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFIL

Query:  LSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLVG
        LSESCIPLFNFSTIY+YLM SK+TF+E+YDLPGPVGRGRY P+MRPTI L QWRKGSQWF++DR +A++VVSDQKYFPVF++HCKPSCYMDEHYLPTLVG
Subjt:  LSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLVG

Query:  IKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFN
        I FS  NSNRTLTWVDWS+GG HPT+F R DV VGLL++LR+G+ C+YNG  TNVCHLFARKF+PNSLNRLL+FAPKLM FN
Subjt:  IKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFN

KAG7023304.1 hypothetical protein SDJN02_14329, partial [Cucurbita argyrosperma subsp. argyrosperma]3.0e-16777.75Show/hide
Query:  LFKLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSP---SPLSLRLTFNQLS--------SPPPSLPPSRVGLRDFLKPPPALHDMTEEELLWRASLVPR
        LFKLQ  FKSL S  LLFAAGLA GFTLTLF F  P   SPLSL  +F+QL          PPP  PPSRVGL+ FL PPPALHDMTEEELLWRASLVPR
Subjt:  LFKLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSP---SPLSLRLTFNQLS--------SPPPSLPPSRVGLRDFLKPPPALHDMTEEELLWRASLVPR

Query:  RIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFIL
        RIP  P      KIAFLFLTKDGVSLAPLWE FFKGHR LYSIYVH S S+NATV S+SVFY RSIPSKGVKWG  SMMEAERRLLANALLDFSNQRFIL
Subjt:  RIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFIL

Query:  LSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLVG
        LSESCIPLFNFSTIYNYLM SK+TF+E+YDLPGPVGRGRY P+MRPTI L QWRKGSQWF++DR +A++VVSDQKYFPVF++HCKPSCYMDEHYLPTLVG
Subjt:  LSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLVG

Query:  IKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFN
        I FS  NSNRTLTWVDWS+GG HPT+F R DV VGLL++LR+G+ C+YNG  TNVCHLFARKF+PNSLNRLL+FAPKLM FN
Subjt:  IKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFN

XP_004137544.2 glycosyltransferase BC10 [Cucumis sativus]1.1e-16978.15Show/hide
Query:  KLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSP-----SPLSLRLTFNQL---------SSPPPSLPPSRVGLRDFLKPPPALHDMTEEELLWRASLVP
        KL +HFKS FS FLLF+AGLA GFTLTLFIF  P     S LSL  TFNQL         S PPP  PPSRVGL++FL PPP LHDMTEEELLWRASLVP
Subjt:  KLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSP-----SPLSLRLTFNQL---------SSPPPSLPPSRVGLRDFLKPPPALHDMTEEELLWRASLVP

Query:  RRIPNLPA---KSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSN--ATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFS
        RRIP LP+    + T+KIAFLFLTKDGVSLAPLWELFFKG+ GLYSIYVH +PSS+  +TV SSSVFY RSIPSKGVKWGE SMMEAERRLLANALLDFS
Subjt:  RRIPNLPA---KSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSN--ATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFS

Query:  NQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHY
        N+RFILLSESCIPLFNFST+YNYLMGSKSTFIEAYDLPGPVGRGRY+P+MRP IKL QWRKGSQWFEMDR IA++V+SDQKYF VFQK CKPSCYMDEHY
Subjt:  NQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHY

Query:  LPTLVGIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFNR
        LPT VGI+F   NSNRTLTWVDWSRGG HPTRF+R DVT+ LL++LR+G  C+YNGVKTN+CHLFARKF+ NSLNRLL+FAPKLM FNR
Subjt:  LPTLVGIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFNR

XP_022988481.1 uncharacterized protein LOC111485710 [Cucurbita maxima]3.3e-16677.28Show/hide
Query:  LFKLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSP---SPLSLRLTFNQLSSPPPS---------LPPSRVGLRDFLKPPPALHDMTEEELLWRASLVP
        LFKLQ  FKSL S  LLFAAGLA GFTLTLF F  P   SPLSL  +F+QL  P PS          PPSRVGL+ F  PPPALHDMTEEELLWRASLVP
Subjt:  LFKLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSP---SPLSLRLTFNQLSSPPPS---------LPPSRVGLRDFLKPPPALHDMTEEELLWRASLVP

Query:  RRIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFI
        RRIP  P      KIAFLFLTKDGVSLAPLWE FFKGHR LYSIYVH S S+NATV S+SVFY RSIPSKGVKWG  SMMEAERRLLANALLDFSNQRFI
Subjt:  RRIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFI

Query:  LLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLV
        LLSESCIPLFNFSTIY+YLM SK+TF+E+YDLPGPVGRGRY P+MRPTI L QWRKGSQWF++DR +A++VVSDQKYFPVF++HCKPSCYMDEHYLPTLV
Subjt:  LLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLV

Query:  GIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFN
        GI FS  NSNRTLTWVDWS+GG HPT+F R DV VGLL++LR+G+ C+YNGV TNVCHLFARKF+PNSLNRLL+FAPKLM FN
Subjt:  GIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFN

XP_038895081.1 glycosyltransferase BC10-like [Benincasa hispida]3.7e-17882.85Show/hide
Query:  KLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSP-----SPLSLRLTFNQLS----SPPPSLPPSRVGLRDFLKPPPALHDMTEEELLWRASLVPRRIPN
        KLQMHF+SL S  L FAAGLA GFTLTLFIF  P     SPLSLR  F+Q      SPPPS PPSRVGL+DFLKPPPALHDMTEEELLWRASLVPRRIPN
Subjt:  KLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSP-----SPLSLRLTFNQLS----SPPPSLPPSRVGLRDFLKPPPALHDMTEEELLWRASLVPRRIPN

Query:  LPA-KSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFILLSE
         P+ +   +KIAFLFLTKDGV LAPLWELFFKGHRG YSIYVH SPSSNATV SSSVFY RSIPSKGVKWGE SMMEAERRLLANALLDFSNQRFILLSE
Subjt:  LPA-KSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFILLSE

Query:  SCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLVGIKF
        SCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRY+P+MRPTI L QWRKGSQWFEMDR IA++VVSD K+FPVFQK CKPSCYMDEHYLPT VGI+F
Subjt:  SCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLVGIKF

Query:  SNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFN
        S  NSNRTLTWVDWSRGG HPTRFIR DVTVGLL++LR+G+ C+YNGV TN+CHLFARKF+PNSLNRLL+FAPKLM FN
Subjt:  SNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFN

TrEMBL top hitse value%identityAlignment
A0A0A0LUY5 Uncharacterized protein5.3e-17078.15Show/hide
Query:  KLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSP-----SPLSLRLTFNQL---------SSPPPSLPPSRVGLRDFLKPPPALHDMTEEELLWRASLVP
        KL +HFKS FS FLLF+AGLA GFTLTLFIF  P     S LSL  TFNQL         S PPP  PPSRVGL++FL PPP LHDMTEEELLWRASLVP
Subjt:  KLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSP-----SPLSLRLTFNQL---------SSPPPSLPPSRVGLRDFLKPPPALHDMTEEELLWRASLVP

Query:  RRIPNLPA---KSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSN--ATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFS
        RRIP LP+    + T+KIAFLFLTKDGVSLAPLWELFFKG+ GLYSIYVH +PSS+  +TV SSSVFY RSIPSKGVKWGE SMMEAERRLLANALLDFS
Subjt:  RRIPNLPA---KSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSN--ATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFS

Query:  NQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHY
        N+RFILLSESCIPLFNFST+YNYLMGSKSTFIEAYDLPGPVGRGRY+P+MRP IKL QWRKGSQWFEMDR IA++V+SDQKYF VFQK CKPSCYMDEHY
Subjt:  NQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHY

Query:  LPTLVGIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFNR
        LPT VGI+F   NSNRTLTWVDWSRGG HPTRF+R DVT+ LL++LR+G  C+YNGVKTN+CHLFARKF+ NSLNRLL+FAPKLM FNR
Subjt:  LPTLVGIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFNR

A0A1S3BX15 uncharacterized protein LOC1034941408.7e-14986.29Show/hide
Query:  MTEEELLWRASLVPRRIPNLPA-KSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERR
        MTEEELLWRASLVPRRIP LP+ ++  +KIAFLFLTKDGVSLAPLWELFFKG+ GLYSIYVH SPSSN+TV SSSVFY RSIPSKGVKWGE SMMEAERR
Subjt:  MTEEELLWRASLVPRRIPNLPA-KSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERR

Query:  LLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHC
        LLANALLDFSNQRFILLSESCIPLFNFSTIYNYLM SKSTFIEAYDLPGPVGRGRYSP+MRP I L QWRKGSQWFEMDR IA++VVSDQKYFPVFQK C
Subjt:  LLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHC

Query:  KPSCYMDEHYLPTLVGIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFNR
        KPSCYMDEHYLPT VGI+FS  NSNRTLTWVDWSRGG HPTRFIR DVTV LL++LRSG+ C+YNGVKTN+CHLFARKF+ NSLNRLL+FAPKLM FNR
Subjt:  KPSCYMDEHYLPTLVGIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFNR

A0A2I4FFX6 glycosyltransferase BC10-like1.7e-13962.2Show/hide
Query:  MTSSIKQSSS-SDSKFLFKLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSPSPLSLRLTFNQL----------SSPPP--SLPP---------------
        M + ++Q S+ S  K L   QMH  SL   FLLF +GLALG TL+ ++       S  L  NQL           SPPP  S PP               
Subjt:  MTSSIKQSSS-SDSKFLFKLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSPSPLSLRLTFNQL----------SSPPP--SLPP---------------

Query:  ----SRVGLRDFLKPPPALHDMTEEELLWRASLVPRRIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYAR
            SRVGL +FLKPP  +HDM +EELLWRAS+VP RI + P K  T K+AF+FLTK  V LAPLWE+FFKGH GLYS+YVH +PS N TVP SSVF+ R
Subjt:  ----SRVGLRDFLKPPPALHDMTEEELLWRASLVPRRIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYAR

Query:  SIPSKGVKWGEASMMEAERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDR
         IPSK V+WGE +M+EAERRLLANALLD SNQRF+LLSESCIPLFNFST+YNYLMGS   F+EAYDLP  VGRGRYSP+M+PTIKL QWRKGSQWFEMDR
Subjt:  SIPSKGVKWGEASMMEAERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDR

Query:  LIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLVGIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFL
         +A EV+SD++YFP+F+KHCK SCY DEHYLPTLV I+F  +NSNRTLTWVDWS+GGPHP+R++R DVT+  L++LRSG+QC+YNG  T++C+LFARKF 
Subjt:  LIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLVGIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFL

Query:  PNSLNRLLIFAPKLMGFN
         ++L+RLL FAPKLM FN
Subjt:  PNSLNRLLIFAPKLMGFN

A0A6J1C2T7 uncharacterized protein LOC1110069265.8e-15371.28Show/hide
Query:  MTSSIKQSSSSDSK---FLFKLQ-MHF--KSLFSQFLLFAAGLALGFTLTLFIF---QSPSPLSLRLTFNQLSSPPPSLPPSRVGLRDFLKPPPALHDMT
        MT   +Q   SD K      KLQ MHF  KSLFSQ LLFA+GLA+GF+L+LF F   Q    L+   +++    PPP  PPS + L D   PPP +HDM+
Subjt:  MTSSIKQSSSSDSK---FLFKLQ-MHF--KSLFSQFLLFAAGLALGFTLTLFIF---QSPSPLSLRLTFNQLSSPPPSLPPSRVGLRDFLKPPPALHDMT

Query:  EEELLWRASLVPRRIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFK-GHRGLYSIYVHCSPSSNATVPS-SSVFYARSIPSKGVKWGEASMMEAERRL
        +EELLWRASL PR +PN     P  KIAFLFLTK+GV+LAPLWELFFK  H  L+SIYVH +  SN+T+PS SSVF+ R+IPSKGVKWGE SMMEAERRL
Subjt:  EEELLWRASLVPRRIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFK-GHRGLYSIYVHCSPSSNATVPS-SSVFYARSIPSKGVKWGEASMMEAERRL

Query:  LANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCK
        LANALLDFSNQRF+LLSESCIPLFNFST+YNYLMGSK+TFIEAYDLPGPVGRGRY PRMRPTI L QWRKGSQWF++DR +A EVVSD K+FPVF K C 
Subjt:  LANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCK

Query:  PSCYMDEHYLPTLVGIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFN
        P CYMDEHYLPTLVGIKFS  NSNRTLTWVDWSRGGPHPTRFIR DV V LLE+LR+G+ C YNGV TNVCHLFARKF+PNSLNRLL+FAPKLM F+
Subjt:  PSCYMDEHYLPTLVGIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFN

A0A6J1JHA6 uncharacterized protein LOC1114857101.6e-16677.28Show/hide
Query:  LFKLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSP---SPLSLRLTFNQLSSPPPS---------LPPSRVGLRDFLKPPPALHDMTEEELLWRASLVP
        LFKLQ  FKSL S  LLFAAGLA GFTLTLF F  P   SPLSL  +F+QL  P PS          PPSRVGL+ F  PPPALHDMTEEELLWRASLVP
Subjt:  LFKLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSP---SPLSLRLTFNQLSSPPPS---------LPPSRVGLRDFLKPPPALHDMTEEELLWRASLVP

Query:  RRIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFI
        RRIP  P      KIAFLFLTKDGVSLAPLWE FFKGHR LYSIYVH S S+NATV S+SVFY RSIPSKGVKWG  SMMEAERRLLANALLDFSNQRFI
Subjt:  RRIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFI

Query:  LLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLV
        LLSESCIPLFNFSTIY+YLM SK+TF+E+YDLPGPVGRGRY P+MRPTI L QWRKGSQWF++DR +A++VVSDQKYFPVF++HCKPSCYMDEHYLPTLV
Subjt:  LLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLV

Query:  GIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFN
        GI FS  NSNRTLTWVDWS+GG HPT+F R DV VGLL++LR+G+ C+YNGV TNVCHLFARKF+PNSLNRLL+FAPKLM FN
Subjt:  GIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFN

SwissProt top hitse value%identityAlignment
Q65XS5 Glycosyltransferase BC101.8e-4235.06Show/hide
Query:  IPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHR-GLYSIYVHCSPSSNAT--VPSSSVFYARSI-PSKGVKWGEASMMEAERRLLANALLDFSNQR
        +   P      ++AFLF+ ++ + L  +W+ FF+G + G +SI+VH  P    T     S  FY R +  S  V WGEASM+EAER LLA+AL D  N+R
Subjt:  IPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHR-GLYSIYVHCSPSSNAT--VPSSSVFYARSI-PSKGVKWGEASMMEAERRLLANALLDFSNQR

Query:  FILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCK------------
        F+ +S+SC+PL+NF+  Y+Y+M S ++F++++        GRY+PRM P I ++ WRKGSQW  + R  A  VV D++  P FQKHC+            
Subjt:  FILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCK------------

Query:  ----------PSCYMDEHYLPTLVGIK-FSNKNSNRTLTWVDW--------SRGGPHPTRFIRKDVTVGLLEKLRS-----------GTQCQYNGVKTNV
                   +C  DEHY+ TL+       + + R++T   W         R G HP  +   D T  L++ ++               C  NG K   
Subjt:  ----------PSCYMDEHYLPTLVGIK-FSNKNSNRTLTWVDW--------SRGGPHPTRFIRKDVTVGLLEKLRS-----------GTQCQYNGVKTNV

Query:  CHLFARKF
        C LFARKF
Subjt:  CHLFARKF

Arabidopsis top hitse value%identityAlignment
AT1G10880.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.1e-9550.43Show/hide
Query:  FKSLFSQFLLFAAGLALGFTLTLFIFQSPSPLSLRLTFNQLSSPPPSLPPSRVGLRDFLKPPPALHDMTEEELLWRASLVPRRIPNLPAKSPT-KKIAFL
        F S+ S  +L A  L      +LF+ +    LS     N L+SP  S PPS         P  +  ++ +EEL+WRA++ PR     P K+ T  K+AF+
Subjt:  FKSLFSQFLLFAAGLALGFTLTLFIFQSPSPLSLRLTFNQLSSPPPSLPPSRVGLRDFLKPPPALHDMTEEELLWRASLVPRRIPNLPAKSPT-KKIAFL

Query:  FLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNY
        FLT+  + L+PLWE+FFKGH G YSIYVH SP      P SSVFY + IPSK V+WG+ SMM+AE+RL+++ALL+ SN RF+LLSE+CIPLFNF+TIY Y
Subjt:  FLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNY

Query:  LMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLVGIKFSNKNSNRTLTWVDW
        L  S  +F+ ++D P P+GRGRY+P+M P + L  WRKG+QWFE+ R +A E+VSD++Y+ VF+ HC+P CY+DEHYLPTLV       NSNRT+TWVDW
Subjt:  LMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLVGIKFSNKNSNRTLTWVDW

Query:  SRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARK
        SRGG HP RF+RKD+ VG L+++R G+ C Y G    V  +  +K
Subjt:  SRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARK

AT1G51770.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein3.6e-9452.25Show/hide
Query:  SPLSLRLTFNQLSSPPPSLPPSRVGLRDFLKPPPAL-HDMTEEELLWRASLVPRRIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVH
        +P++L  T+N  S          V L  F++PP  + H M + ELLWRAS+ P+R      + P  K+AF+FL K  +  APLWE F KGH GLYSIYVH
Subjt:  SPLSLRLTFNQLSSPPPSLPPSRVGLRDFLKPPPAL-HDMTEEELLWRASLVPRRIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVH

Query:  CSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRP
          PS  +    SSVFY R IPS+ V WGE SM EAERRLLANALLD SN+ F+LLSESCIPL  FS IY+Y+  S+ +F+ A D  GP GRGRY   M P
Subjt:  CSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRP

Query:  TIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLVGIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQC
         I L QWRKGSQWFE++R +A E+V D  Y+P F++ C+P CY+DEHY PT++ +K     +NRTLTW DWSRGG HP  F + DVT   L+KL     C
Subjt:  TIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLVGIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQC

Query:  QYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLM
         YN  ++ +C+LFARKF P++L  LL  APK++
Subjt:  QYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLM

AT1G68390.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.0e-11253.23Show/hide
Query:  TSSIKQSSSS-DSKFLFKLQMHFKSLFSQFLLFAAGLALGFTL--TLFIFQSPSPLSLR------LTFNQLSSPPPSLPPS------RVGLRDFLKPPPA
        +SS+  SS S  +K L     HF +L S  L+   G+ +G  L  +L  F S S LS++      +  +   SPPP  PPS      + GL+ F++PP  
Subjt:  TSSIKQSSSS-DSKFLFKLQMHFKSLFSQFLLFAAGLALGFTL--TLFIFQSPSPLSLR------LTFNQLSSPPPSLPPS------RVGLRDFLKPPPA

Query:  L-HDMTEEELLWRASLVPRRIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEA
        L HDM +EELLWRAS+ P +I N P    T K+AF+F+TK  + LA LWE FF+GH GL++IYVH  PS N + P  SVF  R IPSK V WG  +M+EA
Subjt:  L-HDMTEEELLWRASLVPRRIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEA

Query:  ERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQ
        E+RLLANALLD SN+RF+LLSESCIPLFNF+T+Y+YL+ S  T +E+YD  G VGRGRYSP M+P ++L+ WRKGSQW E+DR +A E++SD+ Y+P+F 
Subjt:  ERRLLANALLDFSNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQ

Query:  KHCKPSCYMDEHYLPTLVGIKFS--NKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLM
         +C   CY DEHY+PTL+ IK S   +NSNRTLTWVDWS+GGPHP RFIR +VT   +E LRSG +C YNG +TN+C+LFARKFLP +L+RLL  +  ++
Subjt:  KHCKPSCYMDEHYLPTLVGIKFS--NKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLM

Query:  GF
         F
Subjt:  GF

AT1G73810.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.6e-9449.36Show/hide
Query:  FKSLFSQFLLFAAGLALGFTLTLFIFQ-SPSPLSLRLTF---NQLSSPPPSLPPS-----------------RVGLRD--FLKPPPALHDMTEEELLWRA
        F++L     L   G  LGF L + I   S +P   RL+    +  S+PP    P                   VG +    + P   +H+MTEEELL RA
Subjt:  FKSLFSQFLLFAAGLALGFTLTLFIFQ-SPSPLSLRLTF---NQLSSPPPSLPPS-----------------RVGLRD--FLKPPPALHDMTEEELLWRA

Query:  SLVPRRIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPS--SNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDF
        S +  +   +     TKK AF+FLT+  + LA LWE FFKGH GL+SIY+H S     +   P +S FY R IPSK V WG  SM+ AERRLLANALLD 
Subjt:  SLVPRRIPNLPAKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPS--SNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDF

Query:  SNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEH
         N RF+LLSES IPLFNFSTIY+YL+ S+ ++++ YDLPGP GRGRY+ RM P I    WRKGSQWFE+DR +A  VVSD  YFPVF+K+C  +CY DEH
Subjt:  SNQRFILLSESCIPLFNFSTIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEH

Query:  YLPTLVGIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQ-CQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGF
        YL T V   F  KN+NR+LTW DWSR GPHP ++ R+ VT   L ++R+  Q C YNG K+  C+LFARKF  ++L++LL FA  +MGF
Subjt:  YLPTLVGIKFSNKNSNRTLTWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQ-CQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGF

AT5G11730.1 Core-2/I-branching beta-1,6-N-acetylglucosaminyltransferase family protein1.0e-10151.22Show/hide
Query:  HFKSLFSQFLLFAAGLALGFTLTLFIFQSPSPLSLRLTFNQLSSPPPSLPPSRVG----LRDFLKPPPAL-HDMTEEELLWRASLVPRRIPNLPAKSPTK
        H+K  F   LL   GL L F++T+FI  S S +      + +++   S  P R G    L  +++PP  L H+M++EELLWRAS  PRR    P K    
Subjt:  HFKSLFSQFLLFAAGLALGFTLTLFIFQSPSPLSLRLTFNQLSSPPPSLPPSRVG----LRDFLKPPPAL-HDMTEEELLWRASLVPRRIPNLPAKSPTK

Query:  KIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFILLSESCIPLFNFS
        K+AF+FLTK  + LA LWE F KGH+GLYS+Y+H  PS  A  P+SSVF+ R IPS+  +WG  SM +AE+RLLANALLD SN+ F+L+SESCIPL+NF+
Subjt:  KIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFILLSESCIPLFNFS

Query:  TIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLVGIKFSNKNSNRTL
        TIY+YL  SK +F+ A+D PGP GRGRY+  M P + L +WRKGSQWFE++R +A  +V D  Y+P F++ C+P+CY+DEHY PT++ I+     +NR+L
Subjt:  TIYNYLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLVGIKFSNKNSNRTL

Query:  TWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGF
        TWVDWSRGGPHP  F R D+T     K+  G  C YNG  T++C+LFARKF P++L  LL  APK++GF
Subjt:  TWVDWSRGGPHPTRFIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGTCGTCGATTAAACAATCTTCCTCCTCAGATTCAAAATTCTTGTTCAAACTCCAAATGCACTTCAAATCCCTCTTCTCTCAGTTCCTCCTCTTCGCCGCCGGCCT
CGCTCTCGGCTTCACTCTCACTCTCTTCATCTTCCAATCGCCCTCTCCTCTCTCCCTCCGCCTCACCTTCAACCAACTCTCTTCTCCGCCGCCCTCTCTTCCGCCGTCGC
GTGTGGGGTTGAGAGACTTTTTGAAGCCCCCGCCGGCTCTCCACGACATGACGGAGGAGGAGCTTCTATGGAGGGCCTCGCTGGTTCCTCGCCGGATTCCTAACTTACCG
GCGAAGTCACCGACGAAGAAGATCGCATTCTTGTTTCTGACGAAAGACGGCGTTAGTTTGGCTCCGCTTTGGGAACTGTTCTTCAAAGGCCACCGTGGGCTCTACTCCAT
TTACGTTCATTGCAGCCCCTCCTCCAACGCCACCGTCCCTTCCTCCTCCGTCTTCTACGCCCGCTCAATCCCCAGTAAGGGAGTGAAATGGGGAGAAGCAAGCATGATGG
AGGCAGAACGGCGGCTACTCGCAAACGCACTTCTAGACTTTTCCAATCAAAGATTCATCCTCCTCTCTGAATCCTGCATTCCACTCTTCAACTTCTCCACCATTTACAAC
TACTTAATGGGCTCCAAATCTACCTTCATCGAGGCCTACGACCTCCCAGGCCCAGTGGGCCGAGGCCGCTACAGCCCACGTATGCGGCCCACCATCAAGCTCCAACAATG
GCGCAAGGGTTCCCAGTGGTTCGAAATGGACCGACTCATCGCCAACGAAGTCGTCTCCGACCAAAAATACTTCCCCGTCTTCCAAAAACACTGCAAGCCCTCCTGTTACA
TGGACGAACACTACCTCCCCACGCTCGTCGGAATCAAGTTCTCCAACAAGAATTCCAACCGGACTTTGACTTGGGTTGACTGGTCCAGAGGCGGTCCCCACCCGACCCGG
TTCATCCGAAAGGATGTCACCGTTGGATTGCTCGAGAAGCTGAGAAGTGGAACCCAGTGCCAGTACAATGGAGTTAAGACAAATGTGTGTCATTTGTTTGCTAGGAAATT
CCTGCCCAATTCTTTGAATAGACTCCTCATCTTTGCTCCTAAGCTCATGGGGTTTAATCGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGACGTCGTCGATTAAACAATCTTCCTCCTCAGATTCAAAATTCTTGTTCAAACTCCAAATGCACTTCAAATCCCTCTTCTCTCAGTTCCTCCTCTTCGCCGCCGGCCT
CGCTCTCGGCTTCACTCTCACTCTCTTCATCTTCCAATCGCCCTCTCCTCTCTCCCTCCGCCTCACCTTCAACCAACTCTCTTCTCCGCCGCCCTCTCTTCCGCCGTCGC
GTGTGGGGTTGAGAGACTTTTTGAAGCCCCCGCCGGCTCTCCACGACATGACGGAGGAGGAGCTTCTATGGAGGGCCTCGCTGGTTCCTCGCCGGATTCCTAACTTACCG
GCGAAGTCACCGACGAAGAAGATCGCATTCTTGTTTCTGACGAAAGACGGCGTTAGTTTGGCTCCGCTTTGGGAACTGTTCTTCAAAGGCCACCGTGGGCTCTACTCCAT
TTACGTTCATTGCAGCCCCTCCTCCAACGCCACCGTCCCTTCCTCCTCCGTCTTCTACGCCCGCTCAATCCCCAGTAAGGGAGTGAAATGGGGAGAAGCAAGCATGATGG
AGGCAGAACGGCGGCTACTCGCAAACGCACTTCTAGACTTTTCCAATCAAAGATTCATCCTCCTCTCTGAATCCTGCATTCCACTCTTCAACTTCTCCACCATTTACAAC
TACTTAATGGGCTCCAAATCTACCTTCATCGAGGCCTACGACCTCCCAGGCCCAGTGGGCCGAGGCCGCTACAGCCCACGTATGCGGCCCACCATCAAGCTCCAACAATG
GCGCAAGGGTTCCCAGTGGTTCGAAATGGACCGACTCATCGCCAACGAAGTCGTCTCCGACCAAAAATACTTCCCCGTCTTCCAAAAACACTGCAAGCCCTCCTGTTACA
TGGACGAACACTACCTCCCCACGCTCGTCGGAATCAAGTTCTCCAACAAGAATTCCAACCGGACTTTGACTTGGGTTGACTGGTCCAGAGGCGGTCCCCACCCGACCCGG
TTCATCCGAAAGGATGTCACCGTTGGATTGCTCGAGAAGCTGAGAAGTGGAACCCAGTGCCAGTACAATGGAGTTAAGACAAATGTGTGTCATTTGTTTGCTAGGAAATT
CCTGCCCAATTCTTTGAATAGACTCCTCATCTTTGCTCCTAAGCTCATGGGGTTTAATCGTTGA
Protein sequenceShow/hide protein sequence
MTSSIKQSSSSDSKFLFKLQMHFKSLFSQFLLFAAGLALGFTLTLFIFQSPSPLSLRLTFNQLSSPPPSLPPSRVGLRDFLKPPPALHDMTEEELLWRASLVPRRIPNLP
AKSPTKKIAFLFLTKDGVSLAPLWELFFKGHRGLYSIYVHCSPSSNATVPSSSVFYARSIPSKGVKWGEASMMEAERRLLANALLDFSNQRFILLSESCIPLFNFSTIYN
YLMGSKSTFIEAYDLPGPVGRGRYSPRMRPTIKLQQWRKGSQWFEMDRLIANEVVSDQKYFPVFQKHCKPSCYMDEHYLPTLVGIKFSNKNSNRTLTWVDWSRGGPHPTR
FIRKDVTVGLLEKLRSGTQCQYNGVKTNVCHLFARKFLPNSLNRLLIFAPKLMGFNR