; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC10G191040 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC10G191040
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionE3 ubiquitin-protein ligase BRE1A-like
Genome locationCicolChr10:6208682..6218390
RNA-Seq ExpressionCcUC10G191040
SyntenyCcUC10G191040
Gene Ontology termsGO:0006486 - protein glycosylation (biological process)
GO:0016020 - membrane (cellular component)
GO:0016758 - transferase activity, transferring hexosyl groups (molecular function)
InterPro domainsIPR002659 - Glycosyl transferase, family 31
IPR025298 - Domain of unknown function DUF4094
IPR043424 - Protein BRANCHLESS TRICHOME-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033992.1 E3 ubiquitin-protein ligase BRE1A-like [Cucumis melo var. makuwa]0.0e+0092.01Show/hide
Query:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG
        VSAKKPQLPAPFPPTLNQLPDLDLHY+PGSSR TRRRIRSPARVRRVVAPVGKRSRPETPLLKWK++DPGS  +GVQ EEDEKK  +E  QRGRFRGSKG
Subjt:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG

Query:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGG-HRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEG
        RK VVSARKLAAGIWRLQLQEAVASEGRNGG  RRTEDLLGFQPR GHSG  A HPDDKIAFNSEMNDLLHSP SVSDSRNG LCK EPSFRY NSAMEG
Subjt:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGG-HRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEG

Query:  ATKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKK
        ATKWEPACLKT VEARQIYNQMRL+DQQGA+SALS LEAELEQAHLRIEELQAERNASKKKLEYFLRKV +EKALWRSREHEKVRAFIDDIKAELNREKK
Subjt:  ATKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKK

Query:  TRQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA
        TRQR+EMINSKLVNELADAKLS KR MQDCEKERKERSLVEEVCDELAKEIGEDKARIE+LKRE++KLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA
Subjt:  TRQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA

Query:  LEEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLE
        LEEKYSQM NLVADLEDFLR +SETSDVSEMKKALLLREAAATVNIQDV EFVYEPSNPDDIFSVFEDVNFGESNERE+GQC+TYSPPNHASK+QT  LE
Subjt:  LEEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLE

Query:  ANVTDRIGIQRQTNSFIAHN--GDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSI
        AN+TDRIGIQ+ TNSFIAHN  GDIEEDESGWETVSHLEDQGSSNSPEESIASV KNRRESNASVSGTEWEGNGGGDSP TEISEVCSVPSKQLKK+SSI
Subjt:  ANVTDRIGIQRQTNSFIAHN--GDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSI

Query:  ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR
        ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASI+SADGGSVRSGISPPEL  QWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR
Subjt:  ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR

Query:  Q
        Q
Subjt:  Q

XP_004149354.1 uncharacterized protein LOC101221647 [Cucumis sativus]0.0e+0090.44Show/hide
Query:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG
        +SAK P LPAPFPPTLNQLPDLDLHY+PGS+R TRRRIRSPARVRRVVAP+G+RSRPETPLLKWK++DPGS  +GVQQEE++ K  +E+ QRGRFRGSKG
Subjt:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG

Query:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGG-HRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEG
        RK VVSARKLAAGIWRLQL EAVASEGRNGG  RRTEDLLGFQ RTGHSG  A HPDDKIAFNSEMNDLLHSP SVSDSRNG LCK EPSFRY NSAMEG
Subjt:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGG-HRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEG

Query:  ATKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKK
        ATKWEPACLKTPVEARQIYNQMRL+DQQGA+SALS LEAELE+AHLRIEELQAERNASKKKLEYFLRKV +EKALWRSREHEKVRAFIDDIKAELNREKK
Subjt:  ATKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKK

Query:  TRQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA
        TRQR+EMINSKLVNELADAKLS KR MQDCEKE+KERSLVEEVCDELAKEIGEDKARIE+LKRE+MKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA
Subjt:  TRQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA

Query:  LEEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLE
        +EEKYSQM NLVADLEDFLR +SETSDVSEMKKALLLREAAATVNIQDV EFVYEPSNPDDIFSVFEDVNFGESNERE+GQC+TYSPPNHASK+QT  LE
Subjt:  LEEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLE

Query:  ANVTDRIGIQRQTNSFIAHN--GDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSI
        ANVTDRIGIQ+ TNSFIAHN  GDIEEDESGWETVSHLEDQGSSNSPEESIASV KNRRESNASVSGTEWEGNGGGDSP TEISEVCSVPSKQLKK+SSI
Subjt:  ANVTDRIGIQRQTNSFIAHN--GDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSI

Query:  ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR
        ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASI+SADGGSVRSGISPPEL  QWSSPDSGNGHA RGKKGCIPRNT+KGSLKAKLLEARMESHKVQLR
Subjt:  ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR

Query:  Q
        Q
Subjt:  Q

XP_008463408.1 PREDICTED: uncharacterized protein LOC103501579 [Cucumis melo]0.0e+0091.87Show/hide
Query:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG
        VSAKKPQLPAPFPPTLNQLPDLDLHY+PGSSR TRRRIRSPARVRRVVAPVGKRSRPETPLLKWK++DPGS  +GVQ EEDEKK  +E  QRGRFRGSKG
Subjt:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG

Query:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGG-HRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEG
        RK VVSARKLAAGIWRLQLQEAVASEGRNGG  RRTEDLLGFQPR GHSG  A HPDDKIAFNSEMNDLLHSP SVSDSRNG LCK EPSFRY NSAMEG
Subjt:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGG-HRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEG

Query:  ATKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKK
        ATKWEPACLKT VEARQIYNQMRL+DQQGA+SALS LEAELEQAHLRIEELQAERNASKKKLEYFLRKV +EKALWRSREHEKVRAFIDDIKAELNREKK
Subjt:  ATKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKK

Query:  TRQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA
        TRQR+EMINSKLVNELADAKLS KR MQDCEKERKERSLVEEVCDELAKEIGEDKARIE+LKRE++KLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA
Subjt:  TRQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA

Query:  LEEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLE
        LEEKYSQM NLVADLEDFL  +SETSDVSEMKKALLLREAAATVNIQDV EFVYEPSNPDDIFSVFEDVNFGESNERE+GQC+TYSPPNHASK+QT  LE
Subjt:  LEEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLE

Query:  ANVTDRIGIQRQTNSFIAHN--GDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSI
        AN+TDRIGIQ+ TNSFIAHN  GDIEEDESGWETVSHLEDQGSSNSPEESIASV KNRRESNASVSGTEWEGNGGGDSP TEISEVCSVPSKQLKK+SSI
Subjt:  ANVTDRIGIQRQTNSFIAHN--GDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSI

Query:  ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR
        ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASI+SADGGSVRSGISPPEL  QWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR
Subjt:  ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR

Query:  Q
        Q
Subjt:  Q

XP_023552694.1 coiled-coil domain-containing protein 30-like [Cucurbita pepo subsp. pepo]0.0e+0090.4Show/hide
Query:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG
        VS KKPQLPAPFPPTLNQLPDLDLHYRPGS+R TRRRIRSPAR+RRVVAPVGKRSRPETPLLKWKI+D GS ANGVQQEEDEK+ PLESSQRGRF G KG
Subjt:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG

Query:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGGHRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEGA
        RK VVSARKLAAGIWRLQLQEA ASEGRNGG RR EDL+GFQPRTGH  A  LHPD+KIAFNSEMNDLLHSPRSVSDSRN LLCK EPSFRYSNSAMEGA
Subjt:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGGHRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEGA

Query:  TKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKKT
        TKWEPACLKTPVEARQIYNQMRLLDQQGA+ ALS LE+ELEQAHL+IEELQAERNASKKKLEYFLRKV +EKALWRSREHEK+RA +DDIKAELNREKKT
Subjt:  TKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKKT

Query:  RQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVAL
        RQR+EMINSKLVNELADAKLSVKR MQD EKERKERSLVEEVCDELAKE+GEDKARIEALKRESMKLRDE DEERRMLQMAEVWREERVQMKLVDAKVAL
Subjt:  RQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVAL

Query:  EEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLEA
        EEKYSQM NLVADLEDFLR KSET DVSEMKKALLLREAAA+VNIQDV EFVYEPSNPDDIFSVFEDVNFGESNERE+GQCV YSPP HASK++T  LEA
Subjt:  EEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLEA

Query:  NVTDRIGIQRQTNSFIAHNGDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSIARL
        NVTDRIGIQR T+SF+A N  IEEDESGWETVSHLEDQGSSNSPE+S+AS+KKN RES ASVSG EWEGNGGGDSP TEISEVCSVPSKQLKK+SSIARL
Subjt:  NVTDRIGIQRQTNSFIAHNGDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSIARL

Query:  WKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLRQ
        WKSCSNNEGYKLISLEGINARLSNGRLSSASI SADGGSVRSGISPPELA QWSSPDSGNGH TRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLRQ
Subjt:  WKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLRQ

XP_038904987.1 uncharacterized protein LOC120091168 [Benincasa hispida]0.0e+0094.27Show/hide
Query:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG
        VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKI+DPGS ANGVQQE+DEKK PLESSQR RFRGSKG
Subjt:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG

Query:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGGHRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEGA
        RK VVSARKLAAGIWRLQLQEAVAS GRNGGHRRTEDLLGFQP TGHSGAPA HPDDKIAFNSEMNDLL SPRSVS SRNGLLCK EPSF+Y+NSAMEGA
Subjt:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGGHRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEGA

Query:  TKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKKT
        TKWEPACLKTPVEARQIYNQMRLLDQQGA+SALS LEAELEQAHLRIEELQAER ASKKKLEYFLRKVG+EKALWRSREHEKVRAFIDDIKAELNREKKT
Subjt:  TKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKKT

Query:  RQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVAL
        RQR+EMIN KLVNELADAKLS+KR MQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMK+RDEVDEERRMLQMAEVWREERVQMKLVDAKVAL
Subjt:  RQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVAL

Query:  EEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLEA
        EEKYSQM NLVADLEDFLR +SETSDVSEMKKALLLREAAATVNIQDV EFVYEPSNPDDIFSVFEDVNFGESNERE+GQCV YSPPNHASK+QTV LEA
Subjt:  EEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLEA

Query:  NVTDRIGIQRQTNSFIAHNGDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSIARL
        NVTDRIGIQR TNSFIAHNGDIEEDESGWETVSHLEDQGSSNSPEESIASVKKN RESNASVSGTEWEGNGGGDSP TEISEVCSVPSKQLKK+SSIARL
Subjt:  NVTDRIGIQRQTNSFIAHNGDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSIARL

Query:  WKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLRQ
        WKSCSNNEGYKLISL+GINARLSNGRLSSASI+SADGGSVRSGISPPELA QWSSPDS NGH TRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLRQ
Subjt:  WKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLRQ

TrEMBL top hitse value%identityAlignment
A0A0A0L516 Uncharacterized protein0.0e+0090.44Show/hide
Query:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG
        +SAK P LPAPFPPTLNQLPDLDLHY+PGS+R TRRRIRSPARVRRVVAP+G+RSRPETPLLKWK++DPGS  +GVQQEE++ K  +E+ QRGRFRGSKG
Subjt:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG

Query:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGG-HRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEG
        RK VVSARKLAAGIWRLQL EAVASEGRNGG  RRTEDLLGFQ RTGHSG  A HPDDKIAFNSEMNDLLHSP SVSDSRNG LCK EPSFRY NSAMEG
Subjt:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGG-HRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEG

Query:  ATKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKK
        ATKWEPACLKTPVEARQIYNQMRL+DQQGA+SALS LEAELE+AHLRIEELQAERNASKKKLEYFLRKV +EKALWRSREHEKVRAFIDDIKAELNREKK
Subjt:  ATKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKK

Query:  TRQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA
        TRQR+EMINSKLVNELADAKLS KR MQDCEKE+KERSLVEEVCDELAKEIGEDKARIE+LKRE+MKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA
Subjt:  TRQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA

Query:  LEEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLE
        +EEKYSQM NLVADLEDFLR +SETSDVSEMKKALLLREAAATVNIQDV EFVYEPSNPDDIFSVFEDVNFGESNERE+GQC+TYSPPNHASK+QT  LE
Subjt:  LEEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLE

Query:  ANVTDRIGIQRQTNSFIAHN--GDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSI
        ANVTDRIGIQ+ TNSFIAHN  GDIEEDESGWETVSHLEDQGSSNSPEESIASV KNRRESNASVSGTEWEGNGGGDSP TEISEVCSVPSKQLKK+SSI
Subjt:  ANVTDRIGIQRQTNSFIAHN--GDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSI

Query:  ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR
        ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASI+SADGGSVRSGISPPEL  QWSSPDSGNGHA RGKKGCIPRNT+KGSLKAKLLEARMESHKVQLR
Subjt:  ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR

Query:  Q
        Q
Subjt:  Q

A0A1S3CKR8 uncharacterized protein LOC1035015790.0e+0091.87Show/hide
Query:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG
        VSAKKPQLPAPFPPTLNQLPDLDLHY+PGSSR TRRRIRSPARVRRVVAPVGKRSRPETPLLKWK++DPGS  +GVQ EEDEKK  +E  QRGRFRGSKG
Subjt:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG

Query:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGG-HRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEG
        RK VVSARKLAAGIWRLQLQEAVASEGRNGG  RRTEDLLGFQPR GHSG  A HPDDKIAFNSEMNDLLHSP SVSDSRNG LCK EPSFRY NSAMEG
Subjt:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGG-HRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEG

Query:  ATKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKK
        ATKWEPACLKT VEARQIYNQMRL+DQQGA+SALS LEAELEQAHLRIEELQAERNASKKKLEYFLRKV +EKALWRSREHEKVRAFIDDIKAELNREKK
Subjt:  ATKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKK

Query:  TRQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA
        TRQR+EMINSKLVNELADAKLS KR MQDCEKERKERSLVEEVCDELAKEIGEDKARIE+LKRE++KLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA
Subjt:  TRQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA

Query:  LEEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLE
        LEEKYSQM NLVADLEDFL  +SETSDVSEMKKALLLREAAATVNIQDV EFVYEPSNPDDIFSVFEDVNFGESNERE+GQC+TYSPPNHASK+QT  LE
Subjt:  LEEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLE

Query:  ANVTDRIGIQRQTNSFIAHN--GDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSI
        AN+TDRIGIQ+ TNSFIAHN  GDIEEDESGWETVSHLEDQGSSNSPEESIASV KNRRESNASVSGTEWEGNGGGDSP TEISEVCSVPSKQLKK+SSI
Subjt:  ANVTDRIGIQRQTNSFIAHN--GDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSI

Query:  ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR
        ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASI+SADGGSVRSGISPPEL  QWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR
Subjt:  ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR

Query:  Q
        Q
Subjt:  Q

A0A5A7SS70 E3 ubiquitin-protein ligase BRE1A-like0.0e+0092.01Show/hide
Query:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG
        VSAKKPQLPAPFPPTLNQLPDLDLHY+PGSSR TRRRIRSPARVRRVVAPVGKRSRPETPLLKWK++DPGS  +GVQ EEDEKK  +E  QRGRFRGSKG
Subjt:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG

Query:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGG-HRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEG
        RK VVSARKLAAGIWRLQLQEAVASEGRNGG  RRTEDLLGFQPR GHSG  A HPDDKIAFNSEMNDLLHSP SVSDSRNG LCK EPSFRY NSAMEG
Subjt:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGG-HRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEG

Query:  ATKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKK
        ATKWEPACLKT VEARQIYNQMRL+DQQGA+SALS LEAELEQAHLRIEELQAERNASKKKLEYFLRKV +EKALWRSREHEKVRAFIDDIKAELNREKK
Subjt:  ATKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKK

Query:  TRQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA
        TRQR+EMINSKLVNELADAKLS KR MQDCEKERKERSLVEEVCDELAKEIGEDKARIE+LKRE++KLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA
Subjt:  TRQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVA

Query:  LEEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLE
        LEEKYSQM NLVADLEDFLR +SETSDVSEMKKALLLREAAATVNIQDV EFVYEPSNPDDIFSVFEDVNFGESNERE+GQC+TYSPPNHASK+QT  LE
Subjt:  LEEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLE

Query:  ANVTDRIGIQRQTNSFIAHN--GDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSI
        AN+TDRIGIQ+ TNSFIAHN  GDIEEDESGWETVSHLEDQGSSNSPEESIASV KNRRESNASVSGTEWEGNGGGDSP TEISEVCSVPSKQLKK+SSI
Subjt:  ANVTDRIGIQRQTNSFIAHN--GDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSI

Query:  ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR
        ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASI+SADGGSVRSGISPPEL  QWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR
Subjt:  ARLWKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLR

Query:  Q
        Q
Subjt:  Q

A0A6J1EVK0 coiled-coil domain-containing protein 30-like0.0e+0090.4Show/hide
Query:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG
        VS KKPQLPAPFPPTLNQLPDLDLHYRPGSSR TRRRIRSPARVRRVVAP+GKRSRPETPLLKWKI+D GS ANGVQQEEDEK+ PLESSQRGR RG KG
Subjt:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG

Query:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGGHRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEGA
        RK VVSARKLAAGIWRL LQEA ASEGRNGG  RTEDL+GFQPRTGH  APALHPD+KIAFNSEMNDLLHSP SVSDSRN LLCK EPSFRYSNSAMEGA
Subjt:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGGHRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEGA

Query:  TKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKKT
        TKWEPACLKTPVEARQIYNQMRLLDQ GA+ ALS LE+ELEQAHL+IEELQAERNASKKKLEYFLRKV +EK LWRSREHEK+RA +DDIKAELNREKKT
Subjt:  TKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKKT

Query:  RQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVAL
        RQR+EMINSKLVNELADAKLSVKR MQD EKERKERSLVEEVCDELAKE+GEDKARIEALKRESMKLRDE DEERRMLQMAEVWREERVQMKLVDAKVAL
Subjt:  RQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVAL

Query:  EEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLEA
        EEKYSQM NLVADLEDFLR KSET DVSEMKKALLLREAAA+VNIQDV EFVYEPSNPDDIFSVFEDVNFGESNERE+GQC  YSPP HASK++T  LEA
Subjt:  EEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLEA

Query:  NVTDRIGIQRQTNSFIAHNGDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSIARL
        NVTDRIGIQR T+SF+A N  IEEDESGWETVSHLEDQGSSNSPE+S+AS+KKN RESNASVSG EWEGNGGGDSP TEISEVCSVPSKQLKK+SSIARL
Subjt:  NVTDRIGIQRQTNSFIAHNGDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSIARL

Query:  WKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLRQ
        WKSCSNNEGYKLISLEGINARLSNGRLSSASI SADGGSVRSGISPPELA QWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLRQ
Subjt:  WKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLRQ

A0A6J1J7F3 uncharacterized protein LOC1114832640.0e+0089.83Show/hide
Query:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG
        VS KKPQLPAPFPPTLNQLPDLDLHYRPGSSR TRRRIRSPARVRRVVAPVGKRSRPETPLLKWKI+D GS ANGVQQEEDEK+ PLESSQRGR RG KG
Subjt:  VSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKG

Query:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGGHRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEGA
        RK VVSARKL AGIWRLQLQEA  SEGRNGG RRTEDL+GFQPRTGH  APALHPD+KIAFNSEMNDLLHSPRSVSDSRN LLCK EPSFRYSNSAMEGA
Subjt:  RKAVVSARKLAAGIWRLQLQEAVASEGRNGGHRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEGA

Query:  TKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKKT
        TKWEPACLKTPVEARQIYNQMRLLDQQGA+ ALS LE+ELEQAHL+IEELQAERNAS+KKLEYFLRKV +EKALWRSREHEK+RA +DDIKAELNREKKT
Subjt:  TKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKKT

Query:  RQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVAL
        RQR+EMINSKLVNELAD+KLSVKR MQD EKERKERSLVEEVCDELAKE+GEDKARIEALKRESMKLRDE DEER+MLQMAEVWREERVQMKLVDAKVAL
Subjt:  RQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVAL

Query:  EEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLEA
        EEKYSQM NLVADLEDFLR K ET DVSEMKKALLLREAAA+VNIQDV EFVYEPSNPDDIFSVFEDVNFGESNERE+GQC+ YSPP+HASK+QT  LEA
Subjt:  EEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLEA

Query:  NVTDRIGIQRQTNSFIAHNGDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSIARL
        NVTDRIGIQR T+ F+A N  IEEDESGWETVSHLEDQGSSNSPE+S+AS+KKN R+SNASVSG EWEGNGGGDSP TE+SEVCSVPSKQLKK+SSIARL
Subjt:  NVTDRIGIQRQTNSFIAHNGDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSIARL

Query:  WKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLRQ
        WKSCSNNEGYKLISLEGINARLSNGRLSSASI SADGGSVRSGISPPELA QWSS DSGNGH TRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLRQ
Subjt:  WKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLRQ

SwissProt top hitse value%identityAlignment
A8MRC7 Probable beta-1,3-galactosyltransferase 22.4e-12857.07Show/hide
Query:  MSWKSRGGFEPSTRTSVSRKWTFLFCIGCFCAGMLFSDRMWAVPEVENMPGGSTGSEEEKLKMVSEGCNSSNIF----------LYHRLQNTY-------
        MS K +G  E S+R+ VSRKWT L C+G FC GM F++RMW +PE + M   S  +E E+LK+VSEGCN   ++          L+  + NT+       
Subjt:  MSWKSRGGFEPSTRTSVSRKWTFLFCIGCFCAGMLFSDRMWAVPEVENMPGGSTGSEEEKLKMVSEGCNSSNIF----------LYHRLQNTY-------

Query:  --------------------------------------HLFMSVI-------------------------------------------STTPGGILDRAI
                                                F+ V+                                           S T GGILDRAI
Subjt:  --------------------------------------HLFMSVI-------------------------------------------STTPGGILDRAI

Query:  EAEDKRHGDFMRLDHVEGYLELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGN
        EAED++HGDF+RLDHVEGYLELS KTK+YF+TA ++WDADFYVKVDDDVHVNIATLG TL RHR K RVYIGCMKSGPVL+QKGVRYHEPEYWKFGE GN
Subjt:  EAEDKRHGDFMRLDHVEGYLELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGN

Query:  KYFRHATGQLYAISKDLATYISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSAERMKEVHRRCGE
        KYFRHATGQLYAIS+DLA+YISINQH+LHKYANEDVSLG+WFIG+D +HIDDRRLCCGTPPDCEWKAQAGNIC+ASFDWSCSGIC+SA+R+KEVHRRCGE
Subjt:  KYFRHATGQLYAISKDLATYISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSAERMKEVHRRCGE

Query:  GENALLSAVF
        GE AL SA F
Subjt:  GENALLSAVF

Q8LEJ9 Probable beta-1,3-galactosyltransferase 43.8e-12156.06Show/hide
Query:  STRTSVSRKWTFLFCIGCFCAGMLFSDRMWAVPEVENMPGGSTGSEEEKLKMVSEGCNSSNIFLYHRLQN------------------------------
        ++++ VS+KWT   CIG FCAG+LFSDRMW  PE  N+    T + +E+L++ SE C+SS   L    ++                              
Subjt:  STRTSVSRKWTFLFCIGCFCAGMLFSDRMWAVPEVENMPGGSTGSEEEKLKMVSEGCNSSNIFLYHRLQN------------------------------

Query:  ----------------------TYHLFMSVI-------------------------------------------STTPGGILDRAIEAEDKRHGDFMRLD
                              T   ++ V+                                           S+TPGGILDRAI+AE+ +HGDF+RLD
Subjt:  ----------------------TYHLFMSVI-------------------------------------------STTPGGILDRAIEAEDKRHGDFMRLD

Query:  HVEGYLELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGNKYFRHATGQLYAIS
        HVEGYLELSAKTK+YF TA A+WDADFYVKVDDDVHVNIATLG+ LAR+R K RVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGNKYFRHATGQLYAIS
Subjt:  HVEGYLELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGNKYFRHATGQLYAIS

Query:  KDLATYISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSAERMKEVHRRCGEGENALLSAVF
        ++LA+YISINQ++LHKY NEDVSLGSWF+GLD EH+DDRRLCCGT  DCEWKAQAGNIC+ASFDWSCSGIC+SA+RMK+VHRRCGEGE ALL+A F
Subjt:  KDLATYISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSAERMKEVHRRCGEGENALLSAVF

Q9MAP8 Beta-1,6-galactosyltransferase GALT31A6.2e-10878.83Show/hide
Query:  STTPGGILDRAIEAEDKRHGDFMRLDHVEGYLELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYH
        S++PGG+LD  IEAE+++H DF RL+H+EGY ELS+KT+ YF++AVA WDADFY+KVDDDVHVN+  LGSTLARHRSK RVYIGCMKSGPVLAQKGV+YH
Subjt:  STTPGGILDRAIEAEDKRHGDFMRLDHVEGYLELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYH

Query:  EPEYWKFGEEGNKYFRHATGQLYAISKDLATYISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSA
        EPEYWKFGEEGNKYFRHATGQ+YAISKDLATYIS+N+ +LHKYANEDVSLGSWFIGLD EHIDDR LCCGTP DCEWK QAGN C ASFDWSCSGICKS 
Subjt:  EPEYWKFGEEGNKYFRHATGQLYAISKDLATYISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSA

Query:  ERMKEVHRRCGEGENALLSAVF
        +RM EVH+RCGEG+ A+  + F
Subjt:  ERMKEVHRRCGEGENALLSAVF

Q9SAA4 Probable beta-1,3-galactosyltransferase 16.0e-10349.61Show/hide
Query:  MSWKSRGGFEPSTRTSVSRKWTFLFCIGCFCAGMLFSDRMW-AVPEVENMPGGSTGSEEEKLKMVSEGCNSSNIFLY-----------------------
        MS+K+RG +  + R  VSR   F  C+  FC GM F++RMW  VPE   +      S   KL + S  C+  N+  Y                       
Subjt:  MSWKSRGGFEPSTRTSVSRKWTFLFCIGCFCAGMLFSDRMW-AVPEVENMPGGSTGSEEEKLKMVSEGCNSSNIFLY-----------------------

Query:  ---HRLQNTYHL---------------------------------------------------FMSVISTTPGGILDRAIEAEDKRHGDFMRLDHVEGYL
             L   +++                                                   F+   S    GILD+AIEAE+K HGDF+RL+H EGY+
Subjt:  ---HRLQNTYHL---------------------------------------------------FMSVISTTPGGILDRAIEAEDKRHGDFMRLDHVEGYL

Query:  ELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGNKYFRHATGQLYAISKDLATY
        +LSAKTK++FATAV+LWDA+FY+KVDDDVHVN+A+L   L+ H++K RVY+GCMKSGPVLA+K V+YHEPEYWKFGE GNKYFRHATGQ YAISKDLATY
Subjt:  ELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGNKYFRHATGQLYAISKDLATY

Query:  ISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSAERMKEVHRRCGEGENAL
        I INQ +LHKYANEDVSLGSWFIGL+ EH+D++RLCC T  DCE KA  G++C ASFDW CSGIC+SAERM +VH RCGE +NAL
Subjt:  ISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSAERMKEVHRRCGEGENAL

Q9ZV71 Probable beta-1,3-galactosyltransferase 34.6e-12755.12Show/hide
Query:  MSWKSRGGFEPSTRTSVSRKWTFLFCIGCFCAGMLFSDRMWAVPEVENMPGGSTGSEEEKLKMVSEGCNSSNIF----------LYHRLQNTYHL-----
        MS K +G   PS R+ VS+KWTFL C G FC G+LF+DRMW +PE ++MP  S  +E E+LK++SEGC+   ++          L+  +  T++      
Subjt:  MSWKSRGGFEPSTRTSVSRKWTFLFCIGCFCAGMLFSDRMWAVPEVENMPGGSTGSEEEKLKMVSEGCNSSNIF----------LYHRLQNTYHL-----

Query:  ----------------------------------------FMSVI-------------------------------------------STTPGGILDRAI
                                                ++ V+                                           S T GGILDR+I
Subjt:  ----------------------------------------FMSVI-------------------------------------------STTPGGILDRAI

Query:  EAEDKRHGDFMRLDHVEGYLELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGN
        EAEDK+HGDF+RLDHVEGYLELS KTK+YF+TAV+ WDA+FYVKVDDDVHVNIATLG TL RHR K RVY+GCMKSGPVL+QKGVRYHEPEYWKFGE GN
Subjt:  EAEDKRHGDFMRLDHVEGYLELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGN

Query:  KYFRHATGQLYAISKDLATYISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSAERMKEVHRRCGE
        KYFRHATGQLYAIS+DLA+YIS+NQH+LHKYANEDV+LG+WFIGLD  HIDDRRLCCGTPPDCEWKAQAGNIC+ASFDW+CSGIC+SA+R+KEVH+RCGE
Subjt:  KYFRHATGQLYAISKDLATYISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSAERMKEVHRRCGE

Query:  GENALLSAVF
         ENA+  A F
Subjt:  GENALLSAVF

Arabidopsis top hitse value%identityAlignment
AT1G05170.1 Galactosyltransferase family protein8.6e-12958.23Show/hide
Query:  MSWKSRGGFEPSTRTSVSRKWTFLFCIGCFCAGMLFSDRMWAVPEVENMPGGSTGSEEEKLKMVSEGCN------------------SSNIFL-------
        MS K +G  E S+R+ VSRKWT L C+G FC GM F++RMW +PE + M   S  +E E+LK+VSEGCN                  +++I L       
Subjt:  MSWKSRGGFEPSTRTSVSRKWTFLFCIGCFCAGMLFSDRMWAVPEVENMPGGSTGSEEEKLKMVSEGCN------------------SSNIFL-------

Query:  -------------YHRLQNTYHL--------------FMSVI-------------------------------------------STTPGGILDRAIEAE
                        LQN   L              F+ V+                                           S T GGILDRAIEAE
Subjt:  -------------YHRLQNTYHL--------------FMSVI-------------------------------------------STTPGGILDRAIEAE

Query:  DKRHGDFMRLDHVEGYLELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGNKYF
        D++HGDF+RLDHVEGYLELS KTK+YF+TA ++WDADFYVKVDDDVHVNIATLG TL RHR K RVYIGCMKSGPVL+QKGVRYHEPEYWKFGE GNKYF
Subjt:  DKRHGDFMRLDHVEGYLELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGNKYF

Query:  RHATGQLYAISKDLATYISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSAERMKEVHRRCGEGEN
        RHATGQLYAIS+DLA+YISINQH+LHKYANEDVSLG+WFIG+D +HIDDRRLCCGTPPDCEWKAQAGNIC+ASFDWSCSGIC+SA+R+KEVHRRCGEGE 
Subjt:  RHATGQLYAISKDLATYISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSAERMKEVHRRCGEGEN

Query:  ALLSAVF
        AL SA F
Subjt:  ALLSAVF

AT1G05170.2 Galactosyltransferase family protein1.7e-12957.07Show/hide
Query:  MSWKSRGGFEPSTRTSVSRKWTFLFCIGCFCAGMLFSDRMWAVPEVENMPGGSTGSEEEKLKMVSEGCNSSNIF----------LYHRLQNTY-------
        MS K +G  E S+R+ VSRKWT L C+G FC GM F++RMW +PE + M   S  +E E+LK+VSEGCN   ++          L+  + NT+       
Subjt:  MSWKSRGGFEPSTRTSVSRKWTFLFCIGCFCAGMLFSDRMWAVPEVENMPGGSTGSEEEKLKMVSEGCNSSNIF----------LYHRLQNTY-------

Query:  --------------------------------------HLFMSVI-------------------------------------------STTPGGILDRAI
                                                F+ V+                                           S T GGILDRAI
Subjt:  --------------------------------------HLFMSVI-------------------------------------------STTPGGILDRAI

Query:  EAEDKRHGDFMRLDHVEGYLELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGN
        EAED++HGDF+RLDHVEGYLELS KTK+YF+TA ++WDADFYVKVDDDVHVNIATLG TL RHR K RVYIGCMKSGPVL+QKGVRYHEPEYWKFGE GN
Subjt:  EAEDKRHGDFMRLDHVEGYLELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGN

Query:  KYFRHATGQLYAISKDLATYISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSAERMKEVHRRCGE
        KYFRHATGQLYAIS+DLA+YISINQH+LHKYANEDVSLG+WFIG+D +HIDDRRLCCGTPPDCEWKAQAGNIC+ASFDWSCSGIC+SA+R+KEVHRRCGE
Subjt:  KYFRHATGQLYAISKDLATYISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSAERMKEVHRRCGE

Query:  GENALLSAVF
        GE AL SA F
Subjt:  GENALLSAVF

AT1G50660.1 unknown protein1.7e-19356.81Show/hide
Query:  DLHYRPGSSRATRRRIRSPA--RVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEED--EKKFPLESSQRGRFRGSKGRKAVVSARKLAAGIWRLQ
        DL     ++  T+RR R+P+  R RR     G+RSRPETPLLKWK+ED   E +GV +++D  +    +  S+  R +  +     VS RKLAAG+WRLQ
Subjt:  DLHYRPGSSRATRRRIRSPA--RVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEED--EKKFPLESSQRGRFRGSKGRKAVVSARKLAAGIWRLQ

Query:  LQEAVASEGRNGGHRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEGATKWEPACLKTPVEARQIY
        + +A +S    GG R+ ++ LGFQ   G+ G P L+         + N +  +P +++ ++NG LCKLEPS  + +SAMEGATKW+P CL T  E  QIY
Subjt:  LQEAVASEGRNGGHRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLCKLEPSFRYSNSAMEGATKWEPACLKTPVEARQIY

Query:  NQMRLLDQQ-GAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKKTRQRLEMINSKLVNELAD
        + M+ +DQQ  A+S +S LEAELE+AH RIE+L++E+ + KKKLE FLRKV +E+A WRSREHEKVRA IDD+K ++NREKKTRQRLE++N KLVNELAD
Subjt:  NQMRLLDQQ-GAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAELNREKKTRQRLEMINSKLVNELAD

Query:  AKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVALEEKYSQMCNLVADLEDF
        +KL+VKR MQD EKERK R L+EEVCDELAKEIGEDKA IEALKRESM LR+EVD+ERRMLQMAEVWREERVQMKL+DAKVALEE+YSQM  LV DLE F
Subjt:  AKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVALEEKYSQMCNLVADLEDF

Query:  LRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLEANVTDRIGIQRQTNSFIA
        LRS+   +DV E+++A LLRE AA+VNIQ++KEF Y P+NPDDI++VFE++N GE+++REM + V YSP +H SK+ TV L+AN+ ++ G  R ++++  
Subjt:  LRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLEANVTDRIGIQRQTNSFIA

Query:  HNGDIEEDESGWETVSHLEDQGSSNSPEESIASVKK---NRRESNASVSGTEWEGNGGGD--SPSTEISEVCSVPSKQLKKMSSIARLWKSCSNNEG---
         NGDIEED+SGWETVSHLE+QGSS SP+ SI SV     N R SNAS  GTE  G    D  +P+TEISEVCS+P +  KK+SSIA+LW+S   + G   
Subjt:  HNGDIEEDESGWETVSHLEDQGSSNSPEESIASVKK---NRRESNASVSGTEWEGNGGGD--SPSTEISEVCSVPSKQLKKMSSIARLWKSCSNNEG---

Query:  --YKLISLEGIN-ARLSNGRLSSASIVSADGGSVRSGISP-PELAEQW-SSPDSGN-GHATR-GKKGCI--PRNTIKGSLKAKLLEARMESHKVQLR
          YK+IS+EG+N  R+SNGR SSA +VS D  S + G SP  +L  QW SSP+S N  H  R G KGCI  PR   K SLK+KL+EAR+ES KVQL+
Subjt:  --YKLISLEGIN-ARLSNGRLSSASIVSADGGSVRSGISP-PELAEQW-SSPDSGN-GHATR-GKKGCI--PRNTIKGSLKAKLLEARMESHKVQLR

AT2G32430.1 Galactosyltransferase family protein3.3e-12855.12Show/hide
Query:  MSWKSRGGFEPSTRTSVSRKWTFLFCIGCFCAGMLFSDRMWAVPEVENMPGGSTGSEEEKLKMVSEGCNSSNIF----------LYHRLQNTYHL-----
        MS K +G   PS R+ VS+KWTFL C G FC G+LF+DRMW +PE ++MP  S  +E E+LK++SEGC+   ++          L+  +  T++      
Subjt:  MSWKSRGGFEPSTRTSVSRKWTFLFCIGCFCAGMLFSDRMWAVPEVENMPGGSTGSEEEKLKMVSEGCNSSNIF----------LYHRLQNTYHL-----

Query:  ----------------------------------------FMSVI-------------------------------------------STTPGGILDRAI
                                                ++ V+                                           S T GGILDR+I
Subjt:  ----------------------------------------FMSVI-------------------------------------------STTPGGILDRAI

Query:  EAEDKRHGDFMRLDHVEGYLELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGN
        EAEDK+HGDF+RLDHVEGYLELS KTK+YF+TAV+ WDA+FYVKVDDDVHVNIATLG TL RHR K RVY+GCMKSGPVL+QKGVRYHEPEYWKFGE GN
Subjt:  EAEDKRHGDFMRLDHVEGYLELSAKTKSYFATAVALWDADFYVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGN

Query:  KYFRHATGQLYAISKDLATYISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSAERMKEVHRRCGE
        KYFRHATGQLYAIS+DLA+YIS+NQH+LHKYANEDV+LG+WFIGLD  HIDDRRLCCGTPPDCEWKAQAGNIC+ASFDW+CSGIC+SA+R+KEVH+RCGE
Subjt:  KYFRHATGQLYAISKDLATYISINQHILHKYANEDVSLGSWFIGLDAEHIDDRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSAERMKEVHRRCGE

Query:  GENALLSAVF
         ENA+  A F
Subjt:  GENALLSAVF

AT3G20350.1 unknown protein2.0e-15750.14Show/hide
Query:  KPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPA--RVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKGRK
        K  L A  P T+  + D+            RRR R P+  R RR    V + SRPETP LK K+ED   E  G  ++ D      E     + R  +  +
Subjt:  KPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPA--RVRRVVAPVGKRSRPETPLLKWKIEDPGSEANGVQQEEDEKKFPLESSQRGRFRGSKGRK

Query:  AV--VSARKLAAGIWRLQLQEAVASEGRNGGHRRTEDLLGFQPRTGHSG--APAL---HPDDKIAFNSEMNDLLHSPRSVSDSRN---GLLCKLEPSFRY
        +V   + RKLAAG+WRL++ +AV+S    GG +R++D L FQ   G +G   P     H DDK           HS    ++SRN     LCK EPS  +
Subjt:  AV--VSARKLAAGIWRLQLQEAVASEGRNGGHRRTEDLLGFQPRTGHSG--APAL---HPDDKIAFNSEMNDLLHSPRSVSDSRN---GLLCKLEPSFRY

Query:  SNSAMEGATKWEPACLKTPVEARQIYNQMRLLDQQ-GAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIK
         + AMEGATKW+P CL T  +  QIY  ++  +QQ   +S  S +E +L++A   I++L++E+ + KKKLE FL+KV +E+A WRSREHEKVRA IDD+K
Subjt:  SNSAMEGATKWEPACLKTPVEARQIYNQMRLLDQQ-GAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIK

Query:  AELNREKKTRQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQM
        A++N+EKKTRQRLE++NSKLVNELAD+KL+VKR M D ++ERK R L+EEVCDELAKEI EDKA IEALK ESM LR+EVD+ERRMLQMAEVWREERVQM
Subjt:  AELNREKKTRQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQM

Query:  KLVDAKVALEEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATV-NIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHA
        KL+DAKV LEEKYSQM  LV D+E FL S++ T+ V E++ A LLRE AA+V NIQ++KEF YEP+ PDDI  +FE +N GE+ +RE  Q V YSP +HA
Subjt:  KLVDAKVALEEKYSQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATV-NIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHA

Query:  SKLQTVGLEANVTDRIGIQRQTNSFIAHNGDIEEDESGWETVSHLEDQGSSNSPEESIASVKK-NRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSK
        SK  TV  + N+ ++    R +N+F   NG+ EED+SGWETVSH E+ GSS SP+ESI ++   + R SN S++GTE+E      +   EI EVCSVP +
Subjt:  SKLQTVGLEANVTDRIGIQRQTNSFIAHNGDIEEDESGWETVSHLEDQGSSNSPEESIASVKK-NRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSK

Query:  QLKKMSSIARLWKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQW-SSPDSGNGHATR-GKKGCI--PRNTIKGSLKAKLL
        Q KK+ S+A+LW            SLEG+N R+SN R S+  +VS + GS + G +  +L  QW SSPDS N +  R G+KGCI  PR   K SLK KL+
Subjt:  QLKKMSSIARLWKSCSNNEGYKLISLEGINARLSNGRLSSASIVSADGGSVRSGISPPELAEQW-SSPDSGNGHATR-GKKGCI--PRNTIKGSLKAKLL

Query:  EARMESHKVQLR
        EA++ES KVQL+
Subjt:  EARMESHKVQLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCCTGTCGTTTTCCCCATCACCATCTTCAAACTAAATCCTCTCATACCCTTTTCTTCTTCTTCTTCTCTCTCACAATCCTTCACCCTTCCTCTCCGGCCTCCATAGT
ATCCGCGAAGAAGCCGCAGCTTCCGGCGCCATTCCCGCCCACACTCAACCAACTCCCAGATTTGGATCTTCATTACAGGCCTGGTTCCAGTCGGGCCACGCGGCGGAGAA
TCCGAAGTCCAGCTCGCGTCAGAAGAGTTGTTGCTCCGGTTGGGAAGCGGAGCCGTCCTGAGACGCCCTTGCTTAAGTGGAAGATCGAAGACCCAGGGAGTGAAGCTAAT
GGAGTTCAACAAGAGGAAGATGAGAAGAAGTTCCCTCTGGAAAGTAGCCAGAGAGGTCGCTTTCGAGGAAGTAAAGGAAGGAAAGCTGTTGTTTCGGCTAGGAAGCTCGC
GGCTGGAATCTGGAGATTGCAACTGCAGGAGGCGGTCGCTAGTGAAGGGAGGAATGGCGGCCACCGGAGGACGGAGGATCTCTTAGGGTTTCAGCCTCGCACTGGTCATT
CTGGCGCTCCAGCTCTCCATCCTGATGATAAGATAGCATTTAATTCTGAAATGAATGATCTATTACATAGTCCGCGTTCTGTCTCGGATTCAAGAAATGGGCTATTGTGC
AAGTTGGAGCCCTCATTTCGATATTCAAACTCAGCAATGGAGGGAGCAACAAAGTGGGAACCTGCATGCCTGAAAACACCAGTTGAAGCACGTCAGATTTACAACCAAAT
GAGACTTCTTGATCAACAAGGTGCTATTTCGGCCCTTTCTGTGCTTGAAGCTGAACTAGAGCAGGCCCACTTGCGGATTGAGGAACTTCAAGCTGAGCGTAATGCTTCAA
AAAAGAAGCTCGAATACTTCTTGAGGAAAGTTGGTGATGAAAAAGCCTTATGGCGGAGCAGAGAGCATGAGAAAGTAAGGGCATTTATTGATGATATAAAGGCTGAGCTG
AACAGAGAAAAGAAAACCAGACAAAGGTTGGAAATGATCAATTCGAAATTGGTAAACGAGTTGGCCGATGCTAAGCTATCTGTAAAGCGGCTTATGCAAGACTGTGAAAA
AGAAAGGAAGGAAAGATCCTTGGTTGAGGAAGTATGTGATGAACTCGCAAAGGAAATTGGTGAAGACAAAGCAAGAATAGAAGCATTGAAGAGAGAATCCATGAAATTGA
GAGACGAAGTGGACGAAGAACGAAGAATGTTACAGATGGCTGAGGTCTGGCGTGAGGAACGTGTTCAAATGAAGCTGGTTGATGCCAAAGTCGCACTTGAAGAGAAATAC
TCACAGATGTGTAATCTAGTAGCTGACCTTGAAGACTTCTTAAGGTCGAAAAGTGAAACCTCAGATGTATCAGAGATGAAAAAAGCTTTGCTACTTAGAGAGGCTGCTGC
TACTGTGAATATTCAAGATGTCAAAGAATTTGTTTATGAACCTTCAAACCCAGATGATATCTTCTCTGTGTTTGAAGATGTCAATTTTGGAGAATCAAATGAGAGGGAAA
TGGGACAGTGCGTTACTTACAGCCCACCTAATCATGCTTCTAAACTCCAGACTGTGGGTCTTGAAGCTAATGTGACCGATAGGATTGGCATCCAGAGACAAACGAATTCC
TTTATTGCTCATAACGGTGATATCGAAGAAGATGAGAGTGGATGGGAAACTGTGAGTCATCTAGAGGATCAGGGCTCAAGCAATTCCCCAGAAGAGAGTATTGCATCGGT
CAAAAAGAATCGTCGAGAGAGTAATGCTTCGGTGAGCGGGACAGAATGGGAAGGAAATGGAGGTGGAGACTCACCCTCTACCGAAATCAGCGAAGTCTGCTCGGTTCCAT
CAAAGCAATTAAAGAAGATGTCATCCATTGCACGACTTTGGAAATCGTGCTCAAACAATGAAGGATACAAACTAATCTCACTTGAGGGCATAAATGCCAGGCTCTCAAAT
GGAAGATTATCAAGTGCAAGTATCGTATCCGCAGATGGGGGCTCGGTGAGAAGTGGTATTAGTCCTCCAGAATTAGCAGAACAGTGGAGCTCTCCTGACTCGGGCAATGG
TCATGCAACACGAGGGAAGAAAGGATGCATCCCTCGCAACACAATCAAGGGTAGTTTAAAGGCAAAGCTCTTGGAGGCAAGAATGGAAAGCCATAAGGTCCAGTTGCGCC
AGAAGAGCTTTCAACTGGCTTCGATACCATTTCATCAGGCTCAGCTCCCAGCGGTAGTCCCGGACTCTGTCGCAAAGGGTAAAGTCGTAGGTCTAATCCTAATTGCCAAA
CGAGGAGACGGCAGTGTACACTGTCAACACCGATTGGAGACAGCCATCTTCGATGGAATGGGATGTGTTGGTTCCAAAGTTTCCATTTATAAGCTCATGAAATTCATTGT
CATGCTTAGAGAAAAACGAAACAGACTTAACTTTGTTGATATGGGATTTATAAAAATACAATTTGATATCCATCCCACTGTTGAACTAGCTACCAAAAAGAAAAGAAAAA
GTAATGATGTTATTGTGTTTGCACGTGCTCCATGTGCAAGAAGCATATCTGATTGTCGGTATCATTTGCAATATTCAAAGAACACAGCACTGAGTTGGAGCGACAACAGT
CACTTTTCAGCTCTCCACCGACGCTCTAACCACTACTGTCTACTGTTCAAGATCCACCTCTTCTTAAGTTTAAACCCATTTCCCATTCAGGTCTTTTTGAGTTTTGACTT
TGGGAATTCTGCAACTTGGGCTTCAATTTCTCCACTGGATTTGGTTGTTTTGCGATCTGATTTGGATTTGAACTGCTGGGTTGACTTGGGTAGAAGAAGGGGAAAAAAAA
GGAAGAACATTATGTCGTGGAAGAGCAGAGGAGGGTTTGAGCCAAGTACGAGGACCTCTGTTTCTCGTAAATGGACTTTTCTGTTTTGTATTGGATGCTTTTGTGCTGGG
ATGCTCTTCTCTGATAGAATGTGGGCTGTGCCTGAAGTTGAAAATATGCCAGGAGGGTCAACAGGTTCTGAAGAAGAGAAGTTAAAGATGGTATCAGAGGGTTGTAACTC
ATCAAATATATTTTTATACCATAGATTACAAAACACTTATCATCTATTTATGTCTGTTATTAGTACCACACCGGGTGGTATTCTCGATAGAGCTATTGAAGCCGAGGACA
AAAGACACGGTGATTTTATGAGGCTGGATCATGTCGAAGGTTACCTCGAATTGTCAGCCAAGACGAAATCTTACTTTGCCACCGCTGTTGCATTGTGGGATGCTGATTTT
TATGTGAAGGTTGACGATGATGTGCATGTCAACATAGCAACACTTGGATCAACATTAGCTAGACATCGGTCGAAGTCCCGGGTGTACATTGGATGCATGAAATCTGGTCC
CGTGCTTGCACAAAAAGGAGTCAGATACCATGAACCCGAGTACTGGAAATTTGGGGAAGAAGGAAACAAGTACTTCCGTCACGCCACTGGGCAGCTTTATGCCATATCAA
AGGATTTAGCAACGTACATATCAATCAACCAGCATATACTTCACAAGTATGCGAACGAGGACGTTTCGTTGGGTTCTTGGTTTATCGGATTGGATGCGGAGCATATCGAT
GACCGGAGATTATGTTGTGGCACCCCACCTGATTGTGAGTGGAAGGCTCAGGCAGGAAACATTTGCATTGCGTCGTTCGACTGGAGTTGCAGCGGGATATGCAAGTCGGC
GGAGCGAATGAAGGAGGTTCACCGCCGCTGCGGAGAGGGGGAGAATGCATTGTTGAGTGCAGTTTTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCCTGTCGTTTTCCCCATCACCATCTTCAAACTAAATCCTCTCATACCCTTTTCTTCTTCTTCTTCTCTCTCACAATCCTTCACCCTTCCTCTCCGGCCTCCATAGT
ATCCGCGAAGAAGCCGCAGCTTCCGGCGCCATTCCCGCCCACACTCAACCAACTCCCAGATTTGGATCTTCATTACAGGCCTGGTTCCAGTCGGGCCACGCGGCGGAGAA
TCCGAAGTCCAGCTCGCGTCAGAAGAGTTGTTGCTCCGGTTGGGAAGCGGAGCCGTCCTGAGACGCCCTTGCTTAAGTGGAAGATCGAAGACCCAGGGAGTGAAGCTAAT
GGAGTTCAACAAGAGGAAGATGAGAAGAAGTTCCCTCTGGAAAGTAGCCAGAGAGGTCGCTTTCGAGGAAGTAAAGGAAGGAAAGCTGTTGTTTCGGCTAGGAAGCTCGC
GGCTGGAATCTGGAGATTGCAACTGCAGGAGGCGGTCGCTAGTGAAGGGAGGAATGGCGGCCACCGGAGGACGGAGGATCTCTTAGGGTTTCAGCCTCGCACTGGTCATT
CTGGCGCTCCAGCTCTCCATCCTGATGATAAGATAGCATTTAATTCTGAAATGAATGATCTATTACATAGTCCGCGTTCTGTCTCGGATTCAAGAAATGGGCTATTGTGC
AAGTTGGAGCCCTCATTTCGATATTCAAACTCAGCAATGGAGGGAGCAACAAAGTGGGAACCTGCATGCCTGAAAACACCAGTTGAAGCACGTCAGATTTACAACCAAAT
GAGACTTCTTGATCAACAAGGTGCTATTTCGGCCCTTTCTGTGCTTGAAGCTGAACTAGAGCAGGCCCACTTGCGGATTGAGGAACTTCAAGCTGAGCGTAATGCTTCAA
AAAAGAAGCTCGAATACTTCTTGAGGAAAGTTGGTGATGAAAAAGCCTTATGGCGGAGCAGAGAGCATGAGAAAGTAAGGGCATTTATTGATGATATAAAGGCTGAGCTG
AACAGAGAAAAGAAAACCAGACAAAGGTTGGAAATGATCAATTCGAAATTGGTAAACGAGTTGGCCGATGCTAAGCTATCTGTAAAGCGGCTTATGCAAGACTGTGAAAA
AGAAAGGAAGGAAAGATCCTTGGTTGAGGAAGTATGTGATGAACTCGCAAAGGAAATTGGTGAAGACAAAGCAAGAATAGAAGCATTGAAGAGAGAATCCATGAAATTGA
GAGACGAAGTGGACGAAGAACGAAGAATGTTACAGATGGCTGAGGTCTGGCGTGAGGAACGTGTTCAAATGAAGCTGGTTGATGCCAAAGTCGCACTTGAAGAGAAATAC
TCACAGATGTGTAATCTAGTAGCTGACCTTGAAGACTTCTTAAGGTCGAAAAGTGAAACCTCAGATGTATCAGAGATGAAAAAAGCTTTGCTACTTAGAGAGGCTGCTGC
TACTGTGAATATTCAAGATGTCAAAGAATTTGTTTATGAACCTTCAAACCCAGATGATATCTTCTCTGTGTTTGAAGATGTCAATTTTGGAGAATCAAATGAGAGGGAAA
TGGGACAGTGCGTTACTTACAGCCCACCTAATCATGCTTCTAAACTCCAGACTGTGGGTCTTGAAGCTAATGTGACCGATAGGATTGGCATCCAGAGACAAACGAATTCC
TTTATTGCTCATAACGGTGATATCGAAGAAGATGAGAGTGGATGGGAAACTGTGAGTCATCTAGAGGATCAGGGCTCAAGCAATTCCCCAGAAGAGAGTATTGCATCGGT
CAAAAAGAATCGTCGAGAGAGTAATGCTTCGGTGAGCGGGACAGAATGGGAAGGAAATGGAGGTGGAGACTCACCCTCTACCGAAATCAGCGAAGTCTGCTCGGTTCCAT
CAAAGCAATTAAAGAAGATGTCATCCATTGCACGACTTTGGAAATCGTGCTCAAACAATGAAGGATACAAACTAATCTCACTTGAGGGCATAAATGCCAGGCTCTCAAAT
GGAAGATTATCAAGTGCAAGTATCGTATCCGCAGATGGGGGCTCGGTGAGAAGTGGTATTAGTCCTCCAGAATTAGCAGAACAGTGGAGCTCTCCTGACTCGGGCAATGG
TCATGCAACACGAGGGAAGAAAGGATGCATCCCTCGCAACACAATCAAGGGTAGTTTAAAGGCAAAGCTCTTGGAGGCAAGAATGGAAAGCCATAAGGTCCAGTTGCGCC
AGAAGAGCTTTCAACTGGCTTCGATACCATTTCATCAGGCTCAGCTCCCAGCGGTAGTCCCGGACTCTGTCGCAAAGGGTAAAGTCGTAGGTCTAATCCTAATTGCCAAA
CGAGGAGACGGCAGTGTACACTGTCAACACCGATTGGAGACAGCCATCTTCGATGGAATGGGATGTGTTGGTTCCAAAGTTTCCATTTATAAGCTCATGAAATTCATTGT
CATGCTTAGAGAAAAACGAAACAGACTTAACTTTGTTGATATGGGATTTATAAAAATACAATTTGATATCCATCCCACTGTTGAACTAGCTACCAAAAAGAAAAGAAAAA
GTAATGATGTTATTGTGTTTGCACGTGCTCCATGTGCAAGAAGCATATCTGATTGTCGGTATCATTTGCAATATTCAAAGAACACAGCACTGAGTTGGAGCGACAACAGT
CACTTTTCAGCTCTCCACCGACGCTCTAACCACTACTGTCTACTGTTCAAGATCCACCTCTTCTTAAGTTTAAACCCATTTCCCATTCAGGTCTTTTTGAGTTTTGACTT
TGGGAATTCTGCAACTTGGGCTTCAATTTCTCCACTGGATTTGGTTGTTTTGCGATCTGATTTGGATTTGAACTGCTGGGTTGACTTGGGTAGAAGAAGGGGAAAAAAAA
GGAAGAACATTATGTCGTGGAAGAGCAGAGGAGGGTTTGAGCCAAGTACGAGGACCTCTGTTTCTCGTAAATGGACTTTTCTGTTTTGTATTGGATGCTTTTGTGCTGGG
ATGCTCTTCTCTGATAGAATGTGGGCTGTGCCTGAAGTTGAAAATATGCCAGGAGGGTCAACAGGTTCTGAAGAAGAGAAGTTAAAGATGGTATCAGAGGGTTGTAACTC
ATCAAATATATTTTTATACCATAGATTACAAAACACTTATCATCTATTTATGTCTGTTATTAGTACCACACCGGGTGGTATTCTCGATAGAGCTATTGAAGCCGAGGACA
AAAGACACGGTGATTTTATGAGGCTGGATCATGTCGAAGGTTACCTCGAATTGTCAGCCAAGACGAAATCTTACTTTGCCACCGCTGTTGCATTGTGGGATGCTGATTTT
TATGTGAAGGTTGACGATGATGTGCATGTCAACATAGCAACACTTGGATCAACATTAGCTAGACATCGGTCGAAGTCCCGGGTGTACATTGGATGCATGAAATCTGGTCC
CGTGCTTGCACAAAAAGGAGTCAGATACCATGAACCCGAGTACTGGAAATTTGGGGAAGAAGGAAACAAGTACTTCCGTCACGCCACTGGGCAGCTTTATGCCATATCAA
AGGATTTAGCAACGTACATATCAATCAACCAGCATATACTTCACAAGTATGCGAACGAGGACGTTTCGTTGGGTTCTTGGTTTATCGGATTGGATGCGGAGCATATCGAT
GACCGGAGATTATGTTGTGGCACCCCACCTGATTGTGAGTGGAAGGCTCAGGCAGGAAACATTTGCATTGCGTCGTTCGACTGGAGTTGCAGCGGGATATGCAAGTCGGC
GGAGCGAATGAAGGAGGTTCACCGCCGCTGCGGAGAGGGGGAGAATGCATTGTTGAGTGCAGTTTTCTGATAAGCAACAGTTTTTTTTTCTTACAAGCTTAGAAGGAAAC
ATGAAGAGAGAATTCCATGGGGTTCTCAAGAGATATTTATTCAAAATGTGGAATTAAAGATGGATTAGTTTGAATTTGAGAAAGAAAAAATAGGCATTTTGTTGGGATTT
TTTAATATGAGAGTAAATAGATAGTAATTGTAAAATACCATTCTTTCAGCTCATAGGGGGAGAGAGGCTTTTCTCATGCCTGAGTCTATGTGGCTAATCAATCTCCATAA
GAAGTGCTGCTATTGTTTTCACATGAACCATTTTATAGTTCATCTACTTCAGCTCTGTAATTTTTCTACTTTATTTTAGGTTGTAAATTATATATTGCTTTTATACCTAT
TTCATTTTGGTAAATTCTATATAAATATGTCCCAAATTATTAATCATGAAACTTGG
Protein sequenceShow/hide protein sequence
MPCRFPHHHLQTKSSHTLFFFFFSLTILHPSSPASIVSAKKPQLPAPFPPTLNQLPDLDLHYRPGSSRATRRRIRSPARVRRVVAPVGKRSRPETPLLKWKIEDPGSEAN
GVQQEEDEKKFPLESSQRGRFRGSKGRKAVVSARKLAAGIWRLQLQEAVASEGRNGGHRRTEDLLGFQPRTGHSGAPALHPDDKIAFNSEMNDLLHSPRSVSDSRNGLLC
KLEPSFRYSNSAMEGATKWEPACLKTPVEARQIYNQMRLLDQQGAISALSVLEAELEQAHLRIEELQAERNASKKKLEYFLRKVGDEKALWRSREHEKVRAFIDDIKAEL
NREKKTRQRLEMINSKLVNELADAKLSVKRLMQDCEKERKERSLVEEVCDELAKEIGEDKARIEALKRESMKLRDEVDEERRMLQMAEVWREERVQMKLVDAKVALEEKY
SQMCNLVADLEDFLRSKSETSDVSEMKKALLLREAAATVNIQDVKEFVYEPSNPDDIFSVFEDVNFGESNEREMGQCVTYSPPNHASKLQTVGLEANVTDRIGIQRQTNS
FIAHNGDIEEDESGWETVSHLEDQGSSNSPEESIASVKKNRRESNASVSGTEWEGNGGGDSPSTEISEVCSVPSKQLKKMSSIARLWKSCSNNEGYKLISLEGINARLSN
GRLSSASIVSADGGSVRSGISPPELAEQWSSPDSGNGHATRGKKGCIPRNTIKGSLKAKLLEARMESHKVQLRQKSFQLASIPFHQAQLPAVVPDSVAKGKVVGLILIAK
RGDGSVHCQHRLETAIFDGMGCVGSKVSIYKLMKFIVMLREKRNRLNFVDMGFIKIQFDIHPTVELATKKKRKSNDVIVFARAPCARSISDCRYHLQYSKNTALSWSDNS
HFSALHRRSNHYCLLFKIHLFLSLNPFPIQVFLSFDFGNSATWASISPLDLVVLRSDLDLNCWVDLGRRRGKKRKNIMSWKSRGGFEPSTRTSVSRKWTFLFCIGCFCAG
MLFSDRMWAVPEVENMPGGSTGSEEEKLKMVSEGCNSSNIFLYHRLQNTYHLFMSVISTTPGGILDRAIEAEDKRHGDFMRLDHVEGYLELSAKTKSYFATAVALWDADF
YVKVDDDVHVNIATLGSTLARHRSKSRVYIGCMKSGPVLAQKGVRYHEPEYWKFGEEGNKYFRHATGQLYAISKDLATYISINQHILHKYANEDVSLGSWFIGLDAEHID
DRRLCCGTPPDCEWKAQAGNICIASFDWSCSGICKSAERMKEVHRRCGEGENALLSAVF