; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016897 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016897
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionHistone acetyltransferase
Genome locationtig00153016:250901..278844
RNA-Seq ExpressionSgr016897
SyntenySgr016897
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0016573 - histone acetylation (biological process)
GO:0046872 - metal ion binding (molecular function)
GO:0004402 - histone acetyltransferase activity (molecular function)
InterPro domainsIPR040706 - MYST, zinc finger domain
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR034732 - Extended PHD (ePHD) domain
IPR025995 - RNA binding activity-knot of a chromodomain
IPR019787 - Zinc finger, PHD-finger
IPR019786 - Zinc finger, PHD-type, conserved site
IPR016197 - Chromo-like domain superfamily
IPR016181 - Acyl-CoA N-acyltransferase
IPR013083 - Zinc finger, RING/FYVE/PHD-type
IPR011011 - Zinc finger, FYVE/PHD-type
IPR002717 - Histone acetyltransferase domain, MYST-type
IPR001965 - Zinc finger, PHD-type
IPR000953 - Chromo/chromo shadow domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022141725.1 uncharacterized protein LOC111012014 isoform X2 [Momordica charantia]0.0e+0087.58Show/hide
Query:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK
        MTRDRC LGKKMMGRGADGGCGTEERPCP+SRVPNRIPLTQTQ   EKL TVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFL RHSDGKK
Subjt:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK

Query:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE
        R++KSNSGADKKSSRQGERSRGSNIWVETEEYFRDL L DVDTLRAASSFS LVARKCF+IPS GN+P+ NVGGSENV+++N DG +VK EV DDRLCSE
Subjt:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE

Query:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC
        M + EANGSPP+DKGCLNLD S G+EWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCS CHVVVHH C
Subjt:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC

Query:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------
        YGIREKV GSW CSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNV SGFS+EFAHLFC QW+PEVYIEDLTQM PVMN+G                   
Subjt:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------

Query:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK
               GTCRTSFHPICAREA+HRMEVWAKYGCDNVELRAFCSKHSE RDF+SSGQ QD SEAVN +SYVVNHLPVTLS+NRPQKL GRRN+DNLL+CK
Subjt:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK

Query:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT---ADNLVPDLKSKI
        EASD NSGKLGDGEL DIGSTD  LNAD V THKSTIQGVED NPLDSLKF+SIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT   ADNLVPDLKSKI
Subjt:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT---ADNLVPDLKSKI

Query:  VRWLKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASE
        VRWLKNHAYIG LQKNLRVKLKSAVLSKAVVG ADRSDSLSVPES+NSDLIADKLV +RRKTKSN+SHLKNDEIKCSSEETLGG GLAM SD+LDQQ SE
Subjt:  VRWLKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASE

Query:  EPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLL----
        EPGDSNKECIQD+ EKHLNEHDNSRDSSSRIFPNDVG TV+PLNCSL E D LEAAVSGH SSI A HG  GE+P+S FHPYVQEK+SH LDG+LL    
Subjt:  EPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLL----

Query:  ----NVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPK
            ++  GEMSRW+ASSNAGVCCDH+YQ LECNDV C+SGGFN KQLVNKKI  I+KLSPEDEIEGEIIFYQHRLLANAVSRK+FTDHLIC+VAKSLPK
Subjt:  ----NVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPK

Query:  ELDESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARS
        E+DE+RSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRK+         LMPRAKETLTKVALPKTSLESDF KEHARS
Subjt:  ELDESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARS

Query:  CDICRRPETILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFES
        CDICRRPETILKPILVCSSCKVAVHLDCY  VKESSGPWCCELCEELS SRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFES
Subjt:  CDICRRPETILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFES

Query:  TFRRGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIR
        T+RRGQANPV GMETVSKGVDSCYICHRKHGVCLKCNYGHC+STFHPSCARSAGC+MTVKTSGGKLQHKAYCEKHSSEQRAKAENQ HGIEELNRVKQIR
Subjt:  TFRRGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIR

Query:  VELERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPL
        VELERLRLLC+RIIKREKIKRDLVLCSHDVLAFKRDHVARSVLV TPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDD+TVD TVSIKHWNKVPLPL
Subjt:  VELERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPL

Query:  DIEQKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQE
        D EQKT+DDSSTSQN FS KFAERAQFSGKQIPQRSS TTSR+L+DVGGLR KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQE
Subjt:  DIEQKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQE

Query:  T
        T
Subjt:  T

XP_022141727.1 uncharacterized protein LOC111012014 isoform X3 [Momordica charantia]0.0e+0087.71Show/hide
Query:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK
        MTRDRC LGKKMMGRGADGGCGTEERPCP+SRVPNRIPLTQTQ   EKL TVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFL RHSDGKK
Subjt:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK

Query:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE
        R++KSNSGADKKSSRQGERSRGSNIWVETEEYFRDL L DVDTLRAASSFS LVARKCF+IPS GN+P+ NVGGSENV+++N DG +VK EV DDRLCSE
Subjt:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE

Query:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC
        M + EANGSPP+DKGCLNLD S G+EWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCS CHVVVHH C
Subjt:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC

Query:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------
        YGIREKV GSW CSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNV SGFS+EFAHLFC QW+PEVYIEDLTQM PVMN+G                   
Subjt:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------

Query:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK
               GTCRTSFHPICAREA+HRMEVWAKYGCDNVELRAFCSKHSE RDF+SSGQ QD SEAVN +SYVVNHLPVTLS+NRPQKL GRRN+DNLL+CK
Subjt:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK

Query:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRW
        EASD NSGKLGDGEL DIGSTD  LNAD V THKSTIQGVED NPLDSLKF+SIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRW
Subjt:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRW

Query:  LKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASEEPG
        LKNHAYIG LQKNLRVKLKSAVLSKAVVG ADRSDSLSVPES+NSDLIADKLV +RRKTKSN+SHLKNDEIKCSSEETLGG GLAM SD+LDQQ SEEPG
Subjt:  LKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASEEPG

Query:  DSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLL-------
        DSNKECIQD+ EKHLNEHDNSRDSSSRIFPNDVG TV+PLNCSL E D LEAAVSGH SSI A HG  GE+P+S FHPYVQEK+SH LDG+LL       
Subjt:  DSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLL-------

Query:  -NVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELD
         ++  GEMSRW+ASSNAGVCCDH+YQ LECNDV C+SGGFN KQLVNKKI  I+KLSPEDEIEGEIIFYQHRLLANAVSRK+FTDHLIC+VAKSLPKE+D
Subjt:  -NVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELD

Query:  ESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDI
        E+RSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRK+         LMPRAKETLTKVALPKTSLESDF KEHARSCDI
Subjt:  ESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDI

Query:  CRRPETILKPILVCSSCK-VAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTF
        CRRPETILKPILVCSSCK VAVHLDCY  VKESSGPWCCELCEELS SRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFEST+
Subjt:  CRRPETILKPILVCSSCK-VAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTF

Query:  RRGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVE
        RRGQANPV GMETVSKGVDSCYICHRKHGVCLKCNYGHC+STFHPSCARSAGC+MTVKTSGGKLQHKAYCEKHSSEQRAKAENQ HGIEELNRVKQIRVE
Subjt:  RRGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVE

Query:  LERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDI
        LERLRLLC+RIIKREKIKRDLVLCSHDVLAFKRDHVARSVLV TPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDD+TVD TVSIKHWNKVPLPLD 
Subjt:  LERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDI

Query:  EQKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
        EQKT+DDSSTSQN FS KFAERAQFSGKQIPQRSS TTSR+L+DVGGLR KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
Subjt:  EQKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET

XP_022141729.1 uncharacterized protein LOC111012014 isoform X5 [Momordica charantia]0.0e+0088.09Show/hide
Query:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK
        MTRDRC LGKKMMGRGADGGCGTEERPCP+SRVPNRIPLTQTQ   EKL TVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFL RHSDGKK
Subjt:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK

Query:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE
        R++KSNSGADKKSSRQGERSRGSNIWVETEEYFRDL L DVDTLRAASSFS LVARKCF+IPS GN+P+ NVGGSENV+++N DG +VK EV DDRLCSE
Subjt:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE

Query:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC
        M + EANGSPP+DKGCLNLD S G+EWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCS CHVVVHH C
Subjt:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC

Query:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------
        YGIREKV GSW CSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNV SGFS+EFAHLFC QW+PEVYIEDLTQM PVMN+G                   
Subjt:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------

Query:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK
               GTCRTSFHPICAREA+HRMEVWAKYGCDNVELRAFCSKHSE RDF+SSGQ QD SEAVN +SYVVNHLPVTLS+NRPQKL GRRN+DNLL+CK
Subjt:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK

Query:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT---ADNLVPDLKSKI
        EASD NSGKLGDGEL DIGSTD  LNAD V THKSTIQGVED NPLDSLKF+SIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT   ADNLVPDLKSKI
Subjt:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT---ADNLVPDLKSKI

Query:  VRWLKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASE
        VRWLKNHAYIG LQKNLRVKLKSAVLSKAVVG ADRSDSLSVPES+NSDLIADKLV +RRKTKSN+SHLKNDEIKCSSEETLGG GLAM SD+LDQQ SE
Subjt:  VRWLKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASE

Query:  EPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLLNVSE
        EPGDSNKECIQD+ EKHLNEHDNSRDSSSRIFPNDVG TV+PLNCSL E D LEAAVSGH SSI A HG  GE+P+S FHPYVQEK+SH LDG+LLN+  
Subjt:  EPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLLNVSE

Query:  GEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELDESRST
        GEMSRW+ASSNAGVCCDH+YQ LECNDV C+SGGFN KQLVNKKI  I+KLSPEDEIEGEIIFYQHRLLANAVSRK+FTDHLIC+VAKSLPKE+DE+RST
Subjt:  GEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELDESRST

Query:  RWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDICRRPE
        RWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRK+         LMPRAKETLTKVALPKTSLESDF KEHARSCDICRRPE
Subjt:  RWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDICRRPE

Query:  TILKPILVCSSCK-VAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFRRGQA
        TILKPILVCSSCK VAVHLDCY  VKESSGPWCCELCEELS SRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFEST+RRGQA
Subjt:  TILKPILVCSSCK-VAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFRRGQA

Query:  NPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLR
        NPV GMETVSKGVDSCYICHRKHGVCLKCNYGHC+STFHPSCARSAGC+MTVKTSGGKLQHKAYCEKHSSEQRAKAENQ HGIEELNRVKQIRVELERLR
Subjt:  NPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLR

Query:  LLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIEQKTE
        LLC+RIIKREKIKRDLVLCSHDVLAFKRDHVARSVLV TPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDD+TVD TVSIKHWNKVPLPLD EQKT+
Subjt:  LLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIEQKTE

Query:  DDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
        DDSSTSQN FS KFAERAQFSGKQIPQRSS TTSR+L+DVGGLR KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
Subjt:  DDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET

XP_022141730.1 uncharacterized protein LOC111012014 isoform X6 [Momordica charantia]0.0e+0088.16Show/hide
Query:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK
        MTRDRC LGKKMMGRGADGGCGTEERPCP+SRVPNRIPLTQTQ   EKL TVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFL RHSDGKK
Subjt:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK

Query:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE
        R++KSNSGADKKSSRQGERSRGSNIWVETEEYFRDL L DVDTLRAASSFS LVARKCF+IPS GN+P+ NVGGSENV+++N DG +VK EV DDRLCSE
Subjt:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE

Query:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC
        M + EANGSPP+DKGCLNLD S G+EWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCS CHVVVHH C
Subjt:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC

Query:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------
        YGIREKV GSW CSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNV SGFS+EFAHLFC QW+PEVYIEDLTQM PVMN+G                   
Subjt:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------

Query:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK
               GTCRTSFHPICAREA+HRMEVWAKYGCDNVELRAFCSKHSE RDF+SSGQ QD SEAVN +SYVVNHLPVTLS+NRPQKL GRRN+DNLL+CK
Subjt:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK

Query:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT---ADNLVPDLKSKI
        EASD NSGKLGDGEL DIGSTD  LNAD V THKSTIQGVED NPLDSLKF+SIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT   ADNLVPDLKSKI
Subjt:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT---ADNLVPDLKSKI

Query:  VRWLKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASE
        VRWLKNHAYIG LQKNLRVKLKSAVLSKAVVG ADRSDSLSVPES+NSDLIADKLV +RRKTKSN+SHLKNDEIKCSSEETLGG GLAM SD+LDQQ SE
Subjt:  VRWLKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASE

Query:  EPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLLNVSE
        EPGDSNKECIQD+ EKHLNEHDNSRDSSSRIFPNDVG TV+PLNCSL E D LEAAVSGH SSI A HG  GE+P+S FHPYVQEK+SH LDG+LLN+  
Subjt:  EPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLLNVSE

Query:  GEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELDESRST
        GEMSRW+ASSNAGVCCDH+YQ LECNDV C+SGGFN KQLVNKKI  I+KLSPEDEIEGEIIFYQHRLLANAVSRK+FTDHLIC+VAKSLPKE+DE+RST
Subjt:  GEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELDESRST

Query:  RWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDICRRPE
        RWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRK+         LMPRAKETLTKVALPKTSLESDF KEHARSCDICRRPE
Subjt:  RWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDICRRPE

Query:  TILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFRRGQAN
        TILKPILVCSSCKVAVHLDCY  VKESSGPWCCELCEELS SRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFEST+RRGQAN
Subjt:  TILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFRRGQAN

Query:  PVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRL
        PV GMETVSKGVDSCYICHRKHGVCLKCNYGHC+STFHPSCARSAGC+MTVKTSGGKLQHKAYCEKHSSEQRAKAENQ HGIEELNRVKQIRVELERLRL
Subjt:  PVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRL

Query:  LCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIEQKTED
        LC+RIIKREKIKRDLVLCSHDVLAFKRDHVARSVLV TPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDD+TVD TVSIKHWNKVPLPLD EQKT+D
Subjt:  LCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIEQKTED

Query:  DSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
        DSSTSQN FS KFAERAQFSGKQIPQRSS TTSR+L+DVGGLR KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
Subjt:  DSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET

XP_022141731.1 uncharacterized protein LOC111012014 isoform X7 [Momordica charantia]0.0e+0087.77Show/hide
Query:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK
        MTRDRC LGKKMMGRGADGGCGTEERPCP+SRVPNRIPLTQTQ   EKL TVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFL RHSDGKK
Subjt:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK

Query:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE
        R++KSNSGADKKSSRQGERSRGSNIWVETEEYFRDL L DVDTLRAASSFS LVARKCF+IPS GN+P+ NVGGSENV+++N DG +VK EV DDRLCSE
Subjt:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE

Query:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC
        M + EANGSPP+DKGCLNLD S G+EWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCS CHVVVHH C
Subjt:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC

Query:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------
        YGIREKV GSW CSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNV SGFS+EFAHLFC QW+PEVYIEDLTQM PVMN+G                   
Subjt:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------

Query:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK
               GTCRTSFHPICAREA+HRMEVWAKYGCDNVELRAFCSKHSE RDF+SSGQ QD SEAVN +SYVVNHLPVTLS+NRPQKL GRRN+DNLL+CK
Subjt:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK

Query:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRW
        EASD NSGKLGDGEL DIGSTD  LNAD V THKSTIQGVED NPLDSLKF+SIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRW
Subjt:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRW

Query:  LKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASEEPG
        LKNHAYIG LQKNLRVKLKSAVLSKAVVG ADRSDSLSVPES+NSDLIADKLV +RRKTKSN+SHLKNDEIKCSSEETLGG GLAM SD+LDQQ SEEPG
Subjt:  LKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASEEPG

Query:  DSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLL-------
        DSNKECIQD+ EKHLNEHDNSRDSSSRIFPNDVG TV+PLNCSL E D LEAAVSGH SSI A HG  GE+P+S FHPYVQEK+SH LDG+LL       
Subjt:  DSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLL-------

Query:  -NVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELD
         ++  GEMSRW+ASSNAGVCCDH+YQ LECNDV C+SGGFN KQLVNKKI  I+KLSPEDEIEGEIIFYQHRLLANAVSRK+FTDHLIC+VAKSLPKE+D
Subjt:  -NVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELD

Query:  ESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDI
        E+RSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRK+         LMPRAKETLTKVALPKTSLESDF KEHARSCDI
Subjt:  ESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDI

Query:  CRRPETILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFR
        CRRPETILKPILVCSSCKVAVHLDCY  VKESSGPWCCELCEELS SRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFEST+R
Subjt:  CRRPETILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFR

Query:  RGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVEL
        RGQANPV GMETVSKGVDSCYICHRKHGVCLKCNYGHC+STFHPSCARSAGC+MTVKTSGGKLQHKAYCEKHSSEQRAKAENQ HGIEELNRVKQIRVEL
Subjt:  RGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVEL

Query:  ERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIE
        ERLRLLC+RIIKREKIKRDLVLCSHDVLAFKRDHVARSVLV TPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDD+TVD TVSIKHWNKVPLPLD E
Subjt:  ERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIE

Query:  QKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
        QKT+DDSSTSQN FS KFAERAQFSGKQIPQRSS TTSR+L+DVGGLR KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
Subjt:  QKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET

TrEMBL top hitse value%identityAlignment
A0A6J1CIX1 uncharacterized protein LOC111012014 isoform X20.0e+0087.58Show/hide
Query:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK
        MTRDRC LGKKMMGRGADGGCGTEERPCP+SRVPNRIPLTQTQ   EKL TVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFL RHSDGKK
Subjt:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK

Query:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE
        R++KSNSGADKKSSRQGERSRGSNIWVETEEYFRDL L DVDTLRAASSFS LVARKCF+IPS GN+P+ NVGGSENV+++N DG +VK EV DDRLCSE
Subjt:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE

Query:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC
        M + EANGSPP+DKGCLNLD S G+EWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCS CHVVVHH C
Subjt:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC

Query:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------
        YGIREKV GSW CSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNV SGFS+EFAHLFC QW+PEVYIEDLTQM PVMN+G                   
Subjt:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------

Query:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK
               GTCRTSFHPICAREA+HRMEVWAKYGCDNVELRAFCSKHSE RDF+SSGQ QD SEAVN +SYVVNHLPVTLS+NRPQKL GRRN+DNLL+CK
Subjt:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK

Query:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT---ADNLVPDLKSKI
        EASD NSGKLGDGEL DIGSTD  LNAD V THKSTIQGVED NPLDSLKF+SIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT   ADNLVPDLKSKI
Subjt:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT---ADNLVPDLKSKI

Query:  VRWLKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASE
        VRWLKNHAYIG LQKNLRVKLKSAVLSKAVVG ADRSDSLSVPES+NSDLIADKLV +RRKTKSN+SHLKNDEIKCSSEETLGG GLAM SD+LDQQ SE
Subjt:  VRWLKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASE

Query:  EPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLL----
        EPGDSNKECIQD+ EKHLNEHDNSRDSSSRIFPNDVG TV+PLNCSL E D LEAAVSGH SSI A HG  GE+P+S FHPYVQEK+SH LDG+LL    
Subjt:  EPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLL----

Query:  ----NVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPK
            ++  GEMSRW+ASSNAGVCCDH+YQ LECNDV C+SGGFN KQLVNKKI  I+KLSPEDEIEGEIIFYQHRLLANAVSRK+FTDHLIC+VAKSLPK
Subjt:  ----NVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPK

Query:  ELDESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARS
        E+DE+RSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRK+         LMPRAKETLTKVALPKTSLESDF KEHARS
Subjt:  ELDESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARS

Query:  CDICRRPETILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFES
        CDICRRPETILKPILVCSSCKVAVHLDCY  VKESSGPWCCELCEELS SRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFES
Subjt:  CDICRRPETILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFES

Query:  TFRRGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIR
        T+RRGQANPV GMETVSKGVDSCYICHRKHGVCLKCNYGHC+STFHPSCARSAGC+MTVKTSGGKLQHKAYCEKHSSEQRAKAENQ HGIEELNRVKQIR
Subjt:  TFRRGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIR

Query:  VELERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPL
        VELERLRLLC+RIIKREKIKRDLVLCSHDVLAFKRDHVARSVLV TPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDD+TVD TVSIKHWNKVPLPL
Subjt:  VELERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPL

Query:  DIEQKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQE
        D EQKT+DDSSTSQN FS KFAERAQFSGKQIPQRSS TTSR+L+DVGGLR KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQE
Subjt:  DIEQKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQE

Query:  T
        T
Subjt:  T

A0A6J1CIX7 uncharacterized protein LOC111012014 isoform X60.0e+0088.16Show/hide
Query:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK
        MTRDRC LGKKMMGRGADGGCGTEERPCP+SRVPNRIPLTQTQ   EKL TVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFL RHSDGKK
Subjt:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK

Query:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE
        R++KSNSGADKKSSRQGERSRGSNIWVETEEYFRDL L DVDTLRAASSFS LVARKCF+IPS GN+P+ NVGGSENV+++N DG +VK EV DDRLCSE
Subjt:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE

Query:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC
        M + EANGSPP+DKGCLNLD S G+EWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCS CHVVVHH C
Subjt:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC

Query:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------
        YGIREKV GSW CSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNV SGFS+EFAHLFC QW+PEVYIEDLTQM PVMN+G                   
Subjt:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------

Query:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK
               GTCRTSFHPICAREA+HRMEVWAKYGCDNVELRAFCSKHSE RDF+SSGQ QD SEAVN +SYVVNHLPVTLS+NRPQKL GRRN+DNLL+CK
Subjt:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK

Query:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT---ADNLVPDLKSKI
        EASD NSGKLGDGEL DIGSTD  LNAD V THKSTIQGVED NPLDSLKF+SIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT   ADNLVPDLKSKI
Subjt:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT---ADNLVPDLKSKI

Query:  VRWLKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASE
        VRWLKNHAYIG LQKNLRVKLKSAVLSKAVVG ADRSDSLSVPES+NSDLIADKLV +RRKTKSN+SHLKNDEIKCSSEETLGG GLAM SD+LDQQ SE
Subjt:  VRWLKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASE

Query:  EPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLLNVSE
        EPGDSNKECIQD+ EKHLNEHDNSRDSSSRIFPNDVG TV+PLNCSL E D LEAAVSGH SSI A HG  GE+P+S FHPYVQEK+SH LDG+LLN+  
Subjt:  EPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLLNVSE

Query:  GEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELDESRST
        GEMSRW+ASSNAGVCCDH+YQ LECNDV C+SGGFN KQLVNKKI  I+KLSPEDEIEGEIIFYQHRLLANAVSRK+FTDHLIC+VAKSLPKE+DE+RST
Subjt:  GEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELDESRST

Query:  RWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDICRRPE
        RWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRK+         LMPRAKETLTKVALPKTSLESDF KEHARSCDICRRPE
Subjt:  RWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDICRRPE

Query:  TILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFRRGQAN
        TILKPILVCSSCKVAVHLDCY  VKESSGPWCCELCEELS SRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFEST+RRGQAN
Subjt:  TILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFRRGQAN

Query:  PVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRL
        PV GMETVSKGVDSCYICHRKHGVCLKCNYGHC+STFHPSCARSAGC+MTVKTSGGKLQHKAYCEKHSSEQRAKAENQ HGIEELNRVKQIRVELERLRL
Subjt:  PVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRL

Query:  LCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIEQKTED
        LC+RIIKREKIKRDLVLCSHDVLAFKRDHVARSVLV TPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDD+TVD TVSIKHWNKVPLPLD EQKT+D
Subjt:  LCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIEQKTED

Query:  DSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
        DSSTSQN FS KFAERAQFSGKQIPQRSS TTSR+L+DVGGLR KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
Subjt:  DSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET

A0A6J1CJK8 uncharacterized protein LOC111012014 isoform X50.0e+0088.09Show/hide
Query:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK
        MTRDRC LGKKMMGRGADGGCGTEERPCP+SRVPNRIPLTQTQ   EKL TVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFL RHSDGKK
Subjt:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK

Query:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE
        R++KSNSGADKKSSRQGERSRGSNIWVETEEYFRDL L DVDTLRAASSFS LVARKCF+IPS GN+P+ NVGGSENV+++N DG +VK EV DDRLCSE
Subjt:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE

Query:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC
        M + EANGSPP+DKGCLNLD S G+EWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCS CHVVVHH C
Subjt:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC

Query:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------
        YGIREKV GSW CSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNV SGFS+EFAHLFC QW+PEVYIEDLTQM PVMN+G                   
Subjt:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------

Query:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK
               GTCRTSFHPICAREA+HRMEVWAKYGCDNVELRAFCSKHSE RDF+SSGQ QD SEAVN +SYVVNHLPVTLS+NRPQKL GRRN+DNLL+CK
Subjt:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK

Query:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT---ADNLVPDLKSKI
        EASD NSGKLGDGEL DIGSTD  LNAD V THKSTIQGVED NPLDSLKF+SIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT   ADNLVPDLKSKI
Subjt:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLT---ADNLVPDLKSKI

Query:  VRWLKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASE
        VRWLKNHAYIG LQKNLRVKLKSAVLSKAVVG ADRSDSLSVPES+NSDLIADKLV +RRKTKSN+SHLKNDEIKCSSEETLGG GLAM SD+LDQQ SE
Subjt:  VRWLKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASE

Query:  EPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLLNVSE
        EPGDSNKECIQD+ EKHLNEHDNSRDSSSRIFPNDVG TV+PLNCSL E D LEAAVSGH SSI A HG  GE+P+S FHPYVQEK+SH LDG+LLN+  
Subjt:  EPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLLNVSE

Query:  GEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELDESRST
        GEMSRW+ASSNAGVCCDH+YQ LECNDV C+SGGFN KQLVNKKI  I+KLSPEDEIEGEIIFYQHRLLANAVSRK+FTDHLIC+VAKSLPKE+DE+RST
Subjt:  GEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELDESRST

Query:  RWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDICRRPE
        RWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRK+         LMPRAKETLTKVALPKTSLESDF KEHARSCDICRRPE
Subjt:  RWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDICRRPE

Query:  TILKPILVCSSCK-VAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFRRGQA
        TILKPILVCSSCK VAVHLDCY  VKESSGPWCCELCEELS SRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFEST+RRGQA
Subjt:  TILKPILVCSSCK-VAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFRRGQA

Query:  NPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLR
        NPV GMETVSKGVDSCYICHRKHGVCLKCNYGHC+STFHPSCARSAGC+MTVKTSGGKLQHKAYCEKHSSEQRAKAENQ HGIEELNRVKQIRVELERLR
Subjt:  NPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLR

Query:  LLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIEQKTE
        LLC+RIIKREKIKRDLVLCSHDVLAFKRDHVARSVLV TPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDD+TVD TVSIKHWNKVPLPLD EQKT+
Subjt:  LLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIEQKTE

Query:  DDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
        DDSSTSQN FS KFAERAQFSGKQIPQRSS TTSR+L+DVGGLR KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
Subjt:  DDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET

A0A6J1CK45 uncharacterized protein LOC111012014 isoform X70.0e+0087.77Show/hide
Query:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK
        MTRDRC LGKKMMGRGADGGCGTEERPCP+SRVPNRIPLTQTQ   EKL TVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFL RHSDGKK
Subjt:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK

Query:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE
        R++KSNSGADKKSSRQGERSRGSNIWVETEEYFRDL L DVDTLRAASSFS LVARKCF+IPS GN+P+ NVGGSENV+++N DG +VK EV DDRLCSE
Subjt:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE

Query:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC
        M + EANGSPP+DKGCLNLD S G+EWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCS CHVVVHH C
Subjt:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC

Query:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------
        YGIREKV GSW CSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNV SGFS+EFAHLFC QW+PEVYIEDLTQM PVMN+G                   
Subjt:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------

Query:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK
               GTCRTSFHPICAREA+HRMEVWAKYGCDNVELRAFCSKHSE RDF+SSGQ QD SEAVN +SYVVNHLPVTLS+NRPQKL GRRN+DNLL+CK
Subjt:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK

Query:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRW
        EASD NSGKLGDGEL DIGSTD  LNAD V THKSTIQGVED NPLDSLKF+SIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRW
Subjt:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRW

Query:  LKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASEEPG
        LKNHAYIG LQKNLRVKLKSAVLSKAVVG ADRSDSLSVPES+NSDLIADKLV +RRKTKSN+SHLKNDEIKCSSEETLGG GLAM SD+LDQQ SEEPG
Subjt:  LKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASEEPG

Query:  DSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLL-------
        DSNKECIQD+ EKHLNEHDNSRDSSSRIFPNDVG TV+PLNCSL E D LEAAVSGH SSI A HG  GE+P+S FHPYVQEK+SH LDG+LL       
Subjt:  DSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLL-------

Query:  -NVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELD
         ++  GEMSRW+ASSNAGVCCDH+YQ LECNDV C+SGGFN KQLVNKKI  I+KLSPEDEIEGEIIFYQHRLLANAVSRK+FTDHLIC+VAKSLPKE+D
Subjt:  -NVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELD

Query:  ESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDI
        E+RSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRK+         LMPRAKETLTKVALPKTSLESDF KEHARSCDI
Subjt:  ESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDI

Query:  CRRPETILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFR
        CRRPETILKPILVCSSCKVAVHLDCY  VKESSGPWCCELCEELS SRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFEST+R
Subjt:  CRRPETILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFR

Query:  RGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVEL
        RGQANPV GMETVSKGVDSCYICHRKHGVCLKCNYGHC+STFHPSCARSAGC+MTVKTSGGKLQHKAYCEKHSSEQRAKAENQ HGIEELNRVKQIRVEL
Subjt:  RGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVEL

Query:  ERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIE
        ERLRLLC+RIIKREKIKRDLVLCSHDVLAFKRDHVARSVLV TPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDD+TVD TVSIKHWNKVPLPLD E
Subjt:  ERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIE

Query:  QKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
        QKT+DDSSTSQN FS KFAERAQFSGKQIPQRSS TTSR+L+DVGGLR KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
Subjt:  QKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET

A0A6J1CKN4 uncharacterized protein LOC111012014 isoform X30.0e+0087.71Show/hide
Query:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK
        MTRDRC LGKKMMGRGADGGCGTEERPCP+SRVPNRIPLTQTQ   EKL TVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFL RHSDGKK
Subjt:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQT-HEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKK

Query:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE
        R++KSNSGADKKSSRQGERSRGSNIWVETEEYFRDL L DVDTLRAASSFS LVARKCF+IPS GN+P+ NVGGSENV+++N DG +VK EV DDRLCSE
Subjt:  RRKKSNSGADKKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSE

Query:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC
        M + EANGSPP+DKGCLNLD S G+EWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCS CHVVVHH C
Subjt:  MGLVEANGSPPQDKGCLNLDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNC

Query:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------
        YGIREKV GSW CSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNV SGFS+EFAHLFC QW+PEVYIEDLTQM PVMN+G                   
Subjt:  YGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG-------------------

Query:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK
               GTCRTSFHPICAREA+HRMEVWAKYGCDNVELRAFCSKHSE RDF+SSGQ QD SEAVN +SYVVNHLPVTLS+NRPQKL GRRN+DNLL+CK
Subjt:  -------GTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCK

Query:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRW
        EASD NSGKLGDGEL DIGSTD  LNAD V THKSTIQGVED NPLDSLKF+SIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRW
Subjt:  EASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRW

Query:  LKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASEEPG
        LKNHAYIG LQKNLRVKLKSAVLSKAVVG ADRSDSLSVPES+NSDLIADKLV +RRKTKSN+SHLKNDEIKCSSEETLGG GLAM SD+LDQQ SEEPG
Subjt:  LKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASEEPG

Query:  DSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLL-------
        DSNKECIQD+ EKHLNEHDNSRDSSSRIFPNDVG TV+PLNCSL E D LEAAVSGH SSI A HG  GE+P+S FHPYVQEK+SH LDG+LL       
Subjt:  DSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLL-------

Query:  -NVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELD
         ++  GEMSRW+ASSNAGVCCDH+YQ LECNDV C+SGGFN KQLVNKKI  I+KLSPEDEIEGEIIFYQHRLLANAVSRK+FTDHLIC+VAKSLPKE+D
Subjt:  -NVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELD

Query:  ESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDI
        E+RSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRK+         LMPRAKETLTKVALPKTSLESDF KEHARSCDI
Subjt:  ESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKN---------LMPRAKETLTKVALPKTSLESDFCKEHARSCDI

Query:  CRRPETILKPILVCSSCK-VAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTF
        CRRPETILKPILVCSSCK VAVHLDCY  VKESSGPWCCELCEELS SRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFEST+
Subjt:  CRRPETILKPILVCSSCK-VAVHLDCYRTVKESSGPWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTF

Query:  RRGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVE
        RRGQANPV GMETVSKGVDSCYICHRKHGVCLKCNYGHC+STFHPSCARSAGC+MTVKTSGGKLQHKAYCEKHSSEQRAKAENQ HGIEELNRVKQIRVE
Subjt:  RRGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVE

Query:  LERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDI
        LERLRLLC+RIIKREKIKRDLVLCSHDVLAFKRDHVARSVLV TPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDD+TVD TVSIKHWNKVPLPLD 
Subjt:  LERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDI

Query:  EQKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
        EQKT+DDSSTSQN FS KFAERAQFSGKQIPQRSS TTSR+L+DVGGLR KSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET
Subjt:  EQKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQET

SwissProt top hitse value%identityAlignment
Q8LI34 Putative MYST-like histone acetyltransferase 17.1e-21583.33Show/hide
Query:  MGSIDVQSTNADNGFASASADGRQKPV-AADGMTQTSNDPDGTAPEKPKELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGF-SDYE
        MGS++  ST  +NG A+A+A         A G    +   +G   E+     A     +S LPLEVGTRVMCRWRD K HPVKVIERRK   +   +DYE
Subjt:  MGSIDVQSTNADNGFASASADGRQKPV-AADGMTQTSNDPDGTAPEKPKELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGF-SDYE

Query:  YYVHYTEFNRRLDEWVKLDQLDLDSVETVVDEKVEDKVTGLKMTRHQKRKIDETHVE-GHEELDAASLREHEEFTKVKNIATIELGRYEIETWYFSPFPP
        YYVHYTEFNRRLDEWVKL+QLDL++VET VDEKVEDK T LKMTRHQKRKIDETHVE GHEELDAASLREHEEFTKVKNIA IELGRYEI+TWYFSPFPP
Subjt:  YYVHYTEFNRRLDEWVKLDQLDLDSVETVVDEKVEDKVTGLKMTRHQKRKIDETHVE-GHEELDAASLREHEEFTKVKNIATIELGRYEIETWYFSPFPP

Query:  EYNDTTKLYFCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFEVDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMV
        EYND+ KL+FCEFCLNFMKRKEQLQRHM+KCDLKHPPGDEIYRSGTLSMFEVDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMV
Subjt:  EYNDTTKLYFCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFEVDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMV

Query:  GYFSKEKHSEESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILNT
        GYFSKEKHSEESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGL+SYRGYWTRVLL+ILKKHK NISIKELSDMTAIKA+DIL+T
Subjt:  GYFSKEKHSEESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILNT

Query:  LQSLELIQYRKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPYREQ
        LQSL+LIQYRKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPY+EQ
Subjt:  LQSLELIQYRKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPYREQ

Q9D1P2 Histone acetyltransferase KAT83.3e-11150.72Show/hide
Query:  DGTAPEKPKELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGFSDYEYYVHYTEFNRRLDEWVKLDQLDL-DSVETVVDEKVEDKVTG
        +G A    +    T  R    + +E+G   +CR  D  +H  +VI+ R     G    E+YVHY  FNRRLDEWV  ++L L  +V+  V +  E  ++ 
Subjt:  DGTAPEKPKELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGFSDYEYYVHYTEFNRRLDEWVKLDQLDL-DSVETVVDEKVEDKVTG

Query:  L------KMTRHQKRKIDE-THVE-GHEELD---AASLREHEEFTKVKNIATIELGRYEIETWYFSPFPPEYNDTTKLYFCEFCLNFMKRKEQLQRHMRK
        L      K+TR+QKRK DE  HV+  + E+D   AA  +EHE  TKVK +  I +G YEI+ WYFSPFP +Y    KL+ CE+CL +MK ++  + H+ +
Subjt:  L------KMTRHQKRKIDE-THVE-GHEELD---AASLREHEEFTKVKNIATIELGRYEIETWYFSPFPPEYNDTTKLYFCEFCLNFMKRKEQLQRHMRK

Query:  CDLKHPPGDEIYRSGTLSMFEVDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHSEESYNLACILTLPPYQRKGYG
        C  + PPG EIYR   +S++EVDGK +K+Y QNLC LAKLFLDHKTLY+DV+ F+FY+L E D +G H+VGYFSKEK S +  N+ACILTLPPYQR+GYG
Subjt:  CDLKHPPGDEIYRSGTLSMFEVDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHSEESYNLACILTLPPYQRKGYG

Query:  KFLIAFSYELSKKEGKVGTPERPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILNTLQSLELIQYRKGQHVICADPKVLDRHLKAA
        KFLIAFSYELSK E  VG+PE+PLSDLG +SYR YW+ VLL+IL+  +G +SIK+LS MT+I   DI++TLQSL +++Y KGQHVIC  PK+++ HLK+A
Subjt:  KFLIAFSYELSKKEGKVGTPERPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILNTLQSLELIQYRKGQHVICADPKVLDRHLKAA

Query:  --GRGGLEVDVSKLIWTP
           +  + VD   L W P
Subjt:  --GRGGLEVDVSKLIWTP

Q9FLF7 Histone acetyltransferase of the MYST family 14.3e-22091.23Show/hide
Query:  ELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGFSDYEYYVHYTEFNRRLDEWVKLDQLDLDSVETVVDEKVEDKVTGLKMTRHQKRK
        E  A++KR   +LPLEVGTRVMC+WRDGKYHPVKVIERRK    G +DYEYYVHYTEFNRRLDEW+KL+QLDLDSVE  +DEKVEDKVT LKMTRHQKRK
Subjt:  ELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGFSDYEYYVHYTEFNRRLDEWVKLDQLDLDSVETVVDEKVEDKVTGLKMTRHQKRK

Query:  IDETHVEGHEELDAASLREHEEFTKVKNIATIELGRYEIETWYFSPFPPEYNDTTKLYFCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFE
        IDETHVEGHEELDAASLREHEEFTKVKNIATIELG+YEIETWYFSPFPPEYND  KL+FCEFCL+FMKRKEQLQRHMRKCDLKHPPGDEIYRS TLSMFE
Subjt:  IDETHVEGHEELDAASLREHEEFTKVKNIATIELGRYEIETWYFSPFPPEYNDTTKLYFCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFE

Query:  VDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHSEESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPE
        VDGKKNKVY QNLCYLAKLFLDHKTLYYDVDLFLFY+LCECDDRGCHMVGYFSKEKHSEE+YNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPE
Subjt:  VDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHSEESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPE

Query:  RPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILNTLQSLELIQYRKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPYREQ
        RPLSDLGLVSYRGYWTR+LLDILKKHKGNISIKELSDMTAIKAEDIL+TLQSLELIQYRKGQHVICADPKVLDRHLKAAGRGGL+VDVSK+IWTPY+EQ
Subjt:  RPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILNTLQSLELIQYRKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPYREQ

Q9H7Z6 Histone acetyltransferase KAT83.0e-11249.67Show/hide
Query:  ADNGFASASADGRQKPVAADGMTQTSND--PDGTAPEKPKELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGFSDYEYYVHYTEFNR
        A  G A+A A G    VA +G      +   +GTAP   +    T  R    + +E+G   +CR  D  +H  +VI+ R     G    E+YVHY  FNR
Subjt:  ADNGFASASADGRQKPVAADGMTQTSND--PDGTAPEKPKELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGFSDYEYYVHYTEFNR

Query:  RLDEWVKLDQLDL-DSVETVVDEKVEDKVTGL------KMTRHQKRKIDE-THVE-GHEELD---AASLREHEEFTKVKNIATIELGRYEIETWYFSPFP
        RLDEWV  ++L L  +V+  V +  E  ++ L      K+TR+QKRK DE  HV+  + E+D   AA  +EHE  TKVK +  I +G YEI+ WYFSPFP
Subjt:  RLDEWVKLDQLDL-DSVETVVDEKVEDKVTGL------KMTRHQKRKIDE-THVE-GHEELD---AASLREHEEFTKVKNIATIELGRYEIETWYFSPFP

Query:  PEYNDTTKLYFCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFEVDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHM
         +Y    KL+ CE+CL +MK ++  + H+ +C  + PPG EIYR   +S++EVDGK +K+Y QNLC LAKLFLDHKTLY+DV+ F+FY+L E D +G H+
Subjt:  PEYNDTTKLYFCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFEVDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHM

Query:  VGYFSKEKHSEESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILN
        VGYFSKEK S +  N+ACILTLPPYQR+GYGKFLIAFSYELSK E  VG+PE+PLSDLG +SYR YW+ VLL+IL+  +G +SIK+LS MT+I   DI++
Subjt:  VGYFSKEKHSEESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILN

Query:  TLQSLELIQYRKGQHVICADPKVLDRHLKAA--GRGGLEVDVSKLIWTP
        TLQSL +++Y KGQHVIC  PK+++ HLK+A   +  + VD   L W P
Subjt:  TLQSLELIQYRKGQHVICADPKVLDRHLKAA--GRGGLEVDVSKLIWTP

Q9LXD7 Histone acetyltransferase of the MYST family 22.1e-21983.9Show/hide
Query:  STNADNGFASASADGRQKPVAADGMTQTSNDPDGTAPEKP-KELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGFSDYEYYVHYTEF
        S N +    +      QKP A +G+  +   P    P++   E   ++KR   MLPLEVGTRVMCRWRDGK+HPVKVIERR++   G +DYEYYVHYTEF
Subjt:  STNADNGFASASADGRQKPVAADGMTQTSNDPDGTAPEKP-KELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGFSDYEYYVHYTEF

Query:  NRRLDEWVKLDQLDLDSVETVVDEKVEDKVTGLKMTRHQKRKIDETHVEGHEELDAASLREHEEFTKVKNIATIELGRYEIETWYFSPFPPEYNDTTKLY
        NRRLDEW +LDQLDLDSVE  VDEKVEDKVT LKMTRHQKRKIDETH+EGHEELDAASLREHEEFTKVKNI+TIELG+YEIETWYFSPFPPEYND  KL+
Subjt:  NRRLDEWVKLDQLDLDSVETVVDEKVEDKVTGLKMTRHQKRKIDETHVEGHEELDAASLREHEEFTKVKNIATIELGRYEIETWYFSPFPPEYNDTTKLY

Query:  FCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFEVDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHS
        FCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFEVDGKKNKVY QNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHS
Subjt:  FCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFEVDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHS

Query:  EESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILNTLQSLELIQY
        EE+YNLACILTLP YQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGL+SYRGYWTRVLL+ILKKHKGNISIKELSD+TAIKAEDIL+TLQSLELIQY
Subjt:  EESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILNTLQSLELIQY

Query:  RKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPYREQ
        RKGQHVICADPKVLDRHLKAAGRGGL+VD SKLIWTPY++Q
Subjt:  RKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPYREQ

Arabidopsis top hitse value%identityAlignment
AT1G77800.1 PHD finger family protein2.6e-29743.44Show/hide
Query:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQTHEKLSTVNIDYYAQAQKALCERSPFDVAEE--TSASCVPTLPSRLGSFLGRHSDGK
        M  D+C   KKMMGRG DGGCG EE+P    R   R+ L +   +E + ++ ID+ AQA K L ERSPFDV E+  TS   VPTLP  L + L  HSD K
Subjt:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQTHEKLSTVNIDYYAQAQKALCERSPFDVAEE--TSASCVPTLPSRLGSFLGRHSDGK

Query:  KRRKKSNSGAD---KKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPS----SGNIPDTNVGGSENVNNENT---DGAIVK
        KR KKS+SGAD   KKSSRQG++ R  +IW+E E+YFR L   D++TL   +S   L +R CFSIPS    S +I       S    NE+    DG +++
Subjt:  KRRKKSNSGAD---KKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPS----SGNIPDTNVGGSENVNNENT---DGAIVK

Query:  H-------EVEDDRLCSEMGLVEANGSPPQDKGCLN-LDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDK
                ++ +  +  E  +V+  G      G  +  D S  +EW+LG RN++ LTSERPSKKRKLLG DAGL K+++A+PCEGN+ LC FC  G    
Subjt:  H-------EVEDDRLCSEMGLVEANGSPPQDKGCLN-LDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDK

Query:  GLNPLITCSSCHVVVHHNCYGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG
          + LI C+SC   VH  CYG+ E     W CSWC+ ++   DS +PCLLCPK+GG LKPV    E+G   EFAHLFCS W+PEVYIEDL +MEP++N  
Subjt:  GLNPLITCSSCHVVVHHNCYGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG

Query:  GTCRTSFHPIC---------------------------------------------AREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPS
        G   T    +C                                             A EA +R+EVW K+GCD VELRAFCSKHS   D + SG+  +  
Subjt:  GTCRTSFHPIC---------------------------------------------AREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPS

Query:  EAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCKEASDPNSGK--LGDGELADIGSTDPRLNADFVDTHKSTIQG--VEDANPLD-SLKFSSIMKK
        E+  ++S      P+    + P +  G  ++ N  +  +   P +G     + +L ++ S   + N    D  +S + G   ED   L  SL F  I+KK
Subjt:  EAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCKEASDPNSGK--LGDGELADIGSTDPRLNADFVDTHKSTIQG--VEDANPLD-SLKFSSIMKK

Query:  LIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRWLKNHAYIGTLQ--KNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTS
        LID GKV+VKDVA+EIGI PD L AKLT  +L+PDL  K+V+WL  HA++G+    KNL+ K   +VL K   G+   +   S PE  NS ++  K    
Subjt:  LIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRWLKNHAYIGTLQ--KNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTS

Query:  RRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASEEPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVS
               + H K+  I                    D    +    S+   +++A     N   N  +                LNC             
Subjt:  RRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASEEPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVS

Query:  GHNSSIRADHGKTGESPNSDFHPYVQEKMSHM-LDGKLLNVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEI
           + I  D       P  + H Y+ +++S +  +  L + ++ +++R   + +     +   +HL+        G     QL   +  GI+ LSPEDE+
Subjt:  GHNSSIRADHGKTGESPNSDFHPYVQEKMSHM-LDGKLLNVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEI

Query:  EGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELDESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRK-----
        EGE+++YQ +LL  AVSRK+ +D+L+  VAK LP E+DE    RWD VL+N+YF ++REA+KQG+KE+R+K+AQAVLAAATAAAA SSR +S RK     
Subjt:  EGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELDESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRK-----

Query:  -----------------NLMPRAKETLTKVAL--PKTSLES-----DFCKEHARSCDICRRPETILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCE
                         +L+P+ KE+L K+A+  P +   S     DF  E+ R+CDICRR ETI   I+VCSSCKVAVH+DCY+  KES+GPW CELC 
Subjt:  -----------------NLMPRAKETLTKVAL--PKTSLES-----DFCKEHARSCDICRRPETILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCE

Query:  ELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFRRGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTF
        E      S  P  NF EK     EC LCGGTTGAFRK+++GQWVHAFCAEW  ESTFRRGQ NPV+GME+++K  D+C +C R +G C KC+YG+CQ+TF
Subjt:  ELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFRRGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTF

Query:  HPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVH
        HPSCARSAG  M   T GGK  HKAYCEKHS EQ+AKAE+Q HG EEL  +K  RVELERLRLLC+RI+KREK+KR+L + SH++LA KRDH AR + V 
Subjt:  HPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVH

Query:  TPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIEQKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLV
         PF  PEVSS+SATTS+KGH +   S SEA+QRSDD+T+DSTV+ K   K PL +D +QKT DDS+TS++ FS K  ER   SGK +P++    +     
Subjt:  TPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIEQKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLV

Query:  DVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQE
        D        ++H ETF KELVMTSD+AS KN  LPK Y YVP D L ++K  NQ+
Subjt:  DVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQE

AT1G77800.2 PHD finger family protein1.1e-29843.71Show/hide
Query:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQTHEKLSTVNIDYYAQAQKALCERSPFDVAEE--TSASCVPTLPSRLGSFLGRHSDGK
        M  D+C   KKMMGRG DGGCG EE+P    R   R+ L +   +E + ++ ID+ AQA K L ERSPFDV E+  TS   VPTLP  L + L  HSD K
Subjt:  MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQTHEKLSTVNIDYYAQAQKALCERSPFDVAEE--TSASCVPTLPSRLGSFLGRHSDGK

Query:  KRRKKSNSGAD---KKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPS----SGNIPDTNVGGSENVNNENT---DGAIVK
        KR KKS+SGAD   KKSSRQG++ R  +IW+E E+YFR L   D++TL   +S   L +R CFSIPS    S +I       S    NE+    DG +++
Subjt:  KRRKKSNSGAD---KKSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPS----SGNIPDTNVGGSENVNNENT---DGAIVK

Query:  H-------EVEDDRLCSEMGLVEANGSPPQDKGCLN-LDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDK
                ++ +  +  E  +V+  G      G  +  D S  +EW+LG RN++ LTSERPSKKRKLLG DAGL K+++A+PCEGN+ LC FC  G    
Subjt:  H-------EVEDDRLCSEMGLVEANGSPPQDKGCLN-LDCSPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDK

Query:  GLNPLITCSSCHVVVHHNCYGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG
          + LI C+SC   VH  CYG+ E     W CSWC+ ++   DS +PCLLCPK+GG LKPV    E+G   EFAHLFCS W+PEVYIEDL +MEP++N  
Subjt:  GLNPLITCSSCHVVVHHNCYGIREKVNGSWSCSWCKQKDKTNDSTKPCLLCPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLG

Query:  GTCRTSFHPIC---------------------------------------------AREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPS
        G   T    +C                                             A EA +R+EVW K+GCD VELRAFCSKHS   D + SG+  +  
Subjt:  GTCRTSFHPIC---------------------------------------------AREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPS

Query:  EAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCKEASDPNSGK--LGDGELADIGSTDPRLNADFVDTHKSTIQG--VEDANPLD-SLKFSSIMKK
        E+  ++S      P+    + P +  G  ++ N  +  +   P +G     + +L ++ S   + N    D  +S + G   ED   L  SL F  I+KK
Subjt:  EAVNSDSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCKEASDPNSGK--LGDGELADIGSTDPRLNADFVDTHKSTIQG--VEDANPLD-SLKFSSIMKK

Query:  LIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRWLKNHAYIGTLQ--KNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTS
        LID GKV+VKDVA+EIGI PD L AKLT  +L+PDL  K+V+WL  HA++G+    KNL+ K   +VL K   G+   +   S PE  NS ++  K    
Subjt:  LIDQGKVNVKDVASEIGIPPDLLSAKLTADNLVPDLKSKIVRWLKNHAYIGTLQ--KNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTS

Query:  RRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASEEPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVS
               + H K+  I                    D    +    S+   +++A     N   N  +                LNC             
Subjt:  RRKTKSNISHLKNDEIKCSSEETLGGYGLAMQSDTLDQQASEEPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVS

Query:  GHNSSIRADHGKTGESPNSDFHPYVQEKMSHM-LDGKLLNVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEI
           + I  D       P  + H Y+ +++S +  +  L + ++ +++R   + +     +   +HL+        G     QL   +  GI+ LSPEDE+
Subjt:  GHNSSIRADHGKTGESPNSDFHPYVQEKMSHM-LDGKLLNVSEGEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEI

Query:  EGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELDESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRK-----
        EGE+++YQ +LL  AVSRK+ +D+L+  VAK LP E+DE    RWD VL+N+YF ++REA+KQG+KE+R+K+AQAVLAAATAAAA SSR +S RK     
Subjt:  EGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELDESRSTRWDAVLINQYFSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRK-----

Query:  -----------------NLMPRAKETLTKVAL--PKTSLES-----DFCKEHARSCDICRRPETILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCE
                         +L+P+ KE+L K+A+  P +   S     DF  E+ R+CDICRR ETI   I+VCSSCKVAVH+DCY+  KES+GPW CELC 
Subjt:  -----------------NLMPRAKETLTKVAL--PKTSLES-----DFCKEHARSCDICRRPETILKPILVCSSCKVAVHLDCYRTVKESSGPWCCELCE

Query:  ELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFRRGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTF
        E      S  P  NF EK     EC LCGGTTGAFRK+++GQWVHAFCAEW  ESTFRRGQ NPV+GME+++K  D+C +C R +G C KC+YG+CQ+TF
Subjt:  ELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFRRGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTF

Query:  HPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVH
        HPSCARSAG  M   T GGK  HKAYCEKHS EQ+AKAE+Q HG EEL  +K  RVELERLRLLC+RI+KREK+KR+L + SH++LA KRDH AR + V 
Subjt:  HPSCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVH

Query:  TPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIEQKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLV
         PF  PEVSS+SATTS+KGH +   S SEA+QRSDD+T+DSTV+ K   K PL +D +QKT DDS+TS++ FS K  ER   SGK +P R     S ++ 
Subjt:  TPFFLPEVSSESATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIEQKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLV

Query:  DVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQE
        + G    K +KH ETF KELVMTSD+AS KN  LPK Y YVP D L ++K  NQ+
Subjt:  DVGGLRLKSRKHAETFQKELVMTSDQASMKNSLLPKQYLYVPADVLAKEKQVNQE

AT5G09740.1 histone acetyltransferase of the MYST family 21.5e-22083.9Show/hide
Query:  STNADNGFASASADGRQKPVAADGMTQTSNDPDGTAPEKP-KELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGFSDYEYYVHYTEF
        S N +    +      QKP A +G+  +   P    P++   E   ++KR   MLPLEVGTRVMCRWRDGK+HPVKVIERR++   G +DYEYYVHYTEF
Subjt:  STNADNGFASASADGRQKPVAADGMTQTSNDPDGTAPEKP-KELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGFSDYEYYVHYTEF

Query:  NRRLDEWVKLDQLDLDSVETVVDEKVEDKVTGLKMTRHQKRKIDETHVEGHEELDAASLREHEEFTKVKNIATIELGRYEIETWYFSPFPPEYNDTTKLY
        NRRLDEW +LDQLDLDSVE  VDEKVEDKVT LKMTRHQKRKIDETH+EGHEELDAASLREHEEFTKVKNI+TIELG+YEIETWYFSPFPPEYND  KL+
Subjt:  NRRLDEWVKLDQLDLDSVETVVDEKVEDKVTGLKMTRHQKRKIDETHVEGHEELDAASLREHEEFTKVKNIATIELGRYEIETWYFSPFPPEYNDTTKLY

Query:  FCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFEVDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHS
        FCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFEVDGKKNKVY QNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHS
Subjt:  FCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFEVDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHS

Query:  EESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILNTLQSLELIQY
        EE+YNLACILTLP YQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGL+SYRGYWTRVLL+ILKKHKGNISIKELSD+TAIKAEDIL+TLQSLELIQY
Subjt:  EESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILNTLQSLELIQY

Query:  RKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPYREQ
        RKGQHVICADPKVLDRHLKAAGRGGL+VD SKLIWTPY++Q
Subjt:  RKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPYREQ

AT5G09740.2 histone acetyltransferase of the MYST family 27.2e-16267.35Show/hide
Query:  STNADNGFASASADGRQKPVAADGMTQTSNDPDGTAPEKP-KELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGFSDYEYYVHYTEF
        S N +    +      QKP A +G+  +   P    P++   E   ++KR   MLPLEVGTRVMCRWRDGK+HPVKVIERR++   G +DYEYYVHYTEF
Subjt:  STNADNGFASASADGRQKPVAADGMTQTSNDPDGTAPEKP-KELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGFSDYEYYVHYTEF

Query:  NRRLDEWVKLDQLDLDSVETVVDEKVEDKVTGLKMTRHQKRKIDETHVEGHEELDAASLREHEEFTKVKNIATIELGRYEIETWYFSPFPPEYNDTTKLY
        NRRLDEW +LDQLDLDSVE  VDEKVEDKVT LKMTRHQKRKIDETH+EGHEELDAASLREHEEFTKV                                
Subjt:  NRRLDEWVKLDQLDLDSVETVVDEKVEDKVTGLKMTRHQKRKIDETHVEGHEELDAASLREHEEFTKVKNIATIELGRYEIETWYFSPFPPEYNDTTKLY

Query:  FCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFEVDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHS
                                                      KNKVY QNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHS
Subjt:  FCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFEVDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHS

Query:  EESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILNTLQSLELIQY
        EE+YNLACILTLP YQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGL+SYRGYWTRVLL+ILKKHKGNISIKELSD+TAIKAEDIL+TLQSLELIQY
Subjt:  EESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILNTLQSLELIQY

Query:  RKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPYREQ
        RKGQHVICADPKVLDRHLKAAGRGGL+VD SKLIWTPY++Q
Subjt:  RKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPYREQ

AT5G64610.1 histone acetyltransferase of the MYST family 13.1e-22191.23Show/hide
Query:  ELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGFSDYEYYVHYTEFNRRLDEWVKLDQLDLDSVETVVDEKVEDKVTGLKMTRHQKRK
        E  A++KR   +LPLEVGTRVMC+WRDGKYHPVKVIERRK    G +DYEYYVHYTEFNRRLDEW+KL+QLDLDSVE  +DEKVEDKVT LKMTRHQKRK
Subjt:  ELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGFSDYEYYVHYTEFNRRLDEWVKLDQLDLDSVETVVDEKVEDKVTGLKMTRHQKRK

Query:  IDETHVEGHEELDAASLREHEEFTKVKNIATIELGRYEIETWYFSPFPPEYNDTTKLYFCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFE
        IDETHVEGHEELDAASLREHEEFTKVKNIATIELG+YEIETWYFSPFPPEYND  KL+FCEFCL+FMKRKEQLQRHMRKCDLKHPPGDEIYRS TLSMFE
Subjt:  IDETHVEGHEELDAASLREHEEFTKVKNIATIELGRYEIETWYFSPFPPEYNDTTKLYFCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFE

Query:  VDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHSEESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPE
        VDGKKNKVY QNLCYLAKLFLDHKTLYYDVDLFLFY+LCECDDRGCHMVGYFSKEKHSEE+YNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPE
Subjt:  VDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHSEESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPE

Query:  RPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILNTLQSLELIQYRKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPYREQ
        RPLSDLGLVSYRGYWTR+LLDILKKHKGNISIKELSDMTAIKAEDIL+TLQSLELIQYRKGQHVICADPKVLDRHLKAAGRGGL+VDVSK+IWTPY+EQ
Subjt:  RPLSDLGLVSYRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILNTLQSLELIQYRKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPYREQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTAGAGACCGATGTCACCTGGGAAAGAAGATGATGGGTAGGGGTGCGGACGGAGGTTGTGGTACTGAGGAGAGGCCTTGTCCAGTGTCAAGAGTGCCGAACAGAAT
ACCATTGACCCAAACTCAAACACACGAGAAGCTGTCTACTGTGAACATTGATTATTACGCGCAGGCTCAGAAGGCTTTGTGTGAGCGCTCGCCATTTGATGTAGCGGAAG
AGACTTCAGCTTCTTGCGTGCCTACTTTGCCCAGTAGGCTGGGTAGCTTTTTGGGTAGACATTCGGATGGTAAAAAACGGCGGAAGAAATCCAATTCTGGGGCAGACAAG
AAGTCTTCTCGGCAAGGTGAAAGGTCGCGTGGTTCCAACATTTGGGTTGAGACAGAGGAATATTTTAGGGACTTGACGTTGTCTGATGTAGACACTTTGCGTGCAGCGTC
TTCCTTTAGTGGTTTAGTGGCTAGGAAATGTTTTTCAATTCCGTCTTCGGGAAATATTCCAGACACAAATGTAGGTGGCAGTGAGAATGTTAATAATGAGAATACGGATG
GGGCCATTGTTAAACACGAGGTTGAGGATGATCGTCTGTGCTCGGAGATGGGCTTGGTTGAAGCCAATGGTTCACCTCCGCAAGACAAGGGTTGTTTGAATCTGGATTGT
TCTCCCGGCATAGAATGGCTCTTAGGTTGTAGGAATAAGGTGTCTTTAACTTCGGAGCGGCCTTCAAAGAAACGGAAGCTTCTAGGTGGTGACGCAGGATTGGAAAAAGT
TCTGATTGCTTCTCCTTGCGAAGGAAATTCATCTTTATGTCACTTTTGCTCTAAGGGCGATACAGACAAAGGGCTAAATCCGTTAATTACTTGTAGTAGTTGTCATGTGG
TGGTTCACCATAATTGCTATGGTATAAGAGAGAAAGTTAATGGATCCTGGTCCTGTTCTTGGTGTAAGCAGAAGGATAAAACCAATGATTCCACGAAGCCTTGTTTGCTC
TGTCCCAAGCAGGGTGGTGCTCTGAAACCTGTACATAAGAATGTTGAGAGTGGATTTTCCATCGAATTCGCCCATTTGTTTTGCAGTCAGTGGATACCAGAGGTATATAT
TGAAGACTTGACGCAGATGGAGCCGGTTATGAATCTAGGAGGAACTTGCAGAACCTCCTTTCATCCAATTTGTGCAAGGGAGGCTAGACATAGAATGGAGGTCTGGGCAA
AATATGGGTGTGATAATGTTGAACTTCGAGCTTTCTGTTCCAAGCACTCAGAAGCACGAGATTTTAAGAGCAGTGGTCAATTTCAAGACCCTTCTGAAGCTGTTAACAGT
GACTCATATGTGGTCAACCACCTTCCAGTGACATTGTCAATTAACAGACCGCAGAAATTAGCTGGGAGAAGGAATGTTGACAATCTTTTAGTCTGTAAGGAAGCTTCGGA
TCCCAACTCTGGAAAATTGGGCGATGGTGAATTAGCTGATATTGGATCCACGGATCCCCGTTTGAATGCTGACTTTGTCGATACACACAAGTCTACTATCCAGGGGGTTG
AGGATGCCAACCCACTCGACTCCCTTAAATTTTCATCAATTATGAAAAAGTTGATTGATCAAGGAAAAGTCAATGTTAAAGATGTAGCCTCAGAAATTGGCATACCGCCT
GATTTATTAAGTGCAAAACTTACTGCTGACAACCTTGTCCCTGACTTGAAATCCAAAATTGTCAGATGGCTAAAAAATCATGCTTATATTGGTACCCTGCAGAAAAATTT
GAGAGTCAAACTTAAATCGGCTGTTTTATCCAAGGCTGTTGTTGGAACTGCTGATCGTTCTGACTCGTTATCAGTACCTGAATCTGACAATTCAGATCTTATTGCTGATA
AGTTGGTTACATCTCGAAGAAAAACTAAAAGCAATATAAGCCATTTGAAGAATGATGAAATCAAATGTTCATCTGAAGAAACTTTAGGTGGCTATGGTCTGGCGATGCAA
AGTGATACCTTGGATCAGCAAGCTAGTGAAGAACCAGGTGATTCAAATAAGGAATGTATTCAAGACGCTGGTGAGAAGCATTTAAACGAGCATGATAATTCTCGAGATTC
TTCTTCAAGGATTTTTCCAAATGATGTAGGCATTACAGTTAAGCCCTTAAATTGTAGCCTCGTAGAAGGTGATCATTTAGAGGCTGCCGTTTCTGGCCATAATTCTTCAA
TAAGGGCAGATCATGGAAAGACTGGAGAGTCCCCTAATTCAGATTTTCATCCATATGTCCAAGAGAAGATGTCACATATGTTGGATGGAAAGCTTCTAAATGTGTCTGAG
GGAGAGATGTCACGGTGGCGAGCATCTTCCAATGCTGGCGTTTGCTGTGATCACCAATACCAGCATTTAGAGTGCAATGACGTGAGCTGCAAATCTGGTGGATTTAATTC
AAAGCAGCTAGTCAATAAAAAAATAGGTGGAATTATAAAGTTGTCTCCAGAAGATGAAATTGAAGGAGAAATTATATTTTATCAGCATAGATTACTTGCAAATGCAGTTT
CAAGAAAGCGGTTCACTGATCATTTAATTTGTAATGTTGCTAAGAGTCTTCCAAAGGAGCTTGATGAATCAAGAAGTACTAGATGGGATGCTGTGCTTATTAATCAGTAT
TTTAGTGAGCTTAGAGAAGCAAAGAAACAGGGTAAGAAAGAGAGAAGACACAAGGAAGCACAGGCTGTACTAGCTGCTGCGACTGCTGCTGCTGCAGCCTCTTCTAGGAT
GTCATCATTCAGAAAAAATTTGATGCCTCGTGCCAAAGAAACGCTTACAAAGGTTGCTCTCCCAAAGACTTCGTTAGAGTCAGACTTCTGTAAAGAACATGCTAGATCCT
GTGACATCTGCAGGCGGCCAGAGACAATATTAAAGCCAATTTTAGTCTGCTCTAGTTGCAAGGTTGCAGTGCATCTGGATTGCTATCGTACTGTTAAAGAGTCCTCAGGT
CCATGGTGTTGTGAATTATGTGAAGAATTGTCATTATCAAGAGGCTCTGGAGCTCCGGTTGTCAATTTTTGGGAGAAATCATATTTTGTTGCAGAATGTGGTCTATGTGG
TGGCACAACAGGTGCATTCAGGAAATCATCTGATGGCCAGTGGGTCCATGCCTTCTGTGCGGAGTGGGTCTTTGAATCAACTTTCAGAAGAGGACAAGCAAATCCTGTGG
AAGGAATGGAGACGGTTTCAAAGGGGGTGGACTCTTGCTATATTTGTCACCGCAAACATGGTGTCTGTTTGAAGTGCAATTATGGTCATTGTCAGTCCACTTTTCATCCC
TCGTGCGCTAGAAGTGCTGGATGTTTCATGACTGTAAAGACTTCTGGTGGTAAGTTGCAGCACAAGGCATATTGTGAAAAGCATAGTTCAGAGCAGAGAGCAAAGGCTGA
GAACCAAACACATGGAATCGAGGAATTGAATAGAGTCAAGCAAATTAGGGTTGAACTAGAGAGGCTACGCCTCCTCTGCGATAGAATCATCAAACGTGAGAAGATTAAGA
GGGACCTGGTTCTTTGTTCACATGATGTTCTTGCTTTTAAAAGAGACCATGTAGCACGATCCGTACTAGTTCACACTCCTTTCTTCCTACCTGAAGTTAGTTCTGAATCA
GCTACCACATCCCTAAAGGGGCATGTGGAAGATTTAAAATCGTGTAGTGAAGCTGTGCAGAGATCAGATGATGTAACTGTGGACAGCACAGTTTCTATCAAGCACTGGAA
TAAAGTACCTTTGCCTTTGGATATCGAACAAAAGACTGAGGACGACAGTTCCACATCTCAGAACCCTTTCTCCCTAAAGTTTGCAGAGAGGGCTCAGTTTTCTGGGAAGC
AAATACCTCAGAGATCTTCGACCACTACATCACGTAATCTTGTAGATGTTGGAGGATTGAGGTTAAAGTCTAGAAAGCATGCCGAGACATTTCAGAAAGAGCTGGTTATG
ACGTCAGACCAAGCATCAATGAAGAACTCTCTTTTGCCTAAGCAGTACCTGTATGTTCCTGCTGATGTCCTTGCCAAGGAGAAGCAGGTCAACCAGGAAACAGATTACCG
TATCAGAGACCAAACATCCATATGCCTGAGGGATCCATATCCTGCTCTGAAACTGAGTCCGAGACCGAGCAACATGGTCGGTAGGGAAGAGCAGGGGGAATGCTGGGATG
GACTCCAGCGGGGCACCAGGCTCGCGATGGGCTCGATAGACGTCCAATCAACGAATGCCGATAATGGATTTGCCTCCGCTTCTGCTGATGGCCGTCAAAAGCCAGTTGCC
GCCGACGGTATGACCCAGACATCTAATGACCCAGACGGAACGGCGCCTGAGAAGCCTAAAGAATTGGGGGCCACGAGGAAGCGTATGTCCTCCATGCTTCCTCTGGAGGT
CGGGACCCGCGTCATGTGCCGCTGGAGGGATGGGAAGTACCATCCCGTAAAGGTCATCGAACGTCGGAAGTTGCAGATAACTGGTTTTAGTGATTACGAATATTATGTCC
ATTACACCGAGTTCAATAGGAGACTTGATGAGTGGGTGAAGCTTGACCAACTTGATCTTGATTCAGTTGAGACGGTTGTTGACGAGAAAGTGGAGGACAAGGTAACAGGA
TTGAAGATGACCCGACACCAGAAGCGGAAGATTGATGAAACGCACGTTGAGGGCCATGAGGAACTTGATGCTGCCAGCTTGAGGGAGCATGAAGAATTTACAAAAGTGAA
AAATATAGCAACGATTGAACTTGGAAGATATGAGATTGAGACATGGTATTTCTCCCCCTTTCCACCAGAATACAATGACACTACAAAGCTCTATTTTTGTGAGTTTTGTC
TCAATTTCATGAAGCGCAAAGAACAGCTCCAGAGACATATGAGGAAGTGTGATCTCAAGCATCCTCCTGGTGATGAAATATATCGAAGTGGTACTTTGTCAATGTTTGAG
GTTGATGGAAAAAAGAACAAGGTTTATGGGCAGAATCTCTGTTACTTAGCAAAGTTATTTCTGGACCACAAGACTCTATATTACGATGTTGACCTTTTTCTGTTTTATGT
CCTGTGTGAGTGTGATGATCGAGGGTGTCACATGGTTGGATATTTTTCAAAGGAAAAGCATTCAGAGGAATCCTATAATTTGGCATGCATTCTCACCCTTCCTCCTTATC
AAAGGAAGGGCTATGGAAAGTTTTTAATAGCATTTTCTTATGAACTCTCCAAGAAGGAAGGTAAAGTTGGCACACCTGAAAGACCTCTTTCTGACCTTGGGTTGGTGAGC
TACAGAGGATACTGGACCAGGGTTCTTTTAGACATCTTGAAAAAGCATAAGGGCAATATCTCTATCAAGGAGTTAAGTGACATGACAGCTATCAAAGCCGAAGATATTTT
GAACACACTTCAGAGCCTAGAGTTGATTCAATACAGGAAAGGGCAGCATGTTATATGTGCTGACCCCAAGGTACTGGATCGTCATCTCAAAGCTGCTGGTCGTGGGGGCC
TAGAGGTTGATGTTAGCAAATTGATCTGGACTCCATACAGAGAACAGGACGGTGCTTTGGAAGGTGGGTATGCTGAATCTGTAAGCTTTGTAATACCATCCCACCGGACC
TCCGCTTCCAAAGTTCGAAGCCGCTCTTCTTCTACTTCCACAACCGTTGGCAGCCGCCGTAGATATCGAGGAGGCGGCGGCGGAGGACCTGGTGGTGGCCGTGGCGGAGG
GGGAGGCCAAGAGATTTTTGCTCGAGAAGAATTTCTTCAGCTTGAGCCGATTCTCGGCGATAAAGGCTCGCCGGAAAAGGACTCCGCATGGTCTCGGTCGCATTTCGGCT
GCCTAACGTCTTTGATCGCCGGCTTCGAACATAGCATCGGCAAGAACATGTTTTTCTGGCGATGGTTTTGGTGGTTGAAGCGATTGTATCTGTCACCCATTCTTCTTCCT
GCTTCTTCAGACGTACTGAGAGAGAAGACAGCTTGGACCTTGATACACAACAATGGCGTCTCGAGAATCCTGCCCCCCAAGAGCTCCTCTGGTGCTGCTCTGTGCAGTAG
AGAGAGAGAGAGAGAGAGAGAGAGGGTTTATTGGGAAGGCAAATAG
mRNA sequenceShow/hide mRNA sequence
ATGACTAGAGACCGATGTCACCTGGGAAAGAAGATGATGGGTAGGGGTGCGGACGGAGGTTGTGGTACTGAGGAGAGGCCTTGTCCAGTGTCAAGAGTGCCGAACAGAAT
ACCATTGACCCAAACTCAAACACACGAGAAGCTGTCTACTGTGAACATTGATTATTACGCGCAGGCTCAGAAGGCTTTGTGTGAGCGCTCGCCATTTGATGTAGCGGAAG
AGACTTCAGCTTCTTGCGTGCCTACTTTGCCCAGTAGGCTGGGTAGCTTTTTGGGTAGACATTCGGATGGTAAAAAACGGCGGAAGAAATCCAATTCTGGGGCAGACAAG
AAGTCTTCTCGGCAAGGTGAAAGGTCGCGTGGTTCCAACATTTGGGTTGAGACAGAGGAATATTTTAGGGACTTGACGTTGTCTGATGTAGACACTTTGCGTGCAGCGTC
TTCCTTTAGTGGTTTAGTGGCTAGGAAATGTTTTTCAATTCCGTCTTCGGGAAATATTCCAGACACAAATGTAGGTGGCAGTGAGAATGTTAATAATGAGAATACGGATG
GGGCCATTGTTAAACACGAGGTTGAGGATGATCGTCTGTGCTCGGAGATGGGCTTGGTTGAAGCCAATGGTTCACCTCCGCAAGACAAGGGTTGTTTGAATCTGGATTGT
TCTCCCGGCATAGAATGGCTCTTAGGTTGTAGGAATAAGGTGTCTTTAACTTCGGAGCGGCCTTCAAAGAAACGGAAGCTTCTAGGTGGTGACGCAGGATTGGAAAAAGT
TCTGATTGCTTCTCCTTGCGAAGGAAATTCATCTTTATGTCACTTTTGCTCTAAGGGCGATACAGACAAAGGGCTAAATCCGTTAATTACTTGTAGTAGTTGTCATGTGG
TGGTTCACCATAATTGCTATGGTATAAGAGAGAAAGTTAATGGATCCTGGTCCTGTTCTTGGTGTAAGCAGAAGGATAAAACCAATGATTCCACGAAGCCTTGTTTGCTC
TGTCCCAAGCAGGGTGGTGCTCTGAAACCTGTACATAAGAATGTTGAGAGTGGATTTTCCATCGAATTCGCCCATTTGTTTTGCAGTCAGTGGATACCAGAGGTATATAT
TGAAGACTTGACGCAGATGGAGCCGGTTATGAATCTAGGAGGAACTTGCAGAACCTCCTTTCATCCAATTTGTGCAAGGGAGGCTAGACATAGAATGGAGGTCTGGGCAA
AATATGGGTGTGATAATGTTGAACTTCGAGCTTTCTGTTCCAAGCACTCAGAAGCACGAGATTTTAAGAGCAGTGGTCAATTTCAAGACCCTTCTGAAGCTGTTAACAGT
GACTCATATGTGGTCAACCACCTTCCAGTGACATTGTCAATTAACAGACCGCAGAAATTAGCTGGGAGAAGGAATGTTGACAATCTTTTAGTCTGTAAGGAAGCTTCGGA
TCCCAACTCTGGAAAATTGGGCGATGGTGAATTAGCTGATATTGGATCCACGGATCCCCGTTTGAATGCTGACTTTGTCGATACACACAAGTCTACTATCCAGGGGGTTG
AGGATGCCAACCCACTCGACTCCCTTAAATTTTCATCAATTATGAAAAAGTTGATTGATCAAGGAAAAGTCAATGTTAAAGATGTAGCCTCAGAAATTGGCATACCGCCT
GATTTATTAAGTGCAAAACTTACTGCTGACAACCTTGTCCCTGACTTGAAATCCAAAATTGTCAGATGGCTAAAAAATCATGCTTATATTGGTACCCTGCAGAAAAATTT
GAGAGTCAAACTTAAATCGGCTGTTTTATCCAAGGCTGTTGTTGGAACTGCTGATCGTTCTGACTCGTTATCAGTACCTGAATCTGACAATTCAGATCTTATTGCTGATA
AGTTGGTTACATCTCGAAGAAAAACTAAAAGCAATATAAGCCATTTGAAGAATGATGAAATCAAATGTTCATCTGAAGAAACTTTAGGTGGCTATGGTCTGGCGATGCAA
AGTGATACCTTGGATCAGCAAGCTAGTGAAGAACCAGGTGATTCAAATAAGGAATGTATTCAAGACGCTGGTGAGAAGCATTTAAACGAGCATGATAATTCTCGAGATTC
TTCTTCAAGGATTTTTCCAAATGATGTAGGCATTACAGTTAAGCCCTTAAATTGTAGCCTCGTAGAAGGTGATCATTTAGAGGCTGCCGTTTCTGGCCATAATTCTTCAA
TAAGGGCAGATCATGGAAAGACTGGAGAGTCCCCTAATTCAGATTTTCATCCATATGTCCAAGAGAAGATGTCACATATGTTGGATGGAAAGCTTCTAAATGTGTCTGAG
GGAGAGATGTCACGGTGGCGAGCATCTTCCAATGCTGGCGTTTGCTGTGATCACCAATACCAGCATTTAGAGTGCAATGACGTGAGCTGCAAATCTGGTGGATTTAATTC
AAAGCAGCTAGTCAATAAAAAAATAGGTGGAATTATAAAGTTGTCTCCAGAAGATGAAATTGAAGGAGAAATTATATTTTATCAGCATAGATTACTTGCAAATGCAGTTT
CAAGAAAGCGGTTCACTGATCATTTAATTTGTAATGTTGCTAAGAGTCTTCCAAAGGAGCTTGATGAATCAAGAAGTACTAGATGGGATGCTGTGCTTATTAATCAGTAT
TTTAGTGAGCTTAGAGAAGCAAAGAAACAGGGTAAGAAAGAGAGAAGACACAAGGAAGCACAGGCTGTACTAGCTGCTGCGACTGCTGCTGCTGCAGCCTCTTCTAGGAT
GTCATCATTCAGAAAAAATTTGATGCCTCGTGCCAAAGAAACGCTTACAAAGGTTGCTCTCCCAAAGACTTCGTTAGAGTCAGACTTCTGTAAAGAACATGCTAGATCCT
GTGACATCTGCAGGCGGCCAGAGACAATATTAAAGCCAATTTTAGTCTGCTCTAGTTGCAAGGTTGCAGTGCATCTGGATTGCTATCGTACTGTTAAAGAGTCCTCAGGT
CCATGGTGTTGTGAATTATGTGAAGAATTGTCATTATCAAGAGGCTCTGGAGCTCCGGTTGTCAATTTTTGGGAGAAATCATATTTTGTTGCAGAATGTGGTCTATGTGG
TGGCACAACAGGTGCATTCAGGAAATCATCTGATGGCCAGTGGGTCCATGCCTTCTGTGCGGAGTGGGTCTTTGAATCAACTTTCAGAAGAGGACAAGCAAATCCTGTGG
AAGGAATGGAGACGGTTTCAAAGGGGGTGGACTCTTGCTATATTTGTCACCGCAAACATGGTGTCTGTTTGAAGTGCAATTATGGTCATTGTCAGTCCACTTTTCATCCC
TCGTGCGCTAGAAGTGCTGGATGTTTCATGACTGTAAAGACTTCTGGTGGTAAGTTGCAGCACAAGGCATATTGTGAAAAGCATAGTTCAGAGCAGAGAGCAAAGGCTGA
GAACCAAACACATGGAATCGAGGAATTGAATAGAGTCAAGCAAATTAGGGTTGAACTAGAGAGGCTACGCCTCCTCTGCGATAGAATCATCAAACGTGAGAAGATTAAGA
GGGACCTGGTTCTTTGTTCACATGATGTTCTTGCTTTTAAAAGAGACCATGTAGCACGATCCGTACTAGTTCACACTCCTTTCTTCCTACCTGAAGTTAGTTCTGAATCA
GCTACCACATCCCTAAAGGGGCATGTGGAAGATTTAAAATCGTGTAGTGAAGCTGTGCAGAGATCAGATGATGTAACTGTGGACAGCACAGTTTCTATCAAGCACTGGAA
TAAAGTACCTTTGCCTTTGGATATCGAACAAAAGACTGAGGACGACAGTTCCACATCTCAGAACCCTTTCTCCCTAAAGTTTGCAGAGAGGGCTCAGTTTTCTGGGAAGC
AAATACCTCAGAGATCTTCGACCACTACATCACGTAATCTTGTAGATGTTGGAGGATTGAGGTTAAAGTCTAGAAAGCATGCCGAGACATTTCAGAAAGAGCTGGTTATG
ACGTCAGACCAAGCATCAATGAAGAACTCTCTTTTGCCTAAGCAGTACCTGTATGTTCCTGCTGATGTCCTTGCCAAGGAGAAGCAGGTCAACCAGGAAACAGATTACCG
TATCAGAGACCAAACATCCATATGCCTGAGGGATCCATATCCTGCTCTGAAACTGAGTCCGAGACCGAGCAACATGGTCGGTAGGGAAGAGCAGGGGGAATGCTGGGATG
GACTCCAGCGGGGCACCAGGCTCGCGATGGGCTCGATAGACGTCCAATCAACGAATGCCGATAATGGATTTGCCTCCGCTTCTGCTGATGGCCGTCAAAAGCCAGTTGCC
GCCGACGGTATGACCCAGACATCTAATGACCCAGACGGAACGGCGCCTGAGAAGCCTAAAGAATTGGGGGCCACGAGGAAGCGTATGTCCTCCATGCTTCCTCTGGAGGT
CGGGACCCGCGTCATGTGCCGCTGGAGGGATGGGAAGTACCATCCCGTAAAGGTCATCGAACGTCGGAAGTTGCAGATAACTGGTTTTAGTGATTACGAATATTATGTCC
ATTACACCGAGTTCAATAGGAGACTTGATGAGTGGGTGAAGCTTGACCAACTTGATCTTGATTCAGTTGAGACGGTTGTTGACGAGAAAGTGGAGGACAAGGTAACAGGA
TTGAAGATGACCCGACACCAGAAGCGGAAGATTGATGAAACGCACGTTGAGGGCCATGAGGAACTTGATGCTGCCAGCTTGAGGGAGCATGAAGAATTTACAAAAGTGAA
AAATATAGCAACGATTGAACTTGGAAGATATGAGATTGAGACATGGTATTTCTCCCCCTTTCCACCAGAATACAATGACACTACAAAGCTCTATTTTTGTGAGTTTTGTC
TCAATTTCATGAAGCGCAAAGAACAGCTCCAGAGACATATGAGGAAGTGTGATCTCAAGCATCCTCCTGGTGATGAAATATATCGAAGTGGTACTTTGTCAATGTTTGAG
GTTGATGGAAAAAAGAACAAGGTTTATGGGCAGAATCTCTGTTACTTAGCAAAGTTATTTCTGGACCACAAGACTCTATATTACGATGTTGACCTTTTTCTGTTTTATGT
CCTGTGTGAGTGTGATGATCGAGGGTGTCACATGGTTGGATATTTTTCAAAGGAAAAGCATTCAGAGGAATCCTATAATTTGGCATGCATTCTCACCCTTCCTCCTTATC
AAAGGAAGGGCTATGGAAAGTTTTTAATAGCATTTTCTTATGAACTCTCCAAGAAGGAAGGTAAAGTTGGCACACCTGAAAGACCTCTTTCTGACCTTGGGTTGGTGAGC
TACAGAGGATACTGGACCAGGGTTCTTTTAGACATCTTGAAAAAGCATAAGGGCAATATCTCTATCAAGGAGTTAAGTGACATGACAGCTATCAAAGCCGAAGATATTTT
GAACACACTTCAGAGCCTAGAGTTGATTCAATACAGGAAAGGGCAGCATGTTATATGTGCTGACCCCAAGGTACTGGATCGTCATCTCAAAGCTGCTGGTCGTGGGGGCC
TAGAGGTTGATGTTAGCAAATTGATCTGGACTCCATACAGAGAACAGGACGGTGCTTTGGAAGGTGGGTATGCTGAATCTGTAAGCTTTGTAATACCATCCCACCGGACC
TCCGCTTCCAAAGTTCGAAGCCGCTCTTCTTCTACTTCCACAACCGTTGGCAGCCGCCGTAGATATCGAGGAGGCGGCGGCGGAGGACCTGGTGGTGGCCGTGGCGGAGG
GGGAGGCCAAGAGATTTTTGCTCGAGAAGAATTTCTTCAGCTTGAGCCGATTCTCGGCGATAAAGGCTCGCCGGAAAAGGACTCCGCATGGTCTCGGTCGCATTTCGGCT
GCCTAACGTCTTTGATCGCCGGCTTCGAACATAGCATCGGCAAGAACATGTTTTTCTGGCGATGGTTTTGGTGGTTGAAGCGATTGTATCTGTCACCCATTCTTCTTCCT
GCTTCTTCAGACGTACTGAGAGAGAAGACAGCTTGGACCTTGATACACAACAATGGCGTCTCGAGAATCCTGCCCCCCAAGAGCTCCTCTGGTGCTGCTCTGTGCAGTAG
AGAGAGAGAGAGAGAGAGAGAGAGGGTTTATTGGGAAGGCAAATAG
Protein sequenceShow/hide protein sequence
MTRDRCHLGKKMMGRGADGGCGTEERPCPVSRVPNRIPLTQTQTHEKLSTVNIDYYAQAQKALCERSPFDVAEETSASCVPTLPSRLGSFLGRHSDGKKRRKKSNSGADK
KSSRQGERSRGSNIWVETEEYFRDLTLSDVDTLRAASSFSGLVARKCFSIPSSGNIPDTNVGGSENVNNENTDGAIVKHEVEDDRLCSEMGLVEANGSPPQDKGCLNLDC
SPGIEWLLGCRNKVSLTSERPSKKRKLLGGDAGLEKVLIASPCEGNSSLCHFCSKGDTDKGLNPLITCSSCHVVVHHNCYGIREKVNGSWSCSWCKQKDKTNDSTKPCLL
CPKQGGALKPVHKNVESGFSIEFAHLFCSQWIPEVYIEDLTQMEPVMNLGGTCRTSFHPICAREARHRMEVWAKYGCDNVELRAFCSKHSEARDFKSSGQFQDPSEAVNS
DSYVVNHLPVTLSINRPQKLAGRRNVDNLLVCKEASDPNSGKLGDGELADIGSTDPRLNADFVDTHKSTIQGVEDANPLDSLKFSSIMKKLIDQGKVNVKDVASEIGIPP
DLLSAKLTADNLVPDLKSKIVRWLKNHAYIGTLQKNLRVKLKSAVLSKAVVGTADRSDSLSVPESDNSDLIADKLVTSRRKTKSNISHLKNDEIKCSSEETLGGYGLAMQ
SDTLDQQASEEPGDSNKECIQDAGEKHLNEHDNSRDSSSRIFPNDVGITVKPLNCSLVEGDHLEAAVSGHNSSIRADHGKTGESPNSDFHPYVQEKMSHMLDGKLLNVSE
GEMSRWRASSNAGVCCDHQYQHLECNDVSCKSGGFNSKQLVNKKIGGIIKLSPEDEIEGEIIFYQHRLLANAVSRKRFTDHLICNVAKSLPKELDESRSTRWDAVLINQY
FSELREAKKQGKKERRHKEAQAVLAAATAAAAASSRMSSFRKNLMPRAKETLTKVALPKTSLESDFCKEHARSCDICRRPETILKPILVCSSCKVAVHLDCYRTVKESSG
PWCCELCEELSLSRGSGAPVVNFWEKSYFVAECGLCGGTTGAFRKSSDGQWVHAFCAEWVFESTFRRGQANPVEGMETVSKGVDSCYICHRKHGVCLKCNYGHCQSTFHP
SCARSAGCFMTVKTSGGKLQHKAYCEKHSSEQRAKAENQTHGIEELNRVKQIRVELERLRLLCDRIIKREKIKRDLVLCSHDVLAFKRDHVARSVLVHTPFFLPEVSSES
ATTSLKGHVEDLKSCSEAVQRSDDVTVDSTVSIKHWNKVPLPLDIEQKTEDDSSTSQNPFSLKFAERAQFSGKQIPQRSSTTTSRNLVDVGGLRLKSRKHAETFQKELVM
TSDQASMKNSLLPKQYLYVPADVLAKEKQVNQETDYRIRDQTSICLRDPYPALKLSPRPSNMVGREEQGECWDGLQRGTRLAMGSIDVQSTNADNGFASASADGRQKPVA
ADGMTQTSNDPDGTAPEKPKELGATRKRMSSMLPLEVGTRVMCRWRDGKYHPVKVIERRKLQITGFSDYEYYVHYTEFNRRLDEWVKLDQLDLDSVETVVDEKVEDKVTG
LKMTRHQKRKIDETHVEGHEELDAASLREHEEFTKVKNIATIELGRYEIETWYFSPFPPEYNDTTKLYFCEFCLNFMKRKEQLQRHMRKCDLKHPPGDEIYRSGTLSMFE
VDGKKNKVYGQNLCYLAKLFLDHKTLYYDVDLFLFYVLCECDDRGCHMVGYFSKEKHSEESYNLACILTLPPYQRKGYGKFLIAFSYELSKKEGKVGTPERPLSDLGLVS
YRGYWTRVLLDILKKHKGNISIKELSDMTAIKAEDILNTLQSLELIQYRKGQHVICADPKVLDRHLKAAGRGGLEVDVSKLIWTPYREQDGALEGGYAESVSFVIPSHRT
SASKVRSRSSSTSTTVGSRRRYRGGGGGGPGGGRGGGGGQEIFAREEFLQLEPILGDKGSPEKDSAWSRSHFGCLTSLIAGFEHSIGKNMFFWRWFWWLKRLYLSPILLP
ASSDVLREKTAWTLIHNNGVSRILPPKSSSGAALCSRERERERERVYWEGK