; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh04G006660 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh04G006660
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionNHL domain-containing protein
Genome locationCma_Chr04:3396771..3403990
RNA-Seq ExpressionCmaCh04G006660
SyntenyCmaCh04G006660
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001258 - NHL repeat
IPR011042 - Six-bladed beta-propeller, TolB-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600498.1 hypothetical protein SDJN03_05731, partial [Cucurbita argyrosperma subsp. sororia]2.9e-24896.9Show/hide
Query:  GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK
        GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVE NEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK
Subjt:  GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK

Query:  PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD
        PNDARFNHPKGVTVDD+GNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLN EDCEYQD
Subjt:  PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD

Query:  SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARN
        SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEEST  VDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRAR+
Subjt:  SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARN

Query:  IPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE
         PKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNA+DQ PEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE
Subjt:  IPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE

Query:  RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS
        RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNK   ++S
Subjt:  RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS

XP_022942820.1 uncharacterized protein LOC111447735 isoform X2 [Cucurbita moschata]2.0e-24997.12Show/hide
Query:  GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK
        GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVE NEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK
Subjt:  GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK

Query:  PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD
        PNDARFNHPKGVTVDD+GNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD
Subjt:  PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD

Query:  SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARN
        SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEEST  VDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRAR+
Subjt:  SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARN

Query:  IPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE
         PKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNA+DQ PEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE
Subjt:  IPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE

Query:  RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS
        RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNK   ++S
Subjt:  RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS

XP_022980612.1 uncharacterized protein LOC111479928 isoform X1 [Cucurbita maxima]1.5e-25298.67Show/hide
Query:  GPLIKHLSSIVKWTRSSYKAPPS-DGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDG
        GPLIKHLSSIVKWTRSSYKAPPS DGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDG
Subjt:  GPLIKHLSSIVKWTRSSYKAPPS-DGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDG

Query:  KPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQ
        KPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQ
Subjt:  KPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQ

Query:  DSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRAR
        DSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRAR
Subjt:  DSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRAR

Query:  NIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQK
        NIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQK
Subjt:  NIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQK

Query:  ERPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS
        ERPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNK   ++S
Subjt:  ERPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS

XP_022980631.1 uncharacterized protein LOC111479928 isoform X2 [Cucurbita maxima]6.0e-25498.89Show/hide
Query:  GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK
        GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK
Subjt:  GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK

Query:  PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD
        PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD
Subjt:  PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD

Query:  SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARN
        SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARN
Subjt:  SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARN

Query:  IPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE
        IPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE
Subjt:  IPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE

Query:  RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS
        RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNK   ++S
Subjt:  RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS

XP_023547910.1 uncharacterized protein LOC111806707 isoform X2 [Cucurbita pepo subsp. pepo]1.3e-24896.67Show/hide
Query:  GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK
        GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETV+E NEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK
Subjt:  GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK

Query:  PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD
        PNDARFNHPKGVTVDD+GNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD
Subjt:  PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD

Query:  SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARN
        SSISNSDVLMIIGAVLAGYATYMLQQGFGASS+SQTYSPLETEYREKPNKEEST  VDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRAR+
Subjt:  SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARN

Query:  IPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE
         PKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNA+DQ PEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE
Subjt:  IPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE

Query:  RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS
        RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNK   ++S
Subjt:  RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS

TrEMBL top hitse value%identityAlignment
A0A1S3BT05 uncharacterized protein LOC103492945 isoform X21.8e-21183.51Show/hide
Query:  LTTHLQKMA-EIGPLIKHLSSIVKWTRSSYK---APPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARL
        LT  +Q  A   GPLIKHLSS+VKWTRSSYK   APP DG+VLQFENGYLVETVVE NEIGVLPHKIHVSK+GELFVVDSVNSNIVKI+PPLSKYTRARL
Subjt:  LTTHLQKMA-EIGPLIKHLSSIVKWTRSSYK---APPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARL

Query:  VAGSFQSHTGHVDGKPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAA
        VAGSFQSHTGH+DGKPNDARFNHP+GVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNV GYRDGPGEDAKFSNDFDVMY+RSTCSLLVIDRGNAA
Subjt:  VAGSFQSHTGHVDGKPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAA

Query:  IRQIFLNPEDCEYQDSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVA
        IRQI LN EDCEYQDSSISNSDVLMIIGAVLAGYATYMLQQGFG S+ SQTY PLETEY EKP KE S+  +DSV+E PGWPSFGRLIIDLSKLALEAVA
Subjt:  IRQIFLNPEDCEYQDSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVA

Query:  SIFLSFVPARFRARNIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHR-SKRHDRADFYE
        SIFLSFVPARFRARN  KGLTPLKDSL MPEDEP +P  QMQR PVPLTETRQAH  +  D  PEV MKP KL +SSFKDPSLQSKHR SKR + ADFY 
Subjt:  SIFLSFVPARFRARNIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHR-SKRHDRADFYE

Query:  SGEIPPPYSRSKSQKERPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS
        SGEI PPYSRSKSQKERPRHRQREKSAEI +GA G EPKP+EMK  +YDN   EHYNIRNK   + S
Subjt:  SGEIPPPYSRSKSQKERPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS

A0A6J1FSF5 uncharacterized protein LOC111447735 isoform X12.4e-24896.9Show/hide
Query:  GPLIKHLSSIVKWTRSSYKAPPS-DGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDG
        GPLIKHLSSIVKWTRSSYKAPPS DGNVLQFENGYLVETVVE NEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDG
Subjt:  GPLIKHLSSIVKWTRSSYKAPPS-DGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDG

Query:  KPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQ
        KPNDARFNHPKGVTVDD+GNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQ
Subjt:  KPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQ

Query:  DSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRAR
        DSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEEST  VDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRAR
Subjt:  DSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRAR

Query:  NIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQK
        + PKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNA+DQ PEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQK
Subjt:  NIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQK

Query:  ERPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS
        ERPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNK   ++S
Subjt:  ERPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS

A0A6J1FWZ8 uncharacterized protein LOC111447735 isoform X29.6e-25097.12Show/hide
Query:  GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK
        GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVE NEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK
Subjt:  GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK

Query:  PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD
        PNDARFNHPKGVTVDD+GNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD
Subjt:  PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD

Query:  SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARN
        SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEEST  VDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRAR+
Subjt:  SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARN

Query:  IPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE
         PKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNA+DQ PEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE
Subjt:  IPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE

Query:  RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS
        RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNK   ++S
Subjt:  RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS

A0A6J1IRS7 uncharacterized protein LOC111479928 isoform X17.1e-25398.67Show/hide
Query:  GPLIKHLSSIVKWTRSSYKAPPS-DGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDG
        GPLIKHLSSIVKWTRSSYKAPPS DGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDG
Subjt:  GPLIKHLSSIVKWTRSSYKAPPS-DGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDG

Query:  KPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQ
        KPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQ
Subjt:  KPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQ

Query:  DSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRAR
        DSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRAR
Subjt:  DSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRAR

Query:  NIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQK
        NIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQK
Subjt:  NIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQK

Query:  ERPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS
        ERPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNK   ++S
Subjt:  ERPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS

A0A6J1IU41 uncharacterized protein LOC111479928 isoform X22.9e-25498.89Show/hide
Query:  GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK
        GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK
Subjt:  GPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK

Query:  PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD
        PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD
Subjt:  PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQD

Query:  SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARN
        SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARN
Subjt:  SSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARN

Query:  IPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE
        IPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE
Subjt:  IPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKE

Query:  RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS
        RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNK   ++S
Subjt:  RPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSS

SwissProt top hitse value%identityAlignment
Q8VZ10 Protein SUPPRESSOR OF QUENCHING 1, chloroplastic3.7e-0431.08Show/hide
Query:  ELFVVDSVNSNIVKITPPLSKYTRARLVAGS---FQSHT---GHVDGKPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDA--GVTTIAGGKSNVAG
        E ++ DS +S+I  +     +   +RL+AG    F  +    G  DG   +    HP GV   + G +Y+ D+ N  I+K+      V T+AG  +  AG
Subjt:  ELFVVDSVNSNIVKITPPLSKYTRARLVAGS---FQSHT---GHVDGKPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDA--GVTTIAGGKSNVAG

Query:  YRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNP-EDCE
        ++DG  + A+ S     + I     L V D  N+ IR I LN  ED E
Subjt:  YRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNP-EDCE

Arabidopsis top hitse value%identityAlignment
AT1G23880.1 NHL domain-containing protein1.1e-5951.98Show/hide
Query:  HLSSIVKWT-----RSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK
        H +S++KW      +++ K      ++++FENGY VETV++ +++G+ P+ I V   GEL ++DS NSNI +I+  LS Y+R RLV GS + + GHVDG+
Subjt:  HLSSIVKWT-----RSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGK

Query:  PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKS-NVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQ
          DAR N+PKG+TVDD+GN+YVADT+N AIRKI +AGVTTIAGGK     G+ DGP EDAKFSNDFDV+Y+ S+CSLLVIDRGN AIR+I L+ +DC  Q
Subjt:  PNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKS-NVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQ

Query:  DSSISNSDVLMIIGAVLAGYATYMLQQ
          S     + +++ AV  GY   +LQ+
Subjt:  DSSISNSDVLMIIGAVLAGYATYMLQQ

AT1G70280.1 NHL domain-containing protein3.5e-5837.95Show/hide
Query:  VLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGKPNDARFNHPKGVTVDDKGNVYVADTL
        +++FENGY VETV + +++G+ P+ I V   GEL ++DS NSNI KI+  LS Y+R RLV GS + + GHVDG+  DA+ NHPKG+TVDD+GN+YVADT+
Subjt:  VLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGKPNDARFNHPKGVTVDDKGNVYVADTL

Query:  NLAIRKIGDAGVTTIAGGKS-NVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQDSSISNSDVLMIIGAVLAGYATYMLQ
        N AIRKI + GVTTIAGGK+    G+ DGP EDAKFSNDFDV+Y+ S+CSLLVIDRGN AIR+I L+ +DC YQ  S     + +++ A   GY   +LQ
Subjt:  NLAIRKIGDAGVTTIAGGKS-NVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQDSSISNSDVLMIIGAVLAGYATYMLQ

Query:  QGFGASSMSQTYSPL-ETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARNIPKGLTPLKDSLWMPEDEPDRPPA
        +  G+   S     + E +  +KP K      + +  E          ++ L KL   A  S+ +  +  +    +  +     K S            A
Subjt:  QGFGASSMSQTYSPL-ETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARNIPKGLTPLKDSLWMPEDEPDRPPA

Query:  QMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKERPRHRQREKSAEIH
             P P+ E+    I + +   P     P   K+ +F     + K +  R  RA FY S +   P  + + QK+  +H+ +++  + H
Subjt:  QMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKERPRHRQREKSAEIH

AT1G70280.2 NHL domain-containing protein9.7e-6136.99Show/hide
Query:  GPLIKHLSSIVKW---TRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHV
        G +  H SS++KW    +++ K   +  ++++FENGY VETV + +++G+ P+ I V   GEL ++DS NSNI KI+  LS Y+R RLV GS + + GHV
Subjt:  GPLIKHLSSIVKW---TRSSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHV

Query:  DGKPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKS-NVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDC
        DG+  DA+ NHPKG+TVDD+GN+YVADT+N AIRKI + GVTTIAGGK+    G+ DGP EDAKFSNDFDV+Y+ S+CSLLVIDRGN AIR+I L+ +DC
Subjt:  DGKPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKS-NVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDC

Query:  EYQDSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPL-ETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPAR
         YQ  S     + +++ A   GY   +LQ+  G+   S     + E +  +KP K      + +  E          ++ L KL   A  S+ +  +  +
Subjt:  EYQDSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPL-ETEYREKPNKEESTLTVDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPAR

Query:  FRARNIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRS
            +  +     K S            A     P P+ E+    I + +   P     P   K+ +F     + K +  R  RA FY S +   P  + 
Subjt:  FRARNIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSSFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRS

Query:  KSQKERPRHRQREKSAEIH
        + QK+  +H+ +++  + H
Subjt:  KSQKERPRHRQREKSAEIH

AT3G14860.1 NHL domain-containing protein5.6e-14161.74Show/hide
Query:  AEIGPLIKHLSSIVKWTR-SSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGH
        A  G LIKH+SS++KWT  SS K   SD NVLQFENGYLVETVVE N+IGV+P+KI VS +GEL+ VD +NSNI+KITPPLS+Y+R RLVAGSFQ  TGH
Subjt:  AEIGPLIKHLSSIVKWTR-SSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGH

Query:  VDGKPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDC
         DGKP++ARFNHP+GVT+DDKGNVYVADTLNLAIRKIGD+GVTTIAGGKSN+AGYRDGP EDAKFSNDFDV+Y+R TCSLLVIDRGNAA+RQI L+ EDC
Subjt:  VDGKPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDC

Query:  EYQ-DSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREK-PNKEESTLTVDS---VRETPGWPSFGRLIIDLSKLALEAVASIFLSF
        +YQ DSSIS +D+L++IGAVL GYAT MLQQGFG S  S+T    ET Y E+ P KE+ +  V      +E PGWPSFG+L+ DL KLALE + S     
Subjt:  EYQ-DSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREK-PNKEESTLTVDS---VRETPGWPSFGRLIIDLSKLALEAVASIFLSF

Query:  VPARFRARNIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSS-FKDPSLQSK--HR--SKRHDRADFYESG
        VPARF+       L PLKD L MPEDE + P  Q   AP P++E+R AH+  A+D  PE   K PKL+SSS  KDP+L S   HR  SKR D A FY SG
Subjt:  VPARFRARNIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSS-FKDPSLQSK--HR--SKRHDRADFYESG

Query:  EIPPPYSRSKSQKERPRHRQREKSAEIHHGAAGPEPKPME-MKPVNYDNSN-LEHYNIRN
        E+  P    K  KER R R R+K+ E       P+P P + +KPV Y NS+  +HYN+R+
Subjt:  EIPPPYSRSKSQKERPRHRQREKSAEIHHGAAGPEPKPME-MKPVNYDNSN-LEHYNIRN

AT3G14860.2 NHL domain-containing protein8.6e-14261.74Show/hide
Query:  AEIGPLIKHLSSIVKWTR-SSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGH
        A  G LIKH+SS++KWT  SS K   SD NVLQFENGYLVETVVE N+IGV+P+KI VS +GEL+ VD +NSNI+KITPPLS+Y+R RLVAGSFQ  TGH
Subjt:  AEIGPLIKHLSSIVKWTR-SSYKAPPSDGNVLQFENGYLVETVVEANEIGVLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGH

Query:  VDGKPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDC
         DGKP++ARFNHP+GVT+DDKGNVYVADTLNLAIRKIGD+GVTTIAGGKSN+AGYRDGP EDAKFSNDFDV+Y+R TCSLLVIDRGNAA+RQI L+ EDC
Subjt:  VDGKPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGYRDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDC

Query:  EYQ-DSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREK-PNKEESTLTVDS---VRETPGWPSFGRLIIDLSKLALEAVASIFLSF
        +YQ DSSIS +D+L++IGAVL GYAT MLQQGFG S  S+T    ET Y E+ P KE+ +  V      +E PGWPSFG+L+ DL KLALE + S     
Subjt:  EYQ-DSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREK-PNKEESTLTVDS---VRETPGWPSFGRLIIDLSKLALEAVASIFLSF

Query:  VPARFRARNIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSS-FKDPSLQSK--HR--SKRHDRADFYESG
        VPARF+       L PLKD L MPEDE + P  Q   AP P++E+R AH+  A+D  PE   K PKL+SSS  KDP+L S   HR  SKR D A FY SG
Subjt:  VPARFRARNIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSSS-FKDPSLQSK--HR--SKRHDRADFYESG

Query:  EIPPPYSRSKSQKERPRHRQREKSAEIHHGAAGPEPKPME-MKPVNYDNSN-LEHYNIRN
        E+  P    K  KER R R R+K+ E       P+P P + +KPV Y NS+  +HYN+R+
Subjt:  EIPPPYSRSKSQKERPRHRQREKSAEIHHGAAGPEPKPME-MKPVNYDNSN-LEHYNIRN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGCTCAAGGAGATCAAGGAAGGCGGAGATATAGAAACAGACCCTGCCGCTGCGCGAAGAAGAGTCAAACGCGACGGAAAATCGCGCGGATCGATTGAATTA
GCCGAAGAGCCGAATCAAAACTCCATAGCTACTGATTCTCTCACTACCCATCTGCAGAAAATGGCAGAAATCGGACCATTGATTAAACACTTGTCTTCTATTGTC
AAATGGACTAGGTCTTCATATAAAGCCCCCCCATCAGATGGTAATGTCCTGCAATTTGAGAATGGGTACTTGGTTGAGACTGTAGTGGAGGCAAATGAAATTGGA
GTTCTTCCACATAAGATCCATGTCTCAAAGGAGGGCGAGCTTTTCGTCGTTGATTCGGTTAATAGCAATATTGTGAAGATCACCCCTCCATTATCCAAATATACT
CGAGCAAGATTGGTTGCTGGGTCATTTCAAAGCCACACGGGGCATGTTGATGGAAAACCAAACGACGCCCGTTTTAATCATCCAAAGGGTGTAACCGTGGACGAT
AAAGGGAATGTGTATGTTGCTGATACCTTGAATTTGGCCATCAGAAAGATTGGAGATGCTGGTGTGACAACCATTGCAGGGGGCAAGTCAAATGTCGCAGGCTAC
AGAGATGGACCAGGTGAAGATGCGAAGTTCTCAAACGATTTTGATGTAATGTACATCCGTTCTACTTGTTCCTTGTTAGTTATTGACCGTGGAAATGCTGCGATT
CGGCAAATATTTCTAAATCCGGAGGATTGTGAATATCAAGATAGTTCAATTTCCAACAGTGATGTTCTCATGATCATTGGTGCTGTTTTGGCGGGATACGCAACG
TATATGCTTCAACAGGGTTTTGGAGCCTCGAGCATGTCTCAGACATATTCTCCATTAGAGACTGAGTATAGGGAAAAACCAAACAAGGAAGAATCGACCTTGACT
GTGGATAGTGTAAGGGAGACACCAGGATGGCCATCATTTGGACGACTCATCATTGATCTCTCCAAACTGGCTCTTGAAGCCGTGGCTAGCATTTTCCTTTCTTTT
GTTCCTGCCCGTTTCAGAGCTCGTAACATCCCGAAAGGCCTGACCCCGTTGAAAGATTCTCTCTGGATGCCTGAAGATGAACCCGATCGACCACCAGCTCAAATG
CAGAGAGCTCCTGTTCCTCTGACTGAAACAAGACAAGCTCATATAGGCAATGCAAATGATCAAGTTCCTGAGGTGATGATGAAGCCTCCAAAGCTCAAATCAAGT
AGTTTCAAAGATCCCTCACTGCAAAGTAAGCACCGATCGAAACGTCACGACCGTGCTGATTTTTACGAATCTGGTGAGATACCTCCACCTTATAGCAGGTCCAAG
AGCCAAAAAGAAAGACCACGACACCGCCAGCGCGAGAAAAGTGCAGAGATCCATCACGGAGCTGCAGGACCTGAGCCAAAGCCCATGGAGATGAAGCCAGTCAAT
TACGACAATTCAAACTTGGAACATTACAACATCAGGAATAAAGCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTTCCTCCTCCTCCTCCTCCTCCTCCTCTACAAA
AAGCTTATTCTTCGGTTCTGCAAAATTCTTCCATTCCAACTCTCCACCGAGTTTCTGCTTTTCTCTGGCTTCGTCTTTGTATCTGTCTACTTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGACGCTCAAGGAGATCAAGGAAGGCGGAGATATAGAAACAGACCCTGCCGCTGCGCGAAGAAGAGTCAAACGCGACGGAAAATCGCGCGGATCGATTGAATTA
GCCGAAGAGCCGAATCAAAACTCCATAGCTACTGATTCTCTCACTACCCATCTGCAGAAAATGGCAGAAATCGGACCATTGATTAAACACTTGTCTTCTATTGTC
AAATGGACTAGGTCTTCATATAAAGCCCCCCCATCAGATGGTAATGTCCTGCAATTTGAGAATGGGTACTTGGTTGAGACTGTAGTGGAGGCAAATGAAATTGGA
GTTCTTCCACATAAGATCCATGTCTCAAAGGAGGGCGAGCTTTTCGTCGTTGATTCGGTTAATAGCAATATTGTGAAGATCACCCCTCCATTATCCAAATATACT
CGAGCAAGATTGGTTGCTGGGTCATTTCAAAGCCACACGGGGCATGTTGATGGAAAACCAAACGACGCCCGTTTTAATCATCCAAAGGGTGTAACCGTGGACGAT
AAAGGGAATGTGTATGTTGCTGATACCTTGAATTTGGCCATCAGAAAGATTGGAGATGCTGGTGTGACAACCATTGCAGGGGGCAAGTCAAATGTCGCAGGCTAC
AGAGATGGACCAGGTGAAGATGCGAAGTTCTCAAACGATTTTGATGTAATGTACATCCGTTCTACTTGTTCCTTGTTAGTTATTGACCGTGGAAATGCTGCGATT
CGGCAAATATTTCTAAATCCGGAGGATTGTGAATATCAAGATAGTTCAATTTCCAACAGTGATGTTCTCATGATCATTGGTGCTGTTTTGGCGGGATACGCAACG
TATATGCTTCAACAGGGTTTTGGAGCCTCGAGCATGTCTCAGACATATTCTCCATTAGAGACTGAGTATAGGGAAAAACCAAACAAGGAAGAATCGACCTTGACT
GTGGATAGTGTAAGGGAGACACCAGGATGGCCATCATTTGGACGACTCATCATTGATCTCTCCAAACTGGCTCTTGAAGCCGTGGCTAGCATTTTCCTTTCTTTT
GTTCCTGCCCGTTTCAGAGCTCGTAACATCCCGAAAGGCCTGACCCCGTTGAAAGATTCTCTCTGGATGCCTGAAGATGAACCCGATCGACCACCAGCTCAAATG
CAGAGAGCTCCTGTTCCTCTGACTGAAACAAGACAAGCTCATATAGGCAATGCAAATGATCAAGTTCCTGAGGTGATGATGAAGCCTCCAAAGCTCAAATCAAGT
AGTTTCAAAGATCCCTCACTGCAAAGTAAGCACCGATCGAAACGTCACGACCGTGCTGATTTTTACGAATCTGGTGAGATACCTCCACCTTATAGCAGGTCCAAG
AGCCAAAAAGAAAGACCACGACACCGCCAGCGCGAGAAAAGTGCAGAGATCCATCACGGAGCTGCAGGACCTGAGCCAAAGCCCATGGAGATGAAGCCAGTCAAT
TACGACAATTCAAACTTGGAACATTACAACATCAGGAATAAAGCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTTCCTCCTCCTCCTCCTCCTCCTCCTCTACAAA
AAGCTTATTCTTCGGTTCTGCAAAATTCTTCCATTCCAACTCTCCACCGAGTTTCTGCTTTTCTCTGGCTTCGTCTTTGTATCTGTCTACTTGTAG
Protein sequenceShow/hide protein sequence
MTLKEIKEGGDIETDPAAARRRVKRDGKSRGSIELAEEPNQNSIATDSLTTHLQKMAEIGPLIKHLSSIVKWTRSSYKAPPSDGNVLQFENGYLVETVVEANEIG
VLPHKIHVSKEGELFVVDSVNSNIVKITPPLSKYTRARLVAGSFQSHTGHVDGKPNDARFNHPKGVTVDDKGNVYVADTLNLAIRKIGDAGVTTIAGGKSNVAGY
RDGPGEDAKFSNDFDVMYIRSTCSLLVIDRGNAAIRQIFLNPEDCEYQDSSISNSDVLMIIGAVLAGYATYMLQQGFGASSMSQTYSPLETEYREKPNKEESTLT
VDSVRETPGWPSFGRLIIDLSKLALEAVASIFLSFVPARFRARNIPKGLTPLKDSLWMPEDEPDRPPAQMQRAPVPLTETRQAHIGNANDQVPEVMMKPPKLKSS
SFKDPSLQSKHRSKRHDRADFYESGEIPPPYSRSKSQKERPRHRQREKSAEIHHGAAGPEPKPMEMKPVNYDNSNLEHYNIRNKASSSSSSSSSFLLLLLLLLYK
KLILRFCKILPFQLSTEFLLFSGFVFVSVYL