; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g1751 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g1751
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionNHL domain-containing protein
Genome locationMC06:24802774..24811354
RNA-Seq ExpressionMC06g1751
SyntenyMC06g1751
Gene Ontology termsNA
InterPro domainsIPR011042 - Six-bladed beta-propeller, TolB-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571580.1 Protein SUPPRESSOR OF QUENCHING 1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.077.84Show/hide
Query:  MAFRVRRLKEISRHLSRI-SGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTY
        MAFR RRL+EIS+ L +  SGY HQHH   AVSSL  +V+P + SEG+ +RI++ GRH LRFSTT ELQCESS AN++LSFIKSTLDESEGPNH WLN  
Subjt:  MAFRVRRLKEISRHLSRI-SGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTY

Query:  DGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRN
        DG KG+SEKDGI+LILADQFL M SS+SV LVENVKFLQHRFPQLHVIG QCS++LS AEK++MIQFIM+EYVSFPILLS K  E+ RGLCYIISK+F N
Subjt:  DGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRN

Query:  PLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS
        PLL+ ER+ D   +RKAIEEL E E+EK   PN GRTTYLK AEI  EPYSCSFMQNF+LHFP CISADEKGGRLFLSDSNHNRI+IFN NGKILD+IGS
Subjt:  PLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS

Query:  YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRY
        YPGF+DGEFELVKLARPAASFYHATQ+CLYFVDSENHAIRKADL KRVVETLYP NYSS KSTQLWSWI D+ GLGS  +RE+EDFNPQSLMFPWH+IRY
Subjt:  YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRY

Query:  VDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKY
        +DDRLLILNRSL TLW MDL SGKIIEVV+G S IME YGQL  D +SV+KQIP G LQ    A  +TG LPYLDLLSS TPFQNC+IICDSVGQVI+KY
Subjt:  VDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKY

Query:  HRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPS
        +R SGESSS QFSNFGVLGLPYWFA PPEKVI+TADSF+GAGIDH+ FFRLLPGKVGI INVDLP DIELVES+QEDSIWRQ RGTATE  IVE V+G S
Subjt:  HRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPS

Query:  EKVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKDAE---RIAGFLYPRNGGKIRK
        EKVGSAQQWYDELDSLAFSP +SE+VED  R  NHIGD+K QIECAVNTSPGTSEVIVYAA+YLR RR QDWEGN +K A    RIA  LYP + GK  K
Subjt:  EKVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKDAE---RIAGFLYPRNGGKIRK

Query:  ECCIQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS
        E CIQFL+  KR LRE+IFVKPLHVRIKLD+L HPKADNSKGIILTDSSVE+NLSLAS
Subjt:  ECCIQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS

XP_008465459.1 PREDICTED: uncharacterized protein LOC103503064 isoform X1 [Cucumis melo]0.079.07Show/hide
Query:  MAFRVRRLKEISRHLSRI-SGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTY
        MAFR RRLKEISR L +I SGY HQHH    VSSL L+VAPFH SEGI++R+ D+GRHF RFSTTTELQCESS  N+I SFI STLDESEGPNH WLNT 
Subjt:  MAFRVRRLKEISRHLSRI-SGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTY

Query:  DGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEVRGLCYIISKNFRNP
        +GNKG+ E+DG++LILA+QFL M SS+S+ LVENVKFLQ RFP LHVIGFQC S+LS AEK+ MIQFIM+EY+SFPILLS K FEV G C IISK+  NP
Subjt:  DGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEVRGLCYIISKNFRNP

Query:  LLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSY
        LL+ ER+MD   + KAIEEL E E+EK    N G+TTYLKQAE+  EP SCSFM NFLLH+PGCISADE+GGRLFLSDSNHNRI+I NS GKILD+IGSY
Subjt:  LLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSY

Query:  PGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYV
        PGF+DGEFELVKLARPAASFYH+TQNCLYFVDSENHAIRKADL KRVVETLYPENYS+KKSTQLWSWIMDKFGLGS  +RE+EDFNPQSLMFPWH+IRY+
Subjt:  PGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYV

Query:  DDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYH
        DDRLLILNRSL TLW MDLVSGKIIEVVRGLS IME YG LI D++SV+KQIP G LQ+ S ANI TGG PY+DLLSS TPF+NCIIICDSVGQV+LKY+
Subjt:  DDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYH

Query:  RNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSE
        + SG  SS QFSNFGVLGLPYWFA PPEKVITTA+ FRGAGIDHLQFFRLLPG+VGIQINVDLP+DIELVESL +DSIWRQARGTATE  IVE VAGPSE
Subjt:  RNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSE

Query:  KVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKD-AERIAGFLYPRNGGKIRKECC
        KVGSAQQWYDELDSLAFSP +SEMVEDN R  N+IGDNKV IECAVNTSPGTSEVIVYAALYLRLRRNQD+EGN +K  A RIA FLY R  GKI KE C
Subjt:  KVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKD-AERIAGFLYPRNGGKIRKECC

Query:  IQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS
        IQFL+  KR LRELIFVKPLHVRIKLDSL HPKA+NSK IILT SSVEVN+SL+S
Subjt:  IQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS

XP_022159641.1 uncharacterized protein LOC111025991 isoform X1 [Momordica charantia]0.099.73Show/hide
Query:  MAFRVRRLKEISRHLSRISGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTYD
        MAFRVRRLKEISRHLSRISGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTYD
Subjt:  MAFRVRRLKEISRHLSRISGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTYD

Query:  GNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRNP
        GNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFE+ RGLCYIISKNFRNP
Subjt:  GNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRNP

Query:  LLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSY
        LLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSY
Subjt:  LLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSY

Query:  PGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYV
        PGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYV
Subjt:  PGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYV

Query:  DDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYH
        DDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYH
Subjt:  DDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYH

Query:  RNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSE
        RNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSE
Subjt:  RNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSE

Query:  KVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKDAERIAGFLYPRNGGKIRKECCI
        KVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKDAERIAGFLYPRNGGKIRKECCI
Subjt:  KVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKDAERIAGFLYPRNGGKIRKECCI

Query:  QFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS
        QFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS
Subjt:  QFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS

XP_022159642.1 uncharacterized protein LOC111025991 isoform X2 [Momordica charantia]0.099.57Show/hide
Query:  FLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTYDGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSA
        F RFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTYDGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSA
Subjt:  FLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTYDGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSA

Query:  AEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRNPLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNF
        AEKTDMIQFIMKEYVSFPILLSKKDFE+ RGLCYIISKNFRNPLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNF
Subjt:  AEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRNPLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNF

Query:  LLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSYPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYS
        LLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSYPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYS
Subjt:  LLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSYPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYS

Query:  SKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYVDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWL
        SKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYVDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWL
Subjt:  SKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYVDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWL

Query:  QRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYHRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGI
        QRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYHRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGI
Subjt:  QRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYHRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGI

Query:  QINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSEKVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIV
        QINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSEKVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIV
Subjt:  QINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSEKVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIV

Query:  YAALYLRLRRNQDWEGNEEKDAERIAGFLYPRNGGKIRKECCIQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS
        YAALYLRLRRNQDWEGNEEKDAERIAGFLYPRNGGKIRKECCIQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS
Subjt:  YAALYLRLRRNQDWEGNEEKDAERIAGFLYPRNGGKIRKECCIQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS

XP_038888990.1 uncharacterized protein LOC120078755 isoform X1 [Benincasa hispida]0.080.69Show/hide
Query:  MAFRVRRLKEISRHLSRI-SGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTY
        MAFR RRLKEI R L +I SGY HQHH   AVSSLAL+V+P H SEGI++R++DDGRHFLRFSTTT LQ ESS AN+I SFIKSTLDESEGPNH WLNT 
Subjt:  MAFRVRRLKEISRHLSRI-SGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTY

Query:  DGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRN
        +GNKG+ EKDG++LILADQFL M S++SV LVENVKFLQ RFP LHVIGFQCSS+LSAAEK+DMIQFIM+EY+SFPILLS K FEV  GLCYIISK+  N
Subjt:  DGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRN

Query:  PLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS
        PLL+  R MD   +RKAIEEL E E+EK   P+ GRTTY+KQAEI  EP SCSFMQNFLLHFPGCISADE+G RLFLSDSNHNRI+IFNS+GKILD+IGS
Subjt:  PLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS

Query:  YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRY
        YPGF+DGEFELVKLARPAASFYH+TQNCLYFVDSENHAIRKADL KRVVETLYPENYS+K STQLWSWIMDKFG+GS  +RE+EDFNPQSLMFPWH+IRY
Subjt:  YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRY

Query:  VDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKY
        +DDRLLIL+RSL TLW MDL SGKIIE+VRGLS IME YGQLI D++SV+KQIP G LQ  + ANI TGGLPYLDLLSS TPFQNCIIICDSVGQV+LKY
Subjt:  VDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKY

Query:  HRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPS
        +R SGESSSFQFSNFGVLGLPYW A PPEKVI  AD+F+GA IDHLQFFRLLPGKVGIQINVDLP DIELVESL EDSIWRQARGTATE  IVE+VA PS
Subjt:  HRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPS

Query:  EKVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKD-AERIAGFLYPRNGGKIRKEC
        EKVGSAQQWYDELDSLAFSP +SEMVEDN R  N+IGDNKV IECAVNTSPGTSEVIVYAALYLRLRRNQD EGNE+K  A RIA FLYP N GK+ KE 
Subjt:  EKVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKD-AERIAGFLYPRNGGKIRKEC

Query:  CIQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS
        CI+FL+  KR LRELIFVKPLHVRIKLDSL HPKA+NSKGIILTDSSVEVN+SLAS
Subjt:  CIQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS

TrEMBL top hitse value%identityAlignment
A0A1S3CQE6 uncharacterized protein LOC103503064 isoform X10.079.07Show/hide
Query:  MAFRVRRLKEISRHLSRI-SGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTY
        MAFR RRLKEISR L +I SGY HQHH    VSSL L+VAPFH SEGI++R+ D+GRHF RFSTTTELQCESS  N+I SFI STLDESEGPNH WLNT 
Subjt:  MAFRVRRLKEISRHLSRI-SGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTY

Query:  DGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEVRGLCYIISKNFRNP
        +GNKG+ E+DG++LILA+QFL M SS+S+ LVENVKFLQ RFP LHVIGFQC S+LS AEK+ MIQFIM+EY+SFPILLS K FEV G C IISK+  NP
Subjt:  DGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEVRGLCYIISKNFRNP

Query:  LLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSY
        LL+ ER+MD   + KAIEEL E E+EK    N G+TTYLKQAE+  EP SCSFM NFLLH+PGCISADE+GGRLFLSDSNHNRI+I NS GKILD+IGSY
Subjt:  LLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSY

Query:  PGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYV
        PGF+DGEFELVKLARPAASFYH+TQNCLYFVDSENHAIRKADL KRVVETLYPENYS+KKSTQLWSWIMDKFGLGS  +RE+EDFNPQSLMFPWH+IRY+
Subjt:  PGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYV

Query:  DDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYH
        DDRLLILNRSL TLW MDLVSGKIIEVVRGLS IME YG LI D++SV+KQIP G LQ+ S ANI TGG PY+DLLSS TPF+NCIIICDSVGQV+LKY+
Subjt:  DDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYH

Query:  RNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSE
        + SG  SS QFSNFGVLGLPYWFA PPEKVITTA+ FRGAGIDHLQFFRLLPG+VGIQINVDLP+DIELVESL +DSIWRQARGTATE  IVE VAGPSE
Subjt:  RNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSE

Query:  KVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKD-AERIAGFLYPRNGGKIRKECC
        KVGSAQQWYDELDSLAFSP +SEMVEDN R  N+IGDNKV IECAVNTSPGTSEVIVYAALYLRLRRNQD+EGN +K  A RIA FLY R  GKI KE C
Subjt:  KVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKD-AERIAGFLYPRNGGKIRKECC

Query:  IQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS
        IQFL+  KR LRELIFVKPLHVRIKLDSL HPKA+NSK IILT SSVEVN+SL+S
Subjt:  IQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS

A0A6J1E0D7 uncharacterized protein LOC111025991 isoform X20.099.57Show/hide
Query:  FLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTYDGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSA
        F RFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTYDGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSA
Subjt:  FLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTYDGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSA

Query:  AEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRNPLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNF
        AEKTDMIQFIMKEYVSFPILLSKKDFE+ RGLCYIISKNFRNPLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNF
Subjt:  AEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRNPLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNF

Query:  LLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSYPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYS
        LLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSYPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYS
Subjt:  LLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSYPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYS

Query:  SKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYVDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWL
        SKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYVDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWL
Subjt:  SKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYVDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWL

Query:  QRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYHRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGI
        QRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYHRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGI
Subjt:  QRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYHRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGI

Query:  QINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSEKVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIV
        QINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSEKVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIV
Subjt:  QINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSEKVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIV

Query:  YAALYLRLRRNQDWEGNEEKDAERIAGFLYPRNGGKIRKECCIQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS
        YAALYLRLRRNQDWEGNEEKDAERIAGFLYPRNGGKIRKECCIQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS
Subjt:  YAALYLRLRRNQDWEGNEEKDAERIAGFLYPRNGGKIRKECCIQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS

A0A6J1E4J1 uncharacterized protein LOC111025991 isoform X10.099.73Show/hide
Query:  MAFRVRRLKEISRHLSRISGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTYD
        MAFRVRRLKEISRHLSRISGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTYD
Subjt:  MAFRVRRLKEISRHLSRISGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTYD

Query:  GNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRNP
        GNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFE+ RGLCYIISKNFRNP
Subjt:  GNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRNP

Query:  LLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSY
        LLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSY
Subjt:  LLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSY

Query:  PGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYV
        PGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYV
Subjt:  PGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYV

Query:  DDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYH
        DDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYH
Subjt:  DDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYH

Query:  RNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSE
        RNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSE
Subjt:  RNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSE

Query:  KVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKDAERIAGFLYPRNGGKIRKECCI
        KVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKDAERIAGFLYPRNGGKIRKECCI
Subjt:  KVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKDAERIAGFLYPRNGGKIRKECCI

Query:  QFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS
        QFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS
Subjt:  QFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS

A0A6J1HJ63 uncharacterized protein LOC111464050 isoform X10.077.44Show/hide
Query:  MAFRVRRLKEISRHLSRI-SGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTY
        MAFR RRL+EIS+ L +  SGY HQ+H   AVSSL  +VAP + SEG+ +RI++ GRH LRFSTT ELQCESS  N++LSFIKSTLD+SEGPNH WLN  
Subjt:  MAFRVRRLKEISRHLSRI-SGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTY

Query:  DGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRN
        DGNKG+SEKDGI+LILADQFL M SS+SV LVENVKFLQHRFPQLHVIG QCS++ S AEK++MIQFIM+EYVSFPILLS K  E+ RG CYIISK+F N
Subjt:  DGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRN

Query:  PLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS
        PLL+ ER+ D   +RKAIEEL E E+EK   PN GRTTYLK AEI  EPYSC FMQNF+LHFPGCISADEKGGRLFLSDSNHNRI+IFN NGKILD+IGS
Subjt:  PLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS

Query:  YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRY
        YPGF+DGEFELVKLARPAASFYHATQ+CLYFVDSENHAIRKADL KRVVETLYP NYSS KSTQLWSWI D+ GLGS  +RE+EDFNPQSLMFPWH+IRY
Subjt:  YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRY

Query:  VDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKY
        +DDRLLILNRSL TLW MDL SGKIIEVV+G S IME YGQL  D +SV+KQIP G LQ    A  +TG LPYLDLLSS TPFQNC+IICDSVGQVI+KY
Subjt:  VDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKY

Query:  HRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPS
        +R SGESSS QFSNFGVLGLPYWFA PPEKVI+TADSF+GAGIDH+ FFRLLPGKVGI INVDLP DIELVES+QEDSIWRQ RGTATE  IVE V+G S
Subjt:  HRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPS

Query:  EKVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKDAE---RIAGFLYPRNGGKIRK
        EKVGSAQQWYDELDSLAFSP +SE+VEDN R  NHIGD+K QIECAVNTSPGTSEVIVYAA+YLR RR QDWEGN  K A    RIA  LYP + GK  K
Subjt:  EKVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKDAE---RIAGFLYPRNGGKIRK

Query:  ECCIQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS
        E CIQFL+  KR LRE+IFVKPLHVRIKLD++ HPKADNSKGIILTDSSVE+NLSLAS
Subjt:  ECCIQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS

A0A6J1HW28 uncharacterized protein LOC111466804 isoform X10.077.6Show/hide
Query:  MAFRVRRLKEISRHLSRI-SGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTY
        MAFR RRL+EIS+ L +  SGY HQHH   AVSSL  +VA  + SEG+++RI+D G H  RFSTTTELQC+SS AN+ILSFIKSTLDESEGPNH WLN  
Subjt:  MAFRVRRLKEISRHLSRI-SGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTY

Query:  DGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRN
        DGNKG+SEKD I+LILADQFL M SS+SV LVENVKFLQHRFPQLHVIG QCS++LS  EK++MIQFIM+EYVSFPILLS K FE+ RGLCYIISK++ N
Subjt:  DGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEV-RGLCYIISKNFRN

Query:  PLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS
        PLL+ ER+ D   +RKAIEEL E E+EK   PN GRTTYLK AEI  EPYSCSFMQNF+LHFPGCISADEKGGRLFLSDSNHNRIIIFN NGKILD+IGS
Subjt:  PLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS

Query:  YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRY
        YPGF+DGEFELVKLARPAASFYHATQ+CLYFVDSENHAIRKADL KRVVETLYP NYSS KSTQLWSWI D+ GLGS  +RE+EDFNPQSLMFPWH+I+Y
Subjt:  YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRY

Query:  VDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKY
        +DDRLLILNRSL TLW MDL SGKIIEVV+G S IME Y QL  D+VSV+KQIP G LQ    A  VTGGLPYLDLLSS T FQNC+IICDSVGQVI+KY
Subjt:  VDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKY

Query:  HRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPS
        +R SGESSS +FSNFGVLGLPYWFA PPEKVI+TADSF+GAGIDH+ FFRLLPGKVGI INVDLP DIELVES+QEDSIWRQ RGTATE  IVE V+  S
Subjt:  HRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPS

Query:  EKVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKDAE---RIAGFLYPRNGGKIRK
        EKVGSAQQWYDELDSLAFSP +SE+VEDN R  NHIGD+K QIECAVNTSPGTSEVIVYAA+YLR RR+QDWEGN +K A    RIA  LYP + GK  K
Subjt:  EKVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWEGNEEKDAE---RIAGFLYPRNGGKIRK

Query:  ECCIQFLLKR-KRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS
        E CIQFLL   KR LRE++FVKPLHVRIKLD++ HPKADNSKGIILTDSSVE+NLSLAS
Subjt:  ECCIQFLLKR-KRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS

SwissProt top hitse value%identityAlignment
A4IF69 NHL repeat-containing protein 21.1e-0838.3Show/hide
Query:  LHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKI-LDIIGSYPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETL
        L FPG I+ D    RL ++D+ H+RI++   NG+I   I G  PG  DG F       P         N +Y  D+ENH IRK DL+  +V T+
Subjt:  LHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKI-LDIIGSYPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETL

Q5ZI67 NHL repeat-containing protein 22.5e-1129.38Show/hide
Query:  LKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS-YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHA
        +K   I  + Y  S   + LL FPG ++ D+ G RL ++D+ H+RI++   NG+IL  IG    G  DG F       P         N +Y  D+ENH 
Subjt:  LKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS-YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHA

Query:  IRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHII-------RYVDDRLLILNRSLQTLWVMDLVSGKI
        IRK DL+  +V T+                 +DK G G+  E        Q +  PW ++          DD L I    +  +W + L  GK+
Subjt:  IRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHII-------RYVDDRLLILNRSLQTLWVMDLVSGKI

Q8BZW8 NHL repeat-containing protein 21.0e-0938.3Show/hide
Query:  LHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS-YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETL
        L FPG ++ D   GRL ++D+ H+RI++   NG+I   IG   PG  DG F       P         N +Y  D+ENH IRK DL+   V T+
Subjt:  LHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS-YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETL

Q8NBF2 NHL repeat-containing protein 25.7e-0837.23Show/hide
Query:  LHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKI-LDIIGSYPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETL
        L FPG ++ D+   RL ++D+ H+RI++   NG+I   I G  PG  DG F       P         N +Y  D+ENH IRK DL+   V T+
Subjt:  LHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKI-LDIIGSYPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETL

Q8VZ10 Protein SUPPRESSOR OF QUENCHING 1, chloroplastic4.8e-1546.32Show/hide
Query:  LHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS--YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETL
        L FPG ++ D    RLF+SDSNHNRII+ +  G  +  IGS    GF DG FE     RP    Y+A +N LY  D+ENHA+R+ D     V+TL
Subjt:  LHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS--YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETL

Arabidopsis top hitse value%identityAlignment
AT1G56500.1 haloacid dehalogenase-like hydrolase family protein3.4e-1646.32Show/hide
Query:  LHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS--YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETL
        L FPG ++ D    RLF+SDSNHNRII+ +  G  +  IGS    GF DG FE     RP    Y+A +N LY  D+ENHA+R+ D     V+TL
Subjt:  LHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGS--YPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETL

AT3G07060.1 NHL domain-containing protein3.2e-17643.97Show/hide
Query:  MAFRVRRLKEISRHLSRISGYSHQHHRSDAVSSLALAVAP-FHASEGINKRIVDDGRHFLRFSTTTELQCESSAAN--------NILSFIKSTLDESEGP
        M+ R   LK+IS   SRI   +       ++++ A  +AP       I  + + + R    F++   +   SS+++        ++LSFIK++LD+ EGP
Subjt:  MAFRVRRLKEISRHLSRISGYSHQHHRSDAVSSLALAVAP-FHASEGINKRIVDDGRHFLRFSTTTELQCESSAAN--------NILSFIKSTLDESEGP

Query:  NHRWLNTYDGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAA-EKTDMIQFIMKEYVSFPILLSKKDF-----EV
        +H WLN   GNK L +  G +++LA   L   S  S F  E +K LQ R P +  +G   S     A ++T + + I+KEY++FP+LLS+K+F     EV
Subjt:  NHRWLNTYDGNKGLSEKDGIFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAA-EKTDMIQFIMKEYVSFPILLSKKDF-----EV

Query:  RGLCYIISKNFRNPLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIII
        R   YI+ K+F+NPL+  E+++D  ++ KA++ L  Q++EK         T+ KQAE   E +  SF Q+ LL+FPGCISADE G RLFLSD+NH+RIII
Subjt:  RGLCYIISKNFRNPLLLYERNMDPFTMRKAIEELQEQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIII

Query:  FNSNGKILDIIGSYPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGS------NAER
        F ++GKI+D IG +PGF+DG+FE  K+ RP  + Y   ++CLY VDSENHAIR+A+++ RV+ET+YP+    KK+  LWSWIM+K GLG       +A+ 
Subjt:  FNSNGKILDIIGSYPGFDDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGS------NAER

Query:  ELEDFNPQSLMFPWHIIRYVDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCT
        + E+F+ +SL+FPWHI++  D+ LL++N+S   LW+++  SG+I EVV G S I+E  GQ IT+K+SV++ +P+ WLQ+ + A       P   LLSS T
Subjt:  ELEDFNPQSLMFPWHIIRYVDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQLITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCT

Query:  PFQNCIIICDSVGQVILKYHRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWR
           + I++ D   Q +LK +R+SG  SS QFSN G+LGLPYW   P E+V   A+  + A + H Q  RLLPGK+ I++N+++P   ELVE +QE  IWR
Subjt:  PFQNCIIICDSVGQVILKYHRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLLPGKVGIQINVDLPADIELVESLQEDSIWR

Query:  QARGTATESLIVEDVAGPSEKVGSAQQWYDELDSLAFSPPDSEMVED------NTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWE-G
        Q RG  +E         PSEK+G +QQWYDELDSLA    + E  E+      N    +   D ++ I+C V TSPG+SE+IVYAALYLRL RN++ E  
Subjt:  QARGTATESLIVEDVAGPSEKVGSAQQWYDELDSLAFSPPDSEMVED------NTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAALYLRLRRNQDWE-G

Query:  NEEKDAERIAGFLYP-RNGGKIRKECCIQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSL
         +E+ A +IA  L P RN   ++++  +  L K KR LR+++F+KP+HVRI+LDS DHPKADNS+ +ILTDSSVEV++SL
Subjt:  NEEKDAERIAGFLYP-RNGGKIRKECCIQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTTAGGGTTCGCCGACTCAAAGAAATTTCTAGGCATTTGTCCCGAATCTCCGGATATTCTCATCAGCATCACCGTAGTGATGCTGTCAGCTCTTTGGCATTGGC
CGTTGCTCCATTTCATGCATCTGAAGGAATCAATAAAAGGATTGTAGACGATGGACGTCACTTTCTGCGGTTTTCCACTACAACAGAGCTACAATGCGAGTCTTCTGCAG
CAAATAATATTTTATCCTTCATTAAGTCAACCTTAGATGAGTCCGAAGGGCCTAACCACCGTTGGTTGAATACATATGATGGAAATAAAGGACTTTCAGAGAAGGATGGT
ATCTTCTTAATTCTTGCCGATCAATTTCTTGCAATGGCAAGCTCTGAATCTGTTTTTCTGGTTGAAAATGTAAAGTTCCTTCAGCACAGGTTTCCTCAGCTTCATGTTAT
TGGGTTTCAGTGTTCCAGTTCTCTATCTGCTGCTGAAAAAACTGACATGATCCAATTTATAATGAAGGAATATGTTTCGTTTCCCATTTTGTTGTCCAAGAAGGATTTTG
AGGTGAGGGGGCTCTGTTATATTATCTCCAAAAACTTCAGAAATCCTTTACTCCTCTATGAGAGGAACATGGATCCTTTCACTATGAGGAAAGCTATCGAGGAGTTGCAA
GAACAAGAAAGTGAGAAATTTAGTCAGCCCAATAATGGGAGAACCACTTACCTAAAGCAGGCGGAGATCACAACAGAACCATATTCATGTTCATTCATGCAGAATTTTCT
TCTCCACTTTCCAGGCTGTATATCTGCAGATGAAAAGGGTGGCCGACTCTTCCTTTCAGACAGCAATCATAACCGGATTATCATATTTAACAGCAATGGGAAGATTCTGG
ACATTATTGGTTCTTATCCAGGTTTTGATGATGGAGAATTTGAATTGGTCAAATTAGCTCGTCCAGCAGCTTCCTTTTATCATGCTACTCAGAATTGCTTGTATTTTGTG
GACTCTGAGAACCATGCTATTAGGAAAGCTGATTTGGACAAGCGGGTTGTGGAAACTCTCTATCCAGAAAACTACTCGAGTAAGAAGAGTACACAGTTATGGAGCTGGAT
TATGGACAAATTTGGTCTTGGAAGTAATGCTGAGAGAGAATTAGAAGACTTCAATCCGCAGTCTCTGATGTTTCCTTGGCACATCATTAGATATGTGGATGATAGATTAT
TAATTTTAAACCGCAGTCTTCAAACACTATGGGTCATGGATTTGGTGTCAGGAAAAATTATTGAAGTTGTTAGAGGCCTTTCAAATATCATGGAGACCTATGGACAGTTG
ATCACAGACAAAGTGTCTGTTATGAAACAGATCCCGACTGGTTGGTTGCAGCGGCTAAGTCATGCAAATATTGTCACAGGGGGGCTACCATACCTGGATCTTTTATCTTC
CTGTACACCCTTCCAGAACTGCATAATCATTTGCGACTCAGTTGGACAGGTGATATTGAAATATCATAGAAATTCTGGTGAGAGCTCAAGCTTCCAATTTTCAAATTTTG
GGGTTCTTGGATTACCATATTGGTTTGCTTCACCTCCGGAGAAGGTCATAACCACTGCTGATAGCTTTCGAGGAGCAGGGATTGATCATCTTCAGTTTTTCAGACTGCTG
CCTGGTAAGGTTGGTATACAGATCAATGTTGATCTGCCTGCTGATATTGAACTTGTGGAATCATTACAAGAAGACAGCATATGGCGACAAGCAAGAGGAACTGCAACTGA
AAGCTTAATTGTCGAGGATGTAGCTGGGCCCTCAGAAAAGGTTGGTTCTGCTCAACAGTGGTATGATGAGTTGGATAGCCTAGCCTTTTCACCGCCAGATTCAGAAATGG
TGGAAGATAATACGAGAACTTCTAATCATATAGGAGACAATAAAGTGCAGATTGAGTGTGCTGTCAATACAAGTCCTGGAACTAGCGAGGTTATAGTTTATGCTGCCCTG
TATTTAAGGCTGAGAAGAAACCAAGATTGGGAAGGCAACGAGGAGAAAGATGCAGAAAGGATAGCAGGTTTTTTGTACCCAAGAAATGGAGGGAAGATAAGAAAAGAGTG
CTGCATTCAGTTCCTTTTAAAACGTAAAAGGTATTTGAGAGAGCTCATTTTCGTGAAACCTCTTCATGTCAGGATTAAGTTGGATTCTCTCGATCACCCTAAAGCTGATA
ATTCCAAAGGTATTATCCTCACAGACTCTTCAGTTGAAGTCAATCTATCACTTGCCTCC
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTTAGGGTTCGCCGACTCAAAGAAATTTCTAGGCATTTGTCCCGAATCTCCGGATATTCTCATCAGCATCACCGTAGTGATGCTGTCAGCTCTTTGGCATTGGC
CGTTGCTCCATTTCATGCATCTGAAGGAATCAATAAAAGGATTGTAGACGATGGACGTCACTTTCTGCGGTTTTCCACTACAACAGAGCTACAATGCGAGTCTTCTGCAG
CAAATAATATTTTATCCTTCATTAAGTCAACCTTAGATGAGTCCGAAGGGCCTAACCACCGTTGGTTGAATACATATGATGGAAATAAAGGACTTTCAGAGAAGGATGGT
ATCTTCTTAATTCTTGCCGATCAATTTCTTGCAATGGCAAGCTCTGAATCTGTTTTTCTGGTTGAAAATGTAAAGTTCCTTCAGCACAGGTTTCCTCAGCTTCATGTTAT
TGGGTTTCAGTGTTCCAGTTCTCTATCTGCTGCTGAAAAAACTGACATGATCCAATTTATAATGAAGGAATATGTTTCGTTTCCCATTTTGTTGTCCAAGAAGGATTTTG
AGGTGAGGGGGCTCTGTTATATTATCTCCAAAAACTTCAGAAATCCTTTACTCCTCTATGAGAGGAACATGGATCCTTTCACTATGAGGAAAGCTATCGAGGAGTTGCAA
GAACAAGAAAGTGAGAAATTTAGTCAGCCCAATAATGGGAGAACCACTTACCTAAAGCAGGCGGAGATCACAACAGAACCATATTCATGTTCATTCATGCAGAATTTTCT
TCTCCACTTTCCAGGCTGTATATCTGCAGATGAAAAGGGTGGCCGACTCTTCCTTTCAGACAGCAATCATAACCGGATTATCATATTTAACAGCAATGGGAAGATTCTGG
ACATTATTGGTTCTTATCCAGGTTTTGATGATGGAGAATTTGAATTGGTCAAATTAGCTCGTCCAGCAGCTTCCTTTTATCATGCTACTCAGAATTGCTTGTATTTTGTG
GACTCTGAGAACCATGCTATTAGGAAAGCTGATTTGGACAAGCGGGTTGTGGAAACTCTCTATCCAGAAAACTACTCGAGTAAGAAGAGTACACAGTTATGGAGCTGGAT
TATGGACAAATTTGGTCTTGGAAGTAATGCTGAGAGAGAATTAGAAGACTTCAATCCGCAGTCTCTGATGTTTCCTTGGCACATCATTAGATATGTGGATGATAGATTAT
TAATTTTAAACCGCAGTCTTCAAACACTATGGGTCATGGATTTGGTGTCAGGAAAAATTATTGAAGTTGTTAGAGGCCTTTCAAATATCATGGAGACCTATGGACAGTTG
ATCACAGACAAAGTGTCTGTTATGAAACAGATCCCGACTGGTTGGTTGCAGCGGCTAAGTCATGCAAATATTGTCACAGGGGGGCTACCATACCTGGATCTTTTATCTTC
CTGTACACCCTTCCAGAACTGCATAATCATTTGCGACTCAGTTGGACAGGTGATATTGAAATATCATAGAAATTCTGGTGAGAGCTCAAGCTTCCAATTTTCAAATTTTG
GGGTTCTTGGATTACCATATTGGTTTGCTTCACCTCCGGAGAAGGTCATAACCACTGCTGATAGCTTTCGAGGAGCAGGGATTGATCATCTTCAGTTTTTCAGACTGCTG
CCTGGTAAGGTTGGTATACAGATCAATGTTGATCTGCCTGCTGATATTGAACTTGTGGAATCATTACAAGAAGACAGCATATGGCGACAAGCAAGAGGAACTGCAACTGA
AAGCTTAATTGTCGAGGATGTAGCTGGGCCCTCAGAAAAGGTTGGTTCTGCTCAACAGTGGTATGATGAGTTGGATAGCCTAGCCTTTTCACCGCCAGATTCAGAAATGG
TGGAAGATAATACGAGAACTTCTAATCATATAGGAGACAATAAAGTGCAGATTGAGTGTGCTGTCAATACAAGTCCTGGAACTAGCGAGGTTATAGTTTATGCTGCCCTG
TATTTAAGGCTGAGAAGAAACCAAGATTGGGAAGGCAACGAGGAGAAAGATGCAGAAAGGATAGCAGGTTTTTTGTACCCAAGAAATGGAGGGAAGATAAGAAAAGAGTG
CTGCATTCAGTTCCTTTTAAAACGTAAAAGGTATTTGAGAGAGCTCATTTTCGTGAAACCTCTTCATGTCAGGATTAAGTTGGATTCTCTCGATCACCCTAAAGCTGATA
ATTCCAAAGGTATTATCCTCACAGACTCTTCAGTTGAAGTCAATCTATCACTTGCCTCC
Protein sequenceShow/hide protein sequence
MAFRVRRLKEISRHLSRISGYSHQHHRSDAVSSLALAVAPFHASEGINKRIVDDGRHFLRFSTTTELQCESSAANNILSFIKSTLDESEGPNHRWLNTYDGNKGLSEKDG
IFLILADQFLAMASSESVFLVENVKFLQHRFPQLHVIGFQCSSSLSAAEKTDMIQFIMKEYVSFPILLSKKDFEVRGLCYIISKNFRNPLLLYERNMDPFTMRKAIEELQ
EQESEKFSQPNNGRTTYLKQAEITTEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNRIIIFNSNGKILDIIGSYPGFDDGEFELVKLARPAASFYHATQNCLYFV
DSENHAIRKADLDKRVVETLYPENYSSKKSTQLWSWIMDKFGLGSNAERELEDFNPQSLMFPWHIIRYVDDRLLILNRSLQTLWVMDLVSGKIIEVVRGLSNIMETYGQL
ITDKVSVMKQIPTGWLQRLSHANIVTGGLPYLDLLSSCTPFQNCIIICDSVGQVILKYHRNSGESSSFQFSNFGVLGLPYWFASPPEKVITTADSFRGAGIDHLQFFRLL
PGKVGIQINVDLPADIELVESLQEDSIWRQARGTATESLIVEDVAGPSEKVGSAQQWYDELDSLAFSPPDSEMVEDNTRTSNHIGDNKVQIECAVNTSPGTSEVIVYAAL
YLRLRRNQDWEGNEEKDAERIAGFLYPRNGGKIRKECCIQFLLKRKRYLRELIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNLSLAS