; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr015039 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr015039
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionNHL domain-containing protein
Genome locationtig00002854:136004..148233
RNA-Seq ExpressionSgr015039
SyntenySgr015039
Gene Ontology termsNA
InterPro domainsIPR011042 - Six-bladed beta-propeller, TolB-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571580.1 Protein SUPPRESSOR OF QUENCHING 1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0080.08Show/hide
Query:  MAFRFRRLKEISRHLSRM-SGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGPNHCWFNTS
        MAFRFRRL+EIS+ L +  SGY HQHH  +AV+SL  SV+PS+VSEG  +RI+++GRHLL FSTT ELQCESSPAND+LSFIKST+DESEGPNH W N  
Subjt:  MAFRFRRLKEISRHLSRM-SGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGPNHCWFNTS

Query:  DGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSYIISKDFNN
        DG KG+SEKDGI+LILADQFLEM+SSDSV+L+ENVKFLQHRFPQLHV G QCS++LS AEKS+MI FIM+EYVSFPILLS KI EM RGL YIISKDF+N
Subjt:  DGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSYIISKDFNN

Query:  PLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGS
        PLL+ ER+ DL +LRKAIEEL E ENEKSGLPN GRTTYLK AEIIKEPYSCSFMQNF+L FP CISADEKGGRLFLSDSNHNRI+IFNGNGKILD+IGS
Subjt:  PLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGS

Query:  YPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMIRY
        YPGFEDGEF+LVKLARPAASFYHATQ+CLYFVDSENHAIRKADLGKRVVETLYP NYSS KSTQLWSWI D+ G+GS  DRE+ED NPQSLMFPWHMIRY
Subjt:  YPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMIRY

Query:  VDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKY
        +DDRLLILNRSL TLWTMDLASGKIIEVV+G SKIMENYGQL MD +S+LKQIP+G LQ   DA  +TG LPY+DLLSSLTPFQNC+IICDSVGQV++KY
Subjt:  VDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKY

Query:  HRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPS
        +R+S ESSS QFSNFGVLGLPYWFAPPPEKVI+TADSFQGAGIDH+ FF+LLPG+VGI INVDLP DIELVES+QE SIWRQ RGTATEI ++E V+G S
Subjt:  HRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPS

Query:  EKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDKY---AARIADFLYPGSTGKLRK
        EKVGSAQQWYDELDSLAFS  ESE+VED +R  NHIGD+K  IECAVNTSPGTSEVIVYAA+YLR RR QD EGNGDK    AARIAD LYPGS GK  K
Subjt:  EKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDKY---AARIADFLYPGSTGKLRK

Query:  ECCIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS
        E CIQFL+  K DLRE+IFVKPLHVRI+LD+L HPKADNSKGIILTDSSVE+N+SL S
Subjt:  ECCIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS

XP_022159641.1 uncharacterized protein LOC111025991 isoform X1 [Momordica charantia]0.0e+0087.27Show/hide
Query:  MAFRFRRLKEISRHLSRMSGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGPNHCWFNTSD
        MAFR RRLKEISRHLSR+SGY HQHH+S AV+SLAL+VAP H SEG NKRIVD+GRH L FSTTTELQCESS AN+ILSFIKST+DESEGPNH W NT D
Subjt:  MAFRFRRLKEISRHLSRMSGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGPNHCWFNTSD

Query:  GNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSYIISKDFNNP
        GNKGLSEKDGIFLILADQFL M+SS+SV L+ENVKFLQHRFPQLHV GFQCSSSLSAAEK+DMI FIMKEYVSFPILLSKK FEMARGL YIISK+F NP
Subjt:  GNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSYIISKDFNNP

Query:  LLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGSY
        LLLYERNMD   +RKAIEEL EQE+EK   PNNGRTTYLKQAEI  EPYSCSFMQNFLL FPGCISADEKGGRLFLSDSNHNRIIIFN NGKILDIIGSY
Subjt:  LLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGSY

Query:  PGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMIRYV
        PGF+DGEF+LVKLARPAASFYHATQNCLYFVDSENHAIRKADL KRVVETLYPENYSSKKSTQLWSWIMDKFG+GSNA+RELED NPQSLMFPWH+IRYV
Subjt:  PGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMIRYV

Query:  DDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKYH
        DDRLLILNRSLQTLW MDL SGKIIEVVRGLS IME YGQLI DKVS++KQIP GWLQ+LS AN+VTGGLPY+DLLSS TPFQNCIIICDSVGQV+LKYH
Subjt:  DDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKYH

Query:  RQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPSE
        R S ESSSFQFSNFGVLGLPYWFA PPEKVITTADSF+GAGIDHLQFF+LLPG+VGIQINVDLP DIELVESLQE SIWRQARGTATE L++EDVAGPSE
Subjt:  RQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPSE

Query:  KVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDKYAARIADFLYPGSTGKLRKECCI
        KVGSAQQWYDELDSLAFS P+SEMVED+ RTSNHIGDNKV IECAVNTSPGTSEVIVYAALYLRLRRNQD EGN +K A RIA FLYP + GK+RKECCI
Subjt:  KVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDKYAARIADFLYPGSTGKLRKECCI

Query:  QFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS
        QFLLKRK  LRELIFVKPLHVRI+LDSLDHPKADNSKGIILTDSSVEVN+SL S
Subjt:  QFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS

XP_022159642.1 uncharacterized protein LOC111025991 isoform X2 [Momordica charantia]0.0e+0088.04Show/hide
Query:  FSTTTELQCESSPANDILSFIKSTVDESEGPNHCWFNTSDGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEK
        FSTTTELQCESS AN+ILSFIKST+DESEGPNH W NT DGNKGLSEKDGIFLILADQFL M+SS+SV L+ENVKFLQHRFPQLHV GFQCSSSLSAAEK
Subjt:  FSTTTELQCESSPANDILSFIKSTVDESEGPNHCWFNTSDGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEK

Query:  SDMICFIMKEYVSFPILLSKKIFEMARGLSYIISKDFNNPLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQ
        +DMI FIMKEYVSFPILLSKK FEMARGL YIISK+F NPLLLYERNMD   +RKAIEEL EQE+EK   PNNGRTTYLKQAEI  EPYSCSFMQNFLL 
Subjt:  SDMICFIMKEYVSFPILLSKKIFEMARGLSYIISKDFNNPLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQ

Query:  FPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKK
        FPGCISADEKGGRLFLSDSNHNRIIIFN NGKILDIIGSYPGF+DGEF+LVKLARPAASFYHATQNCLYFVDSENHAIRKADL KRVVETLYPENYSSKK
Subjt:  FPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKK

Query:  STQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMIRYVDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQL
        STQLWSWIMDKFG+GSNA+RELED NPQSLMFPWH+IRYVDDRLLILNRSLQTLW MDL SGKIIEVVRGLS IME YGQLI DKVS++KQIP GWLQ+L
Subjt:  STQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMIRYVDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQL

Query:  SDANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKYHRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQIN
        S AN+VTGGLPY+DLLSS TPFQNCIIICDSVGQV+LKYHR S ESSSFQFSNFGVLGLPYWFA PPEKVITTADSF+GAGIDHLQFF+LLPG+VGIQIN
Subjt:  SDANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKYHRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQIN

Query:  VDLPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPSEKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAA
        VDLP DIELVESLQE SIWRQARGTATE L++EDVAGPSEKVGSAQQWYDELDSLAFS P+SEMVED+ RTSNHIGDNKV IECAVNTSPGTSEVIVYAA
Subjt:  VDLPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPSEKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAA

Query:  LYLRLRRNQDSEGNGDKYAARIADFLYPGSTGKLRKECCIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS
        LYLRLRRNQD EGN +K A RIA FLYP + GK+RKECCIQFLLKRK  LRELIFVKPLHVRI+LDSLDHPKADNSKGIILTDSSVEVN+SL S
Subjt:  LYLRLRRNQDSEGNGDKYAARIADFLYPGSTGKLRKECCIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS

XP_022963868.1 uncharacterized protein LOC111464050 isoform X1 [Cucurbita moschata]0.0e+0079.37Show/hide
Query:  SFGSSLTNMAFRFRRLKEISRHLSRM-SGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGP
        SF SS TNMAFRFRRL+EIS+ L +  SGY HQ+H  +AV+SL  SVAPS+VSEG  +RI+++GRHLL FSTT ELQCESSP ND+LSFIKST+D+SEGP
Subjt:  SFGSSLTNMAFRFRRLKEISRHLSRM-SGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGP

Query:  NHCWFNTSDGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSY
        NH W N  DGNKG+SEKDGI+LILADQFLEM+SSDSV+L+ENVKFLQHRFPQLHV G QCS++ S AEKS+MI FIM+EYVSFPILLS KI EM RG  Y
Subjt:  NHCWFNTSDGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSY

Query:  IISKDFNNPLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNG
        IISKDF+NPLL+ ER+ DL +LRKAIEEL E ENEKSGLPN GRTTYLK AEIIKEPYSC FMQNF+L FPGCISADEKGGRLFLSDSNHNRI+IFNGNG
Subjt:  IISKDFNNPLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNG

Query:  KILDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLM
        KILD+IGSYPGFEDGEF+LVKLARPAASFYHATQ+CLYFVDSENHAIRKADLGKRVVETLYP NYSS KSTQLWSWI D+ G+GS  DRE+ED NPQSLM
Subjt:  KILDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLM

Query:  FPWHMIRYVDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDS
        FPWHMIRY+DDRLLILNRSL TLWTMDLASGKIIEVV+G SKIMENYGQL MD +S+LKQIP+G LQ   DA  +TG LPY+DLLSSLTPFQNC+IICDS
Subjt:  FPWHMIRYVDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDS

Query:  VGQVVLKYHRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILV
        VGQV++KY+R+S ESSS QFSNFGVLGLPYWFAPPPEKVI+TADSFQGAGIDH+ FF+LLPG+VGI INVDLP DIELVES+QE SIWRQ RGTATEI +
Subjt:  VGQVVLKYHRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILV

Query:  IEDVAGPSEKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDKY---AARIADFLYP
        +E V+G SEKVGSAQQWYDELDSLAFS  ESE+VED++R  NHIGD+K  IECAVNTSPGTSEVIVYAA+YLR RR QD EGNG K    AARIAD LYP
Subjt:  IEDVAGPSEKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDKY---AARIADFLYP

Query:  GSTGKLRKECCIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS
        GS GK  KE CIQFL+  K DLRE+IFVKPLHVRI+LD++ HPKADNSKGIILTDSSVE+N+SL S
Subjt:  GSTGKLRKECCIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS

XP_038888990.1 uncharacterized protein LOC120078755 isoform X1 [Benincasa hispida]0.0e+0082.28Show/hide
Query:  MAFRFRRLKEISRHLSRM-SGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGPNHCWFNTS
        MAFRFRRLKEI R L ++ SGY HQHH  YAV+SLALSV+PSHVSEG ++R++D+GRH L FSTTT LQ ESSPANDI SFIKST+DESEGPNH W NTS
Subjt:  MAFRFRRLKEISRHLSRM-SGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGPNHCWFNTS

Query:  DGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSYIISKDFNN
        +GNKG+ EKDG++LILADQFLEM+S+DSV+L+ENVKFLQ RFP LHV GFQCSS+LSAAEKSDMI FIM+EY+SFPILLS KIFE+  GL YIISKD +N
Subjt:  DGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSYIISKDFNN

Query:  PLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGS
        PLL+  R MDL ILRKAIEELHE ENEKSGLP+ GRTTY+KQAEI+KEP SCSFMQNFLL FPGCISADE+G RLFLSDSNHNRI+IFN +GKILD+IGS
Subjt:  PLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGS

Query:  YPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMIRY
        YPGFEDGEF+LVKLARPAASFYH+TQNCLYFVDSENHAIRKADLGKRVVETLYPENYS+K STQLWSWIMDKFG+GS  DRE+ED NPQSLMFPWHMIRY
Subjt:  YPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMIRY

Query:  VDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKY
        +DDRLLIL+RSL TLWTMDLASGKIIE+VRGLS+IMENYGQLIMD++S+LKQIP+G LQ  +DAN+ TGGLPY+DLLSSLTPFQNCIIICDSVGQVVLKY
Subjt:  VDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKY

Query:  HRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPS
        +R+S ESSSFQFSNFGVLGLPYW APPPEKVI  AD+FQGA IDHLQFF+LLPG+VGIQINVDLP DIELVESL E SIWRQARGTATEI ++E+VA PS
Subjt:  HRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPS

Query:  EKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDK-YAARIADFLYPGSTGKLRKEC
        EKVGSAQQWYDELDSLAFS  ESEMVED++R  N+IGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGN DK +A RIADFLYPG+ GK+ KE 
Subjt:  EKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDK-YAARIADFLYPGSTGKLRKEC

Query:  CIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS
        CI+FL+  K DLRELIFVKPLHVRI+LDSL HPKA+NSKGIILTDSSVEVNVSL S
Subjt:  CIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS

TrEMBL top hitse value%identityAlignment
A0A1S3CQE6 uncharacterized protein LOC103503064 isoform X10.0e+0080.42Show/hide
Query:  MAFRFRRLKEISRHLSRM-SGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGPNHCWFNTS
        MAFRFRRLKEISR L ++ SGY HQHH  Y V+SL LSVAP HVSEG ++R+ DNGRH   FSTTTELQCESSP NDI SFI ST+DESEGPNH W NTS
Subjt:  MAFRFRRLKEISRHLSRM-SGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGPNHCWFNTS

Query:  DGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSYIISKDFNN
        +GNKG+ E+DG++LILA+QFLEM+SSDS+ L+ENVKFLQ RFP LHV GFQC S+LS AEKS MI FIM+EY+SFPILLS KIFE+A     IISKD +N
Subjt:  DGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSYIISKDFNN

Query:  PLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGS
        PLL+ ER+MDL IL KAIEELHE ENEKSGL N G+TTYLKQAE+IKEP SCSFM NFLL +PGCISADE+GGRLFLSDSNHNRI+I N  GKILD+IGS
Subjt:  PLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGS

Query:  YPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMIRY
        YPGFEDGEF+LVKLARPAASFYH+TQNCLYFVDSENHAIRKADLGKRVVETLYPENYS+KKSTQLWSWIMDKFG+GS  DRE+ED NPQSLMFPWH+IRY
Subjt:  YPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMIRY

Query:  VDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKY
        +DDRLLILNRSL TLWTMDL SGKIIEVVRGLS+IMENYG LIMD++S+LKQIP+G LQQ SDAN+ TGG PYMDLLSSLTPF+NCIIICDSVGQVVLKY
Subjt:  VDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKY

Query:  HRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPS
        ++QS   SS QFSNFGVLGLPYWFAPPPEKVITTA+ F+GAGIDHLQFF+LLPG VGIQINVDLP DIELVESL + SIWRQARGTATEI ++E VAGPS
Subjt:  HRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPS

Query:  EKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDKY-AARIADFLYPGSTGKLRKEC
        EKVGSAQQWYDELDSLAFS  ESEMVED++R  N+IGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQD EGNG K+ A RIADFLY    GK+ KE 
Subjt:  EKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDKY-AARIADFLYPGSTGKLRKEC

Query:  CIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS
        CIQFL+  K DLRELIFVKPLHVRI+LDSL HPKA+NSK IILT SSVEVNVSL S
Subjt:  CIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS

A0A6J1E0D7 uncharacterized protein LOC111025991 isoform X20.0e+0088.04Show/hide
Query:  FSTTTELQCESSPANDILSFIKSTVDESEGPNHCWFNTSDGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEK
        FSTTTELQCESS AN+ILSFIKST+DESEGPNH W NT DGNKGLSEKDGIFLILADQFL M+SS+SV L+ENVKFLQHRFPQLHV GFQCSSSLSAAEK
Subjt:  FSTTTELQCESSPANDILSFIKSTVDESEGPNHCWFNTSDGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEK

Query:  SDMICFIMKEYVSFPILLSKKIFEMARGLSYIISKDFNNPLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQ
        +DMI FIMKEYVSFPILLSKK FEMARGL YIISK+F NPLLLYERNMD   +RKAIEEL EQE+EK   PNNGRTTYLKQAEI  EPYSCSFMQNFLL 
Subjt:  SDMICFIMKEYVSFPILLSKKIFEMARGLSYIISKDFNNPLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQ

Query:  FPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKK
        FPGCISADEKGGRLFLSDSNHNRIIIFN NGKILDIIGSYPGF+DGEF+LVKLARPAASFYHATQNCLYFVDSENHAIRKADL KRVVETLYPENYSSKK
Subjt:  FPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKK

Query:  STQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMIRYVDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQL
        STQLWSWIMDKFG+GSNA+RELED NPQSLMFPWH+IRYVDDRLLILNRSLQTLW MDL SGKIIEVVRGLS IME YGQLI DKVS++KQIP GWLQ+L
Subjt:  STQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMIRYVDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQL

Query:  SDANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKYHRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQIN
        S AN+VTGGLPY+DLLSS TPFQNCIIICDSVGQV+LKYHR S ESSSFQFSNFGVLGLPYWFA PPEKVITTADSF+GAGIDHLQFF+LLPG+VGIQIN
Subjt:  SDANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKYHRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQIN

Query:  VDLPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPSEKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAA
        VDLP DIELVESLQE SIWRQARGTATE L++EDVAGPSEKVGSAQQWYDELDSLAFS P+SEMVED+ RTSNHIGDNKV IECAVNTSPGTSEVIVYAA
Subjt:  VDLPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPSEKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAA

Query:  LYLRLRRNQDSEGNGDKYAARIADFLYPGSTGKLRKECCIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS
        LYLRLRRNQD EGN +K A RIA FLYP + GK+RKECCIQFLLKRK  LRELIFVKPLHVRI+LDSLDHPKADNSKGIILTDSSVEVN+SL S
Subjt:  LYLRLRRNQDSEGNGDKYAARIADFLYPGSTGKLRKECCIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS

A0A6J1E4J1 uncharacterized protein LOC111025991 isoform X10.0e+0087.27Show/hide
Query:  MAFRFRRLKEISRHLSRMSGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGPNHCWFNTSD
        MAFR RRLKEISRHLSR+SGY HQHH+S AV+SLAL+VAP H SEG NKRIVD+GRH L FSTTTELQCESS AN+ILSFIKST+DESEGPNH W NT D
Subjt:  MAFRFRRLKEISRHLSRMSGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGPNHCWFNTSD

Query:  GNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSYIISKDFNNP
        GNKGLSEKDGIFLILADQFL M+SS+SV L+ENVKFLQHRFPQLHV GFQCSSSLSAAEK+DMI FIMKEYVSFPILLSKK FEMARGL YIISK+F NP
Subjt:  GNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSYIISKDFNNP

Query:  LLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGSY
        LLLYERNMD   +RKAIEEL EQE+EK   PNNGRTTYLKQAEI  EPYSCSFMQNFLL FPGCISADEKGGRLFLSDSNHNRIIIFN NGKILDIIGSY
Subjt:  LLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGSY

Query:  PGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMIRYV
        PGF+DGEF+LVKLARPAASFYHATQNCLYFVDSENHAIRKADL KRVVETLYPENYSSKKSTQLWSWIMDKFG+GSNA+RELED NPQSLMFPWH+IRYV
Subjt:  PGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMIRYV

Query:  DDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKYH
        DDRLLILNRSLQTLW MDL SGKIIEVVRGLS IME YGQLI DKVS++KQIP GWLQ+LS AN+VTGGLPY+DLLSS TPFQNCIIICDSVGQV+LKYH
Subjt:  DDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKYH

Query:  RQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPSE
        R S ESSSFQFSNFGVLGLPYWFA PPEKVITTADSF+GAGIDHLQFF+LLPG+VGIQINVDLP DIELVESLQE SIWRQARGTATE L++EDVAGPSE
Subjt:  RQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPSE

Query:  KVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDKYAARIADFLYPGSTGKLRKECCI
        KVGSAQQWYDELDSLAFS P+SEMVED+ RTSNHIGDNKV IECAVNTSPGTSEVIVYAALYLRLRRNQD EGN +K A RIA FLYP + GK+RKECCI
Subjt:  KVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDKYAARIADFLYPGSTGKLRKECCI

Query:  QFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS
        QFLLKRK  LRELIFVKPLHVRI+LDSLDHPKADNSKGIILTDSSVEVN+SL S
Subjt:  QFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS

A0A6J1HJ63 uncharacterized protein LOC111464050 isoform X10.0e+0079.37Show/hide
Query:  SFGSSLTNMAFRFRRLKEISRHLSRM-SGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGP
        SF SS TNMAFRFRRL+EIS+ L +  SGY HQ+H  +AV+SL  SVAPS+VSEG  +RI+++GRHLL FSTT ELQCESSP ND+LSFIKST+D+SEGP
Subjt:  SFGSSLTNMAFRFRRLKEISRHLSRM-SGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGP

Query:  NHCWFNTSDGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSY
        NH W N  DGNKG+SEKDGI+LILADQFLEM+SSDSV+L+ENVKFLQHRFPQLHV G QCS++ S AEKS+MI FIM+EYVSFPILLS KI EM RG  Y
Subjt:  NHCWFNTSDGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSY

Query:  IISKDFNNPLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNG
        IISKDF+NPLL+ ER+ DL +LRKAIEEL E ENEKSGLPN GRTTYLK AEIIKEPYSC FMQNF+L FPGCISADEKGGRLFLSDSNHNRI+IFNGNG
Subjt:  IISKDFNNPLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNG

Query:  KILDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLM
        KILD+IGSYPGFEDGEF+LVKLARPAASFYHATQ+CLYFVDSENHAIRKADLGKRVVETLYP NYSS KSTQLWSWI D+ G+GS  DRE+ED NPQSLM
Subjt:  KILDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLM

Query:  FPWHMIRYVDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDS
        FPWHMIRY+DDRLLILNRSL TLWTMDLASGKIIEVV+G SKIMENYGQL MD +S+LKQIP+G LQ   DA  +TG LPY+DLLSSLTPFQNC+IICDS
Subjt:  FPWHMIRYVDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDS

Query:  VGQVVLKYHRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILV
        VGQV++KY+R+S ESSS QFSNFGVLGLPYWFAPPPEKVI+TADSFQGAGIDH+ FF+LLPG+VGI INVDLP DIELVES+QE SIWRQ RGTATEI +
Subjt:  VGQVVLKYHRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILV

Query:  IEDVAGPSEKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDKY---AARIADFLYP
        +E V+G SEKVGSAQQWYDELDSLAFS  ESE+VED++R  NHIGD+K  IECAVNTSPGTSEVIVYAA+YLR RR QD EGNG K    AARIAD LYP
Subjt:  IEDVAGPSEKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDKY---AARIADFLYP

Query:  GSTGKLRKECCIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS
        GS GK  KE CIQFL+  K DLRE+IFVKPLHVRI+LD++ HPKADNSKGIILTDSSVE+N+SL S
Subjt:  GSTGKLRKECCIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS

A0A6J1HW28 uncharacterized protein LOC111466804 isoform X10.0e+0079.53Show/hide
Query:  SFGSSLTNMAFRFRRLKEISRHLSRM-SGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGP
        SF SS  NMAFRFRRL+EIS+ L +  SGY HQHH  +AV+SL  SVA S+VSEG ++RI+D+G HL  FSTTTELQC+SSPANDILSFIKST+DESEGP
Subjt:  SFGSSLTNMAFRFRRLKEISRHLSRM-SGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTELQCESSPANDILSFIKSTVDESEGP

Query:  NHCWFNTSDGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSY
        NH W N  DGNKG+SEKD I+LILADQFLEM+SSDSV+L+ENVKFLQHRFPQLHV G QCS++LS  EKS+MI FIM+EYVSFPILLS KIFEM RGL Y
Subjt:  NHCWFNTSDGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPILLSKKIFEMARGLSY

Query:  IISKDFNNPLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNG
        IISKD++NPLL+ ER+ DL +LRKAIEEL E ENEKSGLPN GRTTYLK AEIIKEPYSCSFMQNF+L FPGCISADEKGGRLFLSDSNHNRIIIFNGNG
Subjt:  IISKDFNNPLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIIIFNGNG

Query:  KILDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLM
        KILD+IGSYPGFEDGEF+LVKLARPAASFYHATQ+CLYFVDSENHAIRKADLGKRVVETLYP NYSS KSTQLWSWI D+ G+GS  DRE+ED NPQSLM
Subjt:  KILDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLM

Query:  FPWHMIRYVDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDS
        FPWHMI+Y+DDRLLILNRSL TLWTMDLASGKIIEVV+G SKIMENY QL MD+VS+LKQIP+G LQ   DA  VTGGLPY+DLLSSLT FQNC+IICDS
Subjt:  FPWHMIRYVDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDS

Query:  VGQVVLKYHRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILV
        VGQV++KY+R+S ESSS +FSNFGVLGLPYWFAPPPEKVI+TADSFQGAGIDH+ FF+LLPG+VGI INVDLP DIELVES+QE SIWRQ RGTATEI +
Subjt:  VGQVVLKYHRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILV

Query:  IEDVAGPSEKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDKY---AARIADFLYP
        +E V+  SEKVGSAQQWYDELDSLAFS  ESE+VED++R  NHIGD+K  IECAVNTSPGTSEVIVYAA+YLR RR+QD EGNGDK    AARIAD LYP
Subjt:  IEDVAGPSEKVGSAQQWYDELDSLAFSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDKY---AARIADFLYP

Query:  GSTGKLRKECCIQFLLKR-KMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS
        GS GK  KE CIQFLL   K DLRE++FVKPLHVRI+LD++ HPKADNSKGIILTDSSVE+N+SL S
Subjt:  GSTGKLRKECCIQFLLKR-KMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSLGS

SwissProt top hitse value%identityAlignment
A4IF69 NHL repeat-containing protein 22.3e-0838.3Show/hide
Query:  LQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKI-LDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL
        L FPG I+ D    RL ++D+ H+RI++   NG+I   I G  PG +DG F       P         N +Y  D+ENH IRK DL   +V T+
Subjt:  LQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKI-LDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL

Q5ZI67 NHL repeat-containing protein 21.9e-1029.31Show/hide
Query:  LQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGS-YPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYS
        L FPG ++ D+ G RL ++D+ H+RI++   NG+IL  IG    G +DG F       P         N +Y  D+ENH IRK DL   +V T+      
Subjt:  LQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGS-YPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYS

Query:  SKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMI-------RYVDDRLLILNRSLQTLWTMDLASGKI
                   +DK G G+  +        Q +  PW ++          DD L I    +  +W + L  GK+
Subjt:  SKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHMI-------RYVDDRLLILNRSLQTLWTMDLASGKI

Q8BZW8 NHL repeat-containing protein 22.1e-0938.3Show/hide
Query:  LQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGS-YPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL
        L FPG ++ D   GRL ++D+ H+RI++   NG+I   IG   PG +DG F       P         N +Y  D+ENH IRK DL    V T+
Subjt:  LQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGS-YPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL

Q8NBF2 NHL repeat-containing protein 21.2e-0737.23Show/hide
Query:  LQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKI-LDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL
        L FPG ++ D+   RL ++D+ H+RI++   NG+I   I G  PG +DG F       P         N +Y  D+ENH IRK DL    V T+
Subjt:  LQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKI-LDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL

Q8VZ10 Protein SUPPRESSOR OF QUENCHING 1, chloroplastic5.8e-1545.26Show/hide
Query:  LQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGS--YPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL
        L+FPG ++ D    RLF+SDSNHNRII+ +  G  +  IGS    GF+DG F+     RP    Y+A +N LY  D+ENHA+R+ D     V+TL
Subjt:  LQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGS--YPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL

Arabidopsis top hitse value%identityAlignment
AT1G56500.1 haloacid dehalogenase-like hydrolase family protein4.1e-1645.26Show/hide
Query:  LQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGS--YPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL
        L+FPG ++ D    RLF+SDSNHNRII+ +  G  +  IGS    GF+DG F+     RP    Y+A +N LY  D+ENHA+R+ D     V+TL
Subjt:  LQFPGCISADEKGGRLFLSDSNHNRIIIFNGNGKILDIIGS--YPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL

AT3G07060.1 NHL domain-containing protein4.6e-17747.13Show/hide
Query:  SSPANDILSFIKSTVDESEGPNHCWFNTSDGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAA-EKSDMICFIMK
        SSP  D+LSFIK+++D+ EGP+H W N   GNK L +  G +++LA   L+  +SD     E +K LQ R P +   G   S     A +++ +   I+K
Subjt:  SSPANDILSFIKSTVDESEGPNHCWFNTSDGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAA-EKSDMICFIMK

Query:  EYVSFPILLSKKIFEMARG-LSYIISKDFNNPLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISAD
        EY++FP+LLS+K F    G + YI+ KDF NPL+  E+++D+  + KA++ L  Q+ EKS        T+ KQAE IKE +  SF Q+ LL FPGCISAD
Subjt:  EYVSFPILLSKKIFEMARG-LSYIISKDFNNPLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISAD

Query:  EKGGRLFLSDSNHNRIIIFNGNGKILDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWI
        E G RLFLSD+NH+RIIIF  +GKI+D IG +PGFEDG+F+  K+ RP  + Y   ++CLY VDSENHAIR+A++  RV+ET+YP+    KK+  LWSWI
Subjt:  EKGGRLFLSDSNHNRIIIFNGNGKILDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWI

Query:  MDKFGVGS------NADRELEDLNPQSLMFPWHMIRYVDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSD
        M+K G+G       +AD + E+ + +SL+FPWH+++  D+ LL++N+S   LW ++ ASG+I EVV G SKI+E  GQ I +K+S+L+ +P+ WLQQ + 
Subjt:  MDKFGVGS------NADRELEDLNPQSLMFPWHMIRYVDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSD

Query:  ANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKYHRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVD
        A       P   LLSS T   + I++ D   Q VLK +R S   SS QFSN G+LGLPYW   P E+V   A+  Q A + H Q  +LLPG++ I++N++
Subjt:  ANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKYHRQSDESSSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVD

Query:  LPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPSEKVGSAQQWYDELDSLAFSL--PES--EMVEDDMRTS--NHIGDNKVHIECAVNTSPGTSEVI
        +P   ELVE +QE  IWRQ RG  +E         PSEK+G +QQWYDELDSLA  +  PE+  E  E+D+  S  +   D ++HI+C V TSPG+SE+I
Subjt:  LPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPSEKVGSAQQWYDELDSLAFSL--PES--EMVEDDMRTS--NHIGDNKVHIECAVNTSPGTSEVI

Query:  VYAALYLRLRRNQDSE-GNGDKYAARIADFLYP-GSTGKLRKECCIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSL
        VYAALYLRL RN+++E    ++ A +IA  L P  +   ++++  +  L K K +LR+++F+KP+HVRIRLDS DHPKADNS+ +ILTDSSVEV+VSL
Subjt:  VYAALYLRLRRNQDSE-GNGDKYAARIADFLYP-GSTGKLRKECCIQFLLKRKMDLRELIFVKPLHVRIRLDSLDHPKADNSKGIILTDSSVEVNVSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCTTCCTCAAAGTCGCAGAGACGACGTCTGATACATTCCTCGAACAATCCGCCGCAGAAACACTTGAATCCAGAGAGGTCAGACTCGGAATCCAACTCACTGCA
AGCTTCTCCGATAACCAGCGGCCGGCGGCGGCGGAGAGACGATCGTCCGTTTGAAGCTGCGGCTCACGAATTTAGGTGTTTGATGAGATGGTGGTGGGGGATAAGAAACA
AAAAAAATGTTTGGGAGATTTCGGTGCATGTGAGTGTGCTGAAATCTGAGACGTCTGATCGGAGTTCAATCAAAGAAGGGGATGCAGAAAATTATGATATGGAAGGGGGG
AAGAAATTAGATGGAGGGCAGAAGCAGAATGGAGGGTCCGGATTGGCAGGTAGGGCAGAGCCTCCGTCGACGCCGCGTTCAGGTTATGAAGTTCCCGTCGGCTCGCCGTC
ATTTGGTTCTTCTCTAACGAATATGGCTTTCAGGTTCCGGCGACTTAAAGAAATTTCGAGGCATTTGTCCCGAATGTCCGGATATTGTCATCAGCATCATCAGAGTTATG
CTGTCAACTCCTTGGCATTGTCCGTCGCTCCATCTCATGTATCTGAAGGAACCAATAAAAGGATTGTAGACAATGGACGCCACCTTCTGTGGTTTTCCACAACGACGGAG
CTGCAATGCGAGTCTTCTCCGGCAAATGACATTTTATCCTTCATTAAGTCAACCGTAGATGAGTCTGAAGGCCCTAACCACTGTTGGTTCAATACATCTGATGGAAATAA
AGGACTTTCAGAGAAGGATGGAATCTTCTTAATTCTTGCCGATCAATTTCTTGAAATGTCGAGCTCTGATTCTGTTCTTTTGATTGAAAATGTAAAGTTCCTTCAGCACA
GGTTTCCTCAGCTTCATGTTTTTGGGTTTCAGTGTTCCAGTTCTCTATCTGCTGCTGAAAAAAGTGACATGATCTGTTTTATAATGAAGGAGTATGTTTCCTTTCCCATT
TTGTTATCCAAGAAGATTTTTGAGATGGCGAGGGGGCTCTCTTATATTATCTCCAAGGACTTCAATAATCCTTTGCTCCTCTATGAGAGGAACATGGACCTTAGGATTCT
TAGGAAAGCTATTGAGGAGTTGCATGAACAAGAAAATGAGAAATCTGGTCTGCCCAATAATGGGAGAACCACTTATCTGAAGCAAGCAGAGATCATCAAAGAACCATATT
CATGTTCTTTCATGCAGAATTTTCTTCTCCAGTTTCCAGGCTGTATATCCGCAGATGAAAAGGGTGGCCGACTCTTCCTTTCGGACAGCAATCATAATCGGATTATTATA
TTCAATGGCAATGGGAAGATTCTGGATATTATTGGTTCTTATCCAGGTTTTGAGGATGGAGAATTTGATTTGGTCAAATTAGCTCGCCCAGCAGCTTCCTTTTATCATGC
TACTCAGAATTGCTTGTATTTTGTGGACTCTGAGAACCATGCAATTAGGAAAGCTGATTTGGGAAAGCGGGTTGTGGAAACTCTCTATCCAGAAAACTACTCAAGTAAGA
AGAGTACTCAGTTATGGAGCTGGATTATGGACAAATTTGGTGTGGGAAGCAATGCTGACAGGGAATTAGAAGACTTAAATCCGCAGTCTTTGATGTTTCCTTGGCACATG
ATTAGATATGTGGATGATAGATTATTAATTTTAAACCGCAGTCTTCAAACGCTATGGACCATGGATTTGGCTTCAGGAAAAATTATTGAAGTTGTTAGGGGGCTTTCAAA
GATTATGGAGAACTATGGACAGTTGATCATGGACAAAGTGTCTATTCTGAAGCAGATACCCAATGGTTGGTTGCAGCAGCTAAGTGATGCAAATGTTGTCACAGGGGGGC
TACCATACATGGATCTTTTATCTTCATTAACACCCTTTCAGAATTGCATAATCATTTGCGACTCAGTTGGACAGGTGGTCTTGAAATATCATAGACAATCTGATGAGAGC
TCAAGCTTCCAATTTTCAAACTTTGGGGTTCTTGGATTACCGTATTGGTTTGCTCCACCTCCGGAGAAGGTTATAACCACTGCTGATAGTTTTCAAGGAGCAGGGATTGA
TCATCTTCAGTTTTTCAAACTGCTGCCTGGTAGAGTTGGCATACAGATCAATGTTGATCTTCCTATAGATATTGAACTTGTGGAATCATTACAAGAAGGCAGCATATGGC
GACAAGCGAGAGGAACTGCAACTGAAATCTTAGTCATCGAGGATGTAGCTGGACCCTCAGAAAAGGTTGGTTCTGCTCAACAGTGGTATGATGAATTGGATAGTCTAGCC
TTTTCACTGCCAGAATCAGAAATGGTGGAAGATGATATGAGAACTTCTAACCATATAGGAGACAATAAGGTGCACATTGAGTGTGCTGTCAATACAAGTCCTGGAACTAG
CGAGGTTATAGTTTATGCAGCCCTATATTTAAGGCTTAGAAGAAACCAAGATTCCGAAGGCAATGGGGACAAATATGCAGCAAGGATAGCAGATTTTTTGTACCCAGGAA
GTACAGGGAAGCTAAGAAAAGAGTGTTGCATTCAGTTCCTTTTAAAACGTAAAATGGATTTGAGAGAGCTCATTTTCGTGAAACCTCTGCATGTGAGGATAAGGTTGGAT
TCTCTTGATCACCCTAAAGCTGATAATTCCAAAGGTATTATCCTCACAGACTCCTCAGTTGAAGTCAATGTATCACTTGGTTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGGCTTCCTCAAAGTCGCAGAGACGACGTCTGATACATTCCTCGAACAATCCGCCGCAGAAACACTTGAATCCAGAGAGGTCAGACTCGGAATCCAACTCACTGCA
AGCTTCTCCGATAACCAGCGGCCGGCGGCGGCGGAGAGACGATCGTCCGTTTGAAGCTGCGGCTCACGAATTTAGGTGTTTGATGAGATGGTGGTGGGGGATAAGAAACA
AAAAAAATGTTTGGGAGATTTCGGTGCATGTGAGTGTGCTGAAATCTGAGACGTCTGATCGGAGTTCAATCAAAGAAGGGGATGCAGAAAATTATGATATGGAAGGGGGG
AAGAAATTAGATGGAGGGCAGAAGCAGAATGGAGGGTCCGGATTGGCAGGTAGGGCAGAGCCTCCGTCGACGCCGCGTTCAGGTTATGAAGTTCCCGTCGGCTCGCCGTC
ATTTGGTTCTTCTCTAACGAATATGGCTTTCAGGTTCCGGCGACTTAAAGAAATTTCGAGGCATTTGTCCCGAATGTCCGGATATTGTCATCAGCATCATCAGAGTTATG
CTGTCAACTCCTTGGCATTGTCCGTCGCTCCATCTCATGTATCTGAAGGAACCAATAAAAGGATTGTAGACAATGGACGCCACCTTCTGTGGTTTTCCACAACGACGGAG
CTGCAATGCGAGTCTTCTCCGGCAAATGACATTTTATCCTTCATTAAGTCAACCGTAGATGAGTCTGAAGGCCCTAACCACTGTTGGTTCAATACATCTGATGGAAATAA
AGGACTTTCAGAGAAGGATGGAATCTTCTTAATTCTTGCCGATCAATTTCTTGAAATGTCGAGCTCTGATTCTGTTCTTTTGATTGAAAATGTAAAGTTCCTTCAGCACA
GGTTTCCTCAGCTTCATGTTTTTGGGTTTCAGTGTTCCAGTTCTCTATCTGCTGCTGAAAAAAGTGACATGATCTGTTTTATAATGAAGGAGTATGTTTCCTTTCCCATT
TTGTTATCCAAGAAGATTTTTGAGATGGCGAGGGGGCTCTCTTATATTATCTCCAAGGACTTCAATAATCCTTTGCTCCTCTATGAGAGGAACATGGACCTTAGGATTCT
TAGGAAAGCTATTGAGGAGTTGCATGAACAAGAAAATGAGAAATCTGGTCTGCCCAATAATGGGAGAACCACTTATCTGAAGCAAGCAGAGATCATCAAAGAACCATATT
CATGTTCTTTCATGCAGAATTTTCTTCTCCAGTTTCCAGGCTGTATATCCGCAGATGAAAAGGGTGGCCGACTCTTCCTTTCGGACAGCAATCATAATCGGATTATTATA
TTCAATGGCAATGGGAAGATTCTGGATATTATTGGTTCTTATCCAGGTTTTGAGGATGGAGAATTTGATTTGGTCAAATTAGCTCGCCCAGCAGCTTCCTTTTATCATGC
TACTCAGAATTGCTTGTATTTTGTGGACTCTGAGAACCATGCAATTAGGAAAGCTGATTTGGGAAAGCGGGTTGTGGAAACTCTCTATCCAGAAAACTACTCAAGTAAGA
AGAGTACTCAGTTATGGAGCTGGATTATGGACAAATTTGGTGTGGGAAGCAATGCTGACAGGGAATTAGAAGACTTAAATCCGCAGTCTTTGATGTTTCCTTGGCACATG
ATTAGATATGTGGATGATAGATTATTAATTTTAAACCGCAGTCTTCAAACGCTATGGACCATGGATTTGGCTTCAGGAAAAATTATTGAAGTTGTTAGGGGGCTTTCAAA
GATTATGGAGAACTATGGACAGTTGATCATGGACAAAGTGTCTATTCTGAAGCAGATACCCAATGGTTGGTTGCAGCAGCTAAGTGATGCAAATGTTGTCACAGGGGGGC
TACCATACATGGATCTTTTATCTTCATTAACACCCTTTCAGAATTGCATAATCATTTGCGACTCAGTTGGACAGGTGGTCTTGAAATATCATAGACAATCTGATGAGAGC
TCAAGCTTCCAATTTTCAAACTTTGGGGTTCTTGGATTACCGTATTGGTTTGCTCCACCTCCGGAGAAGGTTATAACCACTGCTGATAGTTTTCAAGGAGCAGGGATTGA
TCATCTTCAGTTTTTCAAACTGCTGCCTGGTAGAGTTGGCATACAGATCAATGTTGATCTTCCTATAGATATTGAACTTGTGGAATCATTACAAGAAGGCAGCATATGGC
GACAAGCGAGAGGAACTGCAACTGAAATCTTAGTCATCGAGGATGTAGCTGGACCCTCAGAAAAGGTTGGTTCTGCTCAACAGTGGTATGATGAATTGGATAGTCTAGCC
TTTTCACTGCCAGAATCAGAAATGGTGGAAGATGATATGAGAACTTCTAACCATATAGGAGACAATAAGGTGCACATTGAGTGTGCTGTCAATACAAGTCCTGGAACTAG
CGAGGTTATAGTTTATGCAGCCCTATATTTAAGGCTTAGAAGAAACCAAGATTCCGAAGGCAATGGGGACAAATATGCAGCAAGGATAGCAGATTTTTTGTACCCAGGAA
GTACAGGGAAGCTAAGAAAAGAGTGTTGCATTCAGTTCCTTTTAAAACGTAAAATGGATTTGAGAGAGCTCATTTTCGTGAAACCTCTGCATGTGAGGATAAGGTTGGAT
TCTCTTGATCACCCTAAAGCTGATAATTCCAAAGGTATTATCCTCACAGACTCCTCAGTTGAAGTCAATGTATCACTTGGTTCCTAA
Protein sequenceShow/hide protein sequence
MAASSKSQRRRLIHSSNNPPQKHLNPERSDSESNSLQASPITSGRRRRRDDRPFEAAAHEFRCLMRWWWGIRNKKNVWEISVHVSVLKSETSDRSSIKEGDAENYDMEGG
KKLDGGQKQNGGSGLAGRAEPPSTPRSGYEVPVGSPSFGSSLTNMAFRFRRLKEISRHLSRMSGYCHQHHQSYAVNSLALSVAPSHVSEGTNKRIVDNGRHLLWFSTTTE
LQCESSPANDILSFIKSTVDESEGPNHCWFNTSDGNKGLSEKDGIFLILADQFLEMSSSDSVLLIENVKFLQHRFPQLHVFGFQCSSSLSAAEKSDMICFIMKEYVSFPI
LLSKKIFEMARGLSYIISKDFNNPLLLYERNMDLRILRKAIEELHEQENEKSGLPNNGRTTYLKQAEIIKEPYSCSFMQNFLLQFPGCISADEKGGRLFLSDSNHNRIII
FNGNGKILDIIGSYPGFEDGEFDLVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENYSSKKSTQLWSWIMDKFGVGSNADRELEDLNPQSLMFPWHM
IRYVDDRLLILNRSLQTLWTMDLASGKIIEVVRGLSKIMENYGQLIMDKVSILKQIPNGWLQQLSDANVVTGGLPYMDLLSSLTPFQNCIIICDSVGQVVLKYHRQSDES
SSFQFSNFGVLGLPYWFAPPPEKVITTADSFQGAGIDHLQFFKLLPGRVGIQINVDLPIDIELVESLQEGSIWRQARGTATEILVIEDVAGPSEKVGSAQQWYDELDSLA
FSLPESEMVEDDMRTSNHIGDNKVHIECAVNTSPGTSEVIVYAALYLRLRRNQDSEGNGDKYAARIADFLYPGSTGKLRKECCIQFLLKRKMDLRELIFVKPLHVRIRLD
SLDHPKADNSKGIILTDSSVEVNVSLGS