; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037897 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037897
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionNHL domain-containing protein
Genome locationscaffold12:41707472..41720733
RNA-Seq ExpressionSpg037897
SyntenySpg037897
Gene Ontology termsNA
InterPro domainsIPR011042 - Six-bladed beta-propeller, TolB-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6571580.1 Protein SUPPRESSOR OF QUENCHING 1, chloroplastic, partial [Cucurbita argyrosperma subsp. sororia]0.0e+0079.4Show/hide
Query:  MAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTIVEIFVFRFS
        MAFRFRRL++IS+SLPQFYSGYYHQ+HH +A+SSL  SV+PS+V EG+ +RIL++GRHLL                                     RFS
Subjt:  MAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTIVEIFVFRFS

Query:  TTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSSTLSAAEKSD
        TT ELQCESSPAND+LSFIKST DESEGPNH WLN  DG KGISEKDGIYLILA+QFLE TSSDSVVLVENVKFLQHRFPQLHVIG QCS+TLS AEKS+
Subjt:  TTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSSTLSAAEKSD

Query:  MIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFP
        MIQFIMREYVSFPILLS KI EM  GLCYIISKDFSNPLL+ E+D DL++LRKAIEEL E ENEKS LP+ GRTTYLK AEIIKEPYSCSFMQNF+LHFP
Subjt:  MIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFP

Query:  GCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENFSSKKST
         CISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQ+CLYFVDSENHAIRKADLGKRVVETLYP N+SS KST
Subjt:  GCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENFSSKKST

Query:  QLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIPDGMLQQLSV
        QLW WI D+ GLGSIPDREVEDFNPQSL+FPWHMIRYMDD LLILNRS  TLWTMDLASG+IIEVV+G SKIMENYGQL MD +SVLKQIPDGMLQ    
Subjt:  QLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIPDGMLQQLSV

Query:  ANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVD
        A  +TG LPY+DLLSSLTPFQ+C IICDSVGQV++KY+ +S ESSS QFSNFGVLGLPYWFAPPPE+VISTADSFQG GIDH+ FFRLLPGKVGI INVD
Subjt:  ANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVD

Query:  LPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGTSEVIVYAPL
        LPTDIELVES+QED IWRQ RGTATEI IVE V+G SEKVGSAQQWYDELDSLAFSPQESE++ED  IRA+NHIGD+K  IECAVNTSPGTSEVIVYA +
Subjt:  LPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGTSEVIVYAPL

Query:  YLRLRRTKDSEGDRDKH---AARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNVSLDS
        YLR RR +D EG+ DK    AARIAD LYPGS GKKIKESCIQFL+N K DLRE IFVKPLHVRIKLD+L HPKADNSKGIILTDSSVE+N+SL S
Subjt:  YLRLRRTKDSEGDRDKH---AARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNVSLDS

XP_022159641.1 uncharacterized protein LOC111025991 isoform X1 [Momordica charantia]0.0e+0078.81Show/hide
Query:  MAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTIVEIFVFRFS
        MAFR RRLK+ISR L +  SGY HQ+H S A+SSLAL+VAP H  EGI KRI+D+GRH L                                     RFS
Subjt:  MAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTIVEIFVFRFS

Query:  TTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSSTLSAAEKSD
        TTTELQCESS AN+ILSFIKST DESEGPNH WLNT DGNKG+SEKDGI+LILA+QFL   SS+SV LVENVKFLQHRFPQLHVIGFQCSS+LSAAEK+D
Subjt:  TTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSSTLSAAEKSD

Query:  MIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFP
        MIQFIM+EYVSFPILLSKK FEMA GLCYIISK+F NPLLL E++MD   +RKAIEEL E+E+EK S P+NGRTTYLKQAEI  EPYSCSFMQNFLLHFP
Subjt:  MIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFP

Query:  GCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENFSSKKST
        GCISADEKGGRLFLSDSNHNRI+IFN NGKILD+IGSYPGF+DGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADL KRVVETLYPEN+SSKKST
Subjt:  GCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENFSSKKST

Query:  QLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIPDGMLQQLSV
        QLW WIMDKFGLGS  +RE+EDFNPQSL+FPWH+IRY+DD LLILNRS +TLW MDL SG+IIEVVRG S IME YGQLI D+VSV+KQIP G LQ+LS 
Subjt:  QLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIPDGMLQQLSV

Query:  ANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVD
        ANIVTGGLPY+DLLSS TPFQ+C IICDSVGQV+LKYH  S ESSSFQFSNFGVLGLPYWFA PPE+VI+TADSF+G GIDHLQFFRLLPGKVGIQINVD
Subjt:  ANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVD

Query:  LPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGTSEVIVYAPL
        LP DIELVESLQED IWRQARGTATE LIVEDVAGPSEKVGSAQQWYDELDSLAFSP +SEM+EDN  R  NHIGDNKV IECAVNTSPGTSEVIVYA L
Subjt:  LPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGTSEVIVYAPL

Query:  YLRLRRTKDSEGDRDKHAARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNVSLDS
        YLRLRR +D EG+ +K A RIA FLYP + GK  KE CIQFLL RK  LRE IFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVN+SL S
Subjt:  YLRLRRTKDSEGDRDKHAARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNVSLDS

XP_022963868.1 uncharacterized protein LOC111464050 isoform X1 [Cucurbita moschata]0.0e+0079.13Show/hide
Query:  MSFRCSATNMAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTI
        MSFR SATNMAFRFRRL++IS+SLPQFYSGYYHQ HH +A+SSL  SVAPS+V EG+ +RIL++GRHLL                               
Subjt:  MSFRCSATNMAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTI

Query:  VEIFVFRFSTTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSS
              RFSTT ELQCESSP ND+LSFIKST D+SEGPNH WLN  DGNKGISEKDGIYLILA+QFLE TSSDSVVLVENVKFLQHRFPQLHVIG QCS+
Subjt:  VEIFVFRFSTTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSS

Query:  TLSAAEKSDMIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSF
        T S AEKS+MIQFIMREYVSFPILLS KI EM  G CYIISKDFSNPLL+ E+D DL++LRKAIEEL E ENEKS LP+ GRTTYLK AEIIKEPYSC F
Subjt:  TLSAAEKSDMIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSF

Query:  MQNFLLHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYP
        MQNF+LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQ+CLYFVDSENHAIRKADLGKRVVETLYP
Subjt:  MQNFLLHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYP

Query:  ENFSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIP
         N+SS KSTQLW WI D+ GLGSIPDREVEDFNPQSL+FPWHMIRYMDD LLILNRS  TLWTMDLASG+IIEVV+G SKIMENYGQL MD +SVLKQIP
Subjt:  ENFSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIP

Query:  DGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPG
        DGMLQ    A  +TG LPY+DLLSSLTPFQ+C IICDSVGQV++KY+ +S ESSS QFSNFGVLGLPYWFAPPPE+VISTADSFQG GIDH+ FFRLLPG
Subjt:  DGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPG

Query:  KVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGT
        KVGI INVDLPTDIELVES+QED IWRQ RGTATEI IVE V+G SEKVGSAQQWYDELDSLAFSPQESE++ED NIRA+NHIGD+K  IECAVNTSPGT
Subjt:  KVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGT

Query:  SEVIVYAPLYLRLRRTKDSEGDRDKH---AARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVN
        SEVIVYA +YLR RR +D EG+  K    AARIAD LYPGS GKKIKESCIQFL+N K DLRE IFVKPLHVRIKLD++ HPKADNSKGIILTDSSVE+N
Subjt:  SEVIVYAPLYLRLRRTKDSEGDRDKH---AARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVN

Query:  VSLDS
        +SL S
Subjt:  VSLDS

XP_022967189.1 uncharacterized protein LOC111466804 isoform X1 [Cucurbita maxima]0.0e+0078.66Show/hide
Query:  MSFRCSATNMAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTI
        MSFR SA NMAFRFRRL++IS+SLPQFYSGYYHQ+HH +A+SSL  SVA S+V EG+ +RILD+G HL                                
Subjt:  MSFRCSATNMAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTI

Query:  VEIFVFRFSTTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSS
              RFSTTTELQC+SSPANDILSFIKST DESEGPNH WLN  DGNKGISEKD IYLILA+QFLE TSSDSVVLVENVKFLQHRFPQLHVIG QCS+
Subjt:  VEIFVFRFSTTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSS

Query:  TLSAAEKSDMIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSF
        TLS  EKS+MIQFIMREYVSFPILLS KIFEM  GLCYIISKD+SNPLL+ E+D DL++LRKAIEEL E ENEKS LP+ GRTTYLK AEIIKEPYSCSF
Subjt:  TLSAAEKSDMIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSF

Query:  MQNFLLHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYP
        MQNF+LHFPGCISADEKGGRLFLSDSNHNRI+IFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQ+CLYFVDSENHAIRKADLGKRVVETLYP
Subjt:  MQNFLLHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYP

Query:  ENFSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIP
         N+SS KSTQLW WI D+ GLGS+PDREVEDFNPQSL+FPWHMI+YMDD LLILNRS  TLWTMDLASG+IIEVV+G SKIMENY QL MDRVSVLKQIP
Subjt:  ENFSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIP

Query:  DGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPG
        DGMLQ    A  VTGGLPY+DLLSSLT FQ+C IICDSVGQV++KY+ +S ESSS +FSNFGVLGLPYWFAPPPE+VISTADSFQG GIDH+ FFRLLPG
Subjt:  DGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPG

Query:  KVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGT
        KVGI INVDLPTDIELVES+QED IWRQ RGTATEI IVE V+  SEKVGSAQQWYDELDSLAFSPQESE++ED N+RA+NHIGD+K  IECAVNTSPGT
Subjt:  KVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGT

Query:  SEVIVYAPLYLRLRRTKDSEGDRDKH---AARIADFLYPGSTGKKIKESCIQFLL-NRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEV
        SEVIVYA +YLR RR +D EG+ DK    AARIAD LYPGS GK+IKESCIQFLL N K DLRE +FVKPLHVRIKLD++ HPKADNSKGIILTDSSVE+
Subjt:  SEVIVYAPLYLRLRRTKDSEGDRDKH---AARIADFLYPGSTGKKIKESCIQFLL-NRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEV

Query:  NVSLDS
        N+SL S
Subjt:  NVSLDS

XP_038888990.1 uncharacterized protein LOC120078755 isoform X1 [Benincasa hispida]0.0e+0081.49Show/hide
Query:  MAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTIVEIFVFRFS
        MAFRFRRLK+I RSLPQ YSGYYHQ+HH YA+SSLALSV+PSHV EGI +R+LD+GRH L                                     RFS
Subjt:  MAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTIVEIFVFRFS

Query:  TTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSSTLSAAEKSD
        TTT LQ ESSPANDI SFIKST DESEGPNH WLNTS+GNKGI EKDG+YLILA+QFLE TS+DSVVLVENVKFLQ RFP LHVIGFQCSSTLSAAEKSD
Subjt:  TTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSSTLSAAEKSD

Query:  MIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFP
        MIQFIMREY+SFPILLS KIFE+  GLCYIISKD SNPLL+  ++MDLSILRKAIEELHE ENEKS LPS GRTTY+KQAEI+KEP SCSFMQNFLLHFP
Subjt:  MIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFP

Query:  GCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENFSSKKST
        GCISADE+G RLFLSDSNHNRIVIFN +GKILDMIGSYPGFEDGEFELVKLARPAASFYH+TQNCLYFVDSENHAIRKADLGKRVVETLYPEN+S+K ST
Subjt:  GCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENFSSKKST

Query:  QLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIPDGMLQQLSV
        QLW WIMDKFG+GSIPDREVEDFNPQSL+FPWHMIRYMDD LLIL+RS  TLWTMDLASG+IIE+VRG S+IMENYGQLIMDR+SVLKQIPDGMLQ  + 
Subjt:  QLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIPDGMLQQLSV

Query:  ANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVD
        ANI TGGLPY+DLLSSLTPFQ+C IICDSVGQVVLKY+ KS ESSSFQFSNFGVLGLPYW APPPE+VI+ AD+FQG  IDHLQFFRLLPGKVGIQINVD
Subjt:  ANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVD

Query:  LPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGTSEVIVYAPL
        LPTDIELVESL ED IWRQARGTATEI IVE+VA PSEKVGSAQQWYDELDSLAFSPQESEM+ED N+RA N+IGDNKVHIECAVNTSPGTSEVIVYA L
Subjt:  LPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGTSEVIVYAPL

Query:  YLRLRRTKDSEGDRDK-HAARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNVSLDS
        YLRLRR +DSEG+ DK HA RIADFLYPG+ GK IKE CI+FL+N K DLRE IFVKPLHVRIKLDSL HPKA+NSKGIILTDSSVEVNVSL S
Subjt:  YLRLRRTKDSEGDRDK-HAARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNVSLDS

TrEMBL top hitse value%identityAlignment
A0A1S3CQE6 uncharacterized protein LOC103503064 isoform X10.0e+0078.84Show/hide
Query:  MAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTIVEIFVFRFS
        MAFRFRRLK+ISRSLPQ YSGYYHQ+HH Y +SSL LSVAP HV EGI +R+ DNGRH                                       RFS
Subjt:  MAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTIVEIFVFRFS

Query:  TTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSSTLSAAEKSD
        TTTELQCESSP NDI SFI ST DESEGPNH WLNTS+GNKGI E+DG+YLILANQFLE TSSDS+ LVENVKFLQ RFP LHVIGFQC STLS AEKS 
Subjt:  TTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSSTLSAAEKSD

Query:  MIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFP
        MIQFIMREY+SFPILLS KIFE+A   C IISKD SNPLL+ E+DMDLSIL KAIEELHE ENEKS L + G+TTYLKQAE+IKEP SCSFM NFLLH+P
Subjt:  MIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFP

Query:  GCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENFSSKKST
        GCISADE+GGRLFLSDSNHNRIVI N  GKILDMIGSYPGFEDGEFELVKLARPAASFYH+TQNCLYFVDSENHAIRKADLGKRVVETLYPEN+S+KKST
Subjt:  GCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENFSSKKST

Query:  QLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIPDGMLQQLSV
        QLW WIMDKFGLGSIPDREVEDFNPQSL+FPWH+IRYMDD LLILNRS  TLWTMDL SG+IIEVVRG S+IMENYG LIMDR+SVLKQIPDG LQQ S 
Subjt:  QLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIPDGMLQQLSV

Query:  ANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVD
        ANI TGG PYMDLLSSLTPF++C IICDSVGQVVLKY+ +S   SS QFSNFGVLGLPYWFAPPPE+VI+TA+ F+G GIDHLQFFRLLPG+VGIQINVD
Subjt:  ANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVD

Query:  LPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGTSEVIVYAPL
        LP+DIELVESL +D IWRQARGTATEI IVE VAGPSEKVGSAQQWYDELDSLAFSPQESEM+ED N+RA N+IGDNKVHIECAVNTSPGTSEVIVYA L
Subjt:  LPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGTSEVIVYAPL

Query:  YLRLRRTKDSEGDRDKH-AARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNVSLDS
        YLRLRR +D EG+  KH A RIADFLY      KIKE+CIQFL+N K DLRE IFVKPLHVRIKLDSL HPKA+NSK IILT SSVEVNVSL S
Subjt:  YLRLRRTKDSEGDRDKH-AARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNVSLDS

A0A6J1E0D7 uncharacterized protein LOC111025991 isoform X20.0e+0084.07Show/hide
Query:  FRFSTTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSSTLSAA
        FRFSTTTELQCESS AN+ILSFIKST DESEGPNH WLNT DGNKG+SEKDGI+LILA+QFL   SS+SV LVENVKFLQHRFPQLHVIGFQCSS+LSAA
Subjt:  FRFSTTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSSTLSAA

Query:  EKSDMIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSFMQNFL
        EK+DMIQFIM+EYVSFPILLSKK FEMA GLCYIISK+F NPLLL E++MD   +RKAIEEL E+E+EK S P+NGRTTYLKQAEI  EPYSCSFMQNFL
Subjt:  EKSDMIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSFMQNFL

Query:  LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENFSS
        LHFPGCISADEKGGRLFLSDSNHNRI+IFN NGKILD+IGSYPGF+DGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADL KRVVETLYPEN+SS
Subjt:  LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENFSS

Query:  KKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIPDGMLQ
        KKSTQLW WIMDKFGLGS  +RE+EDFNPQSL+FPWH+IRY+DD LLILNRS +TLW MDL SG+IIEVVRG S IME YGQLI D+VSV+KQIP G LQ
Subjt:  KKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIPDGMLQ

Query:  QLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQ
        +LS ANIVTGGLPY+DLLSS TPFQ+C IICDSVGQV+LKYH  S ESSSFQFSNFGVLGLPYWFA PPE+VI+TADSF+G GIDHLQFFRLLPGKVGIQ
Subjt:  QLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQ

Query:  INVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGTSEVIV
        INVDLP DIELVESLQED IWRQARGTATE LIVEDVAGPSEKVGSAQQWYDELDSLAFSP +SEM+EDN  R  NHIGDNKV IECAVNTSPGTSEVIV
Subjt:  INVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGTSEVIV

Query:  YAPLYLRLRRTKDSEGDRDKHAARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNVSLDS
        YA LYLRLRR +D EG+ +K A RIA FLYP + GK  KE CIQFLL RK  LRE IFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVN+SL S
Subjt:  YAPLYLRLRRTKDSEGDRDKHAARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNVSLDS

A0A6J1E4J1 uncharacterized protein LOC111025991 isoform X10.0e+0078.81Show/hide
Query:  MAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTIVEIFVFRFS
        MAFR RRLK+ISR L +  SGY HQ+H S A+SSLAL+VAP H  EGI KRI+D+GRH L                                     RFS
Subjt:  MAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTIVEIFVFRFS

Query:  TTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSSTLSAAEKSD
        TTTELQCESS AN+ILSFIKST DESEGPNH WLNT DGNKG+SEKDGI+LILA+QFL   SS+SV LVENVKFLQHRFPQLHVIGFQCSS+LSAAEK+D
Subjt:  TTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSSTLSAAEKSD

Query:  MIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFP
        MIQFIM+EYVSFPILLSKK FEMA GLCYIISK+F NPLLL E++MD   +RKAIEEL E+E+EK S P+NGRTTYLKQAEI  EPYSCSFMQNFLLHFP
Subjt:  MIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFP

Query:  GCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENFSSKKST
        GCISADEKGGRLFLSDSNHNRI+IFN NGKILD+IGSYPGF+DGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADL KRVVETLYPEN+SSKKST
Subjt:  GCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENFSSKKST

Query:  QLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIPDGMLQQLSV
        QLW WIMDKFGLGS  +RE+EDFNPQSL+FPWH+IRY+DD LLILNRS +TLW MDL SG+IIEVVRG S IME YGQLI D+VSV+KQIP G LQ+LS 
Subjt:  QLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIPDGMLQQLSV

Query:  ANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVD
        ANIVTGGLPY+DLLSS TPFQ+C IICDSVGQV+LKYH  S ESSSFQFSNFGVLGLPYWFA PPE+VI+TADSF+G GIDHLQFFRLLPGKVGIQINVD
Subjt:  ANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVD

Query:  LPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGTSEVIVYAPL
        LP DIELVESLQED IWRQARGTATE LIVEDVAGPSEKVGSAQQWYDELDSLAFSP +SEM+EDN  R  NHIGDNKV IECAVNTSPGTSEVIVYA L
Subjt:  LPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGTSEVIVYAPL

Query:  YLRLRRTKDSEGDRDKHAARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNVSLDS
        YLRLRR +D EG+ +K A RIA FLYP + GK  KE CIQFLL RK  LRE IFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVN+SL S
Subjt:  YLRLRRTKDSEGDRDKHAARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNVSLDS

A0A6J1HJ63 uncharacterized protein LOC111464050 isoform X10.0e+0079.13Show/hide
Query:  MSFRCSATNMAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTI
        MSFR SATNMAFRFRRL++IS+SLPQFYSGYYHQ HH +A+SSL  SVAPS+V EG+ +RIL++GRHLL                               
Subjt:  MSFRCSATNMAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTI

Query:  VEIFVFRFSTTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSS
              RFSTT ELQCESSP ND+LSFIKST D+SEGPNH WLN  DGNKGISEKDGIYLILA+QFLE TSSDSVVLVENVKFLQHRFPQLHVIG QCS+
Subjt:  VEIFVFRFSTTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSS

Query:  TLSAAEKSDMIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSF
        T S AEKS+MIQFIMREYVSFPILLS KI EM  G CYIISKDFSNPLL+ E+D DL++LRKAIEEL E ENEKS LP+ GRTTYLK AEIIKEPYSC F
Subjt:  TLSAAEKSDMIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSF

Query:  MQNFLLHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYP
        MQNF+LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQ+CLYFVDSENHAIRKADLGKRVVETLYP
Subjt:  MQNFLLHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYP

Query:  ENFSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIP
         N+SS KSTQLW WI D+ GLGSIPDREVEDFNPQSL+FPWHMIRYMDD LLILNRS  TLWTMDLASG+IIEVV+G SKIMENYGQL MD +SVLKQIP
Subjt:  ENFSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIP

Query:  DGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPG
        DGMLQ    A  +TG LPY+DLLSSLTPFQ+C IICDSVGQV++KY+ +S ESSS QFSNFGVLGLPYWFAPPPE+VISTADSFQG GIDH+ FFRLLPG
Subjt:  DGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPG

Query:  KVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGT
        KVGI INVDLPTDIELVES+QED IWRQ RGTATEI IVE V+G SEKVGSAQQWYDELDSLAFSPQESE++ED NIRA+NHIGD+K  IECAVNTSPGT
Subjt:  KVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGT

Query:  SEVIVYAPLYLRLRRTKDSEGDRDKH---AARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVN
        SEVIVYA +YLR RR +D EG+  K    AARIAD LYPGS GKKIKESCIQFL+N K DLRE IFVKPLHVRIKLD++ HPKADNSKGIILTDSSVE+N
Subjt:  SEVIVYAPLYLRLRRTKDSEGDRDKH---AARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVN

Query:  VSLDS
        +SL S
Subjt:  VSLDS

A0A6J1HW28 uncharacterized protein LOC111466804 isoform X10.0e+0078.66Show/hide
Query:  MSFRCSATNMAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTI
        MSFR SA NMAFRFRRL++IS+SLPQFYSGYYHQ+HH +A+SSL  SVA S+V EG+ +RILD+G HL                                
Subjt:  MSFRCSATNMAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTI

Query:  VEIFVFRFSTTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSS
              RFSTTTELQC+SSPANDILSFIKST DESEGPNH WLN  DGNKGISEKD IYLILA+QFLE TSSDSVVLVENVKFLQHRFPQLHVIG QCS+
Subjt:  VEIFVFRFSTTTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSS

Query:  TLSAAEKSDMIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSF
        TLS  EKS+MIQFIMREYVSFPILLS KIFEM  GLCYIISKD+SNPLL+ E+D DL++LRKAIEEL E ENEKS LP+ GRTTYLK AEIIKEPYSCSF
Subjt:  TLSAAEKSDMIQFIMREYVSFPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSF

Query:  MQNFLLHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYP
        MQNF+LHFPGCISADEKGGRLFLSDSNHNRI+IFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQ+CLYFVDSENHAIRKADLGKRVVETLYP
Subjt:  MQNFLLHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYP

Query:  ENFSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIP
         N+SS KSTQLW WI D+ GLGS+PDREVEDFNPQSL+FPWHMI+YMDD LLILNRS  TLWTMDLASG+IIEVV+G SKIMENY QL MDRVSVLKQIP
Subjt:  ENFSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIP

Query:  DGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPG
        DGMLQ    A  VTGGLPY+DLLSSLT FQ+C IICDSVGQV++KY+ +S ESSS +FSNFGVLGLPYWFAPPPE+VISTADSFQG GIDH+ FFRLLPG
Subjt:  DGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPG

Query:  KVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGT
        KVGI INVDLPTDIELVES+QED IWRQ RGTATEI IVE V+  SEKVGSAQQWYDELDSLAFSPQESE++ED N+RA+NHIGD+K  IECAVNTSPGT
Subjt:  KVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGT

Query:  SEVIVYAPLYLRLRRTKDSEGDRDKH---AARIADFLYPGSTGKKIKESCIQFLL-NRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEV
        SEVIVYA +YLR RR +D EG+ DK    AARIAD LYPGS GK+IKESCIQFLL N K DLRE +FVKPLHVRIKLD++ HPKADNSKGIILTDSSVE+
Subjt:  SEVIVYAPLYLRLRRTKDSEGDRDKH---AARIADFLYPGSTGKKIKESCIQFLL-NRKGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEV

Query:  NVSLDS
        N+SL S
Subjt:  NVSLDS

SwissProt top hitse value%identityAlignment
A4IF69 NHL repeat-containing protein 23.5e-0838.3Show/hide
Query:  LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL
        L FPG I+ D    RL ++D+ H+RI++   NG+I   IG   PG +DG F       P         N +Y  D+ENH IRK DL   +V T+
Subjt:  LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL

Q5ZI67 NHL repeat-containing protein 22.5e-0938.3Show/hide
Query:  LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL
        L FPG ++ D+ G RL ++D+ H+RI++   NG+IL  IG    G +DG F       P         N +Y  D+ENH IRK DL   +V T+
Subjt:  LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL

Q8BZW8 NHL repeat-containing protein 22.5e-0938.3Show/hide
Query:  LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL
        L FPG ++ D   GRL ++D+ H+RI++   NG+I   IG   PG +DG F       P         N +Y  D+ENH IRK DL    V T+
Subjt:  LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL

Q8NBF2 NHL repeat-containing protein 21.8e-0737.23Show/hide
Query:  LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL
        L FPG ++ D+   RL ++D+ H+RI++   NG+I   IG   PG +DG F       P         N +Y  D+ENH IRK DL    V T+
Subjt:  LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL

Q8VZ10 Protein SUPPRESSOR OF QUENCHING 1, chloroplastic1.8e-1545.26Show/hide
Query:  LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS--YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL
        L FPG ++ D    RLF+SDSNHNRI++ +  G  +  IGS    GF+DG FE     RP    Y+A +N LY  D+ENHA+R+ D     V+TL
Subjt:  LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS--YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL

Arabidopsis top hitse value%identityAlignment
AT1G56500.1 haloacid dehalogenase-like hydrolase family protein1.2e-1645.26Show/hide
Query:  LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS--YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL
        L FPG ++ D    RLF+SDSNHNRI++ +  G  +  IGS    GF+DG FE     RP    Y+A +N LY  D+ENHA+R+ D     V+TL
Subjt:  LHFPGCISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS--YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETL

AT3G07060.1 NHL domain-containing protein4.8e-17848.14Show/hide
Query:  SSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSSTLSAA-EKSDMIQFIMR
        SSP  D+LSFIK++ D+ EGP+H WLN   GNK + +  G Y++LA   L+ T SD     E +K LQ R P +  +G   S     A +++ + + I++
Subjt:  SSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSSTLSAA-EKSDMIQFIMR

Query:  EYVSFPILLSKKIFEMAPG-LCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGCISAD
        EY++FP+LLS+K F    G + YI+ KDF NPL+  EKD+D++ + KA++ L  ++ EKS        T+ KQAE IKE +  SF Q+ LL+FPGCISAD
Subjt:  EYVSFPILLSKKIFEMAPG-LCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGCISAD

Query:  EKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENFSSKKSTQLWRWI
        E G RLFLSD+NH+RI+IF  +GKI+D IG +PGFEDG+FE  K+ RP  + Y   ++CLY VDSENHAIR+A++  RV+ET+YP+    KK+  LW WI
Subjt:  EKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENFSSKKSTQLWRWI

Query:  MDKFGLGSIPDREV------EDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIPDGMLQQLSV
        M+K GLG   D  V      E+F+ +SLLFPWH+++  D+SLL++N+SF  LW ++ ASG I EVV GFSKI+E  GQ I +++SVL+ +P   LQQ + 
Subjt:  MDKFGLGSIPDREV------EDFNPQSLLFPWHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIPDGMLQQLSV

Query:  ANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVD
        A       P   LLSS T   D  ++ D   Q VLK +  S   SS QFSN G+LGLPYW   P ERV + A+  Q   + H Q  RLLPGK+ I++N++
Subjt:  ANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVD

Query:  LPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLA---FSPQ--ESEMMEDNNIRALNHIGDNKVHIECAVNTSPGTSEVI
        +P   ELVE +QE  IWRQ RG  +E         PSEK+G +QQWYDELDSLA    +P+  E E  ED N   ++   D ++HI+C V TSPG+SE+I
Subjt:  LPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLA---FSPQ--ESEMMEDNNIRALNHIGDNKVHIECAVNTSPGTSEVI

Query:  VYAPLYLRLRRTKDSEGDRDKHAAR-IADFLYPGSTGKKIKESCIQFLLNR-KGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNVSL
        VYA LYLRL R +++E    +  AR IA  L P      +KE     LL++ K +LR+ +F+KP+HVRI+LDS DHPKADNS+ +ILTDSSVEV+VSL
Subjt:  VYAPLYLRLRRTKDSEGDRDKHAAR-IADFLYPGSTGKKIKESCIQFLLNR-KGDLRESIFVKPLHVRIKLDSLDHPKADNSKGIILTDSSVEVNVSL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATTTCGTTGTTCTGCAACGAATATGGCTTTCAGGTTCCGGCGACTCAAACAAATCTCAAGGTCTTTGCCTCAATTCTACTCCGGATATTATCATCAGTATCACCA
TAGTTATGCTATTAGCTCATTGGCATTGTCCGTCGCTCCATCTCATGTACCTGAAGGAATCGGTAAAAGGATTTTAGACAATGGACGCCACCTTCTGCGGACAAGTATCT
CCATTGGTATGAGGCCTTTTGGGGTGAACCAAAAGCAAAGCCATGAGGGTTATGCCCAAAGTGGACAATATCATACCATTGTGGAGATATTCGTTTTCAGGTTTTCTACA
ACAACAGAGCTGCAATGCGAGTCGTCTCCTGCAAATGACATCTTATCCTTCATTAAGTCAACCTCAGATGAGTCTGAAGGCCCTAACCACTGTTGGTTGAATACATCTGA
TGGAAATAAAGGAATTTCAGAAAAGGATGGAATCTACTTAATTCTTGCCAATCAATTTCTAGAGACAACAAGCTCTGATTCGGTTGTTTTGGTTGAAAATGTAAAGTTCC
TTCAGCACAGGTTTCCTCAGCTTCATGTGATTGGGTTTCAGTGTTCCAGTACTTTATCTGCCGCTGAAAAAAGTGACATGATCCAATTTATAATGAGGGAATATGTTTCC
TTTCCCATTTTGTTATCCAAGAAGATTTTTGAGATGGCGCCAGGGCTCTGTTATATTATCTCTAAGGACTTCAGTAATCCTTTGCTTCTCCGTGAGAAAGACATGGACCT
TAGCATTCTTCGGAAAGCTATTGAGGAGTTGCATGAACGAGAAAATGAGAAATCTAGTCTGCCCAGTAACGGGAGAACTACTTATCTTAAACAAGCGGAGATCATCAAAG
AACCATATTCATGTTCTTTCATGCAGAATTTTCTTCTCCACTTTCCAGGCTGTATATCTGCAGATGAAAAGGGTGGCAGACTCTTCCTTTCAGACAGCAATCATAACCGG
ATTGTTATATTCAATGGCAATGGGAAGATTCTGGACATGATTGGTTCTTATCCAGGTTTTGAGGATGGAGAATTTGAATTGGTCAAATTAGCTCGTCCTGCAGCTTCCTT
TTATCATGCAACTCAGAATTGCTTGTATTTTGTGGACTCTGAGAACCATGCCATTAGGAAAGCTGATTTGGGTAAGCGCGTAGTTGAAACTCTCTATCCAGAAAACTTCT
CAAGCAAGAAGAGTACTCAGTTATGGAGATGGATTATGGACAAATTTGGTCTGGGAAGCATTCCTGACAGAGAAGTAGAAGATTTCAATCCGCAGTCTCTGCTGTTTCCT
TGGCACATGATTAGATATATGGATGATAGTTTATTAATTTTAAATCGCAGTTTTCGGACACTCTGGACCATGGATTTGGCTTCAGGAAGAATTATTGAAGTTGTTAGAGG
GTTTTCAAAGATTATGGAGAACTATGGACAGTTGATCATGGACAGAGTATCTGTTCTTAAACAGATACCCGATGGTATGTTGCAGCAGCTAAGTGTTGCAAATATTGTCA
CAGGGGGGCTACCATACATGGATCTTTTATCTTCTCTAACACCCTTCCAGGATTGCACAATCATTTGTGATTCCGTTGGACAGGTGGTTTTGAAATATCATAGTAAATCT
AGTGAGAGCTCAAGCTTCCAATTTTCAAATTTTGGGGTCCTTGGATTACCATATTGGTTTGCTCCACCTCCGGAGAGGGTTATAAGCACTGCTGATAGTTTCCAAGGAGT
AGGGATTGATCATCTTCAGTTTTTCAGACTGCTGCCTGGAAAGGTTGGTATACAGATCAATGTTGATCTTCCTACAGATATTGAACTTGTGGAATCATTACAAGAAGACA
TCATATGGCGACAAGCAAGAGGAACTGCAACTGAAATCTTAATTGTTGAGGATGTAGCTGGGCCCTCAGAAAAGGTCGGTTCTGCTCAACAGTGGTATGATGAATTGGAC
AGTTTAGCCTTTTCACCGCAAGAATCAGAAATGATGGAAGATAATAACATAAGAGCTCTTAACCATATAGGAGACAATAAAGTTCACATTGAGTGTGCTGTCAATACAAG
TCCTGGAACTAGCGAGGTTATAGTTTATGCACCCCTATATTTGAGGCTCAGAAGAACCAAAGATTCCGAAGGCGATCGGGACAAACATGCAGCAAGGATAGCGGATTTCT
TGTACCCAGGAAGTACTGGGAAGAAGATAAAAGAAAGTTGCATTCAGTTCCTTCTAAACCGTAAAGGAGATCTGAGAGAGAGCATTTTTGTGAAACCTCTGCATGTCAGG
ATAAAGTTGGATTCTCTTGATCACCCTAAAGCTGATAATTCCAAAGGTATTATCCTCACAGACTCCTCAGTTGAAGTCAATGTATCATTAGATTCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCATTTCGTTGTTCTGCAACGAATATGGCTTTCAGGTTCCGGCGACTCAAACAAATCTCAAGGTCTTTGCCTCAATTCTACTCCGGATATTATCATCAGTATCACCA
TAGTTATGCTATTAGCTCATTGGCATTGTCCGTCGCTCCATCTCATGTACCTGAAGGAATCGGTAAAAGGATTTTAGACAATGGACGCCACCTTCTGCGGACAAGTATCT
CCATTGGTATGAGGCCTTTTGGGGTGAACCAAAAGCAAAGCCATGAGGGTTATGCCCAAAGTGGACAATATCATACCATTGTGGAGATATTCGTTTTCAGGTTTTCTACA
ACAACAGAGCTGCAATGCGAGTCGTCTCCTGCAAATGACATCTTATCCTTCATTAAGTCAACCTCAGATGAGTCTGAAGGCCCTAACCACTGTTGGTTGAATACATCTGA
TGGAAATAAAGGAATTTCAGAAAAGGATGGAATCTACTTAATTCTTGCCAATCAATTTCTAGAGACAACAAGCTCTGATTCGGTTGTTTTGGTTGAAAATGTAAAGTTCC
TTCAGCACAGGTTTCCTCAGCTTCATGTGATTGGGTTTCAGTGTTCCAGTACTTTATCTGCCGCTGAAAAAAGTGACATGATCCAATTTATAATGAGGGAATATGTTTCC
TTTCCCATTTTGTTATCCAAGAAGATTTTTGAGATGGCGCCAGGGCTCTGTTATATTATCTCTAAGGACTTCAGTAATCCTTTGCTTCTCCGTGAGAAAGACATGGACCT
TAGCATTCTTCGGAAAGCTATTGAGGAGTTGCATGAACGAGAAAATGAGAAATCTAGTCTGCCCAGTAACGGGAGAACTACTTATCTTAAACAAGCGGAGATCATCAAAG
AACCATATTCATGTTCTTTCATGCAGAATTTTCTTCTCCACTTTCCAGGCTGTATATCTGCAGATGAAAAGGGTGGCAGACTCTTCCTTTCAGACAGCAATCATAACCGG
ATTGTTATATTCAATGGCAATGGGAAGATTCTGGACATGATTGGTTCTTATCCAGGTTTTGAGGATGGAGAATTTGAATTGGTCAAATTAGCTCGTCCTGCAGCTTCCTT
TTATCATGCAACTCAGAATTGCTTGTATTTTGTGGACTCTGAGAACCATGCCATTAGGAAAGCTGATTTGGGTAAGCGCGTAGTTGAAACTCTCTATCCAGAAAACTTCT
CAAGCAAGAAGAGTACTCAGTTATGGAGATGGATTATGGACAAATTTGGTCTGGGAAGCATTCCTGACAGAGAAGTAGAAGATTTCAATCCGCAGTCTCTGCTGTTTCCT
TGGCACATGATTAGATATATGGATGATAGTTTATTAATTTTAAATCGCAGTTTTCGGACACTCTGGACCATGGATTTGGCTTCAGGAAGAATTATTGAAGTTGTTAGAGG
GTTTTCAAAGATTATGGAGAACTATGGACAGTTGATCATGGACAGAGTATCTGTTCTTAAACAGATACCCGATGGTATGTTGCAGCAGCTAAGTGTTGCAAATATTGTCA
CAGGGGGGCTACCATACATGGATCTTTTATCTTCTCTAACACCCTTCCAGGATTGCACAATCATTTGTGATTCCGTTGGACAGGTGGTTTTGAAATATCATAGTAAATCT
AGTGAGAGCTCAAGCTTCCAATTTTCAAATTTTGGGGTCCTTGGATTACCATATTGGTTTGCTCCACCTCCGGAGAGGGTTATAAGCACTGCTGATAGTTTCCAAGGAGT
AGGGATTGATCATCTTCAGTTTTTCAGACTGCTGCCTGGAAAGGTTGGTATACAGATCAATGTTGATCTTCCTACAGATATTGAACTTGTGGAATCATTACAAGAAGACA
TCATATGGCGACAAGCAAGAGGAACTGCAACTGAAATCTTAATTGTTGAGGATGTAGCTGGGCCCTCAGAAAAGGTCGGTTCTGCTCAACAGTGGTATGATGAATTGGAC
AGTTTAGCCTTTTCACCGCAAGAATCAGAAATGATGGAAGATAATAACATAAGAGCTCTTAACCATATAGGAGACAATAAAGTTCACATTGAGTGTGCTGTCAATACAAG
TCCTGGAACTAGCGAGGTTATAGTTTATGCACCCCTATATTTGAGGCTCAGAAGAACCAAAGATTCCGAAGGCGATCGGGACAAACATGCAGCAAGGATAGCGGATTTCT
TGTACCCAGGAAGTACTGGGAAGAAGATAAAAGAAAGTTGCATTCAGTTCCTTCTAAACCGTAAAGGAGATCTGAGAGAGAGCATTTTTGTGAAACCTCTGCATGTCAGG
ATAAAGTTGGATTCTCTTGATCACCCTAAAGCTGATAATTCCAAAGGTATTATCCTCACAGACTCCTCAGTTGAAGTCAATGTATCATTAGATTCCTAA
Protein sequenceShow/hide protein sequence
MSFRCSATNMAFRFRRLKQISRSLPQFYSGYYHQYHHSYAISSLALSVAPSHVPEGIGKRILDNGRHLLRTSISIGMRPFGVNQKQSHEGYAQSGQYHTIVEIFVFRFST
TTELQCESSPANDILSFIKSTSDESEGPNHCWLNTSDGNKGISEKDGIYLILANQFLETTSSDSVVLVENVKFLQHRFPQLHVIGFQCSSTLSAAEKSDMIQFIMREYVS
FPILLSKKIFEMAPGLCYIISKDFSNPLLLREKDMDLSILRKAIEELHERENEKSSLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGCISADEKGGRLFLSDSNHNR
IVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRVVETLYPENFSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFP
WHMIRYMDDSLLILNRSFRTLWTMDLASGRIIEVVRGFSKIMENYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKS
SESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELD
SLAFSPQESEMMEDNNIRALNHIGDNKVHIECAVNTSPGTSEVIVYAPLYLRLRRTKDSEGDRDKHAARIADFLYPGSTGKKIKESCIQFLLNRKGDLRESIFVKPLHVR
IKLDSLDHPKADNSKGIILTDSSVEVNVSLDS