; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0033160 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0033160
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionNHL domain-containing protein
Genome locationchr11:41305438..41315231
RNA-Seq ExpressionLag0033160
SyntenyLag0033160
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001258 - NHL repeat
IPR011042 - Six-bladed beta-propeller, TolB-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011651650.1 uncharacterized protein LOC101209700 isoform X2 [Cucumis sativus]2.0e-25483.59Show/hide
Query:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR
        AIEELHE ENEKSGL + G+TTYLKQAE+IKEP SCSFM NFLLH+PG ISADE+GGRLFLSDSNHNRIVIFN  GKILDMIGSYPGFEDGEFELVKLAR
Subjt:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR

Query:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW
        PAASFYH+TQNCLYFVDSENHAIRKADLGKR VETL+PENYS+KKSTQ W WIMDKFGLGSIPDREV+DFNPQS++FPWHMIRYMDDRLLILNRS  TLW
Subjt:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW

Query:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG
        TMDL SG+IIEVVRG S IM++YGQLIMDR+SV+KQIPDGMLQ+ S ANI  GG PY+DLLSSLT F++C IICDSVGQVVLK + KS E SSFQFSNFG
Subjt:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG

Query:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL
        VLGLPYWFAP PE+VI+TA+ F+G GIDHLQFF+LLPG+VGIQINVDLP+DIELVESL ED IWRQARGTATEI IVE VAGPSEKVGSAQQWYDELDSL
Subjt:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL

Query:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDK-HAARIADFLYPGSTGKMIKESCIQFLLNHKGDLREI
        AFSPQESE +ED N+RA N+IGD+KVHIECAVNTSPGTSEVIVYA LYLRLRRN+DSEG+ +K HA RIADFLYP S GKMIKE+CIQFL+N K DLRE+
Subjt:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDK-HAARIADFLYPGSTGKMIKESCIQFLLNHKGDLREI

Query:  IFVKPLHVRIKLDSLDHPKADNSK
        IFVKPLHVRIKLDS  HPKA+NSK
Subjt:  IFVKPLHVRIKLDSLDHPKADNSK

XP_022159641.1 uncharacterized protein LOC111025991 isoform X1 [Momordica charantia]2.8e-25683.94Show/hide
Query:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR
        AIEEL EQE+EK   P+NGRTTYLKQAEI  EPYSCSFMQNFLLHFPG ISADEKGGRLFLSDSNHNRI+IFN NGKILD+IGSYPGF+DGEFELVKLAR
Subjt:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR

Query:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW
        PAASFYHATQNCLYFVDSENHAIRKADL KR VETL+PENYSSKKSTQLW WIMDKFGLGS  +RE+EDFNPQSL+FPWH+IRY+DDRLLILNRS QTLW
Subjt:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW

Query:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG
         MDL SG+IIEVVRG SNIM+ YGQLI D+VSV+KQIP G LQ+LS ANIVTGGLPY+DLLSS TPFQ+C IICDSVGQV+LKYH  S ESSSFQFSNFG
Subjt:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG

Query:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL
        VLGLPYWFA PPE+VI+TADSF+G GIDHLQFFRLLPGKVGIQINVDLP DIELVESLQED IWRQARGTATE LIVEDVAGPSEKVGSAQQWYDELDSL
Subjt:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL

Query:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKHAARIADFLYPGSTGKMIKESCIQFLLNHKGDLREII
        AFSP +SE +EDN  R  NHIGD+KV IECAVNTSPGTSEVIVYA LYLRLRRN+D EG+ +K A RIA FLYP + GK+ KE CIQFLL  K  LRE+I
Subjt:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKHAARIADFLYPGSTGKMIKESCIQFLLNHKGDLREII

Query:  FVKPLHVRIKLDSLDHPKADNSK
        FVKPLHVRIKLDSLDHPKADNSK
Subjt:  FVKPLHVRIKLDSLDHPKADNSK

XP_022159642.1 uncharacterized protein LOC111025991 isoform X2 [Momordica charantia]2.8e-25683.94Show/hide
Query:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR
        AIEEL EQE+EK   P+NGRTTYLKQAEI  EPYSCSFMQNFLLHFPG ISADEKGGRLFLSDSNHNRI+IFN NGKILD+IGSYPGF+DGEFELVKLAR
Subjt:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR

Query:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW
        PAASFYHATQNCLYFVDSENHAIRKADL KR VETL+PENYSSKKSTQLW WIMDKFGLGS  +RE+EDFNPQSL+FPWH+IRY+DDRLLILNRS QTLW
Subjt:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW

Query:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG
         MDL SG+IIEVVRG SNIM+ YGQLI D+VSV+KQIP G LQ+LS ANIVTGGLPY+DLLSS TPFQ+C IICDSVGQV+LKYH  S ESSSFQFSNFG
Subjt:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG

Query:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL
        VLGLPYWFA PPE+VI+TADSF+G GIDHLQFFRLLPGKVGIQINVDLP DIELVESLQED IWRQARGTATE LIVEDVAGPSEKVGSAQQWYDELDSL
Subjt:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL

Query:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKHAARIADFLYPGSTGKMIKESCIQFLLNHKGDLREII
        AFSP +SE +EDN  R  NHIGD+KV IECAVNTSPGTSEVIVYA LYLRLRRN+D EG+ +K A RIA FLYP + GK+ KE CIQFLL  K  LRE+I
Subjt:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKHAARIADFLYPGSTGKMIKESCIQFLLNHKGDLREII

Query:  FVKPLHVRIKLDSLDHPKADNSK
        FVKPLHVRIKLDSLDHPKADNSK
Subjt:  FVKPLHVRIKLDSLDHPKADNSK

XP_038888990.1 uncharacterized protein LOC120078755 isoform X1 [Benincasa hispida]6.2e-26486.07Show/hide
Query:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR
        AIEELHE ENEKSGLPS GRTTY+KQAEI+KEP SCSFMQNFLLHFPG ISADE+G RLFLSDSNHNRIVIFN +GKILDMIGSYPGFEDGEFELVKLAR
Subjt:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR

Query:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW
        PAASFYH+TQNCLYFVDSENHAIRKADLGKR VETL+PENYS+K STQLW WIMDKFG+GSIPDREVEDFNPQSL+FPWHMIRYMDDRLLIL+RS  TLW
Subjt:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW

Query:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG
        TMDLASG+IIE+VRG S IM+NYGQLIMDR+SVLKQIPDGMLQ  + ANI TGGLPY+DLLSSLTPFQ+C IICDSVGQVVLKY+ KS ESSSFQFSNFG
Subjt:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG

Query:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL
        VLGLPYW APPPE+VI+ AD+FQG  IDHLQFFRLLPGKVGIQINVDLPTDIELVESL ED IWRQARGTATEI IVE+VA PSEKVGSAQQWYDELDSL
Subjt:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL

Query:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDK-HAARIADFLYPGSTGKMIKESCIQFLLNHKGDLREI
        AFSPQESE +ED N+RA N+IGD+KVHIECAVNTSPGTSEVIVYA LYLRLRRN+DSEG+ DK HA RIADFLYPG+ GKMIKE CI+FL+N K DLRE+
Subjt:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDK-HAARIADFLYPGSTGKMIKESCIQFLLNHKGDLREI

Query:  IFVKPLHVRIKLDSLDHPKADNSK
        IFVKPLHVRIKLDSL HPKA+NSK
Subjt:  IFVKPLHVRIKLDSLDHPKADNSK

XP_038888991.1 uncharacterized protein LOC120078755 isoform X2 [Benincasa hispida]6.2e-26486.07Show/hide
Query:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR
        AIEELHE ENEKSGLPS GRTTY+KQAEI+KEP SCSFMQNFLLHFPG ISADE+G RLFLSDSNHNRIVIFN +GKILDMIGSYPGFEDGEFELVKLAR
Subjt:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR

Query:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW
        PAASFYH+TQNCLYFVDSENHAIRKADLGKR VETL+PENYS+K STQLW WIMDKFG+GSIPDREVEDFNPQSL+FPWHMIRYMDDRLLIL+RS  TLW
Subjt:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW

Query:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG
        TMDLASG+IIE+VRG S IM+NYGQLIMDR+SVLKQIPDGMLQ  + ANI TGGLPY+DLLSSLTPFQ+C IICDSVGQVVLKY+ KS ESSSFQFSNFG
Subjt:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG

Query:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL
        VLGLPYW APPPE+VI+ AD+FQG  IDHLQFFRLLPGKVGIQINVDLPTDIELVESL ED IWRQARGTATEI IVE+VA PSEKVGSAQQWYDELDSL
Subjt:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL

Query:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDK-HAARIADFLYPGSTGKMIKESCIQFLLNHKGDLREI
        AFSPQESE +ED N+RA N+IGD+KVHIECAVNTSPGTSEVIVYA LYLRLRRN+DSEG+ DK HA RIADFLYPG+ GKMIKE CI+FL+N K DLRE+
Subjt:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDK-HAARIADFLYPGSTGKMIKESCIQFLLNHKGDLREI

Query:  IFVKPLHVRIKLDSLDHPKADNSK
        IFVKPLHVRIKLDSL HPKA+NSK
Subjt:  IFVKPLHVRIKLDSLDHPKADNSK

TrEMBL top hitse value%identityAlignment
A0A6J1E0D7 uncharacterized protein LOC111025991 isoform X21.4e-25683.94Show/hide
Query:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR
        AIEEL EQE+EK   P+NGRTTYLKQAEI  EPYSCSFMQNFLLHFPG ISADEKGGRLFLSDSNHNRI+IFN NGKILD+IGSYPGF+DGEFELVKLAR
Subjt:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR

Query:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW
        PAASFYHATQNCLYFVDSENHAIRKADL KR VETL+PENYSSKKSTQLW WIMDKFGLGS  +RE+EDFNPQSL+FPWH+IRY+DDRLLILNRS QTLW
Subjt:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW

Query:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG
         MDL SG+IIEVVRG SNIM+ YGQLI D+VSV+KQIP G LQ+LS ANIVTGGLPY+DLLSS TPFQ+C IICDSVGQV+LKYH  S ESSSFQFSNFG
Subjt:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG

Query:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL
        VLGLPYWFA PPE+VI+TADSF+G GIDHLQFFRLLPGKVGIQINVDLP DIELVESLQED IWRQARGTATE LIVEDVAGPSEKVGSAQQWYDELDSL
Subjt:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL

Query:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKHAARIADFLYPGSTGKMIKESCIQFLLNHKGDLREII
        AFSP +SE +EDN  R  NHIGD+KV IECAVNTSPGTSEVIVYA LYLRLRRN+D EG+ +K A RIA FLYP + GK+ KE CIQFLL  K  LRE+I
Subjt:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKHAARIADFLYPGSTGKMIKESCIQFLLNHKGDLREII

Query:  FVKPLHVRIKLDSLDHPKADNSK
        FVKPLHVRIKLDSLDHPKADNSK
Subjt:  FVKPLHVRIKLDSLDHPKADNSK

A0A6J1E4J1 uncharacterized protein LOC111025991 isoform X11.4e-25683.94Show/hide
Query:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR
        AIEEL EQE+EK   P+NGRTTYLKQAEI  EPYSCSFMQNFLLHFPG ISADEKGGRLFLSDSNHNRI+IFN NGKILD+IGSYPGF+DGEFELVKLAR
Subjt:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR

Query:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW
        PAASFYHATQNCLYFVDSENHAIRKADL KR VETL+PENYSSKKSTQLW WIMDKFGLGS  +RE+EDFNPQSL+FPWH+IRY+DDRLLILNRS QTLW
Subjt:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW

Query:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG
         MDL SG+IIEVVRG SNIM+ YGQLI D+VSV+KQIP G LQ+LS ANIVTGGLPY+DLLSS TPFQ+C IICDSVGQV+LKYH  S ESSSFQFSNFG
Subjt:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG

Query:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL
        VLGLPYWFA PPE+VI+TADSF+G GIDHLQFFRLLPGKVGIQINVDLP DIELVESLQED IWRQARGTATE LIVEDVAGPSEKVGSAQQWYDELDSL
Subjt:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL

Query:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKHAARIADFLYPGSTGKMIKESCIQFLLNHKGDLREII
        AFSP +SE +EDN  R  NHIGD+KV IECAVNTSPGTSEVIVYA LYLRLRRN+D EG+ +K A RIA FLYP + GK+ KE CIQFLL  K  LRE+I
Subjt:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKHAARIADFLYPGSTGKMIKESCIQFLLNHKGDLREII

Query:  FVKPLHVRIKLDSLDHPKADNSK
        FVKPLHVRIKLDSLDHPKADNSK
Subjt:  FVKPLHVRIKLDSLDHPKADNSK

A0A6J1HJ63 uncharacterized protein LOC111464050 isoform X11.1e-25383.27Show/hide
Query:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR
        AIEEL E ENEKSGLP+ GRTTYLK AEIIKEPYSC FMQNF+LHFPG ISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR
Subjt:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR

Query:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW
        PAASFYHATQ+CLYFVDSENHAIRKADLGKR VETL+P NYSS KSTQLW WI D+ GLGSIPDREVEDFNPQSL+FPWHMIRYMDDRLLILNRS  TLW
Subjt:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW

Query:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG
        TMDLASG+IIEVV+G S IM+NYGQL MD +SVLKQIPDGMLQ    A  +TG LPY+DLLSSLTPFQ+C IICDSVGQV++KY+ +S ESSS QFSNFG
Subjt:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG

Query:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL
        VLGLPYWFAPPPE+VISTADSFQG GIDH+ FFRLLPGKVGI INVDLPTDIELVES+QED IWRQ RGTATEI IVE V+G SEKVGSAQQWYDELDSL
Subjt:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL

Query:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKH---AARIADFLYPGSTGKMIKESCIQFLLNHKGDLR
        AFSPQESE +ED NIRA+NHIGD K  IECAVNTSPGTSEVIVYA +YLR RR +D EG+  K    AARIAD LYPGS GK IKESCIQFL+N K DLR
Subjt:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKH---AARIADFLYPGSTGKMIKESCIQFLLNHKGDLR

Query:  EIIFVKPLHVRIKLDSLDHPKADNSK
        E+IFVKPLHVRIKLD++ HPKADNSK
Subjt:  EIIFVKPLHVRIKLDSLDHPKADNSK

A0A6J1HLG1 uncharacterized protein LOC111464050 isoform X21.1e-25383.27Show/hide
Query:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR
        AIEEL E ENEKSGLP+ GRTTYLK AEIIKEPYSC FMQNF+LHFPG ISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR
Subjt:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR

Query:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW
        PAASFYHATQ+CLYFVDSENHAIRKADLGKR VETL+P NYSS KSTQLW WI D+ GLGSIPDREVEDFNPQSL+FPWHMIRYMDDRLLILNRS  TLW
Subjt:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW

Query:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG
        TMDLASG+IIEVV+G S IM+NYGQL MD +SVLKQIPDGMLQ    A  +TG LPY+DLLSSLTPFQ+C IICDSVGQV++KY+ +S ESSS QFSNFG
Subjt:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG

Query:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL
        VLGLPYWFAPPPE+VISTADSFQG GIDH+ FFRLLPGKVGI INVDLPTDIELVES+QED IWRQ RGTATEI IVE V+G SEKVGSAQQWYDELDSL
Subjt:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL

Query:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKH---AARIADFLYPGSTGKMIKESCIQFLLNHKGDLR
        AFSPQESE +ED NIRA+NHIGD K  IECAVNTSPGTSEVIVYA +YLR RR +D EG+  K    AARIAD LYPGS GK IKESCIQFL+N K DLR
Subjt:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKH---AARIADFLYPGSTGKMIKESCIQFLLNHKGDLR

Query:  EIIFVKPLHVRIKLDSLDHPKADNSK
        E+IFVKPLHVRIKLD++ HPKADNSK
Subjt:  EIIFVKPLHVRIKLDSLDHPKADNSK

A0A6J1HW28 uncharacterized protein LOC111466804 isoform X12.7e-25282.73Show/hide
Query:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR
        AIEEL E ENEKSGLP+ GRTTYLK AEIIKEPYSCSFMQNF+LHFPG ISADEKGGRLFLSDSNHNRI+IFNGNGKILDMIGSYPGFEDGEFELVKLAR
Subjt:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR

Query:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW
        PAASFYHATQ+CLYFVDSENHAIRKADLGKR VETL+P NYSS KSTQLW WI D+ GLGS+PDREVEDFNPQSL+FPWHMI+YMDDRLLILNRS  TLW
Subjt:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLW

Query:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG
        TMDLASG+IIEVV+G S IM+NY QL MDRVSVLKQIPDGMLQ    A  VTGGLPY+DLLSSLT FQ+C IICDSVGQV++KY+ +S ESSS +FSNFG
Subjt:  TMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFG

Query:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL
        VLGLPYWFAPPPE+VISTADSFQG GIDH+ FFRLLPGKVGI INVDLPTDIELVES+QED IWRQ RGTATEI IVE V+  SEKVGSAQQWYDELDSL
Subjt:  VLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSL

Query:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKH---AARIADFLYPGSTGKMIKESCIQFLL-NHKGDL
        AFSPQESE +ED N+RA+NHIGD K  IECAVNTSPGTSEVIVYA +YLR RR++D EG+ DK    AARIAD LYPGS GK IKESCIQFLL N K DL
Subjt:  AFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKH---AARIADFLYPGSTGKMIKESCIQFLL-NHKGDL

Query:  REIIFVKPLHVRIKLDSLDHPKADNSK
        RE++FVKPLHVRIKLD++ HPKADNSK
Subjt:  REIIFVKPLHVRIKLDSLDHPKADNSK

SwissProt top hitse value%identityAlignment
A4IF69 NHL repeat-containing protein 27.0e-0838.3Show/hide
Query:  LHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRSVETL
        L FPG I+ D    RL ++D+ H+RI++   NG+I   IG   PG +DG F       P         N +Y  D+ENH IRK DL    V T+
Subjt:  LHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRSVETL

Q8BZW8 NHL repeat-containing protein 21.7e-0938.3Show/hide
Query:  LHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRSVETL
        L FPG ++ D   GRL ++D+ H+RI++   NG+I   IG   PG +DG F       P         N +Y  D+ENH IRK DL    V T+
Subjt:  LHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRSVETL

Q8NBF2 NHL repeat-containing protein 21.2e-0737.23Show/hide
Query:  LHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRSVETL
        L FPG ++ D+   RL ++D+ H+RI++   NG+I   IG   PG +DG F       P         N +Y  D+ENH IRK DL    V T+
Subjt:  LHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS-YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRSVETL

Q8VZ10 Protein SUPPRESSOR OF QUENCHING 1, chloroplastic1.2e-1545.26Show/hide
Query:  LHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS--YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRSVETL
        L FPG ++ D    RLF+SDSNHNRI++ +  G  +  IGS    GF+DG FE     RP    Y+A +N LY  D+ENHA+R+ D     V+TL
Subjt:  LHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS--YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRSVETL

Arabidopsis top hitse value%identityAlignment
AT1G56500.1 haloacid dehalogenase-like hydrolase family protein8.4e-1745.26Show/hide
Query:  LHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS--YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRSVETL
        L FPG ++ D    RLF+SDSNHNRI++ +  G  +  IGS    GF+DG FE     RP    Y+A +N LY  D+ENHA+R+ D     V+TL
Subjt:  LHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGS--YPGFEDGEFELVKLARPAASFYHATQNCLYFVDSENHAIRKADLGKRSVETL

AT3G07060.1 NHL domain-containing protein4.3e-13848.32Show/hide
Query:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR
        A++ L  Q+ EKS        T+ KQAE IKE +  SF Q+ LL+FPG ISADE G RLFLSD+NH+RI+IF  +GKI+D IG +PGFEDG+FE  K+ R
Subjt:  AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLAR

Query:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREV------EDFNPQSLLFPWHMIRYMDDRLLILNR
        P  + Y   ++CLY VDSENHAIR+A++  R +ET++P+    KK+  LW WIM+K GLG   D  V      E+F+ +SLLFPWH+++  D+ LL++N+
Subjt:  PAASFYHATQNCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREV------EDFNPQSLLFPWHMIRYMDDRLLILNR

Query:  SFQTLWTMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSF
        SF  LW ++ ASG I EVV GFS I++  GQ I +++SVL+ +P   LQQ + A       P   LLSS T   D  ++ D   Q VLK +  S   SS 
Subjt:  SFQTLWTMDLASGRIIEVVRGFSNIMKNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSF

Query:  QFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWY
        QFSN G+LGLPYW   P ERV + A+  Q   + H Q  RLLPGK+ I++N+++P   ELVE +QE  IWRQ RG  +E         PSEK+G +QQWY
Subjt:  QFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHLQFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWY

Query:  DELDSLA---FSPQ--ESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKHAAR-IADFLYPGSTGKMIKESC-IQ
        DELDSLA    +P+  E E  ED N   ++   D ++HI+C V TSPG+SE+IVYA LYLRL RN+++E    +  AR IA  L P      +KE   + 
Subjt:  DELDSLA---FSPQ--ESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSEVIVYAPLYLRLRRNKDSEGDRDKHAAR-IADFLYPGSTGKMIKESC-IQ

Query:  FLLNHKGDLREIIFVKPLHVRIKLDSLDHPKADNSK
         L   K +LR+I+F+KP+HVRI+LDS DHPKADNS+
Subjt:  FLLNHKGDLREIIFVKPLHVRIKLDSLDHPKADNSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTCGGGAAGTTCTCAGTCAAATCTGCTTTACAGGTTTTCAAGCTAGGAAGGCAGTGCTGGACGGTAGCTTATCACCCCAAATTTGGGAAGGAAAAGTCCCAAATAG
GGTGAAATTTTCTTGTGGACGGTGGCTCATAAAAGCATCAACACGATGGACAAGGTGCAAAAGAGATATCCCAGTCTCTCTATCTCCCCACATATATGTATGCTTTGCCA
TAGAAGTGAGGAGACAGCATCCCACTTATTGCTTCATTGTGATTTGCAAAGGAAGTGTGGAATTATTTTGGGGAACCCTTTGGTATTCAGGGGTGCAAGCCCAGTTGTGT
GCTATTGAGGAGTTGCATGAACAAGAAAATGAGAAATCTGGTCTGCCCAGTAACGGGAGAACTACTTATCTTAAACAAGCGGAGATCATCAAAGAACCATATTCATGTTC
TTTCATGCAGAATTTTCTTCTCCACTTTCCAGGCGGTATATCTGCAGATGAAAAGGGTGGCAGACTCTTCCTTTCAGACAGCAATCATAACCGGATTGTTATATTCAATG
GCAATGGGAAGATTCTGGACATGATTGGTTCTTATCCAGGTTTTGAGGATGGAGAATTTGAATTGGTCAAATTAGCTCGTCCTGCAGCTTCCTTTTATCATGCTACTCAG
AATTGCTTGTATTTTGTGGACTCTGAGAACCATGCCATTAGGAAAGCTGATTTGGGTAAGCGCTCAGTTGAAACTCTCCATCCAGAAAACTACTCAAGCAAGAAGAGTAC
TCAGTTATGGAGATGGATTATGGACAAATTTGGTCTGGGAAGCATTCCTGACAGAGAAGTAGAAGATTTCAATCCGCAGTCTCTGCTGTTTCCTTGGCACATGATTAGAT
ATATGGATGATAGATTATTAATTTTAAATCGCAGTTTTCAGACACTATGGACCATGGATTTGGCTTCAGGAAGAATTATTGAAGTTGTTAGAGGGTTTTCAAATATTATG
AAGAACTATGGACAGTTGATCATGGACAGAGTATCTGTTCTTAAACAGATACCCGATGGTATGTTGCAGCAGCTAAGTGTTGCAAATATTGTCACAGGGGGGCTACCATA
CATGGATCTTTTATCTTCTCTAACACCCTTCCAGGATTGCACAATCATCTGCGATTCCGTTGGACAGGTGGTTTTGAAATATCATAGTAAATCCAGTGAGAGCTCAAGCT
TCCAATTTTCAAATTTTGGGGTCCTTGGACTACCATATTGGTTTGCTCCACCTCCGGAGAGGGTTATAAGCACTGCTGATAGTTTCCAAGGAGTAGGGATTGATCATCTT
CAGTTTTTCAGACTTCTGCCTGGAAAGGTTGGTATACAGATCAATGTTGATCTTCCTACAGATATTGAACTTGTGGAATCATTACAAGAAGACATCATATGGCGACAAGC
AAGAGGAACTGCAACTGAAATCTTAATTGTTGAGGATGTAGCTGGGCCCTCAGAAAAGGTTGGTTCTGCTCAACAGTGGTATGATGAATTGGATAGTCTAGCCTTTTCAC
CGCAAGAATCAGAAACGATGGAAGATAATAATATAAGAGCTCTTAACCATATAGGAGACCATAAAGTTCACATTGAGTGTGCTGTCAATACAAGTCCTGGAACTAGCGAG
GTTATAGTTTATGCACCCCTATATTTGAGGCTCAGAAGAAACAAAGATTCCGAAGGCGATCGGGACAAACATGCAGCAAGGATAGCGGATTTCTTGTACCCAGGAAGTAC
TGGGAAGATGATAAAAGAAAGTTGCATTCAGTTCCTTCTAAACCATAAAGGAGATCTGAGAGAGATCATTTTTGTGAAACCTTTGCATGTCAGGATAAAGTTGGATTCTC
TTGATCACCCCAAAGCTGATAATTCCAAAGTCACTGCCATTGCCATGAAGCCGAAGAATATTTTCTCTCCATCCATTCTCTGGGATCCTGTTGATTCGCCGCCGTGGTTG
GTTTTCTTCTCCGATTCTTTCATTCTTCACTTCGATTCGTCGTCGCTCCAGAATTCGCGTGATTTCAGTCTTCCAAATGCATCTCTTTCACAAGGACTAGAAAAATCGAC
GACTGGAATTCATGCTGTGCATAGGAAGATGATAATTCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTCGGGAAGTTCTCAGTCAAATCTGCTTTACAGGTTTTCAAGCTAGGAAGGCAGTGCTGGACGGTAGCTTATCACCCCAAATTTGGGAAGGAAAAGTCCCAAATAG
GGTGAAATTTTCTTGTGGACGGTGGCTCATAAAAGCATCAACACGATGGACAAGGTGCAAAAGAGATATCCCAGTCTCTCTATCTCCCCACATATATGTATGCTTTGCCA
TAGAAGTGAGGAGACAGCATCCCACTTATTGCTTCATTGTGATTTGCAAAGGAAGTGTGGAATTATTTTGGGGAACCCTTTGGTATTCAGGGGTGCAAGCCCAGTTGTGT
GCTATTGAGGAGTTGCATGAACAAGAAAATGAGAAATCTGGTCTGCCCAGTAACGGGAGAACTACTTATCTTAAACAAGCGGAGATCATCAAAGAACCATATTCATGTTC
TTTCATGCAGAATTTTCTTCTCCACTTTCCAGGCGGTATATCTGCAGATGAAAAGGGTGGCAGACTCTTCCTTTCAGACAGCAATCATAACCGGATTGTTATATTCAATG
GCAATGGGAAGATTCTGGACATGATTGGTTCTTATCCAGGTTTTGAGGATGGAGAATTTGAATTGGTCAAATTAGCTCGTCCTGCAGCTTCCTTTTATCATGCTACTCAG
AATTGCTTGTATTTTGTGGACTCTGAGAACCATGCCATTAGGAAAGCTGATTTGGGTAAGCGCTCAGTTGAAACTCTCCATCCAGAAAACTACTCAAGCAAGAAGAGTAC
TCAGTTATGGAGATGGATTATGGACAAATTTGGTCTGGGAAGCATTCCTGACAGAGAAGTAGAAGATTTCAATCCGCAGTCTCTGCTGTTTCCTTGGCACATGATTAGAT
ATATGGATGATAGATTATTAATTTTAAATCGCAGTTTTCAGACACTATGGACCATGGATTTGGCTTCAGGAAGAATTATTGAAGTTGTTAGAGGGTTTTCAAATATTATG
AAGAACTATGGACAGTTGATCATGGACAGAGTATCTGTTCTTAAACAGATACCCGATGGTATGTTGCAGCAGCTAAGTGTTGCAAATATTGTCACAGGGGGGCTACCATA
CATGGATCTTTTATCTTCTCTAACACCCTTCCAGGATTGCACAATCATCTGCGATTCCGTTGGACAGGTGGTTTTGAAATATCATAGTAAATCCAGTGAGAGCTCAAGCT
TCCAATTTTCAAATTTTGGGGTCCTTGGACTACCATATTGGTTTGCTCCACCTCCGGAGAGGGTTATAAGCACTGCTGATAGTTTCCAAGGAGTAGGGATTGATCATCTT
CAGTTTTTCAGACTTCTGCCTGGAAAGGTTGGTATACAGATCAATGTTGATCTTCCTACAGATATTGAACTTGTGGAATCATTACAAGAAGACATCATATGGCGACAAGC
AAGAGGAACTGCAACTGAAATCTTAATTGTTGAGGATGTAGCTGGGCCCTCAGAAAAGGTTGGTTCTGCTCAACAGTGGTATGATGAATTGGATAGTCTAGCCTTTTCAC
CGCAAGAATCAGAAACGATGGAAGATAATAATATAAGAGCTCTTAACCATATAGGAGACCATAAAGTTCACATTGAGTGTGCTGTCAATACAAGTCCTGGAACTAGCGAG
GTTATAGTTTATGCACCCCTATATTTGAGGCTCAGAAGAAACAAAGATTCCGAAGGCGATCGGGACAAACATGCAGCAAGGATAGCGGATTTCTTGTACCCAGGAAGTAC
TGGGAAGATGATAAAAGAAAGTTGCATTCAGTTCCTTCTAAACCATAAAGGAGATCTGAGAGAGATCATTTTTGTGAAACCTTTGCATGTCAGGATAAAGTTGGATTCTC
TTGATCACCCCAAAGCTGATAATTCCAAAGTCACTGCCATTGCCATGAAGCCGAAGAATATTTTCTCTCCATCCATTCTCTGGGATCCTGTTGATTCGCCGCCGTGGTTG
GTTTTCTTCTCCGATTCTTTCATTCTTCACTTCGATTCGTCGTCGCTCCAGAATTCGCGTGATTTCAGTCTTCCAAATGCATCTCTTTCACAAGGACTAGAAAAATCGAC
GACTGGAATTCATGCTGTGCATAGGAAGATGATAATTCAATAA
Protein sequenceShow/hide protein sequence
MGREVLSQICFTGFQARKAVLDGSLSPQIWEGKVPNRVKFSCGRWLIKASTRWTRCKRDIPVSLSPHIYVCFAIEVRRQHPTYCFIVICKGSVELFWGTLWYSGVQAQLC
AIEELHEQENEKSGLPSNGRTTYLKQAEIIKEPYSCSFMQNFLLHFPGGISADEKGGRLFLSDSNHNRIVIFNGNGKILDMIGSYPGFEDGEFELVKLARPAASFYHATQ
NCLYFVDSENHAIRKADLGKRSVETLHPENYSSKKSTQLWRWIMDKFGLGSIPDREVEDFNPQSLLFPWHMIRYMDDRLLILNRSFQTLWTMDLASGRIIEVVRGFSNIM
KNYGQLIMDRVSVLKQIPDGMLQQLSVANIVTGGLPYMDLLSSLTPFQDCTIICDSVGQVVLKYHSKSSESSSFQFSNFGVLGLPYWFAPPPERVISTADSFQGVGIDHL
QFFRLLPGKVGIQINVDLPTDIELVESLQEDIIWRQARGTATEILIVEDVAGPSEKVGSAQQWYDELDSLAFSPQESETMEDNNIRALNHIGDHKVHIECAVNTSPGTSE
VIVYAPLYLRLRRNKDSEGDRDKHAARIADFLYPGSTGKMIKESCIQFLLNHKGDLREIIFVKPLHVRIKLDSLDHPKADNSKVTAIAMKPKNIFSPSILWDPVDSPPWL
VFFSDSFILHFDSSSLQNSRDFSLPNASLSQGLEKSTTGIHAVHRKMIIQ