; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038065 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038065
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF3537)
Genome locationchr2:12188865..12193824
RNA-Seq ExpressionLag0038065
SyntenyLag0038065
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6607786.1 hypothetical protein SDJN03_01128, partial [Cucurbita argyrosperma subsp. sororia]2.8e-22291.57Show/hide
Query:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL
        MGD +REALIS+KA VFNRS SHA DEL SFRSYLRWMCVDQSDIW+AGLSWS+F LFAI+VPATSHF LACSSCDSHHARPFDRVVQLSLSSVATVSFL
Subjt:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL

Query:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC
        CLSNFIRRYGLRRFLFFD+LC+ESETVR GYTNK NRSLRVLSAFV+PCFAAESAYKIWWYASGASQIPFLGNVIVSDAVAC+MEL+SWLYRTTVIFLVC
Subjt:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC

Query:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL
        ILFRLICDLQILRLQDFATVFQVDSDVGSVL+EHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLL+TTKSSS LNIYIAGELALCSMTLLTSLMILL
Subjt:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL

Query:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN
        RSATKITHKAQSVT+LAAKWHVCATL+SFDVTDGETPMAA A   G  +FPVT RG EES+ ++GCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN
Subjt:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN

Query:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
        NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
Subjt:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS

XP_022940637.1 uncharacterized protein LOC111446173 [Cucurbita moschata]5.6e-22391.8Show/hide
Query:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL
        MGD +REALIS+KA VFNRS SHA DEL SFRSYLRWMCVDQSDIW+AGLSWS+F LFAI+VPATSHF LACSSCDSHHARPFDRVVQLSLSSVATVSFL
Subjt:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL

Query:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC
        CLSNFIRRYGLRRFLFFD+LC+ESETVRRGYTNK NRSLRVLSAFV+PCFAAESAYKIWWYASGASQIPFLGNVIVSDAVAC+MEL+SWLYRTTVIFLVC
Subjt:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC

Query:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL
        ILFRLICDLQILRLQDFATVFQVDSDVGSVL+EHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLL+TTKSSS LNIYIAGELALCSMTLLTSLMILL
Subjt:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL

Query:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN
        RSATKITHKAQSVT+LAAKWHVCATL+SFDVTDGETPMAA A   G  +FPVT RG EES+ ++GCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN
Subjt:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN

Query:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
        NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
Subjt:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS

XP_022981472.1 uncharacterized protein LOC111480583 [Cucurbita maxima]6.7e-22492.26Show/hide
Query:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL
        MGD +REALIS+KA VFNRS SHA DEL SFRSYLRWMCVDQSDIW+AGLSWS+F LFAI+VPATSHF LACSSCDSHHARPFDRVVQLSLSSVATVSFL
Subjt:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL

Query:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC
        CLSNFIRRYGLRRFLFFD+LC+ESETVRRGYTNK NRSLRVLSAFV+PCFAAESAYKIWWYASGASQIPFLGNVIVSDAVAC+MEL+SWLYRTTVIFLVC
Subjt:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC

Query:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL
        ILFRLICDLQILRLQDFATVFQVDSDVGSVL+EHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLL+TTKSSS LNIYIAGELALCSMTLLTSLMILL
Subjt:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL

Query:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN
        RSATKITHKAQSVT+LAAKWHVCATL+SFDVTDGETPMAA A A G  +FPVT RG EESD ++GCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN
Subjt:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN

Query:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
        NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
Subjt:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS

XP_023523310.1 uncharacterized protein LOC111787544 [Cucurbita pepo subsp. pepo]1.1e-22191.59Show/hide
Query:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL
        MGD +REALIS+KA VFNRS SHA DEL SFRSYLRWMCVDQSDIW+AGLSWS+F LFAI+VPA SHF LACSSCDSHHARPFDRVVQLSLSSVATVSFL
Subjt:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL

Query:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC
        CLSNFIRRYGLRRFLFFD+LC+ESETVRRGYTNK NRSLRVLSAFV+PCFAAESAYKIWWYASGASQIPFLGNVIVSDAVAC+MEL+SWLYRTTVIFLVC
Subjt:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC

Query:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL
        ILFRLICDLQILRLQDFATVFQVDSDVGSVL+EHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLL+TTKSSS LNIYIAGELALCSMTLLTSLMILL
Subjt:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL

Query:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAA-GPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFE
        RSATKITHKAQSVT+LAAKWHVCATL+SFDVTDGETPMAA A +A G  +FPVT RG EES+ ++GCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFE
Subjt:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAA-GPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFE

Query:  NNRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
        NNRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
Subjt:  NNRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS

XP_038898198.1 uncharacterized protein LOC120085939 [Benincasa hispida]1.7e-21990.89Show/hide
Query:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL
        MGD+NREALIS+K+SVF RS SHAHDEL+SFRSYLRWMCVDQSDIWTAGLSWSMF LFAIIVPATSHF+LACSSCDS+HARPFDRVVQLSLSS+ATVSFL
Subjt:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL

Query:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC
        CLS FIRRYGLRRFLFFDKLC+ESETVR GYTNK NRSLRVLS FVIPCFAAESAYKIWWYASGASQIPFLGNVIVSD VAC MEL+SWLYRTT+IFLVC
Subjt:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC

Query:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL
        ILFRLICDLQILRLQDFATVFQVDSDV SVL+EHLRIRRHLRIISHRYRAFIL SL+LVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL
Subjt:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL

Query:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN
        RSATKITHKAQSVTSLAAKWHVCATL+SFDVTDGETPMA+  Q    QVFP T  GG+ES   +GCDEED+LDNTKLIPAYAYSTISFQKRQALVTYFEN
Subjt:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN

Query:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
        NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
Subjt:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS

TrEMBL top hitse value%identityAlignment
A0A0A0K352 Uncharacterized protein5.0e-21790.66Show/hide
Query:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL
        MGD+NREALIS+K+SVF RSVSHAHDEL SFRSYLRWMCVDQSDIWTAGLSWSMF LFAIIVPATSHF+LACSSCDS+HARPFDRVVQLSLSSVATVSFL
Subjt:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL

Query:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC
        CLS+FIRRYGLRRFLFFDKLC+ESETVRRGYT K NRSLRVLS FVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVAC MEL+SWLYRTTVIFLVC
Subjt:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC

Query:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL
        ILFRLICDLQILRLQDFATVFQVDSDV SVL+EHLRIRRHLRIISHRYR FIL SL+LVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL
Subjt:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL

Query:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN
        RSATKITHKAQSVT+LAAKWHVCATL+SFDVTDGETPMA+       Q FP     GEES+ D+GCD ED+LDNTKLIPAYAYSTISFQKRQALVTYFEN
Subjt:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN

Query:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
        NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
Subjt:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS

A0A1S3BI23 uncharacterized protein LOC1034901253.8e-21790.66Show/hide
Query:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL
        MGD+NREALI++K+SVF RSVSHAHDEL SFRSYLRWMCVDQSDIWTAGLSWSMF LFAIIVPATSHFLLACSSCDS+HARPFDRVVQLSLSSVATVSFL
Subjt:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL

Query:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC
        CLS+FIRRYGLRRFLFFDKLC+ESETVRRGYT KLNRSLRVLS FVIPCFAAESAYKIWWYASGASQIPFLGNVIVSD VAC MEL+SWLYRTTVIFLVC
Subjt:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC

Query:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL
        ILFRLICDLQILRLQDFATVFQVDSDV SVL+EHLRIRRHLRIISHRYR FIL SL+LVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL
Subjt:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL

Query:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN
        RSATKITHKAQSVT+LAAKWHVCATL+SFDVTDGETPMA+       Q FP     GEES+ D+GCD ED+LDNTKLIPAYAYSTISFQKRQALVTYFEN
Subjt:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN

Query:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
        NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
Subjt:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS

A0A6J1CCA0 uncharacterized protein LOC1110103619.7e-21388.89Show/hide
Query:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL
        MGDN  EAL+++    + RS SHAHDEL+SFRSYLRWMCVDQSDIWTAGLSWSMF LFA+IVPATSHFLLAC+SCDS+HARPFDRVVQLSLS VATVSFL
Subjt:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL

Query:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC
        CLSNFIRRYGLRRFLFFDKLC+ESETVRRGYT KLNRSLRVLS FVIPCFAAESAYKIWWYASGASQIPFLGNVIVSD VAC MEL+SWLYRTTVIFLVC
Subjt:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC

Query:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL
        ILFRLICDLQILRLQDFA+VFQVDSDVGSVL+EHLRIRRHLRIISHRYRAFIL +L+LVTGSQF SLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL
Subjt:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL

Query:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTR--GGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYF
        RSATKITHKAQSVTSLAAKWHVCATL+SFDVTDGETPMAA     G  VFPVT    G  E+D D+GCDEED+LDNTKLIPAYAYSTISFQKRQALVTYF
Subjt:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTR--GGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYF

Query:  ENNRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
        ENNRAGITIYGFTLDR+TLHTIFGIELSLVLWLLGKTIGFS
Subjt:  ENNRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS

A0A6J1FK66 uncharacterized protein LOC1114461732.7e-22391.8Show/hide
Query:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL
        MGD +REALIS+KA VFNRS SHA DEL SFRSYLRWMCVDQSDIW+AGLSWS+F LFAI+VPATSHF LACSSCDSHHARPFDRVVQLSLSSVATVSFL
Subjt:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL

Query:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC
        CLSNFIRRYGLRRFLFFD+LC+ESETVRRGYTNK NRSLRVLSAFV+PCFAAESAYKIWWYASGASQIPFLGNVIVSDAVAC+MEL+SWLYRTTVIFLVC
Subjt:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC

Query:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL
        ILFRLICDLQILRLQDFATVFQVDSDVGSVL+EHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLL+TTKSSS LNIYIAGELALCSMTLLTSLMILL
Subjt:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL

Query:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN
        RSATKITHKAQSVT+LAAKWHVCATL+SFDVTDGETPMAA A   G  +FPVT RG EES+ ++GCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN
Subjt:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN

Query:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
        NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
Subjt:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS

A0A6J1IU24 uncharacterized protein LOC1114805833.2e-22492.26Show/hide
Query:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL
        MGD +REALIS+KA VFNRS SHA DEL SFRSYLRWMCVDQSDIW+AGLSWS+F LFAI+VPATSHF LACSSCDSHHARPFDRVVQLSLSSVATVSFL
Subjt:  MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFL

Query:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC
        CLSNFIRRYGLRRFLFFD+LC+ESETVRRGYTNK NRSLRVLSAFV+PCFAAESAYKIWWYASGASQIPFLGNVIVSDAVAC+MEL+SWLYRTTVIFLVC
Subjt:  CLSNFIRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVC

Query:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL
        ILFRLICDLQILRLQDFATVFQVDSDVGSVL+EHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLL+TTKSSS LNIYIAGELALCSMTLLTSLMILL
Subjt:  ILFRLICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILL

Query:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN
        RSATKITHKAQSVT+LAAKWHVCATL+SFDVTDGETPMAA A A G  +FPVT RG EESD ++GCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN
Subjt:  RSATKITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN

Query:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
        NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
Subjt:  NRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)3.6e-17573.55Show/hide
Query:  VFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFL
        +FNR VSH  DEL SFR YLRWMCVD S  WTA LSW+MF++F ++VPA SHFLLAC+ CDS+H+RP+D VVQLSLSSVATVSFLCL+ F+ +YGLRRFL
Subjt:  VFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFL

Query:  FFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVCILFRLICDLQILRLQ
        FFDKL +ESETVRR YTN+LN SL ++S FVIPCF+A SAYKIWWYASG S+IPFLGN ++SD VAC MEL SWLYRTTVIFLVC+LFRLIC LQILRLQ
Subjt:  FFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVCILFRLICDLQILRLQ

Query:  DFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILLRSATKITHKAQSVTS
        DFA +FQ+DSDVGS+L+EHLRIRRHLRIISHRYR+FIL  LILVTGSQF+SLLITTK+ + +NIY AGELALCSMTL+T+L+ILLRSA+KITHKAQ+VT 
Subjt:  DFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILLRSATKITHKAQSVTS

Query:  LAAKWHVCATLESFDVT------DGETP-MAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIY
        LAAKWHVCATLESFD T        ETP + A        V  V T    ESD D+  DEED+LDN  +IP YA+ST+SFQKRQALV+YFENN AGIT+Y
Subjt:  LAAKWHVCATLESFDVT------DGETP-MAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIY

Query:  GFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
        GFTLDR TLHTIFG+ELSLVLWLLGKTIG S
Subjt:  GFTLDRTTLHTIFGIELSLVLWLLGKTIGFS

AT1G50630.2 Protein of unknown function (DUF3537)7.2e-15271.17Show/hide
Query:  VFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFL
        +FNR VSH  DEL SFR YLRWMCVD S  WTA LSW+MF++F ++VPA SHFLLAC+ CDS+H+RP+D VVQLSLSSVATVSFLCL+ F+ +YGLRRFL
Subjt:  VFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFL

Query:  FFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVCILFRLICDLQILRLQ
        FFDKL +ESETVRR YTN+LN SL ++S FVIPCF+A SAYKIWWYASG S+IPFLGN ++SD VAC MEL SWLYRTTVIFLVC+LFRLIC LQILRLQ
Subjt:  FFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVCILFRLICDLQILRLQ

Query:  DFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILLRSATKITHKAQSVTS
        DFA +FQ+DSDVGS+L+EHLRIRRHLRIISHRYR+FIL  LILVTGSQF+SLLITTK+ + +NIY AGELALCSMTL+T+L+ILLRSA+KITHKAQ+VT 
Subjt:  DFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILLRSATKITHKAQSVTS

Query:  LAAKWHVCATLESFDVT------DGETP-MAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN
        LAAKWHVCATLESFD T        ETP + A        V  V T    ESD D+  DEED+LDN  +IP YA+ST+SFQKRQAL    +N
Subjt:  LAAKWHVCATLESFDVT------DGETP-MAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFEN

AT3G20300.1 Protein of unknown function (DUF3537)1.3e-18576.96Show/hide
Query:  REALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNF
        RE LI+++ + F RSVSHA DEL SFR YLRWMCVDQS  WTA LSWSMFV+F ++VPATSHF+LACS CDSHH+RP+D VVQLSLSS A +SFLCLS F
Subjt:  REALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNF

Query:  IRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVCILFRL
        + +YGLRRFLFFDKL +ESETVR GYTN+LNRSL++LS FV PCF A S+YKIWWYASGASQIPFLGNVI+SD VAC MEL SWLYRTTVIFLVC+LFRL
Subjt:  IRRYGLRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVCILFRL

Query:  ICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILLRSATK
        IC LQILRLQDFA VFQ+DSDVGS+L+EHLRIRRHLRIISHRYR FIL SLILVTGSQF SLLITTK+ + LNIY AGELALCSMTL+T+L+ILLRSA+K
Subjt:  ICDLQILRLQDFATVFQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILLRSATK

Query:  ITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGI
        ITHKAQ+VT LAAKWHVCAT+ESF+  DGETP   V +A+G   +P     G ESD +D  DEED+ DN  LIPAYAYSTISFQKRQALV YFENNR+GI
Subjt:  ITHKAQSVTSLAAKWHVCATLESFDVTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGI

Query:  TIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS
        T++GFTLDR+TLHTIFGIE+SLVLWLLGKTIG S
Subjt:  TIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS

AT4G03820.1 Protein of unknown function (DUF3537)4.6e-10651.59Show/hide
Query:  SFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDKLCNESETVRR
        SF     W   DQS+     LSWS+F L A+IVP  SHF+L C+ CD  H RP+D +VQLSLS  A +SF+ LS++ ++YG+RRFLFFDKL + S+ VR 
Subjt:  SFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDKLCNESETVRR

Query:  GYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVD-SDVG
        GY  K+ RS+++L+ FV+P    ++ Y+IWWYASG +QIP++ N  +S  +ACT++L SWLYRT++  + CIL++ IC LQ+LRL +FA  F  +  D  
Subjt:  GYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVD-SDVG

Query:  SVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILLRSATKITHKAQSVTSLAAKWHVCATLES
        S+L EHL+IRR L+I+SHR+R FIL SL  VT +QF +LL T ++S   NIY  GELALCS +L++ L I L+SAT++THKAQSVTS+A KW+VCA+L++
Subjt:  SVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILLRSATKITHKAQSVTSLAAKWHVCATLES

Query:  FDVT-DGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTIFGIELSL
        FDV  DGETP          Q+        + SD D+  + +D  ++ ++ P +A   IS QKRQALVTY ENNRAGIT+YGF +D+T L  IF IEL+L
Subjt:  FDVT-DGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTIFGIELSL

Query:  VLWLLGKTI
        +LWLL KTI
Subjt:  VLWLLGKTI

AT4G22270.1 Protein of unknown function (DUF3537)5.1e-12157.32Show/hide
Query:  SFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDKLCNESETVRR
        +F S + W   DQS+  TA LSWS+F L  +IVP  SHFLL CS CD HH RP+D +VQLSLS  A +SF+ LS + R++G+RRFLF DKL + S+ VR 
Subjt:  SFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYGLRRFLFFDKLCNESETVRR

Query:  GYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVD-SDVG
         Y  ++ RSL+ L  FV+P    E+ Y+IWWY SG +QIP++ N I+S  VACT++L SWLYR ++  +VCIL+++ C LQ LRL DFA  F  + +DV 
Subjt:  GYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVCILFRLICDLQILRLQDFATVFQVD-SDVG

Query:  SVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILLRSATKITHKAQSVTSLAAKWHVCATLES
        S L EH +IRR+LRI+SHR+R FIL SLILVT +QF +LL TT++S  +NIY  GELALCS++L+T + I LRSATKITHKAQSVTSLAAKW+VCAT++S
Subjt:  SVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILLRSATKITHKAQSVTSLAAKWHVCATLES

Query:  FDVTDGETPMAAVAQAAGPQVFPVTTRGG--EESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTIFGIELS
        FD  DGETP  ++ ++       V+ RG   E SD ++G + +D+LDNTK+ P YA +TIS+QKRQALVTY ENN+AGIT+YGF +DR+ L+TIFGIEL+
Subjt:  FDVTDGETPMAAVAQAAGPQVFPVTTRGG--EESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTIFGIELS

Query:  LVLWLLGKTI
        L+LWLL KTI
Subjt:  LVLWLLGKTI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGACAACAACAGAGAGGCGTTGATCAGCCAAAAAGCGAGCGTGTTCAACCGCTCGGTCTCTCACGCTCATGACGAATTGAAGAGCTTCAGATCGTATCTCCGATG
GATGTGCGTCGATCAATCGGACATTTGGACGGCCGGATTGTCGTGGTCGATGTTCGTCCTCTTCGCCATCATCGTTCCGGCGACGTCGCATTTCCTTCTGGCTTGTTCTT
CCTGCGATAGCCATCACGCTAGGCCGTTCGATCGCGTCGTGCAATTGTCGCTCAGCAGCGTGGCGACGGTTTCGTTCCTCTGCCTTTCGAATTTCATCAGGAGGTACGGA
CTCAGGAGATTCTTGTTCTTCGATAAGCTCTGCAACGAAAGCGAAACTGTGAGAAGAGGATACACGAACAAGCTCAATAGATCACTGCGTGTACTGTCGGCATTCGTGAT
CCCATGCTTCGCAGCGGAGAGCGCGTACAAGATCTGGTGGTACGCATCAGGCGCGTCGCAAATCCCGTTCCTGGGGAACGTTATAGTGAGCGACGCAGTAGCGTGCACGA
TGGAGCTGGTGTCGTGGCTGTACAGAACGACGGTGATCTTCCTGGTGTGCATCCTGTTCCGTCTGATCTGCGACCTCCAGATCCTACGCCTGCAGGACTTCGCCACCGTG
TTCCAGGTGGACTCCGACGTCGGCTCGGTGCTCACGGAGCATTTGAGGATCCGCCGCCATCTCAGAATCATCAGCCACCGCTACCGCGCCTTCATTCTCTGGTCCCTCAT
CCTCGTCACCGGCAGCCAGTTCACTTCTTTACTTATTACTACCAAGTCCTCTTCCAACCTCAATATCTACATTGCCGGCGAACTTGCGCTATGTTCGATGACGCTTCTTA
CAAGTCTGATGATATTACTAAGGAGTGCAACTAAAATCACTCACAAAGCGCAGTCGGTGACGTCGCTTGCTGCCAAGTGGCACGTGTGTGCCACGTTGGAATCTTTCGAC
GTCACAGACGGCGAGACGCCGATGGCTGCCGTCGCTCAGGCCGCCGGACCGCAGGTTTTTCCGGTGACGACCCGCGGCGGAGAAGAATCGGACGTCGACGACGGTTGCGA
TGAAGAAGATGAGCTGGACAACACCAAATTGATTCCAGCTTACGCTTACAGCACCATCTCCTTCCAAAAGAGACAGGCCTTAGTGACGTATTTCGAGAATAACAGAGCTG
GGATAACGATATACGGGTTTACCCTGGATAGGACTACACTCCACACCATCTTTGGAATTGAGTTATCCTTGGTTCTTTGGCTGCTTGGCAAAACAATTGGTTTTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGACAACAACAGAGAGGCGTTGATCAGCCAAAAAGCGAGCGTGTTCAACCGCTCGGTCTCTCACGCTCATGACGAATTGAAGAGCTTCAGATCGTATCTCCGATG
GATGTGCGTCGATCAATCGGACATTTGGACGGCCGGATTGTCGTGGTCGATGTTCGTCCTCTTCGCCATCATCGTTCCGGCGACGTCGCATTTCCTTCTGGCTTGTTCTT
CCTGCGATAGCCATCACGCTAGGCCGTTCGATCGCGTCGTGCAATTGTCGCTCAGCAGCGTGGCGACGGTTTCGTTCCTCTGCCTTTCGAATTTCATCAGGAGGTACGGA
CTCAGGAGATTCTTGTTCTTCGATAAGCTCTGCAACGAAAGCGAAACTGTGAGAAGAGGATACACGAACAAGCTCAATAGATCACTGCGTGTACTGTCGGCATTCGTGAT
CCCATGCTTCGCAGCGGAGAGCGCGTACAAGATCTGGTGGTACGCATCAGGCGCGTCGCAAATCCCGTTCCTGGGGAACGTTATAGTGAGCGACGCAGTAGCGTGCACGA
TGGAGCTGGTGTCGTGGCTGTACAGAACGACGGTGATCTTCCTGGTGTGCATCCTGTTCCGTCTGATCTGCGACCTCCAGATCCTACGCCTGCAGGACTTCGCCACCGTG
TTCCAGGTGGACTCCGACGTCGGCTCGGTGCTCACGGAGCATTTGAGGATCCGCCGCCATCTCAGAATCATCAGCCACCGCTACCGCGCCTTCATTCTCTGGTCCCTCAT
CCTCGTCACCGGCAGCCAGTTCACTTCTTTACTTATTACTACCAAGTCCTCTTCCAACCTCAATATCTACATTGCCGGCGAACTTGCGCTATGTTCGATGACGCTTCTTA
CAAGTCTGATGATATTACTAAGGAGTGCAACTAAAATCACTCACAAAGCGCAGTCGGTGACGTCGCTTGCTGCCAAGTGGCACGTGTGTGCCACGTTGGAATCTTTCGAC
GTCACAGACGGCGAGACGCCGATGGCTGCCGTCGCTCAGGCCGCCGGACCGCAGGTTTTTCCGGTGACGACCCGCGGCGGAGAAGAATCGGACGTCGACGACGGTTGCGA
TGAAGAAGATGAGCTGGACAACACCAAATTGATTCCAGCTTACGCTTACAGCACCATCTCCTTCCAAAAGAGACAGGCCTTAGTGACGTATTTCGAGAATAACAGAGCTG
GGATAACGATATACGGGTTTACCCTGGATAGGACTACACTCCACACCATCTTTGGAATTGAGTTATCCTTGGTTCTTTGGCTGCTTGGCAAAACAATTGGTTTTTCTTAG
Protein sequenceShow/hide protein sequence
MGDNNREALISQKASVFNRSVSHAHDELKSFRSYLRWMCVDQSDIWTAGLSWSMFVLFAIIVPATSHFLLACSSCDSHHARPFDRVVQLSLSSVATVSFLCLSNFIRRYG
LRRFLFFDKLCNESETVRRGYTNKLNRSLRVLSAFVIPCFAAESAYKIWWYASGASQIPFLGNVIVSDAVACTMELVSWLYRTTVIFLVCILFRLICDLQILRLQDFATV
FQVDSDVGSVLTEHLRIRRHLRIISHRYRAFILWSLILVTGSQFTSLLITTKSSSNLNIYIAGELALCSMTLLTSLMILLRSATKITHKAQSVTSLAAKWHVCATLESFD
VTDGETPMAAVAQAAGPQVFPVTTRGGEESDVDDGCDEEDELDNTKLIPAYAYSTISFQKRQALVTYFENNRAGITIYGFTLDRTTLHTIFGIELSLVLWLLGKTIGFS