; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g0006 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g0006
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionNAC domain-containing protein 101-like
Genome locationMC05:52471..55594
RNA-Seq ExpressionMC05g0006
SyntenyMC05g0006
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0016020 - membrane (cellular component)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR003441 - NAC domain
IPR036093 - NAC domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK10206.1 protein CUP-SHAPED COTYLEDON 3-like [Cucumis melo var. makuwa]4.95e-20561.97Show/hide
Query:  MANCT---HRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTG
        MANC      +L+PVGFRFHPTD+EL NHYLKNK++G+ESLVQYI QVDIC ++PWELPSLSN  TG+ QWFFFSAQDFKYSNGRRSNR T TGYWKSTG
Subjt:  MANCT---HRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTG

Query:  KDRKIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGN
        KDRKIMA GTK LIGTKKTLVF+ GRV +GI+TNWVIHEYHLH D N   L+SFVIC LKRK +E+DVL+  EAE NG++ ST  NV +T ++ +E  GN
Subjt:  KDRKIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGN

Query:  GESLFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVSHQ---GFSG
        G SLFQ DLQV DY   +L+S LFSDPEPTSMDFQ  NSYGTH+NG +DED      VN +LVDDENCF EGTPNSS ++FN+EE+L L++     GFSG
Subjt:  GESLFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVSHQ---GFSG

Query:  DTGMNSAFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV-
        +TG+++A  W+ DHAS S  GE   SR+QT+S+PMI++SRRSPLTSKIL+                         H DDSD+       Q  PKSHK + 
Subjt:  DTGMNSAFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV-

Query:  -------AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARAC
                Q +RET ++VVSS  KA+ VPKRIK+V+P  TSNKD + DQ+SEVEN    R+KSTG G L+S  +CSSSILTTKCI HK SPAS Y ARAC
Subjt:  -------AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARAC

Query:  IGFILFIMVARQALLYGN
        +GFILFI VAR+ LLYGN
Subjt:  IGFILFIMVARQALLYGN

XP_008450706.1 PREDICTED: protein CUP-SHAPED COTYLEDON 3-like [Cucumis melo]1.73e-20562.16Show/hide
Query:  MANCT---HRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTG
        MANC      +L+PVGFRFHPTD+EL NHYLKNK++G+ESLVQYI QVDIC ++PWELPSLSN  TG+ QWFFFSAQDFKYSNGRRSNR T TGYWKSTG
Subjt:  MANCT---HRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTG

Query:  KDRKIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGN
        KDRKIMA GTK LIGTKKTLVF+ GRV +GI+TNWVIHEYHLH D N   L+SFVIC LKRK +E+DVL+  EAE NG++ ST  NV +T ++ +E  GN
Subjt:  KDRKIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGN

Query:  GESLFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVSHQ---GFSG
        G SLFQ DLQV DY   +L+S LFSDPEPTSMDFQ  NSYGTH+NG +DED      VN +LVDDENCF EGTPNSS ++FN+EE+L L++     GFSG
Subjt:  GESLFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVSHQ---GFSG

Query:  DTGMNSAFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV-
        +TG+++A  W+ DHAS S  GE   SR+QT+S+PMI++SRRSPLTSKIL+                         H DDSD+       Q  PKSHK + 
Subjt:  DTGMNSAFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV-

Query:  -------AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARAC
                Q +RETQ++VVSS  KA+ VPKRIK+V+P  TSNKD + DQ+SEVEN    R+KSTG G L+S  +CSSSILTTKCI HK SPAS Y ARAC
Subjt:  -------AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARAC

Query:  IGFILFIMVARQALLYGN
        +GFILFI VAR+ LLYGN
Subjt:  IGFILFIMVARQALLYGN

XP_022144282.1 NAC domain-containing protein 101-like [Momordica charantia]0.0100Show/hide
Query:  MANCTHRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDR
        MANCTHRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDR
Subjt:  MANCTHRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDR

Query:  KIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGNGES
        KIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGNGES
Subjt:  KIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGNGES

Query:  LFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVSHQGFSGDTGMNS
        LFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVSHQGFSGDTGMNS
Subjt:  LFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVSHQGFSGDTGMNS

Query:  AFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQGPKSHKVVAQLERETQ
        AFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQGPKSHKVVAQLERETQ
Subjt:  AFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQGPKSHKVVAQLERETQ

Query:  LRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARACIGFILFIMVARQALL
        LRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARACIGFILFIMVARQALL
Subjt:  LRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARACIGFILFIMVARQALL

Query:  YGNW
        YGNW
Subjt:  YGNW

XP_022974303.1 uncharacterized protein LOC111472945 isoform X1 [Cucurbita maxima]1.34e-20362.62Show/hide
Query:  MANCT---HRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTG
        MA+C      DL PVGFRFHPTD+ELINHYLKNKMLG+ESLV YI QVDICKY+PW+LP LSN  T EQ+WFFF+AQD KYSN RRSNR T TGYWKSTG
Subjt:  MANCT---HRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTG

Query:  KDRKIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGN
        KDRKI+AP TK LIGTKKTLVF+ GRV NGIRTNWVIHEYHLHSD  F  LR FVIC LK+K DENDVLIC+EAE NG++N TF N + ++++ +EI GN
Subjt:  KDRKIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGN

Query:  GESLFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVS-HQG--FSG
        G+SLF P+LQV  YDF  L+S LF D EP SMDFQA NSYGT V+  +DE+ID EE             HEGTPNSS + FN++ELL  V   QG    G
Subjt:  GESLFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVS-HQG--FSG

Query:  DTGMNSAFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV-
        DTGM +  + Q +  S S+  EH+ SRV TQ+ P+I++SRRSPLTSKI KYGGE+ +SS RPR D LQ N I SY  DDSD+E  TLR Q  P+S KVV 
Subjt:  DTGMNSAFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV-

Query:  -------AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARAC
                Q +R+TQL+VVSS DKAK V K  KV     TSNKD + D+++EV++ +A++  STGRG +ESR K  SSILTTKCIHH+SS ASAYVAR C
Subjt:  -------AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARAC

Query:  IGFILFIMVARQALLYGNW
        IG ILF +VAR  LL GNW
Subjt:  IGFILFIMVARQALLYGNW

XP_038878284.1 NAC domain-containing protein 101-like [Benincasa hispida]3.75e-21465.62Show/hide
Query:  RDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPG
        ++L PVGFRFHPTD+EL NHYLKNK++G+E LVQYI QVDIC Y+PWELP LSN  TG+QQWFFFSAQDFKYSNGRRSNR T TGYWKSTGKDRKI+A G
Subjt:  RDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPG

Query:  TKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGNGESLFQPDL
        TK LIGTKKTLVF+ GRV NG RTNWVIHEYHLH D N   L+SFVIC LKRK DE+DVL+C EAE NG++ S+  N+ S  ++ Q I GNG+SLFQ DL
Subjt:  TKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGNGESLFQPDL

Query:  QVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNY--EELLHLVSHQ---GFSGDTGMNSA
        QV DYD  +LQSPLFSDPEPTSMDFQ  NSY THVNG +DED      VNSILVDDENC+HEGT NSSI DFN+  EEL  LV+     G  GDTGM++A
Subjt:  QVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNY--EELLHLVSHQ---GFSGDTGMNSA

Query:  FDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV--------
          WQ DHAS S+  E   SR+QT  MPMI++SRRSPLTSKI KYG  K         D LQ NC+  YH DDSDKE      Q  PKSHKV+        
Subjt:  FDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV--------

Query:  AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSL-ANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARACIGFILF
         Q   ETQ +VV S  KA+ VPKRIKV +   TSN+DS+EDQ+SEV+N + A   KSTG G  ES  KC SSILTTKCIHHK SPAS Y ARAC+GFILF
Subjt:  AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSL-ANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARACIGFILF

Query:  IMVARQALLYGN
        I +ARQ LLYGN
Subjt:  IMVARQALLYGN

TrEMBL top hitse value%identityAlignment
A0A1S3BQH1 protein CUP-SHAPED COTYLEDON 3-like8.39e-20662.16Show/hide
Query:  MANCT---HRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTG
        MANC      +L+PVGFRFHPTD+EL NHYLKNK++G+ESLVQYI QVDIC ++PWELPSLSN  TG+ QWFFFSAQDFKYSNGRRSNR T TGYWKSTG
Subjt:  MANCT---HRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTG

Query:  KDRKIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGN
        KDRKIMA GTK LIGTKKTLVF+ GRV +GI+TNWVIHEYHLH D N   L+SFVIC LKRK +E+DVL+  EAE NG++ ST  NV +T ++ +E  GN
Subjt:  KDRKIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGN

Query:  GESLFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVSHQ---GFSG
        G SLFQ DLQV DY   +L+S LFSDPEPTSMDFQ  NSYGTH+NG +DED      VN +LVDDENCF EGTPNSS ++FN+EE+L L++     GFSG
Subjt:  GESLFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVSHQ---GFSG

Query:  DTGMNSAFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV-
        +TG+++A  W+ DHAS S  GE   SR+QT+S+PMI++SRRSPLTSKIL+                         H DDSD+       Q  PKSHK + 
Subjt:  DTGMNSAFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV-

Query:  -------AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARAC
                Q +RETQ++VVSS  KA+ VPKRIK+V+P  TSNKD + DQ+SEVEN    R+KSTG G L+S  +CSSSILTTKCI HK SPAS Y ARAC
Subjt:  -------AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARAC

Query:  IGFILFIMVARQALLYGN
        +GFILFI VAR+ LLYGN
Subjt:  IGFILFIMVARQALLYGN

A0A5D3CIK9 Protein CUP-SHAPED COTYLEDON 3-like2.40e-20561.97Show/hide
Query:  MANCT---HRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTG
        MANC      +L+PVGFRFHPTD+EL NHYLKNK++G+ESLVQYI QVDIC ++PWELPSLSN  TG+ QWFFFSAQDFKYSNGRRSNR T TGYWKSTG
Subjt:  MANCT---HRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTG

Query:  KDRKIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGN
        KDRKIMA GTK LIGTKKTLVF+ GRV +GI+TNWVIHEYHLH D N   L+SFVIC LKRK +E+DVL+  EAE NG++ ST  NV +T ++ +E  GN
Subjt:  KDRKIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGN

Query:  GESLFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVSHQ---GFSG
        G SLFQ DLQV DY   +L+S LFSDPEPTSMDFQ  NSYGTH+NG +DED      VN +LVDDENCF EGTPNSS ++FN+EE+L L++     GFSG
Subjt:  GESLFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVSHQ---GFSG

Query:  DTGMNSAFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV-
        +TG+++A  W+ DHAS S  GE   SR+QT+S+PMI++SRRSPLTSKIL+                         H DDSD+       Q  PKSHK + 
Subjt:  DTGMNSAFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV-

Query:  -------AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARAC
                Q +RET ++VVSS  KA+ VPKRIK+V+P  TSNKD + DQ+SEVEN    R+KSTG G L+S  +CSSSILTTKCI HK SPAS Y ARAC
Subjt:  -------AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARAC

Query:  IGFILFIMVARQALLYGN
        +GFILFI VAR+ LLYGN
Subjt:  IGFILFIMVARQALLYGN

A0A6J1CT89 NAC domain-containing protein 101-like0.0100Show/hide
Query:  MANCTHRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDR
        MANCTHRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDR
Subjt:  MANCTHRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDR

Query:  KIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGNGES
        KIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGNGES
Subjt:  KIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGNGES

Query:  LFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVSHQGFSGDTGMNS
        LFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVSHQGFSGDTGMNS
Subjt:  LFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVSHQGFSGDTGMNS

Query:  AFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQGPKSHKVVAQLERETQ
        AFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQGPKSHKVVAQLERETQ
Subjt:  AFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQGPKSHKVVAQLERETQ

Query:  LRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARACIGFILFIMVARQALL
        LRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARACIGFILFIMVARQALL
Subjt:  LRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARACIGFILFIMVARQALL

Query:  YGNW
        YGNW
Subjt:  YGNW

A0A6J1IAW9 NAC domain-containing protein 53-like3.86e-20262.43Show/hide
Query:  MANCT---HRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTG
        MA+C      DL PVGFRFHPTD+ELINHYLKNKMLG+ESLV YI QVDICKY+PW+LP LSN  T EQ+WFFF+AQD KYSN RRSNR T TGYWKSTG
Subjt:  MANCT---HRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTG

Query:  KDRKIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGN
        KDRKI+AP TK LIGTKKTLVF+ GRV NGIRTNWVIHEYHLHSD  F  LR FVIC LK+K DENDVLIC+EAE NG++N TF N  S +    +I GN
Subjt:  KDRKIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGN

Query:  GESLFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVS-HQG--FSG
        G+SLF P+LQV  YDF  L+S LF D EP SMDFQA NSYGT V+  +DE+ID EE             HEGTPNSS + FN++ELL  V   QG    G
Subjt:  GESLFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVS-HQG--FSG

Query:  DTGMNSAFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV-
        DTGM +  + Q +  S S+  EH+ SRV TQ+ P+I++SRRSPLTSKI KYGGE+ +SS RPR D LQ N I SY  DDSD+E  TLR Q  P+S KVV 
Subjt:  DTGMNSAFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV-

Query:  -------AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARAC
                Q +R++QL+VVSS DKAK V K  KV     TSNKD + D+++EV++ +A++  STGRG +ESR K  SSILTTKCIHH+SS ASAYVAR C
Subjt:  -------AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARAC

Query:  IGFILFIMVARQALLYGNW
        IG ILF +VAR  LL GNW
Subjt:  IGFILFIMVARQALLYGNW

A0A6J1IFU1 uncharacterized protein LOC111472945 isoform X16.47e-20462.62Show/hide
Query:  MANCT---HRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTG
        MA+C      DL PVGFRFHPTD+ELINHYLKNKMLG+ESLV YI QVDICKY+PW+LP LSN  T EQ+WFFF+AQD KYSN RRSNR T TGYWKSTG
Subjt:  MANCT---HRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTG

Query:  KDRKIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGN
        KDRKI+AP TK LIGTKKTLVF+ GRV NGIRTNWVIHEYHLHSD  F  LR FVIC LK+K DENDVLIC+EAE NG++N TF N + ++++ +EI GN
Subjt:  KDRKIMAPGTKTLIGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGN

Query:  GESLFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVS-HQG--FSG
        G+SLF P+LQV  YDF  L+S LF D EP SMDFQA NSYGT V+  +DE+ID EE             HEGTPNSS + FN++ELL  V   QG    G
Subjt:  GESLFQPDLQVLDYDFPDLQSPLFSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVS-HQG--FSG

Query:  DTGMNSAFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV-
        DTGM +  + Q +  S S+  EH+ SRV TQ+ P+I++SRRSPLTSKI KYGGE+ +SS RPR D LQ N I SY  DDSD+E  TLR Q  P+S KVV 
Subjt:  DTGMNSAFDWQNDHASRSLLGEHTRSRVQTQSMPMIRKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQG-PKSHKVV-

Query:  -------AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARAC
                Q +R+TQL+VVSS DKAK V K  KV     TSNKD + D+++EV++ +A++  STGRG +ESR K  SSILTTKCIHH+SS ASAYVAR C
Subjt:  -------AQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVENSLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARAC

Query:  IGFILFIMVARQALLYGNW
        IG ILF +VAR  LL GNW
Subjt:  IGFILFIMVARQALLYGNW

SwissProt top hitse value%identityAlignment
B5X570 NAC domain-containing protein 142.4e-4250Show/hide
Query:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL
        P+GFRF PTD+ELINHYL+ K+ G++  V+ IP++D+CK++PW+LP LS   T +Q+WFFF  +D KY +G RSNR T  GYWK+TGKDR I +   K +
Subjt:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL

Query:  IGTKKTLVFHRGRVLNGIRTNWVIHEYHL---HSDANFTNLRSFVICRLKRK-SDENDVLICNEAESNGIVNST
        IG KKTLVF+RGR   G RTNW++HEY       D        +V+CRL  K SD  D   C E E      +T
Subjt:  IGTKKTLVFHRGRVLNGIRTNWVIHEYHL---HSDANFTNLRSFVICRLKRK-SDENDVLICNEAESNGIVNST

F4JN35 Protein NTM1-like 91.1e-4253.55Show/hide
Query:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL
        P+GFRF PTD+EL+NHYL+ K+ G+ S V+ IP +D+CK++PW+LP+LS   T + +WFFF  +D KY NG RSNR T +GYWK+TGKDR I +   KTL
Subjt:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL

Query:  IGTKKTLVFHRGRVLNGIRTNWVIHEYH---LHSDANFTNLRSFVICRLKRKSDE
        IG KKTLVF+RGR   G RTNW++HEY       D        +V+CRL  K D+
Subjt:  IGTKKTLVFHRGRVLNGIRTNWVIHEYH---LHSDANFTNLRSFVICRLKRKSDE

Q9FFI5 NAC domain-containing protein 861.3e-3750.33Show/hide
Query:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL
        P GFRFHPTD+ELI +YLK K+ G+E  ++ IP+VD+ K +PW+LP  S   + +Q+WFFFS +D KY NG R+NR T  GYWK+TGKDR++        
Subjt:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL

Query:  IGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSD----ANFTNLRSFVICRLKRK
        IGTKKTLV++RGR  +GIRT WV+HEY L       + F    ++ +CR+ +K
Subjt:  IGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSD----ANFTNLRSFVICRLKRK

Q9SCK6 NAC domain-containing protein 626.0e-3844.56Show/hide
Query:  DLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGT
        D  PVG RF PTD+ELI +YL+ K+ G +  V+ I ++DICK++PW+LP  S   T + +W +F   D KY +G R NR T  GYWK+TGKDRKI + G 
Subjt:  DLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGT

Query:  KTLIGTKKTLVFHRGRVLNGIRTNWVIHEYH-LHSDANFTN--LRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKN--VQSTMDEIQEIL
          +IG K+TLVFH GR   G RTNW+IHEY     D + TN     FVIC+L +K  E  VL   +++S+ +      +  V+ T  E+ E++
Subjt:  KTLIGTKKTLVFHRGRVLNGIRTNWVIHEYH-LHSDANFTN--LRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKN--VQSTMDEIQEIL

Q9SQX9 NAC domain containing protein 507.8e-3851.63Show/hide
Query:  GFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTLIG
        GFRFHPTD+EL+++YLK K+LGK      I +VDI K++PW+L   S   T +Q+W+FFSA D KY NG R NR T  GYWK+TGKDR+I       L+G
Subjt:  GFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTLIG

Query:  TKKTLVFHRGRVLNGIRTNWVIHEYHL---HSDANFTNLR-SFVICRLKRKSD
         KKTLVFH GR  +G+RTNWV+HEY L    ++ N + L+ ++V+CR+  K++
Subjt:  TKKTLVFHRGRVLNGIRTNWVIHEYHL---HSDANFTNLR-SFVICRLKRKSD

Arabidopsis top hitse value%identityAlignment
AT1G33060.1 NAC 0141.7e-4350Show/hide
Query:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL
        P+GFRF PTD+ELINHYL+ K+ G++  V+ IP++D+CK++PW+LP LS   T +Q+WFFF  +D KY +G RSNR T  GYWK+TGKDR I +   K +
Subjt:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL

Query:  IGTKKTLVFHRGRVLNGIRTNWVIHEYHL---HSDANFTNLRSFVICRLKRK-SDENDVLICNEAESNGIVNST
        IG KKTLVF+RGR   G RTNW++HEY       D        +V+CRL  K SD  D   C E E      +T
Subjt:  IGTKKTLVFHRGRVLNGIRTNWVIHEYHL---HSDANFTNLRSFVICRLKRK-SDENDVLICNEAESNGIVNST

AT1G33060.2 NAC 0141.7e-4350Show/hide
Query:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL
        P+GFRF PTD+ELINHYL+ K+ G++  V+ IP++D+CK++PW+LP LS   T +Q+WFFF  +D KY +G RSNR T  GYWK+TGKDR I +   K +
Subjt:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL

Query:  IGTKKTLVFHRGRVLNGIRTNWVIHEYHL---HSDANFTNLRSFVICRLKRK-SDENDVLICNEAESNGIVNST
        IG KKTLVF+RGR   G RTNW++HEY       D        +V+CRL  K SD  D   C E E      +T
Subjt:  IGTKKTLVFHRGRVLNGIRTNWVIHEYHL---HSDANFTNLRSFVICRLKRK-SDENDVLICNEAESNGIVNST

AT4G35580.1 NAC transcription factor-like 97.5e-4453.55Show/hide
Query:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL
        P+GFRF PTD+EL+NHYL+ K+ G+ S V+ IP +D+CK++PW+LP+LS   T + +WFFF  +D KY NG RSNR T +GYWK+TGKDR I +   KTL
Subjt:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL

Query:  IGTKKTLVFHRGRVLNGIRTNWVIHEYH---LHSDANFTNLRSFVICRLKRKSDE
        IG KKTLVF+RGR   G RTNW++HEY       D        +V+CRL  K D+
Subjt:  IGTKKTLVFHRGRVLNGIRTNWVIHEYH---LHSDANFTNLRSFVICRLKRKSDE

AT4G35580.2 NAC transcription factor-like 97.5e-4453.55Show/hide
Query:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL
        P+GFRF PTD+EL+NHYL+ K+ G+ S V+ IP +D+CK++PW+LP+LS   T + +WFFF  +D KY NG RSNR T +GYWK+TGKDR I +   KTL
Subjt:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL

Query:  IGTKKTLVFHRGRVLNGIRTNWVIHEYH---LHSDANFTNLRSFVICRLKRKSDE
        IG KKTLVF+RGR   G RTNW++HEY       D        +V+CRL  K D+
Subjt:  IGTKKTLVFHRGRVLNGIRTNWVIHEYH---LHSDANFTNLRSFVICRLKRKSDE

AT4G35580.3 NAC transcription factor-like 97.5e-4453.55Show/hide
Query:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL
        P+GFRF PTD+EL+NHYL+ K+ G+ S V+ IP +D+CK++PW+LP+LS   T + +WFFF  +D KY NG RSNR T +GYWK+TGKDR I +   KTL
Subjt:  PVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL

Query:  IGTKKTLVFHRGRVLNGIRTNWVIHEYH---LHSDANFTNLRSFVICRLKRKSDE
        IG KKTLVF+RGR   G RTNW++HEY       D        +V+CRL  K D+
Subjt:  IGTKKTLVFHRGRVLNGIRTNWVIHEYH---LHSDANFTNLRSFVICRLKRKSDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAATTGCACCCATCGGGATCTATGGCCGGTTGGATTCCGATTCCATCCAACAGACGACGAGCTAATTAATCACTACCTGAAGAACAAGATGCTTGGTAAGGAATC
GCTTGTTCAATACATCCCTCAAGTTGATATCTGCAAGTATGATCCATGGGAACTCCCCAGCCTATCAAACGGCGATACTGGTGAGCAGCAGTGGTTTTTCTTCTCTGCGC
AAGATTTCAAGTATTCCAATGGCCGTCGATCCAATAGGACCACAGGAACCGGCTACTGGAAATCCACGGGCAAGGACCGCAAAATCATGGCTCCAGGAACCAAGACGTTG
ATTGGTACCAAAAAAACTCTGGTCTTCCACCGCGGCCGGGTTTTAAATGGGATCAGGACTAATTGGGTCATACATGAGTACCATCTTCACTCGGACGCCAACTTCACGAA
CCTGAGGTCGTTCGTTATTTGTCGCCTAAAGAGAAAGTCAGACGAGAATGATGTTTTGATATGTAATGAAGCTGAATCAAACGGTATTGTGAATTCCACCTTCAAAAATG
TGCAAAGCACAATGGACGAAATCCAAGAGATTTTAGGAAATGGAGAGTCATTGTTTCAACCGGACCTCCAGGTTTTGGATTATGATTTTCCCGATTTGCAGTCCCCATTG
TTTTCTGACCCCGAGCCTACTTCGATGGATTTCCAAGCTACCAATAGCTATGGGACTCACGTAAATGGAGCTTCTGATGAGGACATTGATACTGAGGAATTTGTAAACTC
AATCCTTGTTGATGATGAAAATTGTTTCCATGAAGGGACACCAAATAGTTCGATCAATGATTTCAACTATGAGGAATTGCTCCACTTGGTTAGTCATCAGGGATTCAGCG
GTGACACTGGCATGAATTCAGCTTTCGATTGGCAGAATGATCATGCTTCTCGCAGCTTGTTGGGTGAACATACCCGTTCAAGGGTACAAACTCAAAGTATGCCGATGATT
AGAAAATCACGTAGAAGCCCACTCACTTCAAAGATTCTCAAGTACGGAGGTGAAAAAGTTGTTTCCTCTCCCCGCCCTCGTGAAGATACACTTCAAATCAATTGCATCTT
TTCATACCACGGTGATGACTCTGACAAAGAAAGAACAACTTTGAGACCTCAAGGACCTAAATCCCATAAGGTTGTAGCCCAATTGGAGAGGGAAACTCAACTCCGAGTAG
TATCATCCCATGACAAGGCAAAACCAGTTCCTAAACGGATTAAAGTTGTCAAACCAGTCAAAACATCTAACAAAGATTCTATTGAAGACCAAAAAAGTGAAGTTGAGAAT
AGTTTAGCCAACCGGATCAAGTCAACTGGTCGTGGGTGTCTTGAGAGTCGAGTAAAGTGTTCCTCAAGTATCCTGACTACAAAATGCATTCACCACAAATCAAGTCCAGC
ATCAGCTTATGTTGCCAGAGCATGTATAGGCTTCATTTTGTTCATCATGGTTGCAAGACAAGCGCTGCTGTATGGAAATTGGTGA
mRNA sequenceShow/hide mRNA sequence
CTTTTCTTTAGCTTCTTCACCAAGTAGAAGACCTCCGCTGGAGTTGGTCAATTTCCAAGTCAATGGAGTTCCCTGTATTTAATGCTCTCCTGCATCCCCACTTCTCTACA
CTCGCTTCCTTCCTGCTCCTCTACTCACTTCTTCTTCCTACAAACCATCCACCCCGAAGCTTTTCGTCAAATACATGGCCAATTGCACCCATCGGGATCTATGGCCGGTT
GGATTCCGATTCCATCCAACAGACGACGAGCTAATTAATCACTACCTGAAGAACAAGATGCTTGGTAAGGAATCGCTTGTTCAATACATCCCTCAAGTTGATATCTGCAA
GTATGATCCATGGGAACTCCCCAGCCTATCAAACGGCGATACTGGTGAGCAGCAGTGGTTTTTCTTCTCTGCGCAAGATTTCAAGTATTCCAATGGCCGTCGATCCAATA
GGACCACAGGAACCGGCTACTGGAAATCCACGGGCAAGGACCGCAAAATCATGGCTCCAGGAACCAAGACGTTGATTGGTACCAAAAAAACTCTGGTCTTCCACCGCGGC
CGGGTTTTAAATGGGATCAGGACTAATTGGGTCATACATGAGTACCATCTTCACTCGGACGCCAACTTCACGAACCTGAGGTCGTTCGTTATTTGTCGCCTAAAGAGAAA
GTCAGACGAGAATGATGTTTTGATATGTAATGAAGCTGAATCAAACGGTATTGTGAATTCCACCTTCAAAAATGTGCAAAGCACAATGGACGAAATCCAAGAGATTTTAG
GAAATGGAGAGTCATTGTTTCAACCGGACCTCCAGGTTTTGGATTATGATTTTCCCGATTTGCAGTCCCCATTGTTTTCTGACCCCGAGCCTACTTCGATGGATTTCCAA
GCTACCAATAGCTATGGGACTCACGTAAATGGAGCTTCTGATGAGGACATTGATACTGAGGAATTTGTAAACTCAATCCTTGTTGATGATGAAAATTGTTTCCATGAAGG
GACACCAAATAGTTCGATCAATGATTTCAACTATGAGGAATTGCTCCACTTGGTTAGTCATCAGGGATTCAGCGGTGACACTGGCATGAATTCAGCTTTCGATTGGCAGA
ATGATCATGCTTCTCGCAGCTTGTTGGGTGAACATACCCGTTCAAGGGTACAAACTCAAAGTATGCCGATGATTAGAAAATCACGTAGAAGCCCACTCACTTCAAAGATT
CTCAAGTACGGAGGTGAAAAAGTTGTTTCCTCTCCCCGCCCTCGTGAAGATACACTTCAAATCAATTGCATCTTTTCATACCACGGTGATGACTCTGACAAAGAAAGAAC
AACTTTGAGACCTCAAGGACCTAAATCCCATAAGGTTGTAGCCCAATTGGAGAGGGAAACTCAACTCCGAGTAGTATCATCCCATGACAAGGCAAAACCAGTTCCTAAAC
GGATTAAAGTTGTCAAACCAGTCAAAACATCTAACAAAGATTCTATTGAAGACCAAAAAAGTGAAGTTGAGAATAGTTTAGCCAACCGGATCAAGTCAACTGGTCGTGGG
TGTCTTGAGAGTCGAGTAAAGTGTTCCTCAAGTATCCTGACTACAAAATGCATTCACCACAAATCAAGTCCAGCATCAGCTTATGTTGCCAGAGCATGTATAGGCTTCAT
TTTGTTCATCATGGTTGCAAGACAAGCGCTGCTGTATGGAAATTGGTGATAAATGTTAGTACCCACGTGGTACAGGTAACAGGTGGTTATACTCGTACTAGCAAGTTACT
ATGTTTCGTAATGTATATGAAGGGAGGTGTCCCGACTCCTAGGCAATCGATCATTCGTAGGCATCATCACCAAATGAGAGAAAAAACGGGACAGAAATGAGGCATGACCT
TCATAAAACGTGCCAGTTTGATTGTTTCTTTTAGTGCCATGGTTGGATTCATGTTGTATAGGCGATGGTGTTCTAGTAGTGATGGCTGATGGGAAAATATTAGATCAAGT
TTTCTGTCTTTTTCTTTCTTTTCCTGTATTCTGCTGTTGATGTGGGAGAAGGGCATCCGCATCTATACTGAAATTTGGTGGCTACTCCAAGATGCAAAGAGGAGAGAGGA
AGAGATCAATCTGGTTTCGAAGGTGCACATCCGACGAGATGCAAATGGCGCGGCCCATGGTATTGTCCGAAGAGCTATGGTGTACACTATGACAGATGAGTGGATTTTTT
AGTTTTCCATCTGGTTAATGGAGTTAGCACGCATTGTTTAGTTTTTTTCTATTTTTGTTCTCAAGAAAGAAAAAAAAAGTACATCTGTTTTATTTAATAGTTTTTTTTTT
TTTAAAATACACTTTGGTTAATGGAGTTAGCACGCATTGTTTAGTTTTTTTCTATTTTTGTTCTCAAGAAAGAAAA
Protein sequenceShow/hide protein sequence
MANCTHRDLWPVGFRFHPTDDELINHYLKNKMLGKESLVQYIPQVDICKYDPWELPSLSNGDTGEQQWFFFSAQDFKYSNGRRSNRTTGTGYWKSTGKDRKIMAPGTKTL
IGTKKTLVFHRGRVLNGIRTNWVIHEYHLHSDANFTNLRSFVICRLKRKSDENDVLICNEAESNGIVNSTFKNVQSTMDEIQEILGNGESLFQPDLQVLDYDFPDLQSPL
FSDPEPTSMDFQATNSYGTHVNGASDEDIDTEEFVNSILVDDENCFHEGTPNSSINDFNYEELLHLVSHQGFSGDTGMNSAFDWQNDHASRSLLGEHTRSRVQTQSMPMI
RKSRRSPLTSKILKYGGEKVVSSPRPREDTLQINCIFSYHGDDSDKERTTLRPQGPKSHKVVAQLERETQLRVVSSHDKAKPVPKRIKVVKPVKTSNKDSIEDQKSEVEN
SLANRIKSTGRGCLESRVKCSSSILTTKCIHHKSSPASAYVARACIGFILFIMVARQALLYGNW