; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009082 (gene) of Snake gourd v1 genome

Gene IDTan0009082
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNAC domain-containing protein
Genome locationLG10:398822..400728
RNA-Seq ExpressionTan0009082
SyntenyTan0009082
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR003441 - NAC domain
IPR036093 - NAC domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144282.1 NAC domain-containing protein 101-like [Momordica charantia]7.3e-15462.93Show/hide
Query:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG
        MA+C     R LWPVGFRFHPTD+ELINHYLKNK+LG                       LSN  T + +WFFFSAQ+FKYSNGRRSNR T TGYWKSTG
Subjt:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG

Query:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEND-----EAEPNDLVDSTFESV-ATVNQNQEIP
        KDRKIM PGTK LIGTKKTLVF+ GRV NGIRTNWVIHEYHLHSD +F  L  RSFVIC LKRK DEND     EAE N +V+STF++V +T+++ QEI 
Subjt:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEND-----EAEPNDLVDSTFESV-ATVNQNQEIP

Query:  GNGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTE----------ENCLHEGTPNSSISDFNWEELLHSVNQNQGGGF
        GNG+SLFQPD QVLDYDFPDLQSPLFSDPEP SMDFQA N YG   NG +DE+IDTE          ENC HEGTPNSSI+DFN+EELLH V+     GF
Subjt:  GNGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTE----------ENCLHEGTPNSSISDFNWEELLHSVNQNQGGGF

Query:  SGDTGMDATLSWQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCI-SYLSDESDREITTIRPQRQHKSHKV
        SGDTGM++   WQ D AS SL  EHTRSRVQ+QSMP+I++SRRSPLTSKILKYGGE              INCI SY  D+SD+E TT+RPQ   KSHKV
Subjt:  SGDTGMDATLSWQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCI-SYLSDESDREITTIRPQRQHKSHKV

Query:  VNASQSKRETQLQVVSSPDKAKEV------------SNEDIV-----------ACRSKSTTGHGHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARAC
        V  +Q +RETQL+VVSS DKAK V            SN+D +           A R KST   G G +ESR KCSS S LT KCIHHKSS ASAYVARAC
Subjt:  VNASQSKRETQLQVVSSPDKAKEV------------SNEDIV-----------ACRSKSTTGHGHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARAC

Query:  IGLILFVTVARQVLLYGN
        IG ILF+ VARQ LLYGN
Subjt:  IGLILFVTVARQVLLYGN

XP_022974281.1 NAC domain-containing protein 53-like [Cucurbita maxima]2.1e-14561.69Show/hide
Query:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG
        MADCIVLP   L PVGFRFHPTDEELINHYLKNK+LG                       LSND TS+ EWFFF+AQ+ KYSN RRSNRAT TGYWKSTG
Subjt:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG

Query:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEN-----DEAEPNDLVDSTFESVATVNQNQEIPG
        KDRKI+ P TKKLIGTKKTLVFYSGRV NGIRTNWVIHEYHLHSDP F +L P  FVICLLK+K DEN     DEAE N L++ TF++  +  Q   IPG
Subjt:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEN-----DEAEPNDLVDSTFESVATVNQNQEIPG

Query:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTEENCLHEGTPNSSISDFNWEELLHSVNQNQGGGFSGDTGMDATLS
        NGQSLF P+ QV  YDF  L+S LF D EPISMDFQA N YG   + PTDEEID EE  LHEGTPNSS S FNW+ELL  V+ +QGG   GDTGM ATL+
Subjt:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTEENCLHEGTPNSSISDFNWEELLHSVNQNQGGGFSGDTGMDATLS

Query:  WQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCISYLSDESDREITTIRPQRQHKSHKVVNAS------QS
         Q +  SHS+F EH+ SRV +Q+ PIIKESRRSPLTSKI KYGGE               N ISYLSD+SDRE  T+R Q Q +S KVVN +      Q 
Subjt:  WQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCISYLSDESDREITTIRPQRQHKSHKVVNAS------QS

Query:  KRETQLQVVSSPDKAKEV------------SNEDIVACRS---KSTTGH-----GHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVTV
        KR++QLQVVSSPDKAK V            SN+DIV  R    KS   H     G GSVESR K  S S LT KCIHH+SS+ASAYVAR CIGLILF  V
Subjt:  KRETQLQVVSSPDKAKEV------------SNEDIVACRS---KSTTGH-----GHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVTV

Query:  ARQVLLYGN
        AR +LL GN
Subjt:  ARQVLLYGN

XP_022974303.1 uncharacterized protein LOC111472945 isoform X1 [Cucurbita maxima]1.3e-15062.67Show/hide
Query:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG
        MADCIVLP   L PVGFRFHPTDEELINHYLKNK+LG                       LSND TS+ EWFFF+AQ+ KYSN RRSNRAT TGYWKSTG
Subjt:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG

Query:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEN-----DEAEPNDLVDSTFESVATVNQNQEIPG
        KDRKI+ P TKKLIGTKKTLVFYSGRV NGIRTNWVIHEYHLHSDP F +L P  FVICLLK+K DEN     DEAE N L++ TF++  +VNQN+EIPG
Subjt:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEN-----DEAEPNDLVDSTFESVATVNQNQEIPG

Query:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTEENCLHEGTPNSSISDFNWEELLHSVNQNQGGGFSGDTGMDATLS
        NGQSLF P+ QV  YDF  L+S LF D EPISMDFQA N YG   + PTDEEID EE  LHEGTPNSS S FNW+ELL  V+ +QGG   GDTGM ATL+
Subjt:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTEENCLHEGTPNSSISDFNWEELLHSVNQNQGGGFSGDTGMDATLS

Query:  WQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCISYLSDESDREITTIRPQRQHKSHKVVNAS------QS
         Q +  SHS+F EH+ SRV +Q+ PIIKESRRSPLTSKI KYGGE               N ISYLSD+SDRE  T+R Q Q +S KVVN +      Q 
Subjt:  WQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCISYLSDESDREITTIRPQRQHKSHKVVNAS------QS

Query:  KRETQLQVVSSPDKAKEV------------SNEDIVACRS---KSTTGH-----GHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVTV
        KR+TQLQVVSSPDKAK V            SN+DIV  R    KS   H     G GSVESR K  S S LT KCIHH+SS+ASAYVAR CIGLILF  V
Subjt:  KRETQLQVVSSPDKAKEV------------SNEDIVACRS---KSTTGH-----GHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVTV

Query:  ARQVLLYGN
        AR +LL GN
Subjt:  ARQVLLYGN

XP_022974304.1 NAC domain-containing protein 53-like isoform X2 [Cucurbita maxima]1.1e-14160.51Show/hide
Query:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG
        MADCIVLP   L PVGFRFHPTDEELINHYLKNK+LG                       LSND TS+ EWFFF+AQ+ KYSN RRSNRAT TGYWKSTG
Subjt:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG

Query:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEN-----DEAEPNDLVDSTFESVATVNQNQEIPG
        KDRKI+ P TKKLIGTKKTLVFYSGRV NGIRTNWVIHEYHLHSDP F +L P  FVICLLK+K DEN     DEAE N L++ TF++  +VNQN+E+ G
Subjt:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEN-----DEAEPNDLVDSTFESVATVNQNQEIPG

Query:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTEENCLHEGTPNSSISDFNWEELLHSVNQNQGGGFSGDTGMDATLS
                      YDF  L+S LF D EPISMDFQA N YG   + PTDEEID EE  LHEGTPNSS S FNW+ELL  V+ +QGG   GDTGM ATL+
Subjt:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTEENCLHEGTPNSSISDFNWEELLHSVNQNQGGGFSGDTGMDATLS

Query:  WQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCISYLSDESDREITTIRPQRQHKSHKVVNAS------QS
         Q +  SHS+F EH+ SRV +Q+ PIIKESRRSPLTSKI KYGGE               N ISYLSD+SDRE  T+R Q Q +S KVVN +      Q 
Subjt:  WQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCISYLSDESDREITTIRPQRQHKSHKVVNAS------QS

Query:  KRETQLQVVSSPDKAKEV------------SNEDIVACRS---KSTTGH-----GHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVTV
        KR+TQLQVVSSPDKAK V            SN+DIV  R    KS   H     G GSVESR K  S S LT KCIHH+SS+ASAYVAR CIGLILF  V
Subjt:  KRETQLQVVSSPDKAKEV------------SNEDIVACRS---KSTTGH-----GHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVTV

Query:  ARQVLLYGN
        AR +LL GN
Subjt:  ARQVLLYGN

XP_038878284.1 NAC domain-containing protein 101-like [Benincasa hispida]1.0e-14760.78Show/hide
Query:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG
        MA+ ++LPP++L PVGFRFHPTDEEL NHYLKNKI+G                       LSND+T D +WFFFSAQ+FKYSNGRRSNRAT TGYWKSTG
Subjt:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG

Query:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEND-----EAEPNDLVDSTFESVATVNQNQEIPG
        KDRKI+  GTK LIGTKKTLVFYSGRVPNG RTNWVIHEYHLH DP+  +L  +SFVICLLKRK DE+D     EAEPN L+ S+  ++ + NQNQ IPG
Subjt:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEND-----EAEPNDLVDSTFESVATVNQNQEIPG

Query:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDT----EENCLHEGTPNSSISD--FNWEELLHSVNQNQGGGFSGDTG
        NGQSLFQ D QV DYD  +LQSPLFSDPEP SMDFQ  N Y    NGPTDE++++    +ENC HEGT NSSI D  FNWEEL   VN++QGGG  GDTG
Subjt:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDT----EENCLHEGTPNSSISD--FNWEELLHSVNQNQGGGFSGDTG

Query:  MDATLSWQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGG-----EINCISYLSDESDREITTIRPQRQHKSHKVVN---AS---QSKRE
        MD  L WQYD ASHS+FSE   SR+Q+  MP+IKESRRSPLTSKI KYG      + NC+ Y SD+SD+E      Q Q KSHKV+N   AS   QS+ E
Subjt:  MDATLSWQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGG-----EINCISYLSDESDREITTIRPQRQHKSHKVVN---AS---QSKRE

Query:  TQLQVVSSPDKAKEV------------SNED------------IVACRSKSTTGHGHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVT
        TQ QVV S  KA+ V            SNED            +VA  SKST   GHGS ES  KC S S LT KCIHHK S AS Y ARAC+G ILF+T
Subjt:  TQLQVVSSPDKAKEV------------SNED------------IVACRSKSTTGHGHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVT

Query:  VARQVLLYGN
        +ARQVLLYGN
Subjt:  VARQVLLYGN

TrEMBL top hitse value%identityAlignment
A0A1S3BQH1 protein CUP-SHAPED COTYLEDON 3-like1.2e-14158.87Show/hide
Query:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG
        MA+C++LPP +L+PVGFRFHPTDEEL NHYLKNKI+G                       LSND+T DH+WFFFSAQ+FKYSNGRRSNRAT TGYWKSTG
Subjt:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG

Query:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEND-----EAEPNDLVDSTFESVATVNQNQEIPG
        KDRKIM  GTK LIGTKKTLVFYSGRVP+GI+TNWVIHEYHLH DP+   L  +SFVIC+LKRK +E+D     EAEPN L+ ST  +VAT NQN+E PG
Subjt:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEND-----EAEPNDLVDSTFESVATVNQNQEIPG

Query:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEID----TEENCLHEGTPNSSISDFNWEELLHSVNQNQGGGFSGDTGMD
        NG SLFQ D QV DY   +L+S LFSDPEP SMDFQ  N YG   NG TDE+++     +ENC  EGTPNSS S+FNWEE+L  +N +QGGGFSG+TG+D
Subjt:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEID----TEENCLHEGTPNSSISDFNWEELLHSVNQNQGGGFSGDTGMD

Query:  ATLSWQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGEINCISYLSDESDREITTIRPQRQHKSHKVVN------ASQSKRETQLQVVS
          LSW+YD AS S F E   SR+Q++S+P+IKESRRSPLTSKIL+           SD+SDR       Q Q KSHK ++        QSKRETQ+QVVS
Subjt:  ATLSWQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGEINCISYLSDESDREITTIRPQRQHKSHKVVN------ASQSKRETQLQVVS

Query:  SPDKAKEV------------SNEDIVA-----CRSKSTTGHGHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVTVARQVLLYGN
        S  KA+ V            SN+DIV        ++     GHGS++S  +CSS S LT KCI HK S AS Y ARAC+G ILF+TVAR+VLLYGN
Subjt:  SPDKAKEV------------SNEDIVA-----CRSKSTTGHGHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVTVARQVLLYGN

A0A6J1CT89 NAC domain-containing protein 101-like3.5e-15462.93Show/hide
Query:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG
        MA+C     R LWPVGFRFHPTD+ELINHYLKNK+LG                       LSN  T + +WFFFSAQ+FKYSNGRRSNR T TGYWKSTG
Subjt:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG

Query:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEND-----EAEPNDLVDSTFESV-ATVNQNQEIP
        KDRKIM PGTK LIGTKKTLVF+ GRV NGIRTNWVIHEYHLHSD +F  L  RSFVIC LKRK DEND     EAE N +V+STF++V +T+++ QEI 
Subjt:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEND-----EAEPNDLVDSTFESV-ATVNQNQEIP

Query:  GNGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTE----------ENCLHEGTPNSSISDFNWEELLHSVNQNQGGGF
        GNG+SLFQPD QVLDYDFPDLQSPLFSDPEP SMDFQA N YG   NG +DE+IDTE          ENC HEGTPNSSI+DFN+EELLH V+     GF
Subjt:  GNGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTE----------ENCLHEGTPNSSISDFNWEELLHSVNQNQGGGF

Query:  SGDTGMDATLSWQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCI-SYLSDESDREITTIRPQRQHKSHKV
        SGDTGM++   WQ D AS SL  EHTRSRVQ+QSMP+I++SRRSPLTSKILKYGGE              INCI SY  D+SD+E TT+RPQ   KSHKV
Subjt:  SGDTGMDATLSWQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCI-SYLSDESDREITTIRPQRQHKSHKV

Query:  VNASQSKRETQLQVVSSPDKAKEV------------SNEDIV-----------ACRSKSTTGHGHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARAC
        V  +Q +RETQL+VVSS DKAK V            SN+D +           A R KST   G G +ESR KCSS S LT KCIHHKSS ASAYVARAC
Subjt:  VNASQSKRETQLQVVSSPDKAKEV------------SNEDIV-----------ACRSKSTTGHGHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARAC

Query:  IGLILFVTVARQVLLYGN
        IG ILF+ VARQ LLYGN
Subjt:  IGLILFVTVARQVLLYGN

A0A6J1IAW9 NAC domain-containing protein 53-like1.0e-14561.69Show/hide
Query:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG
        MADCIVLP   L PVGFRFHPTDEELINHYLKNK+LG                       LSND TS+ EWFFF+AQ+ KYSN RRSNRAT TGYWKSTG
Subjt:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG

Query:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEN-----DEAEPNDLVDSTFESVATVNQNQEIPG
        KDRKI+ P TKKLIGTKKTLVFYSGRV NGIRTNWVIHEYHLHSDP F +L P  FVICLLK+K DEN     DEAE N L++ TF++  +  Q   IPG
Subjt:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEN-----DEAEPNDLVDSTFESVATVNQNQEIPG

Query:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTEENCLHEGTPNSSISDFNWEELLHSVNQNQGGGFSGDTGMDATLS
        NGQSLF P+ QV  YDF  L+S LF D EPISMDFQA N YG   + PTDEEID EE  LHEGTPNSS S FNW+ELL  V+ +QGG   GDTGM ATL+
Subjt:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTEENCLHEGTPNSSISDFNWEELLHSVNQNQGGGFSGDTGMDATLS

Query:  WQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCISYLSDESDREITTIRPQRQHKSHKVVNAS------QS
         Q +  SHS+F EH+ SRV +Q+ PIIKESRRSPLTSKI KYGGE               N ISYLSD+SDRE  T+R Q Q +S KVVN +      Q 
Subjt:  WQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCISYLSDESDREITTIRPQRQHKSHKVVNAS------QS

Query:  KRETQLQVVSSPDKAKEV------------SNEDIVACRS---KSTTGH-----GHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVTV
        KR++QLQVVSSPDKAK V            SN+DIV  R    KS   H     G GSVESR K  S S LT KCIHH+SS+ASAYVAR CIGLILF  V
Subjt:  KRETQLQVVSSPDKAKEV------------SNEDIVACRS---KSTTGH-----GHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVTV

Query:  ARQVLLYGN
        AR +LL GN
Subjt:  ARQVLLYGN

A0A6J1IFU1 uncharacterized protein LOC111472945 isoform X16.2e-15162.67Show/hide
Query:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG
        MADCIVLP   L PVGFRFHPTDEELINHYLKNK+LG                       LSND TS+ EWFFF+AQ+ KYSN RRSNRAT TGYWKSTG
Subjt:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG

Query:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEN-----DEAEPNDLVDSTFESVATVNQNQEIPG
        KDRKI+ P TKKLIGTKKTLVFYSGRV NGIRTNWVIHEYHLHSDP F +L P  FVICLLK+K DEN     DEAE N L++ TF++  +VNQN+EIPG
Subjt:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEN-----DEAEPNDLVDSTFESVATVNQNQEIPG

Query:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTEENCLHEGTPNSSISDFNWEELLHSVNQNQGGGFSGDTGMDATLS
        NGQSLF P+ QV  YDF  L+S LF D EPISMDFQA N YG   + PTDEEID EE  LHEGTPNSS S FNW+ELL  V+ +QGG   GDTGM ATL+
Subjt:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTEENCLHEGTPNSSISDFNWEELLHSVNQNQGGGFSGDTGMDATLS

Query:  WQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCISYLSDESDREITTIRPQRQHKSHKVVNAS------QS
         Q +  SHS+F EH+ SRV +Q+ PIIKESRRSPLTSKI KYGGE               N ISYLSD+SDRE  T+R Q Q +S KVVN +      Q 
Subjt:  WQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCISYLSDESDREITTIRPQRQHKSHKVVNAS------QS

Query:  KRETQLQVVSSPDKAKEV------------SNEDIVACRS---KSTTGH-----GHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVTV
        KR+TQLQVVSSPDKAK V            SN+DIV  R    KS   H     G GSVESR K  S S LT KCIHH+SS+ASAYVAR CIGLILF  V
Subjt:  KRETQLQVVSSPDKAKEV------------SNEDIVACRS---KSTTGH-----GHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVTV

Query:  ARQVLLYGN
        AR +LL GN
Subjt:  ARQVLLYGN

A0A6J1IH80 NAC domain-containing protein 53-like isoform X25.3e-14260.51Show/hide
Query:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG
        MADCIVLP   L PVGFRFHPTDEELINHYLKNK+LG                       LSND TS+ EWFFF+AQ+ KYSN RRSNRAT TGYWKSTG
Subjt:  MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILG-----------------------LSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTG

Query:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEN-----DEAEPNDLVDSTFESVATVNQNQEIPG
        KDRKI+ P TKKLIGTKKTLVFYSGRV NGIRTNWVIHEYHLHSDP F +L P  FVICLLK+K DEN     DEAE N L++ TF++  +VNQN+E+ G
Subjt:  KDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDEN-----DEAEPNDLVDSTFESVATVNQNQEIPG

Query:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTEENCLHEGTPNSSISDFNWEELLHSVNQNQGGGFSGDTGMDATLS
                      YDF  L+S LF D EPISMDFQA N YG   + PTDEEID EE  LHEGTPNSS S FNW+ELL  V+ +QGG   GDTGM ATL+
Subjt:  NGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYG---NGPTDEEIDTEENCLHEGTPNSSISDFNWEELLHSVNQNQGGGFSGDTGMDATLS

Query:  WQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCISYLSDESDREITTIRPQRQHKSHKVVNAS------QS
         Q +  SHS+F EH+ SRV +Q+ PIIKESRRSPLTSKI KYGGE               N ISYLSD+SDRE  T+R Q Q +S KVVN +      Q 
Subjt:  WQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGE--------------INCISYLSDESDREITTIRPQRQHKSHKVVNAS------QS

Query:  KRETQLQVVSSPDKAKEV------------SNEDIVACRS---KSTTGH-----GHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVTV
        KR+TQLQVVSSPDKAK V            SN+DIV  R    KS   H     G GSVESR K  S S LT KCIHH+SS+ASAYVAR CIGLILF  V
Subjt:  KRETQLQVVSSPDKAKEV------------SNEDIVACRS---KSTTGH-----GHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVTV

Query:  ARQVLLYGN
        AR +LL GN
Subjt:  ARQVLLYGN

SwissProt top hitse value%identityAlignment
A4VCM0 NAC domain-containing protein 459.0e-3042.78Show/hide
Query:  PVGFRFHPTDEELINHYLKNKILGLSND-----------------------RTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL
        P GFRFHPTDEELI +YLK KI GL  +                        + D EW+FFS ++ KY NG R+NRAT  GYWK+TGKDR++      + 
Subjt:  PVGFRFHPTDEELINHYLKNKILGLSND-----------------------RTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL

Query:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHL---HSDPSFVRLSPRSFVICLLKRKPDENDEAEPNDLVDSTFESVATVNQN
        IGTKKTLV+Y GR P+GIRT WV+HEY L     +PS   +   ++ +C + +K     EA+P D   S   +++ V+ N
Subjt:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHL---HSDPSFVRLSPRSFVICLLKRKPDENDEAEPNDLVDSTFESVATVNQN

B5X570 NAC domain-containing protein 141.1e-3045.45Show/hide
Query:  PVGFRFHPTDEELINHYLKNKI-----------------------LGLSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL
        P+GFRF PTDEELINHYL+ KI                        GLS  +T D EWFFF  ++ KY +G RSNRAT  GYWK+TGKDR I     K +
Subjt:  PVGFRFHPTDEELINHYLKNKI-----------------------LGLSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL

Query:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSP--RSFVICLLKRKP-DENDEAEPNDLVDSTFESVAT
        IG KKTLVFY GR P G RTNW++HEY   +D       P    +V+C L  KP D  D A   ++    F    T
Subjt:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSP--RSFVICLLKRKP-DENDEAEPNDLVDSTFESVAT

F4JN35 Protein NTM1-like 97.3e-3246.71Show/hide
Query:  PVGFRFHPTDEELINHYLKNKILGLSND-----------------------RTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL
        P+GFRF PTDEEL+NHYL+ KI G  +D                       +T D EWFFF  ++ KY NG RSNRAT +GYWK+TGKDR I     K L
Subjt:  PVGFRFHPTDEELINHYLKNKILGLSND-----------------------RTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL

Query:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRS-----FVICLLKRKPDENDEAEPND
        IG KKTLVFY GR P G RTNW++HEY     P+   L   S     +V+C L  KPD+      +D
Subjt:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRS-----FVICLLKRKPDENDEAEPND

Q9LKG8 NAC domain-containing protein 911.1e-3045.73Show/hide
Query:  PVGFRFHPTDEELINHYLKNKILGLSND-----------------------RTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL
        PVGFRF PTDEEL+ +YL+ KI G  ND                       +T+D EW FF   + KY +G R NRAT  GYWK+TGKDRKI   G  K+
Subjt:  PVGFRFHPTDEELINHYLKNKILGLSND-----------------------RTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL

Query:  IGTKKTLVFYSGRVPNGIRTNWVIHEYH-LHSDPSFVRLSPRSFVICLLKRKPD-ENDEAEPND
        IG K+TLVFY+GR P G RT W++HEY     D    +     FV+C L +K D  N  AEP +
Subjt:  IGTKKTLVFYSGRVPNGIRTNWVIHEYH-LHSDPSFVRLSPRSFVICLLKRKPD-ENDEAEPND

Q9S851 Protein CUP-SHAPED COTYLEDON 31.8e-3045.64Show/hide
Query:  PVGFRFHPTDEELINHYLKNKIL--GLSNDRTS-------------------DHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKLIG
        P GFRFHPTDEELI  YL +KI   GLS    S                   + EW+F+S ++ KY  G R+NRAT  GYWK+TGKD+++   G  +L+G
Subjt:  PVGFRFHPTDEELINHYLKNKIL--GLSNDRTS-------------------DHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKLIG

Query:  TKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRK
         KKTLVFY GR P G++T WV+HEY L +D S        +VIC +  K
Subjt:  TKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRSFVICLLKRK

Arabidopsis top hitse value%identityAlignment
AT1G33060.1 NAC 0147.5e-3245.45Show/hide
Query:  PVGFRFHPTDEELINHYLKNKI-----------------------LGLSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL
        P+GFRF PTDEELINHYL+ KI                        GLS  +T D EWFFF  ++ KY +G RSNRAT  GYWK+TGKDR I     K +
Subjt:  PVGFRFHPTDEELINHYLKNKI-----------------------LGLSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL

Query:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSP--RSFVICLLKRKP-DENDEAEPNDLVDSTFESVAT
        IG KKTLVFY GR P G RTNW++HEY   +D       P    +V+C L  KP D  D A   ++    F    T
Subjt:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSP--RSFVICLLKRKP-DENDEAEPNDLVDSTFESVAT

AT1G33060.2 NAC 0147.5e-3245.45Show/hide
Query:  PVGFRFHPTDEELINHYLKNKI-----------------------LGLSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL
        P+GFRF PTDEELINHYL+ KI                        GLS  +T D EWFFF  ++ KY +G RSNRAT  GYWK+TGKDR I     K +
Subjt:  PVGFRFHPTDEELINHYLKNKI-----------------------LGLSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL

Query:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSP--RSFVICLLKRKP-DENDEAEPNDLVDSTFESVAT
        IG KKTLVFY GR P G RTNW++HEY   +D       P    +V+C L  KP D  D A   ++    F    T
Subjt:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSP--RSFVICLLKRKP-DENDEAEPNDLVDSTFESVAT

AT4G35580.1 NAC transcription factor-like 95.2e-3346.71Show/hide
Query:  PVGFRFHPTDEELINHYLKNKILGLSND-----------------------RTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL
        P+GFRF PTDEEL+NHYL+ KI G  +D                       +T D EWFFF  ++ KY NG RSNRAT +GYWK+TGKDR I     K L
Subjt:  PVGFRFHPTDEELINHYLKNKILGLSND-----------------------RTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL

Query:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRS-----FVICLLKRKPDENDEAEPND
        IG KKTLVFY GR P G RTNW++HEY     P+   L   S     +V+C L  KPD+      +D
Subjt:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRS-----FVICLLKRKPDENDEAEPND

AT4G35580.2 NAC transcription factor-like 95.2e-3346.71Show/hide
Query:  PVGFRFHPTDEELINHYLKNKILGLSND-----------------------RTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL
        P+GFRF PTDEEL+NHYL+ KI G  +D                       +T D EWFFF  ++ KY NG RSNRAT +GYWK+TGKDR I     K L
Subjt:  PVGFRFHPTDEELINHYLKNKILGLSND-----------------------RTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL

Query:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRS-----FVICLLKRKPDENDEAEPND
        IG KKTLVFY GR P G RTNW++HEY     P+   L   S     +V+C L  KPD+      +D
Subjt:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRS-----FVICLLKRKPDENDEAEPND

AT4G35580.3 NAC transcription factor-like 95.2e-3346.71Show/hide
Query:  PVGFRFHPTDEELINHYLKNKILGLSND-----------------------RTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL
        P+GFRF PTDEEL+NHYL+ KI G  +D                       +T D EWFFF  ++ KY NG RSNRAT +GYWK+TGKDR I     K L
Subjt:  PVGFRFHPTDEELINHYLKNKILGLSND-----------------------RTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKL

Query:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRS-----FVICLLKRKPDENDEAEPND
        IG KKTLVFY GR P G RTNW++HEY     P+   L   S     +V+C L  KPD+      +D
Subjt:  IGTKKTLVFYSGRVPNGIRTNWVIHEYHLHSDPSFVRLSPRS-----FVICLLKRKPDENDEAEPND


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGATTGCATCGTGCTTCCTCCTCGGCATCTATGGCCGGTTGGATTCCGCTTCCATCCAACCGATGAGGAGCTAATCAATCACTACCTGAAGAACAAGATACTTGG
CCTATCAAACGACCGTACAAGTGATCATGAGTGGTTTTTCTTTTCTGCTCAAAACTTCAAGTATTCCAATGGCCGTCGATCCAATAGGGCCACTGCAACCGGCTATTGGA
AATCCACGGGCAAGGACCGCAAAATCATGCCTCCAGGAACCAAGAAGTTGATTGGTACCAAAAAAACTCTGGTCTTCTACAGCGGCCGGGTTCCAAATGGGATCAGGACC
AATTGGGTCATACATGAGTATCATCTTCACTCCGACCCCAGCTTCGTTCGTTTGTCCCCTAGGTCGTTTGTTATTTGTCTCCTAAAGAGAAAACCGGACGAGAATGATGA
AGCTGAACCAAATGATCTTGTGGATTCGACCTTCGAAAGTGTAGCCACAGTGAACCAAAATCAAGAGATTCCAGGAAATGGACAATCATTGTTTCAACCAGATTTCCAGG
TTTTGGATTATGATTTTCCCGATTTGCAGTCCCCTTTGTTTTCTGACCCGGAACCCATTTCAATGGATTTCCAAGCTTTTAATTGCTATGGGAATGGACCTACTGATGAG
GAGATTGATACTGAGGAAAATTGTTTGCATGAAGGGACACCAAATAGTTCGATCAGTGATTTCAACTGGGAGGAATTGCTGCACTCGGTTAATCAAAATCAGGGTGGGGG
ATTCAGCGGTGACACGGGCATGGATGCAACTTTAAGCTGGCAGTATGATCCTGCTTCTCACAGCTTGTTCAGTGAACATACCCGTTCAAGGGTACAAAGTCAAAGTATGC
CGATCATTAAAGAATCGCGTAGAAGCCCACTCACTTCAAAGATTCTCAAGTACGGAGGGGAAATCAATTGCATTTCGTACCTCAGTGATGAGTCTGACAGGGAAATAACA
ACTATCAGACCTCAACGTCAACATAAATCCCATAAGGTTGTAAATGCTTCACAATCGAAGAGGGAAACTCAACTCCAAGTAGTGTCATCCCCTGACAAGGCAAAAGAAGT
TTCAAATGAAGACATTGTAGCCTGCCGGTCCAAGTCAACAACTGGGCATGGGCATGGGTCTGTTGAGAGTCGAGAAAAGTGTTCTTCTACTAGTTTCCTGACTAAAAAAT
GCATTCACCACAAATCAAGTGCAGCATCAGCCTATGTTGCCAGAGCATGTATAGGCCTCATTTTATTCGTCACAGTTGCAAGACAAGTGCTGCTGTATGGAAATTCAAAT
TGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGATTGCATCGTGCTTCCTCCTCGGCATCTATGGCCGGTTGGATTCCGCTTCCATCCAACCGATGAGGAGCTAATCAATCACTACCTGAAGAACAAGATACTTGG
CCTATCAAACGACCGTACAAGTGATCATGAGTGGTTTTTCTTTTCTGCTCAAAACTTCAAGTATTCCAATGGCCGTCGATCCAATAGGGCCACTGCAACCGGCTATTGGA
AATCCACGGGCAAGGACCGCAAAATCATGCCTCCAGGAACCAAGAAGTTGATTGGTACCAAAAAAACTCTGGTCTTCTACAGCGGCCGGGTTCCAAATGGGATCAGGACC
AATTGGGTCATACATGAGTATCATCTTCACTCCGACCCCAGCTTCGTTCGTTTGTCCCCTAGGTCGTTTGTTATTTGTCTCCTAAAGAGAAAACCGGACGAGAATGATGA
AGCTGAACCAAATGATCTTGTGGATTCGACCTTCGAAAGTGTAGCCACAGTGAACCAAAATCAAGAGATTCCAGGAAATGGACAATCATTGTTTCAACCAGATTTCCAGG
TTTTGGATTATGATTTTCCCGATTTGCAGTCCCCTTTGTTTTCTGACCCGGAACCCATTTCAATGGATTTCCAAGCTTTTAATTGCTATGGGAATGGACCTACTGATGAG
GAGATTGATACTGAGGAAAATTGTTTGCATGAAGGGACACCAAATAGTTCGATCAGTGATTTCAACTGGGAGGAATTGCTGCACTCGGTTAATCAAAATCAGGGTGGGGG
ATTCAGCGGTGACACGGGCATGGATGCAACTTTAAGCTGGCAGTATGATCCTGCTTCTCACAGCTTGTTCAGTGAACATACCCGTTCAAGGGTACAAAGTCAAAGTATGC
CGATCATTAAAGAATCGCGTAGAAGCCCACTCACTTCAAAGATTCTCAAGTACGGAGGGGAAATCAATTGCATTTCGTACCTCAGTGATGAGTCTGACAGGGAAATAACA
ACTATCAGACCTCAACGTCAACATAAATCCCATAAGGTTGTAAATGCTTCACAATCGAAGAGGGAAACTCAACTCCAAGTAGTGTCATCCCCTGACAAGGCAAAAGAAGT
TTCAAATGAAGACATTGTAGCCTGCCGGTCCAAGTCAACAACTGGGCATGGGCATGGGTCTGTTGAGAGTCGAGAAAAGTGTTCTTCTACTAGTTTCCTGACTAAAAAAT
GCATTCACCACAAATCAAGTGCAGCATCAGCCTATGTTGCCAGAGCATGTATAGGCCTCATTTTATTCGTCACAGTTGCAAGACAAGTGCTGCTGTATGGAAATTCAAAT
TGTTGA
Protein sequenceShow/hide protein sequence
MADCIVLPPRHLWPVGFRFHPTDEELINHYLKNKILGLSNDRTSDHEWFFFSAQNFKYSNGRRSNRATATGYWKSTGKDRKIMPPGTKKLIGTKKTLVFYSGRVPNGIRT
NWVIHEYHLHSDPSFVRLSPRSFVICLLKRKPDENDEAEPNDLVDSTFESVATVNQNQEIPGNGQSLFQPDFQVLDYDFPDLQSPLFSDPEPISMDFQAFNCYGNGPTDE
EIDTEENCLHEGTPNSSISDFNWEELLHSVNQNQGGGFSGDTGMDATLSWQYDPASHSLFSEHTRSRVQSQSMPIIKESRRSPLTSKILKYGGEINCISYLSDESDREIT
TIRPQRQHKSHKVVNASQSKRETQLQVVSSPDKAKEVSNEDIVACRSKSTTGHGHGSVESREKCSSTSFLTKKCIHHKSSAASAYVARACIGLILFVTVARQVLLYGNSN
C