; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021344 (gene) of Snake gourd v1 genome

Gene IDTan0021344
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionNAC domain-containing protein
Genome locationLG02:94524872..94527986
RNA-Seq ExpressionTan0021344
SyntenyTan0021344
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0003677 - DNA binding (molecular function)
InterPro domainsIPR003441 - NAC domain
IPR036093 - NAC domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585403.1 Protein CUP-SHAPED COTYLEDON 2, partial [Cucurbita argyrosperma subsp. sororia]1.4e-18283.9Show/hide
Query:  MQVPSGNNSVMEKEDGKRSEVKKKNTMAESDHHHCHRHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAK
        MQV SG+N   EK DGKRSEV KKNT+AESD     R+HQD  Q+Q+QTLPPGFRFHPSDEELIT+YLLNKISDANFTGRA+TDVDLNKFEPWELPGKAK
Subjt:  MQVPSGNNSVMEKEDGKRSEVKKKNTMAESDHHHCHRHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAK

Query:  MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVCRIFQKSA
        MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFN++TSELVGMKKTLVFYKGRAPRGEK+NWVMHEYRLHSKTAFRTAKDEWVVCR+FQKSA
Subjt:  MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVCRIFQKSA

Query:  GPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGDHAAQYGYGRNFITATELAELNRVLRVGG---GSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLN
        GPKKYPPNQ RAVNPYVNLEIAPPLLPP +MQLGD AAQYGYGRN+I+  +LAELNRVLR GG   GSGGSTHGINLSMQPQFNY PAGGCFTISGLNLN
Subjt:  GPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGDHAAQYGYGRNFITATELAELNRVLRVGG---GSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLN

Query:  LGGASSQPVLRPMPPPPVATAMAIGQQDIASSVMTSTIA----------PENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW
        LGGASSQPVLRPM PPPVA  M IGQQD  SSVM +T+A          PE AYGAEI+NN  GPGNRFMNMDHCMDLENYW NW
Subjt:  LGGASSQPVLRPMPPPPVATAMAIGQQDIASSVMTSTIA----------PENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW

XP_008445052.1 PREDICTED: NAC domain-containing protein 92 [Cucumis melo]4.4e-18484.62Show/hide
Query:  MQVPSGNNSVMEKEDGK-RSEVKKKNTM--AESDHHHCHRHH--------QDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNK
        MQVPSGNN + EKEDGK RSEVKKKNTM   ESDHH  HR H        Q++ QD++QTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNK
Subjt:  MQVPSGNNSVMEKEDGK-RSEVKKKNTM--AESDHHHCHRHH--------QDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNK

Query:  FEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDE
        FEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSEL+GMKKTLVFYKGRAPRGEK+NWVMHEYRLHSKTAFRTAKDE
Subjt:  FEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDE

Query:  WVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGD-HAAQYGYGRNFITATELAELNRVLRVGGG----SGGSTHGINLSMQPQFNYPP
        WVVCR+FQKS G KKYPPNQ RAVNPYVNLEI   LLPP IMQLGD  A QYGYGRN+IT+TELAELNRVLR GGG     GGSTHGINLSMQPQFNYP 
Subjt:  WVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGD-HAAQYGYGRNFITATELAELNRVLRVGGG----SGGSTHGINLSMQPQFNYPP

Query:  AGGCFTISGLNLNLGGASSQPVLR--PMPPPPVATAMAIGQQDIASSVMTSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW
         GGCFTISGLNLNLGG  SQPVLR  P+PPPPVAT M IG QDIASSVM STI PE AYG EINNNA GP NRFMNMDHCMDLENYWPNW
Subjt:  AGGCFTISGLNLNLGGASSQPVLR--PMPPPPVATAMAIGQQDIASSVMTSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW

XP_022951842.1 protein CUP-SHAPED COTYLEDON 3-like [Cucurbita moschata]8.2e-18384.16Show/hide
Query:  MQVPSGNNSVMEKEDGKRSEVKKKNTMAESDHHHCHRHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAK
        MQV SG+N   EK DGK  EV KKNT+AESD     R+HQD  Q+Q+QTLPPGFRFHPSDEELIT+YLLNKISDANFTGRA+TDVDLNKFEPWELPGKAK
Subjt:  MQVPSGNNSVMEKEDGKRSEVKKKNTMAESDHHHCHRHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAK

Query:  MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVCRIFQKSA
        MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFN++TSELVGMKKTLVFYKGRAPRGEK+NWVMHEYRLHSKTAFRTAKDEWVVCR+FQKSA
Subjt:  MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVCRIFQKSA

Query:  GPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGDHAAQYGYGRNFITATELAELNRVLRVGG---GSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLN
        GPKKYPPNQ RAVNPYVNLEIAPPLLPP +MQLGD AAQYGYGRN+IT  +LAELNRVLR GG   GSGGSTHGINLSMQPQFNY PAGGCFTISGLNLN
Subjt:  GPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGDHAAQYGYGRNFITATELAELNRVLRVGG---GSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLN

Query:  LGGASSQPVLRPMPPPPVATAMAIGQQDIASSVM----------TSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW
        LGGASSQPVLRPM PPPVA  M IGQQDI SSVM           STI PE AYGAEI+NN  GPGNRFMNMDHCMDLENYW NW
Subjt:  LGGASSQPVLRPMPPPPVATAMAIGQQDIASSVM----------TSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW

XP_023002385.1 protein CUP-SHAPED COTYLEDON 3 [Cucurbita maxima]7.4e-18484.42Show/hide
Query:  MQVPSGNNSVMEKEDGKRSEVKKKNTMAESDHHHCHRHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAK
        MQV SG+N   EK DGKRSEV KKNT+AESD     R+HQD  Q+Q+QTLPPGFRFHPSDEELIT+YLLNKISDANFTGRA+TDVDLNKFEPWELPGKAK
Subjt:  MQVPSGNNSVMEKEDGKRSEVKKKNTMAESDHHHCHRHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAK

Query:  MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVCRIFQKSA
        MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFN++TSELVGMKKTLVFYKGRAPRGEK+NWVMHEYRLHSKTAFRTAKDEWVVCR+FQKSA
Subjt:  MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVCRIFQKSA

Query:  GPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGDHAAQYGYGRNFITATELAELNRVLRVGG---GSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLN
        GPKKYPPNQ RAVNPYVNLEIAPPLLPP +MQLGD AAQYGYGRN+IT  +LAELNRVLR GG   GSGGSTHGINLSMQPQFNY PAGGCFTISGLNLN
Subjt:  GPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGDHAAQYGYGRNFITATELAELNRVLRVGG---GSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLN

Query:  LGGASSQPVLRPMPPPPVATAMAIGQQDIASSVMTSTIA----------PENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW
        LGGASSQPVLRPM PPPVA  M IGQQDI SSVM +T+A          PE AYGAEI+NN  GPGNRFMNMDHCMDLENYW NW
Subjt:  LGGASSQPVLRPMPPPPVATAMAIGQQDIASSVMTSTIA----------PENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW

XP_038885415.1 NAC domain-containing protein 92 [Benincasa hispida]5.0e-18887.17Show/hide
Query:  MQVPSGNNSVMEKEDGK-RSEVKKKNTMAESDHHHCHRHHQ------DRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPW
        MQVP+GNNS  EKEDGK R EVKKKNTM ESDHH   RHHQ      D+ QD++QTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPW
Subjt:  MQVPSGNNSVMEKEDGK-RSEVKKKNTMAESDHHHCHRHHQ------DRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPW

Query:  ELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVC
        ELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSEL+GMKKTLVFYKGRAPRGEK+NWVMHEYRLHSKTAFRTAKDEWVVC
Subjt:  ELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVC

Query:  RIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGDHAAQYGYGRNFITATELAELNRVLRV---GGGSGGSTHGINLSMQPQFNYPPAGGCFT
        R+FQKS G KKYPPNQ RAVNPYVNLEIAP LLPPSIMQLGD AAQYGYGRN IT+TELAELNRVLR    G GSGGSTHGINLSMQPQFNYP  GGCFT
Subjt:  RIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGDHAAQYGYGRNFITATELAELNRVLRV---GGGSGGSTHGINLSMQPQFNYPPAGGCFT

Query:  ISGLNLNLGGASSQPVLRPMPPPPVATAMAIGQQDIASSVMTSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW
        ISGLNLNLGGA SQPVLRPM PPPVAT M I   DIASSVM STIAPE AYG EINNNA GP NRFMNMDHCMDLENYWP W
Subjt:  ISGLNLNLGGASSQPVLRPMPPPPVATAMAIGQQDIASSVMTSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW

TrEMBL top hitse value%identityAlignment
A0A0A0LS52 NAC domain-containing protein7.8e-17983.81Show/hide
Query:  MQVPSGNNSVMEKEDGK-RSEVKKKNTM-AESDHHHCH-----RHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPW
        MQV SGNN + EKEDGK RSEVKKKNTM  ESD H  H        Q+  QD++QTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPW
Subjt:  MQVPSGNNSVMEKEDGK-RSEVKKKNTM-AESDHHHCH-----RHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPW

Query:  ELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVC
        ELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSEL+GMKKTLVFYKGRAPRGEK+NWVMHEYRLHSKTAFRTAKDEWVVC
Subjt:  ELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVC

Query:  RIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGD-HAAQYGYGRNFITATELAELNRVLRV---GGGSGGSTHGINLSMQPQFNYPPAGGCF
        R+FQKS G KKYPPNQ RAVNPYVNLEI   LLPP IMQLGD  A QYGYGRN+IT+TELAELNRVLR    GG +GGST GINLSMQPQFNYP  GGCF
Subjt:  RIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGD-HAAQYGYGRNFITATELAELNRVLRV---GGGSGGSTHGINLSMQPQFNYPPAGGCF

Query:  TISGLNLNLGGASSQPVLRPMPPPPVATAMAIGQQDIASSVMTSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW
        TISGLNLNLGG  SQPVLR +PPPPVAT M IG QDIASS+M STI PE AYGAEINNNA G  NRFMNMDHCMDLENYWP W
Subjt:  TISGLNLNLGGASSQPVLRPMPPPPVATAMAIGQQDIASSVMTSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW

A0A1S3BCI4 NAC domain-containing protein 922.1e-18484.62Show/hide
Query:  MQVPSGNNSVMEKEDGK-RSEVKKKNTM--AESDHHHCHRHH--------QDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNK
        MQVPSGNN + EKEDGK RSEVKKKNTM   ESDHH  HR H        Q++ QD++QTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNK
Subjt:  MQVPSGNNSVMEKEDGK-RSEVKKKNTM--AESDHHHCHRHH--------QDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNK

Query:  FEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDE
        FEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSEL+GMKKTLVFYKGRAPRGEK+NWVMHEYRLHSKTAFRTAKDE
Subjt:  FEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDE

Query:  WVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGD-HAAQYGYGRNFITATELAELNRVLRVGGG----SGGSTHGINLSMQPQFNYPP
        WVVCR+FQKS G KKYPPNQ RAVNPYVNLEI   LLPP IMQLGD  A QYGYGRN+IT+TELAELNRVLR GGG     GGSTHGINLSMQPQFNYP 
Subjt:  WVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGD-HAAQYGYGRNFITATELAELNRVLRVGGG----SGGSTHGINLSMQPQFNYPP

Query:  AGGCFTISGLNLNLGGASSQPVLR--PMPPPPVATAMAIGQQDIASSVMTSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW
         GGCFTISGLNLNLGG  SQPVLR  P+PPPPVAT M IG QDIASSVM STI PE AYG EINNNA GP NRFMNMDHCMDLENYWPNW
Subjt:  AGGCFTISGLNLNLGGASSQPVLR--PMPPPPVATAMAIGQQDIASSVMTSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW

A0A5A7VHQ8 NAC domain-containing protein 929.2e-18084.96Show/hide
Query:  EKEDGK-RSEVKKKNTM--AESDHHHCHRHH--------QDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAK
        EKEDGK RSEVKKKNTM   ESDHH  HR H        Q++ QD++QTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAK
Subjt:  EKEDGK-RSEVKKKNTM--AESDHHHCHRHH--------QDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAK

Query:  MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVCRIFQKSA
        MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSEL+GMKKTLVFYKGRAPRGEK+NWVMHEYRLHSKTAFRTAKDEWVVCR+FQKS 
Subjt:  MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVCRIFQKSA

Query:  GPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGD-HAAQYGYGRNFITATELAELNRVLRVGGG----SGGSTHGINLSMQPQFNYPPAGGCFTISGLN
        G KKYPPNQ RAVNPYVNLEI   LLPP IMQLGD  A QYGYGRN+IT+TELAELNRVLR GGG     GGSTHGINLSMQPQFNYP  GGCFTISGLN
Subjt:  GPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGD-HAAQYGYGRNFITATELAELNRVLRVGGG----SGGSTHGINLSMQPQFNYPPAGGCFTISGLN

Query:  LNLGGASSQPVLR--PMPPPPVATAMAIGQQDIASSVMTSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW
        LNLGG  SQPVLR  P+PPPPVAT M IG QDIASSVM STI PE AYG EINNNA GP NRFMNMDHCMDLENYWPNW
Subjt:  LNLGGASSQPVLR--PMPPPPVATAMAIGQQDIASSVMTSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW

A0A6J1GIS6 protein CUP-SHAPED COTYLEDON 3-like4.0e-18384.16Show/hide
Query:  MQVPSGNNSVMEKEDGKRSEVKKKNTMAESDHHHCHRHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAK
        MQV SG+N   EK DGK  EV KKNT+AESD     R+HQD  Q+Q+QTLPPGFRFHPSDEELIT+YLLNKISDANFTGRA+TDVDLNKFEPWELPGKAK
Subjt:  MQVPSGNNSVMEKEDGKRSEVKKKNTMAESDHHHCHRHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAK

Query:  MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVCRIFQKSA
        MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFN++TSELVGMKKTLVFYKGRAPRGEK+NWVMHEYRLHSKTAFRTAKDEWVVCR+FQKSA
Subjt:  MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVCRIFQKSA

Query:  GPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGDHAAQYGYGRNFITATELAELNRVLRVGG---GSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLN
        GPKKYPPNQ RAVNPYVNLEIAPPLLPP +MQLGD AAQYGYGRN+IT  +LAELNRVLR GG   GSGGSTHGINLSMQPQFNY PAGGCFTISGLNLN
Subjt:  GPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGDHAAQYGYGRNFITATELAELNRVLRVGG---GSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLN

Query:  LGGASSQPVLRPMPPPPVATAMAIGQQDIASSVM----------TSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW
        LGGASSQPVLRPM PPPVA  M IGQQDI SSVM           STI PE AYGAEI+NN  GPGNRFMNMDHCMDLENYW NW
Subjt:  LGGASSQPVLRPMPPPPVATAMAIGQQDIASSVM----------TSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW

A0A6J1KNS7 protein CUP-SHAPED COTYLEDON 33.6e-18484.42Show/hide
Query:  MQVPSGNNSVMEKEDGKRSEVKKKNTMAESDHHHCHRHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAK
        MQV SG+N   EK DGKRSEV KKNT+AESD     R+HQD  Q+Q+QTLPPGFRFHPSDEELIT+YLLNKISDANFTGRA+TDVDLNKFEPWELPGKAK
Subjt:  MQVPSGNNSVMEKEDGKRSEVKKKNTMAESDHHHCHRHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAK

Query:  MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVCRIFQKSA
        MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFN++TSELVGMKKTLVFYKGRAPRGEK+NWVMHEYRLHSKTAFRTAKDEWVVCR+FQKSA
Subjt:  MGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVCRIFQKSA

Query:  GPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGDHAAQYGYGRNFITATELAELNRVLRVGG---GSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLN
        GPKKYPPNQ RAVNPYVNLEIAPPLLPP +MQLGD AAQYGYGRN+IT  +LAELNRVLR GG   GSGGSTHGINLSMQPQFNY PAGGCFTISGLNLN
Subjt:  GPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGDHAAQYGYGRNFITATELAELNRVLRVGG---GSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLN

Query:  LGGASSQPVLRPMPPPPVATAMAIGQQDIASSVMTSTIA----------PENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW
        LGGASSQPVLRPM PPPVA  M IGQQDI SSVM +T+A          PE AYGAEI+NN  GPGNRFMNMDHCMDLENYW NW
Subjt:  LGGASSQPVLRPMPPPPVATAMAIGQQDIASSVMTSTIA----------PENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW

SwissProt top hitse value%identityAlignment
O04017 Protein CUP-SHAPED COTYLEDON 22.9e-6671.69Show/hide
Query:  HHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKD
        +H D   D  Q LPPGFRFHP+DEELIT YLL K+ D  F+ RAI +VDLNK EPW+LPG+AKMGEKEWYFFSLRDRKYPTG+RTNRAT  GYWK TGKD
Subjt:  HHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKD

Query:  KEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAF----RTAKDEWVVCRIFQKS
        +EIF+S T  LVGMKKTLVFYKGRAP+GEKSNWVMHEYRL  K ++    R++KDEWV+ R+FQK+
Subjt:  KEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAF----RTAKDEWVVCRIFQKS

Q9FKA0 NAC domain-containing protein 923.7e-6162.43Show/hide
Query:  QDQEQ-TLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFN
        +D+E   LPPGFRFHP+DEELIT YL  K+ +  F+  AI +VDLNK EPW+LP KAKMGEKEWYFF +RDRKYPTG+RTNRAT  GYWK TGKDKEIF 
Subjt:  QDQEQ-TLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFN

Query:  SNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAF----RTAKDEWVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLP
          +  LVGMKKTLVFYKGRAP+G K+NWVMHEYRL  K       +TAK+EWV+CR+FQK A   K P +    ++P++N  + P  LP
Subjt:  SNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAF----RTAKDEWVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLP

Q9FLJ2 NAC domain-containing protein 1004.7e-6462.23Show/hide
Query:  QDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNS
        ++++  LPPGFRFHP+DEELIT YL  K+ D +F+ +AI +VDLNK EPWELP  AKMGEKEWYFF +RDRKYPTG+RTNRAT  GYWK TGKDKEI+  
Subjt:  QDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNS

Query:  NTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAF----RTAKDEWVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLP
         +  LVGMKKTLVFY+GRAP+G+K+NWVMHEYRL  K +     +TAK+EWV+CR+FQKSAG KK P +    +   +  +  P LLP
Subjt:  NTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAF----RTAKDEWVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLP

Q9FLR3 NAC domain-containing protein 792.8e-6461.46Show/hide
Query:  RPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIF
        +  D++  LPPGFRFHP+DEELIT YL  K+ D  F+ +AI +VDLNK EPWELP KAK+GEKEWYFF +RDRKYPTG+RTNRAT  GYWK TGKDKEIF
Subjt:  RPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIF

Query:  NSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAF----RTAKDEWVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPS
           +  LVGMKKTLVFY+GRAP+G+K+NWVMHEYRL  K +     +TAK+EWV+CR+F K+AG KK P +    +  Y      PPL   S
Subjt:  NSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAF----RTAKDEWVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPS

Q9FRV4 Protein CUP-SHAPED COTYLEDON 18.9e-6369.33Show/hide
Query:  RPQ-DQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEI
        RP+ + E  +PPGFRFHP+DEELIT+YLL K+ D+NF+  AI+ VDLNK EPWELP KAKMGEKEWYFF+LRDRKYPTG+RTNRAT  GYWK TGKD+EI
Subjt:  RPQ-DQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEI

Query:  FNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFR----TAKDEWVVCRIFQKS
         +S T  L+GMKKTLVFYKGRAP+GEKS WVMHEYRL  K ++     +AKDEWV+C++  KS
Subjt:  FNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFR----TAKDEWVVCRIFQKS

Arabidopsis top hitse value%identityAlignment
AT2G24430.1 NAC domain containing protein 383.7e-8851Show/hide
Query:  HRHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTG
        H+ H    + +E+ LPPGFRFHP+DEELI++YL+NKI+D NFTG+AI DVDLNK EPWELP KAKMG KEWYFFSLRDRKYPTGVRTNRATNTGYWKTTG
Subjt:  HRHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTG

Query:  KDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAK-DEWVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPS-----
        KDKEIFNS TSELVGMKKTLVFY+GRAPRGEK+ WVMHEYRLHSK+++RT+K DEWVVCR+F+K+   KKY      + + + N      +L  +     
Subjt:  KDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAK-DEWVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPS-----

Query:  ----IMQLGDHAAQY---GYGRNFI-TATELAELNRVLRVGGGSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLNLGGASSQPVLRPMPPPPVATAMAI
            ++QL  H   +      ++ +  A  LAEL+RV R       ++  ++ S Q   NY        +SGLNLNLGGA  Q       PPPV     +
Subjt:  ----IMQLGDHAAQY---GYGRNFI-TATELAELNRVLRVGGGSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLNLGGASSQPVLRPMPPPPVATAMAI

Query:  GQQDIASSVMTSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW
          +D+A+  ++++   EN +G              + M  CMDL+ YWP++
Subjt:  GQQDIASSVMTSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW

AT2G24430.2 NAC domain containing protein 383.7e-8851Show/hide
Query:  HRHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTG
        H+ H    + +E+ LPPGFRFHP+DEELI++YL+NKI+D NFTG+AI DVDLNK EPWELP KAKMG KEWYFFSLRDRKYPTGVRTNRATNTGYWKTTG
Subjt:  HRHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTG

Query:  KDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAK-DEWVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPS-----
        KDKEIFNS TSELVGMKKTLVFY+GRAPRGEK+ WVMHEYRLHSK+++RT+K DEWVVCR+F+K+   KKY      + + + N      +L  +     
Subjt:  KDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAK-DEWVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPS-----

Query:  ----IMQLGDHAAQY---GYGRNFI-TATELAELNRVLRVGGGSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLNLGGASSQPVLRPMPPPPVATAMAI
            ++QL  H   +      ++ +  A  LAEL+RV R       ++  ++ S Q   NY        +SGLNLNLGGA  Q       PPPV     +
Subjt:  ----IMQLGDHAAQY---GYGRNFI-TATELAELNRVLRVGGGSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLNLGGASSQPVLRPMPPPPVATAMAI

Query:  GQQDIASSVMTSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW
          +D+A+  ++++   EN +G              + M  CMDL+ YWP++
Subjt:  GQQDIASSVMTSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW

AT3G18400.1 NAC domain containing protein 586.9e-7153.1Show/hide
Query:  EQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTS
        E+ LPPGFRFHP+DEELIT YL  K+SD  FTG+A+ DVDLNK EPW+LP KA MGEKEWYFFS RDRKYPTG+RTNRAT  GYWKTTGKDKEI+ S   
Subjt:  EQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTS

Query:  ELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFR-TAKDEWVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGDHAAQYGYGRN
         LVGMKKTLVFYKGRAP+GEKSNWVMHEYRL SK  F  T K+EWVVCR+F+KS   KK    QP++  P        P    S M              
Subjt:  ELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFR-TAKDEWVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPSIMQLGDHAAQYGYGRN

Query:  FITATELAELNRVLRVGGGSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLNLGGASS
           A E  +++ +  +   S    +  ++    Q N        + +GLN+N+  AS+
Subjt:  FITATELAELNRVLRVGGGSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLNLGGASS

AT5G07680.1 NAC domain containing protein 802.0e-6561.46Show/hide
Query:  RPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIF
        +  D++  LPPGFRFHP+DEELIT YL  K+ D  F+ +AI +VDLNK EPWELP KAK+GEKEWYFF +RDRKYPTG+RTNRAT  GYWK TGKDKEIF
Subjt:  RPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKDKEIF

Query:  NSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAF----RTAKDEWVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPS
           +  LVGMKKTLVFY+GRAP+G+K+NWVMHEYRL  K +     +TAK+EWV+CR+F K+AG KK P +    +  Y      PPL   S
Subjt:  NSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAF----RTAKDEWVVCRIFQKSAGPKKYPPNQPRAVNPYVNLEIAPPLLPPS

AT5G53950.1 NAC (No Apical Meristem) domain transcriptional regulator superfamily protein2.1e-6771.69Show/hide
Query:  HHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKD
        +H D   D  Q LPPGFRFHP+DEELIT YLL K+ D  F+ RAI +VDLNK EPW+LPG+AKMGEKEWYFFSLRDRKYPTG+RTNRAT  GYWK TGKD
Subjt:  HHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFSLRDRKYPTGVRTNRATNTGYWKTTGKD

Query:  KEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAF----RTAKDEWVVCRIFQKS
        +EIF+S T  LVGMKKTLVFYKGRAP+GEKSNWVMHEYRL  K ++    R++KDEWV+ R+FQK+
Subjt:  KEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAF----RTAKDEWVVCRIFQKS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGTACCGAGCGGTAATAATTCGGTGATGGAGAAGGAGGACGGGAAGAGGTCGGAGGTGAAGAAGAAGAATACTATGGCGGAATCAGATCATCATCATTGTCATCG
TCATCATCAAGATCGTCCTCAAGATCAAGAACAAACATTGCCTCCAGGGTTTAGGTTTCATCCTTCTGATGAAGAGTTGATCACTTTTTATCTCTTGAATAAGATATCAG
ATGCTAATTTTACAGGAAGGGCTATTACAGATGTTGATCTCAACAAATTTGAACCTTGGGAACTACCAGGGAAGGCTAAAATGGGAGAAAAAGAATGGTATTTTTTCAGC
CTACGAGACAGAAAATACCCAACTGGAGTAAGAACAAACAGAGCAACAAACACAGGATATTGGAAAACAACTGGAAAAGATAAAGAAATTTTCAATAGCAATACTTCAGA
ATTGGTTGGGATGAAAAAGACATTGGTGTTTTATAAAGGGAGAGCCCCAAGAGGAGAGAAATCCAATTGGGTTATGCATGAATATCGACTTCACTCCAAAACTGCTTTTA
GAACAGCTAAGGACGAATGGGTGGTTTGCCGAATATTTCAAAAGAGTGCTGGACCAAAAAAGTATCCTCCAAATCAACCAAGAGCAGTGAATCCCTATGTCAACCTGGAA
ATTGCCCCTCCTCTTCTCCCACCATCCATCATGCAGCTCGGAGACCATGCTGCTCAATATGGCTATGGTCGAAACTTCATCACAGCCACAGAGCTGGCTGAGCTCAACCG
AGTCCTGAGAGTCGGTGGAGGCAGTGGAGGCTCGACTCACGGTATCAATTTATCGATGCAGCCCCAGTTTAACTATCCACCTGCAGGAGGTTGCTTCACAATATCCGGGT
TGAACTTGAACCTCGGAGGAGCTTCATCGCAACCAGTTCTGCGACCGATGCCACCACCCCCAGTGGCGACAGCAATGGCAATTGGGCAACAAGATATCGCTTCGTCCGTG
ATGACAAGTACGATTGCTCCTGAGAATGCCTACGGGGCAGAGATCAACAACAATGCCGTTGGACCTGGCAACAGATTTATGAACATGGATCATTGCATGGACCTGGAGAA
TTACTGGCCTAACTGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCAAGTACCGAGCGGTAATAATTCGGTGATGGAGAAGGAGGACGGGAAGAGGTCGGAGGTGAAGAAGAAGAATACTATGGCGGAATCAGATCATCATCATTGTCATCG
TCATCATCAAGATCGTCCTCAAGATCAAGAACAAACATTGCCTCCAGGGTTTAGGTTTCATCCTTCTGATGAAGAGTTGATCACTTTTTATCTCTTGAATAAGATATCAG
ATGCTAATTTTACAGGAAGGGCTATTACAGATGTTGATCTCAACAAATTTGAACCTTGGGAACTACCAGGGAAGGCTAAAATGGGAGAAAAAGAATGGTATTTTTTCAGC
CTACGAGACAGAAAATACCCAACTGGAGTAAGAACAAACAGAGCAACAAACACAGGATATTGGAAAACAACTGGAAAAGATAAAGAAATTTTCAATAGCAATACTTCAGA
ATTGGTTGGGATGAAAAAGACATTGGTGTTTTATAAAGGGAGAGCCCCAAGAGGAGAGAAATCCAATTGGGTTATGCATGAATATCGACTTCACTCCAAAACTGCTTTTA
GAACAGCTAAGGACGAATGGGTGGTTTGCCGAATATTTCAAAAGAGTGCTGGACCAAAAAAGTATCCTCCAAATCAACCAAGAGCAGTGAATCCCTATGTCAACCTGGAA
ATTGCCCCTCCTCTTCTCCCACCATCCATCATGCAGCTCGGAGACCATGCTGCTCAATATGGCTATGGTCGAAACTTCATCACAGCCACAGAGCTGGCTGAGCTCAACCG
AGTCCTGAGAGTCGGTGGAGGCAGTGGAGGCTCGACTCACGGTATCAATTTATCGATGCAGCCCCAGTTTAACTATCCACCTGCAGGAGGTTGCTTCACAATATCCGGGT
TGAACTTGAACCTCGGAGGAGCTTCATCGCAACCAGTTCTGCGACCGATGCCACCACCCCCAGTGGCGACAGCAATGGCAATTGGGCAACAAGATATCGCTTCGTCCGTG
ATGACAAGTACGATTGCTCCTGAGAATGCCTACGGGGCAGAGATCAACAACAATGCCGTTGGACCTGGCAACAGATTTATGAACATGGATCATTGCATGGACCTGGAGAA
TTACTGGCCTAACTGGTAA
Protein sequenceShow/hide protein sequence
MQVPSGNNSVMEKEDGKRSEVKKKNTMAESDHHHCHRHHQDRPQDQEQTLPPGFRFHPSDEELITFYLLNKISDANFTGRAITDVDLNKFEPWELPGKAKMGEKEWYFFS
LRDRKYPTGVRTNRATNTGYWKTTGKDKEIFNSNTSELVGMKKTLVFYKGRAPRGEKSNWVMHEYRLHSKTAFRTAKDEWVVCRIFQKSAGPKKYPPNQPRAVNPYVNLE
IAPPLLPPSIMQLGDHAAQYGYGRNFITATELAELNRVLRVGGGSGGSTHGINLSMQPQFNYPPAGGCFTISGLNLNLGGASSQPVLRPMPPPPVATAMAIGQQDIASSV
MTSTIAPENAYGAEINNNAVGPGNRFMNMDHCMDLENYWPNW