; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G4622 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G4622
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionGAGA-binding transcriptional activator
Genome locationctg1227:847487..851510
RNA-Seq ExpressionCucsat.G4622
SyntenyCucsat.G4622
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009723 - response to ethylene (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0042803 - protein homodimerization activity (molecular function)
InterPro domainsIPR010409 - GAGA-binding transcriptional activator


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152657.1 protein BASIC PENTACYSTEINE6 [Cucumis sativus]5.97e-249100Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
        MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN

Query:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNKIMLGK
        NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNKIMLGK
Subjt:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNKIMLGK

Query:  SQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
        SQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
Subjt:  SQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS

Query:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_008444782.1 PREDICTED: protein BASIC PENTACYSTEINE6 [Cucumis melo]5.49e-24598.52Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
        MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN

Query:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNKIMLGK
        NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSR+MLASNDPCPTSPVASESTKARRNKR KEGKTV TPNKKVSKGPRKVKREAEDLNKIMLGK
Subjt:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNKIMLGK

Query:  SQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
        SQEWKDGIGIMS GDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
Subjt:  SQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS

Query:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        AFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_022144218.1 protein BASIC PENTACYSTEINE6 [Momordica charantia]3.24e-22291.84Show/hide
Query:  MDDSGHRENGRHKPD-QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-
        MDDSGHRENGRHK + QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK ALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+ 
Subjt:  MDDSGHRENGRHKPD-QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-

Query:  NNNLSCPPGCQIARGVKHIHHPQQQ---HTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNK
        +NNLSCPPGCQIARGVKHIHHPQQQ   HTHHVPHMNE  YN REM   ND CP SPVASESTKARRNKR KE K V TPNKKVSK PRKVKREAEDLNK
Subjt:  NNNLSCPPGCQIARGVKHIHHPQQQ---HTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNK

Query:  IMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGR
        IMLGKSQEWKD IGIMS GDDLNKQLVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR+GGR
Subjt:  IMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGR

Query:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_038885713.1 protein BASIC PENTACYSTEINE6 isoform X1 [Benincasa hispida]1.53e-23794.78Show/hide
Query:  VLESQMDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRE
        VLESQMDDSGHRENGRHKPDQYK AQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRE
Subjt:  VLESQMDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRE

Query:  NSI--NNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDL
        NS+  NNN+SCPPGCQIARGVKHIHHPQQQHTHHVPHMNE+NYNSREMLASNDPCP SPVASESTKARRNKR KEGKTV TP+KKV +GPRKVKRE EDL
Subjt:  NSI--NNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDL

Query:  NKIMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLG
        NKIMLGKSQEWKDGIGIM  GDDLNKQLV SKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLP VPNKRHARLG
Subjt:  NKIMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLG

Query:  GRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        GRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  GRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_038885714.1 protein BASIC PENTACYSTEINE6 isoform X2 [Benincasa hispida]3.77e-23494.71Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--
        MDDSGHRENGRHKPDQYK AQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--

Query:  NNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNKIML
        NNN+SCPPGCQIARGVKHIHHPQQQHTHHVPHMNE+NYNSREMLASNDPCP SPVASESTKARRNKR KEGKTV TP+KKV +GPRKVKRE EDLNKIML
Subjt:  NNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNKIML

Query:  GKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMS
        GKSQEWKDGIGIM  GDDLNKQLV SKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLP VPNKRHARLGGRKMS
Subjt:  GKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMS

Query:  GSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        GSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  GSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

TrEMBL top hitse value%identityAlignment
A0A0A0LLL3 GAGA-binding transcriptional activator2.89e-249100Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
        MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN

Query:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNKIMLGK
        NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNKIMLGK
Subjt:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNKIMLGK

Query:  SQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
        SQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
Subjt:  SQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS

Query:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A1S3BBW9 GAGA-binding transcriptional activator2.66e-24598.52Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
        MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN

Query:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNKIMLGK
        NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSR+MLASNDPCPTSPVASESTKARRNKR KEGKTV TPNKKVSKGPRKVKREAEDLNKIMLGK
Subjt:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNKIMLGK

Query:  SQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
        SQEWKDGIGIMS GDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
Subjt:  SQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS

Query:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        AFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A5A7VDN8 GAGA-binding transcriptional activator2.66e-24598.52Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
        MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN

Query:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNKIMLGK
        NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSR+MLASNDPCPTSPVASESTKARRNKR KEGKTV TPNKKVSKGPRKVKREAEDLNKIMLGK
Subjt:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNKIMLGK

Query:  SQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
        SQEWKDGIGIMS GDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
Subjt:  SQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS

Query:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        AFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A6J1CR14 GAGA-binding transcriptional activator1.57e-22291.84Show/hide
Query:  MDDSGHRENGRHKPD-QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-
        MDDSGHRENGRHK + QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK ALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+ 
Subjt:  MDDSGHRENGRHKPD-QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-

Query:  NNNLSCPPGCQIARGVKHIHHPQQQ---HTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNK
        +NNLSCPPGCQIARGVKHIHHPQQQ   HTHHVPHMNE  YN REM   ND CP SPVASESTKARRNKR KE K V TPNKKVSK PRKVKREAEDLNK
Subjt:  NNNLSCPPGCQIARGVKHIHHPQQQ---HTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLNK

Query:  IMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGR
        IMLGKSQEWKD IGIMS GDDLNKQLVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR+GGR
Subjt:  IMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGR

Query:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A6J1KJF5 GAGA-binding transcriptional activator2.38e-22091.04Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-N
        MDDSGHRENGRHK +QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAA+AERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+ N
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-N

Query:  NNLSCPPGCQIARGVKHIHHPQQQ----HTH-HVPHMNENNYNSREMLASNDPCPTSP-VASESTKARRNKRPKE-GKTVPTPNKKVSKGPRKVKREAED
        NNLSCPPGCQIARGVKHIHHPQQQ    HTH HVPHMNE+NYNSRE+ A ND CP  P VASESTK RRNKR KE  KTV  PNKKVSK PRKVKREAED
Subjt:  NNLSCPPGCQIARGVKHIHHPQQQ----HTH-HVPHMNENNYNSREMLASNDPCPTSP-VASESTKARRNKRPKE-GKTVPTPNKKVSKGPRKVKREAED

Query:  LNKIMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARL
        LNKIMLGKSQEWKDGIGI+S GDDLNK LVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR+
Subjt:  LNKIMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARL

Query:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        GGRKMSGSAFNKLLSRL AEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

SwissProt top hitse value%identityAlignment
F4JUI3 Protein BASIC PENTACYSTEINE58.6e-7145.35Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMM----QHQPSM--KQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYR
        M+  G  ENGR+KPD YK  Q   +M    QH   +  K+I++I+AERDAA++ERN A++  K ALA RD A  QRD A++ER+NA++E ++A+  L+YR
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMM----QHQPSM--KQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYR

Query:  ENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLN
        EN++N  LSC       RG       ++ H                     +P P S +  E+   R  KR KE K              + K+  EDLN
Subjt:  ENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLN

Query:  KIMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGG
        + +    ++                    S+ DW   D+    V FDE TMP P+C+CTG  RQCYKWGNGGWQS+CCTTTLS YPLP +PNKRH+R+GG
Subjt:  KIMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGG

Query:  RKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        RKMSGS F++LLSRLA EGH+LS+PVDLKN+WA+HGTNRYITIK
Subjt:  RKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

O82286 Protein BASIC PENTACYSTEINE72.8e-3759.13Show/hide
Query:  SKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLK
        +K + K  D+ ++  +FD S +P P+CSCTGV R CYKWG GGWQS+CCT ++S YPLP    +  ARL GRKMS  A+ KLL+RLA EG+DLS P+DLK
Subjt:  SKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLK

Query:  NHWAKHGTNRYITIK
        NHWA+HGTN+++TIK
Subjt:  NHWAKHGTNRYITIK

Q5VSA8 Barley B recombinant-like protein D4.9e-9054.86Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMM-QHQPSMK-----QIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQY-
        MD+ GHRENGR +PDQYK    QWMM Q Q  +K      ++A+M +RD AI+ER+ AL+EKKAA+AERDMA+ QRDAA+AERN A++ERDNA+A L+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMM-QHQPSMK-----QIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQY-

Query:  REN--SINNNLSCPPGCQIARGVKHIHHPQQ-QHTHHVP-HMNENNY-NSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKR
        R N  ++NN    P G     G K+IHH  Q  H    P  + ++ Y ++REM  S       P+++    A + KRPK+  +  +P K+ S   RK K+
Subjt:  REN--SINNNLSCPPGCQIARGVKHIHHPQQ-QHTHHVP-HMNENNY-NSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKR

Query:  EAEDLNKIMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKR
         + D           WK+ +G+   GDD +    V K++WK Q+LGLNQVAFD+STMPAP CSCTG +RQCYKWGNGGWQS+CCT  +SMYPLP +PNKR
Subjt:  EAEDLNKIMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKR

Query:  HARLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        HAR+GGRKMSG AF KLLSRLAAEGHDLS PVDLK+HWAKHGTNRYITI+
Subjt:  HARLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

Q8L999 Protein BASIC PENTACYSTEINE68.2e-12266.48Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--
        MDD GHRENGRHK     + QGQW+MQHQPSMKQ+M+I+AERDAAIQERNLA+SEKKAA+AERDMA+LQRD AIAERNNA++ERD+A+  LQYRENS+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--

Query:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS---ESTKARRNKRPKEGKTVPT-PNKKVSKGPRKVKR
            N  +CPPGCQI+RGVKH+HHP   H    HH+P + EN Y +REM   ND  PTSP A    ES K +R KR     T  T  NK+  K  RKVK+
Subjt:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS---ESTKARRNKRPKEGKTVPT-PNKKVSKGPRKVKR

Query:  EAE-DLNKIMLGK-SQEWKDGIGIMSAGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV
        E+E DLNKIM  K + ++ D        +D +K +++ SKSDWK Q++ GLNQV +DE+TMP P+CSCTGV+RQCYKWGNGGWQS+CCTTTLSMYPLPA+
Subjt:  EAE-DLNKIMLGK-SQEWKDGIGIMSAGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV

Query:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        PNKRHAR+GGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK

Q8S8C6 Protein BASIC PENTACYSTEINE45.8e-7548.28Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT
        M++ G  +N R KPD +K AQ  W M  QHQ           K+IM+I+AERDAA+ ERN A+S KK A+A RD A  QRD A++ER+ AL+ERDNA A 
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT

Query:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREA
        LQ+ ENS+N  LS    C        I  P +                        P  T P    +TK   NKR KE K          +G  KVK+  
Subjt:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREA

Query:  EDLNKIMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA
        EDLN+ +    ++                    S++DW  QD+GLN V FDE+TMP P+CSCTG  RQCYKWGNGGWQS+CCTTTLS YPLP +PNKRH+
Subjt:  EDLNKIMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA

Query:  RLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        R+GGRKMSG+ F++LLSRL+AEG+DLS PVDLK++WA+HGTNRYITIK
Subjt:  RLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

Arabidopsis top hitse value%identityAlignment
AT2G21240.1 basic pentacysteine 44.1e-7648.28Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT
        M++ G  +N R KPD +K AQ  W M  QHQ           K+IM+I+AERDAA+ ERN A+S KK A+A RD A  QRD A++ER+ AL+ERDNA A 
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT

Query:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREA
        LQ+ ENS+N  LS    C        I  P +                        P  T P    +TK   NKR KE K          +G  KVK+  
Subjt:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREA

Query:  EDLNKIMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA
        EDLN+ +    ++                    S++DW  QD+GLN V FDE+TMP P+CSCTG  RQCYKWGNGGWQS+CCTTTLS YPLP +PNKRH+
Subjt:  EDLNKIMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA

Query:  RLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        R+GGRKMSG+ F++LLSRL+AEG+DLS PVDLK++WA+HGTNRYITIK
Subjt:  RLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

AT2G21240.2 basic pentacysteine 44.1e-7648.28Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT
        M++ G  +N R KPD +K AQ  W M  QHQ           K+IM+I+AERDAA+ ERN A+S KK A+A RD A  QRD A++ER+ AL+ERDNA A 
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT

Query:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREA
        LQ+ ENS+N  LS    C        I  P +                        P  T P    +TK   NKR KE K          +G  KVK+  
Subjt:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREA

Query:  EDLNKIMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA
        EDLN+ +    ++                    S++DW  QD+GLN V FDE+TMP P+CSCTG  RQCYKWGNGGWQS+CCTTTLS YPLP +PNKRH+
Subjt:  EDLNKIMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA

Query:  RLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        R+GGRKMSG+ F++LLSRL+AEG+DLS PVDLK++WA+HGTNRYITIK
Subjt:  RLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

AT5G42520.1 basic pentacysteine 65.9e-12366.48Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--
        MDD GHRENGRHK     + QGQW+MQHQPSMKQ+M+I+AERDAAIQERNLA+SEKKAA+AERDMA+LQRD AIAERNNA++ERD+A+  LQYRENS+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--

Query:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS---ESTKARRNKRPKEGKTVPT-PNKKVSKGPRKVKR
            N  +CPPGCQI+RGVKH+HHP   H    HH+P + EN Y +REM   ND  PTSP A    ES K +R KR     T  T  NK+  K  RKVK+
Subjt:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS---ESTKARRNKRPKEGKTVPT-PNKKVSKGPRKVKR

Query:  EAE-DLNKIMLGK-SQEWKDGIGIMSAGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV
        E+E DLNKIM  K + ++ D        +D +K +++ SKSDWK Q++ GLNQV +DE+TMP P+CSCTGV+RQCYKWGNGGWQS+CCTTTLSMYPLPA+
Subjt:  EAE-DLNKIMLGK-SQEWKDGIGIMSAGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV

Query:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        PNKRHAR+GGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK

AT5G42520.2 basic pentacysteine 61.1e-11865.63Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--
        MDD GHRENGRHK     + QG    QHQPSMKQ+M+I+AERDAAIQERNLA+SEKKAA+AERDMA+LQRD AIAERNNA++ERD+A+  LQYRENS+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--

Query:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS---ESTKARRNKRPKEGKTVPT-PNKKVSKGPRKVKR
            N  +CPPGCQI+RGVKH+HHP   H    HH+P + EN Y +REM   ND  PTSP A    ES K +R KR     T  T  NK+  K  RKVK+
Subjt:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS---ESTKARRNKRPKEGKTVPT-PNKKVSKGPRKVKR

Query:  EAE-DLNKIMLGK-SQEWKDGIGIMSAGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV
        E+E DLNKIM  K + ++ D        +D +K +++ SKSDWK Q++ GLNQV +DE+TMP P+CSCTGV+RQCYKWGNGGWQS+CCTTTLSMYPLPA+
Subjt:  EAE-DLNKIMLGK-SQEWKDGIGIMSAGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV

Query:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        PNKRHAR+GGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK

AT5G42520.3 basic pentacysteine 62.0e-9463.92Show/hide
Query:  MAYLQRDAAIAERNNALLERDNAIATLQYRENSI-----NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS-
        MA+LQRD AIAERNNA++ERD+A+  LQYRENS+      N  +CPPGCQI+RGVKH+HHP   H    HH+P + EN Y +REM   ND  PTSP A  
Subjt:  MAYLQRDAAIAERNNALLERDNAIATLQYRENSI-----NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS-

Query:  --ESTKARRNKRPKEGKTVPT-PNKKVSKGPRKVKREAE-DLNKIMLGK-SQEWKDGIGIMSAGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAP
          ES K +R KR     T  T  NK+  K  RKVK+E+E DLNKIM  K + ++ D        +D +K +++ SKSDWK Q++ GLNQV +DE+TMP P
Subjt:  --ESTKARRNKRPKEGKTVPT-PNKKVSKGPRKVKREAE-DLNKIMLGK-SQEWKDGIGIMSAGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAP

Query:  ICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        +CSCTGV+RQCYKWGNGGWQS+CCTTTLSMYPLPA+PNKRHAR+GGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  ICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGATTGCCATTTTGTTTGCAGTGGAGCTTTTTCATTACTGTATCTGTGTTGGTTATTCGTTTTGGAGTCACAAATGGATGACAGTGGACACCGTGAGAATGGAAG
GCACAAACCAGATCAGTATAAGTCAGCTCAGGGTCAGTGGATGATGCAGCATCAGCCCTCAATGAAGCAGATAATGGCAATTATGGCTGAAAGAGACGCAGCCATTCAAG
AAAGAAATTTGGCCCTCTCGGAGAAAAAGGCTGCACTGGCAGAGCGAGACATGGCATATCTACAGCGAGACGCTGCAATTGCAGAGAGGAACAATGCCCTTTTGGAACGA
GACAACGCCATTGCTACTCTTCAGTATCGTGAAAACTCCATAAATAACAATTTATCATGTCCACCAGGATGCCAAATTGCTAGGGGAGTGAAGCATATACATCATCCACA
GCAGCAACACACACATCATGTGCCTCACATGAATGAGAATAATTACAATTCAAGAGAAATGCTTGCTTCCAACGACCCTTGCCCAACATCCCCTGTCGCTTCTGAATCAA
CAAAGGCACGACGAAACAAGCGTCCAAAAGAGGGAAAGACAGTCCCAACACCAAACAAGAAAGTTTCAAAAGGTCCACGGAAGGTCAAAAGGGAGGCTGAAGACTTGAAC
AAAATAATGTTGGGGAAGTCACAAGAATGGAAGGATGGAATTGGTATAATGAGTGCAGGTGACGATCTTAATAAACAGTTGGTAGTATCAAAATCAGATTGGAAAGGCCA
GGATTTAGGATTAAACCAGGTTGCATTTGACGAATCAACCATGCCAGCTCCTATATGCTCCTGCACAGGAGTAATAAGACAATGCTACAAATGGGGGAATGGTGGATGGC
AATCTGCATGTTGTACTACCACCCTCTCAATGTATCCATTACCTGCCGTTCCCAACAAACGACATGCTCGACTTGGCGGTCGGAAAATGAGCGGAAGTGCTTTTAACAAA
CTGCTTAGCCGCCTTGCAGCCGAAGGCCACGACCTATCCGCTCCAGTTGATCTTAAAAATCACTGGGCAAAGCATGGAACAAATCGTTACATCACCATCAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAATGATTGCCATTTTGTTTGCAGTGGAGCTTTTTCATTACTGTATCTGTGTTGGTTATTCGTTTTGGAGTCACAAATGGATGACAGTGGACACCGTGAGAATGGAAG
GCACAAACCAGATCAGTATAAGTCAGCTCAGGGTCAGTGGATGATGCAGCATCAGCCCTCAATGAAGCAGATAATGGCAATTATGGCTGAAAGAGACGCAGCCATTCAAG
AAAGAAATTTGGCCCTCTCGGAGAAAAAGGCTGCACTGGCAGAGCGAGACATGGCATATCTACAGCGAGACGCTGCAATTGCAGAGAGGAACAATGCCCTTTTGGAACGA
GACAACGCCATTGCTACTCTTCAGTATCGTGAAAACTCCATAAATAACAATTTATCATGTCCACCAGGATGCCAAATTGCTAGGGGAGTGAAGCATATACATCATCCACA
GCAGCAACACACACATCATGTGCCTCACATGAATGAGAATAATTACAATTCAAGAGAAATGCTTGCTTCCAACGACCCTTGCCCAACATCCCCTGTCGCTTCTGAATCAA
CAAAGGCACGACGAAACAAGCGTCCAAAAGAGGGAAAGACAGTCCCAACACCAAACAAGAAAGTTTCAAAAGGTCCACGGAAGGTCAAAAGGGAGGCTGAAGACTTGAAC
AAAATAATGTTGGGGAAGTCACAAGAATGGAAGGATGGAATTGGTATAATGAGTGCAGGTGACGATCTTAATAAACAGTTGGTAGTATCAAAATCAGATTGGAAAGGCCA
GGATTTAGGATTAAACCAGGTTGCATTTGACGAATCAACCATGCCAGCTCCTATATGCTCCTGCACAGGAGTAATAAGACAATGCTACAAATGGGGGAATGGTGGATGGC
AATCTGCATGTTGTACTACCACCCTCTCAATGTATCCATTACCTGCCGTTCCCAACAAACGACATGCTCGACTTGGCGGTCGGAAAATGAGCGGAAGTGCTTTTAACAAA
CTGCTTAGCCGCCTTGCAGCCGAAGGCCACGACCTATCCGCTCCAGTTGATCTTAAAAATCACTGGGCAAAGCATGGAACAAATCGTTACATCACCATCAAGTAG
Protein sequenceShow/hide protein sequence
MNDCHFVCSGAFSLLYLCWLFVLESQMDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLER
DNAIATLQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVPTPNKKVSKGPRKVKREAEDLN
KIMLGKSQEWKDGIGIMSAGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGSAFNK
LLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK