; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0026717 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0026717
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGAGA-binding transcriptional activator
Genome locationchr03:25231777..25236566
RNA-Seq ExpressionIVF0026717
SyntenyIVF0026717
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009723 - response to ethylene (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0042803 - protein homodimerization activity (molecular function)
InterPro domainsIPR010409 - GAGA-binding transcriptional activator


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152657.1 protein BASIC PENTACYSTEINE6 [Cucumis sativus]7.83e-24598.52Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
        MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN

Query:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
        NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSR+MLASNDPCPTSPVASESTKARRNKR KEGKTV TPNKKVSKGPRKVKREAEDLNKIMLGK
Subjt:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK

Query:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
        SQEWKDGIGIMS GDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
Subjt:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS

Query:  AFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        AFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  AFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

XP_008444782.1 PREDICTED: protein BASIC PENTACYSTEINE6 [Cucumis melo]1.21e-248100Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
        MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN

Query:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
        NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
Subjt:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK

Query:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
        SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
Subjt:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS

Query:  AFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        AFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
Subjt:  AFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

XP_022144218.1 protein BASIC PENTACYSTEINE6 [Momordica charantia]6.92e-22492.13Show/hide
Query:  MDDSGHRENGRHKPD-QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-
        MDDSGHRENGRHK + QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK ALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+ 
Subjt:  MDDSGHRENGRHKPD-QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-

Query:  NNNLSCPPGCQIARGVKHIHHPQQQ---HTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNK
        +NNLSCPPGCQIARGVKHIHHPQQQ   HTHHVPHMNE  YN R+M   ND CP SPVASESTKARRNKRTKE K VTTPNKKVSK PRKVKREAEDLNK
Subjt:  NNNLSCPPGCQIARGVKHIHHPQQQ---HTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNK

Query:  IMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGR
        IMLGKSQEWKD IGIMSGGDDLNKQLVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR+GGR
Subjt:  IMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGR

Query:  KMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        KMSGSAFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  KMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

XP_038885713.1 protein BASIC PENTACYSTEINE6 isoform X1 [Benincasa hispida]8.54e-23594.71Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--
        MDDSGHRENGRHKPDQYK AQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--

Query:  NNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIML
        NNN+SCPPGCQIARGVKHIHHPQQQHTHHVPHMNE+NYNSR+MLASNDPCP SPVASESTKARRNKR KEGKTVTTP+KKV +GPRKVKRE EDLNKIML
Subjt:  NNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIML

Query:  GKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMS
        GKSQEWKDGIGIM GGDDLNKQLV SKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLP VPNKRHARLGGRKMS
Subjt:  GKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMS

Query:  GSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        GSAFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  GSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

XP_038885714.1 protein BASIC PENTACYSTEINE6 isoform X2 [Benincasa hispida]6.58e-23594.71Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--
        MDDSGHRENGRHKPDQYK AQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--

Query:  NNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIML
        NNN+SCPPGCQIARGVKHIHHPQQQHTHHVPHMNE+NYNSR+MLASNDPCP SPVASESTKARRNKR KEGKTVTTP+KKV +GPRKVKRE EDLNKIML
Subjt:  NNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIML

Query:  GKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMS
        GKSQEWKDGIGIM GGDDLNKQLV SKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLP VPNKRHARLGGRKMS
Subjt:  GKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMS

Query:  GSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        GSAFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  GSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

TrEMBL top hitse value%identityAlignment
A0A0A0LLL3 GAGA-binding transcriptional activator5.2e-19098.52Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
        MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN

Query:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
        NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSR+MLASNDPCPTSPVASESTKARRNKR KEGKTV TPNKKVSKGPRKVKREAEDLNKIMLGK
Subjt:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK

Query:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
        SQEWKDGIGIMS GDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
Subjt:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS

Query:  AFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        AFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  AFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

A0A1S3BBW9 GAGA-binding transcriptional activator6.6e-193100Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
        MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN

Query:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
        NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
Subjt:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK

Query:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
        SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
Subjt:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS

Query:  AFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        AFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
Subjt:  AFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

A0A5A7VDN8 GAGA-binding transcriptional activator6.6e-193100Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
        MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN

Query:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
        NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
Subjt:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK

Query:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
        SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
Subjt:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS

Query:  AFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        AFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
Subjt:  AFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

A0A6J1CR14 GAGA-binding transcriptional activator4.0e-17492.13Show/hide
Query:  MDDSGHRENGRHKPD-QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-
        MDDSGHRENGRHK + QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK ALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+ 
Subjt:  MDDSGHRENGRHKPD-QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-

Query:  NNNLSCPPGCQIARGVKHIHHP---QQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNK
        +NNLSCPPGCQIARGVKHIHHP   QQQHTHHVPHMNE  YN R+M   ND CP SPVASESTKARRNKRTKE K VTTPNKKVSK PRKVKREAEDLNK
Subjt:  NNNLSCPPGCQIARGVKHIHHP---QQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNK

Query:  IMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGR
        IMLGKSQEWKD IGIMSGGDDLNKQLVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR+GGR
Subjt:  IMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGR

Query:  KMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        KMSGSAFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  KMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

A0A6J1KJF5 GAGA-binding transcriptional activator2.2e-17291.33Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-N
        MDDSGHRENGRHK +QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAA+AERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+ N
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-N

Query:  NNLSCPPGCQIARGVKHIHHP----QQQHTH-HVPHMNENNYNSRDMLASNDPCPTSP-VASESTKARRNKRTKE-GKTVTTPNKKVSKGPRKVKREAED
        NNLSCPPGCQIARGVKHIHHP    QQQHTH HVPHMNE+NYNSR+ L +ND CP  P VASESTK RRNKRTKE  KTVT PNKKVSK PRKVKREAED
Subjt:  NNLSCPPGCQIARGVKHIHHP----QQQHTH-HVPHMNENNYNSRDMLASNDPCPTSP-VASESTKARRNKRTKE-GKTVTTPNKKVSKGPRKVKREAED

Query:  LNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARL
        LNKIMLGKSQEWKDGIGI+SGGDDLNK LVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR+
Subjt:  LNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARL

Query:  GGRKMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        GGRKMSGSAFNKLLSRL AEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  GGRKMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

SwissProt top hitse value%identityAlignment
F4JUI3 Protein BASIC PENTACYSTEINE51.4e-7045.64Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMM----QHQPSM--KQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYR
        M+  G  ENGR+KPD YK  Q   +M    QH   +  K+I++I+AERDAA++ERN A++  K ALA RD A  QRD A++ER+NA++E ++A+  L+YR
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMM----QHQPSM--KQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYR

Query:  ENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLN
        EN++N  LSC       RG       ++ H                     +P P S +  E+   R  KR KE K              + K+  EDLN
Subjt:  ENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLN

Query:  KIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGG
        + +    ++                    S+ DW   D+    V FDE TMP P+C+CTG  RQCYKWGNGGWQS+CCTTTLS YPLP +PNKRH+R+GG
Subjt:  KIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGG

Query:  RKMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        RKMSGS F++LLSRLA EGH+LSSPVDLKN+WA+HGTNRYITIK
Subjt:  RKMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

O82286 Protein BASIC PENTACYSTEINE79.9e-3745.4Show/hide
Query:  RTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGN
        R     T T   K V++ P          +K +  K Q  K  +   S        +  +K + K  D+ ++  +FD S +P P+CSCTGV R CYKWG 
Subjt:  RTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGN

Query:  GGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        GGWQS+CCT ++S YPLP    +  ARL GRKMS  A+ KLL+RLA EG+DLS P+DLKNHWA+HGTN+++TIK
Subjt:  GGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

Q5VSA8 Barley B recombinant-like protein D7.2e-8854.29Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMM-QHQPSMK-----QIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQY-
        MD+ GHRENGR +PDQYK    QWMM Q Q  +K      ++A+M +RD AI+ER+ AL+EKKAA+AERDMA+ QRDAA+AERN A++ERDNA+A L+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMM-QHQPSMK-----QIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQY-

Query:  REN--SINNNLSCPPGCQIARGVKHIHHPQQ-QHTHHVP-HMNENNY-NSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKR
        R N  ++NN    P G     G K+IHH  Q  H    P  + ++ Y ++R+M  S       P+++    A + KR K+  +  +P K+ S   RK K+
Subjt:  REN--SINNNLSCPPGCQIARGVKHIHHPQQ-QHTHHVP-HMNENNY-NSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKR

Query:  EAEDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKR
         + D           WK+ +G+   GDD +    V K++WK Q+LGLNQVAFD+STMPAP CSCTG +RQCYKWGNGGWQS+CCT  +SMYPLP +PNKR
Subjt:  EAEDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKR

Query:  HARLGGRKMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        HAR+GGRKMSG AF KLLSRLAAEGHDLS+PVDLK+HWAKHGTNRYITI+
Subjt:  HARLGGRKMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

Q8L999 Protein BASIC PENTACYSTEINE67.7e-12266.2Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--
        MDD GHRENGRHK     + QGQW+MQHQPSMKQ+M+I+AERDAAIQERNLA+SEKKAA+AERDMA+LQRD AIAERNNA++ERD+A+  LQYRENS+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--

Query:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSRDMLASNDPCPTSPVAS---ESTKARRNKRTK-EGKTVTTPNKKVSKGPRKVKR
            N  +CPPGCQI+RGVKH+HHP   H    HH+P + EN Y +R+M   ND  PTSP A    ES K +R KR   +  T T  NK+  K  RKVK+
Subjt:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSRDMLASNDPCPTSPVAS---ESTKARRNKRTK-EGKTVTTPNKKVSKGPRKVKR

Query:  EAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV
        E+E DLNKIM  K + ++ D        +D +K +++ SKSDWK Q++ GLNQV +DE+TMP P+CSCTGV+RQCYKWGNGGWQS+CCTTTLSMYPLPA+
Subjt:  EAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV

Query:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSSPVDLKNHWAKHGTNRYITIK
        PNKRHAR+GGRKMSGSAFNKLLSRLAAEG HDLS+PVDLK+HWAKHGTNRYITIK
Subjt:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSSPVDLKNHWAKHGTNRYITIK

Q8S8C6 Protein BASIC PENTACYSTEINE42.7e-7448.28Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT
        M++ G  +N R KPD +K AQ  W M  QHQ           K+IM+I+AERDAA+ ERN A+S KK A+A RD A  QRD A++ER+ AL+ERDNA A 
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT

Query:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREA
        LQ+ ENS+N  LS    C        I  P +                        P  T P    +TK   NKR KE K          +G  KVK+  
Subjt:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREA

Query:  EDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA
        EDLN+ +    ++                    S++DW  QD+GLN V FDE+TMP P+CSCTG  RQCYKWGNGGWQS+CCTTTLS YPLP +PNKRH+
Subjt:  EDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA

Query:  RLGGRKMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        R+GGRKMSG+ F++LLSRL+AEG+DLS PVDLK++WA+HGTNRYITIK
Subjt:  RLGGRKMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

Arabidopsis top hitse value%identityAlignment
AT2G21240.1 basic pentacysteine 41.9e-7548.28Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT
        M++ G  +N R KPD +K AQ  W M  QHQ           K+IM+I+AERDAA+ ERN A+S KK A+A RD A  QRD A++ER+ AL+ERDNA A 
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT

Query:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREA
        LQ+ ENS+N  LS    C        I  P +                        P  T P    +TK   NKR KE K          +G  KVK+  
Subjt:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREA

Query:  EDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA
        EDLN+ +    ++                    S++DW  QD+GLN V FDE+TMP P+CSCTG  RQCYKWGNGGWQS+CCTTTLS YPLP +PNKRH+
Subjt:  EDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA

Query:  RLGGRKMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        R+GGRKMSG+ F++LLSRL+AEG+DLS PVDLK++WA+HGTNRYITIK
Subjt:  RLGGRKMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

AT2G21240.2 basic pentacysteine 41.9e-7548.28Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT
        M++ G  +N R KPD +K AQ  W M  QHQ           K+IM+I+AERDAA+ ERN A+S KK A+A RD A  QRD A++ER+ AL+ERDNA A 
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT

Query:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREA
        LQ+ ENS+N  LS    C        I  P +                        P  T P    +TK   NKR KE K          +G  KVK+  
Subjt:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREA

Query:  EDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA
        EDLN+ +    ++                    S++DW  QD+GLN V FDE+TMP P+CSCTG  RQCYKWGNGGWQS+CCTTTLS YPLP +PNKRH+
Subjt:  EDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA

Query:  RLGGRKMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK
        R+GGRKMSG+ F++LLSRL+AEG+DLS PVDLK++WA+HGTNRYITIK
Subjt:  RLGGRKMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHGTNRYITIK

AT5G42520.1 basic pentacysteine 65.4e-12366.2Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--
        MDD GHRENGRHK     + QGQW+MQHQPSMKQ+M+I+AERDAAIQERNLA+SEKKAA+AERDMA+LQRD AIAERNNA++ERD+A+  LQYRENS+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--

Query:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSRDMLASNDPCPTSPVAS---ESTKARRNKRTK-EGKTVTTPNKKVSKGPRKVKR
            N  +CPPGCQI+RGVKH+HHP   H    HH+P + EN Y +R+M   ND  PTSP A    ES K +R KR   +  T T  NK+  K  RKVK+
Subjt:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSRDMLASNDPCPTSPVAS---ESTKARRNKRTK-EGKTVTTPNKKVSKGPRKVKR

Query:  EAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV
        E+E DLNKIM  K + ++ D        +D +K +++ SKSDWK Q++ GLNQV +DE+TMP P+CSCTGV+RQCYKWGNGGWQS+CCTTTLSMYPLPA+
Subjt:  EAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV

Query:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSSPVDLKNHWAKHGTNRYITIK
        PNKRHAR+GGRKMSGSAFNKLLSRLAAEG HDLS+PVDLK+HWAKHGTNRYITIK
Subjt:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSSPVDLKNHWAKHGTNRYITIK

AT5G42520.2 basic pentacysteine 61.1e-11865.35Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--
        MDD GHRENGRHK     + QG    QHQPSMKQ+M+I+AERDAAIQERNLA+SEKKAA+AERDMA+LQRD AIAERNNA++ERD+A+  LQYRENS+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--

Query:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSRDMLASNDPCPTSPVAS---ESTKARRNKRTK-EGKTVTTPNKKVSKGPRKVKR
            N  +CPPGCQI+RGVKH+HHP   H    HH+P + EN Y +R+M   ND  PTSP A    ES K +R KR   +  T T  NK+  K  RKVK+
Subjt:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSRDMLASNDPCPTSPVAS---ESTKARRNKRTK-EGKTVTTPNKKVSKGPRKVKR

Query:  EAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV
        E+E DLNKIM  K + ++ D        +D +K +++ SKSDWK Q++ GLNQV +DE+TMP P+CSCTGV+RQCYKWGNGGWQS+CCTTTLSMYPLPA+
Subjt:  EAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV

Query:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSSPVDLKNHWAKHGTNRYITIK
        PNKRHAR+GGRKMSGSAFNKLLSRLAAEG HDLS+PVDLK+HWAKHGTNRYITIK
Subjt:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSSPVDLKNHWAKHGTNRYITIK

AT5G42520.3 basic pentacysteine 64.8e-9563.57Show/hide
Query:  MAYLQRDAAIAERNNALLERDNAIATLQYRENSI-----NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSRDMLASNDPCPTSPVAS-
        MA+LQRD AIAERNNA++ERD+A+  LQYRENS+      N  +CPPGCQI+RGVKH+HHP   H    HH+P + EN Y +R+M   ND  PTSP A  
Subjt:  MAYLQRDAAIAERNNALLERDNAIATLQYRENSI-----NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSRDMLASNDPCPTSPVAS-

Query:  --ESTKARRNKRTK-EGKTVTTPNKKVSKGPRKVKREAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAP
          ES K +R KR   +  T T  NK+  K  RKVK+E+E DLNKIM  K + ++ D        +D +K +++ SKSDWK Q++ GLNQV +DE+TMP P
Subjt:  --ESTKARRNKRTK-EGKTVTTPNKKVSKGPRKVKREAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAP

Query:  ICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSSPVDLKNHWAKHGTNRYITIK
        +CSCTGV+RQCYKWGNGGWQS+CCTTTLSMYPLPA+PNKRHAR+GGRKMSGSAFNKLLSRLAAEG HDLS+PVDLK+HWAKHGTNRYITIK
Subjt:  ICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSSPVDLKNHWAKHGTNRYITIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGACAGTGGACACCGTGAGAATGGAAGGCACAAACCAGATCAGTATAAGTCAGCTCAGGGTCAGTGGATGATGCAGCATCAGCCCTCAATGAAGCAGATAATGGC
AATCATGGCCGAAAGAGATGCAGCCATTCAAGAGAGAAATTTGGCCCTCTCGGAGAAAAAGGCTGCACTAGCAGAGCGAGACATGGCATATCTACAGCGAGACGCTGCAA
TTGCAGAGAGGAACAATGCCCTTTTGGAACGAGACAATGCCATTGCTACTCTTCAGTATCGTGAAAACTCTATAAATAACAATTTATCATGTCCACCAGGATGCCAAATT
GCTAGGGGAGTGAAGCATATACATCATCCGCAGCAGCAACACACACATCATGTGCCCCACATGAATGAGAATAATTACAATTCCAGAGATATGCTTGCTTCCAACGACCC
TTGCCCAACATCCCCCGTTGCTTCTGAATCAACAAAGGCACGACGAAACAAGCGTACAAAAGAGGGAAAGACAGTCACAACACCAAACAAGAAAGTTTCAAAAGGTCCAC
GGAAGGTCAAAAGGGAGGCTGAAGACTTGAACAAGATAATGTTGGGGAAGTCGCAAGAATGGAAGGATGGAATTGGTATAATGAGTGGAGGTGACGATCTTAATAAACAG
TTGGTAGTATCAAAATCAGATTGGAAAGGCCAGGATTTAGGATTAAACCAGGTTGCCTTTGACGAATCAACCATGCCAGCTCCAATATGTTCCTGCACAGGAGTAATAAG
ACAATGCTACAAATGGGGGAATGGTGGATGGCAATCTGCATGTTGTACTACCACCCTCTCAATGTATCCATTACCTGCCGTTCCCAACAAACGACATGCTCGGCTTGGTG
GTCGTAAAATGAGCGGAAGTGCTTTTAACAAACTGCTTAGCCGCCTTGCAGCCGAAGGCCACGATTTATCCTCTCCAGTTGATCTTAAAAATCACTGGGCAAAGCATGGA
ACAAATCGTTACATTACCATCAAGTAG
mRNA sequenceShow/hide mRNA sequence
AAATGACACAAAGCTAACAAAAAAAAAAAGAAAAAGAAAAAGAAAAAGAAAATAGAGAGAGATTCCTTTGTCCCTCTGCTCTGCTTACAACTGTGCTTTCCACAGATTTT
CAATCCTTTTCTTTTTCTCCATCTTCATCTCTTTTCCTTTTTCTTCTTCTTTCTCCAGTTCTTCATCTCTCCTTCTCACTGATGATCTGATGGAGGATGAGGTGGGGGAA
GGTGGGTCAGCCATTTGAGCTGACTAGAGGTCGAAAATTCTAACGATGGTTTCTTAAGCGGATTCTTCGTTTGATCTTCTGATTCCAGTGGAGCTTTTTCATTACTGTAT
CTGTGTTGGTTATTCGTTTTGGAGTCACAAATGGATGACAGTGGACACCGTGAGAATGGAAGGCACAAACCAGATCAGTATAAGTCAGCTCAGGGTCAGTGGATGATGCA
GCATCAGCCCTCAATGAAGCAGATAATGGCAATCATGGCCGAAAGAGATGCAGCCATTCAAGAGAGAAATTTGGCCCTCTCGGAGAAAAAGGCTGCACTAGCAGAGCGAG
ACATGGCATATCTACAGCGAGACGCTGCAATTGCAGAGAGGAACAATGCCCTTTTGGAACGAGACAATGCCATTGCTACTCTTCAGTATCGTGAAAACTCTATAAATAAC
AATTTATCATGTCCACCAGGATGCCAAATTGCTAGGGGAGTGAAGCATATACATCATCCGCAGCAGCAACACACACATCATGTGCCCCACATGAATGAGAATAATTACAA
TTCCAGAGATATGCTTGCTTCCAACGACCCTTGCCCAACATCCCCCGTTGCTTCTGAATCAACAAAGGCACGACGAAACAAGCGTACAAAAGAGGGAAAGACAGTCACAA
CACCAAACAAGAAAGTTTCAAAAGGTCCACGGAAGGTCAAAAGGGAGGCTGAAGACTTGAACAAGATAATGTTGGGGAAGTCGCAAGAATGGAAGGATGGAATTGGTATA
ATGAGTGGAGGTGACGATCTTAATAAACAGTTGGTAGTATCAAAATCAGATTGGAAAGGCCAGGATTTAGGATTAAACCAGGTTGCCTTTGACGAATCAACCATGCCAGC
TCCAATATGTTCCTGCACAGGAGTAATAAGACAATGCTACAAATGGGGGAATGGTGGATGGCAATCTGCATGTTGTACTACCACCCTCTCAATGTATCCATTACCTGCCG
TTCCCAACAAACGACATGCTCGGCTTGGTGGTCGTAAAATGAGCGGAAGTGCTTTTAACAAACTGCTTAGCCGCCTTGCAGCCGAAGGCCACGATTTATCCTCTCCAGTT
GATCTTAAAAATCACTGGGCAAAGCATGGAACAAATCGTTACATTACCATCAAGTAGAGTCACTAGCTTATTAAGGAAGAGTAGCTTACAAAGAGTAGAAAAAAAATGGA
AGGAATTCCCAAACTAGCCAAGGCAAACAACTGCCTCAGAATTTCCTTTCAGTTCCAGCAAAAGAATAAAGGATAGCATAACATCCATAAAGAAAAGGAGTTGGAAGCTG
TCTCTTGCCTTCAACTTCAAAGGTTTCTCGGCCTCTCTCACATTCATCAAGTTTGTTCGAGAGTAAGGACAGAGCCAAGTTCTTCAAAAAGCCATCAAAATTCTTTACTT
TCCTCTTTTTTTTGAAGTGCTTTTCTCTGTAAAGCCATGGCGAGATCCTACTTGGTTCAAATAACTGTAAAGGTATGTGTTTATTGGTATTATGTTGAAGGTTGTACTTA
AATATCAGAATGTGACCCGTCATCATTTTGTAGTATGAAGTCTGTATTATCTCACAAGATGCATCTGTTTTATGCATCTCCAGTGAAACTCAATCCATTAGGTACAATGT
TATCATCATCAGCAGCTCATAAACTCTTCCTCTGTCAATATCTTAAATGTTCGTCTTAGGTTATTTAACAGTCATATTCATCTCCCAGTTAATCAAACCTGTAATTGAAT
TAGCCAAGCAGATTGGTTTGAAGGGTAACATTGGTATTTGGGGTTGAGAAGAGTAAGATAAAAAAAAGTTGCAATGGGTTTTCGAGTTTATGAACTAAAAGCAAAATCTT
CATCAATTATTGAAAATTGCTTAATCAAGTTTATGTCAGATCATGAATCATCAGCATCGAAAAATCAGACAGGGATAACTGATATTTCATGTTTTCATCACCCACATCAC
TTGAACAAGAATTAAACACCAAACTTTACAATATCCCTTAAAATCGGCAATTCACTCCTAAACAATTGAATAAGAAAAAAATCCAAAGAAAATATTTCATGAAAAACTGA
TGAATGAGTTTCAGAGTAGAAAAGCAAAAGGGTGGTGAGCCTCAGACCGTCCTAGTTCAAAAAAGTTTCAACACCAATGCAAATATTACATTTTTGCATCTCTGTCAACC
AAAAAACTGTTCATCACCGGCCACACCTAAGAACAAAGTGAACAAAAGAAGTAAAACAAACCCAAAGGAAAAAAAACCGATGAAAAAACAATAATTTCCAAAAACTATAC
ACTGTCAGAGTTTGAAATATCAGCACTTGGAATCAGCCAGGGATAACTGTACATTTCATGGTTTCTCATCATCCAGAGGAAGAAAAAATCACAAGCAGAAACACTTTCAA
ATAAAACAGCGAGTTCAAATCCATTAAAATTGAGTTTTAAAGAAAAACGGGAGCGGAAGAAAACTAAATGAATTTGCGTAGGAGAGGCATCAGATCTTCCTATACAAAGT
GGTTAATCACAATGCTCCAAGTTCGTAAACTAGAGCATTCTACCATTCGTAAGTTCGTCATTTGCGCTCTTCCGAGTACGAACCCAAAATCAATTGCAGAAGTCGCAAGA
TGAAAAGCCGATCGCACCGGCGTAGTCTGGAAGATCTAAACATGAACGTGAAAAAGAAAGAACTGAATTGAAGAAGACATGAAAAAGAAAATCAAAGCAGCTGAGGATCT
TGGAAAGGGCGGCGTTCTACAAT
Protein sequenceShow/hide protein sequence
MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINNNLSCPPGCQI
ARGVKHIHHPQQQHTHHVPHMNENNYNSRDMLASNDPCPTSPVASESTKARRNKRTKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGKSQEWKDGIGIMSGGDDLNKQ
LVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGSAFNKLLSRLAAEGHDLSSPVDLKNHWAKHG
TNRYITIK