; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Chy3G063790 (gene) of Cucumber (hystrix) v1 genome

Gene IDChy3G063790
OrganismCucumis hystrix (Cucumber (hystrix) v1)
DescriptionGAGA-binding transcriptional activator
Genome locationchrH03:16759840..16769247
RNA-Seq ExpressionChy3G063790
SyntenyChy3G063790
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009723 - response to ethylene (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0042803 - protein homodimerization activity (molecular function)
InterPro domainsIPR010409 - GAGA-binding transcriptional activator


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004152657.1 protein BASIC PENTACYSTEINE6 [Cucumis sativus]2.53e-24599.41Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
        MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN

Query:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
        NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTV TPNKKVSKGPRKVKREAEDLNKIMLGK
Subjt:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK

Query:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
        SQEWKDGIGIMS GDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
Subjt:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS

Query:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_008444782.1 PREDICTED: protein BASIC PENTACYSTEINE6 [Cucumis melo]5.10e-24599.11Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
        MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN

Query:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
        NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSR+MLASNDPCPTSPVASESTKARRNKR KEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
Subjt:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK

Query:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
        SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
Subjt:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS

Query:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        AFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_022144218.1 protein BASIC PENTACYSTEINE6 [Momordica charantia]2.82e-22292.42Show/hide
Query:  MDDSGHRENGRHKPD-QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-
        MDDSGHRENGRHK + QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK ALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+ 
Subjt:  MDDSGHRENGRHKPD-QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-

Query:  NNNLSCPPGCQIARGVKHIHHPQQQ---HTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNK
        +NNLSCPPGCQIARGVKHIHHPQQQ   HTHHVPHMNE  YN REM   ND CP SPVASESTKARRNKR KE K VTTPNKKVSK PRKVKREAEDLNK
Subjt:  NNNLSCPPGCQIARGVKHIHHPQQQ---HTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNK

Query:  IMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGR
        IMLGKSQEWKD IGIMSGGDDLNKQLVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR+GGR
Subjt:  IMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGR

Query:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_038885713.1 protein BASIC PENTACYSTEINE6 isoform X1 [Benincasa hispida]1.98e-23795.36Show/hide
Query:  VLESQMDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRE
        VLESQMDDSGHRENGRHKPDQYK AQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRE
Subjt:  VLESQMDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRE

Query:  NSI--NNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDL
        NS+  NNN+SCPPGCQIARGVKHIHHPQQQHTHHVPHMNE+NYNSREMLASNDPCP SPVASESTKARRNKR KEGKTVTTP+KKV +GPRKVKRE EDL
Subjt:  NSI--NNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDL

Query:  NKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLG
        NKIMLGKSQEWKDGIGIM GGDDLNKQLV SKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLP VPNKRHARLG
Subjt:  NKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLG

Query:  GRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        GRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  GRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_038885714.1 protein BASIC PENTACYSTEINE6 isoform X2 [Benincasa hispida]4.83e-23495.29Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--
        MDDSGHRENGRHKPDQYK AQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--

Query:  NNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNKIML
        NNN+SCPPGCQIARGVKHIHHPQQQHTHHVPHMNE+NYNSREMLASNDPCP SPVASESTKARRNKR KEGKTVTTP+KKV +GPRKVKRE EDLNKIML
Subjt:  NNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNKIML

Query:  GKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMS
        GKSQEWKDGIGIM GGDDLNKQLV SKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLP VPNKRHARLGGRKMS
Subjt:  GKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMS

Query:  GSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        GSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  GSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

TrEMBL top hitse value%identityAlignment
A0A0A0LLL3 GAGA-binding transcriptional activator2.8e-19299.41Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
        MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN

Query:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
        NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTV TPNKKVSKGPRKVKREAEDLNKIMLGK
Subjt:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK

Query:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
        SQEWKDGIGIMS GDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
Subjt:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS

Query:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A1S3BBW9 GAGA-binding transcriptional activator4.8e-19299.11Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
        MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN

Query:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
        NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSR+MLASNDPCPTSPVASESTKARRNKR KEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
Subjt:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK

Query:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
        SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
Subjt:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS

Query:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        AFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A5A7VDN8 GAGA-binding transcriptional activator4.8e-19299.11Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
        MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSINN

Query:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
        NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSR+MLASNDPCPTSPVASESTKARRNKR KEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK
Subjt:  NLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGK

Query:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
        SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS
Subjt:  SQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGS

Query:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        AFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  AFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A6J1CR14 GAGA-binding transcriptional activator9.1e-17592.42Show/hide
Query:  MDDSGHRENGRHKPD-QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-
        MDDSGHRENGRHK + QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK ALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+ 
Subjt:  MDDSGHRENGRHKPD-QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-

Query:  NNNLSCPPGCQIARGVKHIHHP---QQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNK
        +NNLSCPPGCQIARGVKHIHHP   QQQHTHHVPHMNE  YN REM   ND CP SPVASESTKARRNKR KE K VTTPNKKVSK PRKVKREAEDLNK
Subjt:  NNNLSCPPGCQIARGVKHIHHP---QQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLNK

Query:  IMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGR
        IMLGKSQEWKD IGIMSGGDDLNKQLVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR+GGR
Subjt:  IMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGR

Query:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A6J1KJF5 GAGA-binding transcriptional activator3.8e-17391.62Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-N
        MDDSGHRENGRHK +QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAA+AERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+ N
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI-N

Query:  NNLSCPPGCQIARGVKHIHHP----QQQHTH-HVPHMNENNYNSREMLASNDPCPTSP-VASESTKARRNKRPKE-GKTVTTPNKKVSKGPRKVKREAED
        NNLSCPPGCQIARGVKHIHHP    QQQHTH HVPHMNE+NYNSRE L +ND CP  P VASESTK RRNKR KE  KTVT PNKKVSK PRKVKREAED
Subjt:  NNLSCPPGCQIARGVKHIHHP----QQQHTH-HVPHMNENNYNSREMLASNDPCPTSP-VASESTKARRNKRPKE-GKTVTTPNKKVSKGPRKVKREAED

Query:  LNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARL
        LNKIMLGKSQEWKDGIGI+SGGDDLNK LVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR+
Subjt:  LNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARL

Query:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        GGRKMSGSAFNKLLSRL AEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

SwissProt top hitse value%identityAlignment
F4JUI3 Protein BASIC PENTACYSTEINE56.9e-7145.35Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMM----QHQPSM--KQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYR
        M+  G  ENGR+KPD YK  Q   +M    QH   +  K+I++I+AERDAA++ERN A++  K ALA RD A  QRD A++ER+NA++E ++A+  L+YR
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMM----QHQPSM--KQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYR

Query:  ENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLN
        EN++N  LSC       RG       ++ H                     +P P S +  E+   R  KR KE K              + K+  EDLN
Subjt:  ENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREAEDLN

Query:  KIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGG
        + +    ++                    S+ DW   D+    V FDE TMP P+C+CTG  RQCYKWGNGGWQS+CCTTTLS YPLP +PNKRH+R+GG
Subjt:  KIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGG

Query:  RKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        RKMSGS F++LLSRLA EGH+LS+PVDLKN+WA+HGTNRYITIK
Subjt:  RKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

O82286 Protein BASIC PENTACYSTEINE71.1e-3645.4Show/hide
Query:  RPKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGN
        R     T T   K V++ P          +K +  K Q  K  +   S        +  +K + K  D+ ++  +FD S +P P+CSCTGV R CYKWG 
Subjt:  RPKEGKTVTTPNKKVSKGPRKVKREAEDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGN

Query:  GGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        GGWQS+CCT ++S YPLP    +  ARL GRKMS  A+ KLL+RLA EG+DLS P+DLKNHWA+HGTN+++TIK
Subjt:  GGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

Q5VSA8 Barley B recombinant-like protein D8.7e-9054.86Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMM-QHQPSMK-----QIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQY-
        MD+ GHRENGR +PDQYK    QWMM Q Q  +K      ++A+M +RD AI+ER+ AL+EKKAA+AERDMA+ QRDAA+AERN A++ERDNA+A L+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMM-QHQPSMK-----QIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQY-

Query:  REN--SINNNLSCPPGCQIARGVKHIHHPQQ-QHTHHVP-HMNENNY-NSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKR
        R N  ++NN    P G     G K+IHH  Q  H    P  + ++ Y ++REM  S       P+++    A + KRPK+  +  +P K+ S   RK K+
Subjt:  REN--SINNNLSCPPGCQIARGVKHIHHPQQ-QHTHHVP-HMNENNY-NSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKR

Query:  EAEDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKR
         + D           WK+ +G+   GDD +    V K++WK Q+LGLNQVAFD+STMPAP CSCTG +RQCYKWGNGGWQS+CCT  +SMYPLP +PNKR
Subjt:  EAEDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKR

Query:  HARLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        HAR+GGRKMSG AF KLLSRLAAEGHDLS PVDLK+HWAKHGTNRYITI+
Subjt:  HARLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

Q8L999 Protein BASIC PENTACYSTEINE63.0e-12266.48Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--
        MDD GHRENGRHK     + QGQW+MQHQPSMKQ+M+I+AERDAAIQERNLA+SEKKAA+AERDMA+LQRD AIAERNNA++ERD+A+  LQYRENS+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--

Query:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS---ESTKARRNKRPK-EGKTVTTPNKKVSKGPRKVKR
            N  +CPPGCQI+RGVKH+HHP   H    HH+P + EN Y +REM   ND  PTSP A    ES K +R KR   +  T T  NK+  K  RKVK+
Subjt:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS---ESTKARRNKRPK-EGKTVTTPNKKVSKGPRKVKR

Query:  EAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV
        E+E DLNKIM  K + ++ D        +D +K +++ SKSDWK Q++ GLNQV +DE+TMP P+CSCTGV+RQCYKWGNGGWQS+CCTTTLSMYPLPA+
Subjt:  EAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV

Query:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        PNKRHAR+GGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK

Q8S8C6 Protein BASIC PENTACYSTEINE44.6e-7548.28Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT
        M++ G  +N R KPD +K AQ  W M  QHQ           K+IM+I+AERDAA+ ERN A+S KK A+A RD A  QRD A++ER+ AL+ERDNA A 
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT

Query:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREA
        LQ+ ENS+N  LS    C        I  P +                        P  T P    +TK   NKR KE K          +G  KVK+  
Subjt:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREA

Query:  EDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA
        EDLN+ +    ++                    S++DW  QD+GLN V FDE+TMP P+CSCTG  RQCYKWGNGGWQS+CCTTTLS YPLP +PNKRH+
Subjt:  EDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA

Query:  RLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        R+GGRKMSG+ F++LLSRL+AEG+DLS PVDLK++WA+HGTNRYITIK
Subjt:  RLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

Arabidopsis top hitse value%identityAlignment
AT2G21240.1 basic pentacysteine 43.3e-7648.28Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT
        M++ G  +N R KPD +K AQ  W M  QHQ           K+IM+I+AERDAA+ ERN A+S KK A+A RD A  QRD A++ER+ AL+ERDNA A 
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT

Query:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREA
        LQ+ ENS+N  LS    C        I  P +                        P  T P    +TK   NKR KE K          +G  KVK+  
Subjt:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREA

Query:  EDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA
        EDLN+ +    ++                    S++DW  QD+GLN V FDE+TMP P+CSCTG  RQCYKWGNGGWQS+CCTTTLS YPLP +PNKRH+
Subjt:  EDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA

Query:  RLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        R+GGRKMSG+ F++LLSRL+AEG+DLS PVDLK++WA+HGTNRYITIK
Subjt:  RLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

AT2G21240.2 basic pentacysteine 43.3e-7648.28Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT
        M++ G  +N R KPD +K AQ  W M  QHQ           K+IM+I+AERDAA+ ERN A+S KK A+A RD A  QRD A++ER+ AL+ERDNA A 
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIAT

Query:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREA
        LQ+ ENS+N  LS    C        I  P +                        P  T P    +TK   NKR KE K          +G  KVK+  
Subjt:  LQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVTTPNKKVSKGPRKVKREA

Query:  EDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA
        EDLN+ +    ++                    S++DW  QD+GLN V FDE+TMP P+CSCTG  RQCYKWGNGGWQS+CCTTTLS YPLP +PNKRH+
Subjt:  EDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHA

Query:  RLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        R+GGRKMSG+ F++LLSRL+AEG+DLS PVDLK++WA+HGTNRYITIK
Subjt:  RLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

AT5G42520.1 basic pentacysteine 62.1e-12366.48Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--
        MDD GHRENGRHK     + QGQW+MQHQPSMKQ+M+I+AERDAAIQERNLA+SEKKAA+AERDMA+LQRD AIAERNNA++ERD+A+  LQYRENS+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--

Query:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS---ESTKARRNKRPK-EGKTVTTPNKKVSKGPRKVKR
            N  +CPPGCQI+RGVKH+HHP   H    HH+P + EN Y +REM   ND  PTSP A    ES K +R KR   +  T T  NK+  K  RKVK+
Subjt:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS---ESTKARRNKRPK-EGKTVTTPNKKVSKGPRKVKR

Query:  EAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV
        E+E DLNKIM  K + ++ D        +D +K +++ SKSDWK Q++ GLNQV +DE+TMP P+CSCTGV+RQCYKWGNGGWQS+CCTTTLSMYPLPA+
Subjt:  EAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV

Query:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        PNKRHAR+GGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK

AT5G42520.2 basic pentacysteine 64.1e-11965.63Show/hide
Query:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--
        MDD GHRENGRHK     + QG    QHQPSMKQ+M+I+AERDAAIQERNLA+SEKKAA+AERDMA+LQRD AIAERNNA++ERD+A+  LQYRENS+  
Subjt:  MDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSI--

Query:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS---ESTKARRNKRPK-EGKTVTTPNKKVSKGPRKVKR
            N  +CPPGCQI+RGVKH+HHP   H    HH+P + EN Y +REM   ND  PTSP A    ES K +R KR   +  T T  NK+  K  RKVK+
Subjt:  ---NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS---ESTKARRNKRPK-EGKTVTTPNKKVSKGPRKVKR

Query:  EAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV
        E+E DLNKIM  K + ++ D        +D +K +++ SKSDWK Q++ GLNQV +DE+TMP P+CSCTGV+RQCYKWGNGGWQS+CCTTTLSMYPLPA+
Subjt:  EAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAV

Query:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        PNKRHAR+GGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  PNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK

AT5G42520.3 basic pentacysteine 67.0e-9563.92Show/hide
Query:  MAYLQRDAAIAERNNALLERDNAIATLQYRENSI-----NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS-
        MA+LQRD AIAERNNA++ERD+A+  LQYRENS+      N  +CPPGCQI+RGVKH+HHP   H    HH+P + EN Y +REM   ND  PTSP A  
Subjt:  MAYLQRDAAIAERNNALLERDNAIATLQYRENSI-----NNNLSCPPGCQIARGVKHIHHPQQQH---THHVPHMNENNYNSREMLASNDPCPTSPVAS-

Query:  --ESTKARRNKRPK-EGKTVTTPNKKVSKGPRKVKREAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAP
          ES K +R KR   +  T T  NK+  K  RKVK+E+E DLNKIM  K + ++ D        +D +K +++ SKSDWK Q++ GLNQV +DE+TMP P
Subjt:  --ESTKARRNKRPK-EGKTVTTPNKKVSKGPRKVKREAE-DLNKIMLGK-SQEWKDGIGIMSGGDDLNKQLVV-SKSDWKGQDL-GLNQVAFDESTMPAP

Query:  ICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        +CSCTGV+RQCYKWGNGGWQS+CCTTTLSMYPLPA+PNKRHAR+GGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  ICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPAVPNKRHARLGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGGAGTAAGGGTTACGTGTGGGCAATTTCAGCCGGTCTCAACGCTGCGTTTGCCGCCATTGCAGCCAAGCTTTTCTCCTATACGTTAATTAGGTATGGTCTGGT
AATTGCGTTCAACTTGGCAATGTGGGGATGTTATGTCAATAGCTTAAAAGCTCTCTCGTCATTACAAGCTACAGGGACAAACTTTTCAGCTAACTTTCTTTGTTCTGGTC
TAGCCGGTTTCTTGTTGTTCGAAGAAGCATTATCATTTCGGCCGTTACAGTTTCAAGCATCTAACACATTCAAACCTACATCTCTTAACATCAATGCACTCTTTTCCAGT
GGTGAAGTATCTACTGCTCCGAACTTTCGTGTTTTGGCTTATGTACATCTTGTTTCTGCTTATGATCACTTGCTTAGACTGTGGATAAATGGAGCTTTTTCATTACTGTA
TCTGTGTTGGTTGTTCGTTTTGGAGTCACAAATGGATGACAGTGGACACCGTGAGAATGGAAGGCACAAACCAGATCAGTATAAGTCAGCTCAGGGTCAGTGGATGATGC
AGCATCAGCCCTCAATGAAGCAGATAATGGCAATTATGGCTGAAAGAGATGCAGCCATTCAAGAAAGAAATTTGGCCCTCTCGGAGAAAAAGGCTGCACTAGCAGAGCGA
GACATGGCATATCTACAGCGAGACGCTGCAATTGCAGAGAGGAACAATGCCCTTTTGGAACGAGACAACGCCATTGCTACTCTTCAGTATCGTGAAAACTCCATAAATAA
CAATTTATCATGTCCACCAGGATGCCAAATTGCTAGGGGAGTGAAGCATATACATCATCCACAGCAGCAACACACACATCATGTGCCCCACATGAATGAGAATAATTACA
ATTCAAGAGAAATGCTTGCTTCCAACGACCCTTGCCCAACATCCCCTGTCGCTTCTGAATCAACAAAGGCACGACGAAACAAGCGTCCAAAAGAGGGAAAGACAGTCACA
ACACCAAACAAGAAAGTTTCAAAAGGTCCACGGAAGGTCAAAAGGGAGGCTGAAGACTTGAACAAAATAATGTTGGGGAAGTCACAAGAATGGAAGGATGGAATTGGTAT
AATGAGTGGAGGTGACGATCTTAATAAACAGTTGGTAGTATCAAAATCAGATTGGAAAGGGCAGGATTTAGGATTAAACCAGGTTGCATTTGACGAATCAACCATGCCAG
CTCCAATATGCTCCTGCACAGGAGTAATAAGACAATGCTACAAATGGGGAAATGGTGGATGGCAATCTGCATGTTGTACTACCACCCTCTCAATGTATCCATTACCTGCC
GTTCCCAACAAACGACACGCTCGGCTCGGCGGTCGGAAAATGAGTGGAAGTGCTTTTAACAAACTGCTTAGCCGCCTTGCAGCCGAAGGCCACGACCTATCCGCTCCAGT
TGATCTTAAAAATCACTGGGCAAAGCATGGAACAAATCGTTACATCACCATCAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGGAGTAAGGGTTACGTGTGGGCAATTTCAGCCGGTCTCAACGCTGCGTTTGCCGCCATTGCAGCCAAGCTTTTCTCCTATACGTTAATTAGGTATGGTCTGGT
AATTGCGTTCAACTTGGCAATGTGGGGATGTTATGTCAATAGCTTAAAAGCTCTCTCGTCATTACAAGCTACAGGGACAAACTTTTCAGCTAACTTTCTTTGTTCTGGTC
TAGCCGGTTTCTTGTTGTTCGAAGAAGCATTATCATTTCGGCCGTTACAGTTTCAAGCATCTAACACATTCAAACCTACATCTCTTAACATCAATGCACTCTTTTCCAGT
GGTGAAGTATCTACTGCTCCGAACTTTCGTGTTTTGGCTTATGTACATCTTGTTTCTGCTTATGATCACTTGCTTAGACTGTGGATAAATGGAGCTTTTTCATTACTGTA
TCTGTGTTGGTTGTTCGTTTTGGAGTCACAAATGGATGACAGTGGACACCGTGAGAATGGAAGGCACAAACCAGATCAGTATAAGTCAGCTCAGGGTCAGTGGATGATGC
AGCATCAGCCCTCAATGAAGCAGATAATGGCAATTATGGCTGAAAGAGATGCAGCCATTCAAGAAAGAAATTTGGCCCTCTCGGAGAAAAAGGCTGCACTAGCAGAGCGA
GACATGGCATATCTACAGCGAGACGCTGCAATTGCAGAGAGGAACAATGCCCTTTTGGAACGAGACAACGCCATTGCTACTCTTCAGTATCGTGAAAACTCCATAAATAA
CAATTTATCATGTCCACCAGGATGCCAAATTGCTAGGGGAGTGAAGCATATACATCATCCACAGCAGCAACACACACATCATGTGCCCCACATGAATGAGAATAATTACA
ATTCAAGAGAAATGCTTGCTTCCAACGACCCTTGCCCAACATCCCCTGTCGCTTCTGAATCAACAAAGGCACGACGAAACAAGCGTCCAAAAGAGGGAAAGACAGTCACA
ACACCAAACAAGAAAGTTTCAAAAGGTCCACGGAAGGTCAAAAGGGAGGCTGAAGACTTGAACAAAATAATGTTGGGGAAGTCACAAGAATGGAAGGATGGAATTGGTAT
AATGAGTGGAGGTGACGATCTTAATAAACAGTTGGTAGTATCAAAATCAGATTGGAAAGGGCAGGATTTAGGATTAAACCAGGTTGCATTTGACGAATCAACCATGCCAG
CTCCAATATGCTCCTGCACAGGAGTAATAAGACAATGCTACAAATGGGGAAATGGTGGATGGCAATCTGCATGTTGTACTACCACCCTCTCAATGTATCCATTACCTGCC
GTTCCCAACAAACGACACGCTCGGCTCGGCGGTCGGAAAATGAGTGGAAGTGCTTTTAACAAACTGCTTAGCCGCCTTGCAGCCGAAGGCCACGACCTATCCGCTCCAGT
TGATCTTAAAAATCACTGGGCAAAGCATGGAACAAATCGTTACATCACCATCAAGTAG
Protein sequenceShow/hide protein sequence
MEGSKGYVWAISAGLNAAFAAIAAKLFSYTLIRYGLVIAFNLAMWGCYVNSLKALSSLQATGTNFSANFLCSGLAGFLLFEEALSFRPLQFQASNTFKPTSLNINALFSS
GEVSTAPNFRVLAYVHLVSAYDHLLRLWINGAFSLLYLCWLFVLESQMDDSGHRENGRHKPDQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKAALAER
DMAYLQRDAAIAERNNALLERDNAIATLQYRENSINNNLSCPPGCQIARGVKHIHHPQQQHTHHVPHMNENNYNSREMLASNDPCPTSPVASESTKARRNKRPKEGKTVT
TPNKKVSKGPRKVKREAEDLNKIMLGKSQEWKDGIGIMSGGDDLNKQLVVSKSDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTLSMYPLPA
VPNKRHARLGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK