; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g0729 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g0729
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionGAGA-binding transcriptional activator
Genome locationMC08:5915172..5919847
RNA-Seq ExpressionMC08g0729
SyntenyMC08g0729
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009723 - response to ethylene (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0042803 - protein homodimerization activity (molecular function)
InterPro domainsIPR010409 - GAGA-binding transcriptional activator


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020463.1 Protein BASIC PENTACYSTEINE6, partial [Cucurbita argyrosperma subsp. argyrosperma]9.08e-22492.49Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAE QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK A+AERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHPQQQQQQ-HTHH-VPHMNETTYNPREMHPNDACPVSP-VASESTKARRNKRTKEA-KAVTTPNKKVSKAPRKVKREAED
        +NNLSCPPGCQIARGVKHIHHPQQQQQQ HTHH VPHMN++ YN RE+H NDACPVS  VASESTK RRNKRTKE  K +T PNKKVSKAPRKVKREAED
Subjt:  SNNLSCPPGCQIARGVKHIHHPQQQQQQ-HTHH-VPHMNETTYNPREMHPNDACPVSP-VASESTKARRNKRTKEA-KAVTTPNKKVSKAPRKVKREAED

Query:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV
        LNKIMLGKSQEWKD IGI+SGGDDLNK LVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHARV
Subjt:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV

Query:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        GGRKMSGSAFNKLLSRL AEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_022144218.1 protein BASIC PENTACYSTEINE6 [Momordica charantia]4.06e-249100Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNKI
        SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNKI
Subjt:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNKI

Query:  MLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRK
        MLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRK
Subjt:  MLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRK

Query:  MSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        MSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  MSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_022951895.1 protein BASIC PENTACYSTEINE6-like [Cucurbita moschata]6.63e-22492.8Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAE QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK A+AERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHPQQQQQQ--HTHH-VPHMNETTYNPREMHPNDACPVSP-VASESTKARRNKRTKEA-KAVTTPNKKVSKAPRKVKREAE
        +NNLSCPPGCQIARGVKHIHHPQQQQQQ  HTHH VPHMNE+ YN RE+H NDACPVS  VASESTKARRNKRTKE  K VT PNKKVSKAPRKVKREAE
Subjt:  SNNLSCPPGCQIARGVKHIHHPQQQQQQ--HTHH-VPHMNETTYNPREMHPNDACPVSP-VASESTKARRNKRTKEA-KAVTTPNKKVSKAPRKVKREAE

Query:  DLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHAR
        DLNKI LGKSQEWKD IGI+SGGDDLNK LVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR
Subjt:  DLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHAR

Query:  VGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        VGGRKMSGSAFNKLLSRL AEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  VGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_023002409.1 protein BASIC PENTACYSTEINE6-like [Cucurbita maxima]1.11e-22492.77Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAE QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK A+AERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHPQQQQQQ-HTH-HVPHMNETTYNPREMHPNDACPVSP-VASESTKARRNKRTKEA-KAVTTPNKKVSKAPRKVKREAED
        +NNLSCPPGCQIARGVKHIHHPQQQQQQ HTH HVPHMNE+ YN RE+H NDACP+ P VASESTK RRNKRTKE  K VT PNKKVSKAPRKVKREAED
Subjt:  SNNLSCPPGCQIARGVKHIHHPQQQQQQ-HTH-HVPHMNETTYNPREMHPNDACPVSP-VASESTKARRNKRTKEA-KAVTTPNKKVSKAPRKVKREAED

Query:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV
        LNKIMLGKSQEWKD IGI+SGGDDLNK LVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHARV
Subjt:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV

Query:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        GGRKMSGSAFNKLLSRL AEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_023537342.1 protein BASIC PENTACYSTEINE6-like [Cucurbita pepo subsp. pepo]5.48e-22593.06Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAE QYKSAQGQWMMQHQPSMKQIMAIMAERDAA+QERNLALSEKK A+AERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHPQQQQQQ-HTHH-VPHMNETTYNPREMHPNDACPVSP-VASESTKARRNKRTKEA-KAVTTPNKKVSKAPRKVKREAED
        +NNLSCPPGCQIARGVKHIHHPQQQQQQ HTHH VPHMNE+TYN RE+H NDACPVS  VASESTK RRNKRTKE  K VT PNKKVSKAPRKVKREAED
Subjt:  SNNLSCPPGCQIARGVKHIHHPQQQQQQ-HTHH-VPHMNETTYNPREMHPNDACPVSP-VASESTKARRNKRTKEA-KAVTTPNKKVSKAPRKVKREAED

Query:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV
        LNKIMLGKSQEWKD IGI+SGGDDLNK LVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHARV
Subjt:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV

Query:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        GGRKMSGSAFNKLLSRL AEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

TrEMBL top hitse value%identityAlignment
A0A1S3BBW9 GAGA-binding transcriptional activator5.61e-22392.13Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHK + QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK ALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+ 
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREM-HPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNK
        +NNLSCPPGCQIARGVKHIHHPQQQ   HTHHVPHMNE  YN R+M   ND CP SPVASESTKARRNKRTKE K VTTPNKKVSK PRKVKREAEDLNK
Subjt:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREM-HPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNK

Query:  IMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGR
        IMLGKSQEWKD IGIMSGGDDLNKQLVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR+GGR
Subjt:  IMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGR

Query:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        KMSGSAFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A5A7VDN8 GAGA-binding transcriptional activator5.61e-22392.13Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHK + QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK ALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+ 
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREM-HPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNK
        +NNLSCPPGCQIARGVKHIHHPQQQ   HTHHVPHMNE  YN R+M   ND CP SPVASESTKARRNKRTKE K VTTPNKKVSK PRKVKREAEDLNK
Subjt:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREM-HPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNK

Query:  IMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGR
        IMLGKSQEWKD IGIMSGGDDLNKQLVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR+GGR
Subjt:  IMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGR

Query:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        KMSGSAFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A6J1CR14 GAGA-binding transcriptional activator1.96e-249100Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNKI
        SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNKI
Subjt:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNKI

Query:  MLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRK
        MLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRK
Subjt:  MLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRK

Query:  MSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        MSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  MSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A6J1GIX2 GAGA-binding transcriptional activator3.21e-22492.8Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAE QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK A+AERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHPQQQQQQ--HTHH-VPHMNETTYNPREMHPNDACPVSP-VASESTKARRNKRTKEA-KAVTTPNKKVSKAPRKVKREAE
        +NNLSCPPGCQIARGVKHIHHPQQQQQQ  HTHH VPHMNE+ YN RE+H NDACPVS  VASESTKARRNKRTKE  K VT PNKKVSKAPRKVKREAE
Subjt:  SNNLSCPPGCQIARGVKHIHHPQQQQQQ--HTHH-VPHMNETTYNPREMHPNDACPVSP-VASESTKARRNKRTKEA-KAVTTPNKKVSKAPRKVKREAE

Query:  DLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHAR
        DLNKI LGKSQEWKD IGI+SGGDDLNK LVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR
Subjt:  DLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHAR

Query:  VGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        VGGRKMSGSAFNKLLSRL AEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  VGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A6J1KJF5 GAGA-binding transcriptional activator5.35e-22592.77Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAE QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK A+AERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHPQQQQQQ-HTH-HVPHMNETTYNPREMHPNDACPVSP-VASESTKARRNKRTKEA-KAVTTPNKKVSKAPRKVKREAED
        +NNLSCPPGCQIARGVKHIHHPQQQQQQ HTH HVPHMNE+ YN RE+H NDACP+ P VASESTK RRNKRTKE  K VT PNKKVSKAPRKVKREAED
Subjt:  SNNLSCPPGCQIARGVKHIHHPQQQQQQ-HTH-HVPHMNETTYNPREMHPNDACPVSP-VASESTKARRNKRTKEA-KAVTTPNKKVSKAPRKVKREAED

Query:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV
        LNKIMLGKSQEWKD IGI+SGGDDLNK LVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHARV
Subjt:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV

Query:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        GGRKMSGSAFNKLLSRL AEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

SwissProt top hitse value%identityAlignment
F4JUI3 Protein BASIC PENTACYSTEINE52.1e-6644.25Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMM----QHQPSM--KQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQY
        M+  G  ENGR+K ++ YK  Q   +M    QH   +  K+I++I+AERDAA++ERN A++  K ALA RD A  QRD A++ER+NA++E ++A+  L+Y
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMM----QHQPSM--KQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQY

Query:  RENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREA
        REN+L +  LSC       RG                         +   E H  +  P+S +  E+   R  KR KE+K              + K+  
Subjt:  RENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREA

Query:  EDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHA
        EDLN+ +    ++                    S+ DW   D+    V FDE TMP P+C+CTG  RQCYKWGNGGWQS+CCTTT+S YPLP +PNKRH+
Subjt:  EDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHA

Query:  RVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        RVGGRKMSGS F++LLSRLA EGH+LS+PVDLKN+WA+HGTNRYITIK
Subjt:  RVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

Q5VSA8 Barley B recombinant-like protein D2.1e-8753.87Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMM-QHQPSMK-----QIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQY
        MD+ GHRENGR + + QYK    QWMM Q Q  +K      ++A+M +RD AI+ER+ AL+EKK A+AERDMA+ QRDAA+AERN A++ERDNA+A L+ 
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMM-QHQPSMK-----QIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQY

Query:  -RENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKRE
         R N L  NN +  P   ++ G K+IHH  Q     +  +   +    + REMH ++A P+S     + KA   KR K+  +  +P K+ S   RK K+ 
Subjt:  -RENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKRE

Query:  AEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRH
        + D           WK+ +G+   GDD +    V K +WK Q+LGLNQVAFD+STMPAP CSCTG +RQCYKWGNGGWQS+CCT  +SMYPLP +PNKRH
Subjt:  AEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRH

Query:  ARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        AR+GGRKMSG AF KLLSRLAAEGHDLS PVDLK+HWAKHGTNRYITI+
Subjt:  ARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

Q8L999 Protein BASIC PENTACYSTEINE61.4e-12366.1Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSL-
        MDD GHRENGRHKA     + QGQW+MQHQPSMKQ+M+I+AERDAAIQERNLA+SEKK A+AERDMA+LQRD AIAERNNA++ERD+A+  LQYRENS+ 
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSL-

Query:  ---TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVAS---ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKR
            +N  +CPPGCQI+RGVKH+HHP        HH+P + E  Y  REM PND  P SP A    ES K +R KR   +A   T  NK+  K  RKVK+
Subjt:  ---TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVAS---ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKR

Query:  EAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP
        E+E DLNKIM  K + ++ DE       D     L+ SK DWK Q++ GLNQV +DE+TMP P+CSCTGV+RQCYKWGNGGWQS+CCTTT+SMYPLPA+P
Subjt:  EAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP

Query:  NKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        NKRHARVGGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  NKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK

Q8S8C6 Protein BASIC PENTACYSTEINE44.8e-7145.89Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIA
        M++ G  +N R K ++ +K AQ  W M  QHQ           K+IM+I+AERDAA+ ERN A+S KK A+A RD A  QRD A++ER+ AL+ERDNA A
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIA

Query:  TLQYRENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKAR-RNKRTKEAKAVTTPNKKVSKAPRK
         LQ+ ENSL   N +   G  +                         +  +   E H  +  P+S +  E T  +  NKR KE K          +   K
Subjt:  TLQYRENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKAR-RNKRTKEAKAVTTPNKKVSKAPRK

Query:  VKREAEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP
        VK+  EDLN+ +    ++                    S+ DW  QD+GLN V FDE+TMP P+CSCTG  RQCYKWGNGGWQS+CCTTT+S YPLP +P
Subjt:  VKREAEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP

Query:  NKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        NKRH+R+GGRKMSG+ F++LLSRL+AEG+DLS PVDLK++WA+HGTNRYITIK
Subjt:  NKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

Q9C9X6 Protein BASIC PENTACYSTEINE31.1e-3557.01Show/hide
Query:  DLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGT
        ++ +N V+ D   +P P+CSCTG+ +QCY+WG GGWQSACCTT +SMYPLP    +R AR+ GRKMS  AF K+L +L+++G D S P+DLK+HWAKHGT
Subjt:  DLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGT

Query:  NRYITIK
        N+++TI+
Subjt:  NRYITIK

Arabidopsis top hitse value%identityAlignment
AT2G21240.1 basic pentacysteine 43.4e-7245.89Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIA
        M++ G  +N R K ++ +K AQ  W M  QHQ           K+IM+I+AERDAA+ ERN A+S KK A+A RD A  QRD A++ER+ AL+ERDNA A
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIA

Query:  TLQYRENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKAR-RNKRTKEAKAVTTPNKKVSKAPRK
         LQ+ ENSL   N +   G  +                         +  +   E H  +  P+S +  E T  +  NKR KE K          +   K
Subjt:  TLQYRENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKAR-RNKRTKEAKAVTTPNKKVSKAPRK

Query:  VKREAEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP
        VK+  EDLN+ +    ++                    S+ DW  QD+GLN V FDE+TMP P+CSCTG  RQCYKWGNGGWQS+CCTTT+S YPLP +P
Subjt:  VKREAEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP

Query:  NKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        NKRH+R+GGRKMSG+ F++LLSRL+AEG+DLS PVDLK++WA+HGTNRYITIK
Subjt:  NKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

AT2G21240.2 basic pentacysteine 43.4e-7245.89Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIA
        M++ G  +N R K ++ +K AQ  W M  QHQ           K+IM+I+AERDAA+ ERN A+S KK A+A RD A  QRD A++ER+ AL+ERDNA A
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIA

Query:  TLQYRENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKAR-RNKRTKEAKAVTTPNKKVSKAPRK
         LQ+ ENSL   N +   G  +                         +  +   E H  +  P+S +  E T  +  NKR KE K          +   K
Subjt:  TLQYRENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKAR-RNKRTKEAKAVTTPNKKVSKAPRK

Query:  VKREAEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP
        VK+  EDLN+ +    ++                    S+ DW  QD+GLN V FDE+TMP P+CSCTG  RQCYKWGNGGWQS+CCTTT+S YPLP +P
Subjt:  VKREAEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP

Query:  NKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        NKRH+R+GGRKMSG+ F++LLSRL+AEG+DLS PVDLK++WA+HGTNRYITIK
Subjt:  NKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

AT5G42520.1 basic pentacysteine 61.0e-12466.1Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSL-
        MDD GHRENGRHKA     + QGQW+MQHQPSMKQ+M+I+AERDAAIQERNLA+SEKK A+AERDMA+LQRD AIAERNNA++ERD+A+  LQYRENS+ 
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSL-

Query:  ---TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVAS---ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKR
            +N  +CPPGCQI+RGVKH+HHP        HH+P + E  Y  REM PND  P SP A    ES K +R KR   +A   T  NK+  K  RKVK+
Subjt:  ---TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVAS---ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKR

Query:  EAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP
        E+E DLNKIM  K + ++ DE       D     L+ SK DWK Q++ GLNQV +DE+TMP P+CSCTGV+RQCYKWGNGGWQS+CCTTT+SMYPLPA+P
Subjt:  EAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP

Query:  NKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        NKRHARVGGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  NKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK

AT5G42520.2 basic pentacysteine 61.5e-12065.25Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSL-
        MDD GHRENGRHKA     + QG    QHQPSMKQ+M+I+AERDAAIQERNLA+SEKK A+AERDMA+LQRD AIAERNNA++ERD+A+  LQYRENS+ 
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSL-

Query:  ---TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVAS---ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKR
            +N  +CPPGCQI+RGVKH+HHP        HH+P + E  Y  REM PND  P SP A    ES K +R KR   +A   T  NK+  K  RKVK+
Subjt:  ---TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVAS---ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKR

Query:  EAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP
        E+E DLNKIM  K + ++ DE       D     L+ SK DWK Q++ GLNQV +DE+TMP P+CSCTGV+RQCYKWGNGGWQS+CCTTT+SMYPLPA+P
Subjt:  EAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP

Query:  NKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        NKRHARVGGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  NKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK

AT5G42520.3 basic pentacysteine 68.8e-9763.67Show/hide
Query:  MAYLQRDAAIAERNNALLERDNAIATLQYRENSL----TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVAS--
        MA+LQRD AIAERNNA++ERD+A+  LQYRENS+     +N  +CPPGCQI+RGVKH+HHP        HH+P + E  Y  REM PND  P SP A   
Subjt:  MAYLQRDAAIAERNNALLERDNAIATLQYRENSL----TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVAS--

Query:  -ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKREAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPIC
         ES K +R KR   +A   T  NK+  K  RKVK+E+E DLNKIM  K + ++ DE       D     L+ SK DWK Q++ GLNQV +DE+TMP P+C
Subjt:  -ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKREAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPIC

Query:  SCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        SCTGV+RQCYKWGNGGWQS+CCTTT+SMYPLPA+PNKRHARVGGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  SCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGATAGTGGGCATCGTGAGAATGGAAGGCACAAAGCAGAGCATCAGTACAAGTCGGCTCAGGGTCAGTGGATGATGCAGCATCAGCCATCAATGAAGCAGATAAT
GGCAATTATGGCTGAAAGAGATGCAGCCATTCAAGAGAGAAATTTGGCCCTTTCCGAGAAGAAGACAGCATTGGCAGAGCGAGACATGGCTTATCTACAACGAGATGCTG
CAATTGCAGAGAGGAACAATGCCCTTTTGGAAAGAGACAATGCCATTGCAACTCTTCAGTATCGTGAAAACTCCCTAACGAGTAACAATTTATCATGTCCACCAGGGTGC
CAAATTGCTAGGGGAGTGAAGCATATACACCACCCACAGCAGCAGCAGCAGCAACACACGCATCATGTGCCTCACATGAACGAAACTACTTACAATCCAAGAGAAATGCA
TCCCAACGATGCCTGCCCAGTATCCCCAGTTGCTTCTGAATCAACAAAGGCAAGGAGAAACAAGCGAACAAAGGAGGCAAAAGCAGTCACAACACCTAACAAGAAGGTTT
CAAAAGCTCCAAGGAAGGTCAAAAGGGAGGCTGAAGACTTGAATAAGATAATGCTGGGCAAATCACAAGAATGGAAGGATGAAATTGGCATCATGAGTGGAGGTGATGAT
CTTAACAAACAGTTGGTAGTGTCAAAGTTGGATTGGAAGGGTCAGGATCTGGGGCTAAACCAGGTTGCCTTTGATGAATCTACAATGCCAGCCCCAATATGCTCCTGCAC
AGGAGTTATAAGACAATGCTACAAATGGGGAAATGGTGGATGGCAATCCGCATGTTGTACCACCACCATGTCAATGTATCCATTGCCTGCAGTTCCCAACAAACGACACG
CCCGAGTTGGGGGTCGAAAGATGAGCGGAAGCGCCTTTAACAAGTTGCTCAGCCGGCTTGCAGCTGAAGGTCATGATTTATCTGCTCCAGTTGATCTCAAAAATCACTGG
GCAAAGCATGGAACAAATCGTTACATCACAATTAAGTAA
mRNA sequenceShow/hide mRNA sequence
CCCGCCCGACCCGTTTTCCTGTGTAATAATAATTCCCCTATTATATTAATTATTATCATTACTCTTTTATTTCTCATAAAAACCGCAAAAAAATATTTAAACCAAAAGAA
GGGTGAAAAAAATTGTAGCAGAATTTGGAAATGTGAACAGAGACAGAAAGCTAGAAAATAAAAATGAAAAAAGAAATTGAGAAATAGAGAGAGAAAAACGAAGAGAGAGA
GAGAGATTCCTTTGTCCCTCTGCTCTGCTTACAACTGTTCTTTCCACAGATTTTCAATCCTTTTTTCCGTCTTCTTCTCTTTTTCATCAGTCGCGAGTTCTTCTTCCTCT
CTTTCCTTCTCACTGGTGATCCGATGGAGGAAGAGATGGGGAAGGTGGGTCAGCCATTAGAGCTGACTAGAGGTAAACAATTCTAGCGAATGACTTGTTCAATCTTCTGA
TTTCAGTGGAGCTTTTTATTGCTGTGTCTGTGTTGGTTATTAGTTTTTGGGAGTCATAAATGGATGATAGTGGGCATCGTGAGAATGGAAGGCACAAAGCAGAGCATCAG
TACAAGTCGGCTCAGGGTCAGTGGATGATGCAGCATCAGCCATCAATGAAGCAGATAATGGCAATTATGGCTGAAAGAGATGCAGCCATTCAAGAGAGAAATTTGGCCCT
TTCCGAGAAGAAGACAGCATTGGCAGAGCGAGACATGGCTTATCTACAACGAGATGCTGCAATTGCAGAGAGGAACAATGCCCTTTTGGAAAGAGACAATGCCATTGCAA
CTCTTCAGTATCGTGAAAACTCCCTAACGAGTAACAATTTATCATGTCCACCAGGGTGCCAAATTGCTAGGGGAGTGAAGCATATACACCACCCACAGCAGCAGCAGCAG
CAACACACGCATCATGTGCCTCACATGAACGAAACTACTTACAATCCAAGAGAAATGCATCCCAACGATGCCTGCCCAGTATCCCCAGTTGCTTCTGAATCAACAAAGGC
AAGGAGAAACAAGCGAACAAAGGAGGCAAAAGCAGTCACAACACCTAACAAGAAGGTTTCAAAAGCTCCAAGGAAGGTCAAAAGGGAGGCTGAAGACTTGAATAAGATAA
TGCTGGGCAAATCACAAGAATGGAAGGATGAAATTGGCATCATGAGTGGAGGTGATGATCTTAACAAACAGTTGGTAGTGTCAAAGTTGGATTGGAAGGGTCAGGATCTG
GGGCTAAACCAGGTTGCCTTTGATGAATCTACAATGCCAGCCCCAATATGCTCCTGCACAGGAGTTATAAGACAATGCTACAAATGGGGAAATGGTGGATGGCAATCCGC
ATGTTGTACCACCACCATGTCAATGTATCCATTGCCTGCAGTTCCCAACAAACGACACGCCCGAGTTGGGGGTCGAAAGATGAGCGGAAGCGCCTTTAACAAGTTGCTCA
GCCGGCTTGCAGCTGAAGGTCATGATTTATCTGCTCCAGTTGATCTCAAAAATCACTGGGCAAAGCATGGAACAAATCGTTACATCACAATTAAGTAACCACTCTAGTTT
ACTCACGGAGTAGTAGCATGAAGAGTAGAAAAATGGAGGGAATGGCCAAGCTAGCAAAGGACACTACCTCAAAATTTCCCTCCACTTTCAGCAAAACAATAAAGGATAGC
ATAGCATCCAAAAAAAGGTTAGCAGCTATCGATATCTTCAGTTTAATTATGGGTTCTGATCCTCTCTCTCAATCAGCAAGTTTGTTGGAGAGAATTGCATGAGCCAAGTA
CTTCAAAGCCATGTCGATTCTTTTAGCACTTTTGAAGTGGCTTCTTTCAAGCCCTTATGCAAGCCAAGTCGTCCTCCAAATAATTGTAAAGGTATGTATGTTTCTATTAT
TAAGATCAGGGTTGTATTAAAAATATCAGAGTATGGCCACCATTTTGTAGTATTCTGTATCATCTTTCAAGATGTATCTGTTGTCGATGAATCTCCAATGAAACTCAGTC
CACTAGGTAAAATTAAGACAAATTCTTGCCACTTTAATTATCTTCACCTATTAGCTCTAGTTTCTTTATTGATCCAATCAGGCCTTGCTTATGTGATTGGTGTATTTCGA
ATTTTCTTGAAATTATGAAGCTCAAATAATTGTTGGATCTTGGGTTAGGGAAAACCATTTGAGGAAACATCTAAAACTAGTTGAAAAATACTAAAGTGTGTCAGATCATG
AATCATCAGCATCGAAAAATCAGACAGGGATAACTGATATTTCTTTGTTTCATCACCCAAATCACTTGAACAAACCAGAGGTAAAAAGCTTCATTCAAAAACTAAATTAT
CACACGAATTCATTCAATTTTCAATCCAAAAGAGCAATGTGAATCAATTGAAATTGTTGAAAGAGTTTCATAACTTGTTAACAAGGGAAATCAGAAAGGTATGAGCCTCA
GACCGTCCTAGTTCAAAATATTTTCAACACAGATGCAAATATTTCAATTTTGCATCTCAGTGAACCAAAAAATTCTCCATCACTGGCCACATCTAAGAGCGAAGAGAAAA
AAAAGATAAACATGAAACGAAAGAAGAAAAACACAAAAGTAATCATTCGAAGAATCAAAACACTGTCAGAGTGTGAAATATCAGCACTGTGAATCAGCCAGGGATAACTG
TATATTTCATGTTTTCTCATCATCCAGAACCAAAAAAATAAAAGAAACAGTGATTTCAAATCCAGAAGAACAAAGTTAAAGAAAACAAAGAAATCAAGAGGAAAAAAATC
TAGACGAAATGATTTGAAGAGGCATCAGATCTTCCTATACAAAGTGGTTAATCAGCATACTCCAAGTTCGCGACTGGAGCATTCTACCATTCGAAAGTTCGTCATCTGCC
CTTTTCCGAGTTGGAACTCATAACCATTAGCAAAAATCGAAGAAAAAAAGTTGCGCTTAAACCCTAACGCCAACCAAGATCGAAAGAAAGAAGAACGAGCTGAGAAAATT
TGAAAGGGCTGGAGAAGATGAAAAGACGGCGTTTCCATGGGAGGTTCCAGTGGGTCTAGGGTTTTTTTGATAGGAAAGAAGAACAGATTGCGGGCTTTATAACAGTTCGA
CCCGGTGATTTTGGATTTGGACCGGAAATCGGACGGCTGCGATTTGTAGATGGAAAACCCGCG
Protein sequenceShow/hide protein sequence
MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLTSNNLSCPPGC
QIARGVKHIHHPQQQQQQHTHHVPHMNETTYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNKIMLGKSQEWKDEIGIMSGGDD
LNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHW
AKHGTNRYITIK