; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS016881 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS016881
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionGAGA-binding transcriptional activator
Genome locationscaffold9_1:746148..747272
RNA-Seq ExpressionMS016881
SyntenyMS016881
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009723 - response to ethylene (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0000976 - transcription regulatory region sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0042803 - protein homodimerization activity (molecular function)
InterPro domainsIPR010409 - GAGA-binding transcriptional activator


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020463.1 Protein BASIC PENTACYSTEINE6, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-17492.49Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAE QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK A+AERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHP-QQQQQQHT-HHVPHMNETAYNPREMHPNDACPV-SPVASESTKARRNKRTKE-AKAVTTPNKKVSKAPRKVKREAED
        +NNLSCPPGCQIARGVKHIHHP QQQQQQHT HHVPHMN++ YN RE+H NDACPV S VASESTK RRNKRTKE  K +T PNKKVSKAPRKVKREAED
Subjt:  SNNLSCPPGCQIARGVKHIHHP-QQQQQQHT-HHVPHMNETAYNPREMHPNDACPV-SPVASESTKARRNKRTKE-AKAVTTPNKKVSKAPRKVKREAED

Query:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV
        LNKIMLGKSQEWKD IGI+SGGDDLNK LVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHARV
Subjt:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV

Query:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        GGRKMSGSAFNKLLSRL AEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_022144218.1 protein BASIC PENTACYSTEINE6 [Momordica charantia]2.1e-19399.71Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNKI
        SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNET YNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNKI
Subjt:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNKI

Query:  MLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRK
        MLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRK
Subjt:  MLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRK

Query:  MSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        MSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  MSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_022951895.1 protein BASIC PENTACYSTEINE6-like [Cucurbita moschata]1.3e-17492.8Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAE QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK A+AERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHP--QQQQQQHT-HHVPHMNETAYNPREMHPNDACPV-SPVASESTKARRNKRTKE-AKAVTTPNKKVSKAPRKVKREAE
        +NNLSCPPGCQIARGVKHIHHP  QQQQQQHT HHVPHMNE+ YN RE+H NDACPV S VASESTKARRNKRTKE  K VT PNKKVSKAPRKVKREAE
Subjt:  SNNLSCPPGCQIARGVKHIHHP--QQQQQQHT-HHVPHMNETAYNPREMHPNDACPV-SPVASESTKARRNKRTKE-AKAVTTPNKKVSKAPRKVKREAE

Query:  DLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHAR
        DLNKI LGKSQEWKD IGI+SGGDDLNK LVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR
Subjt:  DLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHAR

Query:  VGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        VGGRKMSGSAFNKLLSRL AEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  VGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_023002409.1 protein BASIC PENTACYSTEINE6-like [Cucurbita maxima]3.4e-17592.77Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAE QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK A+AERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHP-QQQQQQHTH-HVPHMNETAYNPREMHPNDACPVSP-VASESTKARRNKRTKE-AKAVTTPNKKVSKAPRKVKREAED
        +NNLSCPPGCQIARGVKHIHHP QQQQQQHTH HVPHMNE+ YN RE+H NDACP+ P VASESTK RRNKRTKE  K VT PNKKVSKAPRKVKREAED
Subjt:  SNNLSCPPGCQIARGVKHIHHP-QQQQQQHTH-HVPHMNETAYNPREMHPNDACPVSP-VASESTKARRNKRTKE-AKAVTTPNKKVSKAPRKVKREAED

Query:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV
        LNKIMLGKSQEWKD IGI+SGGDDLNK LVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHARV
Subjt:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV

Query:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        GGRKMSGSAFNKLLSRL AEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

XP_023537342.1 protein BASIC PENTACYSTEINE6-like [Cucurbita pepo subsp. pepo]4.4e-17592.77Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAE QYKSAQGQWMMQHQPSMKQIMAIMAERDAA+QERNLALSEKK A+AERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHP-QQQQQQHT-HHVPHMNETAYNPREMHPNDACPV-SPVASESTKARRNKRTKE-AKAVTTPNKKVSKAPRKVKREAED
        +NNLSCPPGCQIARGVKHIHHP QQQQQQHT HHVPHMNE+ YN RE+H NDACPV S VASESTK RRNKRTKE  K VT PNKKVSKAPRKVKREAED
Subjt:  SNNLSCPPGCQIARGVKHIHHP-QQQQQQHT-HHVPHMNETAYNPREMHPNDACPV-SPVASESTKARRNKRTKE-AKAVTTPNKKVSKAPRKVKREAED

Query:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV
        LNKIMLGKSQEWKD IGI+SGGDDLNK LVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHARV
Subjt:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV

Query:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        GGRKMSGSAFNKLLSRL AEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

TrEMBL top hitse value%identityAlignment
A0A1S3BBW9 GAGA-binding transcriptional activator9.1e-17492.13Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHK + QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK ALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+ 
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREM-HPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNK
        +NNLSCPPGCQIARGVKHIHHP   QQQHTHHVPHMNE  YN R+M   ND CP SPVASESTKARRNKRTKE K VTTPNKKVSK PRKVKREAEDLNK
Subjt:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREM-HPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNK

Query:  IMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGR
        IMLGKSQEWKD IGIMSGGDDLNKQLVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR+GGR
Subjt:  IMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGR

Query:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        KMSGSAFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A5A7VDN8 GAGA-binding transcriptional activator9.1e-17492.13Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHK + QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK ALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENS+ 
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREM-HPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNK
        +NNLSCPPGCQIARGVKHIHHP   QQQHTHHVPHMNE  YN R+M   ND CP SPVASESTKARRNKRTKE K VTTPNKKVSK PRKVKREAEDLNK
Subjt:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREM-HPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNK

Query:  IMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGR
        IMLGKSQEWKD IGIMSGGDDLNKQLVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR+GGR
Subjt:  IMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGR

Query:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        KMSGSAFNKLLSRLAAEGHDLS+PVDLKNHWAKHGTNRYITIK
Subjt:  KMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A6J1CR14 GAGA-binding transcriptional activator1.0e-19399.71Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNKI
        SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNET YNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNKI
Subjt:  SNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNKI

Query:  MLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRK
        MLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRK
Subjt:  MLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRK

Query:  MSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        MSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  MSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A6J1GIX2 GAGA-binding transcriptional activator6.3e-17592.8Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAE QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK A+AERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHP--QQQQQQHT-HHVPHMNETAYNPREMHPNDACPV-SPVASESTKARRNKRTKE-AKAVTTPNKKVSKAPRKVKREAE
        +NNLSCPPGCQIARGVKHIHHP  QQQQQQHT HHVPHMNE+ YN RE+H NDACPV S VASESTKARRNKRTKE  K VT PNKKVSKAPRKVKREAE
Subjt:  SNNLSCPPGCQIARGVKHIHHP--QQQQQQHT-HHVPHMNETAYNPREMHPNDACPV-SPVASESTKARRNKRTKE-AKAVTTPNKKVSKAPRKVKREAE

Query:  DLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHAR
        DLNKI LGKSQEWKD IGI+SGGDDLNK LVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHAR
Subjt:  DLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHAR

Query:  VGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        VGGRKMSGSAFNKLLSRL AEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  VGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

A0A6J1KJF5 GAGA-binding transcriptional activator1.6e-17592.77Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
        MDDSGHRENGRHKAE QYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKK A+AERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLT

Query:  SNNLSCPPGCQIARGVKHIHHP-QQQQQQHTH-HVPHMNETAYNPREMHPNDACPVSP-VASESTKARRNKRTKE-AKAVTTPNKKVSKAPRKVKREAED
        +NNLSCPPGCQIARGVKHIHHP QQQQQQHTH HVPHMNE+ YN RE+H NDACP+ P VASESTK RRNKRTKE  K VT PNKKVSKAPRKVKREAED
Subjt:  SNNLSCPPGCQIARGVKHIHHP-QQQQQQHTH-HVPHMNETAYNPREMHPNDACPVSP-VASESTKARRNKRTKE-AKAVTTPNKKVSKAPRKVKREAED

Query:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV
        LNKIMLGKSQEWKD IGI+SGGDDLNK LVVSK DWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTT+SMYPLPAVPNKRHARV
Subjt:  LNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARV

Query:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        GGRKMSGSAFNKLLSRL AEGHDLSAPVDLKNHWAKHGTNRYITIK
Subjt:  GGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

SwissProt top hitse value%identityAlignment
F4JUI3 Protein BASIC PENTACYSTEINE52.1e-6644.25Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMM----QHQPSM--KQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQY
        M+  G  ENGR+K ++ YK  Q   +M    QH   +  K+I++I+AERDAA++ERN A++  K ALA RD A  QRD A++ER+NA++E ++A+  L+Y
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMM----QHQPSM--KQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQY

Query:  RENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREA
        REN+L +  LSC       RG                         +   E H  +  P+S +  E+   R  KR KE+K              + K+  
Subjt:  RENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREA

Query:  EDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHA
        EDLN+ +    ++                    S+ DW   D+    V FDE TMP P+C+CTG  RQCYKWGNGGWQS+CCTTT+S YPLP +PNKRH+
Subjt:  EDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHA

Query:  RVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        RVGGRKMSGS F++LLSRLA EGH+LS+PVDLKN+WA+HGTNRYITIK
Subjt:  RVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

Q5VSA8 Barley B recombinant-like protein D1.6e-8753.87Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMM-QHQPSMK-----QIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQY
        MD+ GHRENGR + + QYK    QWMM Q Q  +K      ++A+M +RD AI+ER+ AL+EKK A+AERDMA+ QRDAA+AERN A++ERDNA+A L+ 
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMM-QHQPSMK-----QIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQY

Query:  -RENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKRE
         R N L  NN +  P   ++ G K+IHH  Q     +  +   +    + REMH ++A P+S     + KA   KR K+  +  +P K+ S   RK K+ 
Subjt:  -RENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKRE

Query:  AEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRH
        + D           WK+ +G+   GDD +    V K +WK Q+LGLNQVAFD+STMPAP CSCTG +RQCYKWGNGGWQS+CCT  +SMYPLP +PNKRH
Subjt:  AEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRH

Query:  ARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        AR+GGRKMSG AF KLLSRLAAEGHDLS PVDLK+HWAKHGTNRYITI+
Subjt:  ARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

Q8L999 Protein BASIC PENTACYSTEINE63.7e-12466.38Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSL-
        MDD GHRENGRHKA     + QGQW+MQHQPSMKQ+M+I+AERDAAIQERNLA+SEKK A+AERDMA+LQRD AIAERNNA++ERD+A+  LQYRENS+ 
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSL-

Query:  ---TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVAS---ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKR
            +N  +CPPGCQI+RGVKH+HHP        HH+P + E AY  REM PND  P SP A    ES K +R KR   +A   T  NK+  K  RKVK+
Subjt:  ---TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVAS---ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKR

Query:  EAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP
        E+E DLNKIM  K + ++ DE       D     L+ SK DWK Q++ GLNQV +DE+TMP P+CSCTGV+RQCYKWGNGGWQS+CCTTT+SMYPLPA+P
Subjt:  EAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP

Query:  NKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        NKRHARVGGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  NKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK

Q8S8C6 Protein BASIC PENTACYSTEINE42.8e-7145.89Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIA
        M++ G  +N R K ++ +K AQ  W M  QHQ           K+IM+I+AERDAA+ ERN A+S KK A+A RD A  QRD A++ER+ AL+ERDNA A
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIA

Query:  TLQYRENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVASESTKAR-RNKRTKEAKAVTTPNKKVSKAPRK
         LQ+ ENSL   N +   G  +                         +  +   E H  +  P+S +  E T  +  NKR KE K          +   K
Subjt:  TLQYRENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVASESTKAR-RNKRTKEAKAVTTPNKKVSKAPRK

Query:  VKREAEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP
        VK+  EDLN+ +    ++                    S+ DW  QD+GLN V FDE+TMP P+CSCTG  RQCYKWGNGGWQS+CCTTT+S YPLP +P
Subjt:  VKREAEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP

Query:  NKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        NKRH+R+GGRKMSG+ F++LLSRL+AEG+DLS PVDLK++WA+HGTNRYITIK
Subjt:  NKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

Q9C9X6 Protein BASIC PENTACYSTEINE31.1e-3557.01Show/hide
Query:  DLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGT
        ++ +N V+ D   +P P+CSCTG+ +QCY+WG GGWQSACCTT +SMYPLP    +R AR+ GRKMS  AF K+L +L+++G D S P+DLK+HWAKHGT
Subjt:  DLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGT

Query:  NRYITIK
        N+++TI+
Subjt:  NRYITIK

Arabidopsis top hitse value%identityAlignment
AT2G21240.1 basic pentacysteine 42.0e-7245.89Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIA
        M++ G  +N R K ++ +K AQ  W M  QHQ           K+IM+I+AERDAA+ ERN A+S KK A+A RD A  QRD A++ER+ AL+ERDNA A
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIA

Query:  TLQYRENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVASESTKAR-RNKRTKEAKAVTTPNKKVSKAPRK
         LQ+ ENSL   N +   G  +                         +  +   E H  +  P+S +  E T  +  NKR KE K          +   K
Subjt:  TLQYRENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVASESTKAR-RNKRTKEAKAVTTPNKKVSKAPRK

Query:  VKREAEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP
        VK+  EDLN+ +    ++                    S+ DW  QD+GLN V FDE+TMP P+CSCTG  RQCYKWGNGGWQS+CCTTT+S YPLP +P
Subjt:  VKREAEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP

Query:  NKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        NKRH+R+GGRKMSG+ F++LLSRL+AEG+DLS PVDLK++WA+HGTNRYITIK
Subjt:  NKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

AT2G21240.2 basic pentacysteine 42.0e-7245.89Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIA
        M++ G  +N R K ++ +K AQ  W M  QHQ           K+IM+I+AERDAA+ ERN A+S KK A+A RD A  QRD A++ER+ AL+ERDNA A
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMM--QHQ--------PSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIA

Query:  TLQYRENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVASESTKAR-RNKRTKEAKAVTTPNKKVSKAPRK
         LQ+ ENSL   N +   G  +                         +  +   E H  +  P+S +  E T  +  NKR KE K          +   K
Subjt:  TLQYRENSLTSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVASESTKAR-RNKRTKEAKAVTTPNKKVSKAPRK

Query:  VKREAEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP
        VK+  EDLN+ +    ++                    S+ DW  QD+GLN V FDE+TMP P+CSCTG  RQCYKWGNGGWQS+CCTTT+S YPLP +P
Subjt:  VKREAEDLNKIMLGKSQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP

Query:  NKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK
        NKRH+R+GGRKMSG+ F++LLSRL+AEG+DLS PVDLK++WA+HGTNRYITIK
Subjt:  NKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHWAKHGTNRYITIK

AT5G42520.1 basic pentacysteine 62.6e-12566.38Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSL-
        MDD GHRENGRHKA     + QGQW+MQHQPSMKQ+M+I+AERDAAIQERNLA+SEKK A+AERDMA+LQRD AIAERNNA++ERD+A+  LQYRENS+ 
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSL-

Query:  ---TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVAS---ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKR
            +N  +CPPGCQI+RGVKH+HHP        HH+P + E AY  REM PND  P SP A    ES K +R KR   +A   T  NK+  K  RKVK+
Subjt:  ---TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVAS---ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKR

Query:  EAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP
        E+E DLNKIM  K + ++ DE       D     L+ SK DWK Q++ GLNQV +DE+TMP P+CSCTGV+RQCYKWGNGGWQS+CCTTT+SMYPLPA+P
Subjt:  EAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP

Query:  NKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        NKRHARVGGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  NKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK

AT5G42520.2 basic pentacysteine 65.1e-12165.54Show/hide
Query:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSL-
        MDD GHRENGRHKA     + QG    QHQPSMKQ+M+I+AERDAAIQERNLA+SEKK A+AERDMA+LQRD AIAERNNA++ERD+A+  LQYRENS+ 
Subjt:  MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSL-

Query:  ---TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVAS---ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKR
            +N  +CPPGCQI+RGVKH+HHP        HH+P + E AY  REM PND  P SP A    ES K +R KR   +A   T  NK+  K  RKVK+
Subjt:  ---TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVAS---ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKR

Query:  EAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP
        E+E DLNKIM  K + ++ DE       D     L+ SK DWK Q++ GLNQV +DE+TMP P+CSCTGV+RQCYKWGNGGWQS+CCTTT+SMYPLPA+P
Subjt:  EAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVP

Query:  NKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        NKRHARVGGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  NKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK

AT5G42520.3 basic pentacysteine 63.0e-9764.01Show/hide
Query:  MAYLQRDAAIAERNNALLERDNAIATLQYRENSL----TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVAS--
        MA+LQRD AIAERNNA++ERD+A+  LQYRENS+     +N  +CPPGCQI+RGVKH+HHP        HH+P + E AY  REM PND  P SP A   
Subjt:  MAYLQRDAAIAERNNALLERDNAIATLQYRENSL----TSNNLSCPPGCQIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVAS--

Query:  -ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKREAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPIC
         ES K +R KR   +A   T  NK+  K  RKVK+E+E DLNKIM  K + ++ DE       D     L+ SK DWK Q++ GLNQV +DE+TMP P+C
Subjt:  -ESTKARRNKRTK-EAKAVTTPNKKVSKAPRKVKREAE-DLNKIMLGK-SQEWKDEIGIMSGGDDLNKQLVVSKLDWKGQDL-GLNQVAFDESTMPAPIC

Query:  SCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK
        SCTGV+RQCYKWGNGGWQS+CCTTT+SMYPLPA+PNKRHARVGGRKMSGSAFNKLLSRLAAEG HDLS PVDLK+HWAKHGTNRYITIK
Subjt:  SCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRKMSGSAFNKLLSRLAAEG-HDLSAPVDLKNHWAKHGTNRYITIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATGATAGTGGGCATCGTGAGAATGGAAGGCACAAAGCAGAGCATCAGTACAAGTCGGCTCAGGGTCAGTGGATGATGCAGCATCAGCCATCAATGAAGCAGATAAT
GGCAATTATGGCTGAAAGAGATGCAGCCATTCAAGAGAGAAATTTGGCCCTTTCCGAGAAGAAGACAGCGTTGGCGGAGCGAGACATGGCTTATCTACAACGAGATGCTG
CAATTGCAGAGAGGAACAATGCCCTTTTGGAAAGAGACAATGCCATTGCAACTCTTCAGTATCGTGAAAACTCCCTAACGAGTAACAATTTATCATGTCCACCAGGGTGC
CAAATTGCTAGGGGAGTGAAGCATATACACCACCCACAGCAGCAGCAGCAGCAACACACGCATCATGTGCCTCACATGAACGAAACTGCTTACAATCCAAGAGAAATGCA
TCCCAACGATGCCTGCCCAGTATCCCCAGTTGCTTCTGAATCAACAAAGGCAAGGAGAAACAAGCGAACAAAGGAGGCAAAAGCAGTCACAACACCTAACAAGAAGGTTT
CAAAAGCTCCAAGGAAGGTCAAAAGGGAGGCTGAAGACTTGAATAAGATAATGCTGGGCAAATCACAAGAATGGAAGGATGAAATTGGCATCATGAGTGGAGGTGATGAT
CTTAACAAACAGTTGGTAGTGTCAAAGTTGGATTGGAAGGGTCAGGATCTGGGGCTAAACCAGGTTGCCTTTGATGAATCTACAATGCCAGCCCCAATATGCTCCTGCAC
AGGAGTTATAAGACAATGCTACAAATGGGGAAATGGTGGATGGCAATCCGCATGTTGTACCACCACCATGTCAATGTATCCATTGCCTGCAGTTCCCAACAAACGACACG
CCCGAGTTGGGGGTCGAAAGATGAGCGGAAGCGCCTTTAACAAGTTGCTCAGCCGGCTTGCAGCTGAAGGTCATGATTTGTCTGCTCCAGTTGATCTCAAAAATCACTGG
GCAAAGCATGGAACAAATCGTTACATCACAATTAAG
mRNA sequenceShow/hide mRNA sequence
ATGGATGATAGTGGGCATCGTGAGAATGGAAGGCACAAAGCAGAGCATCAGTACAAGTCGGCTCAGGGTCAGTGGATGATGCAGCATCAGCCATCAATGAAGCAGATAAT
GGCAATTATGGCTGAAAGAGATGCAGCCATTCAAGAGAGAAATTTGGCCCTTTCCGAGAAGAAGACAGCGTTGGCGGAGCGAGACATGGCTTATCTACAACGAGATGCTG
CAATTGCAGAGAGGAACAATGCCCTTTTGGAAAGAGACAATGCCATTGCAACTCTTCAGTATCGTGAAAACTCCCTAACGAGTAACAATTTATCATGTCCACCAGGGTGC
CAAATTGCTAGGGGAGTGAAGCATATACACCACCCACAGCAGCAGCAGCAGCAACACACGCATCATGTGCCTCACATGAACGAAACTGCTTACAATCCAAGAGAAATGCA
TCCCAACGATGCCTGCCCAGTATCCCCAGTTGCTTCTGAATCAACAAAGGCAAGGAGAAACAAGCGAACAAAGGAGGCAAAAGCAGTCACAACACCTAACAAGAAGGTTT
CAAAAGCTCCAAGGAAGGTCAAAAGGGAGGCTGAAGACTTGAATAAGATAATGCTGGGCAAATCACAAGAATGGAAGGATGAAATTGGCATCATGAGTGGAGGTGATGAT
CTTAACAAACAGTTGGTAGTGTCAAAGTTGGATTGGAAGGGTCAGGATCTGGGGCTAAACCAGGTTGCCTTTGATGAATCTACAATGCCAGCCCCAATATGCTCCTGCAC
AGGAGTTATAAGACAATGCTACAAATGGGGAAATGGTGGATGGCAATCCGCATGTTGTACCACCACCATGTCAATGTATCCATTGCCTGCAGTTCCCAACAAACGACACG
CCCGAGTTGGGGGTCGAAAGATGAGCGGAAGCGCCTTTAACAAGTTGCTCAGCCGGCTTGCAGCTGAAGGTCATGATTTGTCTGCTCCAGTTGATCTCAAAAATCACTGG
GCAAAGCATGGAACAAATCGTTACATCACAATTAAG
Protein sequenceShow/hide protein sequence
MDDSGHRENGRHKAEHQYKSAQGQWMMQHQPSMKQIMAIMAERDAAIQERNLALSEKKTALAERDMAYLQRDAAIAERNNALLERDNAIATLQYRENSLTSNNLSCPPGC
QIARGVKHIHHPQQQQQQHTHHVPHMNETAYNPREMHPNDACPVSPVASESTKARRNKRTKEAKAVTTPNKKVSKAPRKVKREAEDLNKIMLGKSQEWKDEIGIMSGGDD
LNKQLVVSKLDWKGQDLGLNQVAFDESTMPAPICSCTGVIRQCYKWGNGGWQSACCTTTMSMYPLPAVPNKRHARVGGRKMSGSAFNKLLSRLAAEGHDLSAPVDLKNHW
AKHGTNRYITIK