; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001962 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001962
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionMADS-box protein 04g005320-like
Genome locationchr4:37573476..37620218
RNA-Seq ExpressionLag0001962
SyntenyLag0001962
Gene Ontology termsGO:0045944 - positive regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR002100 - Transcription factor, MADS-box
IPR033896 - MADS MEF2-like
IPR036879 - Transcription factor, MADS-box superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022155417.1 MADS-box transcription factor 8-like isoform X3 [Momordica charantia]8.8e-5576.43Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLT--DIDT----KDQESDET
        MGRKKIEVKRIED CNRHVTFCKRRSGLIKKARELSVLCDVE+GL++FTNRGRLYEFC GNSL +II RYQSH++G+SQS +  DIDT    +D +S++T
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLT--DIDT----KDQESDET

Query:  LLASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETD
        +L SLGKLLQTIQSQVEEP+FKKLNVT+M+QLENQLEATLDKIK QR    A +E D
Subjt:  LLASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETD

XP_022959860.1 truncated transcription factor CAULIFLOWER D-like [Cucurbita moschata]5.2e-5575.93Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLT--DIDTKD-QESDETLLA
        MGRKKIE+KRIED CNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG+SL +II RYQSH EGRSQS +  ++DTK+ +ES+ETL+ 
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLT--DIDTKD-QESDETLLA

Query:  SLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETDGF-DAMDV
        S GKLLQTIQS VEEP+FKKLNV DMV LENQLEA+LDKIK+QR    A +E   + D MD+
Subjt:  SLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETDGF-DAMDV

XP_023004227.1 MADS-box protein 04g005320-like [Cucurbita maxima]1.6e-5676.25Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLTDIDTKD-QESDETLLASL
        MGRKKIE+KRIED CNRHVTFCKRRSGLIKKARELSVLCDVEVGLV+FTNRGRLYEFCSG+SL +II RYQSH EGRS+S  ++DTK+ +ESDETL+ S 
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLTDIDTKD-QESDETLLASL

Query:  GKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETDGF-DAMDV
        GKLLQTIQS VEEP+FKKLNV DMV LENQLEA+LDKIK+QR    A +E   + D MD+
Subjt:  GKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETDGF-DAMDV

XP_023514206.1 MADS-box protein 04g005320-like [Cucurbita pepo subsp. pepo]1.2e-5475.31Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLT--DIDTKD-QESDETLLA
        MGRKKIE+KRIED CNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG+SL +II RYQSH EGRSQS +  ++DTK+ +ES++TL+ 
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLT--DIDTKD-QESDETLLA

Query:  SLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETDGF-DAMDV
        S GKLLQTIQS VEEP+FKKLNV DMV LENQLEA+LDKIK+QR    A +E   + D MD+
Subjt:  SLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETDGF-DAMDV

XP_023514717.1 MADS-box protein EJ2-like [Cucurbita pepo subsp. pepo]7.7e-5980.79Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLTDIDTKDQESDETLLASLG
        MGRKKIEVKRIEDRCNRHVTFCKRR+GLIKKARELSVLCDVEVGLV+FTNRGRLYEFCSGNSLS++I RYQSH+EGR+++L D DTKD ESDET+L SLG
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLTDIDTKDQESDETLLASLG

Query:  KLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETD
        KLLQTIQSQV EPNFKKLN  D++QLENQL+ATL KIK QR    A +E D
Subjt:  KLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETD

TrEMBL top hitse value%identityAlignment
A0A6J1DME3 MADS-box protein FLOWERING LOCUS C-like isoform X15.6e-5578.52Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLT--DIDT----KDQESDET
        MGRKKIEVKRIED CNRHVTFCKRRSGLIKKARELSVLCDVE+GL++FTNRGRLYEFC GNSL +II RYQSH++G+SQS +  DIDT    +D +S++T
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLT--DIDT----KDQESDET

Query:  LLASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYH
        +L SLGKLLQTIQSQVEEP+FKKLNVT+M+QLENQLEATLDKIK QR +
Subjt:  LLASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYH

A0A6J1DPA0 MADS-box transcription factor 8-like isoform X34.3e-5576.43Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLT--DIDT----KDQESDET
        MGRKKIEVKRIED CNRHVTFCKRRSGLIKKARELSVLCDVE+GL++FTNRGRLYEFC GNSL +II RYQSH++G+SQS +  DIDT    +D +S++T
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLT--DIDT----KDQESDET

Query:  LLASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETD
        +L SLGKLLQTIQSQVEEP+FKKLNVT+M+QLENQLEATLDKIK QR    A +E D
Subjt:  LLASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETD

A0A6J1DRL9 MADS-box protein FLOWERING LOCUS C-like isoform X25.6e-5578.52Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLT--DIDT----KDQESDET
        MGRKKIEVKRIED CNRHVTFCKRRSGLIKKARELSVLCDVE+GL++FTNRGRLYEFC GNSL +II RYQSH++G+SQS +  DIDT    +D +S++T
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLT--DIDT----KDQESDET

Query:  LLASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYH
        +L SLGKLLQTIQSQVEEP+FKKLNVT+M+QLENQLEATLDKIK QR +
Subjt:  LLASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYH

A0A6J1H7I9 truncated transcription factor CAULIFLOWER D-like2.5e-5575.93Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLT--DIDTKD-QESDETLLA
        MGRKKIE+KRIED CNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSG+SL +II RYQSH EGRSQS +  ++DTK+ +ES+ETL+ 
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLT--DIDTKD-QESDETLLA

Query:  SLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETDGF-DAMDV
        S GKLLQTIQS VEEP+FKKLNV DMV LENQLEA+LDKIK+QR    A +E   + D MD+
Subjt:  SLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETDGF-DAMDV

A0A6J1KPU7 MADS-box protein 04g005320-like7.8e-5776.25Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLTDIDTKD-QESDETLLASL
        MGRKKIE+KRIED CNRHVTFCKRRSGLIKKARELSVLCDVEVGLV+FTNRGRLYEFCSG+SL +II RYQSH EGRS+S  ++DTK+ +ESDETL+ S 
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLTDIDTKD-QESDETLLASL

Query:  GKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETDGF-DAMDV
        GKLLQTIQS VEEP+FKKLNV DMV LENQLEA+LDKIK+QR    A +E   + D MD+
Subjt:  GKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETDGF-DAMDV

SwissProt top hitse value%identityAlignment
K4BND8 MADS-box protein 04g0053203.4e-2543.45Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQS-LTDIDTKDQESDETLLASL
        MGR K+E+KRIE++ NR VTF KRR+GL+KKA ELS+LC+ EV L++F+NRG+LYEFCS +S+SD + RY     G  ++  +  D+++   +   L + 
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQS-LTDIDTKDQESDETLLASL

Query:  GKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHN
         ++LQ  Q  +   +  +LN  D+ QLE QL+++L  I+++R  N
Subjt:  GKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHN

P29383 Agamous-like MADS-box protein AGL31.2e-2242.07Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNS-LSDIINRYQSH---VEGRSQSLTDIDTKDQESDETLL
        MGR K+E+KRIE++ NR VTF KRR+GL+KKA ELSVLCD E+ L++F+NRG+LYEFCS  S ++  +++Y+ H       +QS  D+  +D+  D   L
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNS-LSDIINRYQSH---VEGRSQSLTDIDTKDQESDETLL

Query:  ASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR
         S  ++LQ  Q  +      +++V ++  LE Q++A+L +I++ +
Subjt:  ASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR

Q03489 Agamous-like MADS-box protein AGL9 homolog1.2e-2242.47Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLTDIDTKD-----QESDETL
        MGR ++E+KRIE++ NR VTF KRR+GL+KKA ELSVLCD EV L++F+NRG+LYEFCS +S+   + RYQ    G  +  T+I T++      + +   
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLTDIDTKD-----QESDETL

Query:  LASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR
        L +  + LQ  Q  +   +   LN  ++  LE QL+ +L +I++ R
Subjt:  LASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR

Q7Y040 MADS-box protein EJ22.2e-2446.21Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQ----SHVEGRSQSLTDIDTKDQESDETLL
        MGR ++E+KRIE++ NR VTF KRR+GL+KKA ELSVLCD EV L++F+NRG+LYEFCS +S+   I +YQ    + +E  +QS+T  DT++   +   L
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQ----SHVEGRSQSLTDIDTKDQESDETLL

Query:  ASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR
         +  +LLQ  Q      +   L+  D+ QLENQLE++L +I++++
Subjt:  ASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR

Q9SAR1 MADS-box transcription factor 81.4e-2342.68Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLTDIDTKDQESDETLLASLG
        MGR ++E+KRIE++ NR VTF KRR+GL+KKA ELSVLCD EV L++F+NRG+LYEFCSG S++  + RYQ    G     T I  K+ E  ++      
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLTDIDTKDQESDETLLASLG

Query:  KL------LQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETD
        KL      LQ  Q  +   +   L + ++ QLE QL+++L  I++ R  +     TD
Subjt:  KL------LQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETD

Arabidopsis top hitse value%identityAlignment
AT1G77080.4 K-box region and MADS-box transcription factor family protein2.5e-2342.55Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLTDIDTKDQESDETLLASLG
        MGR+KIE+KRIE++ +R VTF KRR+GLI KAR+LS+LC+  V +VV +  G+LY+  SG+ +S II+RY+       ++L D++ K Q           
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLTDIDTKDQESDETLLASLG

Query:  KLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR
        +LL+T+QS++EEPN   ++V  ++ LE QLE  L   + ++
Subjt:  KLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR

AT2G03710.1 K-box region and MADS-box transcription factor family protein8.6e-2442.07Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNS-LSDIINRYQSH---VEGRSQSLTDIDTKDQESDETLL
        MGR K+E+KRIE++ NR VTF KRR+GL+KKA ELSVLCD E+ L++F+NRG+LYEFCS  S ++  +++Y+ H       +QS  D+  +D+  D   L
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNS-LSDIINRYQSH---VEGRSQSLTDIDTKDQESDETLL

Query:  ASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR
         S  ++LQ  Q  +      +++V ++  LE Q++A+L +I++ +
Subjt:  ASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR

AT2G03710.2 K-box region and MADS-box transcription factor family protein8.6e-2442.07Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNS-LSDIINRYQSH---VEGRSQSLTDIDTKDQESDETLL
        MGR K+E+KRIE++ NR VTF KRR+GL+KKA ELSVLCD E+ L++F+NRG+LYEFCS  S ++  +++Y+ H       +QS  D+  +D+  D   L
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNS-LSDIINRYQSH---VEGRSQSLTDIDTKDQESDETLL

Query:  ASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR
         S  ++LQ  Q  +      +++V ++  LE Q++A+L +I++ +
Subjt:  ASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR

AT2G03710.3 K-box region and MADS-box transcription factor family protein8.6e-2442.07Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNS-LSDIINRYQSH---VEGRSQSLTDIDTKDQESDETLL
        MGR K+E+KRIE++ NR VTF KRR+GL+KKA ELSVLCD E+ L++F+NRG+LYEFCS  S ++  +++Y+ H       +QS  D+  +D+  D   L
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNS-LSDIINRYQSH---VEGRSQSLTDIDTKDQESDETLL

Query:  ASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR
         S  ++LQ  Q  +      +++V ++  LE Q++A+L +I++ +
Subjt:  ASLGKLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR

AT5G65070.1 K-box region and MADS-box transcription factor family protein1.5e-2339.72Show/hide
Query:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLTDIDTKDQESDETLLASLG
        MGR+K+E+KRIE++ +R VTFCKRR+GL++KAR+LS+LC+  V L++ +  GRLY F SG+S++ I++RY+       +   D+ T D E       S  
Subjt:  MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLTDIDTKDQESDETLLASLG

Query:  KLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR
        +LL+TIQ ++EE     +++  +  LE QL+  L   + ++
Subjt:  KLLQTIQSQVEEPNFKKLNVTDMVQLENQLEATLDKIKTQR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACGGAAGAAGATCGAGGTGAAGCGAATCGAAGATAGGTGTAATCGCCATGTGACTTTCTGCAAGAGAAGATCTGGATTGATCAAGAAGGCTCGAGAGCTCTCTGT
TCTGTGCGATGTTGAGGTCGGACTCGTCGTCTTCACCAACCGCGGCCGCCTCTATGAGTTTTGCAGCGGAAACAGCTTATCGGATATTATTAATCGCTACCAAAGCCATG
TTGAAGGGAGAAGCCAAAGTCTAACTGACATCGATACGAAGGACCAAGAATCTGATGAGACACTCTTGGCGTCACTTGGAAAGCTCCTACAAACTATTCAAAGTCAGGTT
GAAGAACCAAATTTCAAGAAGCTCAATGTAACCGATATGGTGCAATTGGAGAATCAACTTGAAGCCACACTTGACAAGATCAAAACTCAAAGATATCATAACAATGCCAC
TTTAGAAACAGATGGCTTTGATGCCATGGACGTGTACAGTTTCTTTCTGAATTATGATGTCCCTCTTGATCTAGGTCCATTTCTGCTAAAGAGAGCCAAGGAAGAAGGGG
GACCATTAACCCTTATTCGTTGTGAACTCTATCAAATTTTGGAGAAAATTGGCAGCGTGGCATACAAGCTGGACCTTCCCCCAGCTGCCTCTATCCCCAATGTCTTTTAT
TTACAAATGGGTTTGTTGAACCGTTGGGTGTGTTGGGAATACGTTGAAATTTGGAGTCATATCAAGAAGAATGGATTGTGCATTGGAAAGAGTCCACTGACCTTGTCCAG
CGCCAATTTCCGGACTTCTACTCGAGGACAAGTTCTTGGGATATTGCTCAATAGAACAGTGTTCTTTCCCTTGGTAGCTGCTGTGTCGTGTTTTACTGAGTTTGACAGAC
TAGTAGCTGCTCCTTGGGCTGAAGTTTTGGAAATGCCAAATCACTCCTTTGAAATTGATATCAAGCCCCTGGTCATCTACCTGGACCTTCGTATTTTCAGAAACACCAAA
TCACCTATCAGAAAATGCTCGTATATGGTTCCTTCTTGCGCTGACAAAGACCTCATAATTAACACAAGCCAGAATTCTTCCCAAGTTCCGAGAAGTTCTCTTCTCTACCC
ATTACCATCATGCTCATGCTTTCCTTACAATCCAGCAAAGTACTTTGGCCTAGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGACGGAAGAAGATCGAGGTGAAGCGAATCGAAGATAGGTGTAATCGCCATGTGACTTTCTGCAAGAGAAGATCTGGATTGATCAAGAAGGCTCGAGAGCTCTCTGT
TCTGTGCGATGTTGAGGTCGGACTCGTCGTCTTCACCAACCGCGGCCGCCTCTATGAGTTTTGCAGCGGAAACAGCTTATCGGATATTATTAATCGCTACCAAAGCCATG
TTGAAGGGAGAAGCCAAAGTCTAACTGACATCGATACGAAGGACCAAGAATCTGATGAGACACTCTTGGCGTCACTTGGAAAGCTCCTACAAACTATTCAAAGTCAGGTT
GAAGAACCAAATTTCAAGAAGCTCAATGTAACCGATATGGTGCAATTGGAGAATCAACTTGAAGCCACACTTGACAAGATCAAAACTCAAAGATATCATAACAATGCCAC
TTTAGAAACAGATGGCTTTGATGCCATGGACGTGTACAGTTTCTTTCTGAATTATGATGTCCCTCTTGATCTAGGTCCATTTCTGCTAAAGAGAGCCAAGGAAGAAGGGG
GACCATTAACCCTTATTCGTTGTGAACTCTATCAAATTTTGGAGAAAATTGGCAGCGTGGCATACAAGCTGGACCTTCCCCCAGCTGCCTCTATCCCCAATGTCTTTTAT
TTACAAATGGGTTTGTTGAACCGTTGGGTGTGTTGGGAATACGTTGAAATTTGGAGTCATATCAAGAAGAATGGATTGTGCATTGGAAAGAGTCCACTGACCTTGTCCAG
CGCCAATTTCCGGACTTCTACTCGAGGACAAGTTCTTGGGATATTGCTCAATAGAACAGTGTTCTTTCCCTTGGTAGCTGCTGTGTCGTGTTTTACTGAGTTTGACAGAC
TAGTAGCTGCTCCTTGGGCTGAAGTTTTGGAAATGCCAAATCACTCCTTTGAAATTGATATCAAGCCCCTGGTCATCTACCTGGACCTTCGTATTTTCAGAAACACCAAA
TCACCTATCAGAAAATGCTCGTATATGGTTCCTTCTTGCGCTGACAAAGACCTCATAATTAACACAAGCCAGAATTCTTCCCAAGTTCCGAGAAGTTCTCTTCTCTACCC
ATTACCATCATGCTCATGCTTTCCTTACAATCCAGCAAAGTACTTTGGCCTAGAATAG
Protein sequenceShow/hide protein sequence
MGRKKIEVKRIEDRCNRHVTFCKRRSGLIKKARELSVLCDVEVGLVVFTNRGRLYEFCSGNSLSDIINRYQSHVEGRSQSLTDIDTKDQESDETLLASLGKLLQTIQSQV
EEPNFKKLNVTDMVQLENQLEATLDKIKTQRYHNNATLETDGFDAMDVYSFFLNYDVPLDLGPFLLKRAKEEGGPLTLIRCELYQILEKIGSVAYKLDLPPAASIPNVFY
LQMGLLNRWVCWEYVEIWSHIKKNGLCIGKSPLTLSSANFRTSTRGQVLGILLNRTVFFPLVAAVSCFTEFDRLVAAPWAEVLEMPNHSFEIDIKPLVIYLDLRIFRNTK
SPIRKCSYMVPSCADKDLIINTSQNSSQVPRSSLLYPLPSCSCFPYNPAKYFGLE