; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g04370 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g04370
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionhomeobox-leucine zipper protein HAT22-like
Genome locationchr6:3124653..3131278
RNA-Seq ExpressionMoc06g04370
SyntenyMoc06g04370
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR001356 - Homeobox domain
IPR003106 - Leucine zipper, homeobox-associated
IPR009057 - Homeobox-like domain superfamily
IPR017970 - Homeobox, conserved site


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587716.1 Homeobox-leucine zipper protein HAT4, partial [Cucurbita argyrosperma subsp. sororia]1.1e-4657.08Show/hide
Query:  MGGDDEICNIRLGLGLGFG-EEYVPKK--KIN-NH-NPKFFSDLSFTLIPKEEAINVEIEASSSDHDHLKRIRSNNNQDQIRDSSSIVINGSSERKKLRL
        MG  D++CNIRLGL LG G  EYVPKK  KIN NH NPK F+DLSFTL+PK+E   + I +S++                   ++S+      ERKKLRL
Subjt:  MGGDDEICNIRLGLGLGFG-EEYVPKK--KIN-NH-NPKFFSDLSFTLIPKEEAINVEIEASSSDHDHLKRIRSNNNQDQIRDSSSIVINGSSERKKLRL

Query:  SKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSL--ELWDMKGAMRVAQSTILVLWIIYA
        SKEQ+ LLEESFKLHTTLNPAQKQALA QLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENR L  EL +++     A    + L     
Subjt:  SKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSL--ELWDMKGAMRVAQSTILVLWIIYA

Query:  ILICIGSVVGARKMLFPAHNGSEDDN
        + IC       R    PA N + D N
Subjt:  ILICIGSVVGARKMLFPAHNGSEDDN

XP_004150745.1 homeobox-leucine zipper protein HOX18 [Cucumis sativus]1.4e-4661.97Show/hide
Query:  DDEICNIR-LGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEA-----INVEI---EASSS----DHDHLKRIRSNNN--------QDQI-----R
        +DEICNI  L LGLGFG++YVPKK   N   +    LSFTLIPKEE       N+EI   EA+SS    DH  +KRIRS+NN        QD       R
Subjt:  DDEICNIR-LGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEA-----INVEI---EASSS----DHDHLKRIRSNNN--------QDQI-----R

Query:  DSSSIVINGSS--------------------ERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKK
         SS   IN S                     ERKKLRLSKEQS LLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKK
Subjt:  DSSSIVINGSS--------------------ERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKK

Query:  CCERLNEENRSLE
        CCERLNEENR L+
Subjt:  CCERLNEENRSLE

XP_008453538.1 PREDICTED: homeobox-leucine zipper protein HOX18-like [Cucumis melo]1.9e-4659.81Show/hide
Query:  DDEICNIR-LGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEA-----INVEIE-----ASSSDHDH--LKRIRSNNN------------------
        +DEICNI  L LGLGFG++YVPKK   N + +    +SFTLIPKEE       N+EI+     +S  D DH  +KRIRSNNN                  
Subjt:  DDEICNIR-LGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEA-----INVEIE-----ASSSDHDH--LKRIRSNNN------------------

Query:  ---QDQIRDSSSIV-----------INGSS--ERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLK
            DQ  +++ IV            +GS   ERKKLRLSKEQS LLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLK
Subjt:  ---QDQIRDSSSIV-----------INGSS--ERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLK

Query:  KCCERLNEENRSLE
        KCCERLNEENR L+
Subjt:  KCCERLNEENRSLE

XP_022134791.1 homeobox-leucine zipper protein HAT22-like [Momordica charantia]7.0e-8698.82Show/hide
Query:  MGGDDEICNIRLGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEAINVEIEASSSDHDHLKRIRSNNNQDQIRDSSSIVINGSSERKKLRLSKEQS
        MGGDDEICNIRLGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEAINVEIEASSSDHDHLKRIRSNNNQDQIRDSSSIVINGSSERKKLRLSKEQS
Subjt:  MGGDDEICNIRLGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEAINVEIEASSSDHDHLKRIRSNNNQDQIRDSSSIVINGSSERKKLRLSKEQS

Query:  NLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSLE
        NLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENR L+
Subjt:  NLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSLE

XP_023531587.1 homeobox-leucine zipper protein HAT3-like [Cucurbita pepo subsp. pepo]3.2e-4660.29Show/hide
Query:  MGGDDEICNIRLGLGLGFG-EEYVPKK--KIN-NH-NPKFFSDLSFTLIPKEEAINVEIEASSSDHDHLKRIRSNNNQDQIRDSSSIVINGSSERKKLRL
        MG  D++CNIRLGL LG G  EYVPKK  KIN NH NPK F+DLSFTL+PK+E   + I +S++                   ++S+      ERKKLRL
Subjt:  MGGDDEICNIRLGLGLGFG-EEYVPKK--KIN-NH-NPKFFSDLSFTLIPKEEAINVEIEASSSDHDHLKRIRSNNNQDQIRDSSSIVINGSSERKKLRL

Query:  SKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSL--ELWDMKGAMRVAQSTILVLWIIYA
        SKEQ+ LLEESFKLHTTLNPAQKQALA QLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENR L  EL +++     A    + L     
Subjt:  SKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSL--ELWDMKGAMRVAQSTILVLWIIYA

Query:  ILIC
        + IC
Subjt:  ILIC

TrEMBL top hitse value%identityAlignment
A0A0A0M012 Homeobox domain-containing protein6.9e-4761.97Show/hide
Query:  DDEICNIR-LGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEA-----INVEI---EASSS----DHDHLKRIRSNNN--------QDQI-----R
        +DEICNI  L LGLGFG++YVPKK   N   +    LSFTLIPKEE       N+EI   EA+SS    DH  +KRIRS+NN        QD       R
Subjt:  DDEICNIR-LGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEA-----INVEI---EASSS----DHDHLKRIRSNNN--------QDQI-----R

Query:  DSSSIVINGSS--------------------ERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKK
         SS   IN S                     ERKKLRLSKEQS LLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKK
Subjt:  DSSSIVINGSS--------------------ERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKK

Query:  CCERLNEENRSLE
        CCERLNEENR L+
Subjt:  CCERLNEENRSLE

A0A1S3BWH2 homeobox-leucine zipper protein HOX18-like9.0e-4759.81Show/hide
Query:  DDEICNIR-LGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEA-----INVEIE-----ASSSDHDH--LKRIRSNNN------------------
        +DEICNI  L LGLGFG++YVPKK   N + +    +SFTLIPKEE       N+EI+     +S  D DH  +KRIRSNNN                  
Subjt:  DDEICNIR-LGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEA-----INVEIE-----ASSSDHDH--LKRIRSNNN------------------

Query:  ---QDQIRDSSSIV-----------INGSS--ERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLK
            DQ  +++ IV            +GS   ERKKLRLSKEQS LLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLK
Subjt:  ---QDQIRDSSSIV-----------INGSS--ERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLK

Query:  KCCERLNEENRSLE
        KCCERLNEENR L+
Subjt:  KCCERLNEENRSLE

A0A5A7USQ4 Homeobox-leucine zipper protein HOX18-like9.0e-4759.81Show/hide
Query:  DDEICNIR-LGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEA-----INVEIE-----ASSSDHDH--LKRIRSNNN------------------
        +DEICNI  L LGLGFG++YVPKK   N + +    +SFTLIPKEE       N+EI+     +S  D DH  +KRIRSNNN                  
Subjt:  DDEICNIR-LGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEA-----INVEIE-----ASSSDHDH--LKRIRSNNN------------------

Query:  ---QDQIRDSSSIV-----------INGSS--ERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLK
            DQ  +++ IV            +GS   ERKKLRLSKEQS LLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLK
Subjt:  ---QDQIRDSSSIV-----------INGSS--ERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLK

Query:  KCCERLNEENRSLE
        KCCERLNEENR L+
Subjt:  KCCERLNEENRSLE

A0A6J1BYS3 homeobox-leucine zipper protein HAT22-like3.4e-8698.82Show/hide
Query:  MGGDDEICNIRLGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEAINVEIEASSSDHDHLKRIRSNNNQDQIRDSSSIVINGSSERKKLRLSKEQS
        MGGDDEICNIRLGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEAINVEIEASSSDHDHLKRIRSNNNQDQIRDSSSIVINGSSERKKLRLSKEQS
Subjt:  MGGDDEICNIRLGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEAINVEIEASSSDHDHLKRIRSNNNQDQIRDSSSIVINGSSERKKLRLSKEQS

Query:  NLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSLE
        NLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENR L+
Subjt:  NLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSLE

A0A6J1EUD1 homeobox-leucine zipper protein HAT3-like1.3e-4559.8Show/hide
Query:  MGGDDEICNIRLGLGLGFG-EEYVPKK--KIN-NH-NPKFFSDLSFTLIPKEEAINVEIEASSSDHDHLKRIRSNNNQDQIRDSSSIVINGSSERKKLRL
        MG  D++CNIRLGL LG G  EYVPKK  KIN NH NPK  +DLSFTL+PK+E   + I +S++                   ++S+      ERKKLRL
Subjt:  MGGDDEICNIRLGLGLGFG-EEYVPKK--KIN-NH-NPKFFSDLSFTLIPKEEAINVEIEASSSDHDHLKRIRSNNNQDQIRDSSSIVINGSSERKKLRL

Query:  SKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSL--ELWDMKGAMRVAQSTILVLWIIYA
        SKEQ+ LLEESFKLHTTLNPAQKQALA QLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENR L  EL +++     A    + L     
Subjt:  SKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSL--ELWDMKGAMRVAQSTILVLWIIYA

Query:  ILIC
        + IC
Subjt:  ILIC

SwissProt top hitse value%identityAlignment
A2YW03 Homeobox-leucine zipper protein HOX272.8e-2980.95Show/hide
Query:  GSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSL
        G+S RKKLRLSKEQS  LEESFK H+TLNP QK ALA+QLNL+ RQVEVWFQNRRARTKLKQTEVDCE+LK+CCE L EENR L
Subjt:  GSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSL

A2Z1U1 Homeobox-leucine zipper protein HOX112.1e-2979.07Show/hide
Query:  NGSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSLE
        +G S RKKLRLSKEQS  LEESFK H+TLNP QK ALA+QLNL+ RQVEVWFQNRRARTKLKQTEVDCE+LK+CCE L EENR L+
Subjt:  NGSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSLE

P46604 Homeobox-leucine zipper protein HAT224.6e-3244.04Show/hide
Query:  GDDEICNIRLGLGLGFGE-----EYVPKKKINNHNPKFFS-DLSFTLIPKEEAINVEIEASSSD--------HDHL-----------KRIRSNNNQDQIR
        G D+ CN  L LGLG         +  KK  +  + +F   D S TL    E+  ++  A + D        H  +           + I   + +++  
Subjt:  GDDEICNIRLGLGLGFGE-----EYVPKKKINNHNPKFFS-DLSFTLIPKEEAINVEIEASSSD--------HDHL-----------KRIRSNNNQDQIR

Query:  DSSSIVI-----------NGSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEEN
        +++  V+            G S RKKLRL+K+QS LLE++FKLH+TLNP QKQALA+QLNL+ RQVEVWFQNRRARTKLKQTEVDCEFLKKCCE L +EN
Subjt:  DSSSIVI-----------NGSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEEN

Query:  RSL--ELWDMKGAMRVAQ
        R L  EL D+K A++++Q
Subjt:  RSL--ELWDMKGAMRVAQ

Q67UE2 Homeobox-leucine zipper protein HOX112.1e-2979.07Show/hide
Query:  NGSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSLE
        +G S RKKLRLSKEQS  LEESFK H+TLNP QK ALA+QLNL+ RQVEVWFQNRRARTKLKQTEVDCE+LK+CCE L EENR L+
Subjt:  NGSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSLE

Q6YPD0 Homeobox-leucine zipper protein HOX272.8e-2980.95Show/hide
Query:  GSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSL
        G+S RKKLRLSKEQS  LEESFK H+TLNP QK ALA+QLNL+ RQVEVWFQNRRARTKLKQTEVDCE+LK+CCE L EENR L
Subjt:  GSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSL

Arabidopsis top hitse value%identityAlignment
AT2G22800.1 Homeobox-leucine zipper protein family2.2e-2962.07Show/hide
Query:  EASSSDHDHLKRIRSNNNQDQIRDSSSIVINGSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEF
        E S  + +  +R+ S+ ++D+          G S RKKLRL+K+QS LLEESFK H+TLNP QKQ LA+QLNL+ RQVEVWFQNRRARTKLKQTEVDCEF
Subjt:  EASSSDHDHLKRIRSNNNQDQIRDSSSIVINGSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEF

Query:  LKKCCERLNEENRSLE
        LKKCCE L +EN  L+
Subjt:  LKKCCERLNEENRSLE

AT2G44910.1 homeobox-leucine zipper protein 41.3e-2974.42Show/hide
Query:  NGSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSLE
        NG   RKKLRLSK+Q+ +LEE+FK H+TLNP QK ALA+QLNL+ RQVEVWFQNRRARTKLKQTEVDCE+LK+CC+ L EENR L+
Subjt:  NGSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSLE

AT3G60390.1 homeobox-leucine zipper protein 34.4e-3078.31Show/hide
Query:  SERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSLE
        S RKKLRLSKEQ+ +LEE+FK H+TLNP QK ALA+QLNL+TRQVEVWFQNRRARTKLKQTEVDCE+LK+CCE L +ENR L+
Subjt:  SERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSLE

AT4G16780.1 homeobox protein 24.4e-3075.58Show/hide
Query:  NGSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSLE
        +G + RKKLRLSK+QS +LEE+FK H+TLNP QKQALA+QL L+ RQVEVWFQNRRARTKLKQTEVDCEFL++CCE L EENR L+
Subjt:  NGSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSLE

AT4G37790.1 Homeobox-leucine zipper protein family3.3e-3344.04Show/hide
Query:  GDDEICNIRLGLGLGFGE-----EYVPKKKINNHNPKFFS-DLSFTLIPKEEAINVEIEASSSD--------HDHL-----------KRIRSNNNQDQIR
        G D+ CN  L LGLG         +  KK  +  + +F   D S TL    E+  ++  A + D        H  +           + I   + +++  
Subjt:  GDDEICNIRLGLGLGFGE-----EYVPKKKINNHNPKFFS-DLSFTLIPKEEAINVEIEASSSD--------HDHL-----------KRIRSNNNQDQIR

Query:  DSSSIVI-----------NGSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEEN
        +++  V+            G S RKKLRL+K+QS LLE++FKLH+TLNP QKQALA+QLNL+ RQVEVWFQNRRARTKLKQTEVDCEFLKKCCE L +EN
Subjt:  DSSSIVI-----------NGSSERKKLRLSKEQSNLLEESFKLHTTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEEN

Query:  RSL--ELWDMKGAMRVAQ
        R L  EL D+K A++++Q
Subjt:  RSL--ELWDMKGAMRVAQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAGGAGACGATGAAATCTGCAACATCAGGCTCGGCCTAGGATTGGGTTTTGGCGAAGAATATGTTCCAAAGAAGAAGATTAATAATCATAACCCAAAATTCTTCTC
TGACCTCTCTTTCACTCTCATTCCAAAGGAGGAAGCGATTAACGTGGAGATTGAAGCTTCTTCAAGCGATCATGATCATTTGAAGAGAATTAGAAGCAATAATAATCAAG
ATCAAATTAGAGATTCCAGCTCCATTGTAATCAATGGATCATCAGAGAGGAAAAAACTCAGGCTTTCCAAAGAACAGTCAAATTTGCTCGAAGAAAGCTTCAAACTTCAC
ACGACCTTGAATCCGGCTCAGAAGCAGGCACTTGCCCAACAATTAAACCTCAAAACACGACAAGTGGAAGTTTGGTTTCAAAACCGACGAGCAAGGACGAAACTGAAACA
AACGGAAGTAGATTGCGAGTTTCTGAAGAAATGCTGCGAAAGGTTGAACGAAGAGAATCGAAGTTTGGAACTTTGGGATATGAAGGGTGCAATGAGAGTAGCTCAGTCCA
CCATTTTGGTGCTTTGGATAATTTATGCTATTTTAATCTGCATTGGCTCGGTCGTCGGAGCTCGGAAGATGTTGTTCCCGGCTCACAACGGTAGCGAGGACGACAACGAC
AGCTTAACGACTACAATGGAGACGACGACACTCGGTTCTTGCAAGTCAAAGAACCTCCATTGTTGTGAGTGCTCCTTCTCGAGCTCTGAGGAGATGGCCACGATCAATAA
TGGAAGCAGCAGCTCTGAGGAGAAAAGGGTTGTCCCCACTGGCCCTAACCCACTTCACAATAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGAGGAGACGATGAAATCTGCAACATCAGGCTCGGCCTAGGATTGGGTTTTGGCGAAGAATATGTTCCAAAGAAGAAGATTAATAATCATAACCCAAAATTCTTCTC
TGACCTCTCTTTCACTCTCATTCCAAAGGAGGAAGCGATTAACGTGGAGATTGAAGCTTCTTCAAGCGATCATGATCATTTGAAGAGAATTAGAAGCAATAATAATCAAG
ATCAAATTAGAGATTCCAGCTCCATTGTAATCAATGGATCATCAGAGAGGAAAAAACTCAGGCTTTCCAAAGAACAGTCAAATTTGCTCGAAGAAAGCTTCAAACTTCAC
ACGACCTTGAATCCGGCTCAGAAGCAGGCACTTGCCCAACAATTAAACCTCAAAACACGACAAGTGGAAGTTTGGTTTCAAAACCGACGAGCAAGGACGAAACTGAAACA
AACGGAAGTAGATTGCGAGTTTCTGAAGAAATGCTGCGAAAGGTTGAACGAAGAGAATCGAAGTTTGGAACTTTGGGATATGAAGGGTGCAATGAGAGTAGCTCAGTCCA
CCATTTTGGTGCTTTGGATAATTTATGCTATTTTAATCTGCATTGGCTCGGTCGTCGGAGCTCGGAAGATGTTGTTCCCGGCTCACAACGGTAGCGAGGACGACAACGAC
AGCTTAACGACTACAATGGAGACGACGACACTCGGTTCTTGCAAGTCAAAGAACCTCCATTGTTGTGAGTGCTCCTTCTCGAGCTCTGAGGAGATGGCCACGATCAATAA
TGGAAGCAGCAGCTCTGAGGAGAAAAGGGTTGTCCCCACTGGCCCTAACCCACTTCACAATAAGTAG
Protein sequenceShow/hide protein sequence
MGGDDEICNIRLGLGLGFGEEYVPKKKINNHNPKFFSDLSFTLIPKEEAINVEIEASSSDHDHLKRIRSNNNQDQIRDSSSIVINGSSERKKLRLSKEQSNLLEESFKLH
TTLNPAQKQALAQQLNLKTRQVEVWFQNRRARTKLKQTEVDCEFLKKCCERLNEENRSLELWDMKGAMRVAQSTILVLWIIYAILICIGSVVGARKMLFPAHNGSEDDND
SLTTTMETTTLGSCKSKNLHCCECSFSSSEEMATINNGSSSSEEKRVVPTGPNPLHNK