; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g1733 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g1733
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionethylene-responsive transcription factor-like protein isoform X1
Genome locationMC06:24702362..24706470
RNA-Seq ExpressionMC06g1733
SyntenyMC06g1733
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011651656.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucumis sativus]2.39e-12077.87Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH
        MVSLRRRKLLG  SGK SF+APV KFSENLT E+ +HCT+FV V+PICSD +NKI+ENP AN EPESS  V+VLDTSKE+    N+E IADPPV+ RKRH
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH

Query:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN
         RK FPDE FLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGR+PNFELPEEEKQELRK NWD+FLA+TR+ ITNRKQKRLSPES 
Subjt:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN

Query:  KSKLPSSGNHDSDKRHGKFSNLSTLEDMKPEASTS
        KS+L S GN DS+KRH KF + S LED++P ASTS
Subjt:  KSKLPSSGNHDSDKRHGKFSNLSTLEDMKPEASTS

XP_022159538.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Momordica charantia]4.11e-166100Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
        MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR

Query:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL
        FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL
Subjt:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL

Query:  PSSGNHDSDKRHGKFSNLSTLEDMKPEASTS
        PSSGNHDSDKRHGKFSNLSTLEDMKPEASTS
Subjt:  PSSGNHDSDKRHGKFSNLSTLEDMKPEASTS

XP_022159539.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X2 [Momordica charantia]1.71e-13499.47Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
        MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR

Query:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQ
        FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRK+
Subjt:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQ

XP_031738473.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X2 [Cucumis sativus]1.49e-11275.32Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH
        MVSLRRRKLLG  S        V KFSENLT E+ +HCT+FV V+PICSD +NKI+ENP AN EPESS  V+VLDTSKE+    N+E IADPPV+ RKRH
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH

Query:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN
         RK FPDE FLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGR+PNFELPEEEKQELRK NWD+FLA+TR+ ITNRKQKRLSPES 
Subjt:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN

Query:  KSKLPSSGNHDSDKRHGKFSNLSTLEDMKPEASTS
        KS+L S GN DS+KRH KF + S LED++P ASTS
Subjt:  KSKLPSSGNHDSDKRHGKFSNLSTLEDMKPEASTS

XP_038887390.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Benincasa hispida]8.35e-12478.15Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEES-----IADPPVQCRKR
        MVSLRRRKLLG C+GKGSF+APV KFSENLT E+ +HCTNFVSV+PICSD +NKIKENPIAN EPESS  V+VLDTS+E+N+ +      ADPP++ RKR
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEES-----IADPPVQCRKR

Query:  HWRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPES
        H RK FPDE FLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGR+PNFELPEEEK+ELRK NWD+FLA+TRH ITNRKQKRLSPES
Subjt:  HWRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPES

Query:  NKSKLPSSGNHDSD--KRHGKFSNLSTLEDMKPEASTS
        NKSKL S GN D D  KRH +F + S LEDM+P ASTS
Subjt:  NKSKLPSSGNHDSD--KRHGKFSNLSTLEDMKPEASTS

TrEMBL top hitse value%identityAlignment
A0A0A0LCE7 AP2/ERF domain-containing protein1.16e-12077.87Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH
        MVSLRRRKLLG  SGK SF+APV KFSENLT E+ +HCT+FV V+PICSD +NKI+ENP AN EPESS  V+VLDTSKE+    N+E IADPPV+ RKRH
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH

Query:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN
         RK FPDE FLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGR+PNFELPEEEKQELRK NWD+FLA+TR+ ITNRKQKRLSPES 
Subjt:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN

Query:  KSKLPSSGNHDSDKRHGKFSNLSTLEDMKPEASTS
        KS+L S GN DS+KRH KF + S LED++P ASTS
Subjt:  KSKLPSSGNHDSDKRHGKFSNLSTLEDMKPEASTS

A0A1S4E2T2 ethylene-responsive transcription factor-like protein At4g13040 isoform X61.77e-11078.6Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH
        MVSLRRRKLLG  SGK SF+APV KFSENLT E  +HCT+ V V+PICSD++NKI+ENPIAN EPESS  V+VLDTSKE+    N E IADPPV+ RKRH
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH

Query:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN
         RK FPDE FLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGR+PNFELPEEEKQELRK NWD+FLA+TR+ ITNRKQKRL+PES 
Subjt:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN

Query:  KSKLPSSGNHDSDKR
        KS+L S GN DS+KR
Subjt:  KSKLPSSGNHDSDKR

A0A6J1DZ33 ethylene-responsive transcription factor-like protein At4g13040 isoform X11.99e-166100Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
        MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR

Query:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL
        FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL
Subjt:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL

Query:  PSSGNHDSDKRHGKFSNLSTLEDMKPEASTS
        PSSGNHDSDKRHGKFSNLSTLEDMKPEASTS
Subjt:  PSSGNHDSDKRHGKFSNLSTLEDMKPEASTS

A0A6J1E2M8 ethylene-responsive transcription factor-like protein At4g13040 isoform X28.30e-13599.47Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
        MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR

Query:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQ
        FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRK+
Subjt:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQ

A0A6J1ESB0 ethylene-responsive transcription factor-like protein At4g13040 isoform X12.76e-11173.45Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
        MVSLRRRKLLG C+GKGSF APV K SEN T E+  HCTNF+SVHPICS++ N+I+ENP+AN E ESS RV+VLDTSKEK++E  A+PPV+ RKRH RK+
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR

Query:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL
        FP+E FLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGR+PNFELPE EK+ELRK NWD+FLA+TR  I N+KQKR+SPES  SKL
Subjt:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL

Query:  PSSGNHDSDKRHGKFSNLSTLEDMKP
        P  GN D +KR  +F +LS  ED++P
Subjt:  PSSGNHDSDKRHGKFSNLSTLEDMKP

SwissProt top hitse value%identityAlignment
Q56XP9 Ethylene-responsive transcription factor-like protein At4g130401.5e-3645.13Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPV-----HKFSENLTTENPIHCTNFVSVHPICSDDINK--IKENPIANTEPESSSRVTVLDTSKEKNEESIAD--PPVQ
        MVSLRRR+LLG C G   ++ P+      +    +   N     N      + +  + K  I+E            R    D S   N +SI+    P +
Subjt:  MVSLRRRKLLGFCSGKGSFLAPV-----HKFSENLTTENPIHCTNFVSVHPICSDDINK--IKENPIANTEPESSSRVTVLDTSKEKNEESIAD--PPVQ

Query:  CRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQK-
         RK+H RKR  + EP LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGR+PNFEL EE  +EL++ +W++FL  TR  ITN+K K 
Subjt:  CRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQK-

Query:  RLSPESNKSKL-----PSSGNHDSDK
        R+  E  K        P     DSDK
Subjt:  RLSPESNKSKL-----PSSGNHDSDK

Arabidopsis top hitse value%identityAlignment
AT4G13040.1 Integrase-type DNA-binding superfamily protein1.1e-3745.13Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPV-----HKFSENLTTENPIHCTNFVSVHPICSDDINK--IKENPIANTEPESSSRVTVLDTSKEKNEESIAD--PPVQ
        MVSLRRR+LLG C G   ++ P+      +    +   N     N      + +  + K  I+E            R    D S   N +SI+    P +
Subjt:  MVSLRRRKLLGFCSGKGSFLAPV-----HKFSENLTTENPIHCTNFVSVHPICSDDINK--IKENPIANTEPESSSRVTVLDTSKEKNEESIAD--PPVQ

Query:  CRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQK-
         RK+H RKR  + EP LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGR+PNFEL EE  +EL++ +W++FL  TR  ITN+K K 
Subjt:  CRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQK-

Query:  RLSPESNKSKL-----PSSGNHDSDK
        R+  E  K        P     DSDK
Subjt:  RLSPESNKSKL-----PSSGNHDSDK

AT4G13040.2 Integrase-type DNA-binding superfamily protein2.9e-3560.15Show/hide
Query:  IADPPVQCRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVI
        I+D P + RK+H RKR  + EP LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGR+PNFEL EE  +EL++ +W++FL  TR  I
Subjt:  IADPPVQCRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVI

Query:  TNRKQK-RLSPESNKSKL-----PSSGNHDSDK
        TN+K K R+  E  K        P     DSDK
Subjt:  TNRKQK-RLSPESNKSKL-----PSSGNHDSDK

AT4G13040.3 Integrase-type DNA-binding superfamily protein1.1e-3745.13Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPV-----HKFSENLTTENPIHCTNFVSVHPICSDDINK--IKENPIANTEPESSSRVTVLDTSKEKNEESIAD--PPVQ
        MVSLRRR+LLG C G   ++ P+      +    +   N     N      + +  + K  I+E            R    D S   N +SI+    P +
Subjt:  MVSLRRRKLLGFCSGKGSFLAPV-----HKFSENLTTENPIHCTNFVSVHPICSDDINK--IKENPIANTEPESSSRVTVLDTSKEKNEESIAD--PPVQ

Query:  CRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQK-
         RK+H RKR  + EP LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGR+PNFEL EE  +EL++ +W++FL  TR  ITN+K K 
Subjt:  CRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQK-

Query:  RLSPESNKSKL-----PSSGNHDSDK
        R+  E  K        P     DSDK
Subjt:  RLSPESNKSKL-----PSSGNHDSDK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAGCTTAAGAAGGCGTAAACTCCTGGGATTTTGCTCTGGGAAAGGCTCATTTCTTGCTCCAGTTCACAAGTTTTCTGAAAATTTGACTACCGAAAATCCCATACA
TTGTACAAACTTTGTTAGCGTGCATCCGATCTGTTCAGACGACATTAACAAGATAAAGGAGAATCCCATTGCAAATACAGAGCCTGAATCTTCATCAAGGGTAACTGTTT
TGGATACATCAAAAGAGAAAAATGAGGAGTCAATTGCAGACCCGCCCGTACAGTGCAGAAAGAGACACTGGAGAAAGCGTTTTCCAGATGAACCTTTCTTAATGAGAGGG
GTCTATTTCAAGAACATGAAATGGCAAGCTGCAATAAAGGTTGACAAGAAACAAATACACTTGGGAACTGTAGGATCACAAGAAGAAGCTGCTCATTTGTATGACAGAGC
TGCTTTCATGTGTGGAAGGAAACCAAACTTTGAGCTCCCAGAGGAGGAGAAGCAAGAACTGAGAAAGTTAAATTGGGACCAATTTTTAGCAGTCACTCGCCACGTCATTA
CTAATAGAAAACAGAAGAGGCTCAGCCCAGAATCAAACAAGTCTAAGCTTCCTTCGTCGGGAAATCATGACTCGGACAAAAGACATGGCAAGTTCAGTAACCTCTCAACT
CTAGAAGATATGAAACCAGAAGCCTCTACCTCTTGA
mRNA sequenceShow/hide mRNA sequence
GAAATACGTAAATGTTGGAAAATTTCCAGCACATGGACTGTGGTTGAAATTTCTTTCAGTGAAGCGCATTGAAACTACGTTCTCTCTCTCTTTGTAGTGTTTGTATCCAT
TTTAGACCACCACTGTTCATCACCTTCCCCTCAGACTCCAAACAACAATTTTACAAAAACAGGGGAATTTCAATTTCCAAGGAAACAAAAAGGTGGTGGTGGTAGAAGTA
GATTGTGCATGAGTTCAGGATGCTCATAAATCGTTTTATTTCTAATGTTCGCCACCTTCCCTTCTCTTCCTCTTCCTAATCTCAATCCCCTTTTGTTTCGAATCGCTTTT
CTCCGCCGTGCGCTTCCCACAGCCGCCGCCTCCTCAGATGTGAGGTAATACCACTGAGAAGAAGATCGAACCAAACAATTCCATTTCCATTCCTCTGATAAGAAGATTGA
AGCTATAATCATGGTGAGCTTAAGAAGGCGTAAACTCCTGGGATTTTGCTCTGGGAAAGGCTCATTTCTTGCTCCAGTTCACAAGTTTTCTGAAAATTTGACTACCGAAA
ATCCCATACATTGTACAAACTTTGTTAGCGTGCATCCGATCTGTTCAGACGACATTAACAAGATAAAGGAGAATCCCATTGCAAATACAGAGCCTGAATCTTCATCAAGG
GTAACTGTTTTGGATACATCAAAAGAGAAAAATGAGGAGTCAATTGCAGACCCGCCCGTACAGTGCAGAAAGAGACACTGGAGAAAGCGTTTTCCAGATGAACCTTTCTT
AATGAGAGGGGTCTATTTCAAGAACATGAAATGGCAAGCTGCAATAAAGGTTGACAAGAAACAAATACACTTGGGAACTGTAGGATCACAAGAAGAAGCTGCTCATTTGT
ATGACAGAGCTGCTTTCATGTGTGGAAGGAAACCAAACTTTGAGCTCCCAGAGGAGGAGAAGCAAGAACTGAGAAAGTTAAATTGGGACCAATTTTTAGCAGTCACTCGC
CACGTCATTACTAATAGAAAACAGAAGAGGCTCAGCCCAGAATCAAACAAGTCTAAGCTTCCTTCGTCGGGAAATCATGACTCGGACAAAAGACATGGCAAGTTCAGTAA
CCTCTCAACTCTAGAAGATATGAAACCAGAAGCCTCTACCTCTTGAAGATTAGAATTCAAGAAGAAAAAGTTTAGTTTCCTTATCTCTTCCTCAAGTATGATGGAGATCT
GCAGTTTTGATTTTCCTCTGGAATTTTGGATGTACATCATTTTAGTGATTTTTAGGTAAATCCTTCAGAATTCGAGCCAATAAAAAGATCGATCGACTCGGGAGGGTTCT
TTCCATGCCTTCTTATTTGGACATTACTGCAAGATGCTTCTTACGACGGGGCTCGAGCTATTTTCCAGGCTCGGAACCCATAGGTTCAGCCATCCCTTGTTGCAGTGATG
TTTTTCACATGATCAAGGAACCTTGTTTCCATGGATGAAGAAGATACCACTATTAGCCATCTTATGTTCATAGCTATGCCTTTATTTCAAGTTTCCATTGTTATTAAATT
ATTATTTGAAGGTCCCCTTGAATCCCTCAATAATATGACATTGTTGTTTTTCTTTTTTTCTAATCATTTTACTATGTCTTAAAAGGAATAAAAGTGTAAAACC
Protein sequenceShow/hide protein sequence
MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKRFPDEPFLMRG
VYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKLPSSGNHDSDKRHGKFSNLST
LEDMKPEASTS