; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g36700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g36700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionethylene-responsive transcription factor-like protein isoform X1
Genome locationchr6:28191238..28193202
RNA-Seq ExpressionMoc06g36700
SyntenyMoc06g36700
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011651656.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucumis sativus]5.8e-9477.87Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH
        MVSLRRRKLLG  SGK SF+APV KFSENLT E+ +HCT+FV V+PICSD +NKI+ENP AN EPESS  V+VLDTSKE+    N+E IADPPV+ RKRH
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH

Query:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN
         RK FPDE FLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGR+PNFELPEEEKQELRK NWD+FLA+TR+ ITNRKQKRLSPES 
Subjt:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN

Query:  KSKLPSSGNHDSDKRHGKFSNLSTLEDMKPEASTS
        KS+L S GN DS+KRH KF + S LED++P ASTS
Subjt:  KSKLPSSGNHDSDKRHGKFSNLSTLEDMKPEASTS

XP_022159538.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Momordica charantia]9.5e-129100Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
        MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR

Query:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL
        FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL
Subjt:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL

Query:  PSSGNHDSDKRHGKFSNLSTLEDMKPEASTS
        PSSGNHDSDKRHGKFSNLSTLEDMKPEASTS
Subjt:  PSSGNHDSDKRHGKFSNLSTLEDMKPEASTS

XP_022159539.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X2 [Momordica charantia]3.6e-10499.47Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
        MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR

Query:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQ
        FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRK+
Subjt:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQ

XP_031738473.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X2 [Cucumis sativus]1.1e-8775.32Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH
        MVSLRRRKLLG  S        V KFSENLT E+ +HCT+FV V+PICSD +NKI+ENP AN EPESS  V+VLDTSKE+    N+E IADPPV+ RKRH
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH

Query:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN
         RK FPDE FLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGR+PNFELPEEEKQELRK NWD+FLA+TR+ ITNRKQKRLSPES 
Subjt:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN

Query:  KSKLPSSGNHDSDKRHGKFSNLSTLEDMKPEASTS
        KS+L S GN DS+KRH KF + S LED++P ASTS
Subjt:  KSKLPSSGNHDSDKRHGKFSNLSTLEDMKPEASTS

XP_038887390.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Benincasa hispida]1.2e-9678.57Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKN-----EESIADPPVQCRKR
        MVSLRRRKLLG C+GKGSF+APV KFSENLT E+ +HCTNFVSV+PICSD +NKIKENPIAN EPESS  V+VLDTS+E+N     E   ADPP++ RKR
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKN-----EESIADPPVQCRKR

Query:  HWRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPES
        H RK FPDE FLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGR+PNFELPEEEK+ELRK NWD+FLA+TRH ITNRKQKRLSPES
Subjt:  HWRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPES

Query:  NKSKLPSSGNHDSD--KRHGKFSNLSTLEDMKPEASTS
        NKSKL S GN D D  KRH +F + S LEDM+P ASTS
Subjt:  NKSKLPSSGNHDSD--KRHGKFSNLSTLEDMKPEASTS

TrEMBL top hitse value%identityAlignment
A0A0A0LCE7 AP2/ERF domain-containing protein2.8e-9477.87Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH
        MVSLRRRKLLG  SGK SF+APV KFSENLT E+ +HCT+FV V+PICSD +NKI+ENP AN EPESS  V+VLDTSKE+    N+E IADPPV+ RKRH
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH

Query:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN
         RK FPDE FLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGR+PNFELPEEEKQELRK NWD+FLA+TR+ ITNRKQKRLSPES 
Subjt:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN

Query:  KSKLPSSGNHDSDKRHGKFSNLSTLEDMKPEASTS
        KS+L S GN DS+KRH KF + S LED++P ASTS
Subjt:  KSKLPSSGNHDSDKRHGKFSNLSTLEDMKPEASTS

A0A1S4E2T2 ethylene-responsive transcription factor-like protein At4g13040 isoform X62.8e-8678.6Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH
        MVSLRRRKLLG  SGK SF+APV KFSENLT E  +HCT+ V V+PICSD++NKI+ENPIAN EPESS  V+VLDTSKE+    N E IADPPV+ RKRH
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEK----NEESIADPPVQCRKRH

Query:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN
         RK FPDE FLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGR+PNFELPEEEKQELRK NWD+FLA+TR+ ITNRKQKRL+PES 
Subjt:  WRKRFPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESN

Query:  KSKLPSSGNHDSDKR
        KS+L S GN DS+KR
Subjt:  KSKLPSSGNHDSDKR

A0A6J1DZ33 ethylene-responsive transcription factor-like protein At4g13040 isoform X14.6e-129100Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
        MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR

Query:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL
        FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL
Subjt:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL

Query:  PSSGNHDSDKRHGKFSNLSTLEDMKPEASTS
        PSSGNHDSDKRHGKFSNLSTLEDMKPEASTS
Subjt:  PSSGNHDSDKRHGKFSNLSTLEDMKPEASTS

A0A6J1E2M8 ethylene-responsive transcription factor-like protein At4g13040 isoform X21.8e-10499.47Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
        MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR

Query:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQ
        FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRK+
Subjt:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQ

A0A6J1ESB0 ethylene-responsive transcription factor-like protein At4g13040 isoform X17.4e-8773.25Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR
        MVSLRRRKLLG C+GKGSF APV K SEN T E+  HCTNF+SVHPICS++ N+I+ENP+AN E E SSRV+VLDTSKEK++E  A+PPV+ RKRH RK+
Subjt:  MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKR

Query:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL
        FP+E FLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGR+PNFELPE EK+ELRK NWD+FLA+TR  I N+KQKR+SPES  SKL
Subjt:  FPDEPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKL

Query:  PSSGNHDSDKRHGKFSNLSTLEDMKPEA
        P  GN D +KR  +F +LS  ED++P A
Subjt:  PSSGNHDSDKRHGKFSNLSTLEDMKPEA

SwissProt top hitse value%identityAlignment
Q56XP9 Ethylene-responsive transcription factor-like protein At4g130401.5e-3645.13Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPV-----HKFSENLTTENPIHCTNFVSVHPICSDDINK--IKENPIANTEPESSSRVTVLDTSKEKNEESIAD--PPVQ
        MVSLRRR+LLG C G   ++ P+      +    +   N     N      + +  + K  I+E            R    D S   N +SI+    P +
Subjt:  MVSLRRRKLLGFCSGKGSFLAPV-----HKFSENLTTENPIHCTNFVSVHPICSDDINK--IKENPIANTEPESSSRVTVLDTSKEKNEESIAD--PPVQ

Query:  CRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQK-
         RK+H RKR  + EP LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGR+PNFEL EE  +EL++ +W++FL  TR  ITN+K K 
Subjt:  CRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQK-

Query:  RLSPESNKSKL-----PSSGNHDSDK
        R+  E  K        P     DSDK
Subjt:  RLSPESNKSKL-----PSSGNHDSDK

Arabidopsis top hitse value%identityAlignment
AT4G13040.1 Integrase-type DNA-binding superfamily protein1.1e-3745.13Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPV-----HKFSENLTTENPIHCTNFVSVHPICSDDINK--IKENPIANTEPESSSRVTVLDTSKEKNEESIAD--PPVQ
        MVSLRRR+LLG C G   ++ P+      +    +   N     N      + +  + K  I+E            R    D S   N +SI+    P +
Subjt:  MVSLRRRKLLGFCSGKGSFLAPV-----HKFSENLTTENPIHCTNFVSVHPICSDDINK--IKENPIANTEPESSSRVTVLDTSKEKNEESIAD--PPVQ

Query:  CRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQK-
         RK+H RKR  + EP LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGR+PNFEL EE  +EL++ +W++FL  TR  ITN+K K 
Subjt:  CRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQK-

Query:  RLSPESNKSKL-----PSSGNHDSDK
        R+  E  K        P     DSDK
Subjt:  RLSPESNKSKL-----PSSGNHDSDK

AT4G13040.2 Integrase-type DNA-binding superfamily protein2.9e-3560.15Show/hide
Query:  IADPPVQCRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVI
        I+D P + RK+H RKR  + EP LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGR+PNFEL EE  +EL++ +W++FL  TR  I
Subjt:  IADPPVQCRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVI

Query:  TNRKQK-RLSPESNKSKL-----PSSGNHDSDK
        TN+K K R+  E  K        P     DSDK
Subjt:  TNRKQK-RLSPESNKSKL-----PSSGNHDSDK

AT4G13040.3 Integrase-type DNA-binding superfamily protein1.1e-3745.13Show/hide
Query:  MVSLRRRKLLGFCSGKGSFLAPV-----HKFSENLTTENPIHCTNFVSVHPICSDDINK--IKENPIANTEPESSSRVTVLDTSKEKNEESIAD--PPVQ
        MVSLRRR+LLG C G   ++ P+      +    +   N     N      + +  + K  I+E            R    D S   N +SI+    P +
Subjt:  MVSLRRRKLLGFCSGKGSFLAPV-----HKFSENLTTENPIHCTNFVSVHPICSDDINK--IKENPIANTEPESSSRVTVLDTSKEKNEESIAD--PPVQ

Query:  CRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQK-
         RK+H RKR  + EP LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGR+PNFEL EE  +EL++ +W++FL  TR  ITN+K K 
Subjt:  CRKRHWRKRFPD-EPFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQK-

Query:  RLSPESNKSKL-----PSSGNHDSDK
        R+  E  K        P     DSDK
Subjt:  RLSPESNKSKL-----PSSGNHDSDK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAGCTTAAGAAGGCGTAAACTCCTGGGATTTTGCTCTGGGAAAGGCTCATTTCTTGCTCCAGTTCACAAGTTTTCTGAAAATTTGACTACCGAAAATCCCATACA
TTGTACAAACTTTGTTAGCGTGCATCCGATCTGTTCAGACGACATTAACAAGATAAAGGAGAATCCCATTGCAAATACAGAGCCTGAATCTTCATCAAGGGTAACTGTTT
TGGATACATCAAAAGAGAAAAATGAGGAGTCAATTGCAGACCCGCCCGTACAGTGCAGAAAGAGACACTGGAGAAAGCGTTTTCCAGATGAACCTTTCTTAATGAGAGGG
GTCTATTTCAAGAACATGAAATGGCAAGCTGCAATAAAGGTTGACAAGAAACAAATACACTTGGGAACTGTAGGATCACAAGAAGAAGCTGCTCATTTGTATGACAGAGC
TGCTTTCATGTGTGGAAGGAAACCAAACTTTGAGCTCCCAGAGGAGGAGAAGCAAGAACTGAGAAAGTTAAATTGGGACCAATTTTTAGCAGTCACTCGCCACGTCATTA
CTAATAGAAAACAGAAGAGGCTCAGCCCAGAATCAAACAAGTCTAAGCTTCCTTCGTCGGGAAATCATGACTCGGACAAAAGACATGGCAAGTTCAGTAACCTCTCAACT
CTAGAAGATATGAAACCAGAAGCCTCTACCTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGAGCTTAAGAAGGCGTAAACTCCTGGGATTTTGCTCTGGGAAAGGCTCATTTCTTGCTCCAGTTCACAAGTTTTCTGAAAATTTGACTACCGAAAATCCCATACA
TTGTACAAACTTTGTTAGCGTGCATCCGATCTGTTCAGACGACATTAACAAGATAAAGGAGAATCCCATTGCAAATACAGAGCCTGAATCTTCATCAAGGGTAACTGTTT
TGGATACATCAAAAGAGAAAAATGAGGAGTCAATTGCAGACCCGCCCGTACAGTGCAGAAAGAGACACTGGAGAAAGCGTTTTCCAGATGAACCTTTCTTAATGAGAGGG
GTCTATTTCAAGAACATGAAATGGCAAGCTGCAATAAAGGTTGACAAGAAACAAATACACTTGGGAACTGTAGGATCACAAGAAGAAGCTGCTCATTTGTATGACAGAGC
TGCTTTCATGTGTGGAAGGAAACCAAACTTTGAGCTCCCAGAGGAGGAGAAGCAAGAACTGAGAAAGTTAAATTGGGACCAATTTTTAGCAGTCACTCGCCACGTCATTA
CTAATAGAAAACAGAAGAGGCTCAGCCCAGAATCAAACAAGTCTAAGCTTCCTTCGTCGGGAAATCATGACTCGGACAAAAGACATGGCAAGTTCAGTAACCTCTCAACT
CTAGAAGATATGAAACCAGAAGCCTCTACCTCTTGA
Protein sequenceShow/hide protein sequence
MVSLRRRKLLGFCSGKGSFLAPVHKFSENLTTENPIHCTNFVSVHPICSDDINKIKENPIANTEPESSSRVTVLDTSKEKNEESIADPPVQCRKRHWRKRFPDEPFLMRG
VYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGRKPNFELPEEEKQELRKLNWDQFLAVTRHVITNRKQKRLSPESNKSKLPSSGNHDSDKRHGKFSNLST
LEDMKPEASTS