; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr004369 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr004369
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionethylene-responsive transcription factor-like protein isoform X1
Genome locationtig00002854:47166..52198
RNA-Seq ExpressionSgr004369
SyntenySgr004369
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011651656.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucumis sativus]6.0e-9982.91Show/hide
Query:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEK----NDEPIAEPPVKRRKRHR
        MVSLRRRKLLGL SGK SF+APV KFS+NLT E+HVHCT+FV V+PICSD VNKI+E+P AN  PESS VS LDTSKE+    NDEPIA+PPVKRRKRHR
Subjt:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEK----NDEPIAEPPVKRRKRHR

Query:  RKHFPDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKK
        RKHFPDE  LMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKF WDEFLAMTR  ITNRKQKRLSPESKK
Subjt:  RKHFPDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKK

Query:  SRLPSSRNDDSDRRHDEFSDLSALEDVEPDASTS
        S L S  NDDS++RHD+F D S LEDVEP ASTS
Subjt:  SRLPSSRNDDSDRRHDEFSDLSALEDVEPDASTS

XP_022159538.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Momordica charantia]9.8e-10283.55Show/hide
Query:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPE-SSRVSGLDTSKEKNDEPIAEPPVKRRKRHRRKH
        MVSLRRRKLLG CSGKGSFLAPV KFS+NLTTEN +HCTNFVSVHPICSDD+NKIKE+PIANT PE SSRV+ LDTSKEKN+E IA+PPV+ RKRH RK 
Subjt:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPE-SSRVSGLDTSKEKNDEPIAEPPVKRRKRHRRKH

Query:  FPDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKKSRL
        FPDEP LMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGR+PNFELPEEEKQELRK  WD+FLA+TR  ITNRKQKRLSPES KS+L
Subjt:  FPDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKKSRL

Query:  PSSRNDDSDRRHDEFSDLSALEDVEPDASTS
        PSS N DSD+RH +FS+LS LED++P+ASTS
Subjt:  PSSRNDDSDRRHDEFSDLSALEDVEPDASTS

XP_022930940.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucurbita moschata]4.1e-9278.67Show/hide
Query:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEKNDEPIAEPPVKRRKRHRRKHF
        MVSLRRRKLLGLC+GKGSF APV K S+N T E+  HCTNF+SVHPICS++ N+I+E+P+AN   ESSRVS LDTSKEK+DEP AEPPVKRRKRHRRK F
Subjt:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEKNDEPIAEPPVKRRKRHRRKHF

Query:  PDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKKSRLP
        P+E  LMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPE EK+ELRKF WDEFLAMTR AI N+KQKR+SPESK S+LP
Subjt:  PDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKKSRLP

Query:  SSRNDDSDRRHDEFSDLSALEDVEP
           NDD ++R DEF DLSA ED+EP
Subjt:  SSRNDDSDRRHDEFSDLSALEDVEP

XP_031738473.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X2 [Cucumis sativus]8.3e-9380.34Show/hide
Query:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEK----NDEPIAEPPVKRRKRHR
        MVSLRRRKLLGL S        V KFS+NLT E+HVHCT+FV V+PICSD VNKI+E+P AN  PESS VS LDTSKE+    NDEPIA+PPVKRRKRHR
Subjt:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEK----NDEPIAEPPVKRRKRHR

Query:  RKHFPDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKK
        RKHFPDE  LMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKF WDEFLAMTR  ITNRKQKRLSPESKK
Subjt:  RKHFPDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKK

Query:  SRLPSSRNDDSDRRHDEFSDLSALEDVEPDASTS
        S L S  NDDS++RHD+F D S LEDVEP ASTS
Subjt:  SRLPSSRNDDSDRRHDEFSDLSALEDVEPDASTS

XP_038887390.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Benincasa hispida]4.1e-10082.7Show/hide
Query:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEK----NDEPI-AEPPVKRRKRH
        MVSLRRRKLLGLC+GKGSF+APV KFS+NLT E+HVHCTNFVSV+PICSD VNKIKE+PIAN  PESS VS LDTS+E+    NDEPI A+PP+KRRKRH
Subjt:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEK----NDEPI-AEPPVKRRKRH

Query:  RRKHFPDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESK
        RRKHFPDE  LMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEK+ELRKF WDEFLAMTR AITNRKQKRLSPES 
Subjt:  RRKHFPDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESK

Query:  KSRL--PSSRNDDSDRRHDEFSDLSALEDVEPDASTS
        KS+L  P + +DDS++RHDEF D SALED+EP ASTS
Subjt:  KSRL--PSSRNDDSDRRHDEFSDLSALEDVEPDASTS

TrEMBL top hitse value%identityAlignment
A0A0A0LCE7 AP2/ERF domain-containing protein2.9e-9982.91Show/hide
Query:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEK----NDEPIAEPPVKRRKRHR
        MVSLRRRKLLGL SGK SF+APV KFS+NLT E+HVHCT+FV V+PICSD VNKI+E+P AN  PESS VS LDTSKE+    NDEPIA+PPVKRRKRHR
Subjt:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEK----NDEPIAEPPVKRRKRHR

Query:  RKHFPDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKK
        RKHFPDE  LMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKF WDEFLAMTR  ITNRKQKRLSPESKK
Subjt:  RKHFPDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKK

Query:  SRLPSSRNDDSDRRHDEFSDLSALEDVEPDASTS
        S L S  NDDS++RHD+F D S LEDVEP ASTS
Subjt:  SRLPSSRNDDSDRRHDEFSDLSALEDVEPDASTS

A0A6J1DZ33 ethylene-responsive transcription factor-like protein At4g13040 isoform X14.8e-10283.55Show/hide
Query:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPE-SSRVSGLDTSKEKNDEPIAEPPVKRRKRHRRKH
        MVSLRRRKLLG CSGKGSFLAPV KFS+NLTTEN +HCTNFVSVHPICSDD+NKIKE+PIANT PE SSRV+ LDTSKEKN+E IA+PPV+ RKRH RK 
Subjt:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPE-SSRVSGLDTSKEKNDEPIAEPPVKRRKRHRRKH

Query:  FPDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKKSRL
        FPDEP LMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGR+PNFELPEEEKQELRK  WD+FLA+TR  ITNRKQKRLSPES KS+L
Subjt:  FPDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKKSRL

Query:  PSSRNDDSDRRHDEFSDLSALEDVEPDASTS
        PSS N DSD+RH +FS+LS LED++P+ASTS
Subjt:  PSSRNDDSDRRHDEFSDLSALEDVEPDASTS

A0A6J1ESB0 ethylene-responsive transcription factor-like protein At4g13040 isoform X12.0e-9278.67Show/hide
Query:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEKNDEPIAEPPVKRRKRHRRKHF
        MVSLRRRKLLGLC+GKGSF APV K S+N T E+  HCTNF+SVHPICS++ N+I+E+P+AN   ESSRVS LDTSKEK+DEP AEPPVKRRKRHRRK F
Subjt:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEKNDEPIAEPPVKRRKRHRRKHF

Query:  PDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKKSRLP
        P+E  LMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPE EK+ELRKF WDEFLAMTR AI N+KQKR+SPESK S+LP
Subjt:  PDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKKSRLP

Query:  SSRNDDSDRRHDEFSDLSALEDVEP
           NDD ++R DEF DLSA ED+EP
Subjt:  SSRNDDSDRRHDEFSDLSALEDVEP

A0A6J1EWY2 ethylene-responsive transcription factor-like protein At4g13040 isoform X28.4e-9178.67Show/hide
Query:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEKNDEPIAEPPVKRRKRHRRKHF
        MVSLRRRKLLGLC+GKGSF APV K S+N T E+  HCTNF+SVHPICS++ N+I E+P+AN   ESSRVS LDTSKEK+DEP AEPPVKRRKRHRRK F
Subjt:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEKNDEPIAEPPVKRRKRHRRKHF

Query:  PDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKKSRLP
        P+E  LMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPE EK+ELRKF WDEFLAMTR AI N+KQKR+SPESK S+LP
Subjt:  PDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKKSRLP

Query:  SSRNDDSDRRHDEFSDLSALEDVEP
           NDD ++R DEF DLSA ED+EP
Subjt:  SSRNDDSDRRHDEFSDLSALEDVEP

A0A6J1HQI0 ethylene-responsive transcription factor-like protein At4g13040 isoform X24.2e-9085.28Show/hide
Query:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEKNDEPIAEPPVKRRKRHRRKHF
        MVSLRRRKLLGLC+GKGSF+APV KFS++LTTE+HV+ T+F+SVHP+CSD VNKIKE+P+A   PE S VS LDTSKEKNDEP+A+PPVKRRKRHRRK F
Subjt:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEKNDEPIAEPPVKRRKRHRRKHF

Query:  PDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKKS
         DEP LMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNF+LPE EKQELR+F WDEFLAMTRRAITNRKQKRLSPESKKS
Subjt:  PDEPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKKS

SwissProt top hitse value%identityAlignment
Q56XP9 Ethylene-responsive transcription factor-like protein At4g130408.5e-4047.35Show/hide
Query:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDV------NKIKEDPIANTGPE----SSRVSGLDTSKEKNDEPIAEPPVK
        MVSLRRR+LLGLC G   ++ P+P  +         +     + +P  ++ V       K  E+    T  +     S  S +      +  P  +PP K
Subjt:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDV------NKIKEDPIANTGPE----SSRVSGLDTSKEKNDEPIAEPPVK

Query:  RRKRHRRKHFPD-EPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQK-
        RRK+HRRK   + EPCLMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL EE  +EL++  W+EFL  TRR ITN+K K 
Subjt:  RRKRHRRKHFPD-EPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQK-

Query:  RLSPESKK-----SRLPSSRNDDSDR
        R+  E  K        P     DSD+
Subjt:  RLSPESKK-----SRLPSSRNDDSDR

Arabidopsis top hitse value%identityAlignment
AT4G13040.1 Integrase-type DNA-binding superfamily protein6.0e-4147.35Show/hide
Query:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDV------NKIKEDPIANTGPE----SSRVSGLDTSKEKNDEPIAEPPVK
        MVSLRRR+LLGLC G   ++ P+P  +         +     + +P  ++ V       K  E+    T  +     S  S +      +  P  +PP K
Subjt:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDV------NKIKEDPIANTGPE----SSRVSGLDTSKEKNDEPIAEPPVK

Query:  RRKRHRRKHFPD-EPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQK-
        RRK+HRRK   + EPCLMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL EE  +EL++  W+EFL  TRR ITN+K K 
Subjt:  RRKRHRRKHFPD-EPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQK-

Query:  RLSPESKK-----SRLPSSRNDDSDR
        R+  E  K        P     DSD+
Subjt:  RLSPESKK-----SRLPSSRNDDSDR

AT4G13040.2 Integrase-type DNA-binding superfamily protein3.1e-3763.16Show/hide
Query:  IAEPPVKRRKRHRRKHFPD-EPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAI
        I++ P KRRK+HRRK   + EPCLMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL EE  +EL++  W+EFL  TRR I
Subjt:  IAEPPVKRRKRHRRKHFPD-EPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAI

Query:  TNRKQK-RLSPESKK-----SRLPSSRNDDSDR
        TN+K K R+  E  K        P     DSD+
Subjt:  TNRKQK-RLSPESKK-----SRLPSSRNDDSDR

AT4G13040.3 Integrase-type DNA-binding superfamily protein6.0e-4147.35Show/hide
Query:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDV------NKIKEDPIANTGPE----SSRVSGLDTSKEKNDEPIAEPPVK
        MVSLRRR+LLGLC G   ++ P+P  +         +     + +P  ++ V       K  E+    T  +     S  S +      +  P  +PP K
Subjt:  MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDV------NKIKEDPIANTGPE----SSRVSGLDTSKEKNDEPIAEPPVK

Query:  RRKRHRRKHFPD-EPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQK-
        RRK+HRRK   + EPCLMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL EE  +EL++  W+EFL  TRR ITN+K K 
Subjt:  RRKRHRRKHFPD-EPCLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQK-

Query:  RLSPESKK-----SRLPSSRNDDSDR
        R+  E  K        P     DSD+
Subjt:  RLSPESKK-----SRLPSSRNDDSDR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAGCTTAAGAAGGCGTAAACTCCTGGGACTTTGCTCTGGCAAAGGCTCATTCCTTGCTCCAGTTCCTAAGTTTTCTGATAATTTGACTACCGAAAATCACGTGCA
CTGTACAAACTTCGTTAGTGTGCATCCCATCTGTTCAGATGACGTTAACAAGATAAAGGAGGATCCTATTGCAAATACCGGGCCTGAATCATCAAGGGTATCTGGTTTGG
ATACATCAAAAGAGAAAAATGATGAGCCAATTGCAGAGCCACCCGTAAAGCGCAGAAAGAGACACCGGAGAAAGCATTTTCCAGATGAACCTTGCTTAATGAGAGGTGTT
TATTTCAAGAACATGAAATGGCAGGCTGCTATAAAGGTTGACAAGAAACAAATACACTTGGGAACTGTAGGATCACAAGAAGAAGCTGCTCATTTGTATGACAGAGCTGC
TTTCATGTGTGGAAGGGAACCCAACTTTGAGCTCCCAGAGGAGGAGAAGCAAGAACTGAGAAAGTTCAAATGGGATGAATTTTTAGCAATGACTCGCCGTGCAATTACTA
ATAGAAAACAGAAGAGGCTCAGCCCAGAATCAAAGAAGTCTAGACTTCCCTCGTCGAGGAATGACGACTCGGACAGAAGACACGATGAGTTCAGTGACCTCTCAGCTCTA
GAAGACGTGGAACCAGATGCCTCTACCTCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGAGCTTAAGAAGGCGTAAACTCCTGGGACTTTGCTCTGGCAAAGGCTCATTCCTTGCTCCAGTTCCTAAGTTTTCTGATAATTTGACTACCGAAAATCACGTGCA
CTGTACAAACTTCGTTAGTGTGCATCCCATCTGTTCAGATGACGTTAACAAGATAAAGGAGGATCCTATTGCAAATACCGGGCCTGAATCATCAAGGGTATCTGGTTTGG
ATACATCAAAAGAGAAAAATGATGAGCCAATTGCAGAGCCACCCGTAAAGCGCAGAAAGAGACACCGGAGAAAGCATTTTCCAGATGAACCTTGCTTAATGAGAGGTGTT
TATTTCAAGAACATGAAATGGCAGGCTGCTATAAAGGTTGACAAGAAACAAATACACTTGGGAACTGTAGGATCACAAGAAGAAGCTGCTCATTTGTATGACAGAGCTGC
TTTCATGTGTGGAAGGGAACCCAACTTTGAGCTCCCAGAGGAGGAGAAGCAAGAACTGAGAAAGTTCAAATGGGATGAATTTTTAGCAATGACTCGCCGTGCAATTACTA
ATAGAAAACAGAAGAGGCTCAGCCCAGAATCAAAGAAGTCTAGACTTCCCTCGTCGAGGAATGACGACTCGGACAGAAGACACGATGAGTTCAGTGACCTCTCAGCTCTA
GAAGACGTGGAACCAGATGCCTCTACCTCTTGA
Protein sequenceShow/hide protein sequence
MVSLRRRKLLGLCSGKGSFLAPVPKFSDNLTTENHVHCTNFVSVHPICSDDVNKIKEDPIANTGPESSRVSGLDTSKEKNDEPIAEPPVKRRKRHRRKHFPDEPCLMRGV
YFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKQELRKFKWDEFLAMTRRAITNRKQKRLSPESKKSRLPSSRNDDSDRRHDEFSDLSAL
EDVEPDASTS