; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg037932 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg037932
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionEthylene-responsive transcription factor-like protein
Genome locationscaffold12:41601291..41605812
RNA-Seq ExpressionSpg037932
SyntenySpg037932
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008460375.1 PREDICTED: ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucumis melo]1.8e-9783.84Show/hide
Query:  KKNQTVPLITRLKLIMVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEKVD----
        +K QT+PLIT  KLIMVSLRRRKLLGL +GK SFVAPV KFSENLTAE HVHCT+ V V+PICSDEVNKI+ENPIANIEPE S VSVLDTSKE++D    
Subjt:  KKNQTVPLITRLKLIMVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEKVD----

Query:  EPIADLPVKRRKRHRRKNFPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHA
        EPIAD PVKRRKRHRRK+FPDE    RGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPEEEK+ELRKFNWDEFLAMTR+ 
Subjt:  EPIADLPVKRRKRHRRKNFPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHA

Query:  ITNRKQKRLSPESKKSKLPSPWNDDSNKR
        ITNRKQKRL+PESKKS+L SP NDDSNKR
Subjt:  ITNRKQKRLSPESKKSKLPSPWNDDSNKR

XP_008460376.1 PREDICTED: ethylene-responsive transcription factor-like protein At4g13040 isoform X2 [Cucumis melo]1.8e-9783.84Show/hide
Query:  KKNQTVPLITRLKLIMVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEKVD----
        +K QT+PLIT  KLIMVSLRRRKLLGL +GK SFVAPV KFSENLTAE HVHCT+ V V+PICSDEVNKI+ENPIANIEPE S VSVLDTSKE++D    
Subjt:  KKNQTVPLITRLKLIMVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEKVD----

Query:  EPIADLPVKRRKRHRRKNFPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHA
        EPIAD PVKRRKRHRRK+FPDE    RGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPEEEK+ELRKFNWDEFLAMTR+ 
Subjt:  EPIADLPVKRRKRHRRKNFPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHA

Query:  ITNRKQKRLSPESKKSKLPSPWNDDSNKR
        ITNRKQKRL+PESKKS+L SP NDDSNKR
Subjt:  ITNRKQKRLSPESKKSKLPSPWNDDSNKR

XP_011651656.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucumis sativus]1.4e-10285.04Show/hide
Query:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEKV----DEPIADLPVKRRKRHR
        MVSLRRRKLLGL +GK SFVAPV KFSENLTAEDHVHCT+FV V+PICSD+VNKI+ENP ANIEPE S VSVLDTSKE++    DEPIAD PVKRRKRHR
Subjt:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEKV----DEPIADLPVKRRKRHR

Query:  RKNFPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHAITNRKQKRLSPESKK
        RK+FPDE    RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEK+ELRKFNWDEFLAMTR+ ITNRKQKRLSPESKK
Subjt:  RKNFPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHAITNRKQKRLSPESKK

Query:  SKLPSPWNDDSNKRHDEFSDLSAIEDVEPVASTS
        S+L SP NDDSNKRHD+F D S +EDVEPVASTS
Subjt:  SKLPSPWNDDSNKRHDEFSDLSAIEDVEPVASTS

XP_022159538.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Momordica charantia]1.1e-9780.95Show/hide
Query:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPE-PSRVSVLDTSKEKVDEPIADLPVKRRKRHRRKN
        MVSLRRRKLLG C+GKGSF+APV KFSENLT E+ +HCTNFVSVHPICSD++NKIKENPIAN EPE  SRV+VLDTSKEK +E IAD PV+ RKRH RK 
Subjt:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPE-PSRVSVLDTSKEKVDEPIADLPVKRRKRHRRKN

Query:  FPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHAITNRKQKRLSPESKKSKL
        FPDEP   RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGR+PNFELPEEEK+ELRK NWD+FLA+TRH ITNRKQKRLSPES KSKL
Subjt:  FPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHAITNRKQKRLSPESKKSKL

Query:  PSPWNDDSNKRHDEFSDLSAIEDVEPVASTS
        PS  N DS+KRH +FS+LS +ED++P ASTS
Subjt:  PSPWNDDSNKRHDEFSDLSAIEDVEPVASTS

XP_038887390.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Benincasa hispida]1.1e-10587.76Show/hide
Query:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEK----VDEPI-ADLPVKRRKRH
        MVSLRRRKLLGLCTGKGSFVAPV KFSENLTAEDHVHCTNFVSV+PICSD+VNKIKENPIANIEPE S VSVLDTS+E+     DEPI AD P+KRRKRH
Subjt:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEK----VDEPI-ADLPVKRRKRH

Query:  RRKNFPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHAITNRKQKRLSPESK
        RRK+FPDE    RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHAITNRKQKRLSPES 
Subjt:  RRKNFPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHAITNRKQKRLSPESK

Query:  KSKLPSPWN--DDSNKRHDEFSDLSAIEDVEPVASTS
        KSKL SP N  DDSNKRHDEF D SA+ED+EPVASTS
Subjt:  KSKLPSPWN--DDSNKRHDEFSDLSAIEDVEPVASTS

TrEMBL top hitse value%identityAlignment
A0A0A0LCE7 AP2/ERF domain-containing protein7.0e-10385.04Show/hide
Query:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEKV----DEPIADLPVKRRKRHR
        MVSLRRRKLLGL +GK SFVAPV KFSENLTAEDHVHCT+FV V+PICSD+VNKI+ENP ANIEPE S VSVLDTSKE++    DEPIAD PVKRRKRHR
Subjt:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEKV----DEPIADLPVKRRKRHR

Query:  RKNFPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHAITNRKQKRLSPESKK
        RK+FPDE    RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEK+ELRKFNWDEFLAMTR+ ITNRKQKRLSPESKK
Subjt:  RKNFPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHAITNRKQKRLSPESKK

Query:  SKLPSPWNDDSNKRHDEFSDLSAIEDVEPVASTS
        S+L SP NDDSNKRHD+F D S +EDVEPVASTS
Subjt:  SKLPSPWNDDSNKRHDEFSDLSAIEDVEPVASTS

A0A1S3CCB8 ethylene-responsive transcription factor-like protein At4g13040 isoform X18.8e-9883.84Show/hide
Query:  KKNQTVPLITRLKLIMVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEKVD----
        +K QT+PLIT  KLIMVSLRRRKLLGL +GK SFVAPV KFSENLTAE HVHCT+ V V+PICSDEVNKI+ENPIANIEPE S VSVLDTSKE++D    
Subjt:  KKNQTVPLITRLKLIMVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEKVD----

Query:  EPIADLPVKRRKRHRRKNFPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHA
        EPIAD PVKRRKRHRRK+FPDE    RGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPEEEK+ELRKFNWDEFLAMTR+ 
Subjt:  EPIADLPVKRRKRHRRKNFPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHA

Query:  ITNRKQKRLSPESKKSKLPSPWNDDSNKR
        ITNRKQKRL+PESKKS+L SP NDDSNKR
Subjt:  ITNRKQKRLSPESKKSKLPSPWNDDSNKR

A0A1S3CCT4 ethylene-responsive transcription factor-like protein At4g13040 isoform X28.8e-9883.84Show/hide
Query:  KKNQTVPLITRLKLIMVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEKVD----
        +K QT+PLIT  KLIMVSLRRRKLLGL +GK SFVAPV KFSENLTAE HVHCT+ V V+PICSDEVNKI+ENPIANIEPE S VSVLDTSKE++D    
Subjt:  KKNQTVPLITRLKLIMVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEKVD----

Query:  EPIADLPVKRRKRHRRKNFPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHA
        EPIAD PVKRRKRHRRK+FPDE    RGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPEEEK+ELRKFNWDEFLAMTR+ 
Subjt:  EPIADLPVKRRKRHRRKNFPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHA

Query:  ITNRKQKRLSPESKKSKLPSPWNDDSNKR
        ITNRKQKRL+PESKKS+L SP NDDSNKR
Subjt:  ITNRKQKRLSPESKKSKLPSPWNDDSNKR

A0A6J1DZ33 ethylene-responsive transcription factor-like protein At4g13040 isoform X15.2e-9880.95Show/hide
Query:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPE-PSRVSVLDTSKEKVDEPIADLPVKRRKRHRRKN
        MVSLRRRKLLG C+GKGSF+APV KFSENLT E+ +HCTNFVSVHPICSD++NKIKENPIAN EPE  SRV+VLDTSKEK +E IAD PV+ RKRH RK 
Subjt:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPE-PSRVSVLDTSKEKVDEPIADLPVKRRKRHRRKN

Query:  FPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHAITNRKQKRLSPESKKSKL
        FPDEP   RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGR+PNFELPEEEK+ELRK NWD+FLA+TRH ITNRKQKRLSPES KSKL
Subjt:  FPDEP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHAITNRKQKRLSPESKKSKL

Query:  PSPWNDDSNKRHDEFSDLSAIEDVEPVASTS
        PS  N DS+KRH +FS+LS +ED++P ASTS
Subjt:  PSPWNDDSNKRHDEFSDLSAIEDVEPVASTS

A0A6J1ESB0 ethylene-responsive transcription factor-like protein At4g13040 isoform X17.5e-9782.38Show/hide
Query:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEKVDEPIADLPVKRRKRHRRKNF
        MVSLRRRKLLGLCTGKGSF APVSK SEN TAED  HCTNF+SVHPICS+E N+I+ENP+AN+E E SRVSVLDTSKEK DEP A+ PVKRRKRHRRK F
Subjt:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVLDTSKEKVDEPIADLPVKRRKRHRRKNF

Query:  PDE---PRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHAITNRKQKRLSPESKKSKLP
        P+E    RGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAAHLYDRAAFMCGREPNFELPE EKKELRKFNWDEFLAMTR AI N+KQKR+SPESK SKLP
Subjt:  PDE---PRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHAITNRKQKRLSPESKKSKLP

Query:  SPWNDDSNKRHDEFSDLSAIEDVEPVA
           NDD NKR DEF DLSA ED+EP+A
Subjt:  SPWNDDSNKRHDEFSDLSAIEDVEPVA

SwissProt top hitse value%identityAlignment
Q56XP9 Ethylene-responsive transcription factor-like protein At4g130407.8e-3543.59Show/hide
Query:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHV-----------------HCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVL--DTSKEKVD
        MVSLRRR+LLGLC G   +V P+      LTAE+ +                      V V     +E ++   +   +   + S +S +  D+   K  
Subjt:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHV-----------------HCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVL--DTSKEKVD

Query:  EPIADLPVKRRKRHRRKNFPD-EP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRH
         P      KRRK+HRRK   + EP   RGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL EE  +EL++ +W+EFL  TR 
Subjt:  EPIADLPVKRRKRHRRKNFPD-EP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRH

Query:  AITNRKQK-RLSPESKKSKLPSPWNDDSNKRHDE
         ITN+K K R+  E  K     P   +  ++  +
Subjt:  AITNRKQK-RLSPESKKSKLPSPWNDDSNKRHDE

Arabidopsis top hitse value%identityAlignment
AT4G13040.1 Integrase-type DNA-binding superfamily protein5.6e-3643.59Show/hide
Query:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHV-----------------HCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVL--DTSKEKVD
        MVSLRRR+LLGLC G   +V P+      LTAE+ +                      V V     +E ++   +   +   + S +S +  D+   K  
Subjt:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHV-----------------HCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVL--DTSKEKVD

Query:  EPIADLPVKRRKRHRRKNFPD-EP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRH
         P      KRRK+HRRK   + EP   RGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL EE  +EL++ +W+EFL  TR 
Subjt:  EPIADLPVKRRKRHRRKNFPD-EP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRH

Query:  AITNRKQK-RLSPESKKSKLPSPWNDDSNKRHDE
         ITN+K K R+  E  K     P   +  ++  +
Subjt:  AITNRKQK-RLSPESKKSKLPSPWNDDSNKRHDE

AT4G13040.2 Integrase-type DNA-binding superfamily protein8.9e-3459.09Show/hide
Query:  IADLPVKRRKRHRRKNFPD-EP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHAI
        I+D P KRRK+HRRK   + EP   RGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL EE  +EL++ +W+EFL  TR  I
Subjt:  IADLPVKRRKRHRRKNFPD-EP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRHAI

Query:  TNRKQK-RLSPESKKSKLPSPWNDDSNKRHDE
        TN+K K R+  E  K     P   +  ++  +
Subjt:  TNRKQK-RLSPESKKSKLPSPWNDDSNKRHDE

AT4G13040.3 Integrase-type DNA-binding superfamily protein5.6e-3643.59Show/hide
Query:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHV-----------------HCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVL--DTSKEKVD
        MVSLRRR+LLGLC G   +V P+      LTAE+ +                      V V     +E ++   +   +   + S +S +  D+   K  
Subjt:  MVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHV-----------------HCTNFVSVHPICSDEVNKIKENPIANIEPEPSRVSVL--DTSKEKVD

Query:  EPIADLPVKRRKRHRRKNFPD-EP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRH
         P      KRRK+HRRK   + EP   RGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAA LYDRAAFMCGREPNFEL EE  +EL++ +W+EFL  TR 
Subjt:  EPIADLPVKRRKRHRRKNFPD-EP---RGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKELRKFNWDEFLAMTRH

Query:  AITNRKQK-RLSPESKKSKLPSPWNDDSNKRHDE
         ITN+K K R+  E  K     P   +  ++  +
Subjt:  AITNRKQK-RLSPESKKSKLPSPWNDDSNKRHDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGATGAGTTGTTTGCTGTTTTGTTGTTTTGGGTTTCTGAGTTGGCGTCTGTTTCAGTATTGGGTCATCCTTATTGGTCGTTCTCAGAAACTTGCATTCTTTATCAC
CAATGTGAAGCGATCCAGGAAAAAGAACCAAACGGTACCACTGATAACAAGATTGAAGCTAATTATGGTGAGCTTAAGAAGACGTAAACTCCTGGGACTTTGCACTGGCA
AAGGCTCATTTGTTGCTCCAGTTTCGAAGTTTTCTGAAAATTTGACTGCTGAAGATCACGTGCATTGTACAAACTTCGTTAGTGTCCATCCCATCTGTTCGGACGAAGTT
AACAAGATAAAGGAGAATCCCATTGCAAATATAGAGCCTGAACCATCAAGGGTATCTGTTTTGGATACATCAAAAGAGAAAGTTGATGAGCCAATTGCAGACCTGCCCGT
AAAGCGCAGAAAGAGACACCGGAGAAAGAATTTTCCAGATGAACCAAGAGGTGTTTATTTCAAGAACATGAAATGGCAGGCTGCTATAAAGGTTGACAAGAAACAAATAC
ACTTGGGAACTGTAGGATCACAAGAAGAAGCTGCTCATTTGTATGACAGAGCTGCTTTCATGTGTGGAAGGGAACCCAACTTTGAGCTCCCAGAGGAGGAGAAGAAAGAA
CTGAGAAAGTTCAATTGGGACGAGTTTTTAGCAATGACTCGCCACGCGATTACTAATAGAAAACAGAAAAGGCTCAGCCCAGAATCAAAGAAGTCTAAACTTCCTTCGCC
GTGGAATGACGACTCGAACAAGAGACATGATGAGTTCAGTGACCTCTCAGCCATAGAAGACGTGGAACCTGTTGCCTCTACCTCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGATGAGTTGTTTGCTGTTTTGTTGTTTTGGGTTTCTGAGTTGGCGTCTGTTTCAGTATTGGGTCATCCTTATTGGTCGTTCTCAGAAACTTGCATTCTTTATCAC
CAATGTGAAGCGATCCAGGAAAAAGAACCAAACGGTACCACTGATAACAAGATTGAAGCTAATTATGGTGAGCTTAAGAAGACGTAAACTCCTGGGACTTTGCACTGGCA
AAGGCTCATTTGTTGCTCCAGTTTCGAAGTTTTCTGAAAATTTGACTGCTGAAGATCACGTGCATTGTACAAACTTCGTTAGTGTCCATCCCATCTGTTCGGACGAAGTT
AACAAGATAAAGGAGAATCCCATTGCAAATATAGAGCCTGAACCATCAAGGGTATCTGTTTTGGATACATCAAAAGAGAAAGTTGATGAGCCAATTGCAGACCTGCCCGT
AAAGCGCAGAAAGAGACACCGGAGAAAGAATTTTCCAGATGAACCAAGAGGTGTTTATTTCAAGAACATGAAATGGCAGGCTGCTATAAAGGTTGACAAGAAACAAATAC
ACTTGGGAACTGTAGGATCACAAGAAGAAGCTGCTCATTTGTATGACAGAGCTGCTTTCATGTGTGGAAGGGAACCCAACTTTGAGCTCCCAGAGGAGGAGAAGAAAGAA
CTGAGAAAGTTCAATTGGGACGAGTTTTTAGCAATGACTCGCCACGCGATTACTAATAGAAAACAGAAAAGGCTCAGCCCAGAATCAAAGAAGTCTAAACTTCCTTCGCC
GTGGAATGACGACTCGAACAAGAGACATGATGAGTTCAGTGACCTCTCAGCCATAGAAGACGTGGAACCTGTTGCCTCTACCTCCTGA
Protein sequenceShow/hide protein sequence
MLMSCLLFCCFGFLSWRLFQYWVILIGRSQKLAFFITNVKRSRKKNQTVPLITRLKLIMVSLRRRKLLGLCTGKGSFVAPVSKFSENLTAEDHVHCTNFVSVHPICSDEV
NKIKENPIANIEPEPSRVSVLDTSKEKVDEPIADLPVKRRKRHRRKNFPDEPRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAAHLYDRAAFMCGREPNFELPEEEKKE
LRKFNWDEFLAMTRHAITNRKQKRLSPESKKSKLPSPWNDDSNKRHDEFSDLSAIEDVEPVASTS