; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG07G002100 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG07G002100
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionEthylene-responsive transcription factor-like protein
Genome locationCG_Chr07:2168938..2172236
RNA-Seq ExpressionClCG07G002100
SyntenyClCG07G002100
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008460375.1 PREDICTED: ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucumis melo]1.1e-9886.76Show/hide
Query:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK
        MVSLRRRKLLGL +GK  FVAPVLKFSENLT E HVHCT+SV VYPICSDEVNKI+EN IAN+EPESSG+SVLDTSKE+ DTTN EPI   ADPP+KRRK
Subjt:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK

Query:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE
        RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAA LYDRAAFMCGREPNFELPE EKQELRKFNWDEFLAMTR+ ITNRKQKR++PE
Subjt:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE

Query:  SKKSKLSSPGNDDDYSNKR
        SKKS+LSSPGNDD  SNKR
Subjt:  SKKSKLSSPGNDDDYSNKR

XP_008460380.1 PREDICTED: ethylene-responsive transcription factor-like protein At4g13040 isoform X6 [Cucumis melo]1.1e-9886.76Show/hide
Query:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK
        MVSLRRRKLLGL +GK  FVAPVLKFSENLT E HVHCT+SV VYPICSDEVNKI+EN IAN+EPESSG+SVLDTSKE+ DTTN EPI   ADPP+KRRK
Subjt:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK

Query:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE
        RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAA LYDRAAFMCGREPNFELPE EKQELRKFNWDEFLAMTR+ ITNRKQKR++PE
Subjt:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE

Query:  SKKSKLSSPGNDDDYSNKR
        SKKS+LSSPGNDD  SNKR
Subjt:  SKKSKLSSPGNDDDYSNKR

XP_011651656.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Cucumis sativus]4.3e-10886.61Show/hide
Query:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK
        MVSLRRRKLLGL +GK  FVAPVLKFSENLT EDHVHCT+ V VYPICSD+VNKI+EN  AN+EPESSG+SVLDTSKE+ DTTNDEPI   ADPP+KRRK
Subjt:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK

Query:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE
        RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAA LYDRAAFMCGREPNFELPE EKQELRKFNWDEFLAMTR+ ITNRKQKR+SPE
Subjt:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE

Query:  SKKSKLSSPGNDDDYSNKRHDEFDDLSALEDVEPVASTS
        SKKS+LSSPGNDD  SNKRHD+F D S LEDVEPVASTS
Subjt:  SKKSKLSSPGNDDDYSNKRHDEFDDLSALEDVEPVASTS

XP_031738473.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X2 [Cucumis sativus]3.5e-10284.1Show/hide
Query:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK
        MVSLRRRKLLGL +        VLKFSENLT EDHVHCT+ V VYPICSD+VNKI+EN  AN+EPESSG+SVLDTSKE+ DTTNDEPI   ADPP+KRRK
Subjt:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK

Query:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE
        RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAA LYDRAAFMCGREPNFELPE EKQELRKFNWDEFLAMTR+ ITNRKQKR+SPE
Subjt:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE

Query:  SKKSKLSSPGNDDDYSNKRHDEFDDLSALEDVEPVASTS
        SKKS+LSSPGNDD  SNKRHD+F D S LEDVEPVASTS
Subjt:  SKKSKLSSPGNDDDYSNKRHDEFDDLSALEDVEPVASTS

XP_038887390.1 ethylene-responsive transcription factor-like protein At4g13040 isoform X1 [Benincasa hispida]1.3e-11791.63Show/hide
Query:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK
        MVSLRRRKLLGLCTGKG FVAPVLKFSENLT EDHVHCTN VSVYPICSD+VNKIKEN IAN+EPESSG+SVLDTS+E+NDTTNDEPIA  ADPPIKRRK
Subjt:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK

Query:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE
        RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAA LYDRAAFMCGREPNFELPE EK+ELRKFNWDEFLAMTRHAITNRKQKR+SPE
Subjt:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE

Query:  SKKSKLSSPGNDDDYSNKRHDEFDDLSALEDVEPVASTS
        S KSKLSSPGNDDD SNKRHDEF D SALED+EPVASTS
Subjt:  SKKSKLSSPGNDDDYSNKRHDEFDDLSALEDVEPVASTS

TrEMBL top hitse value%identityAlignment
A0A0A0LCE7 AP2/ERF domain-containing protein2.1e-10886.61Show/hide
Query:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK
        MVSLRRRKLLGL +GK  FVAPVLKFSENLT EDHVHCT+ V VYPICSD+VNKI+EN  AN+EPESSG+SVLDTSKE+ DTTNDEPI   ADPP+KRRK
Subjt:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK

Query:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE
        RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAA LYDRAAFMCGREPNFELPE EKQELRKFNWDEFLAMTR+ ITNRKQKR+SPE
Subjt:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE

Query:  SKKSKLSSPGNDDDYSNKRHDEFDDLSALEDVEPVASTS
        SKKS+LSSPGNDD  SNKRHD+F D S LEDVEPVASTS
Subjt:  SKKSKLSSPGNDDDYSNKRHDEFDDLSALEDVEPVASTS

A0A1S3CCB8 ethylene-responsive transcription factor-like protein At4g13040 isoform X15.1e-9986.76Show/hide
Query:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK
        MVSLRRRKLLGL +GK  FVAPVLKFSENLT E HVHCT+SV VYPICSDEVNKI+EN IAN+EPESSG+SVLDTSKE+ DTTN EPI   ADPP+KRRK
Subjt:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK

Query:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE
        RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAA LYDRAAFMCGREPNFELPE EKQELRKFNWDEFLAMTR+ ITNRKQKR++PE
Subjt:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE

Query:  SKKSKLSSPGNDDDYSNKR
        SKKS+LSSPGNDD  SNKR
Subjt:  SKKSKLSSPGNDDDYSNKR

A0A1S3CCT4 ethylene-responsive transcription factor-like protein At4g13040 isoform X25.1e-9986.76Show/hide
Query:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK
        MVSLRRRKLLGL +GK  FVAPVLKFSENLT E HVHCT+SV VYPICSDEVNKI+EN IAN+EPESSG+SVLDTSKE+ DTTN EPI   ADPP+KRRK
Subjt:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK

Query:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE
        RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAA LYDRAAFMCGREPNFELPE EKQELRKFNWDEFLAMTR+ ITNRKQKR++PE
Subjt:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE

Query:  SKKSKLSSPGNDDDYSNKR
        SKKS+LSSPGNDD  SNKR
Subjt:  SKKSKLSSPGNDDDYSNKR

A0A1S4E2T2 ethylene-responsive transcription factor-like protein At4g13040 isoform X65.1e-9986.76Show/hide
Query:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK
        MVSLRRRKLLGL +GK  FVAPVLKFSENLT E HVHCT+SV VYPICSDEVNKI+EN IAN+EPESSG+SVLDTSKE+ DTTN EPI   ADPP+KRRK
Subjt:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK

Query:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE
        RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAA LYDRAAFMCGREPNFELPE EKQELRKFNWDEFLAMTR+ ITNRKQKR++PE
Subjt:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE

Query:  SKKSKLSSPGNDDDYSNKR
        SKKS+LSSPGNDD  SNKR
Subjt:  SKKSKLSSPGNDDDYSNKR

A0A5A7VNB7 Ethylene-responsive transcription factor-like protein5.1e-9986.76Show/hide
Query:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK
        MVSLRRRKLLGL +GK  FVAPVLKFSENLT E HVHCT+SV VYPICSDEVNKI+EN IAN+EPESSG+SVLDTSKE+ DTTN EPI   ADPP+KRRK
Subjt:  MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRK

Query:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE
        RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTV SQEEAA LYDRAAFMCGREPNFELPE EKQELRKFNWDEFLAMTR+ ITNRKQKR++PE
Subjt:  RHRRKHFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPE

Query:  SKKSKLSSPGNDDDYSNKR
        SKKS+LSSPGNDD  SNKR
Subjt:  SKKSKLSSPGNDDDYSNKR

SwissProt top hitse value%identityAlignment
Q56XP9 Ethylene-responsive transcription factor-like protein At4g130406.3e-3846.58Show/hide
Query:  MVSLRRRKLLGLCTGKGLFVAPV--LKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKN-DTTNDEPIAVFADPPIK
        MVSLRRR+LLGLC G   +V P+  L   E +T   + +   + +  P  +     +++ +I      +    +   S   +    N + I+    PP K
Subjt:  MVSLRRRKLLGLCTGKGLFVAPV--LKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKN-DTTNDEPIAVFADPPIK

Query:  RRKRHRRKHFPD-ESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQK-
        RRK+HRRK   + E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAARLYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  ITN+K K 
Subjt:  RRKRHRRKHFPD-ESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQK-

Query:  RVSPESKKSKLSSPGNDDD
        R+  E  K   S P   ++
Subjt:  RVSPESKKSKLSSPGNDDD

Arabidopsis top hitse value%identityAlignment
AT4G13040.1 Integrase-type DNA-binding superfamily protein4.5e-3946.58Show/hide
Query:  MVSLRRRKLLGLCTGKGLFVAPV--LKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKN-DTTNDEPIAVFADPPIK
        MVSLRRR+LLGLC G   +V P+  L   E +T   + +   + +  P  +     +++ +I      +    +   S   +    N + I+    PP K
Subjt:  MVSLRRRKLLGLCTGKGLFVAPV--LKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKN-DTTNDEPIAVFADPPIK

Query:  RRKRHRRKHFPD-ESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQK-
        RRK+HRRK   + E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAARLYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  ITN+K K 
Subjt:  RRKRHRRKHFPD-ESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQK-

Query:  RVSPESKKSKLSSPGNDDD
        R+  E  K   S P   ++
Subjt:  RVSPESKKSKLSSPGNDDD

AT4G13040.2 Integrase-type DNA-binding superfamily protein1.4e-3563.2Show/hide
Query:  ADPPIKRRKRHRRKHFPD-ESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAIT
        +D P KRRK+HRRK   + E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAARLYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  IT
Subjt:  ADPPIKRRKRHRRKHFPD-ESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAIT

Query:  NRKQK-RVSPESKKSKLSSPGNDDD
        N+K K R+  E  K   S P   ++
Subjt:  NRKQK-RVSPESKKSKLSSPGNDDD

AT4G13040.3 Integrase-type DNA-binding superfamily protein4.5e-3946.58Show/hide
Query:  MVSLRRRKLLGLCTGKGLFVAPV--LKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKN-DTTNDEPIAVFADPPIK
        MVSLRRR+LLGLC G   +V P+  L   E +T   + +   + +  P  +     +++ +I      +    +   S   +    N + I+    PP K
Subjt:  MVSLRRRKLLGLCTGKGLFVAPV--LKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKN-DTTNDEPIAVFADPPIK

Query:  RRKRHRRKHFPD-ESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQK-
        RRK+HRRK   + E  LMRGVY+KNMKWQAAIKV+KKQIHLGT  SQEEAARLYDRAAFMCGREPNFEL E   +EL++ +W+EFL  TR  ITN+K K 
Subjt:  RRKRHRRKHFPD-ESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQK-

Query:  RVSPESKKSKLSSPGNDDD
        R+  E  K   S P   ++
Subjt:  RVSPESKKSKLSSPGNDDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGAGCTTAAGAAGGCGAAAACTCCTTGGACTTTGCACCGGCAAAGGCTTATTTGTTGCTCCAGTTTTGAAGTTTTCTGAAAATTTGACTACTGAAGATCAC
GTGCATTGTACAAACTCCGTTAGTGTCTATCCCATCTGTTCAGACGAAGTTAACAAGATAAAGGAGAATTCAATTGCAAATTTAGAGCCTGAATCATCAGGGTTA
TCGGTTTTGGATACATCAAAAGAGAAAAATGATACGACAAATGATGAGCCAATTGCAGTATTTGCAGACCCACCCATAAAGCGCAGAAAGAGACACCGGAGAAAG
CATTTTCCCGACGAATCTTTCTTAATGAGAGGTGTTTATTTCAAGAACATGAAATGGCAGGCTGCAATAAAGGTTGACAAAAAACAAATACACTTGGGGACTGTA
GGATCACAAGAAGAAGCTGCTCGTTTGTATGACAGAGCTGCTTTCATGTGTGGAAGGGAACCTAACTTTGAGCTCCCAGAAGCGGAGAAGCAAGAACTGAGAAAG
TTTAATTGGGATGAATTTCTAGCAATGACTCGCCACGCGATTACTAATAGAAAACAGAAGAGGGTCAGCCCCGAATCAAAGAAGTCTAAACTTTCTTCGCCGGGG
AATGACGATGACTACTCTAACAAAAGACATGATGAGTTCGATGACCTCTCAGCTCTAGAAGATGTGGAACCAGTTGCCTCAACATCTTGA
mRNA sequenceShow/hide mRNA sequence
AGATGTGCAGTGATCCGGGAAAAAGAACTAAACAATAGCACTGATAATAGGATTGAAGCTAATTATGGTGAGCTTAAGAAGGCGAAAACTCCTTGGACTTTGCAC
CGGCAAAGGCTTATTTGTTGCTCCAGTTTTGAAGTTTTCTGAAAATTTGACTACTGAAGATCACGTGCATTGTACAAACTCCGTTAGTGTCTATCCCATCTGTTC
AGACGAAGTTAACAAGATAAAGGAGAATTCAATTGCAAATTTAGAGCCTGAATCATCAGGGTTATCGGTTTTGGATACATCAAAAGAGAAAAATGATACGACAAA
TGATGAGCCAATTGCAGTATTTGCAGACCCACCCATAAAGCGCAGAAAGAGACACCGGAGAAAGCATTTTCCCGACGAATCTTTCTTAATGAGAGGTGTTTATTT
CAAGAACATGAAATGGCAGGCTGCAATAAAGGTTGACAAAAAACAAATACACTTGGGGACTGTAGGATCACAAGAAGAAGCTGCTCGTTTGTATGACAGAGCTGC
TTTCATGTGTGGAAGGGAACCTAACTTTGAGCTCCCAGAAGCGGAGAAGCAAGAACTGAGAAAGTTTAATTGGGATGAATTTCTAGCAATGACTCGCCACGCGAT
TACTAATAGAAAACAGAAGAGGGTCAGCCCCGAATCAAAGAAGTCTAAACTTTCTTCGCCGGGGAATGACGATGACTACTCTAACAAAAGACATGATGAGTTCGA
TGACCTCTCAGCTCTAGAAGATGTGGAACCAGTTGCCTCAACATCTTGAAATTCAAGAAGAAAAAGTTTAGTTTTCCCATCTCTAACATCTCAGTTTAAGTATGG
AGACCTGCAGTTTTGATTTTTTATGTTCAAAGGCCTGTCAAATCAACCTCCTAGAAGTTTTGGATGTACATCATTTTAGTAGTTTAGGCAAATCTTTCAATCTTT
GAGCCGATCGAAAGAGCAATTTACAGTGATAGCCAGTGATACGAGAGGCCCCTTCTGAATCTACTCGACGATACTTCATGTCGCAAGATGCCTCATGATGAGTTC
GGGCTATTTTCCAGCCTAGAATACATAACTCTAGCTATCCCTTGTTTCCATGAAGGAAGAAGAT
Protein sequenceShow/hide protein sequence
MVSLRRRKLLGLCTGKGLFVAPVLKFSENLTTEDHVHCTNSVSVYPICSDEVNKIKENSIANLEPESSGLSVLDTSKEKNDTTNDEPIAVFADPPIKRRKRHRRK
HFPDESFLMRGVYFKNMKWQAAIKVDKKQIHLGTVGSQEEAARLYDRAAFMCGREPNFELPEAEKQELRKFNWDEFLAMTRHAITNRKQKRVSPESKKSKLSSPG
NDDDYSNKRHDEFDDLSALEDVEPVASTS