; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg006445 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg006445
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionEthylene-responsive transcription factor
Genome locationscaffold2:47136549..47147630
RNA-Seq ExpressionSpg006445
SyntenySpg006445
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0009873 - ethylene-activated signaling pathway (biological process)
GO:0005634 - nucleus (cellular component)
GO:0003677 - DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR001471 - AP2/ERF domain
IPR016177 - DNA-binding domain superfamily
IPR036955 - AP2/ERF domain superfamily
IPR044808 - Ethylene-responsive transcription factor


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579632.1 Ethylene-responsive transcription factor, partial [Cucurbita argyrosperma subsp. sororia]1.6e-3771.55Show/hide
Query:  EDLYESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGK
        E   ++P+SH+  QS +SSSPEA +       DGGRRYRGVR++PWGKFAA+IRDPSRKG+RVWLGTFDSDVD ARAYDSAAFK+RGRKAKLNF LDAG 
Subjt:  EDLYESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGK

Query:  SEPPLGNGRKKRKEMT
        S+PP  NGRKKR+E T
Subjt:  SEPPLGNGRKKRKEMT

KAG6605642.1 Ethylene-responsive transcription factor, partial [Cucurbita argyrosperma subsp. sororia]7.1e-3873.28Show/hide
Query:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSE--
        E P   +PEQS +SSSPE ++AEE+SS DGGR+YRGVR++PWGKFAA+IRDPSRKG+RVWLGTF+SDVD ARAYDSAAFKMRGRKAKLNF LDAGKSE  
Subjt:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSE--

Query:  ----PPLGNGRKKRKE
            PP GNGR + +E
Subjt:  ----PPLGNGRKKRKE

KAG7035552.1 Ethylene-responsive transcription factor, partial [Cucurbita argyrosperma subsp. argyrosperma]7.1e-3873.28Show/hide
Query:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSE--
        E P   +PEQS +SSSPE ++AEE+SS DGGR+YRGVR++PWGKFAA+IRDPSRKG+RVWLGTF+SDVD ARAYDSAAFKMRGRKAKLNF LDAGKSE  
Subjt:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSE--

Query:  ----PPLGNGRKKRKE
            PP GNGR + +E
Subjt:  ----PPLGNGRKKRKE

XP_022957677.1 ethylene-responsive transcription factor ERF106-like [Cucurbita moschata]7.1e-3873.28Show/hide
Query:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSE--
        E P   +PEQS +SSSPE ++AEE+SS DGGR+YRGVR++PWGKFAA+IRDPSRKG+RVWLGTF+SDVD ARAYDSAAFKMRGRKAKLNF LDAGKSE  
Subjt:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSE--

Query:  ----PPLGNGRKKRKE
            PP GNGR + +E
Subjt:  ----PPLGNGRKKRKE

XP_023533021.1 ethylene-responsive transcription factor ERF106-like [Cucurbita pepo subsp. pepo]4.9e-3976.79Show/hide
Query:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSE--
        E P   +PEQS +SSSPE ++AEE+SS DGGR+YRGVR++PWGKFAA+IRDPSRKG+RVWLGTF+SDVD ARAYDSAAFKMRGRKAKLNF LDAGKSE  
Subjt:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSE--

Query:  PPLGNGRKKRKE
        PP GNGR +R+E
Subjt:  PPLGNGRKKRKE

TrEMBL top hitse value%identityAlignment
A0A1S3ASX7 ethylene-responsive transcription factor ERF106-like1.7e-3772.41Show/hide
Query:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPP
        +SPK H   QSP+SSSP           DGGRRYRGVR++PWGKFAA+IRDPSRKG+RVWLGTFDSDVD ARAYDSAAFK+RGRKAKLNF LDAGKS+PP
Subjt:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPP

Query:  LGNGRKKRKEMTNPDA
         GNGRKK++  TN +A
Subjt:  LGNGRKKRKEMTNPDA

A0A5A7UNP1 Ethylene-responsive transcription factor ERF106-like protein1.7e-3772.41Show/hide
Query:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPP
        +SPK H   QSP+SSSP           DGGRRYRGVR++PWGKFAA+IRDPSRKG+RVWLGTFDSDVD ARAYDSAAFK+RGRKAKLNF LDAGKS+PP
Subjt:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPP

Query:  LGNGRKKRKEMTNPDA
         GNGRKK++  TN +A
Subjt:  LGNGRKKRKEMTNPDA

A0A6J1H0X6 ethylene-responsive transcription factor ERF106-like3.4e-3873.28Show/hide
Query:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSE--
        E P   +PEQS +SSSPE ++AEE+SS DGGR+YRGVR++PWGKFAA+IRDPSRKG+RVWLGTF+SDVD ARAYDSAAFKMRGRKAKLNF LDAGKSE  
Subjt:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSE--

Query:  ----PPLGNGRKKRKE
            PP GNGR + +E
Subjt:  ----PPLGNGRKKRKE

A0A6J1K7L0 ethylene-responsive transcription factor ERF106-like1.7e-3771.55Show/hide
Query:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKS---
        E P   +PEQS +SSSPE ++AEE+ S DGGRRYRGVR++PWGKF A+IRDPSRKG+RVWLGTF+SDV+ ARAYDSAAF+MRGRKAKLNF LDAGKS   
Subjt:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKS---

Query:  EPPLGNGRKKRKEMTN
         PP GNGR +RK   N
Subjt:  EPPLGNGRKKRKEMTN

E5GCU9 AP2/ERF domain-containing transcription factor1.7e-3772.41Show/hide
Query:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPP
        +SPK H   QSP+SSSP           DGGRRYRGVR++PWGKFAA+IRDPSRKG+RVWLGTFDSDVD ARAYDSAAFK+RGRKAKLNF LDAGKS+PP
Subjt:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPP

Query:  LGNGRKKRKEMTNPDA
         GNGRKK++  TN +A
Subjt:  LGNGRKKRKEMTNPDA

SwissProt top hitse value%identityAlignment
O80341 Ethylene-responsive transcription factor 56.0e-2459.09Show/hide
Query:  RRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPPLGNGRKKRKEMTNPDALVV
        + YRGVRQ+PWGKFAA+IRDP+++G+RVWLGTFD+ ++ ARAYD AAF++RG KA LNF L+ GK +P    G KKRK   +    VV
Subjt:  RRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPPLGNGRKKRKEMTNPDALVV

Q40478 Ethylene-responsive transcription factor 51.3e-2152.13Show/hide
Query:  RRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSE-------PPLGNGRKKRKEMTNPDALV
        + YRGVRQ+PWGKFAA+IRDP+RKG RVWLGTFD+ ++ A+AYD AAFK+RG KA +NF L+    +        P  +GRK+ +E  N + ++
Subjt:  RRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSE-------PPLGNGRKKRKEMTNPDALV

Q8VY90 Ethylene-responsive transcription factor ERF1051.6e-2158.89Show/hide
Query:  RRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPPLGNGRK------KRKEMTNPD
        R YRGVR++PWGK+AA+IRDP++KG RVWLGTFD+ ++ AR YD AAFK+RG KA LNF L+AGK E  LG+ +K      KRK     D
Subjt:  RRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPPLGNGRK------KRKEMTNPD

Q9FKG2 Ethylene-responsive transcription factor ERF1074.9e-2655.05Show/hide
Query:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPP
        ES  S     SP+  +   S  ++    +  R YRGVR++PWGKFAA+IRDP++KG+R+WLGTF+SD+D ARAYD AAFK+RGRKA LNF LDAGK + P
Subjt:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPP

Query:  LGNGRKKRK
        + + RK+R+
Subjt:  LGNGRKKRK

Q9LY05 Ethylene-responsive transcription factor ERF1062.4e-2858.41Show/hide
Query:  ESPKSHRPEQSPQSSSPE----ASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGK
        ES  S  PE +  SS+ E       AE        R YRGVR++PWGKFAA+IRDP++KG+R+WLGTF+SDVD ARAYD AAFK+RGRKA LNF LDAGK
Subjt:  ESPKSHRPEQSPQSSSPE----ASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGK

Query:  SEPPLGNGRKKRK
         E P  +GRK+++
Subjt:  SEPPLGNGRKKRK

Arabidopsis top hitse value%identityAlignment
AT5G07580.1 Integrase-type DNA-binding superfamily protein1.7e-2958.41Show/hide
Query:  ESPKSHRPEQSPQSSSPE----ASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGK
        ES  S  PE +  SS+ E       AE        R YRGVR++PWGKFAA+IRDP++KG+R+WLGTF+SDVD ARAYD AAFK+RGRKA LNF LDAGK
Subjt:  ESPKSHRPEQSPQSSSPE----ASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGK

Query:  SEPPLGNGRKKRK
         E P  +GRK+++
Subjt:  SEPPLGNGRKKRK

AT5G47230.1 ethylene responsive element binding factor 54.3e-2559.09Show/hide
Query:  RRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPPLGNGRKKRKEMTNPDALVV
        + YRGVRQ+PWGKFAA+IRDP+++G+RVWLGTFD+ ++ ARAYD AAF++RG KA LNF L+ GK +P    G KKRK   +    VV
Subjt:  RRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPPLGNGRKKRKEMTNPDALVV

AT5G51190.1 Integrase-type DNA-binding superfamily protein1.2e-2258.89Show/hide
Query:  RRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPPLGNGRK------KRKEMTNPD
        R YRGVR++PWGK+AA+IRDP++KG RVWLGTFD+ ++ AR YD AAFK+RG KA LNF L+AGK E  LG+ +K      KRK     D
Subjt:  RRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPPLGNGRK------KRKEMTNPD

AT5G61590.1 Integrase-type DNA-binding superfamily protein3.5e-2755.05Show/hide
Query:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPP
        ES  S     SP+  +   S  ++    +  R YRGVR++PWGKFAA+IRDP++KG+R+WLGTF+SD+D ARAYD AAFK+RGRKA LNF LDAGK + P
Subjt:  ESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPP

Query:  LGNGRKKRK
        + + RK+R+
Subjt:  LGNGRKKRK

AT5G61600.1 ethylene response factor 1041.7e-2144Show/hide
Query:  ESPKSHRPEQSPQSSSPEASVAEELSSL----DGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAG-
        +SPK +      +   P  SV+  +S+     +  R YRGVR++PWGK+AA+IRDP++KG R+WLGT+D+ V+  RAYD AAF++RGRKA LNF LD   
Subjt:  ESPKSHRPEQSPQSSSPEASVAEELSSL----DGGRRYRGVRQQPWGKFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAG-

Query:  -----KSEPPLGNGRKKRKEMTNPD
               E  +G G++KR + + P+
Subjt:  -----KSEPPLGNGRKKRKEMTNPD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGCTCCAATCATCTTCAAATCTCTCTGCAGAACTGAAGAACAAAGCTCAGATCCTCTTCAAATCCCGTTACAGAATCGAAGATCGAATTGACTGTGAGAGCTCAGT
TTCAAGCACGTTGTTGTGCGCTCTCTCAGTTCAGGACAGTGTTGAGCTTACGATTAGACACGTTATTCGTGCACCAGATGGCACCCAAGCATATTTTCATGCACTAGCTG
TTCCACCGTATCTCATGTCTATGTTTCAGAGGTATTGTGTTGGGAGAGAACGTCTGGTGACTGGCGAGGGTTGTGACTTGGTTGGCTATGGGATTGCAGTTGCAGTTTGT
GTCTTTTTCCTTGAGGATTTCGGTCTCCTTTATCTTAGGATAGAATCTCTCATTTTTAGACCTTATGTTATGGCTAGAGTAAAAAAAAGTGCAAACAATGATAATAGACG
GAGAAGTCAAAGAATATTGGGTAGTGGTAGCAGATTGCCTACAAGGAAACAGAATGAAAATGACGTTATCGATATAGATTCCGAGGACGAAGTTGATAGATCAACAAACC
TCGGTGTACCAACAAACTCATCGTTCTTCGATCGGTTGTTTGGGGATATGGCTACTGAGATTCGCCCGACAACACATAGAAGGGATGACGAGGATGATAGACAAGCAGAA
CCTATACCAATCAAGATTATAGACCAATTTGCTGATTTAGCAAACTGTCAAATAGTTCCATATGGATATTGCTTTGAGGAAAAGACAATGAGGGATGTTGAAGAGGGAAA
ACATGAGGAAACGATACAACAGGAGGAAGAGGTTTTAGATGATGAAGTTGCAAATGAGCAGGAAGAAGAAGATCAAGAGCTGGTCAATGCAGAAACGTGGATGACCAGGA
AGAACATGATGTTCAGGTTGCAACAGAAGAAGATCAACATGGTAATGATAAAATGGAACCAGGGGACTCATTCTGAGGAACAAAATGAAACTCGTGATGCGGGAAATGTG
GATGTTAGAGAATATGAATCTGAGGAGGATGTTATCGTCCTGGACGACACGACAACTAGAGAAGTAGTTGGGAGGGGGAAAAGAAAAAGGACAGTAGAAGTTGTCAAGAT
GGACCAAAAGAGTGAAACTACGAATAAGGCGATCAAGATGGAACGATTCTCAAGAGGATCGGTGTTTGGAATTAACAGAGAGGATTTGTATGAGAGTCCGAAGAGCCACC
GCCCGGAACAATCGCCGCAGTCGAGCTCACCAGAGGCGTCGGTGGCGGAGGAGCTGTCGAGTTTGGACGGTGGGAGGAGGTACAGAGGCGTACGACAGCAGCCGTGGGGG
AAGTTTGCGGCGAAGATCCGAGATCCGAGTAGGAAAGGGAACAGAGTTTGGTTGGGGACTTTTGATTCGGATGTGGATGTGGCTAGGGCTTATGATTCTGCTGCGTTTAA
GATGAGAGGAAGAAAGGCTAAGTTGAATTTTCTCTTGGATGCCGGAAAGTCCGAGCCACCGCTGGGGAATGGCCGGAAGAAGAGGAAAGAGATGACAAATCCTGATGCAT
TGGTGGTTGGGATTGCTGGATTCCCACATGCATTACTTGTGTGGGCGTATGAATGCATCCTTATGCTTTCTAGACCAACAATCATTTGTGCACAGAAAATTTCAAATGGA
ATGCCGAAGATGAATAGCTGGGTGGCAGACGTACATCCAGAATGGAAAGACTTGGCCATGAAGGTCTTTGAACACCCAAATTTTCAAATATATGAGATGGTTGATGAGAA
TGTTGTCGAGAAAAGAAGGTCATCAATTCAGAAAATACAAGAGTCACAACAGGAATTGAATATGAAGATGTCCACGTTTTTCCGAGAGATTGCTGAAATGAAGAAAATGA
TGGCCACCAAGCAATCTGGGGGACATAATGCTGACAATAATGAGAATGATGACGATTACGAATGTGACAACAGGGATGGAGATGTTGGTAATGGTGGTGAGAATGAAAAT
AGGAATGAATCTAATGCAGGAGGGCAAAAAGAGGGAAACATTGAAGGTGAAGGCAATGAAAACAACGAGAAGGATGACACCATGGAAGAAGTGGATGAAGATATTGTCAG
TAATAGCTTCTTAGTTGAAGTCGATAAAATTGAAAAAGCTGCAATCTCAAATATTCGTAAGGGAAAAACGGTGCTGAAGAAAAGTAAAGAAAATGTGTTTGATTCAAATG
GGGCATATTGTCATGCTTTGAGAGGTACTGGACCGTTTGCAGATAATCAAACTTTTTTAAGGAGAGAACGAAAGAAGATCATACCATCAAAACTAATGAAGTCCCCCTTC
ACATCGAAGTTTGGGTCTGTTGAAGAAAAGAAACGGCCAAAGATAGAGAAGATCTTGACAACACAAGATTTCAATGCACCAAATTTTAATTTGCTTTCTCCACCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGGCTCCAATCATCTTCAAATCTCTCTGCAGAACTGAAGAACAAAGCTCAGATCCTCTTCAAATCCCGTTACAGAATCGAAGATCGAATTGACTGTGAGAGCTCAGT
TTCAAGCACGTTGTTGTGCGCTCTCTCAGTTCAGGACAGTGTTGAGCTTACGATTAGACACGTTATTCGTGCACCAGATGGCACCCAAGCATATTTTCATGCACTAGCTG
TTCCACCGTATCTCATGTCTATGTTTCAGAGGTATTGTGTTGGGAGAGAACGTCTGGTGACTGGCGAGGGTTGTGACTTGGTTGGCTATGGGATTGCAGTTGCAGTTTGT
GTCTTTTTCCTTGAGGATTTCGGTCTCCTTTATCTTAGGATAGAATCTCTCATTTTTAGACCTTATGTTATGGCTAGAGTAAAAAAAAGTGCAAACAATGATAATAGACG
GAGAAGTCAAAGAATATTGGGTAGTGGTAGCAGATTGCCTACAAGGAAACAGAATGAAAATGACGTTATCGATATAGATTCCGAGGACGAAGTTGATAGATCAACAAACC
TCGGTGTACCAACAAACTCATCGTTCTTCGATCGGTTGTTTGGGGATATGGCTACTGAGATTCGCCCGACAACACATAGAAGGGATGACGAGGATGATAGACAAGCAGAA
CCTATACCAATCAAGATTATAGACCAATTTGCTGATTTAGCAAACTGTCAAATAGTTCCATATGGATATTGCTTTGAGGAAAAGACAATGAGGGATGTTGAAGAGGGAAA
ACATGAGGAAACGATACAACAGGAGGAAGAGGTTTTAGATGATGAAGTTGCAAATGAGCAGGAAGAAGAAGATCAAGAGCTGGTCAATGCAGAAACGTGGATGACCAGGA
AGAACATGATGTTCAGGTTGCAACAGAAGAAGATCAACATGGTAATGATAAAATGGAACCAGGGGACTCATTCTGAGGAACAAAATGAAACTCGTGATGCGGGAAATGTG
GATGTTAGAGAATATGAATCTGAGGAGGATGTTATCGTCCTGGACGACACGACAACTAGAGAAGTAGTTGGGAGGGGGAAAAGAAAAAGGACAGTAGAAGTTGTCAAGAT
GGACCAAAAGAGTGAAACTACGAATAAGGCGATCAAGATGGAACGATTCTCAAGAGGATCGGTGTTTGGAATTAACAGAGAGGATTTGTATGAGAGTCCGAAGAGCCACC
GCCCGGAACAATCGCCGCAGTCGAGCTCACCAGAGGCGTCGGTGGCGGAGGAGCTGTCGAGTTTGGACGGTGGGAGGAGGTACAGAGGCGTACGACAGCAGCCGTGGGGG
AAGTTTGCGGCGAAGATCCGAGATCCGAGTAGGAAAGGGAACAGAGTTTGGTTGGGGACTTTTGATTCGGATGTGGATGTGGCTAGGGCTTATGATTCTGCTGCGTTTAA
GATGAGAGGAAGAAAGGCTAAGTTGAATTTTCTCTTGGATGCCGGAAAGTCCGAGCCACCGCTGGGGAATGGCCGGAAGAAGAGGAAAGAGATGACAAATCCTGATGCAT
TGGTGGTTGGGATTGCTGGATTCCCACATGCATTACTTGTGTGGGCGTATGAATGCATCCTTATGCTTTCTAGACCAACAATCATTTGTGCACAGAAAATTTCAAATGGA
ATGCCGAAGATGAATAGCTGGGTGGCAGACGTACATCCAGAATGGAAAGACTTGGCCATGAAGGTCTTTGAACACCCAAATTTTCAAATATATGAGATGGTTGATGAGAA
TGTTGTCGAGAAAAGAAGGTCATCAATTCAGAAAATACAAGAGTCACAACAGGAATTGAATATGAAGATGTCCACGTTTTTCCGAGAGATTGCTGAAATGAAGAAAATGA
TGGCCACCAAGCAATCTGGGGGACATAATGCTGACAATAATGAGAATGATGACGATTACGAATGTGACAACAGGGATGGAGATGTTGGTAATGGTGGTGAGAATGAAAAT
AGGAATGAATCTAATGCAGGAGGGCAAAAAGAGGGAAACATTGAAGGTGAAGGCAATGAAAACAACGAGAAGGATGACACCATGGAAGAAGTGGATGAAGATATTGTCAG
TAATAGCTTCTTAGTTGAAGTCGATAAAATTGAAAAAGCTGCAATCTCAAATATTCGTAAGGGAAAAACGGTGCTGAAGAAAAGTAAAGAAAATGTGTTTGATTCAAATG
GGGCATATTGTCATGCTTTGAGAGGTACTGGACCGTTTGCAGATAATCAAACTTTTTTAAGGAGAGAACGAAAGAAGATCATACCATCAAAACTAATGAAGTCCCCCTTC
ACATCGAAGTTTGGGTCTGTTGAAGAAAAGAAACGGCCAAAGATAGAGAAGATCTTGACAACACAAGATTTCAATGCACCAAATTTTAATTTGCTTTCTCCACCTTGA
Protein sequenceShow/hide protein sequence
MRLQSSSNLSAELKNKAQILFKSRYRIEDRIDCESSVSSTLLCALSVQDSVELTIRHVIRAPDGTQAYFHALAVPPYLMSMFQRYCVGRERLVTGEGCDLVGYGIAVAVC
VFFLEDFGLLYLRIESLIFRPYVMARVKKSANNDNRRRSQRILGSGSRLPTRKQNENDVIDIDSEDEVDRSTNLGVPTNSSFFDRLFGDMATEIRPTTHRRDDEDDRQAE
PIPIKIIDQFADLANCQIVPYGYCFEEKTMRDVEEGKHEETIQQEEEVLDDEVANEQEEEDQELVNAETWMTRKNMMFRLQQKKINMVMIKWNQGTHSEEQNETRDAGNV
DVREYESEEDVIVLDDTTTREVVGRGKRKRTVEVVKMDQKSETTNKAIKMERFSRGSVFGINREDLYESPKSHRPEQSPQSSSPEASVAEELSSLDGGRRYRGVRQQPWG
KFAAKIRDPSRKGNRVWLGTFDSDVDVARAYDSAAFKMRGRKAKLNFLLDAGKSEPPLGNGRKKRKEMTNPDALVVGIAGFPHALLVWAYECILMLSRPTIICAQKISNG
MPKMNSWVADVHPEWKDLAMKVFEHPNFQIYEMVDENVVEKRRSSIQKIQESQQELNMKMSTFFREIAEMKKMMATKQSGGHNADNNENDDDYECDNRDGDVGNGGENEN
RNESNAGGQKEGNIEGEGNENNEKDDTMEEVDEDIVSNSFLVEVDKIEKAAISNIRKGKTVLKKSKENVFDSNGAYCHALRGTGPFADNQTFLRRERKKIIPSKLMKSPF
TSKFGSVEEKKRPKIEKILTTQDFNAPNFNLLSPP