; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10021253 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10021253
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionnuclear envelope pore membrane protein POM 121-like
Genome locationChr05:7019953..7023392
RNA-Seq ExpressionHG10021253
SyntenyHG10021253
Gene Ontology termsGO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR040356 - SPEAR family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6583971.1 hypothetical protein SDJN03_19903, partial [Cucurbita argyrosperma subsp. sororia]9.6e-8759.48Show/hide
Query:  MKKTQAKRHRIPRRGPGVAELEKILKEQE---GGGDGGAGQGHDSFI---------TQFQDISPPPSLNPPRPPPPPPPPPLVPIITPP--RDYASWSNN
        MKKT  KRHRIPRRGPGVAELEKILKEQ+   GGG+GG  Q H S +          +    + P SLNPPRPPPPPPPPPL+P   PP  RDYASWS N
Subjt:  MKKTQAKRHRIPRRGPGVAELEKILKEQE---GGGDGGAGQGHDSFI---------TQFQDISPPPSLNPPRPPPPPPPPPLVPIITPP--RDYASWSNN

Query:  LPLFPTFEFIPPPLPT-----AAAASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLNL--------------EYYNSM-------------------
        LPLFPT +FIPP LPT     AAAA+ +KPLFPTTR SD+Q+N PPHFFP+FQYSASS NL               YY  +                   
Subjt:  LPLFPTFEFIPPPLPT-----AAAASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLNL--------------EYYNSM-------------------

Query:  ----MVNAKRVRAILEESHREGNIESRGQIFAKMATKD---SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLA-EDSNLMVLGSS
            MV+AKRVR  L+E HR+ N ESR   F  MATK+   SSSSSSSSSM+MNRSP + DSN RGTKRGF G+LM  +KRS  YQLA E S+LM LG S
Subjt:  ----MVNAKRVRAILEESHREGNIESRGQIFAKMATKD---SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLA-EDSNLMVLGSS

Query:  SSSAPNEI-APFNFHLPQETMEEASQHRD-GGCASNYYKVTFNSSSLYESNSNSKGNKEIAIGSEARAEVEVEAEAKGIDLNLKL
        SSSAPNE+ A FN H PQETM EASQ+RD GG AS+Y +VTFNSSS     S  KGNKEI IGS    EVEVEAEA+GIDL+LKL
Subjt:  SSSAPNEI-APFNFHLPQETMEEASQHRD-GGCASNYYKVTFNSSSLYESNSNSKGNKEIAIGSEARAEVEVEAEAKGIDLNLKL

KAG7019594.1 hypothetical protein SDJN02_18557, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-9265.62Show/hide
Query:  MKKTQAKRHRIPRRGPGVAELEKILKEQE---GGGDGGAGQGHDSFI---------TQFQDISPPPSLNPPRPPPPPPPPPLVPIITPP--RDYASWSNN
        MKKT  KRHRIPRRGPGVAELEKILKEQ+   GGG+GG  Q H S +          +    + P SLNPPRPPPPPPPPPL+P   PP  RDYASWS N
Subjt:  MKKTQAKRHRIPRRGPGVAELEKILKEQE---GGGDGGAGQGHDSFI---------TQFQDISPPPSLNPPRPPPPPPPPPLVPIITPP--RDYASWSNN

Query:  LPLFPTFEFIPP--PLPTAAAASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLNLEYYNSMMVNAKRVRAILEESHREGNIESRGQIFAKMATKD--
        LPLFPT +FIPP  P PT AAA+ +KPLFPTTR SD+Q+N PPHFFP+FQYSASS     +N MMV+AKRVR  L+E HR+ N ESR   F  MATK+  
Subjt:  LPLFPTFEFIPP--PLPTAAAASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLNLEYYNSMMVNAKRVRAILEESHREGNIESRGQIFAKMATKD--

Query:  -SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLA-EDSNLMVLGSSSSSAPNEI-APFNFHLPQETMEEASQHRD-GGCASNYYKV
         SSSSSSSSSM+MNRSPF+ DSN RGTKRGF G+LM  +KRS  YQLA E S+LM LG SSSSAPNE+ A FN H PQETM EASQ+RD GG AS+Y KV
Subjt:  -SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLA-EDSNLMVLGSSSSSAPNEI-APFNFHLPQETMEEASQHRD-GGCASNYYKV

Query:  TFNSSSLYESNSNSKGNKEIAIGS----EARAEVEVEAEAKGIDLNLKL
        TFNSSS     S  KGNKEI IGS    EA AE E EAEA+GIDL+LKL
Subjt:  TFNSSSLYESNSNSKGNKEIAIGS----EARAEVEVEAEAKGIDLNLKL

XP_022927210.1 nuclear envelope pore membrane protein POM 121-like [Cucurbita moschata]7.9e-8960.47Show/hide
Query:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGGDGGAGQGHD----------SFITQFQDISP---PPSLNPPRPPPPPPPPPLVPIITPP--RDYASWSN
        MKKTQ KRHRIPRRGPGVAELEKILKEQEGG  GG G G            SF    +  SP   P SLNPPRPPPPPPPPPL+P   PP  RDYASWS 
Subjt:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGGDGGAGQGHD----------SFITQFQDISP---PPSLNPPRPPPPPPPPPLVPIITPP--RDYASWSN

Query:  NLPLFPTFEFIPP--PLPTAAAASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLN--------------LEYYNSM---------------------
        NLPLFPT +FIPP  P PT AAA+ +KPLFPTTR SD+Q+N PPHFFP+FQYSASS N                YY  +                     
Subjt:  NLPLFPTFEFIPP--PLPTAAAASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLN--------------LEYYNSM---------------------

Query:  --MVNAKRVRAILEESHREGNIESRGQIFAKMATKD---SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLA-EDSNLMVLGSSSS
          MV+AKRVR  L+E HR+ N ESR   F  MATK+   SSSSSSSSSM+MNRSPF+ DSN RGTKRGF G+LM  +KRS  YQLA E S+LM LG SSS
Subjt:  --MVNAKRVRAILEESHREGNIESRGQIFAKMATKD---SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLA-EDSNLMVLGSSSS

Query:  SAPNEI-APFNFHLPQETMEEASQHRDGGCASNYYKVTFNSSSLYESNSNSKGNKEIAIGSEARAEVEVEAEAKGIDLNLKL
        SAPNE+ A FN H PQET  EASQ+RDGG AS+Y KVTFNSSS     S  KGNKEI IGS    EVE EAEA+GIDL+LKL
Subjt:  SAPNEI-APFNFHLPQETMEEASQHRDGGCASNYYKVTFNSSSLYESNSNSKGNKEIAIGSEARAEVEVEAEAKGIDLNLKL

XP_023001726.1 trinucleotide repeat-containing gene 18 protein-like [Cucurbita maxima]1.9e-9061.98Show/hide
Query:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGGDGGAGQGHD---------SFITQFQDISP---PPSLNPPRPPPPPPPPPLVPIITPP--RDYASWSNN
        MKKT  KRHRIPRRGPGVAELEKILKEQEGG  GG G G D         SF    +  SP   P SLNPPRPPPPPPPPPL+P   PP  RDYASWS N
Subjt:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGGDGGAGQGHD---------SFITQFQDISP---PPSLNPPRPPPPPPPPPLVPIITPP--RDYASWSNN

Query:  LPLFPTFEFIPPPLPT----AAAASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLNL--------------EYYNSM--------------------
        LPLFPT +FIPP LPT    AAAA+A+KPLFPTTR SD+Q+N PPHFFP+FQYSASS NL               YY  +                    
Subjt:  LPLFPTFEFIPPPLPT----AAAASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLNL--------------EYYNSM--------------------

Query:  ---MVNAKRVRAILEESHREGNIESRGQIFAKMATKD---SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLA-EDSNLMVLGSSS
           MV+AKRVR  L+E HR+ N ESR   F  MATK+   SSSSSSSSSM+MNRSPFN DSN RGTKRGF G+LM  +KRS  YQLA E S+LM LG SS
Subjt:  ---MVNAKRVRAILEESHREGNIESRGQIFAKMATKD---SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLA-EDSNLMVLGSSS

Query:  SSAPNEI-APFNFHLPQETMEEASQHRD-GGCASNYYKVTFNSSSLYESNSNSKGNKEIAIGSEARAEVEVEAEAKGIDLNLKL
        SSAPNE+ A FN H PQETM EASQ+RD GG AS+Y KVTFNSSS     S SKGNKEI IGS    EVE EAEA+GIDLNLKL
Subjt:  SSAPNEI-APFNFHLPQETMEEASQHRD-GGCASNYYKVTFNSSSLYESNSNSKGNKEIAIGSEARAEVEVEAEAKGIDLNLKL

XP_038895159.1 uncharacterized protein DDB_G0271670-like [Benincasa hispida]1.2e-11368.02Show/hide
Query:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGGDGGAGQGHDSFITQFQDISPPP---------SLNPPRPPPPPPPPPLVPIITPPRDY-ASWSNNLPLF
        MKK QAKR RIPRRGPGVAELEKILKEQEG                 QDI  PP         SLNPPRPPPPPPPPPLV    P  DY ASWSNNLPLF
Subjt:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGGDGGAGQGHDSFITQFQDISPPP---------SLNPPRPPPPPPPPPLVPIITPPRDY-ASWSNNLPLF

Query:  PTFEFIPPPLPTAAA---ASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLNLEYYNSM---------------------------------------
        PT EFIPPPLP +AA   ASAKKPLFPTTRISDSQL F PHFFPSFQYSASSLN+E YNSM                                       
Subjt:  PTFEFIPPPLPTAAA---ASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLNLEYYNSM---------------------------------------

Query:  ---------MVNAKRVRAILEESHREGNIESRGQIFAKMATKD-------SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLAEDS
                 MV+AKRVRA LEESHRE NIES+G IF  MATKD       SSSSSSSSSMEMN SPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLAEDS
Subjt:  ---------MVNAKRVRAILEESHREGNIESRGQIFAKMATKD-------SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLAEDS

Query:  NLMVLGSSSSSAPNEIAPFNFHLPQETMEEASQHRDGGCASNYYKVTFN-SSSLYESNSNSKGNKEIAIGS--EARAEVEVEAEAKGIDLNLKL
        NLM LGSSSSSAPNEIA FNFHLPQETM EASQHRDGGC S+YYKVTFN SSS+YESNSNSKGNKE  IGS   A A+VE EAEA GIDLNLKL
Subjt:  NLMVLGSSSSSAPNEIAPFNFHLPQETMEEASQHRDGGCASNYYKVTFN-SSSLYESNSNSKGNKEIAIGS--EARAEVEVEAEAKGIDLNLKL

TrEMBL top hitse value%identityAlignment
A0A0A0LSF6 Uncharacterized protein9.7e-8557.72Show/hide
Query:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGG----DGGAGQGHDSFITQFQDISPPPSLNPPRPPPPPPPPPLVPIITP-----PRDYASWSNNLPLFP
        M+KTQAKR RIPRRGPGVAELEKILKEQE G        +    +S  T     + P SLNPPRPPPPPPPPPLV I+TP     PRDY +WSNNLPLFP
Subjt:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGG----DGGAGQGHDSFITQFQDISPPPSLNPPRPPPPPPPPPLVPIITP-----PRDYASWSNNLPLFP

Query:  TFEFIPPP-LPTAAAASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLNL-EYYNSM-----------------------------------------
        T EFIPPP LPT      +KPLFPTTR+S+SQLN  P+F PSFQYSASS N  +YYN M                                         
Subjt:  TFEFIPPP-LPTAAAASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLNL-EYYNSM-----------------------------------------

Query:  --------MVNAKRVRAILEESHR-EGNIESRGQIFAK-------MATKDSSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQ-LMSSTKRSGSYQLA-E
                MVNAKRV   LEESHR E N  +   I  K       M TKD SSSSSSSSME N SPF+F SNFRGTKRG  GQ  MS+TKRSG YQL  +
Subjt:  --------MVNAKRVRAILEESHR-EGNIESRGQIFAK-------MATKDSSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQ-LMSSTKRSGSYQLA-E

Query:  DSNLMVLGSSSSSAPNEIAPFN-FHLPQETMEEASQHRD-GGCASNYYKVTFNSSSLYESNSNSKGNKEIAIGSEARAEVEVEAEAKGIDLNLKL
        +SNLM LGSSSSSAPNEI  FN FHLPQETM E  QHRD GGC S+YYK+ FN SSLYESNSN+KGN+E+ + S   A  E EAEA+GIDLNLKL
Subjt:  DSNLMVLGSSSSSAPNEIAPFN-FHLPQETMEEASQHRD-GGCASNYYKVTFNSSSLYESNSNSKGNKEIAIGSEARAEVEVEAEAKGIDLNLKL

A0A1S3B6J8 uncharacterized protein DDB_G0271670-like1.3e-8458.44Show/hide
Query:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGGDGGAGQGHDSFITQFQDISP--PPSLNPPRPPPPPPPPPLVPIIT-----PPRDYASWSNNLPLFPTF
        M+KTQAKR RIPRRGPGVAELEKILKEQE G  G +   H S  T     +   P SLNPP PPPPPPPPPLV I+T     PPRDY SW+NNLPLFPT 
Subjt:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGGDGGAGQGHDSFITQFQDISP--PPSLNPPRPPPPPPPPPLVPIIT-----PPRDYASWSNNLPLFPTF

Query:  EFIPPP-LPTAAAASAKKPLFPTTRISD-SQLNFPPHFFPSFQYSASSLNL-EYYNSM------------------------------------------
        EFIPPP LPT A   A+KPLFPTTRIS+ SQLN  P+F P+FQ+SASS N  +YYN M                                          
Subjt:  EFIPPP-LPTAAAASAKKPLFPTTRISD-SQLNFPPHFFPSFQYSASSLNL-EYYNSM------------------------------------------

Query:  -------MVNAKRVRAILEESHR-EGNIESRGQIFAK-------MATKD----SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLM-SSTKRSGSYQL
               MVNAKRV   LEESHR E N  +   I  K       M TKD    SSSSSSSSSME+N SPF+  SNFRGTKRG AGQLM ++TKRSG YQL
Subjt:  -------MVNAKRVRAILEESHR-EGNIESRGQIFAK-------MATKD----SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLM-SSTKRSGSYQL

Query:  A-EDSNLMVLGSSSSSAPNEIAPFN-FHLPQETMEEASQHRDGGCASNYYKVTFNSSSLYESNSNSKGNKEIAIGSEARAEVEVEAEAKGIDLNLKL
          ++SNLM LGSSSSSAPNEI  FN FHLP+ETM E  QHRDGGC S+ YK+ FN SSLYESNSN+KGN+EIAI S A A  E EAE +GIDLNLKL
Subjt:  A-EDSNLMVLGSSSSSAPNEIAPFN-FHLPQETMEEASQHRDGGCASNYYKVTFNSSSLYESNSNSKGNKEIAIGSEARAEVEVEAEAKGIDLNLKL

A0A5A7UM91 CREB-regulated transcription coactivator 1-like1.5e-3765.62Show/hide
Query:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGGDGGAGQGHDSFITQFQDISP--PPSLNPPRPPPPPPPPPLVPIIT-----PPRDYASWSNNLPLFPTF
        M+KTQAKR RIPRRGPGVAELEKILKEQE G  G +   H S  T     +   P SLNPP PPPPPPPPPLV I+T     PPRDY SW+NNLPLFPT 
Subjt:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGGDGGAGQGHDSFITQFQDISP--PPSLNPPRPPPPPPPPPLVPIIT-----PPRDYASWSNNLPLFPTF

Query:  EFIPPP-LPTAAAASAKKPLFPTTRISD-SQLNFPPHFFPSFQYSASSLNLEYYNSMMVN
        EFIPPP LPT A   A+KPLFPTTRIS+ SQLN  P+F P+FQ+SASS N + Y + MVN
Subjt:  EFIPPP-LPTAAAASAKKPLFPTTRISD-SQLNFPPHFFPSFQYSASSLNLEYYNSMMVN

A0A6J1EGJ1 nuclear envelope pore membrane protein POM 121-like3.8e-8960.47Show/hide
Query:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGGDGGAGQGHD----------SFITQFQDISP---PPSLNPPRPPPPPPPPPLVPIITPP--RDYASWSN
        MKKTQ KRHRIPRRGPGVAELEKILKEQEGG  GG G G            SF    +  SP   P SLNPPRPPPPPPPPPL+P   PP  RDYASWS 
Subjt:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGGDGGAGQGHD----------SFITQFQDISP---PPSLNPPRPPPPPPPPPLVPIITPP--RDYASWSN

Query:  NLPLFPTFEFIPP--PLPTAAAASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLN--------------LEYYNSM---------------------
        NLPLFPT +FIPP  P PT AAA+ +KPLFPTTR SD+Q+N PPHFFP+FQYSASS N                YY  +                     
Subjt:  NLPLFPTFEFIPP--PLPTAAAASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLN--------------LEYYNSM---------------------

Query:  --MVNAKRVRAILEESHREGNIESRGQIFAKMATKD---SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLA-EDSNLMVLGSSSS
          MV+AKRVR  L+E HR+ N ESR   F  MATK+   SSSSSSSSSM+MNRSPF+ DSN RGTKRGF G+LM  +KRS  YQLA E S+LM LG SSS
Subjt:  --MVNAKRVRAILEESHREGNIESRGQIFAKMATKD---SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLA-EDSNLMVLGSSSS

Query:  SAPNEI-APFNFHLPQETMEEASQHRDGGCASNYYKVTFNSSSLYESNSNSKGNKEIAIGSEARAEVEVEAEAKGIDLNLKL
        SAPNE+ A FN H PQET  EASQ+RDGG AS+Y KVTFNSSS     S  KGNKEI IGS    EVE EAEA+GIDL+LKL
Subjt:  SAPNEI-APFNFHLPQETMEEASQHRDGGCASNYYKVTFNSSSLYESNSNSKGNKEIAIGSEARAEVEVEAEAKGIDLNLKL

A0A6J1KHF7 trinucleotide repeat-containing gene 18 protein-like9.1e-9161.98Show/hide
Query:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGGDGGAGQGHD---------SFITQFQDISP---PPSLNPPRPPPPPPPPPLVPIITPP--RDYASWSNN
        MKKT  KRHRIPRRGPGVAELEKILKEQEGG  GG G G D         SF    +  SP   P SLNPPRPPPPPPPPPL+P   PP  RDYASWS N
Subjt:  MKKTQAKRHRIPRRGPGVAELEKILKEQEGGGDGGAGQGHD---------SFITQFQDISP---PPSLNPPRPPPPPPPPPLVPIITPP--RDYASWSNN

Query:  LPLFPTFEFIPPPLPT----AAAASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLNL--------------EYYNSM--------------------
        LPLFPT +FIPP LPT    AAAA+A+KPLFPTTR SD+Q+N PPHFFP+FQYSASS NL               YY  +                    
Subjt:  LPLFPTFEFIPPPLPT----AAAASAKKPLFPTTRISDSQLNFPPHFFPSFQYSASSLNL--------------EYYNSM--------------------

Query:  ---MVNAKRVRAILEESHREGNIESRGQIFAKMATKD---SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLA-EDSNLMVLGSSS
           MV+AKRVR  L+E HR+ N ESR   F  MATK+   SSSSSSSSSM+MNRSPFN DSN RGTKRGF G+LM  +KRS  YQLA E S+LM LG SS
Subjt:  ---MVNAKRVRAILEESHREGNIESRGQIFAKMATKD---SSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLMSSTKRSGSYQLA-EDSNLMVLGSSS

Query:  SSAPNEI-APFNFHLPQETMEEASQHRD-GGCASNYYKVTFNSSSLYESNSNSKGNKEIAIGSEARAEVEVEAEAKGIDLNLKL
        SSAPNE+ A FN H PQETM EASQ+RD GG AS+Y KVTFNSSS     S SKGNKEI IGS    EVE EAEA+GIDLNLKL
Subjt:  SSAPNEI-APFNFHLPQETMEEASQHRD-GGCASNYYKVTFNSSSLYESNSNSKGNKEIAIGSEARAEVEVEAEAKGIDLNLKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGAAAACTCAAGCCAAACGACATCGTATTCCCAGGAGAGGACCTGGCGTTGCAGAGCTTGAGAAGATTTTGAAGGAACAAGAAGGCGGCGGTGACGGCGGTGCCGG
CCAAGGTCATGACTCCTTCATTACCCAATTTCAAGACATTTCTCCACCGCCGTCGTTAAACCCTCCCCGGCCACCGCCGCCGCCGCCTCCTCCGCCACTTGTGCCAATAA
TCACGCCGCCGCGTGACTATGCTTCTTGGTCAAATAATCTTCCGTTGTTCCCAACTTTTGAATTCATTCCGCCCCCTCTACCAACCGCCGCCGCTGCCTCCGCCAAAAAA
CCCTTGTTTCCAACAACCCGAATATCCGATTCCCAATTGAATTTTCCTCCGCATTTTTTCCCCAGTTTTCAATATTCGGCTTCTTCTCTCAATCTTGAATATTATAATTC
AATGATGGTAAATGCAAAGAGGGTGAGAGCCATTTTGGAAGAAAGCCATAGGGAAGGAAATATAGAGAGCAGAGGTCAAATTTTCGCAAAAATGGCAACAAAAGACTCAT
CTTCCTCTTCTTCTTCTTCTTCAATGGAAATGAATCGTTCTCCATTTAATTTCGATTCAAACTTCAGAGGTACAAAAAGGGGTTTTGCTGGGCAGTTAATGAGCTCCACC
AAACGAAGTGGAAGCTATCAATTAGCAGAAGACAGCAATTTGATGGTATTAGGATCTTCATCATCATCAGCTCCAAATGAAATTGCACCATTCAATTTCCATCTCCCCCA
AGAAACTATGGAGGAGGCTTCACAGCATAGAGATGGAGGATGTGCCTCAAATTACTACAAAGTTACATTTAACTCCTCCTCCTTGTATGAATCAAACTCAAACTCAAAGG
GCAACAAAGAAATTGCCATCGGCTCCGAAGCTAGAGCCGAAGTCGAAGTCGAAGCCGAAGCCAAAGGCATCGATCTGAATTTGAAGCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAGAAAACTCAAGCCAAACGACATCGTATTCCCAGGAGAGGACCTGGCGTTGCAGAGCTTGAGAAGATTTTGAAGGAACAAGAAGGCGGCGGTGACGGCGGTGCCGG
CCAAGGTCATGACTCCTTCATTACCCAATTTCAAGACATTTCTCCACCGCCGTCGTTAAACCCTCCCCGGCCACCGCCGCCGCCGCCTCCTCCGCCACTTGTGCCAATAA
TCACGCCGCCGCGTGACTATGCTTCTTGGTCAAATAATCTTCCGTTGTTCCCAACTTTTGAATTCATTCCGCCCCCTCTACCAACCGCCGCCGCTGCCTCCGCCAAAAAA
CCCTTGTTTCCAACAACCCGAATATCCGATTCCCAATTGAATTTTCCTCCGCATTTTTTCCCCAGTTTTCAATATTCGGCTTCTTCTCTCAATCTTGAATATTATAATTC
AATGATGGTAAATGCAAAGAGGGTGAGAGCCATTTTGGAAGAAAGCCATAGGGAAGGAAATATAGAGAGCAGAGGTCAAATTTTCGCAAAAATGGCAACAAAAGACTCAT
CTTCCTCTTCTTCTTCTTCTTCAATGGAAATGAATCGTTCTCCATTTAATTTCGATTCAAACTTCAGAGGTACAAAAAGGGGTTTTGCTGGGCAGTTAATGAGCTCCACC
AAACGAAGTGGAAGCTATCAATTAGCAGAAGACAGCAATTTGATGGTATTAGGATCTTCATCATCATCAGCTCCAAATGAAATTGCACCATTCAATTTCCATCTCCCCCA
AGAAACTATGGAGGAGGCTTCACAGCATAGAGATGGAGGATGTGCCTCAAATTACTACAAAGTTACATTTAACTCCTCCTCCTTGTATGAATCAAACTCAAACTCAAAGG
GCAACAAAGAAATTGCCATCGGCTCCGAAGCTAGAGCCGAAGTCGAAGTCGAAGCCGAAGCCAAAGGCATCGATCTGAATTTGAAGCTGTAA
Protein sequenceShow/hide protein sequence
MKKTQAKRHRIPRRGPGVAELEKILKEQEGGGDGGAGQGHDSFITQFQDISPPPSLNPPRPPPPPPPPPLVPIITPPRDYASWSNNLPLFPTFEFIPPPLPTAAAASAKK
PLFPTTRISDSQLNFPPHFFPSFQYSASSLNLEYYNSMMVNAKRVRAILEESHREGNIESRGQIFAKMATKDSSSSSSSSSMEMNRSPFNFDSNFRGTKRGFAGQLMSST
KRSGSYQLAEDSNLMVLGSSSSSAPNEIAPFNFHLPQETMEEASQHRDGGCASNYYKVTFNSSSLYESNSNSKGNKEIAIGSEARAEVEVEAEAKGIDLNLKL